PGA网站的搜索有多个页面,URL遵循以下模式:
http://www.pga.com/golf-courses/search?page=1 # Additional info after page parameter here
这意味着您可以读取页面的内容,然后将page的值更改为1,然后读取下一页…。依此类推。
import csvimport requests from bs4 import BeautifulSoupfor i in range(907): # Number of pages plus one url = "http://www.pga.com/golf-courses/search?page={}&searchbox=Course+Name&searchbox_zip=ZIP&distance=50&price_range=0&course_type=both&has_events=0".format(i) r = requests.get(url) soup = BeautifulSoup(r.content) # Your pre for each individual page here


