21年10月，唯美女孩网站爬取，python， requests + re

Python 更新时间：2026-05-21 21:32:41 发布时间：1679天前 IT归档最新发布模块sitemap 名妆网法律咨询聚返吧英语巴士网伯小乐网商动力

不多bb，直接上代码

import re
import time

import requests

headers = {
    "user-agent": "写自己的浏览器"
}
response = requests.get('https://www.vmgirls.com/这里五个数.html', headers=headers)    # 只爬详情页
time.sleep(1)


def cunchu(data):
    img_all = re.findall('', data)
    for i in img_all:
        img_url = 'https:' + i
        print(img_url)
        time.sleep(1)
        file = './img/' + i[26:34] + '.jpg'
        img_res = requests.get(img_url, headers=headers)
        with open(file, "wb") as f:
            f.write(img_res.content)


if re.findall('', response.text):
    print("没有跳转")
    cunchu(response.text)
else:
    href = re.findall('.href ="(.*?)";  
重点思路：在请求详情页的时候有可能会遇到跳转页，通过re获取详情页url，继续get。

转载请注明：文章转载自 www.mshxw.com

本文地址：https://www.mshxw.com/it/324046.html

上一篇 Robot Framework自动化测试----自定义读写xlsx格式的excel表格库

下一篇 Docker 部署jar包

Python相关栏目本月热门文章

关于我们文章归档网站地图联系我们

21年10月， 唯美女孩网站爬取，python， requests + re

Python相关栏目本月热门文章

21年10月，唯美女孩网站爬取，python， requests + re