python 使用xpath获取网页标签内容

Python 更新时间：2026-05-22 03:00:20 发布时间：1490天前 IT归档最新发布模块sitemap 名妆网法律咨询聚返吧英语巴士网伯小乐网商动力

获取指定html的标签内容

打开网页的开发者模式,得到路径标签，然后加上/text() 即可得到标签的文本内容//*[@id="sonsyuanwen"]/div[1]/h1

对于网页爬取来说，还是很方便的

# -*- ecoding: utf-8 -*-
# @ModuleName: test005
# @Function: 
# @Author: darling
# @Time: 2022-04-18 13:58

import requests

from lxml import etree


def get_url():
    resource = requests.get('https://so.gushiwen.cn/shiwenv_444df93c9bdf.aspx')
    html = etree.HTML(resource.text)
    title = html.xpath('//*[@id="sonsyuanwen"]/div[1]/h1/text()')
    neir=html.xpath('//*[@id="contson444df93c9bdf"]/text()')
    print(title,neir)
    return resource


if __name__ == "__main__":
    res = get_url()
    print(res)

转载请注明：文章转载自 www.mshxw.com

本文地址：https://www.mshxw.com/it/822328.html

上一篇 AttributeError: module ‘pandas‘ has no attribute ‘read

下一篇《Python编程：从入门到实践》读书笔记：第12章武装飞船

Python相关栏目本月热门文章

关于我们文章归档网站地图联系我们

python 使用xpath获取网页标签内容

获取指定html的标签内容 打开网页的开发者模式,得到路径标签，然后加上/text() 即可得到标签的文本内容//*[@id="sonsyuanwen"]/div[1]/h1

Python相关栏目本月热门文章

获取指定html的标签内容

打开网页的开发者模式,得到路径标签，然后加上/text() 即可得到标签的文本内容//*[@id="sonsyuanwen"]/div[1]/h1