我不知道您的访问方式,
tag但以下内容对我有用:
import BeautifulSoupif __name__ == "__main__": data = """<html><h2 >some title</h2><ul> <li>Lorem ipsum dolor sit amet, consectetuer adipiscing elit.</li> <li>Aliquam tincidunt mauris eu risus.</li> <li>Vestibulum auctor dapibus neque.</li></ul></html> """ soup = BeautifulSoup.BeautifulSoup(data) h2 = soup.find('h2') h2.name = 'h1' print soupprint soup命令输出为:
<html><h1 >some title</h1><ul><li>Lorem ipsum dolor sit amet, consectetuer adipiscing elit.</li><li>Aliquam tincidunt mauris eu risus.</li><li>Vestibulum auctor dapibus neque.</li></ul></html>
如您所见,
h2变成了
h1。文件中的其他内容都没有改变。我正在使用Python 2.6和BeautifulSoup 3.2.0。
如果您有多个,
h2并且想要全部更改,则可以执行以下操作:
soup = BeautifulSoup.BeautifulSoup(your_data)while True: h2 = soup.find('h2') if not h2: break h2.name = 'h1'


