从difflib获取更多粒度的diff（或对diff进行后处理以实现同一目的的方法）

面试问答更新时间：2026-05-21 21:08:28 发布时间：1631天前 IT归档最新发布模块sitemap 名妆网法律咨询聚返吧英语巴士网伯小乐网商动力

您可以

nltk.sent_tokenize()

用来将汤串分割成句子：

from nltk import sent_tokenizesentences = [sentence for string in soup.stripped_strings for sentence in sent_tokenize(string)]sentences2 = [sentence for string in soup2.stripped_strings for sentence in sent_tokenize(string)]diff = d.compare(sentences, sentences2)changes = [change for change in diff if change.startswith('-') or  change.startswith('+')]for change in changes:    print(change)

仅在检测到更改的地方打印适当的句子：

- It contains a Title II provision that changes the age at which workers compensation/public disability offset ends for disability beneficiaries from age 65 to full retirement age (FRA).+ It contains a Title II provision that changes the age at which workers compensation/public disability offset ends for disability beneficiaries from age 68 to full retirement age (FRA).

转载请注明：文章转载自 www.mshxw.com

本文地址：https://www.mshxw.com/it/625502.html

上一篇无法使用Python遍历分页的API响应

下一篇使用for循环迭代并引用lst [i]时发生TypeError / IndexError

面试问答相关栏目本月热门文章

关于我们文章归档网站地图联系我们