报错:
open函数报错:'gbk' codec can't encode character 'xa0' in position 12863: illegal multibyte sequence
错误原因:
在windows下,新txt文件的默认编码是GBK,python解释器会用GBK编码去解析网络数据流txt,但该txt已经是decode过的unicode编码,因此无法解析。
解决方案:
给open函数加上参数encoding='UTF-8':
with open(EachSource, 'w', encoding='UTF-8') as f:
f.write(html)


