content = re.sub(r'[^a-zA-Z0-9u3002uff1buff0cuff1au201cu201duff08uff09u3001uff1fu300au300bu4e00-u9fa5]','',content)
上述一行代码就能搞定
匹配数字和大小写字母:[a-zA-Z0-9]
匹配中文标点符号: [u3002uff1buff0cuff1au201cu201duff08uff09u3001uff1fu300au300b]
匹配中文字符的正则表达式: [u4e00-u9fa5]
^代表非



