如何使用Python识别二进制文件和文本文件？[重复]

面试问答更新时间：2026-04-03 20:24:36 发布时间：1572天前 IT归档最新发布模块sitemap 名妆网法律咨询聚返吧英语巴士网伯小乐网商动力

谢谢大家，我找到了适合我问题的解决方案。我在http://pre.activestate.com/recipes/173220/上找到了此代码，并做了一些修改以适合我。

它工作正常。

from __future__ import divisionimport stringdef istext(filename):    s=open(filename).read(512)    text_characters = "".join(map(chr, range(32, 127)) + list("nrtb"))    _null_trans = string.maketrans("", "")    if not s:        # Empty files are considered text        return True    if "" in s:        # Files with null bytes are likely binary        return False    # Get the non-text characters (maps a character to itself then    # use the 'remove' option to get rid of the text characters.)    t = s.translate(_null_trans, text_characters)    # If more than 30% non-text characters, then    # this is considered a binary file    if float(len(t))/float(len(s)) > 0.30:        return False    return True

转载请注明：文章转载自 www.mshxw.com

本文地址：https://www.mshxw.com/it/660154.html

上一篇假设有很多重复，使用numpy向量化“纯”函数

下一篇在Python中使用<128KB的字符串时发生内存泄漏？

面试问答相关栏目本月热门文章

关于我们文章归档网站地图联系我们