[已解决]一个问题代码，好像是编码格式错误，不知道怎么弄，特来求助

忽视 · 发表于 2017-8-16 10:57:24

import os
m = []
x = []
temp = input('请将该代码放在要查找的文件夹内,请输入关键子:')
for each in os.walk('E:/to come again'):
m.append(each)
y = len(m)
while True:
if y != (-1):
for each in m[y-1][2]:
x.append(m[y-1][0]+'/'+each)
else:
break
y-=1
for each in x:
r = open(each)
for i in r:
if '曹植' in i:
print(i)
else:
print('没找到')

复制代码

上面是我的代码
下面是我的代码的错误提示:
Traceback (most recent call last):
File "<pyshell#86>", line 3, in <module>
for i in r:
UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 66: illegal multibyte sequence
研究了好久，不知道是不是编码格式错误

最佳答案

月排行榜 / 总排行榜

ba21

2017-8-16 10:57:25

https://pypi.python.org/pypi/chardet

import chardet
#以rb读取文件返回文件的编码(用到了chardet类)
      with open(file_name, 'rb') as f:
         raw = f.read()
         result = chardet.detect(raw)
         encoding = result['encoding']

      lines = 0
      with open(file_name,encoding=encoding) as f:
         print('正在分析文件：%s ...' % file_name)
         try:
            for each_line in f:
                  lines += 1
         except Exception as reason:
            print(str(reason)) # 读取出错显示错误信息......
      print('%s -> %s' % (file_name,lines))
      return lines

import os
import chardet
m = []
x = []
temp = input('请将该代码放在要查找的文件夹内,请输入关键子:')
for each in os.walk('E:/Json60r8'):
m.append(each)
y = len(m)
while True:
if y != (-1):
for each in m[y-1][2]:
x.append(m[y-1][0]+'/'+each)
else:
break
y-=1
for each in x:
#以rb读取文件返回文件的编码(用到了chardet类)
with open(each, 'rb') as f:
raw = f.read()
result = chardet.detect(raw)
encoding = result['encoding']
r = open(each, encoding=encoding)
try:
for i in r:
if '曹植' in i:
print(i)
else:
print('没找到')
except Exception as reason:
print(str(reason)) # 读取出错显示错误信息......

复制代码

跳转到最佳答案楼层

ba21 · 发表于 2017-8-16 10:57:25

https://pypi.python.org/pypi/chardet

import chardet
#以rb读取文件返回文件的编码(用到了chardet类)
      with open(file_name, 'rb') as f:
         raw = f.read()
         result = chardet.detect(raw)
         encoding = result['encoding']

      lines = 0
      with open(file_name,encoding=encoding) as f:
         print('正在分析文件：%s ...' % file_name)
         try:
            for each_line in f:
                  lines += 1
         except Exception as reason:
            print(str(reason)) # 读取出错显示错误信息......
      print('%s -> %s' % (file_name,lines))
      return lines

import os
import chardet
m = []
x = []
temp = input('请将该代码放在要查找的文件夹内,请输入关键子:')
for each in os.walk('E:/Json60r8'):
m.append(each)
y = len(m)
while True:
if y != (-1):
for each in m[y-1][2]:
x.append(m[y-1][0]+'/'+each)
else:
break
y-=1
for each in x:
#以rb读取文件返回文件的编码(用到了chardet类)
with open(each, 'rb') as f:
raw = f.read()
result = chardet.detect(raw)
encoding = result['encoding']
r = open(each, encoding=encoding)
try:
for i in r:
if '曹植' in i:
print(i)
else:
print('没找到')
except Exception as reason:
print(str(reason)) # 读取出错显示错误信息......

复制代码

丢丢yhj · 发表于 2017-8-16 11:43:24

for each in os.walk('E:/to come again'，encoding='UTF-8'):
改成这样试试还有我记得E：后面应该是//

新手·ing · 发表于 2017-8-16 11:56:13

http://bbs.fishc.com/thread-56452-1-1.html

kinght1147 · 发表于 2017-8-16 12:04:33

第6行 for each in os.walk('E:/to come again'):应该是路径的斜杠错了

忽视 · 发表于 2017-8-26 12:06:16

ba21 发表于 2017-8-16 10:57
https://pypi.python.org/pypi/chardet

import chardet

好久没登录，现在才上线，不好意思

账号		自动登录	找回密码
密码			立即注册

[已解决]一个问题代码，好像是编码格式错误，不知道怎么弄，特来求助

最佳答案

评分