“ for line in…”导致UnicodeDecodeError：’utf-8’编解码器无法解码字节

Question 1

Here is my code,

for line in open('u.item'):
#read each line

whenever I run this code it gives the following error:

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 2892: invalid continuation byte

I tried to solve this and add an extra parameter in open(), the code looks like;

for line in open('u.item', encoding='utf-8'):
#read each line

But again it gives the same error. what should I do then! Please help.

Question 2

As suggested by Mark Ransom, I found the right encoding for that problem. The encoding was "ISO-8859-1", so replacing open("u.item", encoding="utf-8") with open('u.item', encoding = "ISO-8859-1") will solve the problem.

Question 3

Also worked for me, ISO 8859-1 is going to save a lot, hahaha, mainly if using Speech Recognition API’s

Example:

file = open('../Resources/' + filename, 'r', encoding="ISO-8859-1");

Question 4

Your file doesn’t actually contain utf-8 encoded data, it contains some other encoding. Figure out what that encoding is and use it in the open call.

In Windows-1252 encoding for example the 0xe9 would be the character é.

Question 5

Try this to read using pandas

pd.read_csv('u.item', sep='|', names=m_cols , encoding='latin-1')

Question 6

If you are using Python 2 the following will the solution:

import io
for line in io.open("u.item", encoding="ISO-8859-1"):
    # do something

Because encoding parameter doesn’t work with open(), you will be getting the following error:

TypeError: 'encoding' is an invalid keyword argument for this function

Question 7

You could resolve the problem with:

for line in open(your_file_path, 'rb'):

‘rb’ is reading file in binary mode. Read more here. Hope this will help!

Question 8

This works:

open('filename', encoding='latin-1')

or:

open('filename',encoding="ISO-8859-1")

Question 9

If someone looking for these, this is an example for converting a CSV file in Python 3:

try:
    inputReader = csv.reader(open(argv[1], encoding='ISO-8859-1'), delimiter=',',quotechar='"')
except IOError:
    pass

Question 10

Sometimes when open(filepath) in which filepath actually is not a file would get the same error, so firstly make sure the file you’re trying to open exists:

import os
assert os.path.isfile(filepath)

hope this will help.

Question 11

you can try this way:

open('u.item', encoding='utf8', errors='ignore')

“ for line in…”导致UnicodeDecodeError：’utf-8’编解码器无法解码字节

问题：“ for line in…”导致UnicodeDecodeError：’utf-8’编解码器无法解码字节

回答 0

回答 1

回答 2

回答 3

回答 4

回答 5

回答 6

回答 7

回答 8

回答 9

排行榜展示

Python 情人节超强技能导出微信聊天记录生成词云

你不得不知道的python超级文献批量搜索下载工具

7行代码 Python热力图可视化分析缺失数据处理

Python 流程图 — 一键转化代码为流程图

Python 优化—算出每条语句执行时间

你的10W块放哪里能赚最多钱？

文章展示

Python 模拟登陆神库！集合了20+个平台的模拟登陆脚本

如何在Python中进行热编码？

使用Python检测和录制音频

找不到pg_config可执行文件

如何查找Python包的依赖项

有没有一种简单的方法来删除字符串中的多个空格？

“ for line in…”导致UnicodeDecodeError：’utf-8’编解码器无法解码字节

问题：“ for line in…”导致UnicodeDecodeError：’utf-8’编解码器无法解码字节

回答 0

回答 1

回答 2

回答 3

回答 4

回答 5

回答 6

回答 7

回答 8

回答 9

相关文章

排行榜展示

文章展示