Illegal multibyte sequence
xiaoyongzhu opened this issue · comments
Describe the bug
感谢这个repo,非常实用!
在使用的过程中经常遇到这个error:illegal multibyte sequence,感觉是和相关的encoding有关。并不是经常出现,但是当内容有一些比较奇怪字符的时候就容易出现这个问题。
使用的系统是windows 10 x64中文版,但感觉这个问题是在insert into database时候出现的问题。
--- Logging error ---
Traceback (most recent call last):
File "logging\__init__.py", line 1036, in emit
UnicodeEncodeError: 'gbk' codec can't encode character '\uc190' in position 322: illegal multibyte sequence
Call stack:
File "manage.py", line 116, in <module>
File "site-packages\flask_script\__init__.py", line 417, in run
File "site-packages\flask_script\__init__.py", line 386, in handle
File "site-packages\flask_script\commands.py", line 216, in __call__
File "manage.py", line 41, in fetch
File "fetch.py", line 99, in fetch_user
File "fetch.py", line 76, in fetch_blog
File "crawl\blog.py", line 83, in get_blogs
File "crawl\blog.py", line 49, in load_blog_list
File "crawl\utils.py", line 124, in get_comments
File "site-packages\peewee.py", line 1574, in inner
File "site-packages\peewee.py", line 1645, in execute
File "site-packages\peewee.py", line 2288, in _execute
File "site-packages\peewee.py", line 2063, in _execute
File "site-packages\peewee.py", line 2653, in execute
File "site-packages\peewee.py", line 2628, in execute_sql
File "logging\__init__.py", line 1371, in debug
File "logging\__init__.py", line 1519, in _log
File "logging\__init__.py", line 1529, in handle
File "logging\__init__.py", line 1591, in callHandlers
File "logging\__init__.py", line 905, in handle
File "logging\handlers.py", line 479, in emit
File "logging\__init__.py", line 1132, in emit
File "logging\__init__.py", line 1040, in emit
Message: ('INSERT OR REPLACE INTO "comment" ("id", "t", "entry_id", "entry_type", "authorId", "authorName", "content") VALUES (?, ?, ?, ?, ?, ?, ?)', [36028797501124092, datetime.datetime(2008, 4, 8, 7, 22, 45), 282116120, 'blog', 172790766, '赵欢', '回复孙鹤中손학중:过奖啦!呵呵<img src="<a href=\'http://uu.ren/kRMsjR\' target=\'_blank\' title=\'http://static.xiaonei.com/img/editor/emot/emot-10.gif\'>http://uu.ren/kRMsjR </a> "/>'])
貌似我用的code base太老了...最新的code base已经没有这个问题了。多谢!