whusnoopy / renrenBackup

A backup tool for renren.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Illegal multibyte sequence

xiaoyongzhu opened this issue · comments

Describe the bug
感谢这个repo,非常实用!
在使用的过程中经常遇到这个error:illegal multibyte sequence,感觉是和相关的encoding有关。并不是经常出现,但是当内容有一些比较奇怪字符的时候就容易出现这个问题。

使用的系统是windows 10 x64中文版,但感觉这个问题是在insert into database时候出现的问题。

--- Logging error ---
Traceback (most recent call last):
  File "logging\__init__.py", line 1036, in emit
UnicodeEncodeError: 'gbk' codec can't encode character '\uc190' in position 322: illegal multibyte sequence
Call stack:
  File "manage.py", line 116, in <module>
  File "site-packages\flask_script\__init__.py", line 417, in run
  File "site-packages\flask_script\__init__.py", line 386, in handle
  File "site-packages\flask_script\commands.py", line 216, in __call__
  File "manage.py", line 41, in fetch
  File "fetch.py", line 99, in fetch_user
  File "fetch.py", line 76, in fetch_blog
  File "crawl\blog.py", line 83, in get_blogs
  File "crawl\blog.py", line 49, in load_blog_list
  File "crawl\utils.py", line 124, in get_comments
  File "site-packages\peewee.py", line 1574, in inner
  File "site-packages\peewee.py", line 1645, in execute
  File "site-packages\peewee.py", line 2288, in _execute
  File "site-packages\peewee.py", line 2063, in _execute
  File "site-packages\peewee.py", line 2653, in execute
  File "site-packages\peewee.py", line 2628, in execute_sql
  File "logging\__init__.py", line 1371, in debug
  File "logging\__init__.py", line 1519, in _log
  File "logging\__init__.py", line 1529, in handle
  File "logging\__init__.py", line 1591, in callHandlers
  File "logging\__init__.py", line 905, in handle
  File "logging\handlers.py", line 479, in emit
  File "logging\__init__.py", line 1132, in emit
  File "logging\__init__.py", line 1040, in emit
Message: ('INSERT OR REPLACE INTO "comment" ("id", "t", "entry_id", "entry_type", "authorId", "authorName", "content") VALUES (?, ?, ?, ?, ?, ?, ?)', [36028797501124092, datetime.datetime(2008, 4, 8, 7, 22, 45), 282116120, 'blog', 172790766, '赵欢', '回复孙鹤中손학중:过奖啦!呵呵<img src="<a href=\'http://uu.ren/kRMsjR\' target=\'_blank\' title=\'http://static.xiaonei.com/img/editor/emot/emot-10.gif\'>http://uu.ren/kRMsjR </a> "/>'])

貌似我用的code base太老了...最新的code base已经没有这个问题了。多谢!