will4906 / PatentCrawler

scrapy专利爬虫(停止维护)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

abstract解析出来有html代码

fallleave001 opened this issue · comments

将patent.py第145行,改为如下即可:
sipocrawler['abstract'] = BeautifulSoup(detail.get('abstractInfoDTO').get('abIndexList')[0].get('value')).text.replace('\n','')

好的,我修改一下