Soryu23 / weibo-crawler

Weibo-crawler is a crawler project based on golang colly framework to crawl weibo sites and get information. It crawls web content by regular expressions and Xpath selector, spatially transforms keywords using word vector model, and clusters text content by HDBSCAN clustering algorithm.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Soryu23/weibo-crawler Watchers