nirenxiaoxiao's repositories
similarity
similarity:相似度计算工具包,java编写。用于词语、短语、句子、词法分析、情感分析、语义分析等相关的相似度计算。
Administrative-divisions-of-China
中华人民共和国行政区划:省级(省份直辖市自治区)、 地级(城市)、 县级(区县)、 乡级(乡镇街道)、 村级(村委会居委会) ,**省市区镇村二级三级四级五级联动地址数据 Node.js 爬虫。
china-metro-info
**城市地铁站数据库
Company-Names-Corpus
公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。
CrawlerProject
爬虫项目:链家网(普通/scrapy)、虎扑、维基百科、百度地图api、房天下(分布式爬虫)、微信公众号(代理池爬取)
DesignPattern
C++编程**设计模式代码
docker_identidock
using docker to compose a group micro service
jd_maotai_seckill
优化版本的京东茅台抢购神器
learn-regex
Learn regex the easy way
linux_drvier_kernel
Linux驱动,内核经典书籍
machine-learning-for-software-engineers
A complete daily plan for studying to become a machine learning engineer.
MicroTokenizer
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese
nlp-in-practice
NLP, Text Mining and Machine Learning starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, keyword extraction with TFIDF, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
python_get_city_datas
爬取**省市资料
WebCollector-Python
WebCollector-Python is an open source web crawler framework based on Python.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
YouCompleteMe
A code-completion engine for Vim