uyru's starred repositories
ArticleSpider
Crawling zhihu, jobbole, lagou by Scrapy, and using Elasticsearch+Django to build a Search Engine website --- README_zh.md (including: implementation roadmap, distributed-crawler and coping with anti-crawling strategies).
NLPwebsite
自然语言处理网站标注系统,采用Django框架写的Python Web
word2vec-tensorflow
使用word2vec进行中文词向量的 训练
Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
word2vec-Finance
基于20W金融资讯训练得到的词向量
Min-Graph-Equipartition
Min-Graph Equipartition Problem with Simulated Annealing
Plagiarism-Detection
This is a utility to detect text plagiarism given 2 documents.
ProductAnalysis
抓取zol数据,django-haystack实现全文搜索,bokeh进行数据可视化,pandas进行数据分析
ajaxSearch
模仿百度搜索栏制作一个简易的search demo
Measuring-Sentence-Similarity
The project aims to measure the similarity between sentences using Natural Language Processing tools like WordNet, NLTK. The Application works on semantic and syntactic features and then evaluates them using Machine Learning classifiers such as Logistic Regression and SVM ( SKlearn).
SemanticModellingPaperSimilarity
Bachelor Thesis "Semantic Modelling of Scientific Publications and Similarity Measures
CIKM-AnalytiCup-2018
[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.
Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
fast-remover
A function for remove nodes which have k degree
ExemplarQueries
Exemplar queries is a new query paradigm. This library is able to use Freebase to process exemplar queries at run-time.
Plagiarize3
Detection of plagiarism
plagiarism
Plagiarism detection
pycode_similar
A simple plagiarism detection tool for python code
CheckArticle
根据余弦相似度算法,利用python语言实现科技项目查重
DuplicateDetector
python实现的文档重复/抄袭检测