Fan's repositories
bdbk-kb
Baidu Baike Knowledge base
berkeleycoref-bansalklein
Extractor for Bansal & Klein features used in the Berkeley Coreference System modified with distributional features
chinese_correct_wsd
简易的中文纠错和消歧
chinese_text_cluster
MachineLearning
ChineseNgoKnowledgeGraph
createing Chinese Ngo Knowledge Graph is my graduation project, develop language is python, web-frame is django, database is grahp database Neo4j, use D3 visulation ,very cool
ChineseWordSegmentation
Chinese word segmentation algorithm without corpus(无需语料库的中文分词)
corpusZh
一个中文的已标注词性的语料库
DeepLearning
Deep learning code by Hinton
dict_build
自动构建中文词库:build dict from large chinese text using unsupervised method,algorithm:http://www.matrix67.com/blog/archives/5044
distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
entity-linking-and-retrieval-tutorial
Entity Linking and Retrieval Tutorial
maxent
Maximum Entropy Modeling Toolkit for Python and C++
Mutual-Information
In probability theory and information theory, the mutual information of two random variables is a quantity that measures the mutual dependence of the two random variables. This script performs MI over Mutual Information over discrete random variables
neo-mblog
A micro-blogging web application powered by Neo4j
ngender
根据姓名来判断性别
reverb
Web-Scale Open Information Extraction
scir-training-day
a small traing program for new crews of hit-scir
word-clustering
Implementation of monolingual and multilingual word clustering algorithms, mostly in python