Wei He's repositories
2014-Alibaba-Big-Data-Competition
2014 Alibaba Big Data Competition, Team leader, implemented a recommendation algorithm with logistic regression and participated in final season(500 out of over 5000)
2014-NovelData
The python scripts used for processing novel data.
2012-BPlusTree
BplusTree according to sqlite
2012-HFFSM
a algorithm HFFSM based on FFSM is presented. First equally convert directed multigraphs to undirected simple graphs, and then construct suboptimal CAM tree and mine frequent subgraphs with the top-down depth-first method. The empirical study shows that HFFSM can deal with directed multigraphs well, reduce false alarms when the algorithm is applied, and is a little better than FFSM in efficiency, but more useful than FSSM
2013-Assessing-Single-pair-Similarity-over-Graphs
a more efficient way to access single-pair similarity over graphs
2014-Automatic-Tagging-in-Social-Networks
Automatic tagging users in social networks according to their interests
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
atom
The hackable editor
Automatic-Tagging-in-Social-Networks
Automatic Tagging in Social Networks
ConfigCenter
An interface for config management with zookeeper
for_scikit
a long way to expert
GoogleSearchCrawler
a tool for crawl Google search results
howdoi
howdoi - instant coding answers via the command line
meibenjin.github.io
Resume
MultitouchAttributuion
One method of our way to credit attribution
nstools
Some meaningless nscripter tools.
scikit-learn
scikit-learn: machine learning in Python
snippet
snippet code for personnal
WeiboCrawler
新浪微博搜索工具