bn's repositories
500lines
500 Lines or Less
storm
Mirror of Apache Storm
hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
book
学习笔记
bootstrap
The most popular front-end framework for developing responsive, mobile first projects on the web.
cola
A distributed crawling framework.
word2vec
Python interface to Google word2vec
word2vec-1
Word2Vec in C++ 11
TextRank-3
TextRank算法提取关键词的Java实现
Word2VEC_java
word2vec java版本的一个实现
RAKE
A python implementation of the Rapid Automatic Keyword Extraction
textrank-1
用textrank主题模型提取关键词
AppInfoCrawler
Get a list of packages by crawling Play Store
coursera
Script for downloading Coursera.org videos and naming them.
TextRank
Python implementation of TextRank algorithm (http://acl.ldc.upenn.edu/acl2004/emnlp/pdf/Mihalcea.pdf) for automatic keyword extraction and summarization using Levenshtein distance as relation between text units.
AutomaticKeyphraseExtraction
Data for Automatic Keyphrase Extraction Task
manber-introduction-to-algorithms-solutions
Collection of solutions for the exercises proposed in Udi Manber's book: Introduction to Algorithms -- A Creative Approach.
hexo
A fast, simple & powerful blog framework, powered by Node.js.
zh-google-styleguide
Google 开源项目风格指南 (中文版)
HiTune
HiTune is a Hadoop performance analyzer. See trouble shooting and known issues here
textrank-2
Java implementation of the TextRank algorithm by Mihalcea, et al. http://lit.csci.unt.edu/index.php/Graph-based_NLP