bruce's repositories
jd_text_analysis
used to analysis JD comments, use nlp techiques
abnormal_enterprise_detection
use model to find out abnormal enterprise
char-rnn
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
dgk_lost_conv
dgk_lost_conv 中文对白语料 chinese conversation corpus
movie_data
description of movie data,data derived from kaggle.com
population-analysis-
create 2D mapview by R, utilize multiple packagess(shiny etc)
pytorch_ctpn
This is a pytorch implementation of CTPN(Detecting Text in Natural Image with Connectionist Text Proposal Network). You may want to finetune from: https://drive.google.com/open?id=1JHhI4sEIXfs5gDa1I9AgJBY477HTzAd0
scrapy
web crawler use python3.5
similar_company
to describe similarity among companies, parameters including person,company_name,address
TextInfoExp
自然语言处理相关实验(基于sougou数据集),包含文本特征提取(TF-IDF),文本分类,文本聚类,word2vec训练词向量及同义词词林中文词语相似度计算、文档自动摘要,信息抽取,情感分析与观点挖掘等。
thulac_practice
company commit
tianchi_buyerprediction
use tianchi_buyer prediction data, to predict whether a buyer is a repeat buyer or not
wechat_jump
used for automatic wechat jump game
weibospider
:zap: A distributed crawler for weibo, building with celery and requests.