haoybl's repositories
allennlp
An open-source NLP research library, built on PyTorch.
bqplot
Plotting library for IPython/Jupyter Notebooks
DBDA-python
Doing Bayesian Data Analysis, 2nd Edition (Kruschke, 2015): Python/PyMC3 code
DeepRec
推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction
demo-spark-analytics
Demo about realtime analytics of user behavior using elk stack/apache spark streaming+mllib/redis/slamdata
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌&零件词库、时间抽取、连续英文切割、中文词向量大全、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据。
Image-category-understanding-and-application
此库为2017-2018年度工程实践项目,主要目的是能够识别图像类别,尤其是医学类,然后在医学类中再进行更为细致的类别识别,以达到医学影像这一垂直领域的应用目的。
Kaggle-Competition
Grant dataset
odo
Data Migration for the Blaze Project
orange3
:tangerine: Orange 3 data mining suite: http://orange.biolab.si
pentaho-kettle
Pentaho Data Integration ( ETL ) a.k.a Kettle
power-architect
Automatically exported from code.google.com/p/power-architect
pyexcel-io
One interface to read and write the data in various excel formats, import the data into and export the data from databases
Python-Algorithms
All Algorithms implemented in Python
python-cx_Oracle
Python interface to Oracle Database conforming to the Python DB API 2.0 specification.
Quadtree_KNN
基于Quadtree的KNN最近邻算法
Reco-papers
Classic papers and resources on recommendation
scikit-feature
open-source feature selection repository in python (DMML Lab@ASU)
Scrapy-Zhihu-Internet-Practitioners
运用Scrapy框架,采用递归算法,爬取知乎海量用户信息,用于分析互联网从业者的地理分布。
soar
SQL Optimizer And Rewriter
sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
sqlalchemy
See the development link for contribution guidelines
StatisticalRethinkingPythonWithPyMC
A selection of examples and exercises from R. McElreath's Statistical Rethinking book on Bayesian data analysis.
stats-dvb
Notes and Exercises for 'Stats - Data and Models' (De Veaux, Velleman, Bock)