JAY LEE's repositories
DataScienceCourse
数据科学与大数据--python入门与爬虫
Topic_Evolution_developing
主题模型演化的开发测试
cs224n_2019
My_Homework_cs224n
Wechat_chat_bot
微信聊天机器人v-0.2
Best_README_template
🌩最好的中文README模板⚡️Best README template
crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
data_mining_models
Basic data mining model, including feature importance display
docker-hadoop
Apache Hadoop docker image
docker-jmeter
Docker image for Apache JMeter
greenplum-oss-docker
Greenplum OSS docker
Greenplum-tutorial
GP的部署学习记录
JA1lE1.github.io
BY Blog ->
Kubernetes
kerbernetes学习
kubernetes-handbook
Kubernetes中文指南/云原生应用架构实践手册 - https://jimmysong.io/kubernetes-handbook
Lda2vec-Tensorflow
Tensorflow 1.5 implementation of Chris Moody's Lda2vec, adapted from @meereeum
LightGBM-binary-classification-example
A model that predicts the default rate of credit card holders using the LightGBM classifier. Trained the LightGBM classifier with Scikit-learn's GridSearchCV.
mybatis-3
MyBatis SQL mapper framework for Java
mysql-tutorial
🌱 This is a tutorial of MySQL. In this tutorial, you can leran how to use MySQL and optimize SQL.
nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
platform-ds
This is a quick-and-dirty data analytics platform based on Spark, Hadoop and Jupyterhub. All this tools are deployed automatically with docker and docker-compose.
postgresql-tutorial
Postgresql- 学习
pyspark-algorithms
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Python_Full_Stack_Developer
python开发
stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
studyFiles
一些经典且高质量的电子书分享
time_news_crawl
带有时间信息的新闻文本的爬取