Yan Xiaole's repositories
spark-imbalanced-learn
Spark Toolbox for imbalanced dataset in machine learning
airbyte
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
alfred2-password-generator
alfred workflow to generate some random memorable passwords
arrow
Better dates & times for Python
caltech_ml
caltech machine learning homework
camus
Mirror of Linkedin's Camus
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Capslock
ultimate macOS keyboard re-mapping
char-rnn
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
creamsoda
spark imitate exercise
cs109-dl-videos
Shell script to scrape Harvard CS109 (Intro to Data Science) lecture videos
db-readings
Readings in Databases
geektime-nginx
极客时间:nginx核心知识100讲配置文件与代码分享
google-interview-university
A complete daily plan for studying to become a Google software engineer.
imbalanced-learn
Python module to perform under sampling and over sampling with various techniques.
myhpsc
coursera uwhpsc homework
pinyin.py
汉字转拼音,With Python
scikit-learn
scikit-learn: machine learning in Python
scipy_2015_sklearn_tutorial
Scikit-Learn tutorial material for Scipy 2015
seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
snakebite
A pure python HDFS client
spark
Apache Spark - A unified analytics engine for large-scale data processing
spark-avro
Using Avro data format in Spark, SQL, and DataFrames
spark-redshift
Redshift data source for Spark
streamingpro
Unify Big Data and Machine Learning.
torch-rnn
Efficient, reusable RNNs and LSTMs for torch
UCI
one UCI repo DA per week