Shan Dou's repositories
amazon-sagemaker-examples
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
awesome-machine-learning-interpretability
A curated list of awesome machine learning interpretability resources.
awesome-search
Awesome Search - this is all about the (e-commerce) search and its awesomeness
codespell
check code for common misspellings
course-nlp
A Code-First Introduction to NLP course
design-patterns-for-humans
An ultra-simplified explanation to design patterns
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
faiss
A library for efficient similarity search and clustering of dense vectors.
fastbook
The fastai book, published as Jupyter Notebooks
feature_engine
Feature engineering package with sklearn like functionality
JamSpell
Modern spell checking library - accurate, fast, multi-language
kaggle-solutions
🏅 Collection of Kaggle Solutions and Ideas 🏅
kglab
Graph Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, NetworkX, RAPIDS, RDFlib, pySHACL, PyVis, morph-kgc, pslpython, pyarrow, etc.
machine-learning-imbalanced-data
Code repository for the online course Machine Learning with Imbalanced Data
ptranking
Learning to Rank in PyTorch
pyspellchecker
Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/
pytorchTutorial
PyTorch Tutorials from my YouTube channel
scikit-learn
scikit-learn: machine learning in Python
scikit-survival
Survival analysis built on top of scikit-learn
trax
Trax — Deep Learning with Clear Code and Speed
tutorials-1
CatBoost tutorials repository
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow