Eric Hahn's repositories
pylint
It's not just a linter that annoys you!
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
spark
Apache Spark - A unified analytics engine for large-scale data processing
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
mlflow
Open source platform for the machine learning lifecycle
gensim
Topic Modelling for Humans
SynapseML
Simple and Distributed Machine Learning
nltk
NLTK Source
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
dask
Parallel computing with task scheduling
faiss
A library for efficient similarity search and clustering of dense vectors.
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
featuretools
An open source python library for automated feature engineering
kubeflow
Machine Learning Toolkit for Kubernetes