JP's starred repositories

transformers

πŸ€— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:131789Issues:1117Issues:15645

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34655Issues:342Issues:2712

applied-ml

πŸ“š Papers & tech blogs by companies sharing their work on data science & machine learning in production.

dash

Data Apps & Dashboards for Python. No JavaScript Required.

Language:PythonLicense:MITStargazers:21151Issues:418Issues:1770

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8530Issues:131Issues:1077

optimum

πŸš€ Accelerate training and inference of πŸ€— Transformers and πŸ€— Diffusers with easy to use hardware optimization tools

Language:PythonLicense:Apache-2.0Stargazers:2446Issues:56Issues:731

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for πŸ€— Hugging Face transformer models πŸš€

Language:PythonLicense:Apache-2.0Stargazers:1642Issues:27Issues:121

transformers-interpret

Model explainability that works seamlessly with πŸ€— transformers. Explain your transformers model in just 2 lines of code.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1261Issues:20Issues:76

baal

Bayesian active learning library for research and industrial usecases.

Language:PythonLicense:Apache-2.0Stargazers:858Issues:18Issues:114

DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

Language:Jupyter NotebookLicense:MITStargazers:814Issues:11Issues:24

NL-Augmenter

NL-Augmenter 🦎 β†’ 🐍 A Collaborative Repository of Natural Language Transformations

Language:PythonLicense:MITStargazers:770Issues:23Issues:52

Few-NERD

Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"

Language:PythonLicense:Apache-2.0Stargazers:385Issues:12Issues:44

ISIC-Archive-Downloader

A script to download the ISIC Archive of lesion images

Language:PythonLicense:Apache-2.0Stargazers:198Issues:8Issues:30

label-studio-transformers

Label data using HuggingFace's transformers and automatically get a prediction service

Language:PythonLicense:Apache-2.0Stargazers:174Issues:9Issues:14

d3graph

Creation of interactive networks using d3 Javascript

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:171Issues:7Issues:33

diffbot-python-client

Python Diffbot API Client

Language:PythonStargazers:118Issues:16Issues:0

EUNN-tensorflow

Efficient Unitary Neural Network(EUNN) implementation in Tensorflow

Language:PythonLicense:MITStargazers:74Issues:10Issues:3

Interpret

The PyTorch implementation the Smooth Grad [https://arxiv.org/pdf/1706.03825.pdf] and Integrated Gradients [https://arxiv.org/pdf/1703.01365.pdf] for NLP Models.

FrenchLefffLemmatizer

A French Lemmatizer in Python based on the LEFFF

Language:PythonLicense:NOASSERTIONStargazers:34Issues:1Issues:1

torch_eunn

A Pytorch implementation of an efficient unitary neural network (https://arxiv.org/abs/1612.05231)

Language:PythonLicense:MITStargazers:32Issues:3Issues:0

AMOS

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Language:PythonLicense:MITStargazers:24Issues:5Issues:2

benchmark-for-transformers

Tool for easily comparing and evaluating the performance of transformers under different scenarios.

Language:PythonLicense:MITStargazers:6Issues:1Issues:0

nlu_data_diets

NLU on Data Diets

Language:PythonLicense:Apache-2.0Stargazers:6Issues:2Issues:0

toxic-comment-server

Models to detect hateful comments served with flask trained on Kaggle's Toxic Comment Classification Challenge dataset.

Language:PythonStargazers:4Issues:2Issues:0

covid19-transmission-ukf

With this repository, I derive the time-dependent R0 coefficient of the COVID-19 with the Unscented Kalman Filter from the data gathered by John Hopkins assuming the SEIR model.

Language:PythonLicense:MITStargazers:3Issues:2Issues:2

pure-matrix

Pure python matrix code to do algebra with PCA (naive power iteration) and KMean (random initialization) implementations.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

orsum2020_collaborative_datasets

Anonymized train set and test set used in RecSys2020 experiment to optimize the hyperparameters.