Ryan Paul Gozum's repositories
next-word-prediction
Generative Pretrained Transformer 2 (GPT-2) for Language Modeling using the PyTorch-Transformers library.
dexpression-pytorch
A PyTorch implementation of DeXpression for facial expression recognition.
quant-trading
Python trading and backtesting platform.
faiss
A library for efficient similarity search and clustering of dense vectors.
python-orb
Reusable Python orb for your CircleCI pipeline.
sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
BentoML
Unified Model Serving Framework 🍱
convolution-visualizer
Convolution visualizations
datasets
🤗 Fast, efficient, open-access datasets and evaluation metrics for Natural Language Processing and more in PyTorch, TensorFlow, NumPy and Pandas
flink-training
Apache Flink Training Excercises
hey
HTTP load generator, ApacheBench (ab) replacement, formerly known as rakyll/boom
kedro
A Python framework for creating reproducible, maintainable and modular data science code.
lazynlp
Library to scrape and clean web pages to create massive datasets.
lightning-flash
Collection of tasks for fast prototyping, baselining, finetuning and solving problems with deep learning.
onnx
Open standard for machine learning interoperability
Orb-Project-Template
A starter template for orb projects. Build, test, and publish orbs automatically on CircleCI.
regex-notebooks
Sample usage of the Python module for regular expressions.
SentAugment
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.
simpletransformers
Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
spark_submit_airflow
Simple repo to demonstrate how to submit a spark job to EMR from Airflow
tf-quant-finance
High-performance TensorFlow library for quantitative finance.
tokenizers
💥Fast State-of-the-Art Tokenizers optimized for Research and Production
tpot
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
txtai
AI-powered search engine
word-counter
An API which counts how many times a word exists in the webpage source.
yolov5
YOLOv5 in PyTorch > ONNX > CoreML > TFLite