jason_1i's repositories
awesome-mlops
A curated list of references for MLOps
Branching-out-of-the-Notebook
This repository will take you through creating a FastAPI StableDiffusion app (including Dockerfile) all the way to adding a new feature using industry standard branch development!
bullet3
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
client
DAGsHub client libraries
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
course22p2
course.fast.ai 2022 part 2 - under construction
DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
ElegantRL
Cloud-native Deep Reinforcement Learning. 🔥
FastChat
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
FinRL-Meta
FinRL-Meta: Data-Driven Metaverse for Financial Reinforcement Learning. 🔥
GPTeam
GPTeam: An open-source multi-agent simulation
langchain
⚡ Building applications with LLMs through composability ⚡
LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
optuna
A hyperparameter optimization framework
panda-gym
Set of robotic environments based on PyBullet physics engine and gymnasium.
Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
reincarnating_rl
[NeurIPS 2022] Open source code for reusing prior computational work in RL.
river
🌊 Online machine learning in Python
rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
s2client-proto
StarCraft II Client - protocol definitions used to communicate with StarCraft II.
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
serving
A flexible, high-performance serving system for machine learning models
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
trax
Trax — Deep Learning with Clear Code and Speed