jason_1i's repositories

awesome-mlops

A curated list of references for MLOps

Stargazers:0Issues:0Issues:0

Branching-out-of-the-Notebook

This repository will take you through creating a FastAPI StableDiffusion app (including Dockerfile) all the way to adding a new feature using industry standard branch development!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

client

DAGsHub client libraries

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

course22p2

course.fast.ai 2022 part 2 - under construction

License:Apache-2.0Stargazers:0Issues:0Issues:0

DI-star

An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ElegantRL

Cloud-native Deep Reinforcement Learning. 🔥

License:NOASSERTIONStargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.

License:Apache-2.0Stargazers:0Issues:0Issues:0

FinRL-Meta

FinRL­-Meta: Data-Driven Metaverse for Financial Reinforcement Learning. 🔥

License:MITStargazers:0Issues:0Issues:0

GPTeam

GPTeam: An open-source multi-agent simulation

License:MITStargazers:0Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

License:MITStargazers:0Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

License:MITStargazers:0Issues:0Issues:0

ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

License:NOASSERTIONStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

License:MITStargazers:0Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

License:Apache-2.0Stargazers:0Issues:0Issues:0

optuna

A hyperparameter optimization framework

License:NOASSERTIONStargazers:0Issues:0Issues:0

panda-gym

Set of robotic environments based on PyBullet physics engine and gymnasium.

License:MITStargazers:0Issues:0Issues:0

Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

License:Apache-2.0Stargazers:0Issues:0Issues:0

reincarnating_rl

[NeurIPS 2022] Open source code for reusing prior computational work in RL.

License:Apache-2.0Stargazers:0Issues:0Issues:0

river

🌊 Online machine learning in Python

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

License:MITStargazers:0Issues:0Issues:0

s2client-proto

StarCraft II Client - protocol definitions used to communicate with StarCraft II.

License:MITStargazers:0Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

serving

A flexible, high-performance serving system for machine learning models

License:Apache-2.0Stargazers:0Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

License:MITStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

License:Apache-2.0Stargazers:0Issues:0Issues:0

trax

Trax — Deep Learning with Clear Code and Speed

License:Apache-2.0Stargazers:0Issues:0Issues:0