willdphan

William Phan's starred repositories

ragapp

The easiest way to use Agentic RAG in any enterprise

Language:TypeScriptApache-2.0142300

Dot

Text-To-Speech, RAG, and LLMs. All local!

Language:JavaScriptGPL-3.0100900

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonApache-2.099700

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonBSD-3-Clause53700

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Language:PythonMIT69200

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT44900

llm-data-creation

Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"

Language:PythonMIT7600

awesome-deep-learning-papers

The most cited deep learning papers

Language:TeX2519200

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.01513300

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause117100

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT1196200

dora

Dataflow-Oriented Robotic Application is middleware that streamlines and simplifies the creation of AI-based robotic applications with low latency, composable, and distributed dataflow.

Language:RustApache-2.0115100

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonMIT276300

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT1940600

pytorch-rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Language:Jupyter NotebookMIT25600

Simple-MuJoCo-PickNPlace

Very simple MuJoCo Pick and Place task using Panda

Language:Python700

mujoco_menagerie

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Language:Jupyter NotebookNOASSERTION104900

MuJoCo_RL_UR5

A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.

Language:PythonMIT35100

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonMIT596200

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT1543000

Robot-Learning-UT

Simulation of a neural network model using Deep Deterministic Policy Gradient (DDPG) improved with Hindsight Experience Replay (HER) in the Fetch Reach and Pick and Place environments of Gym Open AI.

Language:PythonMIT800

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:PythonMIT37700

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT812100

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonMIT182400

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION462100

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Language:PythonApache-2.040100

Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Language:Jupyter Notebook82700

Tutorial on how to get started with MuJoCo Simulation Platform. MuJoCo stands for Multi-Joint dynamics with Contact. It was acquired and made freely available by DeepMind in October 2021, and open sourced in May 2022. Feel free to contribute. Show your support by ✨this repository.

Language:Jupyter NotebookMIT11300

tokencost

Easy token price estimates for LLMs

Language:PythonMIT21300

act

Language:PythonMIT46700