William Phan's starred repositories

ragapp

The easiest way to use Agentic RAG in any enterprise

Language:TypeScriptLicense:Apache-2.0Stargazers:1423Issues:0Issues:0

Dot

Text-To-Speech, RAG, and LLMs. All local!

Language:JavaScriptLicense:GPL-3.0Stargazers:1009Issues:0Issues:0

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:997Issues:0Issues:0

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonLicense:BSD-3-ClauseStargazers:537Issues:0Issues:0

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Language:PythonLicense:MITStargazers:692Issues:0Issues:0

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:449Issues:0Issues:0

llm-data-creation

Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"

Language:PythonLicense:MITStargazers:76Issues:0Issues:0

awesome-deep-learning-papers

The most cited deep learning papers

Language:TeXStargazers:25192Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:15133Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1171Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:11962Issues:0Issues:0

dora

Dataflow-Oriented Robotic Application is middleware that streamlines and simplifies the creation of AI-based robotic applications with low latency, composable, and distributed dataflow.

Language:RustLicense:Apache-2.0Stargazers:1151Issues:0Issues:0

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonLicense:MITStargazers:2763Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:19406Issues:0Issues:0

pytorch-rl

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Language:Jupyter NotebookLicense:MITStargazers:256Issues:0Issues:0

Simple-MuJoCo-PickNPlace

Very simple MuJoCo Pick and Place task using Panda

Language:PythonStargazers:7Issues:0Issues:0

mujoco_menagerie

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1049Issues:0Issues:0

MuJoCo_RL_UR5

A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.

Language:PythonLicense:MITStargazers:351Issues:0Issues:0

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:5962Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15430Issues:0Issues:0

Robot-Learning-UT

Simulation of a neural network model using Deep Deterministic Policy Gradient (DDPG) improved with Hindsight Experience Replay (HER) in the Fetch Reach and Pick and Place environments of Gym Open AI.

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Language:PythonLicense:MITStargazers:377Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8121Issues:0Issues:0

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonLicense:MITStargazers:1824Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:4621Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Language:PythonLicense:Apache-2.0Stargazers:401Issues:0Issues:0

Hands-On-Reinforcement-Learning-With-Python

Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow

Language:Jupyter NotebookStargazers:827Issues:0Issues:0

MuJoCo-Tutorial

Tutorial on how to get started with MuJoCo Simulation Platform. MuJoCo stands for Multi-Joint dynamics with Contact. It was acquired and made freely available by DeepMind in October 2021, and open sourced in May 2022. Feel free to contribute. Show your support by ✨this repository.

Language:Jupyter NotebookLicense:MITStargazers:113Issues:0Issues:0

tokencost

Easy token price estimates for LLMs

Language:PythonLicense:MITStargazers:213Issues:0Issues:0
Language:PythonLicense:MITStargazers:467Issues:0Issues:0