MrClownC's starred repositories

Language:PythonStargazers:1725Issues:0Issues:0

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:85422Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:24222Issues:0Issues:0

openvla

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Language:PythonLicense:MITStargazers:841Issues:0Issues:0

LIV

Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)

Language:PythonLicense:MITStargazers:77Issues:0Issues:0

DecisionNCE

[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"

Language:PythonLicense:MITStargazers:61Issues:0Issues:0

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2145Issues:0Issues:0

rl_games

RL implementations

Language:Jupyter NotebookLicense:MITStargazers:829Issues:0Issues:0

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookLicense:MITStargazers:2759Issues:0Issues:0
Language:Jupyter NotebookLicense:GPL-3.0Stargazers:45Issues:0Issues:0

powerlevel10k

A Zsh theme

Language:ShellLicense:MITStargazers:45091Issues:0Issues:0

IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Language:PythonLicense:NOASSERTIONStargazers:1850Issues:0Issues:0

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Language:PythonLicense:Apache-2.0Stargazers:439Issues:0Issues:0

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:736Issues:0Issues:0

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:5076Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8572Issues:0Issues:0

LLM-Tuning

Tuning LLMs with no tearsđź’¦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

Language:HTMLStargazers:952Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25558Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5952Issues:0Issues:0

ReMax

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Language:PythonStargazers:139Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9553Issues:0Issues:0

RoboFlamingo

Code for RoboFlamingo

Language:PythonLicense:MITStargazers:275Issues:0Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

Language:PythonLicense:MITStargazers:1058Issues:0Issues:0

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3612Issues:0Issues:0
Language:Jupyter NotebookStargazers:84Issues:0Issues:0

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookLicense:MITStargazers:2554Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9010Issues:0Issues:0

RVT

Official Code for RVT-2 and RVT

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:247Issues:0Issues:0

3d_diffuser_actor

Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"

Language:PythonLicense:MITStargazers:182Issues:0Issues:0

calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Language:PythonLicense:MITStargazers:340Issues:0Issues:0