NagisaZj's repositories
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
ContextWM
Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://arxiv.org/abs/2305.18499
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
diffusion_reward
[arXiv'23] Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"
dreamerv3
Mastering Diverse Domains through World Models
DrM
DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Graphormer
Graphormer is a general-purpose deep learning backbone for molecular modeling.
HIQL
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
hypnettorch
Package for working with hypernetworks in PyTorch.
icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs?
llama3
The official Meta Llama 3 GitHub site
MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2
octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
opro
official code for "Large Language Models as Optimizers"
universal_manipulation_interface
Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
viper_rl
Using advances in generative modeling to learn reward functions from unlabeled videos.