Runze Liu's repositories
CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
dpss-exp3-VC-BNF
Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
eat_pytorch_in_20_days
Pytorch🍊🍉 is delicious, just eat it! 😋😋
RyanLiu112.github.io
Personal website
eai-vc
The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).
factor-world
Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation (2023)
ITTS
Code for "Information-theoretic Task Selection for Meta-Reinforcement Learning".
LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
LLaVA
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
mPLUG-Owl
mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
open_flamingo
An open-source framework for training large multimodal models.
pykan
Kolmogorov Arnold Networks
PyRep
A toolkit for robot learning research.
rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
rl-teacher-tf
Open source implementation of "Deep Reinforcement Learning from Human Preferences", updating with evolutionary strategies and augmented morphologies from "Reinforcement Learning for Improved Agent Design"
RLBench
A large-scale benchmark and learning environment.
rune
Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
RyanLiu112
Config files for my GitHub profile.
Video-LLaMA
Video-LLaMA: An Instruction-Finetuned Visual Language Model for Video Understanding
vip
Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"
VTF_PAR
[CVPR-2023 Workshop@NFVLR] Official PyTorch implementation of Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition