Xuweiyi Chen's repositories
ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
blender
Official mirror of Blender
ControlNet
Let us control diffusion models!
DAT
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
deformable-attention
Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"
dream-in-4d
Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]
dust3r
DUSt3R: Geometric 3D Vision Made Easy
ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
embodied-generalist
Official code repository for 3D embodied generalist agent LEO
FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
grok-1
Grok open release
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
InternVL
[CVPR 2024] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B
IsaacLab
Unified framework for robot learning built on NVIDIA Isaac Sim
jtd-remote
Example of Just the Docs as a remote theme
LLaVA_Attn_Control
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
open_flamingo
An open-source framework for training large multimodal models.
ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
StoryDiffusion
Create Magic Story!
transformers_attn_control
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trl
Train transformer language models with reinforcement learning.
VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
zeno-hub-uva
AI Evaluation Platform