XuweiyiChen

Xuweiyi Chen's repositories

UniCtrl

Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

MIT53 4 3

Pix2Gif

Language:Python600

ArCHer

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Language:Python000

blender

Official mirror of Blender

NOASSERTION000

busy_gpu

Language:Shell000

CameraCtrl

Language:PythonApache-2.0000

ControlNet

Let us control diffusion models!

Language:PythonApache-2.0000

DAT

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Language:PythonApache-2.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Apache-2.0000

deformable-attention

Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention"

MIT000

dream-in-4d

Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]

Language:PythonNOASSERTION000

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonNOASSERTION000

ELLA

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Apache-2.0000

embodied-generalist

Official code repository for 3D embodied generalist agent LEO

Language:PythonMIT000

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookMIT000

grok-1

Grok open release

Language:PythonApache-2.0000

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.0000

InternVL

[CVPR 2024] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks —— An Open-Source Alternative to ViT-22B

Language:Jupyter NotebookMIT000

IsaacLab

Unified framework for robot learning built on NVIDIA Isaac Sim

NOASSERTION000

jtd-remote

Example of Just the Docs as a remote theme

Language:SCSS000

LLaVA_Attn_Control

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.0000

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonMIT000

ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

000

StoryDiffusion

Create Magic Story!

Apache-2.0000

transformer-debugger

Language:PythonMIT000

transformers_attn_control

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

TripoSR

Language:PythonMIT000

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0000

VideoCrafter

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Language:PythonNOASSERTION000

zeno-hub-uva

AI Evaluation Platform

MIT000