Wang-Xiaodong1899

Xiaodong Wang's starred repositories

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookCC-BY-4.029170 357 1522

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.024401 191 3861

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonApache-2.018230 170 1248

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

5906 122 12

AgentVerse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

Language:JavaScriptApache-2.03932 57 76

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonApache-2.03275 15 85

UniAD

[CVPR'23 Best Paper Award] Planning-oriented Autonomous Driving

Language:PythonApache-2.03182 34 171

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonApache-2.03130 69 260

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

2942 124 18

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonMIT2438 34 260

nuscenes-devkit

The devkit of the nuScenes dataset.

Language:PythonNOASSERTION2210 51 780

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileMIT1352 23 32

tomesd

Speed up Stable Diffusion with this one simple trick!

Language:PythonMIT1252 19 48

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookMIT1080 14 33

text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Language:PythonMIT998 10 31

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonApache-2.0877 12 18

Awesome-LLM4AD

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

Apache-2.0814 34 5

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonApache-2.0736 7 54

AGIEval

Language:PythonMIT674 9 27

VAD

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Language:PythonApache-2.0540 27 71

Awesome-Papers-Autonomous-Agent

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

494 9 1

MovieChat

[CVPR 2024] 🎬💭 chat with over 10K frames of video!

Language:PythonBSD-3-Clause471 10 71

Agent-Attention

Official repository of Agent Attention (ECCV2024)

Language:Python448 3 38

TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Language:PythonMIT259 11 31

ChatEval

Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"

Language:PythonApache-2.0216 3 8

multi-lora-fine-tune

Provide Efficient LLM Fine-Tune via Multi-LoRA Optimization

Language:PythonApache-2.0190 3 42

DriveMLM

140 18 5

ReCo

ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023

Language:Jupyter NotebookMIT112 5 10

InstructionGPT-4

Language:PythonMIT35 10

HealGPT

Language:PythonApache-2.04 10