ZhaoQiiii

ZhaoQiiii's starred repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:Jupyter NotebookMIT7608 76 203

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Language:PythonApache-2.05846 55 1479

cook

🍲 好的，今天我们来做菜！OK, Let's Cook!

Language:VueMIT5070 23 47

Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Language:PythonAGPL-3.04799 47 180

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT4379 61 96

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT3068 36 231

Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.02989 28 185

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonApache-2.02804 470

DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonApache-2.02581 32 132

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonApache-2.02521 43 389

meditron

Meditron is a suite of open-source medical Large Language Models (LLMs).

Language:PythonApache-2.01887 32 29

Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

1805 53 14

garss

Github Actions采集RSS, 打造无广告内容优质的头版头条超赞宝藏页

Language:Python1202 38 13

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Language:PythonNOASSERTION1103 39 19

SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Language:PythonMIT906 22 69

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonNOASSERTION681 5 93

dobb-e

Dobb·E: An open-source, general framework for learning household robotic manipulation

Language:G-codeMIT576 15 7

OpenLane-V2

[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving

Language:Jupyter NotebookApache-2.0557 21 108

LLMRiddles

Open-Source Reproduction/Demo of the LLM Riddles Game

Language:PythonApache-2.0525 5 8

X-Pose

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Language:PythonNOASSERTION503 22 32

agentlego

Enhance LLM agents with rich tool APIs

Language:PythonApache-2.0351 9 7

InterpAny-Clearer

[ECCV2024 Oral] Clearer anytime frame interpolation & Manipulated interpolation of anything

Language:PythonMIT229 6 18

stable-video-diffusion-colab

Language:Jupyter Notebook212 8 5

resume

使用LaTeX编译生成的中英文个人简历

Language:TeXMIT210 40

BotChat

Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.

Language:Jupyter NotebookApache-2.0139 2 1

Video-Bench

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!

Language:Python117 3 9

DI-1024

1024 + 深度强化学习（Deep Reinforcement Learning + 1024 Game/ 2048 Game)

Language:PythonApache-2.0114 2 1

CodeMorpheus

CodeMorpheus: Generate code self-portraits with one click（一键生成代码自画像，决策型 AI + 生成式 AI）

Language:PythonApache-2.047 2 1

BiSTNet

30 6 4

NTIRE23-VIDEO-COLORIZATION

Language:Python18 2 13