Jiaqi Wang (myownskyW7)

myownskyW7

Geek Repo

Company:@open-mmlab

Location:Hong Kong

Home Page:https://myownskyw7.github.io/

Github PK Tool:Github PK Tool

Jiaqi Wang's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:67030Issues:559Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:30305Issues:172Issues:494

LLM101n

LLM101n: Let's build a Storyteller

mem0

The memory layer for Personalized AI

Language:PythonLicense:Apache-2.0Stargazers:20647Issues:121Issues:610

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:16617Issues:99Issues:425

QAnything

Question and Answer based on Anything.

Language:PythonLicense:AGPL-3.0Stargazers:11258Issues:101Issues:372

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Language:PythonLicense:NOASSERTIONStargazers:7752Issues:301Issues:260

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:7164Issues:44Issues:75

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:4334Issues:39Issues:158

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:4057Issues:90Issues:1022

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3967Issues:116Issues:78

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2735Issues:19Issues:758

4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2041Issues:35Issues:160

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:1724Issues:27Issues:111

OS-Copilot

An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.

Language:PythonLicense:MITStargazers:1426Issues:20Issues:26

LucidDreamer

Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".

Language:PythonLicense:NOASSERTIONStargazers:1307Issues:23Issues:63

ShareGPT4Video

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

GRUtopia

GRUtopia: Dream General Robots in a City at Scale

Language:PythonLicense:MITStargazers:434Issues:9Issues:14
Language:PythonLicense:Apache-2.0Stargazers:399Issues:12Issues:15

MotionClone

Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:234Issues:11Issues:11

MMStar

This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"

RegionSpot

Recognize Any Regions

Language:PythonLicense:NOASSERTIONStargazers:116Issues:1Issues:15

LongICLBench

Code and Data for "Long-context LLMs Struggle with Long In-context Learning"

Language:PythonLicense:MITStargazers:87Issues:3Issues:4

MMDU

Official repository of MMDU dataset

Language:PythonLicense:Apache-2.0Stargazers:58Issues:2Issues:3

MixPL

Mixed Pseudo Labels for Semi-Supervised Object Detection

Language:PythonLicense:Apache-2.0Stargazers:53Issues:4Issues:16

Soda

Search, organize, discover anything!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:43Issues:0Issues:0

Prism

A Framework for Decoupling and Assessing the Capabilities of VLMs

Language:PythonLicense:Apache-2.0Stargazers:36Issues:1Issues:0
Language:PythonLicense:NOASSERTIONStargazers:8Issues:0Issues:0

swift

SWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) is an extensible framwork designed to facilitate lightweight model fine-tuning.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0