Chenxi (chenxwh)

chenxwh

Geek Repo

Company:University of Cambridge

Home Page:https://chenxwh.github.io/

Twitter:@chenxi_jw

Github PK Tool:Github PK Tool


Organizations
replicate

Chenxi's repositories

insanely-fast-whisper

Incredibly fast Whisper-large-v3

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1827Issues:14Issues:0

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:86Issues:3Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:51Issues:1Issues:0

VideoCrafter

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Language:PythonLicense:NOASSERTIONStargazers:13Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonLicense:NOASSERTIONStargazers:6Issues:0Issues:0

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

UnIVAL

Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3Issues:0Issues:0
Language:PythonStargazers:2Issues:1Issues:0

Cutie

[arXiv 2023] Putting the Object Back Into Video Object Segmentation

Language:PythonLicense:GPL-3.0Stargazers:2Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

ScaleCrafter

Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Language:PythonStargazers:2Issues:0Issues:0

T2I-Adapter

T2I-Adapter

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

chenxwh.github.io

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:JavaScriptLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:1Issues:0

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ProphetNet

A research project for natural language generation, containing the official implementations by MSRA NLC team.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

Wuerstchen

Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

cog-whisperv3

Run OpenAI Whisper as a Cog model

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

daclip-uir

PyTorch implementation of the paper "Controlling Vision-Language Models for Universal Image Restoration". Currently aiming for *academic researches* 😋

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

InstructCV

Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LongerCrafter

Code for FreeNoise

Language:PythonStargazers:0Issues:0Issues:0

LongLoRA

Efficient long-context fine-tuning, supervised fine-tuning, LongQA dataset.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Magic123

Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0