chenxwh

followers

following

stars

University of Cambridge

https://chenxwh.github.io/

Organizations

replicate

Chenxi's repositories

insanely-fast-whisper

Incredibly fast Whisper-large-v3

Language:Jupyter NotebookApache-2.01827 140

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookApache-2.086 30

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.051 10

VideoCrafter

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Language:PythonNOASSERTION1300

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:PythonNOASSERTION600

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonMIT500

UnIVAL

Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.

Language:Jupyter NotebookApache-2.0300

cog-I2VGen-XL

Language:Python2 10

Cutie

[arXiv 2023] Putting the Object Back Into Video Object Segmentation

Language:PythonGPL-3.0200

InternLM-XComposer

Language:Python200

ScaleCrafter

Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Language:Python200

T2I-Adapter

T2I-Adapter

Language:PythonApache-2.0200

chenxwh.github.io

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:JavaScriptMIT100

cog-idefics

Language:Python1 10

gorilla

Gorilla: An API store for LLMs

Language:PythonApache-2.0100

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonApache-2.0100

ProphetNet

A research project for natural language generation, containing the official implementations by MSRA NLC team.

Language:PythonMIT100

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Language:PythonNOASSERTION100

Wuerstchen

Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models

Language:Jupyter NotebookMIT100

cog-whisperv3

Run OpenAI Whisper as a Cog model

Language:PythonApache-2.0000

cog-wuerstchen

Language:Python010

daclip-uir

PyTorch implementation of the paper "Controlling Vision-Language Models for Universal Image Restoration". Currently aiming for *academic researches* 😋

Language:PythonMIT000

distil-whisper

Language:PythonMIT000

InstructCV

Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"

Language:PythonNOASSERTION000

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonMIT000

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonApache-2.0000

LongerCrafter

Code for FreeNoise

Language:Python000

LongLoRA

Efficient long-context fine-tuning, supervised fine-tuning, LongQA dataset.

Language:PythonApache-2.0000

Magic123

Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Language:Jupyter NotebookApache-2.0000

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonApache-2.0000