Chenxi (chenxwh)

chenxwh

Geek Repo

Company:University of Cambridge

Home Page:https://chenxwh.github.io/

Twitter:@chenxi_jw

Github PK Tool:Github PK Tool


Organizations
replicate

Chenxi's repositories

bark

🔊 Text-Prompted Generative Audio Model

Language:PythonLicense:NOASSERTIONStargazers:95Issues:6Issues:0

SadTalker

(CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:23Issues:4Issues:0

Semantic-Segment-Anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Language:PythonLicense:Apache-2.0Stargazers:23Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:16Issues:0Issues:0

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13Issues:0Issues:0

ControlVideo

Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

cog-stable-diffusion

Diffusers Stable Diffusion as a Cog model

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:3Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

StableSR

Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0

StyleDrop-PyTorch

Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)

Language:PythonLicense:MITStargazers:2Issues:0Issues:0
Language:PythonStargazers:1Issues:1Issues:0

FastChat

The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

shap-e

Generate 3D objects conditioned on text or images

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

tango

Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

difformer

The offical codebase for Difformer: Empowering Diffusion Models on the Embedding Space for Text Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FastSAM

Fast Segment Anything

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lorahub

The official repository of paper "LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition".

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ProFusion

Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

recognize-anything

Code for the Recognize Anything Model (RAM) and Tag2Text Model

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (PyTorch)

Language:PythonStargazers:0Issues:0Issues:0

Text2Video-Zero

Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

webie

Dataset for web-scaled information extraction.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0