Beast code in Giters

Chenxi's repositories

bark

🔊 Text-Prompted Generative Audio Model

Language:PythonNOASSERTION95 60

SadTalker

（CVPR 2023）SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION23 40

Semantic-Segment-Anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Language:PythonApache-2.02300

cog-deforum-stable-diffusion

Language:PythonNOASSERTION1600

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs

Language:Jupyter NotebookApache-2.01300

ControlVideo

Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"

Language:PythonMIT800

replicate-sd-textual-inversion

Language:Python7 10

cog-stable-diffusion

Diffusers Stable Diffusion as a Cog model

Language:PythonApache-2.0600

cog-bark

Language:Python400

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT300

Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models

Language:PythonMIT300

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT300

cog-dolly

Language:Python200

dolly

Language:PythonApache-2.0200

StableSR

Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonNOASSERTION200

StyleDrop-PyTorch

Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)

Language:PythonMIT200

cog-ledits

Language:Python1 10

cog-segment-anything

100

FastChat

The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

Language:PythonApache-2.0100

shap-e

Generate 3D objects conditioned on text or images

Language:PythonMIT1 10

tango

Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"

Language:PythonNOASSERTION100

difformer

The offical codebase for Difformer: Empowering Diffusion Models on the Embedding Space for Text Generation

Language:PythonMIT000

fastcomposer

Language:PythonMIT000

FastSAM

Fast Segment Anything

Language:PythonApache-2.0000

lorahub

The official repository of paper "LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition".

Language:PythonMIT000

ProFusion

Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Language:Jupyter NotebookApache-2.0000

recognize-anything

Code for the Recognize Anything Model (RAM) and Tag2Text Model

Language:PythonMIT000

ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (PyTorch)

Language:Python000

Text2Video-Zero

Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonNOASSERTION000

webie

Dataset for web-scaled information extraction.

Language:PythonNOASSERTION000