zhangshushu15's starred repositories

License:CC-BY-4.0Stargazers:782Issues:0Issues:0

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4141Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:9646Issues:0Issues:0

Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

Stargazers:27826Issues:0Issues:0

edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Language:PythonLicense:NOASSERTIONStargazers:1172Issues:0Issues:0

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:6020Issues:0Issues:0

conceptual-12m

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

License:NOASSERTIONStargazers:333Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2506Issues:0Issues:0

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2455Issues:0Issues:0

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:37803Issues:0Issues:0

style2paints

sketch + style = paints :art: (TOG2018/SIGGRAPH2018ASIA)

Language:JavaScriptLicense:Apache-2.0Stargazers:17881Issues:0Issues:0

ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Language:PythonStargazers:4469Issues:0Issues:0

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3322Issues:0Issues:0

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4184Issues:0Issues:0

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10493Issues:0Issues:0

lyra

A Very Low-Bitrate Codec for Speech Compression

Language:C++License:Apache-2.0Stargazers:3796Issues:0Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:1867Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3291Issues:0Issues:0

Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Language:PythonLicense:MITStargazers:719Issues:0Issues:0

riffusion

Stable diffusion for real-time music generation

Language:PythonLicense:MITStargazers:3283Issues:0Issues:0

Mubert-Text-to-Music

A simple notebook demonstrating prompt-based music generation via Mubert API

Language:Jupyter NotebookStargazers:2725Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20133Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23141Issues:0Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4044Issues:0Issues:0

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:2945Issues:0Issues:0

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Language:PythonLicense:Apache-2.0Stargazers:6159Issues:0Issues:0

openai-python

The official Python library for the OpenAI API

Language:PythonLicense:Apache-2.0Stargazers:20994Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38272Issues:0Issues:0

InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Language:PythonLicense:Apache-2.0Stargazers:5475Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3146Issues:0Issues:0