Beast code in Giters

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

MIT000

magic-image-refiner

000

xtts-trainer-no-ui-auto

This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.

000

xtts-api-server

A simple FastAPI Server to run XTTSv2

MIT000

p5.brush

Unlock custom brushes, natural fill effects and intuitive hatching in p5.js

MIT000

SAM

Official Implementation for "Only a Matter of Style: Age Transformation Using a Style-Based Regression Model" (SIGGRAPH 2021) https://arxiv.org/abs/2102.02754

MIT000

i2vgen-xl

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python000

Giant-Music-Transformer

[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-velocity and outro tokens

Apache-2.0000

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.0000

cog-sdxl-img-blend

Cog wrapper for SDXL img blend using compel

Language:Python000

RenderAI

Render-AI

RenderAI's repositories

rubra

cog-metavoice

Cog-SDXL-ControlNet-LoRA

AudioSep

cog-musicgen-fine-tuner

versatile_audio_super_resolution

cog-xtts-v2

xtts-webui

WhisperSpeech

ml-mgie

DynamiCrafter-576x1024-replicate

cog-comfyui-image-merge

CogVLM

nexrender

InstantID-Juggernaut

LLMs-from-scratch

Lumos

InstantID

replicate-local

GPT-SoVITS

cog-audiocraft

magic-image-refiner

xtts-trainer-no-ui-auto

xtts-api-server

p5.brush

SAM

i2vgen-xl

Giant-Music-Transformer

IP-Adapter

cog-sdxl-img-blend