RenderAI's repositories
rubra
AI Assistants, LLMs and tools made easy
Cog-SDXL-ControlNet-LoRA
A Cog implementation of canny SDXL ControlNet supporting Replicate's LoRA weights.
AudioSep
Official implementation of "Separate Anything You Describe"
cog-musicgen-fine-tuner
This is a cog implementation of Meta's 1.5B MusicGen Melody model
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
cog-xtts-v2
Cog wrapper for Coqui / xtts-v2
xtts-webui
Webui for using XTTS and for finetuning it
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
nexrender
📹 Data-driven render automation for After Effects
InstantID-Juggernaut
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
Lumos
A RAG LLM co-pilot for browsing the web, powered by local LLMs
InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds
replicate-local
Retrieve the source code for any model made available on replicate.com!
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
cog-audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
xtts-trainer-no-ui-auto
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.
xtts-api-server
A simple FastAPI Server to run XTTSv2
p5.brush
Unlock custom brushes, natural fill effects and intuitive hatching in p5.js
SAM
Official Implementation for "Only a Matter of Style: Age Transformation Using a Style-Based Regression Model" (SIGGRAPH 2021) https://arxiv.org/abs/2102.02754
i2vgen-xl
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Giant-Music-Transformer
[SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-velocity and outro tokens
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
cog-sdxl-img-blend
Cog wrapper for SDXL img blend using compel