Ahsen Khaliq's repositories
generative-models
Generative Models by Stability AI
Mubert-Text-to-Music
A simple notebook demonstrating prompt-based music generation via Mubert API
discord-md-badge
Add to your GitHub readme a badge that shows your Discord username and presence, or a server invite!
CogVideo
Text-to-video generation.
Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion (tweaks focused on training faces)
fast-stable-diffusion
fast-stable-diffusion, +25% speed increase + memory efficient.
g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
japanese-stable-diffusion
Japanese Stable Diffusion is a Japanese specific latent text-to-image diffusion model capable of generating photo-realistic images given any text input.
knn-transformers
PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT
Mengzi
Mengzi Pretrained Models
min-dalle
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
point-e
Point cloud diffusion for 3D model synthesis
resefa
[ICML 2022] Region-Based Semantic Factorization in GANs
riffusion-app
Stable diffusion for real-time music generation
sd-webui-colab-simplified
A one-click version of sd-webui-colab
stable-diffusion-tensorflow
Stable Diffusion in TensorFlow / Keras
stable-diffusion-webui
Stable Diffusion web UI
stable-diffusion-webui-1
Stable Diffusion web UI
VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone