Alexander Varlamov's starred repositories
stable-diffusion-webui
Stable Diffusion web UI
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
parler-tts
Inference and training library for high-quality TTS models.
riffusion-hobby
Stable diffusion for real-time music generation
sd-webui-animatediff
AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI
stable-audio-tools
Generative models for conditional audio generation
IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
VQ-Diffusion
Official implementation of VQ-Diffusion
Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
DeepImageSearch
DeepImageSearch is a Python library for fast and accurate image search. It offers seamless integration with Python, GPU support, and advanced capabilities for identifying complex image patterns using the Vision Transformer models.
VoiceFlow-TTS
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"