Alphonsce

followers

following

stars

MIPT

Russia, Moscow

https://www.kaggle.com/alphonsce

Alexander Varlamov's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0140248 1073 7640

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonMIT38577 444 305

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.031033 179 515

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++MIT30665 481 2480

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT14689 110 385

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT13177 92 16

mmcv

OpenMMLab Computer Vision Foundation

Language:PythonApache-2.05836 84 1150

torchdiffeq

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

Language:PythonMIT5489 125 216

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonMIT4477 77 169

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.04257 56 97

riffusion-hobby

Stable diffusion for real-time music generation

Language:PythonMIT3368 39 93

sd-webui-animatediff

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Language:PythonNOASSERTION3051 23 371

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT2549 43 87

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION2397 42 105

LyCORIS

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Language:PythonApache-2.02166 20 140

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Language:PythonApache-2.01388 21 158

VQ-Diffusion

Official implementation of VQ-Diffusion

Language:PythonMIT880 10 37

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonMIT662 25 46

Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookMIT633 16 63

DeepImageSearch

DeepImageSearch is a Python library for fast and accurate image search. It offers seamless integration with Python, GPU support, and advanced capabilities for identifying complex image patterns using the Vision Transformer models.

Language:PythonMIT374 7 25

mustango

Mustango: Toward Controllable Text-to-Music Generation

Language:PythonMIT324 16 13

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python301 15 15

laughter-detection

Language:PythonMIT212 9 11

emospeech

Language:PythonApache-2.099 4 5

Yin

Fast Python implementation of the Yin algorithm: a fundamental frequency estimator

Language:PythonMIT91 3 2

huggingface-tokenizer-in-cxx

Language:C++49 1 9

NISQA-s

Language:PythonApache-2.034 2 2

palmistry

2022-2 SNU Computer Vision Project - Fortune On Your Hand: View-Invariant Machine Palmistry

Language:Jupyter Notebook23 1 2

dsp

Digital Signal Processing course

Language:PythonApache-2.021 40

metr

🚜 METR: Message Enhanced Tree-Ring

Language:Jupyter NotebookApache-2.01000