Alexander Varlamov (Alphonsce)

Alphonsce

Geek Repo

Company:MIPT

Location:Russia, Moscow

Home Page:https://www.kaggle.com/alphonsce

Github PK Tool:Github PK Tool

Alexander Varlamov's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:140248Issues:1073Issues:7640

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:38577Issues:444Issues:305

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:31033Issues:179Issues:515

faiss

A library for efficient similarity search and clustering of dense vectors.

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:14689Issues:110Issues:385

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13177Issues:92Issues:16

mmcv

OpenMMLab Computer Vision Foundation

Language:PythonLicense:Apache-2.0Stargazers:5836Issues:84Issues:1150

torchdiffeq

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

Language:PythonLicense:MITStargazers:5489Issues:125Issues:216

muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Language:PythonLicense:MITStargazers:4477Issues:77Issues:169

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:4257Issues:56Issues:97

riffusion-hobby

Stable diffusion for real-time music generation

Language:PythonLicense:MITStargazers:3368Issues:39Issues:93

sd-webui-animatediff

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Language:PythonLicense:NOASSERTIONStargazers:3051Issues:23Issues:371

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2549Issues:43Issues:87

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:2397Issues:42Issues:105

LyCORIS

Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.

Language:PythonLicense:Apache-2.0Stargazers:2166Issues:20Issues:140

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Language:PythonLicense:Apache-2.0Stargazers:1388Issues:21Issues:158

VQ-Diffusion

Official implementation of VQ-Diffusion

Language:PythonLicense:MITStargazers:880Issues:10Issues:37

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:662Issues:25Issues:46

Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Language:Jupyter NotebookLicense:MITStargazers:633Issues:16Issues:63

DeepImageSearch

DeepImageSearch is a Python library for fast and accurate image search. It offers seamless integration with Python, GPU support, and advanced capabilities for identifying complex image patterns using the Vision Transformer models.

Language:PythonLicense:MITStargazers:374Issues:7Issues:25

mustango

Mustango: Toward Controllable Text-to-Music Generation

Language:PythonLicense:MITStargazers:324Issues:16Issues:13

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:PythonLicense:Apache-2.0Stargazers:99Issues:4Issues:5

Yin

Fast Python implementation of the Yin algorithm: a fundamental frequency estimator

Language:PythonLicense:MITStargazers:91Issues:3Issues:2
Language:PythonLicense:Apache-2.0Stargazers:34Issues:2Issues:2

palmistry

2022-2 SNU Computer Vision Project - Fortune On Your Hand: View-Invariant Machine Palmistry

Language:Jupyter NotebookStargazers:23Issues:1Issues:2

dsp

Digital Signal Processing course

Language:PythonLicense:Apache-2.0Stargazers:21Issues:4Issues:0

metr

🚜 METR: Message Enhanced Tree-Ring

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10Issues:0Issues:0