Dmitry Tarasov's repositories
memes-dataset
Imgflip memes dataset parser
aac-datasets
Audio Captioning datasets for PyTorch.
audiocaps
🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
frechet-audio-distance
A lightweight library for Frechet Audio Distance calculation.
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
icons
Набор SVG иконок, представленный в виде React компонентов.
LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
ml-course-hse
Машинное обучение на ФКН ВШЭ
Perl-Critic-Git
Perl module to connect and Perl::Critic, to blame the right people for violations.
Pointnet_Pointnet2_pytorch
PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.
riffusion
Stable diffusion for real-time music generation
Test-Perl-Critic-Git
Run Perl::Critic as a unit test for git diff
text-translator
extension for gnome-shell
TorchLRP
A PyTorch 1.6 implementation of Layer-Wise Relevance Propagation (LRP).
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vk-teamsru.mail.biz.VKTeams
flatpak for VK Teams (MyTeam)
vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
wild-workouts-go-ddd-example
Go DDD example application. Complete project to show how to apply DDD, Clean Architecture, and CQRS by practical refactoring.