Vladimir's repositories
awesome-e2k
awesome-e2k
awesome-oss-alternatives
Awesome list of open-source startup alternatives to well-known SaaS products 🚀
BC-ResNet
BC-ResNet for Keyword Spotting
deepiler
A neural-based decompiler using Deep Learning with Transformer model.
HierSpeechpp
The official implementation of HierSpeech++
OmniXAI
OmniXAI: A Library for eXplainable AI
Only-Noisy-Training
A self-supervised speech denoising strategy named Only-Noisy Training (ONT), which solves the speech denoising problem with only noisy audio signals in audio space for the first time.
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
PromptCS
A Prompt Learning Framework for Source Code Summarization
PytorchVAD
Code to train voice activity detection model with pytorch
rete
JavaScript framework for visual programming and creating node editor
SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings
Speech-enhancement
Deep learning for audio denoising
StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
vosk-tts
Text To Speech Synthesis with Vosk
whisper.cpp
Port of OpenAI's Whisper model in C/C++
YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
Yolov5_StrongSORT_OSNet
Real-time multi-camera multi-object tracker using YOLOv5 and StrongSORT with OSNet
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone