magicse's repositories
ncnn-hifi-GAN
ncnn HiFi-GAN
Aladdin-Persson-AI-Watermark-Destroy
Aladdin-Persson-AI-Watermark-Destroy Public
caffe-windows-dependencies
Build scripts to compile caffe dependencies on Windows
Gyver-Lamp
Home Assistant компонент для интеграции лампы Гайвера на оригинальной прошивке
LJSpeechTools
Tools for making LJSpeech datasets
ncnn-SpyNet-opticalflow
ncnn SpyNet opticalflow
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
sherpa-onnx
Speech-to-text and text-to-speech using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
SoniTranslate
Synchronized Translation for Videos
stable-diffusion-webui-depthmap-script
High Resolution Depth Maps for Stable Diffusion WebUI
stt_normalization
Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks
StyleTTS-VC
Official Implementation of StyleTTS-VC
Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
VALL-E-X-Trainer-by-CustomData
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io