magicse

followers

following

stars

at home

magicse's repositories

ncnn-colorization-siggraph17

Language:C++31 2 1

ncnn-hifi-GAN

ncnn HiFi-GAN

Language:C++22 4 1

ai-video-dubber

Language:Python010

Aladdin-Persson-AI-Watermark-Destroy

Aladdin-Persson-AI-Watermark-Destroy Public

Language:Python010

caffe-windows-dependencies

Build scripts to compile caffe dependencies on Windows

Language:C++010

E2FGVI

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

Language:PythonNOASSERTION010

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CGPL-3.0010

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0010

ggml

Tensor library for machine learning

Language:CMIT010

Gyver-Lamp

Home Assistant компонент для интеграции лампы Гайвера на оригинальной прошивке

Language:Python010

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonMIT010

homeassistant

Language:DockerfileGPL-3.0020

LJSpeechTools

Tools for making LJSpeech datasets

Language:PythonMIT010

McJSON

A Delphi / Lazarus / C++Builder simple and small class for fast JSON parsing.

Language:PascalMIT010

ncnn-SpyNet-opticalflow

ncnn SpyNet opticalflow

Language:C++020

opencv_mingw64_windows

020

portaudio

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

Language:CNOASSERTION010

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT010

radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.

Language:RoffMIT010

sherpa-onnx

Speech-to-text and text-to-speech using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go

Language:C++Apache-2.0010

SoniTranslate

Synchronized Translation for Videos

Language:PythonApache-2.0010

stable-diffusion-webui-depthmap-script

High Resolution Depth Maps for Stable Diffusion WebUI

Language:PythonMIT010

stt_normalization

Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks

Language:PythonGPL-3.0010

StyleTTS

Official Implementation of StyleTTS

Language:PythonMIT010

StyleTTS-VC

Official Implementation of StyleTTS-VC

Language:PythonMIT010

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonMIT010

uk-sherpa-onnx-model

020

VALL-E-X-Trainer-by-CustomData

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT010

ViTA

ViTA: Video Transformer Adaptor for Robust Video Depth Estimation

Language:PythonNOASSERTION010

voice_conversion

Language:PythonMIT010