magicse

magicse

Geek Repo

Company:at home

Github PK Tool:Github PK Tool

magicse's repositories

Language:PythonStargazers:0Issues:1Issues:0

Aladdin-Persson-AI-Watermark-Destroy

Aladdin-Persson-AI-Watermark-Destroy Public

Language:PythonStargazers:0Issues:1Issues:0

caffe-windows-dependencies

Build scripts to compile caffe dependencies on Windows

Language:C++Stargazers:0Issues:1Issues:0

E2FGVI

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

espeak-ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Language:CLicense:GPL-3.0Stargazers:0Issues:1Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ggml

Tensor library for machine learning

Language:CLicense:MITStargazers:0Issues:1Issues:0

Gyver-Lamp

Home Assistant компонент для интеграции лампы Гайвера на оригинальной прошивке

Language:PythonStargazers:0Issues:1Issues:0

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:DockerfileLicense:GPL-3.0Stargazers:0Issues:2Issues:0

LJSpeechTools

Tools for making LJSpeech datasets

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

McJSON

A Delphi / Lazarus / C++Builder simple and small class for fast JSON parsing.

Language:PascalLicense:MITStargazers:0Issues:1Issues:0

ncnn-SpyNet-opticalflow

ncnn SpyNet opticalflow

Language:C++Stargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

portaudio

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained Control over of Low Dimensional (F0 and Energy) Speech Attributes.

Language:RoffLicense:MITStargazers:0Issues:1Issues:0

sherpa-onnx

Speech-to-text and text-to-speech using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

SoniTranslate

Synchronized Translation for Videos

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

stable-diffusion-webui-depthmap-script

High Resolution Depth Maps for Stable Diffusion WebUI

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

stt_normalization

Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

StyleTTS

Official Implementation of StyleTTS

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

StyleTTS-VC

Official Implementation of StyleTTS-VC

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

VALL-E-X-Trainer-by-CustomData

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ViTA

ViTA: Video Transformer Adaptor for Robust Video Depth Estimation

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0