gachaun's repositories

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-LLM-resourses

🧑‍🚀 全世界最好的中文LLM资料总结

Stargazers:0Issues:0Issues:0

Awesome-MLLM-Hallucination

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

Stargazers:0Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

cat-catch

猫抓 浏览器资源嗅探扩展 / cat-catch Browser Resource Sniffing Extension

Language:JavaScriptLicense:GPL-3.0Stargazers:0Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

DeepFaceLive

Real-time face swap for PC streaming or video calls

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

NativeSpeaker

make your Speaker talking as Native style with own voice!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NDK_OpenGLES_3_0

Android OpenGL ES 3.0 从入门到精通系统性学习教程

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenGLCamera2

🔥 Android OpenGL Camera 2.0 实现 30 多种滤镜和抖音特效

Language:C++Stargazers:0Issues:0Issues:0

PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vits_chinese-1

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

interview

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。

License:NOASSERTIONStargazers:0Issues:0Issues:0

mnist-onnx-runtime

MoE model with onnx runtime

Stargazers:0Issues:0Issues:0

roop-cam

real time face swap and one-click video deepfake with only a single image (Uncensored)

License:AGPL-3.0Stargazers:0Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript

License:Apache-2.0Stargazers:0Issues:0Issues:0

stable-ts

Transcription, forced alignment, and audio indexing with OpenAI's Whisper

License:MITStargazers:0Issues:0Issues:0

talking-face-arxiv-daily

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vits-simple-api

A simple VITS HTTP API, developed by extending Moegoe with additional features.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Stargazers:0Issues:0Issues:0