YasinLin's repositories
chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
Bert-VITS2
vits2 backbone with multilingual-bert
chatgpt-web-midjourney-proxy
chatgpt web, midjourney, gpts,tts, whisper 一套ui全搞定
data-workflow
数据处理工作流,elt
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
HierSpeechpp
The official implementation of HierSpeech++
FFCreator
一个基于node.js的高速视频制作库 A fast video processing library based on node.js
FFCreatorLite
A lightweight and fast short video processing library based on node.js
FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
ffmpeg-build-script
The FFmpeg build script provides an easy way to build a static FFmpeg on OSX and Linux with non-free codecs included.
ffmpeg-docker
Docker build for FFmpeg on Ubuntu / Alpine / Centos / Scratch / nvidia / vaapi
ffmpeg-gl-transition
FFmpeg filter for applying GLSL transitions between video streams.
ffmpeg-windows-build-helpers
Helper script for cross compiling some media tools for windows, like customizable ffmpeg.exe (with or without non-free components, etc), and some other bonuses like mplayer, mp4box, mxf, etc.
glew
The OpenGL Extension Wrangler Library
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
NativeSpeaker
make your Speaker talking as Native style with own voice!
OpenVoice
Instant voice cloning by MyShell
pingora
A library for building fast, reliable and evolvable network services.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
whisper.cpp
Port of OpenAI's Whisper model in C/C++
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)