叶大侠's starred repositories
whisper.cpp
Port of OpenAI's Whisper model in C/C++
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
flexsearch
Next-Generation full text search library for Browser and Node.js
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Dango-Translator
团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
CTranslate2
Fast inference engine for Transformer models
DashPlayer
为英语学习者量身打造的视频播放器,助你通过观看视频、沉浸真实语境,轻松提升英语水平。#美剧 #播放器 #听力
WhisperLive
A nearly-live implementation of OpenAI's Whisper.
rag-search
RAG Search API
vite-vue3-chrome-extension-v3
Another vite powered web extension (chrome, firefox, etc.) starter template.
clothes-swap-salvton-comfyui-workflow
A ComfyUI workflow to dress your virtual influencer with real clothes. Made with 💚 by the CozyMantis squad.