叶大侠's starred repositories

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:51691Issues:382Issues:3244

whisper.cpp

Port of OpenAI's Whisper model in C/C++

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:33068Issues:202Issues:1214

RSSHub

🧡 Everything is RSSible

Language:TypeScriptLicense:MITStargazers:32424Issues:344Issues:5503

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:31008Issues:178Issues:514

mlx

MLX: An array framework for Apple silicon

flexsearch

Next-Generation full text search library for Browser and Node.js

Language:JavaScriptLicense:Apache-2.0Stargazers:12354Issues:101Issues:319

ggml

Tensor library for machine learning

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10785Issues:141Issues:349

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用

Language:PythonLicense:GPL-3.0Stargazers:10095Issues:66Issues:515

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7832Issues:107Issues:440
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7416Issues:65Issues:188

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonLicense:NOASSERTIONStargazers:7277Issues:38Issues:127
Language:PythonLicense:Apache-2.0Stargazers:7091Issues:66Issues:71

Dango-Translator

团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器

Language:PythonLicense:LGPL-2.1Stargazers:6949Issues:84Issues:129

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:6115Issues:58Issues:1097

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5242Issues:39Issues:37

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:4452Issues:39Issues:163

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3520Issues:64Issues:101

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:3357Issues:46Issues:165

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3246Issues:57Issues:690

aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Language:PythonLicense:AGPL-3.0Stargazers:2484Issues:72Issues:209

DashPlayer

为英语学习者量身打造的视频播放器,助你通过观看视频、沉浸真实语境,轻松提升英语水平。#美剧 #播放器 #听力

Language:TypeScriptLicense:AGPL-3.0Stargazers:2251Issues:15Issues:54

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Language:PythonLicense:MITStargazers:1831Issues:30Issues:171

lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language:PythonLicense:Apache-2.0Stargazers:1113Issues:11Issues:82

rag-search

RAG Search API

Language:PythonLicense:Apache-2.0Stargazers:976Issues:6Issues:7

noScribe

Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)

Language:PythonLicense:GPL-3.0Stargazers:409Issues:12Issues:44

vite-vue3-chrome-extension-v3

Another vite powered web extension (chrome, firefox, etc.) starter template.

clothes-swap-salvton-comfyui-workflow

A ComfyUI workflow to dress your virtual influencer with real clothes. Made with 💚 by the CozyMantis squad.