douzi (douzi0248)

douzi0248

Geek Repo

Company:lili

Location:yixia

Github PK Tool:Github PK Tool

douzi's starred repositories

nlp-competitions-list-review

复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!

Stargazers:2662Issues:0Issues:0

Tianchi-LLM-QA

阿里天池: 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答 baseline 80+

Language:PythonStargazers:71Issues:0Issues:0

BetterMixture-Top1-Solution

天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案

Language:PythonStargazers:22Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:Jupyter NotebookLicense:MITStargazers:6811Issues:0Issues:0

edgeai-tidl-tools

Edgeai TIDL Tools and Examples - This repository contains Tools and example developed for Deep learning runtime (DLRT) offering provided by TI’s edge AI solutions.

Language:PythonLicense:NOASSERTIONStargazers:135Issues:0Issues:0

open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Language:SvelteLicense:MITStargazers:44858Issues:0Issues:0

qwen2-sft

Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理

Language:PythonLicense:Apache-2.0Stargazers:43Issues:0Issues:0
Language:C++License:MITStargazers:3Issues:0Issues:0

SummerTTS

SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out

Language:C++Stargazers:400Issues:0Issues:0

tts-demo

支持各种感情的男女声音,支持实时和离线文本合成tts语音;支持单模特声音变声,语音速率调整,语音音量大小调整;支持自定义语音模型。

Language:JavaStargazers:57Issues:0Issues:0

ollama-python

Ollama Python library

Language:PythonLicense:MITStargazers:4367Issues:0Issues:0

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7087Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter, Object Pascal, Lazarus, Rust

Language:C++License:Apache-2.0Stargazers:3474Issues:0Issues:0

OpenVoiceChat

Have a natural voice conversation with an LLM

Language:PythonLicense:Apache-2.0Stargazers:220Issues:0Issues:0

RealtimeSTT_LLM_TTS

实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

Language:PythonLicense:MITStargazers:244Issues:0Issues:0
Language:PythonLicense:MITStargazers:280Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12431Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:35096Issues:0Issues:0

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonLicense:MITStargazers:29570Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:13694Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:34927Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:31957Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:5975Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8988Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:3259Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:4147Issues:0Issues:0

faster-whisper-GUI

faster_whisper GUI with PySide6

Language:PythonLicense:AGPL-3.0Stargazers:1585Issues:0Issues:0

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Language:PythonLicense:MITStargazers:1988Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:70544Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:6754Issues:0Issues:0