MarineHuang

MarineHuang

Geek Repo

Github PK Tool:Github PK Tool

MarineHuang's starred repositories

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:1242Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1953Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7456Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3327Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19240Issues:0Issues:0

lobe-tts

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

Language:TypeScriptLicense:MITStargazers:375Issues:0Issues:0

pyRobBot

Chat with GPT LLMs over voice, UI & terminal, all with access to the internet. Powered by OpenAI.

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

transcribe-video-audio

An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.

Language:TypeScriptStargazers:36Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32217Issues:0Issues:0

YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:861Issues:0Issues:0

FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Language:PythonLicense:MITStargazers:319Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:6929Issues:0Issues:0

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Language:PythonLicense:MITStargazers:1588Issues:0Issues:0

SpokenLanguageAssessment

A spoken language assessment tool by which you can use your speech to determine how better are you in your english speaking capabalities.

Language:PythonStargazers:6Issues:0Issues:0

IntroventsEnglishCorner

A spoken English education chatbot based on ChatGPT/whsiper and gTTS.社恐人士的英语角

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

dlib

A toolkit for making real world machine learning and data analysis applications in C++

Language:C++License:BSL-1.0Stargazers:13258Issues:0Issues:0

lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

Language:TypeScriptLicense:NOASSERTIONStargazers:35699Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:33325Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6158Issues:0Issues:0

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3085Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:10481Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10213Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:89686Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5856Issues:0Issues:0

cnpy

library to read/write .npy and .npz files in C/C++

Language:C++License:MITStargazers:1288Issues:0Issues:0

aiges

AI Serving framework loader

Language:PythonLicense:Apache-2.0Stargazers:272Issues:0Issues:0

zipper

[Lib][Version 2.1.0][Functional] C++ wrapper around minizip compression library

Language:C++License:MITStargazers:65Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:62357Issues:0Issues:0

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:7011Issues:0Issues:0

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

Language:PostScriptLicense:CC0-1.0Stargazers:16358Issues:0Issues:0