Tsai Meng-Ting's starred repositories
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
youre-the-os
A game where you are a computer's OS and you have to manage processes, memory and I/O events.
vits2_pytorch
unofficial vits2-TTS implementation in pytorch
voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
vocode-core
🤖 Build voice-based LLM agents. Modular + open source.
Auto_Tuning_Zeroshot_TTS_and_VC
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis", Interspeech 2023
Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
elevenlabs-python
The official Python API for ElevenLabs Text to Speech.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
quivr
Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
audiowmark
Audio Watermarking
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
S.A.T.U.R.D.A.Y
A toolbox for working with WebRTC, Audio and AI
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
chordfinder
不囉唆的和弦代號查詢器 by NiceChord 好和弦