Guan-Ting (Daniel) Lin's starred repositories
leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
parler-tts
Inference and training library for high-quality TTS models.
stable-audio-tools
Generative models for conditional audio generation
Awesome-Graph-LLM
A collection of AWESOME things about Graph-Related LLMs.
HierSpeechpp
The official implementation of HierSpeech++
acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
control-vc
This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"
DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
SpeechAgents
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
Spatial-AST
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
emphassess
This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses paper (de Seyssel et al., 2023).