Simon Lee's starred repositories
fish-speech
Brand new TTS solution
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
OS-Copilot
An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.
distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
MInference
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Flash-VStream
Please refer to our official repo at https://github.com/IVGSZ/Flash-VStream.
EMO-SUPERB-submission
EMO-SUPERB submission