dudulu's starred repositories
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
tensorrtllm_backend
The Triton TensorRT-LLM Backend
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
flash-attention
Fast and memory-efficient exact attention
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Llama3-Chinese-Chat
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
mlx-examples
Examples in the MLX framework
opentelemetry-python
OpenTelemetry Python API and SDK