xuhzyy's starred repositories
parler-tts
Inference and training library for high-quality TTS models.
RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
RealtimeTTS
Converts text to speech in realtime
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
cpp-httplib
A C++ header-only HTTP/HTTPS server and client library
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
agentscope
Start building LLM-empowered multi-agent applications in an easier way.
Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
Qwen-Agent
Agent framework and applications built upon Qwen1.5 & Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs