DrakeYang1's starred repositories
whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.
Qwen2-VL-Finetune
An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.
SillyTavern
LLM Frontend for Power Users.
KoboldAI-Client
For GGUF support, see KoboldCPP: https://github.com/LostRuins/koboldcpp
GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Efficient-Live-Portrait
Fast running Live Portrait with TensorRT and ONNX models
MeshAnythingV2
From anything to mesh like human artists. Official impl. of "MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization"
Stable-Hair
Stable-Hair: Real-World Hair Transfer via Diffusion Model
LivePortrait
Bring portraits to life!