Kyle Huang's starred repositories
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
so-vits-svc
SoftVC VITS Singing Voice Conversion
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
DiffSynth-Studio
Enjoy the magic of Diffusion models!
ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
elevenlabs-python
The official Python API for ElevenLabs Text to Speech.
Chinese-LLaMA-Alpaca-3
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
python-wechaty
Python Wechaty is a Conversational RPA SDK for Chatbot Makers written in Python
HuixiangDou
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
cloudflare-docker-proxy
A docker registry proxy run on cloudflare worker.
ComfyUI-GGUF
GGUF Quantization support for native ComfyUI models
RealBasicVSR
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
Micro-Wheeled_leg-Robot
全球最小的桌面级双轮腿机器人!
phoenix-battleship
The Good Old game, built with Elixir, Phoenix, React and Redux
tennis_analysis
This project analyzes Tennis players in a video to measure their speed, ball shot speed and number of shots. This project will detect players and the tennis ball using YOLO and also utilizes CNNs to extract court keypoints. This hands on project is perfect for polishing your machine learning, and computer vision skills.
ComfyUI-LLMs
An extremely simple call to the LLMs model node