yuys0602's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ControlNet
Let us control diffusion models!
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
RTranslator
Open source real-time translation app for Android that runs locally
efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
torchscale
Foundation Architecture for (M)LLMs
Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
sd-webui-reactor
Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)
chinese-llm-benchmark
中文大模型能力评测榜单:目前已囊括106个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
AIGC_text_detector
The official codes of our work on AIGC detection: "Multiscale Positive-Unlabeled Detection of AI-Generated Texts" (ICLR'24 Spotlight)
FourierKAN-mnist
MNIST example using Kolmogorov-Arnold Networks
WaterBench
[ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of LLM Watermarks