Kyle Huang's starred repositories

alist

🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。

Language:GoLicense:AGPL-3.0Stargazers:42875Issues:195Issues:3616

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:33670Issues:207Issues:5160

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:25808Issues:178Issues:130

ExoPlayer

This project is deprecated and stale. The latest ExoPlayer code is available in https://github.com/androidx/media

Language:JavaLicense:Apache-2.0Stargazers:21735Issues:838Issues:10132

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:20410Issues:573Issues:3529

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:8435Issues:121Issues:351

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8038Issues:96Issues:104

bypy

Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端

Language:PythonLicense:MITStargazers:7916Issues:297Issues:580

clappr

:clapper: An extensible media player for the web.

Language:JavaScriptLicense:BSD-3-ClauseStargazers:7107Issues:235Issues:1504

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6936Issues:97Issues:707

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6542Issues:57Issues:152

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:5150Issues:32Issues:552

ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:4085Issues:22Issues:1239

Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:2914Issues:26Issues:337

openmv

OpenMV Camera Module

elevenlabs-python

The official Python API for ElevenLabs Text to Speech.

Language:PythonLicense:MITStargazers:2175Issues:37Issues:256

media

Jetpack Media3 support libraries for media use cases, including ExoPlayer, an extensible media player for Android

Language:JavaLicense:Apache-2.0Stargazers:1679Issues:46Issues:1611

Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Language:PythonLicense:Apache-2.0Stargazers:1658Issues:20Issues:78

python-wechaty

Python Wechaty is a Conversational RPA SDK for Chatbot Makers written in Python

Language:PythonLicense:Apache-2.0Stargazers:1642Issues:27Issues:280

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonLicense:BSD-3-ClauseStargazers:1507Issues:23Issues:36

cloudflare-docker-proxy

A docker registry proxy run on cloudflare worker.

ComfyUI-GGUF

GGUF Quantization support for native ComfyUI models

Language:PythonLicense:Apache-2.0Stargazers:962Issues:12Issues:136

RealBasicVSR

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Language:PythonLicense:Apache-2.0Stargazers:916Issues:13Issues:87

Micro-Wheeled_leg-Robot

全球最小的桌面级双轮腿机器人!

phoenix-battleship

The Good Old game, built with Elixir, Phoenix, React and Redux

Language:ElixirLicense:MITStargazers:526Issues:20Issues:7

tennis_analysis

This project analyzes Tennis players in a video to measure their speed, ball shot speed and number of shots. This project will detect players and the tennis ball using YOLO and also utilizes CNNs to extract court keypoints. This hands on project is perfect for polishing your machine learning, and computer vision skills.

Language:Jupyter NotebookStargazers:441Issues:12Issues:10

ffmpegcv

The ffmpegcv is a ffmpeg backbone for open-cv like Video Reader and Writer

tty2048

Terminal-based 2048 game written in Elixir

Language:ElixirLicense:ISCStargazers:155Issues:10Issues:0

ComfyUI-LLMs

An extremely simple call to the LLMs model node

Language:PythonLicense:GPL-3.0Stargazers:20Issues:2Issues:4