Jack's repositories
agents
A collection of production-ready subagents for Claude Code
clash-for-linux-install
😼 优雅地部署基于 clash/mihomo 的代理环境
context-engineering-intro
Context engineering is the new vibe coding - it's the way to actually make AI coding assistants work. Claude Code is the best for this so that's what this repo is centered around, but you can apply this strategy with any AI coding assistant!
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
easy-dataset
A powerful tool for creating fine-tuning datasets for LLM
easy-llm-cli
An open-source AI agent that is compatible with multiple LLM models
EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
gemini-cli
An open-source AI agent that brings the power of Gemini directly into your terminal.
Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
InternNav
InternRobotics' open platform for building generalized navigation foundation models.
Isaac-GR00T
NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
llama_index
LlamaIndex is a data framework for your LLM applications
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
mem0
The Memory layer for your AI apps
MiniCPM-o
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and Video Understanding on Your Phone
molmo
Code for the Molmo Vision-Language Model
NavRL
[IEEE RA-L'25] NavRL: Learning Safe Flight in Dynamic Environments (NVIDIA Isaac/Python/ROS1/ROS2)
NeuPAN
[TRO 2025] NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning.
Open-Reasoner-Zero
Official Repo for Open-Reasoner-Zero
owl
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Qwen3-SmVL
将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
rlds_dataset_builder
An example RLDS dataset builder for X-embodiment dataset conversion.
SenseVoice
Multilingual Voice Understanding Model
v2rayN
A GUI client for Windows, Linux and macOS, support Xray and sing-box and others