Hertz's repositories
paper-list
autoupdate paper list
LLM-101-Bootcamp
🚀这里是LLM-AI 101创造营,人人都是全民制作人
house-of-model-cards
(HOMC)house-of-model-cards
multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
Awesome-LLM-Post-training
Awesome Reasoning LLM Tutorial/Survey/Guide
awesome-LLM-resourses
🧑🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
Awesome-RL-based-Reasoning-MLLMs
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
byte-langmanus-web
The web UI for LangManus.
ColossalAI
Making large AI models cheaper, faster and more accessible
ComfyUI_examples
Examples of ComfyUI workflows
comfyui_LLM_party
LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG
DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
fairseq2
FAIR Sequence Modeling Toolkit 2
Griffon
Official repo of Griffon series including v1(ECCV 2024), v2, and G
langmanus
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search, crawling, and Python code execution, while giving back to the community that made this possible.
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
maestro
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
motia
AI Agent Framework For Software Engineers
OpenSeek
OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next-generation models that surpass DeepSeek.
oumi
Everything you need to build state-of-the-art foundation models, end-to-end.
trl
Train transformer language models with reinforcement learning.
unsloth
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
zju-icicles
浙江大学课程攻略共享计划