R3xpook's starred repositories

crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Language:PythonLicense:Apache-2.0Stargazers:53962Issues:299Issues:842

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Language:PythonLicense:MITStargazers:44482Issues:266Issues:6698

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:40050Issues:192Issues:6036

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Language:PythonLicense:MITStargazers:38585Issues:326Issues:1528

BitNet

Official inference framework for 1-bit LLMs

Language:PythonLicense:MITStargazers:22094Issues:205Issues:213

Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Language:PythonLicense:MITStargazers:17562Issues:151Issues:185

ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Language:PythonLicense:Apache-2.0Stargazers:10129Issues:44Issues:3457

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:9182Issues:90Issues:707

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:8363Issues:80Issues:799

smol-course

A course on aligning smol models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6412Issues:46Issues:53

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonLicense:BSD-2-ClauseStargazers:5702Issues:51Issues:262

swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Language:PythonLicense:Apache-2.0Stargazers:5283Issues:55Issues:401

lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Language:PythonLicense:BSD-3-ClauseStargazers:4718Issues:29Issues:52

OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Language:Jupyter NotebookLicense:MITStargazers:4271Issues:87Issues:177

entropix

Entropy Based Sampling and Parallel CoT Decoding

Language:PythonLicense:Apache-2.0Stargazers:3422Issues:71Issues:40

smollm

Everything about the SmolLM and SmolVLM family of models

Language:PythonLicense:Apache-2.0Stargazers:3279Issues:25Issues:61

distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Language:PythonLicense:Apache-2.0Stargazers:2898Issues:26Issues:477

Smart-AutoClicker

An open-source auto clicker on images for Android

Language:KotlinLicense:GPL-3.0Stargazers:2751Issues:42Issues:527

maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Language:PythonLicense:Apache-2.0Stargazers:2633Issues:34Issues:45

Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:1204Issues:9Issues:178

self-adaptive-llms

A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!

Language:PythonLicense:Apache-2.0Stargazers:1149Issues:16Issues:15

search-and-learn

Recipes to scale inference-time compute of open models

Language:PythonLicense:Apache-2.0Stargazers:1109Issues:9Issues:28

magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Language:PythonLicense:MITStargazers:774Issues:5Issues:39

multi1

multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at once.

Language:PythonLicense:MITStargazers:351Issues:6Issues:6

lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Language:PythonLicense:Apache-2.0Stargazers:335Issues:8Issues:62

Meissonic

[ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Language:PythonLicense:Apache-2.0Stargazers:329Issues:7Issues:20

joy-caption-batch

A batch captioning tool for joy_caption

Language:PythonLicense:MITStargazers:186Issues:6Issues:30

Zamba2

PyTorch implementation of models from the Zamba2 series.

Language:PythonLicense:Apache-2.0Stargazers:185Issues:4Issues:3

open-agentinstruct

An open-source recreation of the AgentInstruct agentic workflow for synthetic data generation

Language:PythonLicense:MITStargazers:14Issues:3Issues:0