Beast code in Giters

R3xpook's starred repositories

crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Language:PythonApache-2.053962 299 842

llama_index

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Language:PythonMIT44482 266 6698

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonApache-2.040050 192 6036

crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Language:PythonMIT38585 326 1528

BitNet

Official inference framework for 1-bit LLMs

Language:PythonMIT22094 205 213

Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Language:PythonMIT17562 151 185

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Language:PythonApache-2.010129 44 3457

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptApache-2.09182 90 707

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonApache-2.08363 80 799

smol-course

A course on aligning smol models.

Language:Jupyter NotebookApache-2.06412 46 53

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonBSD-2-Clause5702 51 262

swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Language:PythonApache-2.05283 55 401

lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Language:PythonBSD-3-Clause4718 29 52

OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Language:Jupyter NotebookMIT4271 87 177

entropix

Entropy Based Sampling and Parallel CoT Decoding

Language:PythonApache-2.03422 71 40

smollm

Everything about the SmolLM and SmolVLM family of models

Language:PythonApache-2.03279 25 61

distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Language:PythonApache-2.02898 26 477

Smart-AutoClicker

An open-source auto clicker on images for Android

Language:KotlinGPL-3.02751 42 527

maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Language:PythonApache-2.02633 34 45

Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Language:PythonApache-2.01204 9 178

self-adaptive-llms

A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!

Language:PythonApache-2.01149 16 15

search-and-learn

Recipes to scale inference-time compute of open models

Language:PythonApache-2.01109 9 28

magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Language:PythonMIT774 5 39

multi1

multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at once.

Language:PythonMIT351 6 6

lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Language:PythonApache-2.0335 8 62

Meissonic

[ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Language:PythonApache-2.0329 7 20

joy-caption-batch

A batch captioning tool for joy_caption

Language:PythonMIT186 6 30

Zamba2

PyTorch implementation of models from the Zamba2 series.

Language:PythonApache-2.0185 4 3

open-agentinstruct

An open-source recreation of the AgentInstruct agentic workflow for synthetic data generation

Language:PythonMIT14 30

R3xpook