Beast code in Giters

iiLaurens's starred repositories

lazygit

simple terminal UI for git commands

Language:GoMIT51813 284 1957

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.032236 204 4960

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonMIT17985 112 473

:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonApache-2.017087 138 3533

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Language:PythonAGPL-3.012760 71 428

FlexiGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonApache-2.09161 112 82

outlines

Structured Text Generation

Language:PythonApache-2.08548 46 586

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7913 77 162

gotenberg

A developer-friendly API for converting numerous document formats into PDF files, and more!

Language:GoMIT7833 66 544

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonApache-2.05543 56 555

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Language:PythonApache-2.05176 45 1024

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.04394 35 1410

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonBSD-2-Clause3183 35 71

transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

Language:JavaScriptMIT2730 29 17

promptbench

A unified evaluation framework for large language models

Language:PythonMIT2413 21 53

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonMIT2378 19 156

curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

Language:PythonMIT2174 33 311

usearch

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

Language:C++Apache-2.02155 26 150

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonApache-2.02136 33 238

bypass-paywalls-firefox-clean

MIT1602 16 118

infinity

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

Language:PythonMIT1349 18 154

thepipe

Extract clean data from anywhere, powered by vision-language models ⚡

Language:PythonMIT1139 11 17

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Apache-2.0979 24 11

MInference

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Language:PythonMIT733 6 54