iiLaurens

iiLaurens

Geek Repo

Github PK Tool:Github PK Tool

iiLaurens's starred repositories

lazygit

simple terminal UI for git commands

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:32236Issues:204Issues:4960

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:17985Issues:112Issues:473

haystack

:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:17087Issues:138Issues:3533

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.ไธ€็ซ™ๅผๅผ€ๆบ้ซ˜่ดจ้‡ๆ•ฐๆฎๆๅ–ๅทฅๅ…ท๏ผŒๆ”ฏๆŒPDF/็ฝ‘้กต/ๅคšๆ ผๅผ็”ตๅญไนฆๆๅ–ใ€‚

Language:PythonLicense:AGPL-3.0Stargazers:12760Issues:71Issues:428

FlexiGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9161Issues:112Issues:82

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:8548Issues:46Issues:586

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7913Issues:77Issues:162

gotenberg

A developer-friendly API for converting numerous document formats into PDF files, and more!

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5543Issues:56Issues:555

aim

Aim ๐Ÿ’ซ โ€” An easy-to-use & supercharged open-source experiment tracker.

Language:PythonLicense:Apache-2.0Stargazers:5176Issues:45Issues:1024

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:4394Issues:35Issues:1410

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonLicense:BSD-2-ClauseStargazers:3183Issues:35Issues:71

transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

Language:JavaScriptLicense:MITStargazers:2730Issues:29Issues:17

promptbench

A unified evaluation framework for large language models

Language:PythonLicense:MITStargazers:2413Issues:21Issues:53

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonLicense:MITStargazers:2378Issues:19Issues:156

curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.

Language:PythonLicense:MITStargazers:2174Issues:33Issues:311

usearch

Fast Open-Source Search & Clustering engine ร— for Vectors & ๐Ÿ”œ Strings ร— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram ๐Ÿ”

Language:C++License:Apache-2.0Stargazers:2155Issues:26Issues:150

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonLicense:Apache-2.0Stargazers:2136Issues:33Issues:238

infinity

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

Language:PythonLicense:MITStargazers:1349Issues:18Issues:154

thepipe

Extract clean data from anywhere, powered by vision-language models โšก

Language:PythonLicense:MITStargazers:1139Issues:11Issues:17

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

MInference

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Language:PythonLicense:MITStargazers:733Issues:6Issues:54

dom-to-semantic-markdown

DOM to Semantic-Markdown for use with LLMs

Language:TypeScriptLicense:MITStargazers:654Issues:7Issues:14

puncc

๐Ÿ‘‹ Puncc is a python library for predictive uncertainty quantification using conformal prediction.

INTERS

This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"

Language:PythonLicense:MITStargazers:196Issues:23Issues:6

pyftpsync

Synchronize directories using FTP(S), SFTP, or file system access.

Language:PythonLicense:MITStargazers:117Issues:10Issues:54

formatspread

Code accompanying "How I learned to start worrying about prompt formatting".

Language:PythonLicense:MITStargazers:89Issues:1Issues:2