hppanev's starred repositories

llama.cpp

LLM inference in C/C++

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28147Issues:187Issues:4440

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25020Issues:206Issues:213

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21002Issues:178Issues:420

llamafile

Distribute and run LLMs with a single file.

Language:C++License:NOASSERTIONStargazers:17970Issues:166Issues:382

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13954Issues:107Issues:305

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9548Issues:161Issues:625

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Language:PythonLicense:MITStargazers:8748Issues:68Issues:67

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8504Issues:99Issues:1237

ai-town

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

Language:TypeScriptLicense:MITStargazers:7231Issues:60Issues:90

SillyTavern

LLM Frontend for Power Users.

Language:JavaScriptLicense:AGPL-3.0Stargazers:7016Issues:61Issues:1372

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonLicense:MITStargazers:4285Issues:31Issues:106

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3110Issues:26Issues:129

cohere-toolkit

Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.

Language:TypeScriptLicense:MITStargazers:2625Issues:37Issues:33

secret-llama

Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.

Language:TypeScriptLicense:Apache-2.0Stargazers:2376Issues:15Issues:24

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:1691Issues:11Issues:365

pywinassistant

The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models.

Language:PythonLicense:MITStargazers:1250Issues:31Issues:16

lms

LM Studio CLI. Written in TypeScript/Node

Language:TypeScriptLicense:MITStargazers:1188Issues:21Issues:43

llm-reasoners

A library for advanced large language model reasoning

Language:PythonLicense:Apache-2.0Stargazers:1029Issues:14Issues:31

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:564Issues:9Issues:36

configs

LM Studio JSON configuration file format and a collection of example config files.

llamaduo

This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM. For this project, we have initially chosen Gemini 1.0 Pro for service type LLM and Gemma 2B/7B for small sized LLM model. It now supports other service LLMs such as GPT4 and Claude3.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:175Issues:5Issues:7

reka-vibe-eval

Multimodal language model benchmark, featuring challenging examples

Language:PythonLicense:Apache-2.0Stargazers:139Issues:10Issues:1

Retrieval_Head

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:128Issues:8Issues:13

llm-compression-intelligence

Official github repo for the paper "Compression Represents Intelligence Linearly"

Language:PythonLicense:MITStargazers:112Issues:3Issues:7

LongEmbed

Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"

ReAlign

Reformatted Alignment

MAmmoTH2

Official code for "MAmmoTH2: Scaling Instructions from the Web"

Language:PythonLicense:MITStargazers:93Issues:2Issues:7

NeoSapiens

The next evolution of Agents

Language:PythonLicense:MITStargazers:43Issues:4Issues:2