horizon94's starred repositories

guardrails

Adding guardrails to large language models.

Language:PythonLicense:Apache-2.0Stargazers:3780Issues:0Issues:0

jsonformer

A Bulletproof Way to Generate Structured JSON from Language Models

Language:Jupyter NotebookLicense:MITStargazers:4297Issues:0Issues:0

Viscacha

Viscacha:通用信息抽取数据集收集

License:Apache-2.0Stargazers:21Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:63885Issues:0Issues:0

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3170Issues:0Issues:0

kor

LLM(😽)

Language:PythonLicense:MITStargazers:1592Issues:0Issues:0

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:18518Issues:0Issues:0

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonLicense:MITStargazers:1305Issues:0Issues:0

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:5792Issues:0Issues:0

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7994Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3950Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:25264Issues:0Issues:0

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:636Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7978Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:29552Issues:0Issues:0

KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language:PythonStargazers:266Issues:0Issues:0

nanoLM

An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales

Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0

rationalizing-neural-predictions

[in progress] pytorch implementation of Tao Lei's rationalizing neural predictions

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1396Issues:0Issues:0
Language:PythonStargazers:130Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:861Issues:0Issues:0

LLMTest_NeedleInAHaystack2

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1400Issues:0Issues:0

InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:6123Issues:0Issues:0

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:PythonStargazers:407Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:518Issues:0Issues:0

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:926Issues:0Issues:0

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Language:PythonLicense:NOASSERTIONStargazers:552Issues:0Issues:0
Language:RustLicense:Apache-2.0Stargazers:1079Issues:0Issues:0

AI-Gaokao

Gaokao Benchmark for AI

Stargazers:104Issues:0Issues:0