simonsanvil

Simon S. Viloria's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT5879600

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:Jupyter NotebookApache-2.0306700

lago

Open Source Metering and Usage Based Billing API ⭐️ Consumption tracking, Subscription management, Pricing iterations, Payment orchestration & Revenue analytics

Language:ShellAGPL-3.0623400

outlines

Structured Text Generation

Language:PythonApache-2.0604700

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonMIT304900

awesome-public-real-time-datasets

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

CC0-1.041400

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.0809000

ensemble-instruct

codebase release for EMNLP2023 paper publication

Language:PythonApache-2.01900

instructlab

InstructLab Command-Line Interface. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

Language:PythonApache-2.041900

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonMIT400100

dspy-redteam

Red-Teaming Language Models with DSPy

Language:Python6800

starlark

Starlark Language

Language:StarlarkApache-2.0224600

logfire

Uncomplicated Observability for Python and beyond! 🪵🔥

Language:PythonMIT140300

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.03487000

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonApache-2.0715200

cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Language:PythonApache-2.0169600

[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.

Language:PythonMIT26500

simonsanvil

Simon S. Viloria's starred repositories

llama.cpp

Promptify

lago

outlines

exllamav2

awesome-public-real-time-datasets

text-generation-inference

ensemble-instruct

instructlab

LLMLingua

LookaheadDecoding

dspy-redteam

starlark

logfire

FastChat

txtai

cognita

prometheus

pytype

text-generation-inference

argilla

llm4regression

ml-engineering

mlx

distilabel

unsloth

spacy-llm

presidio-research

terraform-ibm-cloud-pak

cloud-pak-cli