Simon S. Viloria (simonsanvil)

simonsanvil

Geek Repo

Company:IBM

Location:Madrid, Spain

Home Page:simonsviloria.notion.site

Twitter:@simon_s_viloria

Github PK Tool:Github PK Tool

Simon S. Viloria's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:58796Issues:0Issues:0

Promptify

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3067Issues:0Issues:0

lago

Open Source Metering and Usage Based Billing API ⭐️ Consumption tracking, Subscription management, Pricing iterations, Payment orchestration & Revenue analytics

Language:ShellLicense:AGPL-3.0Stargazers:6234Issues:0Issues:0

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:6047Issues:0Issues:0

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonLicense:MITStargazers:3049Issues:0Issues:0

awesome-public-real-time-datasets

A list of publicly available datasets with real-time data maintained by the team at bytewax.io

License:CC0-1.0Stargazers:414Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8090Issues:0Issues:0

ensemble-instruct

codebase release for EMNLP2023 paper publication

Language:PythonLicense:Apache-2.0Stargazers:19Issues:0Issues:0

instructlab

InstructLab Command-Line Interface. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

Language:PythonLicense:Apache-2.0Stargazers:419Issues:0Issues:0

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonLicense:MITStargazers:4001Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:998Issues:0Issues:0

dspy-redteam

Red-Teaming Language Models with DSPy

Language:PythonStargazers:68Issues:0Issues:0

starlark

Starlark Language

Language:StarlarkLicense:Apache-2.0Stargazers:2246Issues:0Issues:0

logfire

Uncomplicated Observability for Python and beyond! 🪵🔥

Language:PythonLicense:MITStargazers:1403Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:34870Issues:0Issues:0

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:7152Issues:0Issues:0

cognita

RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry

Language:PythonLicense:Apache-2.0Stargazers:1696Issues:0Issues:0

prometheus

[ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically designed for fine-grained evaluation on a customized score rubric, Prometheus is a good alternative for human evaluation and GPT-4 evaluation.

Language:PythonLicense:MITStargazers:265Issues:0Issues:0

pytype

A static type analyzer for Python code

Language:PythonLicense:NOASSERTIONStargazers:4632Issues:0Issues:0

text-generation-inference

IBM development fork of https://github.com/huggingface/text-generation-inference

Language:PythonLicense:Apache-2.0Stargazers:41Issues:0Issues:0

argilla

Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:3156Issues:0Issues:0

llm4regression

Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update

Language:PythonStargazers:97Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:9982Issues:0Issues:0

mlx

MLX: An array framework for Apple silicon

Language:C++License:MITStargazers:14888Issues:0Issues:0

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonLicense:Apache-2.0Stargazers:983Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:10306Issues:0Issues:0

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language:PythonLicense:MITStargazers:964Issues:0Issues:0

presidio-research

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

Language:PythonLicense:MITStargazers:153Issues:0Issues:0

terraform-ibm-cloud-pak

Terraform modules and examples to support installation for IBM Cloud Paks onto OpenShift clusters

Language:HCLLicense:Apache-2.0Stargazers:8Issues:0Issues:0

cloud-pak-cli

Cloudctl is a command line tool to manage Container Application Software for Enterprises (CASE)

License:NOASSERTIONStargazers:19Issues:0Issues:0