Riccorl

Riccardo Orlando's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.020843 197 2982

fx

Terminal JSON viewer & processor

Language:GoMIT18630 120 199

OpenLLM

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

Language:PythonApache-2.09104 54 251

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python9006 112 189

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.08082 73 388

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookNOASSERTION7850 68 227

llm

An ecosystem of Rust libraries for working with large language models

Language:RustApache-2.06016 50 231

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonMIT3272 56 94

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION2583 36 131

langui

UI for your AI. Open Source Tailwind components tailored for your GPT, generative AI, and LLM projects.

Language:HTMLMIT2017 16 16

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT2002 23 155

trulens

Evaluation and Tracking for LLM Experiments

Language:Jupyter NotebookMIT1760 16 238

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonApache-2.01757 18 104

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonApache-2.01575 42 20

keras-core

A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.

Language:PythonApache-2.01268 27 172

catalog

:trophy: :books: A list of awesome MkDocs projects and plugins.

Language:PythonCC-BY-SA-4.0877 15 21

tinyvector

A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)

Language:PythonMIT769 10 10

mosec

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Language:PythonApache-2.0714 13 96

tokenmonster

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Language:GoMIT516 10 26

neurips_llm_efficiency_challenge

NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day

Language:Python240 16 16

llama2.rs

Inference Llama 2 in one file of pure Rust 🦀

Language:PythonMIT222 40

llm_qlora

Fine-tuning LLMs using QLoRA

Language:Jupyter NotebookMIT215 4 9

optimum-benchmark

A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

Language:PythonApache-2.0206 6 68