Eric Buehler's repositories
mistral.rs
Blazingly fast LLM inference.
candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
diffusion-rs
Blazingly fast inference of diffusion models.
candle_graphs
Graph model execution API for Candle
edge-u-cation
Edge(u)cation: Cutting-edge multimodal LLMs on the edge with mistral.rs, using F8Q8
uqff_maker
Automated generation of UQFF models with mistral.rs.
dora
DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
Graph-Aware-Transformers
Graph-Aware Attention for Adaptive Dynamics in Transformers
llguidance
Super-fast Structured Outputs
llama_index
LlamaIndex is a data framework for your LLM applications
loc
Count lines of code quickly.
mirage
A multi-level tensor algebra superoptimizer
RustPython
A Python Interpreter written in Rust
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production