Filippo Pedrazzini's repositories
petals-model-converter
Convert any HF Model to Petals Optimized Format
llama-explorer
🚀 Get instant insights in the open-source AI landscape. Most starred repositories, top contirbutors and most used programming languages in a single simple UI.
chat.petals.dev
Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
burn-model-serving
End to end example on How to Serve a Burn Model
burn-with-wasm
Example of using Burn with Web Assembly
deep-learning-with-rust
Examples and snippets to get started with Deep Learning and Rust.
gprc-microservice
Simple Microservice with gRPC
health.petals.dev
Health monitor for a Petals swarm
hello-world-nomad
Testing Nomad as Cross Platform Orchestrator
litellm
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
llama2-burn
Llama2 LLM ported to Rust burn
LocalAI
:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others
localai-website
LocalAI website, powered by Hugo
machine-learning-with-rust
Repository containing Machine Learning snippets and examples using Rust and the most known ML frameworks.
openplayground
An LLM playground you can run on your laptop
prem-app
Prem provides a unified environment to develop AI applications and deploy AI models on your infrastructure
promptfoo
Test your prompts. Evaluate and compare LLM outputs, catch regressions, and improve prompt quality.
prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
q-learning-with-rust
Q-Learning Implementation with Rust
s3gw
Container able to run on a Kubernetes cluster, providing S3-compatible endpoints to applications.
state-of-open-source-ai
Clarity in the current fast-paced mess of Open Source innovation
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
whisper.cpp
Port of OpenAI's Whisper model in C/C++
whispercpp
Pybind11 bindings for Whisper.cpp