adrien's starred repositories
llama-stack
Model components of the Llama Stack APIs
RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
graphql-engine
Blazing fast, instant realtime GraphQL APIs on your DB with fine grained access control, also trigger webhooks on database events.
ExtractThinker
ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
mteb-french
MTEB: Massive Text Embedding Benchmark French extended
pgvecto.rs
Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.
argo-rollouts
Progressive Delivery for Kubernetes