moise-g's starred repositories
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
ml-engineering
Machine Learning Engineering Open Book
mistral-inference
Official inference library for Mistral models
RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
text-embeddings-inference
A blazing fast inference solution for text embeddings models
semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
ir_datasets
Provides a common interface to many IR ranking datasets.
AiTimeline
A timeline of notable generative AI events