Beast code in Giters

moise-g's starred repositories

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

llama.cpp

LLM inference in C/C++

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.035466 346 2822

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION27229 286 42

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.020239 156 1537

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

Language:PythonMIT13708 97 386

pandas-ai

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Language:PythonNOASSERTION13501 109 734

mamba

Mamba SSM architecture

Language:PythonApache-2.013204 99 548

ml-engineering

Machine Learning Engineering Open Book

Language:PythonCC-BY-SA-4.011644 117 30

ggml

Tensor library for machine learning

Language:C++MIT11209 131 419

OpenLLM

Run any open-source LLMs, such as Llama, Gemma, as OpenAI compatible API endpoint in the cloud.

Language:PythonApache-2.010051 56 268

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookApache-2.09720 126 145

gpt-prompt-engineer

Language:Jupyter NotebookMIT9368 85 30

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

Language:Jupyter NotebookNOASSERTION8527 106 16

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7964 78 168

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.07915 44 676

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6666 65 83

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonApache-2.06064 56 635

ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Language:PythonMIT3066 41 268

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustApache-2.02836 34 259

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookApache-2.02577 30 388

semantic-router

Superfast AI decision making and intelligent processing of multi-modal data.

Language:PythonMIT2107 22 167

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonMIT1833 17 83

optillm

Optimizing inference proxy for LLMs

Language:PythonApache-2.01504 25 38

wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Language:PythonMIT727 13 76

FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

Language:PythonApache-2.0657 7 26

RankGPT

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Language:PythonApache-2.0525 7 21

ir_datasets

Provides a common interface to many IR ranking datasets.

Language:PythonApache-2.0322 10 164

pylate

Late Interaction Models Training & Retrieval

Language:PythonMIT165 9 20

AiTimeline

A timeline of notable generative AI events

Language:HTMLMIT37 2 1