moise-g

moise-g

Geek Repo

Github PK Tool:Github PK Tool

moise-g's starred repositories

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

llama.cpp

LLM inference in C/C++

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:35466Issues:346Issues:2822

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:20239Issues:156Issues:1537

SWE-agent

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.

Language:PythonLicense:MITStargazers:13708Issues:97Issues:386

pandas-ai

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Language:PythonLicense:NOASSERTIONStargazers:13501Issues:109Issues:734

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:13204Issues:99Issues:548

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:11644Issues:117Issues:30

ggml

Tensor library for machine learning

OpenLLM

Run any open-source LLMs, such as Llama, Gemma, as OpenAI compatible API endpoint in the cloud.

Language:PythonLicense:Apache-2.0Stargazers:10051Issues:56Issues:268

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9720Issues:126Issues:145
Language:Jupyter NotebookLicense:MITStargazers:9368Issues:85Issues:30

RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8527Issues:106Issues:16

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7964Issues:78Issues:168

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7915Issues:44Issues:676

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6666Issues:65Issues:83

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:6064Issues:56Issues:635

ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Language:PythonLicense:MITStargazers:3066Issues:41Issues:268

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustLicense:Apache-2.0Stargazers:2836Issues:34Issues:259

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2577Issues:30Issues:388

semantic-router

Superfast AI decision making and intelligent processing of multi-modal data.

Language:PythonLicense:MITStargazers:2107Issues:22Issues:167

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1833Issues:17Issues:83

optillm

Optimizing inference proxy for LLMs

Language:PythonLicense:Apache-2.0Stargazers:1504Issues:25Issues:38

wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Language:PythonLicense:MITStargazers:727Issues:13Issues:76

FlashRank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

Language:PythonLicense:Apache-2.0Stargazers:657Issues:7Issues:26

RankGPT

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Language:PythonLicense:Apache-2.0Stargazers:525Issues:7Issues:21

ir_datasets

Provides a common interface to many IR ranking datasets.

Language:PythonLicense:Apache-2.0Stargazers:322Issues:10Issues:164

pylate

Late Interaction Models Training & Retrieval

Language:PythonLicense:MITStargazers:165Issues:9Issues:20

AiTimeline

A timeline of notable generative AI events

Language:HTMLLicense:MITStargazers:37Issues:2Issues:1