3lLobo's starred repositories

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

code-server

VS Code in the browser

Language:TypeScriptLicense:MITStargazers:67455Issues:726Issues:3500

llama.cpp

LLM inference in C/C++

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:61569Issues:524Issues:111

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34609Issues:343Issues:2705

whisper.cpp

Port of OpenAI's Whisper model in C/C++

autogen

A programming framework for agentic AI 🤖

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:30461Issues:373Issues:1597

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:25950Issues:223Issues:4274

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15698Issues:104Issues:1012

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:15109Issues:98Issues:768

QAnything

Question and Answer based on Anything.

Language:PythonLicense:AGPL-3.0Stargazers:11272Issues:101Issues:372

h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

Language:PythonLicense:Apache-2.0Stargazers:11187Issues:159Issues:1122

llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

Language:TypeScriptLicense:MITStargazers:10676Issues:81Issues:127

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10626Issues:110Issues:21

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9841Issues:162Issues:691

promptflow

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Language:PythonLicense:MITStargazers:9137Issues:100Issues:538

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:8630Issues:85Issues:744

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:8098Issues:86Issues:1763

rags

Build ChatGPT over your data, all with natural language

Language:PythonLicense:MITStargazers:6192Issues:56Issues:39

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5486Issues:64Issues:98

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Language:PythonLicense:NOASSERTIONStargazers:2969Issues:56Issues:136

twinny

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private.

Language:TypeScriptLicense:MITStargazers:2279Issues:14Issues:154

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonLicense:Apache-2.0Stargazers:1657Issues:30Issues:72

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++License:Apache-2.0Stargazers:1637Issues:33Issues:639

auditd

Best Practice Auditd Configuration

pico-tpmsniffer

A simple, very experimental TPM sniffer for LPC bus

Language:CLicense:NOASSERTIONStargazers:506Issues:24Issues:1

docker-suricata

A Suricata Docker image.

Language:ShellLicense:MITStargazers:248Issues:13Issues:28

nvtrust

Ancillary open source software to support confidential computing on NVIDIA GPUs

Language:PythonLicense:Apache-2.0Stargazers:185Issues:16Issues:46

.dotfiles

Dotfiles for nvim, oh-my-zsh, and other stuff…

Language:LuaStargazers:6Issues:2Issues:0