Trần Nhật Quý's starred repositories

generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:65023Issues:554Issues:128

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:27079Issues:225Issues:262

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:18982Issues:122Issues:510

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:18728Issues:140Issues:811

unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:18050Issues:124Issues:994

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:12557Issues:103Issues:576

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9179Issues:85Issues:36

FlexiGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9178Issues:112Issues:82

ragas

Supercharge Your LLM Application Evaluations 🚀

Language:PythonLicense:Apache-2.0Stargazers:7187Issues:36Issues:869

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:6005Issues:57Issues:629

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5663Issues:61Issues:104

promptbase

All things prompt engineering

Language:PythonLicense:MITStargazers:5421Issues:59Issues:15
Language:Jupyter NotebookLicense:MITStargazers:3949Issues:68Issues:23

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1430Issues:19Issues:54

llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Language:PythonLicense:MITStargazers:1265Issues:22Issues:119

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1227Issues:40Issues:11

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:1150Issues:16Issues:90

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:1003Issues:15Issues:38

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonLicense:Apache-2.0Stargazers:960Issues:8Issues:9

optimum-quanto

A pytorch quantization backend for optimum

Language:PythonLicense:Apache-2.0Stargazers:821Issues:8Issues:129

hqq

Official implementation of Half-Quadratic Quantization (HQQ)

Language:PythonLicense:Apache-2.0Stargazers:696Issues:16Issues:102
Language:PythonLicense:Apache-2.0Stargazers:517Issues:6Issues:12

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:428Issues:5Issues:69

mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Language:PythonLicense:LGPL-3.0Stargazers:404Issues:4Issues:14

HPT

HPT - Open Multimodal LLMs from HyperGAI

Language:PythonLicense:Apache-2.0Stargazers:313Issues:7Issues:11

stripedhyena

Repository for StripedHyena, a state-of-the-art beyond Transformer architecture

Language:PythonLicense:Apache-2.0Stargazers:269Issues:7Issues:10

LLM-Benchmark-Logs

Just a bunch of benchmark logs for different LLMs

License:MITStargazers:114Issues:7Issues:0