chiennv2000

Chien Nguyen's starred repositories

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonApache-2.017186 165 1067

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT11482 113 456

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause11095 104 806

readme-md-generator

📄 CLI that generates beautiful README.md files

Language:JavaScriptMIT10768 74 101

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.010206 189 2078

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.09610 73 347

shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Language:PythonMIT8401 80 289

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.06765 82 1292

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04054 41 147

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT3859 33 415

llmware

Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.

Language:PythonApache-2.03810 37 107

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.02527 24 793

Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Language:PythonMIT1239 25 137