Chien Nguyen (chiennv2000)

chiennv2000

Geek Repo

Company:University of Oregon

Location:Oregon, USA

Home Page:https://chiennv2000.github.io/

Twitter:@chiennv2000

Github PK Tool:Github PK Tool

Chien Nguyen's starred repositories

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:16840Issues:160Issues:1013

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:10750Issues:104Issues:779

readme-md-generator

📄 CLI that generates beautiful README.md files

Language:JavaScriptLicense:MITStargazers:10738Issues:74Issues:101

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10040Issues:183Issues:2049

shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Language:PythonLicense:MITStargazers:8261Issues:78Issues:274

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:6694Issues:54Issues:264

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:6510Issues:81Issues:1212

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:3947Issues:40Issues:138

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:3754Issues:32Issues:398

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:3327Issues:40Issues:199

llmware

Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.

Language:PythonLicense:Apache-2.0Stargazers:3107Issues:38Issues:103

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:2253Issues:22Issues:724

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonLicense:Apache-2.0Stargazers:1238Issues:39Issues:48

Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Language:PythonLicense:MITStargazers:1211Issues:26Issues:129

fsdp_qlora

Training LLMs with QLoRA + FSDP

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1093Issues:17Issues:25

fastRAG

Efficient Retrieval Augmentation and Generation Framework

Language:PythonLicense:Apache-2.0Stargazers:895Issues:10Issues:22

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:778Issues:43Issues:45

mamba.py

A simple and efficient Mamba implementation in PyTorch and MLX.

Language:PythonLicense:MITStargazers:544Issues:5Issues:9

llm-autoeval

Automatically evaluate your LLMs in Google Colab

Language:PythonLicense:MITStargazers:377Issues:7Issues:14

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:370Issues:8Issues:22

veScale

A PyTorch Native LLM Training Framework

Language:PythonLicense:Apache-2.0Stargazers:340Issues:29Issues:4

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:293Issues:22Issues:52

TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Language:PythonLicense:MITStargazers:285Issues:8Issues:30

st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Language:PythonLicense:MITStargazers:218Issues:5Issues:9

mend

MEND: Fast Model Editing at Scale

Language:PythonLicense:MITStargazers:213Issues:7Issues:12

KnowledgeEditor

Code for Editing Factual Knowledge in Language Models

Language:PythonLicense:MITStargazers:128Issues:5Issues:9

Everything-of-Thoughts-XoT

An implemtation of Everyting of Thoughts (XoT).

Language:PythonLicense:NOASSERTIONStargazers:83Issues:9Issues:3

DiffusionNER

Code for the paper "DiffusionNER: Boundary Diffusion for Named Entity Recognition", accepted at ACL 2023.

ST-LLM

Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Language:PythonLicense:Apache-2.0Stargazers:27Issues:4Issues:0

Rainbow-Table

Group Project for UO CS631 (Advanced Parallel Computing)

Language:CStargazers:1Issues:0Issues:0