Chien Nguyen (chiennv2000)

chiennv2000

Geek Repo

Company:University of Oregon

Location:Oregon, USA

Home Page:https://chiennv2000.github.io/

Twitter:@chiennv2000

Github PK Tool:Github PK Tool

Chien Nguyen's starred repositories

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:17186Issues:165Issues:1067

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:11482Issues:113Issues:456

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11095Issues:104Issues:806

readme-md-generator

📄 CLI that generates beautiful README.md files

Language:JavaScriptLicense:MITStargazers:10768Issues:74Issues:101

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10206Issues:189Issues:2078

unsloth

Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:9610Issues:73Issues:347

shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

Language:PythonLicense:MITStargazers:8401Issues:80Issues:289

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:6765Issues:82Issues:1292

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4054Issues:41Issues:147

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:3859Issues:33Issues:415

llmware

Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.

Language:PythonLicense:Apache-2.0Stargazers:3810Issues:37Issues:107

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:2527Issues:24Issues:793

Olive

Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.

Language:PythonLicense:MITStargazers:1239Issues:25Issues:137

fsdp_qlora

Training LLMs with QLoRA + FSDP

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1234Issues:19Issues:34

fastRAG

Efficient Retrieval Augmentation and Generation Framework

Language:PythonLicense:Apache-2.0Stargazers:935Issues:10Issues:24

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonLicense:Apache-2.0Stargazers:740Issues:11Issues:40

mamba.py

A simple and efficient Mamba implementation in PyTorch and MLX.

Language:PythonLicense:MITStargazers:626Issues:4Issues:21

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:485Issues:17Issues:10

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:407Issues:8Issues:29

veScale

A PyTorch Native LLM Training Framework

Language:PythonLicense:Apache-2.0Stargazers:366Issues:33Issues:4

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:312Issues:22Issues:58

TransformerCompression

For releasing code related to compression methods for transformers, accompanying our publications

Language:PythonLicense:MITStargazers:310Issues:8Issues:35
Language:PythonLicense:Apache-2.0Stargazers:241Issues:9Issues:4

st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Language:PythonLicense:MITStargazers:228Issues:5Issues:10

mend

MEND: Fast Model Editing at Scale

Language:PythonLicense:MITStargazers:217Issues:7Issues:12

KnowledgeEditor

Code for Editing Factual Knowledge in Language Models

Language:PythonLicense:MITStargazers:128Issues:5Issues:9

Everything-of-Thoughts-XoT

An implemtation of Everyting of Thoughts (XoT).

Language:PythonLicense:NOASSERTIONStargazers:86Issues:9Issues:3

DiffusionNER

Code for the paper "DiffusionNER: Boundary Diffusion for Named Entity Recognition", accepted at ACL 2023.

ST-LLM

Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Language:PythonLicense:Apache-2.0Stargazers:51Issues:7Issues:12

Rainbow-Table

Group Project for UO CS631 (Advanced Parallel Computing)

Language:CStargazers:1Issues:0Issues:0