Chien Nguyen (chiennv2000)

chiennv2000

Geek Repo

Company:University of Oregon

Location:Oregon, USA

Home Page:https://chiennv2000.github.io/

Twitter:@chiennv2000

Github PK Tool:Github PK Tool

Chien Nguyen's starred repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Stargazers:269898Issues:0Issues:0

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookLicense:MITStargazers:597Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:23644Issues:0Issues:0

streaming

A Data Streaming Library for Efficient Neural Network Training

Language:PythonLicense:Apache-2.0Stargazers:978Issues:0Issues:0

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5044Issues:0Issues:0

hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

License:Apache-2.0Stargazers:1080Issues:0Issues:0

LLM-Workshop

LLM Workshop by Sourab Mangrulkar

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:284Issues:0Issues:0

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellLicense:NOASSERTIONStargazers:952Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1259Issues:0Issues:0

PASTA

PASTA: Post-hoc Attention Steering for LLMs

Language:PythonLicense:MITStargazers:85Issues:0Issues:0

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2082Issues:0Issues:0

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:2927Issues:0Issues:0

OpenELM

Evolution Through Large Models

Language:PythonLicense:MITStargazers:655Issues:0Issues:0

mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Language:PythonLicense:Apache-2.0Stargazers:1068Issues:0Issues:0

determined

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Language:GoLicense:Apache-2.0Stargazers:2891Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33237Issues:0Issues:0

natural-instructions

Expanding natural instructions

Language:PythonLicense:Apache-2.0Stargazers:912Issues:0Issues:0

DecT

Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding

Language:PythonLicense:MITStargazers:45Issues:0Issues:0

ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain

Language:PythonLicense:Apache-2.0Stargazers:218Issues:0Issues:0

BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Language:PythonLicense:Apache-2.0Stargazers:521Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4046Issues:0Issues:0

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:PythonLicense:Apache-2.0Stargazers:1999Issues:0Issues:0

FewDocAE

Few-Shot Document-Level Event Argument Extraction: https://arxiv.org/abs/2209.02203

Language:PythonLicense:GPL-3.0Stargazers:11Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1530Issues:0Issues:0

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:6533Issues:0Issues:0

M3rlin-fmengine

M3 Training Using FMengine

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Language:PythonStargazers:354Issues:0Issues:0

HojiChar

The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.

Language:PythonLicense:Apache-2.0Stargazers:110Issues:0Issues:0

freshqa

Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:288Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38114Issues:0Issues:0