yuzc19

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

MIT000

Megatron-LM

Ongoing research training transformer models at scale

NOASSERTION000

NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Language:Jupyter NotebookApache-2.0000

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Apache-2.0000

SemDeDup

Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).

Language:PythonNOASSERTION000

yuzc19

Zichun Yu's repositories

yuzc19.github.io

BioLAMA-1

dataset-recommendation-pub

FiD

LM-BFF

OOP_QA

pet

Ray-tracing-engine

SimCSE

unifiedqa

WA-AC

zcore-tests

dclm

doremi

galactic

lit-gpt

lm-evaluation-harness

Megatron-LM

NeMo-Curator

Pai-Megatron-Patch

SemDeDup