Zichun Yu (yuzc19)

yuzc19

User data from Github https://github.com/yuzc19

Company:Carnegie Mellon University

GitHub:@yuzc19

Zichun Yu's repositories

yuzc19.github.io

Personal homepage for Zichun Yu

Language:HTMLLicense:MITStargazers:2Issues:0Issues:0

BioLAMA-1

EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

FiD

Fusion-in-Decoder

Language:PythonLicense:NOASSERTIONStargazers:1Issues:0Issues:0

LM-BFF

ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

OOP_QA

OOP QAList

Language:C++Stargazers:1Issues:0Issues:0

pet

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

SimCSE

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

unifiedqa

UnifiedQA: Crossing Format Boundaries With a Single QA System

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

WA-AC

This repository is used to save algorithm learning materials.

Language:C++Stargazers:1Issues:0Issues:0

zcore-tests

Test scripts for zCore OS

Language:PythonStargazers:1Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

galactic

data cleaning and curation for unstructured text

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

License:MITStargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

License:NOASSERTIONStargazers:0Issues:0Issues:0

NeMo-Curator

Scalable data pre processing and curation toolkit for LLMs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SemDeDup

Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0