Amy Hyunji Lee's starred repositories

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:6079Issues:104Issues:405

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonLicense:MITStargazers:3349Issues:27Issues:267

galai

Model API for GALACTICA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2673Issues:43Issues:71

awesome-chatgpt

⚡ Everything about ChatGPT

AutoChain

AutoChain: Build lightweight, extensible, and testable LLM Agents

Language:PythonLicense:MITStargazers:1771Issues:11Issues:10

awesome-semi-supervised-learning

😎 An up-to-date & curated list of awesome semi-supervised learning papers, methods & resources.

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:1685Issues:24Issues:210

AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Language:PythonLicense:Apache-2.0Stargazers:1481Issues:27Issues:25

natural-instructions

Expanding natural instructions

Language:PythonLicense:Apache-2.0Stargazers:934Issues:21Issues:161

acl2020-openqa-tutorial

ACL2020 Tutorial: Open-Domain Question Answering

Korpora

Korean corpus repository

Language:PythonLicense:CC-BY-4.0Stargazers:684Issues:26Issues:101

FiD

Fusion-in-Decoder

Language:PythonLicense:NOASSERTIONStargazers:543Issues:10Issues:32

LMkor

Pretrained Language Models for Korean

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:389Issues:15Issues:7

FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Language:PythonLicense:MITStargazers:259Issues:4Issues:34

style-transfer-paraphrase

Official code and data repository for our EMNLP 2020 long paper "Reformulating Unsupervised Style Transfer as Paraphrase Generation" (https://arxiv.org/abs/2010.05700).

Language:HTMLLicense:MITStargazers:227Issues:11Issues:36

korean-sentence-splitter

Split Korean text into sentences using heuristic algorithm.

Language:C++License:BSD-3-ClauseStargazers:205Issues:5Issues:8

KG-BART

KG-BART: Knowledge Graph-Augmented BART for GenerativeCommonsense Reasoning

NPM

The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)

Language:PythonLicense:NOASSERTIONStargazers:155Issues:8Issues:4

bart-closed-book-qa

A BART version of an open-domain QA model in a closed-book setup

Flipped-Learning

[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners

wikidata-simplequestions

Mapping of the SimpleQuestions dataset to Wikidata

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:81Issues:3Issues:8

ICIL

Official implementation of "In-Context Instruction Learning"

Language:PythonLicense:MITStargazers:76Issues:6Issues:3

SLURM

SLURM Example Scripts

knowledge-unlearning

[ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models

EDMem

Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"

Language:PythonStargazers:15Issues:10Issues:0

unified-prompt-selection

[TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9Issues:2Issues:0