Yang Liu's starred repositories

CodeGen

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Language:PythonLicense:Apache-2.0Stargazers:4796Issues:79Issues:74

TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Language:PythonLicense:MITStargazers:2781Issues:38Issues:269

fast-transformers

Pytorch library for fast transformer implementations

Language:PythonLicense:Apache-2.0Stargazers:1410Issues:31Issues:75

state-spaces

Sequence Modeling with Structured State Spaces

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1389Issues:45Issues:111

Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language:PythonLicense:Apache-2.0Stargazers:726Issues:7Issues:35

prize

A prize for finding tasks that cause large language models to show inverse scaling

unify-parameter-efficient-tuning

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)

Language:PythonLicense:Apache-2.0Stargazers:494Issues:7Issues:16

TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.

Language:PythonLicense:GPL-3.0Stargazers:425Issues:6Issues:52

Pytorch-PCGrad

Pytorch reimplementation for "Gradient Surgery for Multi-Task Learning"

Language:PythonLicense:BSD-3-ClauseStargazers:276Issues:5Issues:17
Language:PythonLicense:BSD-3-ClauseStargazers:174Issues:9Issues:32

UniEval

Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation

Language:PythonLicense:MITStargazers:165Issues:4Issues:7

dialogsum

DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021

Language:PythonLicense:MITStargazers:164Issues:5Issues:12

DialogLM

Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

Language:PythonLicense:MITStargazers:134Issues:13Issues:9
Language:PythonLicense:MITStargazers:116Issues:5Issues:4

Module-0

Module 0 - Fundamentals

LongDocSum

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"

UniSumm

UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning

Language:PythonLicense:MITStargazers:60Issues:10Issues:2

MVLPT

code for "Multitask Vision-Language Prompt Tuning" https://arxiv.org/abs/2211.11720

Language:PythonLicense:MITStargazers:49Issues:2Issues:6
Language:Jupyter NotebookStargazers:48Issues:2Issues:9

summary-explorer

Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.

Language:CSSLicense:MITStargazers:43Issues:16Issues:1

marge

Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources

light-transformer-emnlp2021

EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling

Language:PythonLicense:MITStargazers:31Issues:3Issues:3

text-sum-uncertainty

Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)

Language:PythonLicense:MITStargazers:30Issues:2Issues:1

MACSum

Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.

Language:PythonLicense:NOASSERTIONStargazers:30Issues:7Issues:2

RE-T5

Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).

Language:PythonLicense:Apache-2.0Stargazers:28Issues:1Issues:3

NoisySumm

Codes for NAACL 2021 paper 'Noisy Self-Knowledge Distillation for Text Summarization'

Language:PythonLicense:MITStargazers:23Issues:2Issues:3