ZhangShiyue

followers

following

stars

UNC-CH

Chapel Hill, NC, US

https://www.cs.unc.edu/~shiyue/

ShiyueZhang's starred repositories

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.028996 341 266

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION14174 263 201

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonMIT10435 284 1544

gpt-2-output-dataset

Dataset of GPT-2 outputs for research in detection, biases, and more

Language:PythonMIT1906 76 47

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonApache-2.01574 42 20

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION1164 12 25

pytorch-struct

Fast, general, and tested differentiable structured prediction in PyTorch

Language:Jupyter NotebookMIT1097 34 55

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonMIT1004 20 19

berkeley-doc-summarizer

The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference constraints to ensure clarity.

Language:ScalaGPL-3.0742 26 6

chatgpt-failures

Failure archive for ChatGPT and similar models

Language:Python584 24 9

SummEval

Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper

Language:PythonMIT349 9 41

factCC

Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper

Language:PythonBSD-3-Clause266 10 14

Multi-News

Large-scale multi-document summarization dataset and code

Language:PythonNOASSERTION263 3 36

mauve

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Language:PythonNOASSERTION263 4 13

simple-web-audio-recorder-demo

A simple HTML/JS demo that uses WebAudioRecorder.js to record audio on a web page

Language:JavaScript181 11 12

gold-off-policy-text-gen-iclr21

Language:PythonMIT49 30

qamr

Question-Answer Meaning Representation

Language:ScalaMIT48 8 4

repro

Repro is a library for easily running code from published papers via Docker.

Language:PythonApache-2.040 1 10

longeval-summarization

Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https://arxiv.org/abs/2301.13298).

Language:PythonApache-2.039 10

MTAdam

MTAdam: Automatic Balancing of Multiple Training Loss Terms

Language:Python34 1 2

ROSE

Language:PythonBSD-3-Clause31 13 1

SummarizationPrograms

[ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees

Language:PythonMIT23 4 2

QANom

Language:PythonMIT20 50

MixCE-acl2023

Implementation of MixCE method described in ACL 2023 paper by Zhang et al.

Language:PythonApache-2.017 80

qasrl

Tools for working with QA-SRL data and annotating it with crowdsourcing.

Language:ScalaMIT1200

LitePyramids

Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.

Language:Python11 10

truncation-sampling

Codebase describing experiments in Truncation Sampling as Language Model Desmoothing

Language:Jupyter Notebook9 10

QA-ALIGN

QA-ALIGN: Representing Cross-Text Content Overlap by Aligning Question-Answer Propositions

Language:Jupyter Notebook800

ginn

A minimalistic, header only neural net library

Language:C++Apache-2.04 1 32

qasrl-crowdsourcing

Language:ScalaMIT100