Yung-Sung Chuang's starred repositories
curiosity_redteam
Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
traveling-words
Code repository for the paper "Traveling Words: A Geometric Interpretation of Transformers"
DUAL-textless-SQA
Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.
Taiwan-LLM
Traditional Mandarin LLMs for Taiwan
speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"