voidism

Yung-Sung Chuang's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2181900

LongChat

Official repository for LongChat and LongEval

Language:PythonApache-2.049500

curiosity_redteam

Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)

Language:Jupyter NotebookMIT4300

bocoel

Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lines of modular code.

Language:PythonApache-2.026400

dinosr

DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning

Language:Python4200

hyde

HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels

Language:Jupyter Notebook38300

cfrs

An extremely minimal drawing language consisting of only 6 simple commands: C, F, R, S, [, and ].

Language:HTMLMIT23800

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

Language:PythonApache-2.067600

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonMIT48100

tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

Language:PythonMIT37200

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonNOASSERTION958500

LangCode

LangCode - Improving alignment and reasoning of large language models (LLMs) with natural language embedded program (NLEP).

Language:PythonMIT2400

traveling-words

Code repository for the paper "Traveling Words: A Geometric Interpretation of Transformers"

Language:Python700

Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning" paper.

Language:PythonCC-BY-SA-4.03400

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonApache-2.0227400

DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Language:Python35400

Taiwan-LLM

Traditional Mandarin LLMs for Taiwan

Language:PythonApache-2.093000

silo-lm

SILO Language Models code repository

Language:PythonMIT7900

speech-resynthesis

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Language:PythonNOASSERTION36000