Tom Young's starred repositories
ColossalAI
Making large AI models cheaper, faster and more accessible
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
lm-evaluation-harness
A framework for few-shot evaluation of language models.
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Dataset_Quantization
[ICCV2023] Dataset Quantization
thesis_template_ntu
Thesis Latex Template for Nanyang Technological University (NTU)
MultiWOZ_Evaluation
Unified MultiWOZ evaluation scripts for the context-to-response task.
experiments
My exploration on new technologies.
MLM_inconsistencies
Inconsistencies in Masked Language Models