smj0's starred repositories
TRAM-Benchmark
TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)
FollowBench
Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
SFT_function_learning
Reference implementation for DPO (Direct Preference Optimization)
AIMasterDevelopers
find the masters, know the masters behind the major project
arxiv-ai-analysis
A visualization experience of AI/ML academic papers hosted on ArXiV - for project work at the University of California, Berkeley MIDS program (W209, Data Visualization).
arxiv-public-datasets
A set of scripts to grab public datasets from resources related to arXiv
arxiv-tools
Tools to bulk download arxiv data
SuperCLUE-Math6
SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅
Math_Word_Problem_Collection
A collection for math word problem (MWP) works, including datasets, algorithms and so on.
temporal-llms
Materials for paper "Are Large Language Models Temporally Grounded?"
flash-attention
Fast and memory-efficient exact attention
UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
alignment-handbook
Robust recipes to align language models with human and AI preferences
protoqa-data
Dataset for protoqa ("family feud") data
Awesome-LLMs-Evaluation-Papers
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.
data_tooling
Tools for managing datasets for governance and training.