zchuz / TimeBench

The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models

This is the repository containing evaluation datas, instructions and demonstrations with paper TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models (Chu et al., 2023)

Datasets

Symbolic Temporal Reasoning

Commonsense Temporal Reasoning

Event Temporal Reasoning

Models

Performance

image

Citation

If you find our work helpful, you can cite this paper as:

@misc{chu2023timebench,
      title={TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models}, 
      author={Zheng Chu and Jingchang Chen and Qianglong Chen and Weijiang Yu and Haotian Wang and Ming Liu and Bing Qin},
      year={2023},
      eprint={2311.17667},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2311.17667}
}

About

The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"

License:MIT License


Languages

Language:Python 100.0%