There are 0 repository under complex-reasoning topic.
Paper collection on building and evaluating language model agents via executable language grounding
RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.
RUPBench: Benchmarking Reasoning Under Perturbations for Robustness Evaluation in Large Language Models