CriticBench: Benchmarking LLMs for Critique-Correct Reasoning Paper • 2402.14809 • Published Feb 22 • 2
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs Paper • 2312.17080 • Published Dec 28, 2023 • 1
TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools Paper • 2406.03618 • Published Jun 5 • 2