TAUR-Lab/Taur_CoT_Analysis_Project___internlm__internlm2_5-7b-chat Viewer • Updated 2 days ago • 63.7k • 39
TAUR-Lab/Taur_CoT_Analysis_Project___OpenGVLab__InternVL2_5-8B Viewer • Updated 2 days ago • 63.7k • 45
TAUR-Lab/Taur_CoT_Analysis_Project___Qwen__Qwen2-VL-7B-Instruct Viewer • Updated 2 days ago • 63.7k • 34
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics Paper • 2102.01672 • Published Feb 2, 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation Paper • 2112.02721 • Published Dec 6, 2021
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across Paragraphs Paper • 2309.08873 • Published Sep 16, 2023
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16 • 3
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Paper • 2402.13249 • Published Feb 20 • 11
Using Natural Language Explanations to Rescale Human Judgments Paper • 2305.14770 • Published May 24, 2023
A Long Way to Go: Investigating Length Correlations in RLHF Paper • 2310.03716 • Published Oct 5, 2023 • 9