HINT-lab/DeepSeek-R1-Distill-Qwen-1.5B-Self-Calibration Text Generation • Updated about 8 hours ago • 243
On Grounded Planning for Embodied Tasks with Language Models Paper • 2209.00465 • Published Aug 29, 2022
Optimizing Language Model's Reasoning Abilities with Weak Supervision Paper • 2405.04086 • Published May 7, 2024 • 1
Divide, Reweight, and Conquer: A Logit Arithmetic Approach for In-Context Learning Paper • 2410.10074 • Published Oct 14, 2024
Self-Calibration Collection Efficient Test-Time Scaling via Self-Calibration https://arxiv.org/abs/2503.00031 • 6 items • Updated 1 day ago • 1