Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published Nov 12 • 62
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper • 2411.11504 • Published 27 days ago • 19
Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework Paper • 2410.06328 • Published Oct 8 • 1
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published 15 days ago • 49