Revisiting Hierarchical Text Classification: Inference and Metrics Paper • 2410.01305 • Published Oct 2, 2024
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning Paper • 2502.06533 • Published Feb 10 • 18