Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 2 days ago • 56
When an LLM is apprehensive about its answers -- and when its uncertainty is justified Paper • 2503.01688 • Published 2 days ago • 18
Predictive Data Selection: The Data That Predicts Is the Data That Teaches Paper • 2503.00808 • Published 4 days ago • 48
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published 6 days ago • 28