Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published about 1 month ago • 9
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 25 days ago • 35
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper • 2501.17433 • Published 25 days ago • 9
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 24 days ago • 55
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 23 days ago • 81
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 23 days ago • 55
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 24 days ago • 24
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 22 days ago • 37
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 27 days ago • 359
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Paper • 2501.10132 • Published Jan 17 • 19
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published Jan 21 • 22
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published Jan 21 • 51