How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published 14 days ago • 83 • 8
ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation Paper • 2502.13581 • Published 15 days ago • 5 • 3
Large Language Models and Mathematical Reasoning Failures Paper • 2502.11574 • Published 17 days ago • 3 • 3
We Can't Understand AI Using our Existing Vocabulary Paper • 2502.07586 • Published 23 days ago • 10 • 4
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 25 days ago • 34 • 3
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper • 2502.05003 • Published 27 days ago • 42 • 3
Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression Paper • 2502.04296 • Published 28 days ago • 6 • 3
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach Paper • 2502.03639 • Published 29 days ago • 8 • 3
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published about 1 month ago • 9 • 6
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published Jan 30 • 25 • 3
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Paper • 2501.18837 • Published Jan 31 • 10 • 5
Unraveling the Capabilities of Language Models in News Summarization Paper • 2501.18128 • Published Jan 30 • 4 • 3
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper • 2501.16411 • Published Jan 27 • 18 • 3
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published Jan 30 • 19 • 4
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published Jan 29 • 23 • 3
Aria: An Open Multimodal Native Mixture-of-Experts Model Paper • 2410.05993 • Published Oct 8, 2024 • 109 • 7