-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 53 -
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models
Paper • 2411.14257 • Published • 9 -
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Paper • 2411.12580 • Published • 2 -
Features that Make a Difference: Leveraging Gradients for Improved Dictionary Learning
Paper • 2411.10397 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2412.06769