Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models Paper • 2411.14257 • Published 3 days ago • 8
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models Paper • 2411.12580 • Published 5 days ago • 2
Controllable Context Sensitivity and the Knob Behind It Paper • 2411.07404 • Published 13 days ago • 1
Features that Make a Difference: Leveraging Gradients for Improved Dictionary Learning Paper • 2411.10397 • Published 9 days ago • 1
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated 3 days ago • 177
The Geometry of Concepts: Sparse Autoencoder Feature Structure Paper • 2410.19750 • Published Oct 10 • 1
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders Paper • 2410.20526 • Published 28 days ago • 1
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics Paper • 2410.21272 • Published 27 days ago • 1
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper • 2410.15999 • Published Oct 21 • 19
Automatically Interpreting Millions of Features in Large Language Models Paper • 2410.13928 • Published Oct 17 • 1
How Do Multilingual Models Remember? Investigating Multilingual Factual Recall Mechanisms Paper • 2410.14387 • Published Oct 18 • 1
Towards Interpreting Visual Information Processing in Vision-Language Models Paper • 2410.07149 • Published Oct 9 • 1
Geometric Signatures of Compositionality Across a Language Model's Lifetime Paper • 2410.01444 • Published Oct 2 • 1
ITA-Bench: Italian Benchmarks for LLMs Collection A collection of Italian benchmarks for Large Language Models. See also our Github repo: https://github.com/SapienzaNLP/ita-bench • 19 items • Updated Sep 23 • 6
A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders Paper • 2409.14507 • Published Sep 22 • 1