A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30 • 9
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models Paper • 2404.07004 • Published Apr 10 • 6
Calibrating Reasoning in Language Models with Internal Consistency Paper • 2405.18711 • Published May 29 • 6
🔍 Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized • 82 items • Updated 2 days ago • 91