ReAct: Synergizing Reasoning and Acting in Language Models Paper • 2210.03629 • Published Oct 6, 2022 • 17
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 7 days ago • 48
🇮🇹 Italian NLP Resources Collection Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 282 items • Updated 3 days ago • 24
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 5 days ago • 97
Building Bridges, Not Walls -- Advancing Interpretability by Unifying Feature, Data, and Model Component Attribution Paper • 2501.18887 • Published 10 days ago • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 5 days ago • 97
Propositional Interpretability in Artificial Intelligence Paper • 2501.15740 • Published 14 days ago • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 5 days ago • 97
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 5 days ago • 97
Sparse Autoencoders Trained on the Same Data Learn Different Features Paper • 2501.16615 • Published 13 days ago • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 5 days ago • 97
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders Paper • 2501.17148 • Published 12 days ago • 1
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated 5 days ago • 97