Multi-property Steering of Large Language Models with Dynamic Activation Composition Paper โข 2406.17563 โข Published 6 days ago โข 4
๐ Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized โข 57 items โข Updated 5 days ago โข 61