Collections

Discover the best community collections!

Collections including paper arxiv:2408.05147
Papers I want to read
Papers in my to-read list
mechanistic interpretability with sparse autoencoders
A collection of papers that I found useful for learning about using Sparse Autoencoders for finding interpretable features in language models
🔍 Daily Picks in Interpretability & Analysis of LMs
Outstanding research in interpretability and evaluation of language models, summarized
LLM
Multimodal LLM