arxiv:2411.14257
Neel Nanda
NeelNanda
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
29 days ago
Do I Know This Entity? Knowledge Awareness and Hallucinations in
Language Models
updated
a model
about 2 months ago
NeelNanda/crosscoders-gpt2-small
Organizations
None yet
Papers
10
models
65
NeelNanda/crosscoders-gpt2-small
Updated
•
5
NeelNanda/GELU_1L512W_C4_Code
Updated
•
3.1k
•
2
NeelNanda/gpt-neox-tokenizer-digits
Updated
•
2
NeelNanda/sparse_autoencoder
Updated
•
3
NeelNanda/redwood-attn-only-2l
Updated
•
11
NeelNanda/Othello-GPT-Transformer-Lens
Updated
NeelNanda/full_pred_log_probs
Updated
NeelNanda/SoLU_1L256W_C4_Width_Scan
Updated
•
8
NeelNanda/SoLU_1L128W_C4_Width_Scan
Updated
•
7
NeelNanda/SoLU_1L64W_C4_Width_Scan
Updated
•
7
datasets
15
NeelNanda/pile-small-tokenized-2b
Viewer
•
Updated
•
10.8M
•
3.26k
NeelNanda/pile-tokenized-10b
Viewer
•
Updated
•
10.8M
•
1.03k
NeelNanda/openwebtext-tokenized-9b
Viewer
•
Updated
•
8.83M
•
357
NeelNanda/code-10k
Viewer
•
Updated
•
10k
•
77
•
1
NeelNanda/wiki-10k
Viewer
•
Updated
•
10k
•
53
NeelNanda/c4-code-20k
Viewer
•
Updated
•
20k
•
140
•
4
NeelNanda/c4-10k
Viewer
•
Updated
•
10k
•
122
NeelNanda/c4-tokenized-2b
Viewer
•
Updated
•
1.36M
•
301
NeelNanda/code-tokenized
Viewer
•
Updated
•
297k
•
68
NeelNanda/c4-code-tokenized-2b
Viewer
•
Updated
•
1.66M
•
95
•
1