quintic commited on
Commit
b567978
1 Parent(s): 1b7b199

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -3
README.md CHANGED
@@ -1,3 +1,11 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ Data: c4 and codeparrot, about 1:1 sample-wise but 1:4 token-wise mix. Significantly biased for codes (python, go, java, javascript, c, c++).
6
+ 1 epoch but use 48x instead of 32x default sae.
7
+
8
+ Params:
9
+ - batch size 64 * 2048 * 8 = 1048576 tokens
10
+ - lr automatically according to EAI sae codebase
11
+ - auxk_alpha 0.03