quintic commited on
Commit
b3451cf
1 Parent(s): d7a2f5d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -1,3 +1,9 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+
5
+ Data: c4 and codeparrot, about 1:1 sample-wise but 1:4 token-wise mix. Significantly biased for codes (python, go, java, javascript, c, c++).
6
+ Params:
7
+ - batch size 64 * 2048 = 131072 tokens
8
+ - lr automatically according to EAI sae codebase
9
+ - auxk_alpha 0.03