add 2b token runs
c833498
-
concat-z-gelu-21-l1-lr-sweep-3
add 2b token runs
-
dec-9-reinit-runs
add reinit runs
-
1.52 kB
initial commit
0.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
68 kB
First model version
-
152 Bytes
First model version
1.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
68 kB
First model version
10.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
8.46 MB
First model version
-
487 Bytes
First model version
-
152 Bytes
First model version
2.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
68 kB
First model version
-
481 Bytes
First model version
3.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
134 kB
First model version
-
481 Bytes
First model version
4.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
134 kB
First model version
-
481 Bytes
First model version
5.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
530 kB
First model version
-
484 Bytes
First model version
6.pt
Detected Pickle imports (3)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
530 kB
First model version
-
484 Bytes
First model version
7.pt
Detected Pickle imports (3)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
2.12 MB
First model version
-
484 Bytes
First model version
8.pt
Detected Pickle imports (3)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
2.12 MB
First model version
-
484 Bytes
First model version
9.pt
Detected Pickle imports (3)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
8.46 MB
First model version
-
487 Bytes
First model version
-
85 Bytes
Update README.md
-
2.12 MB
add 1B token run
-
485 Bytes
add 1B token run
-
2.12 MB
no resample run
-
497 Bytes
no resample run
-
2.12 MB
no resample run
-
498 Bytes
no resample run
-
2.12 MB
no resample run
-
482 Bytes
no resample run
-
2.12 MB
add in 500M training run
-
484 Bytes
add in 500M training run
-
67.2 MB
add hook_z concat
-
475 Bytes
add hook_z concat
-
67.2 MB
add hook_z concat
-
67.2 MB
add hook_z concat
-
474 Bytes
add hook_z concat
-
475 Bytes
add hook_z concat
-
67.2 MB
add hook_z concat
-
475 Bytes
add hook_z concat
-
67.2 MB
add hook_z concat
-
473 Bytes
add hook_z concat
-
67.2 MB
add hook_z concat
-
474 Bytes
add hook_z concat
-
67.2 MB
add hook_z concat
-
475 Bytes
add hook_z concat