empty-michael
/

tinystories_1layer_attn_mlp_C25k_k16_mse_weighted

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

tinystories_1layer_attn_mlp_C25k_k16_mse_weighted

1 contributor

History: 2 commits

empty-michael's picture

Training in progress, step 500

ce61d09 verified 10 months ago

.gitattributes

1.52 kB

initial commit 10 months ago
config.json

711 Bytes

Training in progress, step 500 10 months ago
merges.txt

456 kB

Training in progress, step 500 10 months ago
model.safetensors

347 MB
LFS

Training in progress, step 500 10 months ago
special_tokens_map.json

438 Bytes

Training in progress, step 500 10 months ago
tokenizer.json

2.11 MB

Training in progress, step 500 10 months ago
tokenizer_config.json

514 Bytes

Training in progress, step 500 10 months ago
training_args.bin
Detected Pickle imports (8)
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "codebook_features.run_clm.TrainingArguments",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.state.PartialState",
- "torch.device",
- "transformers.trainer_utils.SchedulerType",
- "transformers.training_args.OptimizerNames"
How to fix it?
4.86 kB
LFS

Training in progress, step 500 10 months ago
vocab.json

798 kB

Training in progress, step 500 10 months ago