Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
distily
/
distily_validate_extra_grad_stats2
like
0
TensorBoard
Safetensors
wikimedia/wikipedia
Distily
gpt2
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
ce35aa4
distily_validate_extra_grad_stats2
1 contributor
History:
4 commits
lapp0
Training in progress, step 2475
ce35aa4
verified
26 days ago
logs
Training in progress, step 2475
26 days ago
.gitattributes
1.52 kB
initial commit
26 days ago
README.md
3.58 kB
End of training
26 days ago
benchmarks.shelve.bak
pickle
0 Bytes
End of training
26 days ago
benchmarks.shelve.dat
pickle
0 Bytes
End of training
26 days ago
benchmarks.shelve.dir
pickle
0 Bytes
End of training
26 days ago
config.json
1.02 kB
Training in progress, step 2475
26 days ago
generation_config.json
119 Bytes
End of training
26 days ago
merges.txt
456 kB
Training in progress, step 2475
26 days ago
model.safetensors
164 MB
LFS
Training in progress, step 2475
26 days ago
special_tokens_map.json
131 Bytes
Training in progress, step 2475
26 days ago
tokenizer.json
2.11 MB
Training in progress, step 2475
26 days ago
tokenizer_config.json
476 Bytes
Training in progress, step 2475
26 days ago
training_args.bin
pickle
Detected Pickle imports (9)
"torch.device"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SchedulerType"
,
"distily.args.DistillationTrainingArguments"
,
"transformers.trainer_utils.IntervalStrategy"
,
"accelerate.state.PartialState"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.training_args.OptimizerNames"
How to fix it?
5.56 kB
LFS
Training in progress, step 2475
26 days ago
vocab.json
798 kB
Training in progress, step 2475
26 days ago