Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
distily
/
distily_attn_distilgpt2_sweep
like
0
TensorBoard
Safetensors
wikimedia/wikipedia
Distily
gpt2
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
3d68e2a
distily_attn_distilgpt2_sweep
1 contributor
History:
48 commits
lapp0
Training in progress, step 115000
3d68e2a
verified
29 days ago
logs
Training in progress, step 115000
29 days ago
.gitattributes
1.52 kB
initial commit
30 days ago
README.md
3.64 kB
Training in progress, step 115000
29 days ago
benchmarks.shelve.bak
pickle
0 Bytes
End of training
30 days ago
benchmarks.shelve.dat
pickle
0 Bytes
End of training
30 days ago
benchmarks.shelve.dir
pickle
0 Bytes
End of training
30 days ago
config.json
1.02 kB
Training in progress, step 115000
29 days ago
generation_config.json
119 Bytes
Training in progress, step 115000
29 days ago
merges.txt
456 kB
Training in progress, step 99000
30 days ago
model.safetensors
164 MB
LFS
Training in progress, step 115000
29 days ago
special_tokens_map.json
131 Bytes
Training in progress, step 99000
30 days ago
tokenizer.json
2.11 MB
Training in progress, step 99000
30 days ago
tokenizer_config.json
476 Bytes
Training in progress, step 99000
30 days ago
training_args.bin
pickle
Detected Pickle imports (9)
"torch.device"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.trainer_utils.SchedulerType"
,
"distily.args.DistillationTrainingArguments"
,
"transformers.trainer_utils.IntervalStrategy"
,
"accelerate.state.PartialState"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.training_args.OptimizerNames"
How to fix it?
5.56 kB
LFS
Training in progress, step 115000
29 days ago
vocab.json
798 kB
Training in progress, step 99000
30 days ago