tsavage68
/

Summary4500_M2_750steps_1e5rate_SFT

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Summary4500_M2_750steps_1e5rate_SFT

1 contributor

History: 2 commits

tsavage68's picture

End of training

f24051f verified 20 days ago

final_checkpoint
End of training 20 days ago
.gitattributes

1.52 kB

initial commit 20 days ago
README.md

2.21 kB

End of training 20 days ago
config.json

653 Bytes

End of training 20 days ago
generation_config.json

111 Bytes

End of training 20 days ago
model-00001-of-00003.safetensors
4.94 GB
LFS

End of training 20 days ago
model-00002-of-00003.safetensors
5 GB
LFS

End of training 20 days ago
model-00003-of-00003.safetensors
4.54 GB
LFS

End of training 20 days ago
model.safetensors.index.json

24 kB

End of training 20 days ago
special_tokens_map.json

437 Bytes

End of training 20 days ago
tokenizer.json

1.8 MB

End of training 20 days ago
tokenizer_config.json

1.5 kB

End of training 20 days ago
training_args.bin
Detected Pickle imports (9)
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.state.PartialState",
- "torch.device",
- "transformers.training_args.TrainingArguments"
How to fix it?
4.67 kB
LFS

End of training 20 days ago