tsavage68
/

Summary_L3_50steps_1e6rate_05beta_CSFTDPO

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Summary_L3_50steps_1e6rate_05beta_CSFTDPO

1 contributor

History: 2 commits

tsavage68's picture

End of training

89a682d verified 15 days ago

final_checkpoint
End of training 15 days ago
.gitattributes

1.52 kB

initial commit 15 days ago
README.md

2.15 kB

End of training 15 days ago
config.json

736 Bytes

End of training 15 days ago
generation_config.json

194 Bytes

End of training 15 days ago
model-00001-of-00004.safetensors
4.98 GB
LFS

End of training 15 days ago
model-00002-of-00004.safetensors
5 GB
LFS

End of training 15 days ago
model-00003-of-00004.safetensors
4.92 GB
LFS

End of training 15 days ago
model-00004-of-00004.safetensors

1.17 GB
LFS

End of training 15 days ago
model.safetensors.index.json

24 kB

End of training 15 days ago
special_tokens_map.json

325 Bytes

End of training 15 days ago
tokenizer.json

9.09 MB

End of training 15 days ago
tokenizer_config.json

51.1 kB

End of training 15 days ago
training_args.bin
Detected Pickle imports (9)
- "transformers.training_args.TrainingArguments",
- "accelerate.state.PartialState",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.training_args.OptimizerNames",
- "torch.device",
- "transformers.trainer_utils.HubStrategy",
- "transformers.trainer_utils.SchedulerType"
How to fix it?
4.67 kB
LFS

End of training 15 days ago