jacobthebanana
/

deepseek_lora_128

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

deepseek_lora_128

1 contributor

History: 5 commits

jacobthebanana's picture

Training in progress, step 500

4ce7fe0 verified 15 days ago

.gitattributes

1.57 kB

Training in progress, step 350 19 days ago
README.md

1.8 kB

End of training 19 days ago
adapter_config.json

732 Bytes

Training in progress, step 500 15 days ago
adapter_model.safetensors

59 MB
LFS

Training in progress, step 500 15 days ago
added_tokens.json

605 Bytes

Training in progress, step 350 19 days ago
merges.txt

1.67 MB

Training in progress, step 350 19 days ago
special_tokens_map.json

496 Bytes

Training in progress, step 500 15 days ago
tokenizer.json

11.4 MB
LFS

Training in progress, step 350 19 days ago
tokenizer_config.json

7.33 kB

Training in progress, step 500 15 days ago
training_args.bin
Detected Pickle imports (14)
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "torch.bfloat16",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.trainer_utils.IntervalStrategy",
- "accelerate.state.PartialState",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.training_args.OptimizerNames",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "transformers.trainer_utils.SaveStrategy",
- "torch.device",
- "trl.trainer.sft_config.SFTConfig"
How to fix it?
7.16 kB
LFS

Training in progress, step 500 15 days ago
vocab.json

2.78 MB

Training in progress, step 350 19 days ago