Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Aratako
/
sarashina2.1-1b-sft
like
15
Text Generation
Transformers
PyTorch
Safetensors
Japanese
llama
axolotl
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
other
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
2b4a075
sarashina2.1-1b-sft
1 contributor
History:
5 commits
Aratako
Update tokenizer_config.json
2b4a075
verified
11 days ago
.gitattributes
1.52 kB
initial commit
12 days ago
LICENSE
11.6 kB
Update LICENSE
11 days ago
README.md
6.21 kB
End of training
11 days ago
added_tokens.json
53 Bytes
End of training
11 days ago
config.json
728 Bytes
End of training
11 days ago
generation_config.json
132 Bytes
End of training
11 days ago
model.safetensors
2.82 GB
LFS
End of training
11 days ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
2.82 GB
LFS
End of training
11 days ago
special_tokens_map.json
968 Bytes
End of training
11 days ago
tokenizer.json
6.72 MB
End of training
11 days ago
tokenizer.model
1.83 MB
LFS
End of training
11 days ago
tokenizer_config.json
4.46 kB
Update tokenizer_config.json
11 days ago
training_args.bin
pickle
Detected Pickle imports (13)
"transformers.trainer_utils.IntervalStrategy"
,
"axolotl.core.trainer_builder.AxolotlTrainingArguments"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.SchedulerType"
,
"transformers.training_args.OptimizerNames"
,
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
,
"accelerate.state.PartialState"
,
"torch.device"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"torch.bfloat16"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
How to fix it?
8.12 kB
LFS
End of training
11 days ago