shizhediao2
/

mamba1b-4nodes-lr5e-5-ep1-bsz1024-packing-nemo-sft1-new

Model card Files Files and versions Community

mamba1b-4nodes-lr5e-5-ep1-bsz1024-packing-nemo-sft1-new

1 contributor

History: 10 commits

shizhediao2's picture

Upload train_results.json with huggingface_hub

e64f666 verified 2 months ago

.gitattributes

1.52 kB

initial commit 2 months ago
README.md

31 Bytes

initial commit 2 months ago
config.json

4.03 kB

Upload config.json with huggingface_hub 2 months ago
generation_config.json

154 Bytes

Upload generation_config.json with huggingface_hub 2 months ago
model.safetensors

3.15 GB
LFS

Upload model.safetensors with huggingface_hub 2 months ago
special_tokens_map.json

434 Bytes

Upload special_tokens_map.json with huggingface_hub 2 months ago
tokenizer.json

1.84 MB

Upload tokenizer.json with huggingface_hub 2 months ago
tokenizer.model

500 kB
LFS

Upload tokenizer.model with huggingface_hub 2 months ago
tokenizer_config.json

2.5 kB

Upload tokenizer_config.json with huggingface_hub 2 months ago
train_results.json

226 Bytes

Upload train_results.json with huggingface_hub 2 months ago
training_args.bin
Detected Pickle imports (12)
- "transformers.trainer_utils.SchedulerType",
- "torch.bfloat16",
- "lmflow.args.FinetunerArguments",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.HubStrategy",
- "accelerate.state.PartialState",
- "torch.device",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "accelerate.utils.dataclasses.DistributedType"
How to fix it?
7.48 kB
LFS

Upload training_args.bin with huggingface_hub 2 months ago