version: main | |
family: smollm2-1.7b | |
model_name: -all_raw_folders_metadata-600B | |
license: mit | |
tags: | |
- model | |
- transformer | |
- smollm2 | |
# SmolLM2 -all_raw_folders_metadata-600B (Version: main) | |
## Model Details | |
- **Architecture:** SmolLM2 | |
- **Parameters:** 1.7B | |
## Training Configuration | |
```yaml | |
optimizer: | |
class_path: torch.optim.AdamW | |
init_args: | |
lr: 0.0005 | |
weight_decay: 0.01 | |
precision: bf16-mixed | |
seed: 42 | |
train: | |
global_batch_size: 1024 | |
max_seq_length: 2048 | |
max_tokens: 600000000000 | |
micro_batch_size: 8 | |
``` | |
## Model Loading and Revision System | |
This repository hosts multiple revisions of the model. | |
To load a specific revision, use the `revision` parameter. For example: | |
```python | |
from transformers import AutoModelForCausalLM, AutoTokenizer | |
model = AutoModelForCausalLM.from_pretrained("locuslab/-all_raw_folders_metadata-600B", revision="final") | |
tokenizer = AutoTokenizer.from_pretrained("locuslab/-all_raw_folders_metadata-600B", revision="final") | |
``` | |
Replace `"final"` with the desired revision. | |