saching0071's picture
Update main README with loading instructions
5f9912d verified
---
version: main
family: smollm2-1.7b
model_name: -all_raw_folders_metadata-600B
license: mit
tags:
- model
- transformer
- smollm2
---
# SmolLM2 -all_raw_folders_metadata-600B (Version: main)
## Model Details
- **Architecture:** SmolLM2
- **Parameters:** 1.7B
## Training Configuration
```yaml
optimizer:
class_path: torch.optim.AdamW
init_args:
lr: 0.0005
weight_decay: 0.01
precision: bf16-mixed
seed: 42
train:
global_batch_size: 1024
max_seq_length: 2048
max_tokens: 600000000000
micro_batch_size: 8
```
## Model Loading and Revision System
This repository hosts multiple revisions of the model.
To load a specific revision, use the `revision` parameter. For example:
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("locuslab/-all_raw_folders_metadata-600B", revision="final")
tokenizer = AutoTokenizer.from_pretrained("locuslab/-all_raw_folders_metadata-600B", revision="final")
```
Replace `"final"` with the desired revision.