AmirMohseni
/

SmolLM-360M-Instruct-finetuned-sft

@@ -1,7 +1,19 @@
-```markdown
 ---
 library_name: transformers
-tags: [language-model, fine-tuned, instruction-following, SmolLM, HelpSteer2, NVIDIA, A100, English]
 ---
 # Model Card for `SmolLM-360M-Instruct-finetuned-sft`
@@ -60,11 +72,9 @@ print(response)
 ## Training Details
 ### Training Data
-The model was fine-tuned using the [HelpSteer2](https://huggingface.co/datasets/nvidia/HelpSteer2) dataset, which consists of approximately 21,400 examples of instruction-based prompts and corresponding responses. The dataset is designed to enhance AI models' ability to generate helpful, correct, and coherent outputs.
 ### Training Procedure
 The fine-tuning was performed using the following hyperparameters:
 - **Training regime:** Mixed precision (FP16)
@@ -78,16 +88,13 @@ The fine-tuning was performed using the following hyperparameters:
 ## Evaluation
 ### Testing Data, Factors & Metrics
 The model was evaluated using a validation subset of the HelpSteer2 dataset.
 #### Metrics
 - **Training Loss:** Final loss was 5.4814.
 - **Validation Loss:** Final loss was 5.4625.
 ### Results
 The model demonstrated a consistent decrease in both training and validation losses, indicating effective learning and good generalization.
 ## Environmental Impact
@@ -95,5 +102,4 @@ The model demonstrated a consistent decrease in both training and validation los
 Carbon emissions for the training process were minimal due to the efficient use of the NVIDIA A100 GPU, which allowed for rapid fine-tuning within an hour.
 - **Hardware Type:** NVIDIA A100 GPU
-- **Hours used:** Less than 1 hour
-```

 ---
 library_name: transformers
+tags:
+  - language-model
+  - fine-tuned
+  - instruction-following
+  - SmolLM
+  - HelpSteer2
+  - NVIDIA
+  - A100
+  - English
+language: en
+license: apache-2.0
+datasets:
+  - nvidia/HelpSteer2
+model_name: SmolLM-360M-Instruct-finetuned-sft
 ---
 # Model Card for `SmolLM-360M-Instruct-finetuned-sft`
 ## Training Details
 ### Training Data
+The model was fine-tuned using the [HelpSteer2 dataset](https://huggingface.co/datasets/nvidia/HelpSteer2), which consists of approximately 21,400 examples of instruction-based prompts and corresponding responses. The dataset is designed to enhance AI models' ability to generate helpful, correct, and coherent outputs.
 ### Training Procedure
 The fine-tuning was performed using the following hyperparameters:
 - **Training regime:** Mixed precision (FP16)
 ## Evaluation
 ### Testing Data, Factors & Metrics
 The model was evaluated using a validation subset of the HelpSteer2 dataset.
 #### Metrics
 - **Training Loss:** Final loss was 5.4814.
 - **Validation Loss:** Final loss was 5.4625.
 ### Results
 The model demonstrated a consistent decrease in both training and validation losses, indicating effective learning and good generalization.
 ## Environmental Impact
 Carbon emissions for the training process were minimal due to the efficient use of the NVIDIA A100 GPU, which allowed for rapid fine-tuning within an hour.
 - **Hardware Type:** NVIDIA A100 GPU
+- **Hours used:** Less than 1 hour