SriSanth2345's picture
Update README.md
b6b3e4c verified
metadata
datasets:
  - TIGER-Lab/MathInstruct
base_model:
  - meta-llama/Llama-3.2-3B-Instruct
tags:
  - lora
  - adapters
  - llama

LLAMA-3.2-3B-MathInstruct_LORA_SFT

This model is a fine-tuned version of meta-llama/Llama-3.2-3B-Instruct on the MathInstruct dataset. The fine-tuning process was designed to enhance the model's performance for mathematical instruction-following tasks, ensuring improved accuracy and precision when solving math-related problems.

It achieves the following results on the evaluation set:

  • Loss: 0.6895

Model Description

This model is specifically fine-tuned for mathematical reasoning, problem-solving, and instruction-following tasks. Leveraging the LLaMA-3.2-3B-Instruct base model, it has been optimized to handle mathematical queries and tasks with improved efficiency and context understanding.


Training and Evaluation Data

The model was fine-tuned on the MathInstruct dataset.

  • Dataset Source: TIGER-Lab.
  • Dataset Focus: Mathematical instruction-following and reasoning tasks.
  • Scope: A wide range of math topics, including arithmetic, algebra, calculus, and problem-solving.

The dataset was carefully curated to align with instructional objectives for solving mathematical problems and understanding step-by-step reasoning.


Training Procedure

Hyperparameters

  • Learning rate: 0.0001
  • Train batch size: 1
  • Eval batch size: 1
  • Gradient accumulation steps: 8
  • Total effective batch size: 8
  • Optimizer: AdamW (torch)
    • Betas: (0.9, 0.999)
    • Epsilon: 1e-08
  • Learning rate scheduler: Cosine schedule with 10% warmup.
  • Number of epochs: 3.0

Framework Versions

  • PEFT: 0.12.0
  • Transformers: 4.46.1
  • PyTorch: 2.5.1+cu124
  • Datasets: 3.1.0
  • Tokenizers: 0.20.3

Training Results

  • Loss: 0.6895
  • Evaluation indicates strong performance on math instruction-following tasks. Further testing on specific use cases is recommended to assess the model’s generalizability.

Additional Information

  • Author: Sri Santh M
  • Purpose: Fine-tuned for educational and development purposes, particularly for math-related tasks.
  • Dataset Link: MathInstruct Dataset

This model represents a focused effort to adapt the LLaMA-3.2-3B-Instruct model for specialized mathematical use cases. It can be further fine-tuned or extended for more specific mathematical domains or applications.