Training procedure

  • total_batch_size: 32
  • epoch: 3
  • lr: 1.0e-4
  • warm-up rate: 0.1
  • type: Lora

Framework versions

  • LLaMA-Factory: v0.9.0

Paper

  • link: arxiv.org/abs/2412.04905

Data

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for iiiiwis/DEMO_Agent

Base model

Qwen/Qwen2-7B
Finetuned
(59)
this model