iiiiwis
/

DEMO_Agent

Text Generation

Model card Files Files and versions Community

Training procedure

total_batch_size: 32
epoch: 3
lr: 1.0e-4
warm-up rate: 0.1
type: Lora

Framework versions

LLaMA-Factory: v0.9.0

Paper

link: arxiv.org/abs/2412.04905

Data

link: https://github.com/MozerWang/DEMO

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Examples

Text Generation

Unable to determine this model's library. Check the docs .

Model tree for iiiiwis/DEMO_Agent

Base model

Qwen/Qwen2-7B

Finetuned

Qwen/Qwen2-7B-Instruct

Finetuned

(59)

this model