PEFT
Safetensors

Sky-T1-32B with Step-by-Step Thinking

This repo contains a LoRA adapter to NovaSky-AI/Sky-T1-32B-Preview for step-by-step thinking. It breaks down the thinking process to steps, allowing better tracking of the thought process.

The model was finetuned without any system prompt.

Each step is outputted in a specific format:

<|step|> TITLE
TEXT

Thus the model's response should adhere to the following format:

<|begin_of_thought|>
<|step|> STEP_1_TITLE
STEP_1_THINKING
<|step|> STEP_2_TITLE
STEP_2_THINKING
...
<|end_of_thought|>
<|begin_of_solution|>
...
<|end_of_solution|>
Downloads last month
0
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for shakedzy/Sky-T1-32B-Steps

Base model

Qwen/Qwen2.5-32B
Adapter
(1)
this model

Dataset used to train shakedzy/Sky-T1-32B-Steps