Art-v0-3B-GGUF / README.md
aashish1904's picture
Upload README.md with huggingface_hub
fc30d80 verified
|
raw
history blame
1.83 kB
metadata
license: other

QuantFactory Banner

QuantFactory/Art-v0-3B-GGUF

This is quantized version of AGI-0/Art-v0-3B created using llama.cpp

Original Model Card

Auto Regressive Thinker (Art) v0 3B

Art v0 3B is our inaugural model in the Art series, fine-tuned from Qwen/Qwen2.5-3B-Instruct using a specialized dataset generated with Gemini 2.0 Flash Thinking. Read more about the Art series

Model Details

  • Base Model: Qwen2.5-3B-Instruct
  • Architecture: Transformer
  • Size: 3B parameters

Usage

The model incorporates a reasoning mechanism using specific tags:

<|start_reasoning|> model's reasoning process <|end_reasoning|> model's response

Recommendations

  • Use the model without quantization
  • Use the tokenizer chat template
  • Use a low temperature 0.1-0.3 and repetition_penalty of 1.1

Training Details

This experimental model was trained on a curated dataset generated using Gemini 2.0 Flash Thinking. Detailed training methodology, dataset, and code are available exclusively to our community members.

About Us

We are a community-funded AI research lab focused on advancing open-source AGI development. Our community members support us through Patreon donations.

Community Access

Our supporters get exclusive access to:

  • Training dataset
  • Training code and methodology
  • Behind-the-scenes development insights
  • Future model previews

Join Our Community