Edit model card

ShuttleAI Thumbnail

💻 Use via API

Shuttle-3 (beta) [2024/10/25]

We are excited to introduce Shuttle-3, our next-generation state-of-the-art language model designed to excel in complex chat, multilingual communication, reasoning, and agent tasks.

  • Shuttle-3 is a fine-tuned version of Qwen-2.5-72b-Instruct, emulating the writing style of Claude 3 models and thoroughly trained on role-playing data.

Model Details

Key Features

  • Pretrained on a large proportion of multilingual and code data
  • Finetuned to emulate the prose quality of Claude 3 models and extensively on role play data

Fine-Tuning Details

  • Training Setup: Trained on 130 million tokens for 12 hours using 4 A100 PCIe GPUs.

Prompting

Shuttle-3 uses ChatML as its prompting format:

<|im_start|>system
You are a pirate! Yardy harr harr!<|im_end|>
<|im_start|>user
Where are you currently!<|im_end|>
<|im_start|>assistant
Look ahoy ye scallywag! We're on the high seas!<|im_end|>
Downloads last month
606
Safetensors
Model size
72.7B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for shuttleai/shuttle-3

Base model

Qwen/Qwen2.5-72B
Finetuned
(21)
this model
Merges
2 models
Quantizations
4 models

Collection including shuttleai/shuttle-3