---
license: apache-2.0
tags:
- text-generation
base_model: Locutusque/TinyMistral-248M
datasets:
- OpenAssistant/oasst_top1_2023-08-25
widget:
- messages:
  - role: user
    content: Invited some friends to come home today. Give me some ideas for games
      to play with them!
- messages:
  - role: user
    content: How do meteorologists predict how much air pollution will be produced
      in the next year?
- messages:
  - role: user
    content: Who is Mona Lisa?
- messages:
  - role: user
    content: Heya!
  - role: assistant
    content: Hi! How may I help you today?
  - role: user
    content: I need to build a simple website. Where should I start learning about
      web development?
- messages:
  - role: user
    content: What are some potential applications for quantum computing?
- messages:
  - role: user
    content: Got a question for you!
  - role: assistant
    content: Sure! What's it?
  - role: user
    content: Why do you love cats so much!? 🐈
- messages:
  - role: user
    content: Tell me about the pros and cons of social media.
- messages:
  - role: user
    content: 'Question: What is a dog?'
  - role: assistant
    content: A dog is a four-legged, domesticated animal that is a member of the class
      Mammalia, which includes all mammals. Dogs are known for their loyalty, playfulness,
      and ability to be trained for various tasks. They are also used for hunting,
      herding, and as service animals.
  - role: user
    content: 'Question: What is the capital of France?'
  - role: assistant
    content: The capital of France is Paris. Paris is located in the north-central
      region of France and is known for its famous landmarks, such as the Eiffel Tower,
      the Louvre Museum, and Notre-Dame Cathedral.
  - role: user
    content: 'Question: What is the color of an apple?'
inference: false
model-index:
- name: TinyMistral-248M-SFT-v3
  results:
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: AI2 Reasoning Challenge (25-Shot)
      type: ai2_arc
      config: ARC-Challenge
      split: test
      args:
        num_few_shot: 25
    metrics:
    - type: acc_norm
      value: 21.93
      name: normalized accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/TinyMistral-248M-SFT-v3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: HellaSwag (10-Shot)
      type: hellaswag
      split: validation
      args:
        num_few_shot: 10
    metrics:
    - type: acc_norm
      value: 28.26
      name: normalized accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/TinyMistral-248M-SFT-v3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: MMLU (5-Shot)
      type: cais/mmlu
      config: all
      split: test
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 22.91
      name: accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/TinyMistral-248M-SFT-v3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: TruthfulQA (0-shot)
      type: truthful_qa
      config: multiple_choice
      split: validation
      args:
        num_few_shot: 0
    metrics:
    - type: mc2
      value: 40.03
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/TinyMistral-248M-SFT-v3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: Winogrande (5-shot)
      type: winogrande
      config: winogrande_xl
      split: validation
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 51.54
      name: accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/TinyMistral-248M-SFT-v3
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: GSM8k (5-shot)
      type: gsm8k
      config: main
      split: test
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 0.0
      name: accuracy
    source:
      url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Felladrin/TinyMistral-248M-SFT-v3
      name: Open LLM Leaderboard
---

# Locutusque's TinyMistral-248M trained on OpenAssistant TOP-1 Conversation Threads

- Base model: [Locutusque/TinyMistral-248M](https://huggingface.co/Locutusque/TinyMistral-248M)
- Dataset: [OpenAssistant/oasst_top1_2023-08-25](https://huggingface.co/datasets/OpenAssistant/oasst_top1_2023-08-25)
- Availability in other ML formats:
  - GGUF: [Felladrin/gguf-TinyMistral-248M-SFT-v4](https://huggingface.co/Felladrin/gguf-TinyMistral-248M-SFT-v4)
  - ONNX: [Felladrin/onnx-TinyMistral-248M-SFT-v4](https://huggingface.co/Felladrin/onnx-TinyMistral-248M-SFT-v4)

## Where to try out this model

The [inference widget from HuggingFace was not working properly for this model](https://discuss.huggingface.co/t/api-endpoint-not-working-as-expected/69457), so it was temporarily disabled.

To try out this model online, please visit this HuggingFace Space: [Felladrin/ModelsPlayground](https://huggingface.co/spaces/Felladrin/ModelsPlayground)

## Recommended Prompt Format

```
<|im_start|>user
{message}<|im_end|>
<|im_start|>assistant
```

## Recommended Inference Parameters

```yml
penalty_alpha: 0.5
top_k: 5
```

# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Felladrin__TinyMistral-248M-SFT-v3)

|             Metric              |Value|
|---------------------------------|----:|
|Avg.                             |27.45|
|AI2 Reasoning Challenge (25-Shot)|21.93|
|HellaSwag (10-Shot)              |28.26|
|MMLU (5-Shot)                    |22.91|
|TruthfulQA (0-shot)              |40.03|
|Winogrande (5-shot)              |51.54|
|GSM8k (5-shot)                   | 0.00|