---
language:
  - en
  - ko
tags:
  - generation
license: apache-2.0
model-index:
  - name: task_1
    results:
      - task:
          type: natural-language-generation
        dataset:
          type: hellaswag
          name: hellaswag(10 shots)
        metrics:
          - type: acc_norm
            value: 27.7
  - name: task_2
    results:
      - task:
          type: natural-language-generation
        dataset:
          type: ARC
          name: ARC(25 shots)
        metrics:
          - type: acc_norm
            value: 23.8
  - name: task_3
    results:
      - task:
          type: natural-language-generation
        dataset:
          type: MMLU
          name: MMLU(5 shots)
        metrics:
          - type: acc
            value: 24.9

  - name: task_4
    results:
      - task:
          type: natural-language-generation
        dataset:
          type: TruthfulQA 
          name: TruthfulQA(0 shots)
        metrics:
          - type: mc2
            value: 46.5
---

Pretrained GPT2 with expanded n_ctx up to 2048(also with expanded embedding dimension to 1536) in Korean.
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_psyche__kogpt)

| Metric                | Value                     |
|-----------------------|---------------------------|
| Avg.                  | 24.27   |
| ARC (25-shot)         | 21.16          |
| HellaSwag (10-shot)   | 28.11    |
| MMLU (5-shot)         | 26.56         |
| TruthfulQA (0-shot)   | 42.06   |
| Winogrande (5-shot)   | 49.09   |
| GSM8K (5-shot)        | 0.0        |
| DROP (3-shot)         | 2.89         |