Crataco's picture
Update README.md
466f5b0
---
language:
- en
tags:
- ggml
- causal-lm
- gpt2
- text-generation
license: mit
---
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6251b9851842c08ef3111c4f/NtCV4dSUaHb7XnMYrm1iF.png)
### This repository contains quantized conversions of the AI Dungeon 2 checkpoint, "model_v5".
*For use with frontends that support GGML quantized GPT-2 models. This model works best with KoboldCpp's "Adventure" mode.*
*Last updated on 2023-09-23.*
Model | RAM usage (KoboldCpp) | RAM usage (Oobabooga)
:--:|:--:|:--:
aid2classic-ggml-q4_0.bin | 984.1 MiB | 1.4 GiB
aid2classic-ggml-q4_1.bin | 1.1 GiB | 1.5 GiB
aid2classic-ggml-q5_0.bin | 1.2 GiB | 1.6 GiB
aid2classic-ggml-q5_1.bin | 1.2 GiB | 1.7 GiB
aid2classic-ggml-q8_0.bin | 1.7 GiB | 2.2 GiB
aid2classic-ggml-f16.bin | 3.2 GiB | 3.6 GiB
**Description:**
- 2019 AI Dungeon users may recognize this model as the same one that powered [the open-source AI Dungeon 2 project](https://github.com/Latitude-Archives/AIDungeon) and its various forks. This was before AI Dungeon 2 moved to its own website and consequently rebranded to "AI Dungeon".
- 2020-2021 AI Dungeon users may recognize this model as "Classic", the free tier below Griffin (free, but later used "energy") and Dragon (subscription).
- If you want a better model trained on the same dataset at the cost of higher hardware requirements, check out [Spring Dragon 13B](https://huggingface.co/TheBloke/Spring-Dragon-GGUF), intended to replicate 2020 AI Dungeon's "Dragon" experience on local hardware.
- The motivation behind these quantizations was that [Henk717/ai-dungeon2-classic-ggml](https://huggingface.co/Henk717/ai-dungeon2-classic-ggml) was older and lacked other quantization formats. The workflow for this quantization was also different: henk717's mentions being converted to a 16-bit Pytorch checkpoint before being converted to GGML. This one was converted straight from Tensorflow to 16-bit GGML before being quantized.
**Notes:**
- KoboldCpp [[bfc696f]](https://github.com/LostRuins/koboldcpp/tree/bfc696fcc452975dbe8967c39301ba856d04a030) was tested without OpenBLAS.
- Oobabooga [[895ec9d]](https://github.com/oobabooga/text-generation-webui/tree/895ec9dadb96120e8202a83052bf9032ca3245ae) was tested with with the `--model <model> --loader ctransformers --model_type gpt2` launch arguments.
- ggerganov/ggml [[8ca2c19]](https://github.com/ggerganov/ggml/tree/8ca2c19a3bb8622954d858fbf6383522684eaf34)'s gpt-2 conversion script was used for conversion and quantization.
- The original model was found in the `generator/gpt2/models/model_v5` directory of [AI Dungeon 2 Unleashed](https://henk.tech/aid/).