--- language: - en tags: - ggml - causal-lm - gpt2 - text-generation license: mit --- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6251b9851842c08ef3111c4f/NtCV4dSUaHb7XnMYrm1iF.png) ### This repository contains quantized conversions of the AI Dungeon 2 checkpoint, "model_v5". *For use with frontends that support GGML quantized GPT-2 models. This model works best with KoboldCpp's "Adventure" mode.* *Last updated on 2023-09-23.* Model | RAM usage (KoboldCpp) | RAM usage (Oobabooga) :--:|:--:|:--: aid2classic-ggml-q4_0.bin | 984.1 MiB | 1.4 GiB aid2classic-ggml-q4_1.bin | 1.1 GiB | 1.5 GiB aid2classic-ggml-q5_0.bin | 1.2 GiB | 1.6 GiB aid2classic-ggml-q5_1.bin | 1.2 GiB | 1.7 GiB aid2classic-ggml-q8_0.bin | 1.7 GiB | 2.2 GiB aid2classic-ggml-f16.bin | 3.2 GiB | 3.6 GiB **Description:** - 2019 AI Dungeon users may recognize this model as the same one that powered [the open-source AI Dungeon 2 project](https://github.com/Latitude-Archives/AIDungeon) and its various forks. This was before AI Dungeon 2 moved to its own website and consequently rebranded to "AI Dungeon". - 2020-2021 AI Dungeon users may recognize this model as "Classic", the free tier below Griffin (free, but later used "energy") and Dragon (subscription). - If you want a better model trained on the same dataset at the cost of higher hardware requirements, check out [Spring Dragon 13B](https://huggingface.co/TheBloke/Spring-Dragon-GGUF), intended to replicate 2020 AI Dungeon's "Dragon" experience on local hardware. - The motivation behind these quantizations was that [Henk717/ai-dungeon2-classic-ggml](https://huggingface.co/Henk717/ai-dungeon2-classic-ggml) was older and lacked other quantization formats. The workflow for this quantization was also different: henk717's mentions being converted to a 16-bit Pytorch checkpoint before being converted to GGML. This one was converted straight from Tensorflow to 16-bit GGML before being quantized. **Notes:** - KoboldCpp [[bfc696f]](https://github.com/LostRuins/koboldcpp/tree/bfc696fcc452975dbe8967c39301ba856d04a030) was tested without OpenBLAS. - Oobabooga [[895ec9d]](https://github.com/oobabooga/text-generation-webui/tree/895ec9dadb96120e8202a83052bf9032ca3245ae) was tested with with the `--model --loader ctransformers --model_type gpt2` launch arguments. - ggerganov/ggml [[8ca2c19]](https://github.com/ggerganov/ggml/tree/8ca2c19a3bb8622954d858fbf6383522684eaf34)'s gpt-2 conversion script was used for conversion and quantization. - The original model was found in the `generator/gpt2/models/model_v5` directory of [AI Dungeon 2 Unleashed](https://henk.tech/aid/).