Crataco
/

AI-Dungeon-2-Classic-GGML

Text Generation

Model card Files Files and versions Community

AI-Dungeon-2-Classic-GGML / README.md

Crataco's picture

Update README.md

466f5b0 10 months ago

|

history blame contribute delete

No virus

2.65 kB

	---
	language:
	- en
	tags:
	- ggml
	- causal-lm
	- gpt2
	- text-generation
	license: mit
	---
	![image/png](https://cdn-uploads.huggingface.co/production/uploads/6251b9851842c08ef3111c4f/NtCV4dSUaHb7XnMYrm1iF.png)
	### This repository contains quantized conversions of the AI Dungeon 2 checkpoint, "model_v5".
	For use with frontends that support GGML quantized GPT-2 models. This model works best with KoboldCpp's "Adventure" mode.

	Last updated on 2023-09-23.

	Model \| RAM usage (KoboldCpp) \| RAM usage (Oobabooga)
	:--:\|:--:\|:--:
	aid2classic-ggml-q4_0.bin \| 984.1 MiB \| 1.4 GiB
	aid2classic-ggml-q4_1.bin \| 1.1 GiB \| 1.5 GiB
	aid2classic-ggml-q5_0.bin \| 1.2 GiB \| 1.6 GiB
	aid2classic-ggml-q5_1.bin \| 1.2 GiB \| 1.7 GiB
	aid2classic-ggml-q8_0.bin \| 1.7 GiB \| 2.2 GiB
	aid2classic-ggml-f16.bin \| 3.2 GiB \| 3.6 GiB

	Description:
	- 2019 AI Dungeon users may recognize this model as the same one that powered [the open-source AI Dungeon 2 project](https://github.com/Latitude-Archives/AIDungeon) and its various forks. This was before AI Dungeon 2 moved to its own website and consequently rebranded to "AI Dungeon".
	- 2020-2021 AI Dungeon users may recognize this model as "Classic", the free tier below Griffin (free, but later used "energy") and Dragon (subscription).
	- If you want a better model trained on the same dataset at the cost of higher hardware requirements, check out [Spring Dragon 13B](https://huggingface.co/TheBloke/Spring-Dragon-GGUF), intended to replicate 2020 AI Dungeon's "Dragon" experience on local hardware.
	- The motivation behind these quantizations was that [Henk717/ai-dungeon2-classic-ggml](https://huggingface.co/Henk717/ai-dungeon2-classic-ggml) was older and lacked other quantization formats. The workflow for this quantization was also different: henk717's mentions being converted to a 16-bit Pytorch checkpoint before being converted to GGML. This one was converted straight from Tensorflow to 16-bit GGML before being quantized.

	Notes:
	- KoboldCpp [[bfc696f]](https://github.com/LostRuins/koboldcpp/tree/bfc696fcc452975dbe8967c39301ba856d04a030) was tested without OpenBLAS.
	- Oobabooga [[895ec9d]](https://github.com/oobabooga/text-generation-webui/tree/895ec9dadb96120e8202a83052bf9032ca3245ae) was tested with with the `--model <model> --loader ctransformers --model_type gpt2` launch arguments.
	- ggerganov/ggml [[8ca2c19]](https://github.com/ggerganov/ggml/tree/8ca2c19a3bb8622954d858fbf6383522684eaf34)'s gpt-2 conversion script was used for conversion and quantization.
	- The original model was found in the `generator/gpt2/models/model_v5` directory of [AI Dungeon 2 Unleashed](https://henk.tech/aid/).