duyntnet
/

GRMR-2B-Instruct-imatrix-GGUF

Text Generation

GRMR-2B-Instruct

Model card Files Files and versions Community

GRMR-2B-Instruct-imatrix-GGUF / README.md

duyntnet's picture

Upload README.md with huggingface_hub

3980fd7 verified about 1 month ago

|

history blame contribute delete

1.99 kB

	---
	license: other
	language:
	- en
	pipeline_tag: text-generation
	inference: false
	tags:
	- transformers
	- gguf
	- imatrix
	- GRMR-2B-Instruct
	---
	Quantizations of https://huggingface.co/qingy2024/GRMR-2B-Instruct

	### Inference Clients/UIs
	* [llama.cpp](https://github.com/ggerganov/llama.cpp)
	* [KoboldCPP](https://github.com/LostRuins/koboldcpp)
	* [ollama](https://github.com/ollama/ollama)
	* [jan](https://github.com/janhq/jan)
	* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)
	* [GPT4All](https://github.com/nomic-ai/gpt4all)
	---

	My note Use with llama.cpp like this:
	```
	llama-cli -m GRMR-2B-Instruct_quant.gguf -ngl 99 --conversation --temp 0.0 --reverse-prompt "Below is the original text. Please rewrite it to correct any grammatical errors if any, improve clarity, and enhance overall readability." --in-prefix "### Original Text:" --in-suffix "### Corrected Text:" --prompt " " --repeat-penalty 1.0
	```

	---

	# From original readme

	This fine-tune of Gemma 2 2B is trained to take any input text and repeat it (with fixed grammar).

	Example:


	User: Find a clip from a professional production of any musical within the past 50 years. The Tony awards have a lot of great options of performances of Tony nominated performances in the archives on their websites.

	GRMR-2B-Instruct: Find a clip from a professional production of any musical within the past 50 years. The Tony Awards have a lot of great options of performances of Tony-nominated performances in their archives on their websites.

	Note: This model uses a custom chat template:

	```
	Below is the original text. Please rewrite it to correct any grammatical errors if any, improve clarity, and enhance overall readability.

	### Original Text:
	{PROMPT HERE}

	### Corrected Text:
	{MODEL'S OUTPUT HERE}
	```

	I would recommend a temperature of 0.0 and repeat penalty 1.0 for this model to get optimal results.


	Disclaimer, I ran this text through the model itself to correct the grammar.