thatname
/

Ziya-LLaMA-13B-v1-ggml

Model card Files Files and versions Community

Ziya-LLaMA-13B-v1-ggml / README.md

thatname's picture

Update README.md

b9c2abd over 1 year ago

|

history blame contribute delete

519 Bytes

	# GGML 4-bit/5-bit quantized IDEA-CCNL/Ziya-LLaMA-13B-v1
	* You need the latest version of llama-cpp or llama-cpp-python (to support ggml format v3).
	* Currently llama-cpp can not tokenize '\<human\>', '\<bot\>' special tokens, I changed these to 🤖🧑 emojis.
	* Promote like this:
	```python
	inputs = '🧑:' + query.strip() + '\n🤖:'
	```
	* If you wanna quantize Ziya to GGML yourself, you should override its 'add_tokens.json' file with ours, which is provided in this repository.
	---
	license: gpl-3.0
	---