Spaces:

Mr-Bhaskar
/

test3

Runtime error

Upload 399 files

20efbc0 verified 5 months ago

2.06 kB

	## What Works

	\| Loader \| Loading 1 LoRA \| Loading 2 or more LoRAs \| Training LoRAs \| Multimodal extension \| Perplexity evaluation \|
	\|----------------\|----------------\|-------------------------\|----------------\|----------------------\|-----------------------\|
	\| Transformers \| ✅ \| ✅\\\* \| ✅\* \| ✅ \| ✅ \|
	\| llama.cpp \| ❌ \| ❌ \| ❌ \| ❌ \| use llamacpp_HF \|
	\| llamacpp_HF \| ❌ \| ❌ \| ❌ \| ❌ \| ✅ \|
	\| ExLlamav2_HF \| ✅ \| ✅ \| ❌ \| ❌ \| ✅ \|
	\| ExLlamav2 \| ✅ \| ✅ \| ❌ \| ❌ \| use ExLlamav2_HF \|
	\| AutoGPTQ \| ✅ \| ❌ \| ❌ \| ✅ \| ✅ \|
	\| AutoAWQ \| ? \| ❌ \| ? \| ? \| ✅ \|
	\| GPTQ-for-LLaMa \| ✅\\ \| ✅\\\* \| ✅ \| ✅ \| ✅ \|
	\| QuIP# \| ? \| ? \| ? \| ? \| ✅ \|
	\| HQQ \| ? \| ? \| ? \| ? \| ✅ \|

	❌ = not implemented

	✅ = implemented

	\* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model.

	\\ Requires the monkey-patch. The instructions can be found [here](https://github.com/oobabooga/text-generation-webui/wiki/08-%E2%80%90-Additional-Tips#using-loras-with-gptq-for-llama).

	\\\* Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases.

	## What Works

	\| Loader \| Loading 1 LoRA \| Loading 2 or more LoRAs \| Training LoRAs \| Multimodal extension \| Perplexity evaluation \|
	\|----------------\|----------------\|-------------------------\|----------------\|----------------------\|-----------------------\|
	\| Transformers \| ✅ \| ✅\\\* \| ✅\* \| ✅ \| ✅ \|
	\| llama.cpp \| ❌ \| ❌ \| ❌ \| ❌ \| use llamacpp_HF \|
	\| llamacpp_HF \| ❌ \| ❌ \| ❌ \| ❌ \| ✅ \|
	\| ExLlamav2_HF \| ✅ \| ✅ \| ❌ \| ❌ \| ✅ \|
	\| ExLlamav2 \| ✅ \| ✅ \| ❌ \| ❌ \| use ExLlamav2_HF \|
	\| AutoGPTQ \| ✅ \| ❌ \| ❌ \| ✅ \| ✅ \|
	\| AutoAWQ \| ? \| ❌ \| ? \| ? \| ✅ \|
	\| GPTQ-for-LLaMa \| ✅\\ \| ✅\\\* \| ✅ \| ✅ \| ✅ \|
	\| QuIP# \| ? \| ? \| ? \| ? \| ✅ \|
	\| HQQ \| ? \| ? \| ? \| ? \| ✅ \|

	❌ = not implemented

	✅ = implemented

	\* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model.

	\\ Requires the monkey-patch. The instructions can be found [here](https://github.com/oobabooga/text-generation-webui/wiki/08-%E2%80%90-Additional-Tips#using-loras-with-gptq-for-llama).

	\\\* Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases.