## What Works | |
| Loader | Loading 1 LoRA | Loading 2 or more LoRAs | Training LoRAs | Multimodal extension | Perplexity evaluation | | |
|----------------|----------------|-------------------------|----------------|----------------------|-----------------------| | |
| Transformers | โ | โ *** | โ * | โ | โ | | |
| ExLlamav2_HF | โ | โ | โ | โ | โ | | |
| ExLlamav2 | โ | โ | โ | โ | use ExLlamav2_HF | | |
| AutoGPTQ | โ | โ | โ | โ | โ | | |
| GPTQ-for-LLaMa | โ ** | โ *** | โ | โ | โ | | |
| llama.cpp | โ | โ | โ | โ | use llamacpp_HF | | |
| llamacpp_HF | โ | โ | โ | โ | โ | | |
| ctransformers | โ | โ | โ | โ | โ | | |
| AutoAWQ | ? | โ | ? | ? | โ | | |
โ = not implemented | |
โ = implemented | |
\* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model. | |
\*\* Requires the monkey-patch. The instructions can be found [here](https://github.com/oobabooga/text-generation-webui/wiki/08-%E2%80%90-Additional-Tips#using-loras-with-gptq-for-llama). | |
\*\*\* Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases. | |