mirror of
https://github.com/oobabooga/text-generation-webui.git
synced 2024-11-25 09:19:23 +01:00
25 lines
2.0 KiB
Markdown
25 lines
2.0 KiB
Markdown
## What Works
|
|
|
|
| Loader | Loading 1 LoRA | Loading 2 or more LoRAs | Training LoRAs | Multimodal extension | Perplexity evaluation |
|
|
|----------------|----------------|-------------------------|----------------|----------------------|-----------------------|
|
|
| Transformers | ✅ | ✅\*\*\* | ✅\* | ✅ | ✅ |
|
|
| llama.cpp | ❌ | ❌ | ❌ | ❌ | use llamacpp_HF |
|
|
| llamacpp_HF | ❌ | ❌ | ❌ | ❌ | ✅ |
|
|
| ExLlamav2_HF | ✅ | ✅ | ❌ | ❌ | ✅ |
|
|
| ExLlamav2 | ✅ | ✅ | ❌ | ❌ | use ExLlamav2_HF |
|
|
| AutoGPTQ | ✅ | ❌ | ❌ | ✅ | ✅ |
|
|
| AutoAWQ | ? | ❌ | ? | ? | ✅ |
|
|
| GPTQ-for-LLaMa | ✅\*\* | ✅\*\*\* | ✅ | ✅ | ✅ |
|
|
| QuIP# | ? | ? | ? | ? | ✅ |
|
|
| HQQ | ? | ? | ? | ? | ✅ |
|
|
|
|
❌ = not implemented
|
|
|
|
✅ = implemented
|
|
|
|
\* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model.
|
|
|
|
\*\* Requires the monkey-patch. The instructions can be found [here](https://github.com/oobabooga/text-generation-webui/wiki/08-%E2%80%90-Additional-Tips#using-loras-with-gptq-for-llama).
|
|
|
|
\*\*\* Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases.
|