text-generation-webui/What Works.md at 232c07bf1fef2e234edd13e74a8bee51fcde57c9

Mirrors/text-generation-webui

Fork 0

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-10-30 22:20:14 +01:00

oobabooga 0e54a09bcb

Remove exllamav1 loaders (#5128 )

2023-12-31 01:57:06 -03:00

1.9 KiB

Raw Blame History

What Works

Loader	Loading 1 LoRA	Loading 2 or more LoRAs	Training LoRAs	Multimodal extension	Perplexity evaluation
Transformers	✅	✅***	✅*	✅	✅
ExLlamav2_HF	✅	✅	❌	❌	✅
ExLlamav2	✅	✅	❌	❌	use ExLlamav2_HF
AutoGPTQ	✅	❌	❌	✅	✅
GPTQ-for-LLaMa	✅**	✅***	✅	✅	✅
llama.cpp	❌	❌	❌	❌	use llamacpp_HF
llamacpp_HF	❌	❌	❌	❌	✅
ctransformers	❌	❌	❌	❌	❌
AutoAWQ	?	❌	?	?	✅

❌ = not implemented

✅ = implemented

* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model.

** Requires the monkey-patch. The instructions can be found here.

*** Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases.

1.9 KiB Raw Blame History

What Works

1.9 KiB

Raw Blame History