text-generation-webui/docs/ExLlama.md

# ExLlama

### About

ExLlama is an extremely optimized GPTQ backend for LLaMA models. It features much lower VRAM usage and much higher speeds due to not relying on unoptimized transformers code.

### Usage

Configure text-generation-webui to use exllama via the UI or command line:
   - In the "Model" tab, set "Loader" to "exllama"
   - Specify `--loader exllama` on the command line

### Manual setup

No additional installation steps are necessary since an exllama package is already included in the requirements.txt. If this package fails to install for some reason, you can use the following manual procedure:

1) Clone the ExLlama repository into your `text-generation-webui/repositories` folder:

```
mkdir repositories
cd repositories
git clone https://github.com/turboderp/exllama
```
Add ExLlama support (#2444) 2023-06-17 01:35:38 +02:00			`# ExLlama`

Update ExLlama.md 2023-06-25 01:23:01 +02:00			`### About`
Add ExLlama support (#2444) 2023-06-17 01:35:38 +02:00
Update ExLlama.md 2023-06-25 01:23:01 +02:00			`ExLlama is an extremely optimized GPTQ backend for LLaMA models. It features much lower VRAM usage and much higher speeds due to not relying on unoptimized transformers code.`
Add ExLlama support (#2444) 2023-06-17 01:35:38 +02:00
Update ExLlama.md 2023-06-25 01:23:01 +02:00			`### Usage`

			`Configure text-generation-webui to use exllama via the UI or command line:`
			`- In the "Model" tab, set "Loader" to "exllama"`
			- Specify `--loader exllama` on the command line

			`### Manual setup`

			`No additional installation steps are necessary since an exllama package is already included in the requirements.txt. If this package fails to install for some reason, you can use the following manual procedure:`
Add ExLlama support (#2444) 2023-06-17 01:35:38 +02:00
Update ExLlama.md 2023-06-17 01:40:12 +02:00			1) Clone the ExLlama repository into your `text-generation-webui/repositories` folder:
Add ExLlama support (#2444) 2023-06-17 01:35:38 +02:00
			```
Update ExLlama.md 2023-06-17 01:40:12 +02:00			`mkdir repositories`
Add ExLlama support (#2444) 2023-06-17 01:35:38 +02:00			`cd repositories`
			`git clone https://github.com/turboderp/exllama`
			```