Update GPTQ-models-(4-bit-mode).md

2024-11-25 09:19:23 +01:00 · 2023-05-29 22:35:49 -03:00 · 2023-05-29 22:35:49 -03:00 · e763ace593
commit e763ace593
parent 86ef695d37
1 changed files with 14 additions and 1 deletions
--- a/docs/GPTQ-models-(4-bit-mode).md
+++ b/docs/GPTQ-models-(4-bit-mode).md
@ -39,7 +39,20 @@ Overall, I recommend using the old CUDA branch. It is included by default in the

 ### Installation using precompiled wheels

-https://github.com/jllllll/GPTQ-for-LLaMa-Wheels
+Kindly provided by our friend jllllll: https://github.com/jllllll/GPTQ-for-LLaMa-Wheels
+
+Windows:
+
+```
+pip install https://github.com/jllllll/GPTQ-for-LLaMa-Wheels/raw/main/quant_cuda-0.0.0-cp310-cp310-win_amd64.whl
+```
+
+Linux:
+
+```
+pip install https://github.com/jllllll/GPTQ-for-LLaMa-Wheels/raw/Linux-x64/quant_cuda-0.0.0-cp310-cp310-linux_x86_64.whl
+```
+

 ### Manual installation