mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2025-02-06 08:30:33 +01:00
66ee4f297c
* vulkan: initial support for IQ3_S * vulkan: initial support for IQ3_XXS * vulkan: initial support for IQ2_XXS * vulkan: initial support for IQ2_XS * vulkan: optimize Q3_K by removing branches * vulkan: implement dequantize variants for coopmat2 * vulkan: initial support for IQ2_S * vulkan: vertically realign code * port failing dequant callbacks from mul_mm * Fix array length mismatches * vulkan: avoid using workgroup size before it is referenced * tests: increase timeout for Vulkan llvmpipe backend --------- Co-authored-by: Jeff Bolz <jbolz@nvidia.com> |
||
---|---|---|
.. | ||
bench.yml.disabled | ||
build.yml | ||
close-issue.yml | ||
docker.yml | ||
editorconfig.yml | ||
gguf-publish.yml | ||
labeler.yml | ||
python-check-requirements.yml | ||
python-lint.yml | ||
python-type-check.yml | ||
server.yml |