llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-31 14:13:09 +01:00

History

Jeff Bolz 44e18ef939 vulkan: fix coopmat2 flash attention for non-contiguous inputs (#11281 ) Add code similar to mul_mm_cm2 to force alignment of strides, to avoid a performance regression. Add noncontiguous FA tests in test-backend-ops. Fixes #11268.		2025-01-18 09:26:50 +01:00
..
include	rpc : early register backend devices (#11262 )	2025-01-17 10:57:09 +02:00
src	vulkan: fix coopmat2 flash attention for non-contiguous inputs (#11281 )	2025-01-18 09:26:50 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	fix: ggml: fix vulkan-shaders-gen build (#10448 )	2025-01-15 14:17:42 +01:00