llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-27 12:33:06 +01:00

History

Johannes Gäßler 76e9e58b78 CUDA: fix MMV kernel being used for FP16 src1 (#10357 )		2024-11-17 23:20:42 +01:00
..
include	ggml: new optimization interface (ggml/988)	2024-11-17 08:30:29 +02:00
src	CUDA: fix MMV kernel being used for FP16 src1 (#10357 )	2024-11-17 23:20:42 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	CUDA: remove DMMV, consolidate F16 mult mat vec (#10318 )	2024-11-17 09:09:55 +01:00