llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-31 06:03:11 +01:00

History

Jeff Bolz bd38ddea01 vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (#11166 ) * vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl Shaders are based on cpy.cu. * vulkan: support copy from q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl to f32 * ggml: copy q->f32 assumes some contiguity in the destination		2025-01-16 22:47:10 +01:00
..
include	CUDA: backwards pass for misc. ops, add tests (#11257 )	2025-01-16 16:43:38 +01:00
src	vulkan: support copy from f32 to q4_0/q4_1/q5_0/q5_1/q8_0/iq4_nl (#11166 )	2025-01-16 22:47:10 +01:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	fix: ggml: fix vulkan-shaders-gen build (#10448 )	2025-01-15 14:17:42 +01:00