llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-02-04 15:43:53 +01:00

History

Srihari-mcw 1e7b9299c6 ggml : AVX512 gemm for Q4_0_8_8 (#9532 ) * AVX512 version of ggml_gemm_q4_0_8x8_q8_0 * Remove zero vector parameter passing * Rename functions and rearrange order of macros * Edit commments * style : minor adjustments * Update x to start from 0 --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2024-09-23 17:06:38 +03:00
..
cmake	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
include	ggml/examples: add backend support for numerical optimization (ggml/949)	2024-09-20 21:15:05 +03:00
src	ggml : AVX512 gemm for Q4_0_8_8 (#9532 )	2024-09-23 17:06:38 +03:00
.gitignore	vulkan : cmake integration (#8119 )	2024-07-13 18:12:39 +02:00
CMakeLists.txt	cmake : do not hide GGML options + rename option (#9465 )	2024-09-16 10:27:50 +03:00