llama.cpp/ggml/src
Changyeon Kim 409dc4f8bb
ggml : fix build break for the vulkan-debug (#9265)
- windows build : Ok.
- linux build : Ok.

Signed-off-by: Changyeon Kim <cyzero.kim@samsung.com>
2024-09-06 15:54:50 +03:00
..
ggml-cann ggml : move rope type enum to ggml.h (#8949) 2024-08-13 21:13:15 +02:00
ggml-cuda cuda : fix defrag with quantized KV (#9319) 2024-09-05 11:13:11 +02:00
ggml-sycl [SYCL] Fix DMMV dequantization (#9279) 2024-09-04 16:26:33 +01:00
kompute@4565194ed7 llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
kompute-shaders ggml : move rope type enum to ggml.h (#8949) 2024-08-13 21:13:15 +02:00
llamafile sgemm : improved Q4_0 and Q8_0 performance via 4xN and Mx4 gemm (#8908) 2024-08-31 11:20:35 +03:00
vulkan-shaders Improve Vulkan shader build system (#9239) 2024-09-06 08:56:17 +02:00
CMakeLists.txt Improve Vulkan shader build system (#9239) 2024-09-06 08:56:17 +02:00
ggml-aarch64.c ggml : AVX2 support for Q4_0_8_8 (#8713) 2024-09-04 19:51:22 +03:00
ggml-aarch64.h ggml : minor naming changes (#8433) 2024-07-12 10:46:02 +03:00
ggml-alloc.c ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
ggml-backend-impl.h llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
ggml-backend.c cuda : fix defrag with quantized KV (#9319) 2024-09-05 11:13:11 +02:00
ggml-blas.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
ggml-cann.cpp [CANN]: Fix ggml_backend_cann_buffer_get_tensor (#8871) 2024-08-06 12:42:42 +08:00
ggml-common.h ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) 2024-09-05 21:48:47 -04:00
ggml-cuda.cu cuda : fix defrag with quantized KV (#9319) 2024-09-05 11:13:11 +02:00
ggml-impl.h ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) 2024-09-05 21:48:47 -04:00
ggml-kompute.cpp ggml : reduce hash table reset cost (#8698) 2024-07-27 04:41:55 +02:00
ggml-metal.m sync : ggml 2024-08-27 22:41:27 +03:00
ggml-metal.metal sync : ggml 2024-08-27 22:41:27 +03:00
ggml-quants.c ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) 2024-09-05 21:48:47 -04:00
ggml-quants.h ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) 2024-09-05 21:48:47 -04:00
ggml-rpc.cpp rpc : print error message when failed to connect endpoint (#9042) 2024-08-19 10:11:45 +03:00
ggml-sycl.cpp [SYCL] Add oneDNN primitive support (#9091) 2024-08-22 12:50:10 +08:00
ggml-vulkan.cpp ggml : fix build break for the vulkan-debug (#9265) 2024-09-06 15:54:50 +03:00
ggml.c ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151) 2024-09-05 21:48:47 -04:00