.. |
ggml-cann
|
ggml : move rope type enum to ggml.h (#8949)
|
2024-08-13 21:13:15 +02:00 |
ggml-cuda
|
cuda : fix defrag with quantized KV (#9319)
|
2024-09-05 11:13:11 +02:00 |
ggml-sycl
|
[SYCL] Fix DMMV dequantization (#9279)
|
2024-09-04 16:26:33 +01:00 |
kompute@4565194ed7
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
kompute-shaders
|
ggml : move rope type enum to ggml.h (#8949)
|
2024-08-13 21:13:15 +02:00 |
llamafile
|
sgemm : improved Q4_0 and Q8_0 performance via 4xN and Mx4 gemm (#8908)
|
2024-08-31 11:20:35 +03:00 |
vulkan-shaders
|
Improve Vulkan shader build system (#9239)
|
2024-09-06 08:56:17 +02:00 |
CMakeLists.txt
|
Improve Vulkan shader build system (#9239)
|
2024-09-06 08:56:17 +02:00 |
ggml-aarch64.c
|
ggml : AVX2 support for Q4_0_8_8 (#8713)
|
2024-09-04 19:51:22 +03:00 |
ggml-aarch64.h
|
ggml : minor naming changes (#8433)
|
2024-07-12 10:46:02 +03:00 |
ggml-alloc.c
|
ggml : reduce hash table reset cost (#8698)
|
2024-07-27 04:41:55 +02:00 |
ggml-backend-impl.h
|
llama : reorganize source code + improve CMake (#8006)
|
2024-06-26 18:33:02 +03:00 |
ggml-backend.c
|
cuda : fix defrag with quantized KV (#9319)
|
2024-09-05 11:13:11 +02:00 |
ggml-blas.cpp
|
ggml : reduce hash table reset cost (#8698)
|
2024-07-27 04:41:55 +02:00 |
ggml-cann.cpp
|
[CANN]: Fix ggml_backend_cann_buffer_get_tensor (#8871)
|
2024-08-06 12:42:42 +08:00 |
ggml-common.h
|
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
2024-09-05 21:48:47 -04:00 |
ggml-cuda.cu
|
cuda : fix defrag with quantized KV (#9319)
|
2024-09-05 11:13:11 +02:00 |
ggml-impl.h
|
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
2024-09-05 21:48:47 -04:00 |
ggml-kompute.cpp
|
ggml : reduce hash table reset cost (#8698)
|
2024-07-27 04:41:55 +02:00 |
ggml-metal.m
|
sync : ggml
|
2024-08-27 22:41:27 +03:00 |
ggml-metal.metal
|
sync : ggml
|
2024-08-27 22:41:27 +03:00 |
ggml-quants.c
|
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
2024-09-05 21:48:47 -04:00 |
ggml-quants.h
|
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
2024-09-05 21:48:47 -04:00 |
ggml-rpc.cpp
|
rpc : print error message when failed to connect endpoint (#9042)
|
2024-08-19 10:11:45 +03:00 |
ggml-sycl.cpp
|
[SYCL] Add oneDNN primitive support (#9091)
|
2024-08-22 12:50:10 +08:00 |
ggml-vulkan.cpp
|
ggml : fix build break for the vulkan-debug (#9265)
|
2024-09-06 15:54:50 +03:00 |
ggml.c
|
ggml-quants : ternary packing for TriLMs and BitNet b1.58 (#8151)
|
2024-09-05 21:48:47 -04:00 |