llama.cpp/ggml
2024-11-14 13:00:15 +01:00
..
cmake llama : reorganize source code + improve CMake (#8006) 2024-06-26 18:33:02 +03:00
include metal : optimize FA kernels (#10171) 2024-11-08 13:47:22 +02:00
src CUDA: no -sm row for very small matrices (#10185) 2024-11-14 13:00:15 +01:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt metal : opt-in compile flag for BF16 (#10218) 2024-11-08 21:59:46 +02:00