llama.cpp/ggml
2025-02-03 14:25:56 +02:00
..
cmake cmake: add ggml find package (#11369) 2025-01-26 12:07:48 -04:00
include CUDA: use mma PTX instructions for FlashAttention (#11583) 2025-02-02 19:31:09 +01:00
src CUDA: fix Volta FlashAttention logic (#11615) 2025-02-03 14:25:56 +02:00
.gitignore vulkan : cmake integration (#8119) 2024-07-13 18:12:39 +02:00
CMakeLists.txt cmake: add ggml find package (#11369) 2025-01-26 12:07:48 -04:00