llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-10 12:30:50 +01:00

History

ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (#9217 )

* ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels

* added fallback mechanism when the offline re-quantized model is not
optimized for the underlying target.

* fix for build errors

* remove prints from the low-level code

* Rebase to the latest upstream

2024-09-25 16:12:20 +03:00

cmake

llama : reorganize source code + improve CMake (#8006 )

2024-06-26 18:33:02 +03:00

include

examples : adapt to ggml.h changes (ggml/0)

2024-09-24 11:00:52 +03:00

src

ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (#9217 )

2024-09-25 16:12:20 +03:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

cmake : do not hide GGML options + rename option (#9465 )

2024-09-16 10:27:50 +03:00