ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (#9217)

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-27 04:23:06 +01:00

* ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels

* added fallback mechanism when the offline re-quantized model is not
optimized for the underlying target.

* fix for build errors

* remove prints from the low-level code

* Rebase to the latest upstream

This commit is contained in:

Charles Xu

2024-09-25 15:12:20 +02:00

committed by

GitHub

parent afbbfaa537

commit 1e43630218

No known key found for this signature in database

GPG Key ID: B5690EEEBB952194

1 changed files with 1591 additions and 1635 deletions

3226

ggml/src/ggml-aarch64.c

View File

File diff suppressed because it is too large Load Diff

ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (#9217)

3226 ggml/src/ggml-aarch64.c View File

3226

ggml/src/ggml-aarch64.c

View File