Adrien Gallouët
|
0c39f44d70
|
ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_q4_0_4x4_q8_0() (#10567)
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2024-11-30 09:13:18 -08:00 |
|
Shupei Fan
|
4b3242bbea
|
ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (#10580)
|
2024-11-29 14:49:02 +01:00 |
|
Georgi Gerganov
|
dc22344088
|
ggml : remove redundant copyright notice + update authors
|
2024-11-28 20:46:40 +02:00 |
|
Shupei Fan
|
c202cef168
|
ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541)
* ggml-cpu: support IQ4_NL_4_4 by runtime repack
* ggml-cpu: add __ARM_FEATURE_DOTPROD guard
|
2024-11-28 13:52:03 +01:00 |
|
Dan Johansson
|
1e58ee1318
|
ggml : optimize Q4_0 into Q4_0_X_Y repack (#10324)
|
2024-11-16 01:53:37 +01:00 |
|
Charles Xu
|
1607a5e5b0
|
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (#9921)
* backend-cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels
---------
Co-authored-by: Diego Devesa <slarengh@gmail.com>
|
2024-11-15 01:28:50 +01:00 |
|
Diego Devesa
|
ae8de6d50a
|
ggml : build backends as libraries (#10256)
* ggml : build backends as libraries
---------
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Co-authored-by: R0CKSTAR <xiaodong.ye@mthreads.com>
|
2024-11-14 18:04:35 +01:00 |
|