mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2025-01-14 06:19:02 +01:00
2025fa67e9
* kompute: op_unary: reject unsupported parameters Signed-off-by: Sergio Lopez <slp@redhat.com> * kompute: softmax: implement ALiBi support Signed-off-by: Sergio Lopez <slp@redhat.com> * kompute: rope: implement neox and phi3 support Signed-off-by: Sergio Lopez <slp@redhat.com> * kompute: op_mul_mat_q4_k permutted support Signed-off-by: Sergio Lopez <slp@redhat.com> * kompute: op_mul_mat_[q4_0|q4_1|q8_0] permutted support Signed-off-by: Sergio Lopez <slp@redhat.com> * kompute: op_mul_mat_f16 permutted support Signed-off-by: Sergio Lopez <slp@redhat.com> * kompute: op_mul_mat_q6_k permutted support Signed-off-by: Sergio Lopez <slp@redhat.com> --------- Signed-off-by: Sergio Lopez <slp@redhat.com> |
||
---|---|---|
.. | ||
ggml-amx | ||
ggml-blas | ||
ggml-cann | ||
ggml-cpu | ||
ggml-cuda | ||
ggml-hip | ||
ggml-kompute | ||
ggml-metal | ||
ggml-musa | ||
ggml-rpc | ||
ggml-sycl | ||
ggml-vulkan | ||
CMakeLists.txt | ||
ggml-aarch64.c | ||
ggml-aarch64.h | ||
ggml-alloc.c | ||
ggml-backend-impl.h | ||
ggml-backend-reg.cpp | ||
ggml-backend.cpp | ||
ggml-common.h | ||
ggml-impl.h | ||
ggml-opt.cpp | ||
ggml-quants.c | ||
ggml-quants.h | ||
ggml-threading.cpp | ||
ggml-threading.h | ||
ggml.c |