llama.cpp/tests at compilade/refactor-kv-cache - llama.cpp - Gitea: Git with a cup of tea

Mirrors/llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-13 13:52:22 +01:00

History

compilade 4134999e01

gguf-py : Numpy dequantization for most types (#8939 )

* gguf-py : Numpy dequantization for most types

* gguf-py : Numpy dequantization for grid-based i-quants

2024-08-11 14:45:41 -04:00

..

__init__.py

convert-*.py: GGUF Naming Convention Refactor and Metadata Override Refactor (#7499 )

2024-07-18 20:40:15 +10:00

test_metadata.py

gguf-py : fix some metadata name extraction edge cases (#8591 )

2024-07-20 21:58:49 -04:00

test_quants.py

gguf-py : Numpy dequantization for most types (#8939 )

2024-08-11 14:45:41 -04:00