llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-22 09:39:08 +01:00

History

Francis Couture-Harpin 2ff601fc32 gguf-py : fix and simplify quantized shape round-trip		2024-05-22 23:42:36 -04:00
..
__init__.py	convert : support models with multiple chat templates (#6588 )	2024-04-18 14:49:01 +03:00
gguf-convert-endian.py	convert.py : add python logging instead of print() (#6511 )	2024-05-03 22:36:41 +03:00
gguf-dump.py	convert-hf : save memory with lazy evaluation (#7075 )	2024-05-08 18:16:38 -04:00
gguf-new-metadata.py	gguf-py : fix and simplify quantized shape round-trip	2024-05-22 23:42:36 -04:00
gguf-set-metadata.py	convert.py : add python logging instead of print() (#6511 )	2024-05-03 22:36:41 +03:00