mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2024-12-25 05:48:47 +01:00
8800226d65
* Fix --split-max-size Byte size calculation was done on int and overflowed. * add tests.sh * add examples test scripts to ci run Will autodiscover examples/*/tests.sh scripts and run them. * move WORK_PATH to a subdirectory * clean up before and after test * explicitly define which scripts to run * add --split-max-size to readme |
||
---|---|---|
.. | ||
CMakeLists.txt | ||
gguf-split.cpp | ||
README.md | ||
tests.sh |
GGUF split Example
CLI to split / merge GGUF files.
Command line options:
--split
: split GGUF to multiple GGUF, default operation.--split-max-size
: max size per split inM
orG
, f.ex.500M
or2G
.--split-max-tensors
: maximum tensors in each split: default(128)--merge
: merge multiple GGUF to a single GGUF.