Georgi Gerganov
307e09cd85
Merge branch 'gguf' into gguf-write-single-pass
2023-08-17 21:51:15 +03:00
Georgi Gerganov
e426b3cfc8
gguf.py : fix vertical alignment
2023-08-17 21:50:01 +03:00
Georgi Gerganov
acaa98234a
convert.py : fix HF tensor permuting / unpacking
...
ggml-ci
2023-08-17 21:06:45 +03:00
M. Yusuf Sarıgöz
4dbce7d009
gguf : rm file_type key and method
2023-08-17 20:02:38 +03:00
M. Yusuf Sarıgöz
2f8fc92d86
gguf : fix conflicts
2023-08-17 18:51:14 +03:00
Georgi Gerganov
e0429d38e4
convert-new.py : output gguf ( #2635 )
...
* convert-new.py : output gguf (WIP)
* convert-new.py : add gguf key-value pairs
* llama : add hparams.ctx_train + no longer print ftype
* convert-new.py : minor fixes
* convert-new.py : vocab-only option should work now
* llama : fix tokenizer to use llama_char_to_byte
* tests : add new ggml-vocab-llama.gguf
* convert-new.py : tensor name mapping
* convert-new.py : add map for skipping tensor serialization
* convert-new.py : convert script now works
* gguf.py : pick some of the refactoring from #2644
* convert-new.py : minor fixes
2023-08-17 17:19:52 +03:00
M. Yusuf Sarıgöz
f31e9230ad
gguf : single pass for writing tensors + refactoring writer
2023-08-17 15:19:30 +03:00
Georgi Gerganov
c8ee87f141
gguf.py : merge all files in gguf.py
2023-08-16 19:55:49 +03:00
klosax
b6056c3db8
gguf.py : add tensor data layout
2023-08-15 19:53:44 +02:00
klosax
cedb4870c6
gguf.py : add token types
2023-08-14 22:08:40 +02:00
Georgi Gerganov
62490f1380
gguf : use UNIX line ending
2023-08-14 13:04:35 +03:00
klosax
5c5a95ba2d
gguf.py : dont add empty strings
2023-08-14 11:22:06 +02:00
M. Yusuf Sarıgöz
91d4bfd536
convert : write more metadata for LLaMA
2023-08-13 13:29:46 +03:00
klosax
d2ce9cfe8d
gguf.py : n_layer --> n_block
2023-08-13 00:01:20 +02:00
klosax
5d81a715d4
gguf.py : no need to convert tensors twice
2023-08-12 21:45:45 +02:00
klosax
4357e692ac
gguf.py : use custom alignment if present
2023-08-07 13:51:26 +02:00
klosax
278ada9572
gguf.py : bytesarray for gpt2bpe tokenizer
2023-08-04 04:07:57 +02:00
klosax
49380a23a3
gguf.py : accumulate kv and tensor info data + special tokens
2023-08-01 23:37:48 +02:00
M. Yusuf Sarıgöz
7aa0a0e7f7
gguf : support custom alignment value
2023-07-31 09:59:36 +03:00
klosax
ccd81a751b
gguf.py : add layer norm eps and merges
2023-07-30 14:48:14 +02:00
M. Yusuf Sarıgöz
32e037ffbe
gguf : fix set is not subscriptable
2023-07-30 01:01:13 +03:00
klosax
9577821487
gguf.py : support any type
2023-07-29 21:29:07 +02:00
M. Yusuf Sarıgöz
0f5e57f01d
gguf : handle already encoded string
2023-07-29 19:56:06 +03:00
M. Yusuf Sarıgöz
cc3dd7f042
gguf : write tokenizer data
2023-07-29 13:30:22 +03:00
M. Yusuf Sarıgöz
8a76dd8a85
gguf : write tensors one by one
2023-07-29 13:17:28 +03:00
M. Yusuf Sarıgöz
c861e234f4
gguf : write tensors one by one
2023-07-29 12:49:01 +03:00
M. Yusuf Sarıgöz
0c219fb5b5
gguf : fix writing gguf arrays
2023-07-29 12:42:54 +03:00
M. Yusuf Sarıgöz
93f7f7aef7
gguf : write tensors one by one and code reuse
2023-07-29 12:34:35 +03:00
M. Yusuf Sarıgöz
ea5f9ad2ca
gguf : fix writing gguf arrays
2023-07-29 12:25:43 +03:00
M. Yusuf Sarıgöz
06f423a8e1
gguf : write sample tensors to read
2023-07-29 10:26:26 +03:00
M. Yusuf Sarıgöz
1495735aac
gguf : fix writing tensors
2023-07-29 00:26:22 +03:00
M. Yusuf Sarıgöz
11ef380c2a
GGUF : write tensor ( #2426 )
...
* WIP: Write tensor
* GGUF : Support writing tensors in Python
* refactor : rm unused import and upd todos
* fix : fix errors upd writing example
* rm example.gguf
* gitignore *.gguf
* undo formatting
2023-07-28 11:34:16 +03:00
Georgi Gerganov
158be8f7f4
gguf.py : some code style changes
2023-07-27 15:37:06 +03:00
M. Yusuf Sarıgöz
c85d3178b3
refactor : reduce code duplication and better API ( #2415 )
2023-07-27 10:29:29 +03:00
M. Yusuf Sarıgöz
bae6b125f6
wip : implement GGUF ( #2397 )
...
* Add LLAMA_DEFAULT_RMS_EPS so we can change the default (#2384 )
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
* WIP: python class to write GGUF, incomplete C apı for reading
---------
Co-authored-by: Kawrakow <48489457+ikawrakow@users.noreply.github.com>
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2023-07-26 18:21:13 +03:00