klosax
|
30c4ea47e6
|
add gptneox gguf example
|
2023-07-30 16:59:26 +02:00 |
|
klosax
|
2fabc176ce
|
Update convert-llama-h5-to-gguf.py
|
2023-07-30 16:28:08 +02:00 |
|
klosax
|
f175b05872
|
Makefile : add gptneox gguf example
|
2023-07-30 15:08:37 +02:00 |
|
klosax
|
e9192b0135
|
add gptneox gguf example
|
2023-07-30 15:05:37 +02:00 |
|
klosax
|
4ed98bf1ab
|
Update convert-llama-h5-to-gguf.py
|
2023-07-30 15:01:47 +02:00 |
|
klosax
|
b19c11750b
|
ggml.c : add gguf_get_arr_n
|
2023-07-30 14:58:50 +02:00 |
|
klosax
|
b4676ee447
|
ggml.h : increase GGML_MAX_NAME to 64
|
2023-07-30 14:51:37 +02:00 |
|
klosax
|
ccd81a751b
|
gguf.py : add layer norm eps and merges
|
2023-07-30 14:48:14 +02:00 |
|
klosax
|
0790c121aa
|
constants.py : add layer norm eps
|
2023-07-30 14:46:36 +02:00 |
|
M. Yusuf Sarıgöz
|
87c34e4dd4
|
gguf : update convert-llama-h5-to-gguf.py
|
2023-07-30 01:09:22 +03:00 |
|
M. Yusuf Sarıgöz
|
32e037ffbe
|
gguf : fix set is not subscriptable
|
2023-07-30 01:01:13 +03:00 |
|
klosax
|
06c3e4a1a7
|
Update convert-llama-h5-to-gguf.py
|
2023-07-29 21:38:01 +02:00 |
|
klosax
|
9577821487
|
gguf.py : support any type
|
2023-07-29 21:29:07 +02:00 |
|
klosax
|
2c22e3bcdb
|
ggml.c : get arr str and f32
|
2023-07-29 20:37:47 +02:00 |
|
klosax
|
34469b9ea7
|
ggml.h : get array str and f32
|
2023-07-29 20:36:06 +02:00 |
|
M. Yusuf Sarıgöz
|
0f5e57f01d
|
gguf : handle already encoded string
|
2023-07-29 19:56:06 +03:00 |
|
klosax
|
8ad7cd49fb
|
Update convert-llama-h5-to-gguf.py
|
2023-07-29 16:47:00 +02:00 |
|
M. Yusuf Sarıgöz
|
0317c41d98
|
gguf : upd gguf conversion script
|
2023-07-29 13:31:07 +03:00 |
|
M. Yusuf Sarıgöz
|
cc3dd7f042
|
gguf : write tokenizer data
|
2023-07-29 13:30:22 +03:00 |
|
M. Yusuf Sarıgöz
|
8a76dd8a85
|
gguf : write tensors one by one
|
2023-07-29 13:17:28 +03:00 |
|
M. Yusuf Sarıgöz
|
c861e234f4
|
gguf : write tensors one by one
|
2023-07-29 12:49:01 +03:00 |
|
M. Yusuf Sarıgöz
|
0c219fb5b5
|
gguf : fix writing gguf arrays
|
2023-07-29 12:42:54 +03:00 |
|
M. Yusuf Sarıgöz
|
93f7f7aef7
|
gguf : write tensors one by one and code reuse
|
2023-07-29 12:34:35 +03:00 |
|
M. Yusuf Sarıgöz
|
aa99562d70
|
Merge branch 'gguf' of https://github.com//ggerganov/llama.cpp into gguf
|
2023-07-29 12:26:11 +03:00 |
|
M. Yusuf Sarıgöz
|
ea5f9ad2ca
|
gguf : fix writing gguf arrays
|
2023-07-29 12:25:43 +03:00 |
|
klosax
|
999431c4b6
|
quick and dirty conversion example
|
2023-07-29 11:20:05 +02:00 |
|
M. Yusuf Sarıgöz
|
d54f53ca51
|
gguf : add tokenization constants
|
2023-07-29 12:04:45 +03:00 |
|
M. Yusuf Sarıgöz
|
06f423a8e1
|
gguf : write sample tensors to read
|
2023-07-29 10:26:26 +03:00 |
|
M. Yusuf Sarıgöz
|
08dc8fd884
|
gguf : do not hardcode tensor names to read
|
2023-07-29 10:24:46 +03:00 |
|
M. Yusuf Sarıgöz
|
9475cdb7a3
|
Merge branch 'gguf-write-tokenization' into gguf
|
2023-07-29 00:36:35 +03:00 |
|
M. Yusuf Sarıgöz
|
1495735aac
|
gguf : fix writing tensors
|
2023-07-29 00:26:22 +03:00 |
|
klosax
|
3492f848d7
|
gguf : add gguf_find_key (#2438)
* gguf.cpp : find key example
* ggml.h : add gguf_find_key
* ggml.c : add gguf_find_key
|
2023-07-28 23:45:24 +03:00 |
|
M. Yusuf Sarıgöz
|
11ef380c2a
|
GGUF : write tensor (#2426)
* WIP: Write tensor
* GGUF : Support writing tensors in Python
* refactor : rm unused import and upd todos
* fix : fix errors upd writing example
* rm example.gguf
* gitignore *.gguf
* undo formatting
|
2023-07-28 11:34:16 +03:00 |
|
Georgi Gerganov
|
d2bb3ac10b
|
convert.py : remove GGML vocab + other obsolete stuff
|
2023-07-27 16:36:35 +03:00 |
|
Georgi Gerganov
|
68f53485e4
|
convert.py : start a new simplified implementation by removing old stuff
|
2023-07-27 15:56:53 +03:00 |
|
Georgi Gerganov
|
158be8f7f4
|
gguf.py : some code style changes
|
2023-07-27 15:37:06 +03:00 |
|
Georgi Gerganov
|
d2b6ca13ad
|
gguf : add array support
|
2023-07-27 14:53:07 +03:00 |
|
Georgi Gerganov
|
d89533dff6
|
gguf : expose the gguf_type enum through the API for now
|
2023-07-27 11:10:34 +03:00 |
|
M. Yusuf Sarıgöz
|
c85d3178b3
|
refactor : reduce code duplication and better API (#2415)
|
2023-07-27 10:29:29 +03:00 |
|
Georgi Gerganov
|
d8491fc7e3
|
gguf : add comments
|
2023-07-26 23:00:24 +03:00 |
|
Georgi Gerganov
|
5628ec7163
|
gguf : read / write sample models
|
2023-07-26 22:40:45 +03:00 |
|
Georgi Gerganov
|
e46870f5af
|
gguf : gguf.c is now part of ggml.c
|
2023-07-26 18:55:32 +03:00 |
|
Georgi Gerganov
|
d313c0fa33
|
gguf : simplify gguf_get_val
|
2023-07-26 18:53:57 +03:00 |
|
Georgi Gerganov
|
cb871fa022
|
gguf : do not support passing existing ggml_context to gguf_init
|
2023-07-26 18:48:52 +03:00 |
|
Georgi Gerganov
|
860c9c63ce
|
gguf : add gguf_get_tensor_name()
|
2023-07-26 18:21:14 +03:00 |
|
Georgi Gerganov
|
78b226a959
|
gguf : initial model loading - not tested
|
2023-07-26 18:21:14 +03:00 |
|
Georgi Gerganov
|
d91b985d2d
|
gguf : read tensor info
|
2023-07-26 18:21:13 +03:00 |
|
Georgi Gerganov
|
8d6acfec12
|
gguf : read header + meta data
|
2023-07-26 18:21:13 +03:00 |
|
Georgi Gerganov
|
6873148771
|
gguf : first API pass
|
2023-07-26 18:21:13 +03:00 |
|
Georgi Gerganov
|
7e82d25f40
|
ci : disable CI temporary to not waste energy
|
2023-07-26 18:21:13 +03:00 |
|