Georgi Gerganov
|
afa8a9ec9b
|
llama : add llama_vocab , functions -> methods, naming (#11110)
* llama : functions -> methods (#11110)
* llama : add struct llama_vocab to the API (#11156)
ggml-ci
* hparams : move vocab params to llama_vocab (#11159)
ggml-ci
* vocab : more pimpl (#11165)
ggml-ci
* vocab : minor tokenization optimizations (#11160)
ggml-ci
Co-authored-by: Diego Devesa <slarengh@gmail.com>
* lora : update API names (#11167)
ggml-ci
* llama : update API names to use correct prefix (#11174)
* llama : update API names to use correct prefix
ggml-ci
* cont
ggml-ci
* cont
ggml-ci
* minor [no ci]
* vocab : llama_vocab_add_[be]os -> llama_vocab_get_add_[be]os (#11174)
ggml-ci
* vocab : llama_vocab_n_vocab -> llama_vocab_n_tokens (#11174)
ggml-ci
---------
Co-authored-by: Diego Devesa <slarengh@gmail.com>
|
2025-01-12 11:32:42 +02:00 |
|
Georgi Gerganov
|
c2a16c0bdb
|
server : fix free of spec context and batch (#10651)
ggml-ci
|
2024-12-07 11:52:44 +02:00 |
|
Georgi Gerganov
|
9fd8c2687f
|
server : add more information about error (#10455)
|
2024-11-25 22:28:59 +02:00 |
|
Georgi Gerganov
|
d9d54e498d
|
speculative : refactor and add a simpler example (#10362)
* speculative : refactor and add a simpler example
ggml-ci
* speculative : clean-up and add comments and TODOs [no ci]
* speculative : manage context in common_speculative
ggml-ci
* speculative : simplify
ggml-ci
* speculative : simplify (cont)
ggml-ci
* speculative : add --draft-min CLI arg
* speculative : minor fixup
* make : build fixes
* speculative : do not redraft previous drafts
ggml-ci
* speculative : fix the draft sampling
ggml-ci
* speculative : fix compile warning
* common : refactor args
ggml-ci
* common : change defaults [no ci]
* common : final touches
ggml-ci
|
2024-11-25 09:58:41 +02:00 |
|