Georgi Gerganov
|
6df37bc28b
|
llama : update API names to use correct prefix (#11174)
* llama : update API names to use correct prefix
ggml-ci
* cont
ggml-ci
* cont
ggml-ci
* minor [no ci]
|
2025-01-11 16:41:56 +02:00 |
|
Georgi Gerganov
|
c725f691ea
|
llama : add struct llama_vocab to the API (#11156)
ggml-ci
|
2025-01-10 11:24:41 +02:00 |
|
Georgi Gerganov
|
c2a16c0bdb
|
server : fix free of spec context and batch (#10651)
ggml-ci
|
2024-12-07 11:52:44 +02:00 |
|
Georgi Gerganov
|
9fd8c2687f
|
server : add more information about error (#10455)
|
2024-11-25 22:28:59 +02:00 |
|
Georgi Gerganov
|
d9d54e498d
|
speculative : refactor and add a simpler example (#10362)
* speculative : refactor and add a simpler example
ggml-ci
* speculative : clean-up and add comments and TODOs [no ci]
* speculative : manage context in common_speculative
ggml-ci
* speculative : simplify
ggml-ci
* speculative : simplify (cont)
ggml-ci
* speculative : add --draft-min CLI arg
* speculative : minor fixup
* make : build fixes
* speculative : do not redraft previous drafts
ggml-ci
* speculative : fix the draft sampling
ggml-ci
* speculative : fix compile warning
* common : refactor args
ggml-ci
* common : change defaults [no ci]
* common : final touches
ggml-ci
|
2024-11-25 09:58:41 +02:00 |
|