Georgi Gerganov 1da7b76569
server : fix speculative decoding with context shift (#10641)
* server : fix speculative decoding with context shift

ggml-ci

* server : take into account speculative limits

ggml-ci

* server : add tests
2024-12-04 22:38:20 +02:00
..
2024-12-02 21:22:53 +02:00
2024-12-04 01:26:37 +01:00
2023-03-29 20:21:09 +03:00