Georgi Gerganov bc21975084
speculative : fix handling of some input params (#9963)
* speculative : fix batch sizes at initialization

ggml-ci

* speculative : handle params.n_predict == -1

* speculative : limit batch size to llama_n_batch
2024-10-21 09:37:12 +03:00
..
2024-08-30 01:20:53 +02:00
2023-03-29 20:21:09 +03:00