llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-10 20:40:24 +01:00

History

Georgi Gerganov d1031cf49c

sampling : refactor init to use llama_sampling_params (#3696 )

* sampling : refactor init to use llama_sampling_params

* llama : combine repetition, frequency and presence penalties in 1 call

* examples : remove embd-input and gptneox-wip

* sampling : rename penalty params + reduce size of "prev" vector

* sampling : add llama_sampling_print helper

* sampling : hide prev behind API and apply #3661

ggml-ci

2023-10-20 21:07:23 +03:00

CMakeLists.txt

common : fix mirostat state when using multiple sequences (#3543 )

2023-10-11 22:35:46 +03:00

common.cpp

sampling : refactor init to use llama_sampling_params (#3696 )

2023-10-20 21:07:23 +03:00

common.h

sampling : refactor init to use llama_sampling_params (#3696 )

2023-10-20 21:07:23 +03:00

console.cpp

check C++ code with -Wmissing-declarations (#3184 )