1
0
mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-23 09:59:18 +01:00
llama.cpp/examples/speculative
Georgi Gerganov ee1a0ec9cb
llama : add option for greedy sampling with probs ()
* llama : add option for greedy sampling with probs

* llama : add comment about llama_sample_token_greedy() missing probs

* sampling : temp == 0.0 -> no probs, temp < 0.0 -> probs
2023-10-28 14:23:11 +03:00
..
CMakeLists.txt speculative : PoC for speeding-up inference via speculative sampling () 2023-09-03 15:12:08 +03:00
speculative.cpp llama : add option for greedy sampling with probs () 2023-10-28 14:23:11 +03:00