1
0
mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-26 20:22:25 +01:00
Commit Graph

1 Commits

Author SHA1 Message Date
Georgi Gerganov
47068e5170
speculative : PoC for speeding-up inference via speculative sampling ()
* speculative : initial example

* speculative : print encoding speed

* speculative : add --draft CLI arg
2023-09-03 15:12:08 +03:00