mirror of
https://github.com/ggerganov/llama.cpp.git
synced 2024-12-25 13:58:46 +01:00
9 lines
234 B
Markdown
9 lines
234 B
Markdown
|
# llama.cpp/examples/speculative
|
||
|
|
||
|
Demonstartion of speculative decoding and tree-based speculative decoding techniques
|
||
|
|
||
|
More info:
|
||
|
|
||
|
- https://github.com/ggerganov/llama.cpp/pull/2926
|
||
|
- https://github.com/ggerganov/llama.cpp/pull/3624
|