llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-10 12:30:50 +01:00

History

Georgi Gerganov 99bd4ac28c

* llama : infill sampling handle very long tokens

ggml-ci

* cont : better indices

ggml-ci

2024-10-17 22:32:47 +03:00

llama.h

2024-10-17 22:32:47 +03:00