llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2025-01-10 12:30:50 +01:00

History

Georgi Gerganov d197545530

* llama : bump max layers from 256 to 512

* llama : replace asserts with exceptions

2024-07-19 16:50:47 +03:00

llama.h

2024-07-19 16:50:47 +03:00