Georgi Gerganov
|
c725f691ea
|
llama : add struct llama_vocab to the API (#11156)
ggml-ci
|
2025-01-10 11:24:41 +02:00 |
|
Georgi Gerganov
|
47182dd03f
|
llama : update llama_model API names (#11063)
* llama : deprecate llama_free_model, add llama_model_free
ggml-ci
* llama : change `llama_load_model_from_file` -> `llama_model_load_from_file`
ggml-ci
|
2025-01-06 10:55:18 +02:00 |
|
Diego Devesa
|
7cc2d2c889
|
ggml : move AMX to the CPU backend (#10570)
* ggml : move AMX to the CPU backend
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-11-29 21:54:58 +01:00 |
|
Diego Devesa
|
5931c1f233
|
ggml : add support for dynamic loading of backends (#10469)
* ggml : add support for dynamic loading of backends
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-11-25 15:13:39 +01:00 |
|
Diego Devesa
|
b634f8a26f
|
simple-chat : only add bos on first prompt (#10129)
|
2024-11-02 13:08:53 +01:00 |
|
Diego Devesa
|
a6744e43e8
|
llama : add simple-chat example (#10124)
* llama : add simple-chat example
---------
Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>
|
2024-11-01 23:50:59 +01:00 |
|