Diego Devesa
|
7cc2d2c889
|
ggml : move AMX to the CPU backend (#10570)
* ggml : move AMX to the CPU backend
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-11-29 21:54:58 +01:00 |
|
Diego Devesa
|
7eee341bee
|
common : use common_ prefix for common library functions (#9805)
* common : use common_ prefix for common library functions
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-10-10 22:57:42 +02:00 |
|
Xuan Son Nguyen
|
afbbfaa537
|
server : add more env vars, improve gen-docs (#9635)
* server : add more env vars, improve gen-docs
* update server docs
* LLAMA_ARG_NO_CONTEXT_SHIFT
|
2024-09-25 14:05:13 +02:00 |
|
Xuan Son Nguyen
|
bfe76d4a17
|
common : move arg parser code to arg.cpp (#9388)
* common : move arg parser to arg.cpp
* better categorize args
* add cmake
* missing climits
* missing cstdarg
* common : more explicit includes
* fix build
* refactor gpt_params_parse
* update server readme
* fix test
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-09-09 23:36:09 +02:00 |
|
Xuan Son Nguyen
|
1b9ae5189c
|
common : refactor arg parser (#9308)
* (wip) argparser v3
* migrated
* add test
* handle env
* fix linux build
* add export-docs example
* fix build (2)
* skip build test-arg-parser on windows
* update server docs
* bring back missing --alias
* bring back --n-predict
* clarify test-arg-parser
* small correction
* add comments
* fix args with 2 values
* refine example-specific args
* no more lamba capture
Co-authored-by: slaren@users.noreply.github.com
* params.sparams
* optimize more
* export-docs --> gen-docs
|
2024-09-07 20:43:51 +02:00 |
|