Georgi Gerganov
|
d9d54e498d
|
speculative : refactor and add a simpler example (#10362)
* speculative : refactor and add a simpler example
ggml-ci
* speculative : clean-up and add comments and TODOs [no ci]
* speculative : manage context in common_speculative
ggml-ci
* speculative : simplify
ggml-ci
* speculative : simplify (cont)
ggml-ci
* speculative : add --draft-min CLI arg
* speculative : minor fixup
* make : build fixes
* speculative : do not redraft previous drafts
ggml-ci
* speculative : fix the draft sampling
ggml-ci
* speculative : fix compile warning
* common : refactor args
ggml-ci
* common : change defaults [no ci]
* common : final touches
ggml-ci
|
2024-11-25 09:58:41 +02:00 |
|
Diego Devesa
|
7eee341bee
|
common : use common_ prefix for common library functions (#9805)
* common : use common_ prefix for common library functions
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-10-10 22:57:42 +02:00 |
|
Georgi Gerganov
|
6262d13e0b
|
common : reimplement logging (#9418)
https://github.com/ggerganov/llama.cpp/pull/9418
|
2024-09-15 20:46:12 +03:00 |
|
Xuan Son Nguyen
|
bfe76d4a17
|
common : move arg parser code to arg.cpp (#9388)
* common : move arg parser to arg.cpp
* better categorize args
* add cmake
* missing climits
* missing cstdarg
* common : more explicit includes
* fix build
* refactor gpt_params_parse
* update server readme
* fix test
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
|
2024-09-09 23:36:09 +02:00 |
|
Xuan Son Nguyen
|
3f7ccfd649
|
common : bring back missing args, add env var duplication check (#9375)
* common : bring back missing args
* move duplication check to test-arg-parser
* add check for duplicated env var
* correct default values
|
2024-09-08 18:08:55 +02:00 |
|
Xuan Son Nguyen
|
1b9ae5189c
|
common : refactor arg parser (#9308)
* (wip) argparser v3
* migrated
* add test
* handle env
* fix linux build
* add export-docs example
* fix build (2)
* skip build test-arg-parser on windows
* update server docs
* bring back missing --alias
* bring back --n-predict
* clarify test-arg-parser
* small correction
* add comments
* fix args with 2 values
* refine example-specific args
* no more lamba capture
Co-authored-by: slaren@users.noreply.github.com
* params.sparams
* optimize more
* export-docs --> gen-docs
|
2024-09-07 20:43:51 +02:00 |
|