doc: fix outdated default value of batch size (#6336)

* doc: fix outdated default value of batch size * doc: add doc for ubatch-size
2025-01-24 18:39:19 +01:00 · 2024-03-28 16:51:06 +08:00 · 2024-03-28 16:51:06 +08:00 · cfc4d75df6
commit cfc4d75df6
parent 6902cb7f2e
1 changed files with 3 additions and 1 deletions
--- a/examples/main/README.md
+++ b/examples/main/README.md
@ -296,7 +296,9 @@ These options help improve the performance and memory usage of the LLaMA models.

 ### Batch Size

-   `-b N, --batch-size N`: Set the batch size for prompt processing (default: 512). This large batch size benefits users who have BLAS installed and enabled it during the build. If you don't have BLAS enabled ("BLAS=0"), you can use a smaller number, such as 8, to see the prompt progress as it's evaluated in some situations.
+-   `-b N, --batch-size N`: Set the batch size for prompt processing (default: `2048`). This large batch size benefits users who have BLAS installed and enabled it during the build. If you don't have BLAS enabled ("BLAS=0"), you can use a smaller number, such as 8, to see the prompt progress as it's evaluated in some situations.
+
+- `-ub N`, `--ubatch-size N`: physical maximum batch size. This is for pipeline parallelization. Default: `512`.

 ### Prompt Caching