From cfc4d75df6399b36153ef739f2c1abee4c114bb8 Mon Sep 17 00:00:00 2001 From: Ting Sun Date: Thu, 28 Mar 2024 16:51:06 +0800 Subject: [PATCH] doc: fix outdated default value of batch size (#6336) * doc: fix outdated default value of batch size * doc: add doc for ubatch-size --- examples/main/README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/examples/main/README.md b/examples/main/README.md index 9c83fd3bf..bb696b562 100644 --- a/examples/main/README.md +++ b/examples/main/README.md @@ -296,7 +296,9 @@ These options help improve the performance and memory usage of the LLaMA models. ### Batch Size -- `-b N, --batch-size N`: Set the batch size for prompt processing (default: 512). This large batch size benefits users who have BLAS installed and enabled it during the build. If you don't have BLAS enabled ("BLAS=0"), you can use a smaller number, such as 8, to see the prompt progress as it's evaluated in some situations. +- `-b N, --batch-size N`: Set the batch size for prompt processing (default: `2048`). This large batch size benefits users who have BLAS installed and enabled it during the build. If you don't have BLAS enabled ("BLAS=0"), you can use a smaller number, such as 8, to see the prompt progress as it's evaluated in some situations. + +- `-ub N`, `--ubatch-size N`: physical maximum batch size. This is for pipeline parallelization. Default: `512`. ### Prompt Caching