diff --git a/extensions/multimodal/README.md b/extensions/multimodal/README.md
index 24996b3b..0f515ae6 100644
--- a/extensions/multimodal/README.md
+++ b/extensions/multimodal/README.md
@@ -57,7 +57,10 @@ This extension uses the following parameters (from `settings.json`):
## Usage through API
-You can run the multimodal inference through API, by inputting the images to prompt. Images are embedded like so: `f''`, where `img_str` is base-64 jpeg data. Python example:
+You can run the multimodal inference through API, by inputting the images to prompt. Images are embedded like so: `f''`, where `img_str` is base-64 jpeg data. Note that you will need to launch `server.py` with the arguments `--api --extensions multimodal`.
+
+Python example:
+
```Python
import base64
import requests