LGAI-EXAONE
/

EXAONE-4.0-32B

Text Generation

Model card Files Files and versions

LG-AI-EXAONE commited on Jul 30

Commit

1b30fcd

·

1 Parent(s): 59231af

Update vLLM support

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -194,13 +194,23 @@ You can run the TensorRT-LLM server by following steps:
 2. Run server with the configuration
     ```bash
-    trtllm-serve serve [MODEL_PATH] --backend pytorch --extra_llm_api_options extra_llm_api_config.yaml
     ```
 For more details, please refer to [the documentation](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/models/core/exaone) of EXAONE from TensorRT-LLM.
 > [!NOTE]
-> Other inference engines including `vllm` and `sglang` don't support the EXAONE 4.0 officially now. We will update as soon as these libraries are updated.
 ## Performance

 2. Run server with the configuration
     ```bash
+    trtllm-serve serve LGAI-EXAONE/EXAONE-4.0-32B --backend pytorch --extra_llm_api_options extra_llm_api_config.yaml
     ```
 For more details, please refer to [the documentation](https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/models/core/exaone) of EXAONE from TensorRT-LLM.
+### vLLM
+vLLM officially supports EXAONE 4.0 models in the version of `0.10.0`. You can run the vLLM server by following command:
+```bash
+vllm serve LGAI-EXAONE/EXAONE-4.0-32B --enable-auto-tool-choice --tool-call-parser hermes --reasoning-parser qwen3
+```
+For more details, please refer to [the vLLM documentation](https://docs.vllm.ai/en/stable/).
 > [!NOTE]
+> Other inference engines including `sglang` don't support the EXAONE 4.0 officially now. We will update as soon as these libraries are updated.
 ## Performance