update README

Files changed (4) hide show

docs/sglang_deploy_guide.md +26 -6
docs/sglang_deploy_guide_cn.md +19 -4
docs/vllm_deploy_guide.md +1 -1
docs/vllm_deploy_guide_cn.md +1 -1

docs/sglang_deploy_guide.md CHANGED Viewed

@@ -35,13 +35,30 @@ It is recommended to use a virtual environment (such as **venv**, **conda**, or
 We recommend installing SGLang in a fresh Python environment. Since it has not been released yet, you need to manually build it from the source code:
 ```bash
-git clone https://github.com/sgl-project/sglang.git
 cd sglang
-uv pip install ./python --torch-backend=auto
 ```
 Run the following command to start the SGLang server. SGLang will automatically download and cache the MiniMax-M2 model from Hugging Face.
 8-GPU deployment command:
 ```bash
@@ -50,15 +67,18 @@ python -m sglang.launch_server \
     --tp-size 8 \
     --ep-size 8 \
     --tool-call-parser minimax-m2 \
-    --reasoning-parser minimax \
     --trust-remote-code \
     --port 8000 \
     --mem-fraction-static 0.7
 ```
 ## Testing Deployment
-After startup, you can test the vLLM OpenAI-compatible API with the following command:
 ```bash
 curl http://localhost:8000/v1/chat/completions \
@@ -84,13 +104,13 @@ export HF_ENDPOINT=https://hf-mirror.com
 ### MiniMax-M2 model is not currently supported
-This vLLM version is outdated. Please upgrade to the latest version.
 ## Getting Support
 If you encounter any issues while deploying the MiniMax model:
-- Contact our technical support team through official channels such as email at [api@minimaxi.com](mailto:api@minimaxi.com)
 - Submit an issue on our [GitHub](https://github.com/MiniMax-AI) repository

 We recommend installing SGLang in a fresh Python environment. Since it has not been released yet, you need to manually build it from the source code:
 ```bash
+git clone -b v0.5.4.post3 https://github.com/sgl-project/sglang.git
 cd sglang
+# Install the python packages
+pip install --upgrade pip
+pip install -e "python"
 ```
 Run the following command to start the SGLang server. SGLang will automatically download and cache the MiniMax-M2 model from Hugging Face.
+4-GPU deployment command:
+```bash
+python -m sglang.launch_server \
+    --model-path MiniMaxAI/MiniMax-M2 \
+    --tp-size 4 \
+    --tool-call-parser minimax-m2 \
+    --reasoning-parser minimax-append-think \
+    --host 0.0.0.0 \
+    --trust-remote-code \
+    --port 8000 \
+    --mem-fraction-static 0.7
+```
 8-GPU deployment command:
 ```bash
     --tp-size 8 \
     --ep-size 8 \
     --tool-call-parser minimax-m2 \
+    --reasoning-parser minimax-append-think \
+    --host 0.0.0.0 \
     --trust-remote-code \
     --port 8000 \
     --mem-fraction-static 0.7
 ```
 ## Testing Deployment
+After startup, you can test the SGLang OpenAI-compatible API with the following command:
 ```bash
 curl http://localhost:8000/v1/chat/completions \
 ### MiniMax-M2 model is not currently supported
+This SGLang version is outdated. Please upgrade to the latest version.
 ## Getting Support
 If you encounter any issues while deploying the MiniMax model:
+- Contact our technical support team through official channels such as email at [model@minimax.io](mailto:model@minimax.io)
 - Submit an issue on our [GitHub](https://github.com/MiniMax-AI) repository

docs/sglang_deploy_guide_cn.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # MiniMax M2 模型 SGLang 部署指南
-我们推荐使用 [SGLang](https://github.com/sgl-project/sglang) 来部署 [MiniMax-M2](https://huggingface.co/MiniMaxAI/MiniMax-M2) 模型。vLLM 是一个高性能的推理引擎，其具有卓越的服务吞吐、高效智能的内存管理机制、强大的批量请求处理能力、深度优化的底层性能等特性。我们建议在部署之前查看 SGLang 的官方文档以检查硬件兼容性。
 ## 本文档适用模型
@@ -41,6 +41,20 @@ uv pip install ./python --torch-backend=auto
 运行如下命令启动 SGLang 服务器，SGLang 会自动从 Huggingface 下载并缓存 MiniMax-M2 模型。
 8 卡部署命令：
 ```bash
@@ -49,7 +63,8 @@ python -m sglang.launch_server \
     --tp-size 8 \
     --ep-size 8 \
     --tool-call-parser minimax-m2 \
-    --reasoning-parser minimax \
     --trust-remote-code \
     --port 8000 \
     --mem-fraction-static 0.7
@@ -83,13 +98,13 @@ export HF_ENDPOINT=https://hf-mirror.com
 ### MiniMax-M2 model is not currently supported
-该 vLLM 版本过旧，请升级到最新版本。
 ## 获取支持
 如果在部署 MiniMax 模型过程中遇到任何问题：
-- 通过邮箱 [api@minimaxi.com](mailto:api@minimaxi.com) 等官方渠道联系我们的技术支持团队
 - 在我们的 [GitHub](https://github.com/MiniMax-AI) 仓库提交 Issue
 我们会持续优化模型的部署体验，欢迎反馈！

 # MiniMax M2 模型 SGLang 部署指南
+我们推荐使用 [SGLang](https://github.com/sgl-project/sglang) 来部署 [MiniMax-M2](https://huggingface.co/MiniMaxAI/MiniMax-M2) 模型。SGLang 是一个高性能的推理引擎，其具有卓越的服务吞吐、高效智能的内存管理机制、强大的批量请求处理能力、深度优化的底层性能等特性。我们建议在部署之前查看 SGLang 的官方文档以检查硬件兼容性。
 ## 本文档适用模型
 运行如下命令启动 SGLang 服务器，SGLang 会自动从 Huggingface 下载并缓存 MiniMax-M2 模型。
+4 卡部署命令：
+```bash
+python -m sglang.launch_server \
+    --model-path MiniMaxAI/MiniMax-M2 \
+    --tp-size 4 \
+    --tool-call-parser minimax-m2 \
+    --reasoning-parser minimax-append-think \
+    --host 0.0.0.0 \
+    --trust-remote-code \
+    --port 8000 \
+    --mem-fraction-static 0.7
+```
 8 卡部署命令：
 ```bash
     --tp-size 8 \
     --ep-size 8 \
     --tool-call-parser minimax-m2 \
+    --reasoning-parser minimax-append-think \
+    --host 0.0.0.0 \
     --trust-remote-code \
     --port 8000 \
     --mem-fraction-static 0.7
 ### MiniMax-M2 model is not currently supported
+该 SGLang 版本过旧，请升级到最新版本。
 ## 获取支持
 如果在部署 MiniMax 模型过程中遇到任何问题：
+- 通过邮箱 [model@minimax.io](mailto:model@minimax.io) 等官方渠道联系我们的技术支持团队
 - 在我们的 [GitHub](https://github.com/MiniMax-AI) 仓库提交 Issue
 我们会持续优化模型的部署体验，欢迎反馈！

docs/vllm_deploy_guide.md CHANGED Viewed

@@ -87,7 +87,7 @@ This vLLM version is outdated. Please upgrade to the latest version.
 If you encounter any issues while deploying the MiniMax model:
-- Contact our technical support team through official channels such as email at [api@minimaxi.com](mailto:api@minimaxi.com)
 - Submit an issue on our [GitHub](https://github.com/MiniMax-AI) repository

 If you encounter any issues while deploying the MiniMax model:
+- Contact our technical support team through official channels such as email at [model@minimax.io](mailto:model@minimax.io)
 - Submit an issue on our [GitHub](https://github.com/MiniMax-AI) repository

docs/vllm_deploy_guide_cn.md CHANGED Viewed

@@ -86,7 +86,7 @@ export HF_ENDPOINT=https://hf-mirror.com
 如果在部署 MiniMax 模型过程中遇到任何问题：
-- 通过邮箱 [api@minimaxi.com](mailto:api@minimaxi.com) 等官方渠道联系我们的技术支持团队
 - 在我们的 [GitHub](https://github.com/MiniMax-AI) 仓库提交 Issue
 我们会持续优化模型的部署体验，欢迎反馈！

 如果在部署 MiniMax 模型过程中遇到任何问题：
+- 通过邮箱 [model@minimax.io](mailto:model@minimax.io) 等官方渠道联系我们的技术支持团队
 - 在我们的 [GitHub](https://github.com/MiniMax-AI) 仓库提交 Issue
 我们会持续优化模型的部署体验，欢迎反馈！