Text Generation
Transformers
Safetensors
minimax
conversational
fp8
zhaochenyang20 commited on
Commit
17642e0
·
verified ·
1 Parent(s): c4075e3

Update docs/sglang_deploy_guide.md

Browse files
Files changed (1) hide show
  1. docs/sglang_deploy_guide.md +6 -8
docs/sglang_deploy_guide.md CHANGED
@@ -32,7 +32,7 @@ The following are recommended configurations; actual requirements should be adju
32
 
33
  It is recommended to use a virtual environment (such as **venv**, **conda**, or **uv**) to avoid dependency conflicts.
34
 
35
- We recommend installing SGLang in a fresh Python environment. Since it has not been released yet, you need to manually build it from the source code:
36
 
37
  ```bash
38
  git clone -b v0.5.4.post3 https://github.com/sgl-project/sglang.git
@@ -56,7 +56,7 @@ python -m sglang.launch_server \
56
  --host 0.0.0.0 \
57
  --trust-remote-code \
58
  --port 8000 \
59
- --mem-fraction-static 0.7
60
  ```
61
 
62
  8-GPU deployment command:
@@ -67,15 +67,13 @@ python -m sglang.launch_server \
67
  --tp-size 8 \
68
  --ep-size 8 \
69
  --tool-call-parser minimax-m2 \
70
- --reasoning-parser minimax-append-think \
71
- --host 0.0.0.0 \
72
  --trust-remote-code \
 
 
73
  --port 8000 \
74
- --mem-fraction-static 0.7
75
  ```
76
 
77
-
78
-
79
  ## Testing Deployment
80
 
81
  After startup, you can test the SGLang OpenAI-compatible API with the following command:
@@ -104,7 +102,7 @@ export HF_ENDPOINT=https://hf-mirror.com
104
 
105
  ### MiniMax-M2 model is not currently supported
106
 
107
- This SGLang version is outdated. Please upgrade to the latest version.
108
 
109
  ## Getting Support
110
 
 
32
 
33
  It is recommended to use a virtual environment (such as **venv**, **conda**, or **uv**) to avoid dependency conflicts.
34
 
35
+ We recommend installing SGLang in a fresh Python environment:
36
 
37
  ```bash
38
  git clone -b v0.5.4.post3 https://github.com/sgl-project/sglang.git
 
56
  --host 0.0.0.0 \
57
  --trust-remote-code \
58
  --port 8000 \
59
+ --mem-fraction-static 0.85
60
  ```
61
 
62
  8-GPU deployment command:
 
67
  --tp-size 8 \
68
  --ep-size 8 \
69
  --tool-call-parser minimax-m2 \
 
 
70
  --trust-remote-code \
71
+ --host 0.0.0.0 \
72
+ --reasoning-parser minimax-append-think \
73
  --port 8000 \
74
+ --mem-fraction-static 0.85
75
  ```
76
 
 
 
77
  ## Testing Deployment
78
 
79
  After startup, you can test the SGLang OpenAI-compatible API with the following command:
 
102
 
103
  ### MiniMax-M2 model is not currently supported
104
 
105
+ Please upgrade to the latest stable version, >= v0.5.4.post3.
106
 
107
  ## Getting Support
108