Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | 
         @@ -211,12 +211,12 @@ python -m sglang.launch_server --model-path HuggingFaceTB/SmolLM3-3B 
     | 
|
| 211 | 
         
             
            #### vLLM
         
     | 
| 212 | 
         | 
| 213 | 
         
             
            ```bash
         
     | 
| 214 | 
         
            -
            vllm serve HuggingFaceTB/SmolLM3-3B
         
     | 
| 215 | 
         
             
            ```
         
     | 
| 216 | 
         | 
| 217 | 
         
             
            #### Setting `chat_template_kwargs`
         
     | 
| 218 | 
         | 
| 219 | 
         
            -
            You can specify `chat_template_kwargs` such as `enable_thinking`  
     | 
| 220 | 
         | 
| 221 | 
         
             
            ```bash
         
     | 
| 222 | 
         
             
            curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
         
     | 
| 
         | 
|
| 211 | 
         
             
            #### vLLM
         
     | 
| 212 | 
         | 
| 213 | 
         
             
            ```bash
         
     | 
| 214 | 
         
            +
            vllm serve HuggingFaceTB/SmolLM3-3B --enable-auto-tool-choice --tool-call-parser=hermes
         
     | 
| 215 | 
         
             
            ```
         
     | 
| 216 | 
         | 
| 217 | 
         
             
            #### Setting `chat_template_kwargs`
         
     | 
| 218 | 
         | 
| 219 | 
         
            +
            You can specify `chat_template_kwargs` such as `enable_thinking` to a deployed model by passing the `chat_template_kwargs` parameter in the API request.
         
     | 
| 220 | 
         | 
| 221 | 
         
             
            ```bash
         
     | 
| 222 | 
         
             
            curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
         
     |