tencent
/

Hunyuan-7B-Instruct-0124

Text Generation

hunyuan_v1_dense

Model card Files Files and versions

woodchen7 commited on Jan 24

Commit

6eab741

·

verified ·

1 Parent(s): c9640a0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -98,7 +98,7 @@ You can refer to the content in [Tencent-Hunyuan-Large](https://github.com/Tence
 ### Inference Performance
-This section presents the efficiency test results of deploying various models (original and quantized) using vLLM, including inference speed (tokens/s) under different batch sizes.
 | Inference Framework | Model      | Number of GPUs (series 1) | input_length | batch=1             | batch=4              |
 |------|------------|-------------------------|-------------------------|---------------------|----------------------|

 ### Inference Performance
+This section presents the efficiency test results of deploying various models using vLLM, including inference speed (tokens/s) under different batch sizes.
 | Inference Framework | Model      | Number of GPUs (series 1) | input_length | batch=1             | batch=4              |
 |------|------------|-------------------------|-------------------------|---------------------|----------------------|