Update README.md
Browse files
README.md
CHANGED
|
@@ -125,6 +125,25 @@ User prompt...
|
|
| 125 |
|
| 126 |
In this format, the system section is optional and the conversation can be either single-turn or multi-turn. When applying inference, you always make your input string end with "\<s\>bot" to ask the model generating answers.
|
| 127 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 128 |
|
| 129 |
## Quickstart
|
| 130 |
|
|
@@ -293,6 +312,26 @@ User prompt...
|
|
| 293 |
|
| 294 |
在这个格式中,System提示是可选的(按需设定),支持单轮会话也支持多轮会话。推理时,请确保拼接的prompt字符串以"\<s\>bot\n"结尾,引导模型生成回答。
|
| 295 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 296 |
## 快速使用
|
| 297 |
|
| 298 |
```python
|
|
|
|
| 125 |
|
| 126 |
In this format, the system section is optional and the conversation can be either single-turn or multi-turn. When applying inference, you always make your input string end with "\<s\>bot" to ask the model generating answers.
|
| 127 |
|
| 128 |
+
For example, the format used to infer HumanEval is like the following:
|
| 129 |
+
|
| 130 |
+
```python
|
| 131 |
+
<s>human
|
| 132 |
+
# language: Python
|
| 133 |
+
from typing import List
|
| 134 |
+
def separate_paren_groups(paren_string: str) -> List[str]:
|
| 135 |
+
""" Input to this function is a string containing multiple groups of nested parentheses. Your goal is to
|
| 136 |
+
separate those group into separate strings and return the list of those.
|
| 137 |
+
Separate groups are balanced (each open brace is properly closed) and not nested within each other
|
| 138 |
+
Ignore any spaces in the input string.
|
| 139 |
+
>>> separate_paren_groups('( ) (( )) (( )( ))')
|
| 140 |
+
['()', '(())', '(()())']
|
| 141 |
+
"""
|
| 142 |
+
<s>bot
|
| 143 |
+
|
| 144 |
+
```
|
| 145 |
+
|
| 146 |
+
Specifically, we also add the Programming Language Tag (e.g. ```#language: Python``` for Python) used by CodeGeex models.
|
| 147 |
|
| 148 |
## Quickstart
|
| 149 |
|
|
|
|
| 312 |
|
| 313 |
在这个格式中,System提示是可选的(按需设定),支持单轮会话也支持多轮会话。推理时,请确保拼接的prompt字符串以"\<s\>bot\n"结尾,引导模型生成回答。
|
| 314 |
|
| 315 |
+
例如,推理HumanEval数据时使用的格式如下所示:
|
| 316 |
+
|
| 317 |
+
```python
|
| 318 |
+
<s>human
|
| 319 |
+
# language: Python
|
| 320 |
+
from typing import List
|
| 321 |
+
def separate_paren_groups(paren_string: str) -> List[str]:
|
| 322 |
+
""" Input to this function is a string containing multiple groups of nested parentheses. Your goal is to
|
| 323 |
+
separate those group into separate strings and return the list of those.
|
| 324 |
+
Separate groups are balanced (each open brace is properly closed) and not nested within each other
|
| 325 |
+
Ignore any spaces in the input string.
|
| 326 |
+
>>> separate_paren_groups('( ) (( )) (( )( ))')
|
| 327 |
+
['()', '(())', '(()())']
|
| 328 |
+
"""
|
| 329 |
+
<s>bot
|
| 330 |
+
|
| 331 |
+
```
|
| 332 |
+
|
| 333 |
+
特别地,我们也使用了CodeGeeX系列模型采用的编程语言区分标签(例如,对于Python语言,我们会使用```#language: Python```)。
|
| 334 |
+
|
| 335 |
## 快速使用
|
| 336 |
|
| 337 |
```python
|