Commit
·
b21a48a
1
Parent(s):
81d56cf
Update README.md
Browse files
README.md
CHANGED
|
@@ -671,6 +671,64 @@ model-index:
|
|
| 671 |
- **Point of Contact:** [Niklas Muennighoff](mailto:niklas@hf.co)
|
| 672 |
- **Languages:** Refer to [BLOOM](https://huggingface.co/bigscience/bloom) for pretraining & [xP3](https://huggingface.co/bigscience/xP3) for finetuning language proportions. It understands both pretraining & finetuning languages.
|
| 673 |
- **BLOOMZ & mT0 Model Family:**
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 674 |
|Name|Explanation|
|
| 675 |
|----|-----------|
|
| 676 |
|[bloomz-560m](https://huggingface.co/bigscience/bloomz-560m)| 560M parameter multitask finetuned version of [bloom-560m](https://huggingface.co/bigscience/bloom-560m) on [xP3](https://huggingface.co/datasets/bigscience/xP3)|
|
|
|
|
| 671 |
- **Point of Contact:** [Niklas Muennighoff](mailto:niklas@hf.co)
|
| 672 |
- **Languages:** Refer to [BLOOM](https://huggingface.co/bigscience/bloom) for pretraining & [xP3](https://huggingface.co/bigscience/xP3) for finetuning language proportions. It understands both pretraining & finetuning languages.
|
| 673 |
- **BLOOMZ & mT0 Model Family:**
|
| 674 |
+
|
| 675 |
+
<table>
|
| 676 |
+
<tr>
|
| 677 |
+
<th colspan="11">Multitask finetuned on xP3 - Recommended for prompting in English.
|
| 678 |
+
</tr>
|
| 679 |
+
<tr>
|
| 680 |
+
<th>Parameters</th>
|
| 681 |
+
<td>560M</td>
|
| 682 |
+
<td>560M</td>
|
| 683 |
+
<td>560M</td>
|
| 684 |
+
<td>560M</td>
|
| 685 |
+
<td>560M</td>
|
| 686 |
+
<td>560M</td>
|
| 687 |
+
<td>560M</td>
|
| 688 |
+
<td>560M</td>
|
| 689 |
+
<td>560M</td>
|
| 690 |
+
<td>560M</td>
|
| 691 |
+
</tr>
|
| 692 |
+
<tr>
|
| 693 |
+
<th>Finetuned Model</th>
|
| 694 |
+
<td>560M</td>
|
| 695 |
+
<td>560M</td>
|
| 696 |
+
<td>560M</td>
|
| 697 |
+
<td>560M</td>
|
| 698 |
+
<td>560M</td>
|
| 699 |
+
<td>560M</td>
|
| 700 |
+
<td>560M</td>
|
| 701 |
+
<td>560M</td>
|
| 702 |
+
<td>560M</td>
|
| 703 |
+
<td>560M</td>
|
| 704 |
+
</tr>
|
| 705 |
+
</tr>
|
| 706 |
+
<tr>
|
| 707 |
+
<th>Original pretrained checkpoint</th>
|
| 708 |
+
<td>560M</td>
|
| 709 |
+
<td>560M</td>
|
| 710 |
+
<td>560M</td>
|
| 711 |
+
<td>560M</td>
|
| 712 |
+
<td>560M</td>
|
| 713 |
+
<td>560M</td>
|
| 714 |
+
<td>560M</td>
|
| 715 |
+
<td>560M</td>
|
| 716 |
+
<td>560M</td>
|
| 717 |
+
<td>560M</td>
|
| 718 |
+
</tr>
|
| 719 |
+
</table>
|
| 720 |
+
|
| 721 |
+
<table>
|
| 722 |
+
<tr>
|
| 723 |
+
<td>One</td>
|
| 724 |
+
<td>Two</td>
|
| 725 |
+
</tr>
|
| 726 |
+
<tr>
|
| 727 |
+
<td colspan="2">Three</td>
|
| 728 |
+
</tr>
|
| 729 |
+
</table>
|
| 730 |
+
|
| 731 |
+
|
| 732 |
|Name|Explanation|
|
| 733 |
|----|-----------|
|
| 734 |
|[bloomz-560m](https://huggingface.co/bigscience/bloomz-560m)| 560M parameter multitask finetuned version of [bloom-560m](https://huggingface.co/bigscience/bloom-560m) on [xP3](https://huggingface.co/datasets/bigscience/xP3)|
|