utter-project
/

TowerVision-2B

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

GuilhermeNunes commited on Oct 14

Commit

f452736

·

verified ·

1 Parent(s): 68d29ed

Update README.md

Files changed (1) hide show

README.md +4 -10

README.md CHANGED Viewed

@@ -3,8 +3,6 @@ library_name: transformers
 tags:
 - multimodal
 - multilingual
-- llm
-- vision
 - vlm
 - translation
 language:
@@ -29,6 +27,7 @@ language:
 base_model:
 - Unbabel/Tower-Plus-2B
 pipeline_tag: image-text-to-text
 ---
 # Model Card for TowerVision
@@ -41,8 +40,6 @@ TowerVision is a family of open-source multilingual vision-language models with
 This model card covers the TowerVision family, including the 2B and 9B parameter versions, both in their instruct-tuned (it) and pretrained (pt) variants, with the latter not undergoing instruction tuning.
-- **Point of Contact**: X (add some email here)
-- **License**: Apache 2.0
 - **Model Family**: TowerVision (2B, 9B variants)
 - **Context length**: 8192 tokens
 - **Languages**: 20+ languages including European, Asian, and other language families
@@ -62,6 +59,9 @@ This model card covers the TowerVision family, including the 2B and 9B parameter
 ## How to Use TowerVision
 ### Quick Start with Transformers
 <details open>
@@ -337,12 +337,6 @@ If you find TowerVision useful in your research, please consider citing the foll
 For errors or additional questions about details in this model card, contact the research team.
-## Terms of Use
-We hope that the release of this model will make community-based research efforts more accessible by releasing the weights of highly performant multilingual vision-language models to researchers all over the world.
-This model is governed by the Apache 2.0 License.
 ## Acknowledgments
 TowerVision builds upon the excellent work of:

 tags:
 - multimodal
 - multilingual
 - vlm
 - translation
 language:
 base_model:
 - Unbabel/Tower-Plus-2B
 pipeline_tag: image-text-to-text
+license: cc-by-nc-sa-4.0
 ---
 # Model Card for TowerVision
 This model card covers the TowerVision family, including the 2B and 9B parameter versions, both in their instruct-tuned (it) and pretrained (pt) variants, with the latter not undergoing instruction tuning.
 - **Model Family**: TowerVision (2B, 9B variants)
 - **Context length**: 8192 tokens
 - **Languages**: 20+ languages including European, Asian, and other language families
 ## How to Use TowerVision
+When using the model, make sure your prompt is formated correctly!
+Also, we recommend using **bfloat16** rather than **fp32/16**
 ### Quick Start with Transformers
 <details open>
 For errors or additional questions about details in this model card, contact the research team.
 ## Acknowledgments
 TowerVision builds upon the excellent work of: