Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -37,7 +37,7 @@ demonstrate their fine-tuning potential in various downstream tasks.
|
|
| 37 |
- SightationVQA
|
| 38 |
- SightationReasoning
|
| 39 |
|
| 40 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="
|
| 41 |
The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
|
| 42 |
grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
|
| 43 |
accessible descriptions.
|
|
@@ -45,7 +45,7 @@ accessible descriptions.
|
|
| 45 |
The description qualities assessed by their respective evaluator groups.
|
| 46 |
|
| 47 |
## Results
|
| 48 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="
|
| 49 |
Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
|
| 50 |
The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.
|
| 51 |
|
|
|
|
| 37 |
- SightationVQA
|
| 38 |
- SightationReasoning
|
| 39 |
|
| 40 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/cNshK4QAdiNMqk7x6J6j7.png" width="100%" height="100%" title="visual_abstract" alt="visual_abstract"></img>
|
| 41 |
The key benefit of utilizing sighted user feedback lies in their assessments that are based on solid visual
|
| 42 |
grounding. The compiled assessments prove an effective training substance for steering VLMs towards more
|
| 43 |
accessible descriptions.
|
|
|
|
| 45 |
The description qualities assessed by their respective evaluator groups.
|
| 46 |
|
| 47 |
## Results
|
| 48 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a86f66c6f66e2fa5888b41/094e9Hw7lauvT1tshg1Wj.png" width="90%" height="90%" title="spider_chart" alt="spider_chart"></img>
|
| 49 |
Tuning VLMs on Sightation enhanced various qualities of the diagram descriptions, evaluated by BLV educators, and shown here as normalized ratings averaged in each aspect.
|
| 50 |
The capability of the dataset is most strongly pronounced with Qwen2-VL-2B model, shown above.
|
| 51 |
|