Bin12345 commited on
Commit
d9386b6
·
verified ·
1 Parent(s): 5244798

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: image-text-to-text
4
+ tags:
5
+ - grounding
6
+ - agent
7
+
8
+ GUI_Spotlight is a `think-with-image` GUI visual grounding model. For each step, it first calls tooling to crop the image according to its own predictions, and then returns an exact coordinate location.
9
+
10
+ For evaluation and inference details, please refer to [the GUI_Spotlight repository](https://github.com/bin123apple/GUI_Spotlight)