Chinastark commited on
Commit
035a971
·
verified ·
1 Parent(s): d06e899

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -5,4 +5,12 @@ language:
5
  base_model:
6
  - Qwen/Qwen2.5-Coder-3B-Instruc
7
  pipeline_tag: translation
8
- ---
 
 
 
 
 
 
 
 
 
5
  base_model:
6
  - Qwen/Qwen2.5-Coder-3B-Instruc
7
  pipeline_tag: translation
8
+ ---
9
+
10
+ ### Performance on the BIRD Development Set
11
+
12
+ We further evaluate **DatA-SQL-3B** on the **BIRD** development set using different self-consistency voting sizes.
13
+ Under **Vote@8**, our model attains an **execution accuracy (EX) of 61.05 %**.
14
+ When the voting size increases to **Vote@32**, the EX further improves to **62.58 %**.
15
+ These results confirm that larger voting ensembles enhance semantic robustness and execution stability while maintaining nearly the same inference cost due to our lightweight multi-agent design.
16
+ Overall, **DatA-SQL** achieves competitive or superior accuracy compared with GPT-based pipelines at only a fraction of their computational expense.