Chinastark
/

DatA-SQL-3B

Model card Files Files and versions

Chinastark commited on 19 days ago

Commit

035a971

·

verified ·

1 Parent(s): d06e899

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -5,4 +5,12 @@ language:
 base_model:
 - Qwen/Qwen2.5-Coder-3B-Instruc
 pipeline_tag: translation
----

 base_model:
 - Qwen/Qwen2.5-Coder-3B-Instruc
 pipeline_tag: translation
+---
+### Performance on the BIRD Development Set
+We further evaluate **DatA-SQL-3B** on the **BIRD** development set using different self-consistency voting sizes.
+Under **Vote@8**, our model attains an **execution accuracy (EX) of 61.05 %**.
+When the voting size increases to **Vote@32**, the EX further improves to **62.58 %**.
+These results confirm that larger voting ensembles enhance semantic robustness and execution stability while maintaining nearly the same inference cost due to our lightweight multi-agent design.
+Overall, **DatA-SQL** achieves competitive or superior accuracy compared with GPT-based pipelines at only a fraction of their computational expense.