Update README.md
Browse files
README.md
CHANGED
|
@@ -5,4 +5,12 @@ language:
|
|
| 5 |
base_model:
|
| 6 |
- Qwen/Qwen2.5-Coder-3B-Instruc
|
| 7 |
pipeline_tag: translation
|
| 8 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
base_model:
|
| 6 |
- Qwen/Qwen2.5-Coder-3B-Instruc
|
| 7 |
pipeline_tag: translation
|
| 8 |
+
---
|
| 9 |
+
|
| 10 |
+
### Performance on the BIRD Development Set
|
| 11 |
+
|
| 12 |
+
We further evaluate **DatA-SQL-3B** on the **BIRD** development set using different self-consistency voting sizes.
|
| 13 |
+
Under **Vote@8**, our model attains an **execution accuracy (EX) of 61.05 %**.
|
| 14 |
+
When the voting size increases to **Vote@32**, the EX further improves to **62.58 %**.
|
| 15 |
+
These results confirm that larger voting ensembles enhance semantic robustness and execution stability while maintaining nearly the same inference cost due to our lightweight multi-agent design.
|
| 16 |
+
Overall, **DatA-SQL** achieves competitive or superior accuracy compared with GPT-based pipelines at only a fraction of their computational expense.
|