|
|
--- |
|
|
license: bigscience-openrail-m |
|
|
language: |
|
|
- en |
|
|
base_model: |
|
|
- Qwen/Qwen2.5-Coder-3B-Instruct |
|
|
pipeline_tag: translation |
|
|
--- |
|
|
|
|
|
### Performance on the BIRD Development Set |
|
|
|
|
|
We further evaluate **DatA-SQL-3B** on the **BIRD** development set using different self-consistency voting sizes. |
|
|
Under **Vote@8**, our model attains an **execution accuracy (EX) of 61.05 %**. |
|
|
When the voting size increases to **Vote@32**, the EX further improves to **62.58 %**. |
|
|
These results confirm that larger voting ensembles enhance semantic robustness and execution stability while maintaining nearly the same inference cost due to our lightweight multi-agent design. |
|
|
Overall, **DatA-SQL** achieves competitive or superior accuracy compared with GPT-based pipelines at only a fraction of their computational expense. |