gair-prox
/

CodeLlama-7B-ProXMath

Model card Files Files and versions

SinclairWang commited on Sep 17, 2024

Commit

d25d9ea

·

verified ·

1 Parent(s): 2936274

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -24,11 +24,10 @@ base_model:
 ProX models are evaluated on 9 common math reasoning benchmarks.
-| Model                 	| asdiv    	| gsm8k    	| mathqa   	| mawps    	| minerva_math 	| mmlu_stem 	| sat_math 	| svamp    	| tabmwp   	| average  	|
-|-----------------------	|----------	|----------	|----------	|----------	|--------------	|-----------	|----------	|----------	|----------	|----------	|
-| CodeLlama-7B          	| 50.7     	| 11.8     	| 14.3     	| 62.6     	| 5.0          	| 20.4      	| 21.9     	| 44.2     	| 30.6     	| 29.1     	|
-| CodeLlama-7B-ProXMath 	| **67.9** 	| **35.6** 	| **38.9** 	| **82.7** 	| **17.6**     	| **42.6**  	| **62.5** 	| **55.8** 	| **41.3** 	| **49.4** 	|
 ### Citation
 ```

 ProX models are evaluated on 9 common math reasoning benchmarks.
+| Model                 |   asdiv  |   gsm8k  |  mathqa  |   mawps  | minerva_math | mmlu_stem | sat_math |   svamp  |  tabmwp  |  average |
+|-----------------------|:--------:|:--------:|:--------:|:--------:|:------------:|:---------:|:--------:|:--------:|:--------:|:--------:|
+| CodeLlama-7B          |   50.7   |   11.8   |   14.3   |   62.6   |      5.0     |    20.4   |   21.9   |   44.2   |   30.6   |   29.1   |
+| CodeLlama-7B-ProXMath | **67.9** | **35.6** | **38.9** | **82.7** |   **17.6**   |  **42.6** | **62.5** | **55.8** | **41.3** | **49.4** |
 ### Citation
 ```