Spaces:

codeparrot
/

apps_metric

Running

loubnabnl HF Staff commited on Jul 4, 2022

Commit

093dcff

1 Parent(s): 8280fac

update readme

Files changed (1) hide show

README.md CHANGED Viewed

@@ -21,12 +21,12 @@ This metric is used to evaluate code generation on the [APPS benchmark](https://
 You can load the metric and use it with the following commands:
 ```
 from evaluate import load
-glue_metric = load('loubnabnl/apps_metric')
 results = apps_metric.compute(predictions=generations)
 ```
 ### Inputs
-**generations** (list(str)): List of code generations, each sub-list corresponds to the generation for a problem in APPS dataset, the order of the samples in the dataset must be kept.
 ### Output Values

 You can load the metric and use it with the following commands:
 ```
 from evaluate import load
+apps_metric = load('loubnabnl/apps_metric')
 results = apps_metric.compute(predictions=generations)
 ```
 ### Inputs
+**generations** list(list(str)): List of code generations, each sub-list corresponds to the generations for a problem in APPS dataset, the order of the samples in the dataset must be kept (with respect to the difficulty level).
 ### Output Values