Model Card for Model ID

Tasks Version Filter n-shot Metric Value Stderr
arc_challenge 1 none 25 acc 0.2090 ยฑ 0.0119
none 25 acc_norm 0.2389 ยฑ 0.0125
truthfulqa_mc2 2 none 0 acc 0.4297 ยฑ 0.0152
winogrande 1 none 5 acc 0.5217 ยฑ 0.014
hellaswag 1 none 10 acc 0.2923 ยฑ 0.0045
none 10 acc_norm 0.3198 ยฑ 0.0047
gsm8k 3 strict-match 5 exact_match 0.0068 ยฑ 0.0023
flexible-extract 5 exact_match 0.0167 ยฑ 0.0035

MMLU (0.26727368421052633, 0.004481878288705264)

Tasks Version Filter n-shot Metric Value Stderr
world_religions 0 none 5 acc 0.2515 ยฑ 0.0333
virology 0 none 5 acc 0.2470 ยฑ 0.0336
us_foreign_policy 0 none 5 acc 0.2600 ยฑ 0.0441
sociology 0 none 5 acc 0.2090 ยฑ 0.0287
security_studies 0 none 5 acc 0.4041 ยฑ 0.0314
public_relations 0 none 5 acc 0.2182 ยฑ 0.0396
professional_psychology 0 none 5 acc 0.2386 ยฑ 0.0172
professional_medicine 0 none 5 acc 0.4338 ยฑ 0.0301
professional_law 0 none 5 acc 0.2464 ยฑ 0.0110
professional_accounting 0 none 5 acc 0.2482 ยฑ 0.0258
prehistory 0 none 5 acc 0.2284 ยฑ 0.0234
philosophy 0 none 5 acc 0.2733 ยฑ 0.0253
nutrition 0 none 5 acc 0.2810 ยฑ 0.0257
moral_scenarios 0 none 5 acc 0.2268 ยฑ 0.0140
moral_disputes 0 none 5 acc 0.2572 ยฑ 0.0235
miscellaneous 0 none 5 acc 0.2146 ยฑ 0.0147
medical_genetics 0 none 5 acc 0.3300 ยฑ 0.0473
marketing 0 none 5 acc 0.1880 ยฑ 0.0256
management 0 none 5 acc 0.3107 ยฑ 0.0458
machine_learning 0 none 5 acc 0.1339 ยฑ 0.0323
logical_fallacies 0 none 5 acc 0.2638 ยฑ 0.0346
jurisprudence 0 none 5 acc 0.2315 ยฑ 0.0408
international_law 0 none 5 acc 0.3636 ยฑ 0.0439
human_sexuality 0 none 5 acc 0.2290 ยฑ 0.0369
human_aging 0 none 5 acc 0.2242 ยฑ 0.0280
high_school_world_history 0 none 5 acc 0.2700 ยฑ 0.0289
high_school_us_history 0 none 5 acc 0.3039 ยฑ 0.0323
high_school_statistics 0 none 5 acc 0.4259 ยฑ 0.0337
high_school_psychology 0 none 5 acc 0.3138 ยฑ 0.0199
high_school_physics 0 none 5 acc 0.2384 ยฑ 0.0348
high_school_microeconomics 0 none 5 acc 0.2395 ยฑ 0.0277
high_school_mathematics 0 none 5 acc 0.2963 ยฑ 0.0278
high_school_macroeconomics 0 none 5 acc 0.3410 ยฑ 0.0240
high_school_government_and_politics 0 none 5 acc 0.3627 ยฑ 0.0347
high_school_geography 0 none 5 acc 0.3131 ยฑ 0.0330
high_school_european_history 0 none 5 acc 0.2848 ยฑ 0.0352
high_school_computer_science 0 none 5 acc 0.2400 ยฑ 0.0429
high_school_chemistry 0 none 5 acc 0.2611 ยฑ 0.0309
high_school_biology 0 none 5 acc 0.3097 ยฑ 0.0263
global_facts 0 none 5 acc 0.2800 ยฑ 0.0451
formal_logic 0 none 5 acc 0.1825 ยฑ 0.0346
elementary_mathematics 0 none 5 acc 0.2646 ยฑ 0.0227
electrical_engineering 0 none 5 acc 0.2690 ยฑ 0.0370
econometrics 0 none 5 acc 0.2368 ยฑ 0.0400
conceptual_physics 0 none 5 acc 0.2979 ยฑ 0.0299
computer_security 0 none 5 acc 0.1900 ยฑ 0.0394
college_physics 0 none 5 acc 0.2549 ยฑ 0.0434
college_medicine 0 none 5 acc 0.2197 ยฑ 0.0316
college_mathematics 0 none 5 acc 0.2700 ยฑ 0.0446
college_computer_science 0 none 5 acc 0.2200 ยฑ 0.0416
college_chemistry 0 none 5 acc 0.3000 ยฑ 0.0461
college_biology 0 none 5 acc 0.2778 ยฑ 0.0375
clinical_knowledge 0 none 5 acc 0.3094 ยฑ 0.0285
business_ethics 0 none 5 acc 0.1800 ยฑ 0.0386
astronomy 0 none 5 acc 0.2697 ยฑ 0.0361
anatomy 0 none 5 acc 0.2593 ยฑ 0.0379
abstract_algebra 0 none 5 acc 0.2400 ยฑ 0.0429

Model Details

Model Description

This is the model card of a ๐Ÿค— transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: [More Information Needed]
  • Funded by [optional]: [More Information Needed]
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Uses

Direct Use

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Downloads last month
-
Safetensors
Model size
0.3B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support