deepsodha's picture
Upload 25 files
beb5479 verified

A newer version of the Streamlit SDK is available: 1.51.0

Upgrade

πŸ›οΈ RetailGPT Evaluator β€” AxionX Digital

Purpose: Evaluate and compare multiple retail QA models on the same dataset.

Includes

  • evaluate.py β†’ runs metrics across multiple models
  • leaderboard.py β†’ aggregates results into ranking
  • app.py β†’ Streamlit UI with leaderboard + live model chat

Usage

!python retailgpt_evaluator/dataset_loader.py
!python retailgpt_evaluator/evaluate.py