Add historical data visualization features

#7
by badaoui HF Staff - opened

I made some changes to improve the CI dashboard, building on Manuel's work on adding historical data graphs. Here’s the link: https://huggingface.co/spaces/badaoui/Tcid

What I added / improved:

  • First Seen Date of failing test : in order to provide immediate context on which PRs might have caused the failure.
  • “Failing Models” toggle : Shows only failing models. You can also filter by hardware (AMD or NVIDIA) to display failing models for a specific platform.
  • New Regression detection : Compares the set of failed tests from the latest run against the previous run. Any test failing today but not yesterday is flagged as a regression and needs immediate attention.
  • Fixed historical failure rates graph : Now matches the main dashboard calculation by including ERROR tests in failures and excluding SKIPPED tests from the total. Formula is now (failed + error) / (passed + failed + error) instead of failed / (passed + failed + skipped).
  • Navigation fixes : Resolved small issues when browsing pages (e.g., clicking on a model → history view → Summary → history view now correctly shows the summary page).
  • Improved visualizations : Switched to Plotly for better graphs.
  • Added logos
badaoui changed pull request status to open
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment