Commit History

Use constants from agent-eval for openness and tool usage (#116)
f70f2d7
Running
unverified

Chloe Anastasiades commited on

fix 24h submission rate check (#114)
8b707ca
unverified

Jason commited on

Add redirect script on submit (#115)
b392800
unverified

Amber Tanaka commited on

say .tar.gz (#113)
0438ff0
unverified

Jason commited on

Update submission receipt text (#112)
b47a3c3
unverified

Jason commited on

Disable share button on diagram (#111)
7830648
unverified

Amber Tanaka commited on

Add diagram take 2 (#110)
a0bf2a2
unverified

Amber Tanaka commited on

Jason/inttest and contact record improvements for reviewer (#97)
50aa233
unverified

Jason commited on

Put graph note in graph column (#109)
ba77e6d
unverified

danemery-ai2 commited on

remove spacing in the nav buttons (#107)
3e2a645
unverified

Smita R Smita commited on

Fix spacing for the home page (#108)
d26a835
unverified

Chloe Anastasiades commited on

Remove future diagram text (#106)
1875c3b
unverified

Chloe Anastasiades commited on

Add back brookes change for mobile header (#105)
37bf680
unverified

Chloe Anastasiades commited on

Add back submit page tooltips (#104)
4e01af2
unverified

Chloe Anastasiades commited on

Add hover state to nav items (#100)
9a90879
unverified

danemery-ai2 commited on

Add favicon and title (#96)
198c409
unverified

Cecile Nguyen commited on

Get openness and tool usage names from the same place (#90)
8f044f3
unverified

Chloe Anastasiades commited on

Jason/a few fixes for whatever.hf.space direct url (#95)
3317293
unverified

Jason commited on

Bold overall stat columns on main leaderboards (#94)
ffbe9ce
unverified

danemery-ai2 commited on

Instructions around pushing to the second leaderboard (#93)
0fc962d
unverified

Chloe Anastasiades commited on

minor contact info dataset commit message adjustment, follow-on to #86 (#91)
16b3a18
unverified

Jason commited on

Add terms and conditions modal (#92)
0aeab65
unverified

Amber Tanaka commited on

put submitter info in contact info commits for convenience (#86)
a530bd4
unverified

Jason commited on

bump agent-eval version to pick up reasoning effort model name display thing (#88)
dbeca22
unverified

Chloe Anastasiades commited on

update links (#87)
e31d809
unverified

Amber Tanaka commited on

Change name of LLM Base and adjust hover behavior (#85)
85744c7
unverified

Amber Tanaka commited on

Fix the table legend and tooltips (#84)
268d785
unverified

Amber Tanaka commited on

Default value for cost divider line when no points have costs (#83)
c039999
unverified

Chloe Anastasiades commited on

increase file size to 5GB (#80)
2eee49c
unverified

Jason commited on

Add border to plot legend (#81)
5015459
unverified

danemery-ai2 commited on

Bug Bash Fixes (#79)
8bd1c00
unverified

Amber Tanaka commited on

Disable HF account age requirement; submission fixes (#76)
d60c9d9
unverified

Jason commited on

update styling of links (#77)
011f79c
unverified

Smita R Smita commited on

Fix submission field bug (#78)
ad0be25
unverified

danemery-ai2 commited on

Add Date to table (#75)
941eea2
unverified

Amber Tanaka commited on

Submission page revamp (#73)
20c57a4
unverified

danemery-ai2 commited on

confirmation and error messages post submission (#74)
21516b7
unverified

Smita R Smita commited on

About Page Adjustments (#72)
482c591
unverified

Amber Tanaka commited on

Make tooltips more dynamic (#70)
28bc4e5
unverified

Amber Tanaka commited on

Bump to agenteval version 0.1.38 (#71)
4d6710e
unverified

Chloe Anastasiades commited on

Remove claude preferences from codebase (#68)
4279ed8
unverified

Jason commited on

Fix legend rendering bug (#69)
eeb1750
unverified

danemery-ai2 commited on

Refactoring intro paragraph / layout (#67)
b077021
unverified

Amber Tanaka commited on

Custom plot legends (#62)
17162c9
unverified

danemery-ai2 commited on

Jason/submit only to submissions repo (#65)
8939028
unverified

Jason commited on

Update table legend to use new names + styling (#66)
cdccabc
unverified

Amber Tanaka commited on

benchmark descriptions and styling (#59)
ac15cf4
unverified

Smita R Smita commited on

Paper cuts (#64)
7b52df4
unverified

Amber Tanaka commited on

Jason/dataset cfg (#54)
0dd7833
unverified

Jason commited on

turn on results table filter (#60)
cbcb51a
unverified

Jason commited on