Use constants from agent-eval for openness and tool usage (#116) f70f2d7 Running unverified Chloe Anastasiades commited on Aug 29
Jason/inttest and contact record improvements for reviewer (#97) 50aa233 unverified Jason commited on Aug 26
Add back brookes change for mobile header (#105) 37bf680 unverified Chloe Anastasiades commited on Aug 26
Get openness and tool usage names from the same place (#90) 8f044f3 unverified Chloe Anastasiades commited on Aug 25
Bold overall stat columns on main leaderboards (#94) ffbe9ce unverified danemery-ai2 commited on Aug 25
Instructions around pushing to the second leaderboard (#93) 0fc962d unverified Chloe Anastasiades commited on Aug 25
minor contact info dataset commit message adjustment, follow-on to #86 (#91) 16b3a18 unverified Jason commited on Aug 25
put submitter info in contact info commits for convenience (#86) a530bd4 unverified Jason commited on Aug 25
bump agent-eval version to pick up reasoning effort model name display thing (#88) dbeca22 unverified Chloe Anastasiades commited on Aug 23
Change name of LLM Base and adjust hover behavior (#85) 85744c7 unverified Amber Tanaka commited on Aug 22
Default value for cost divider line when no points have costs (#83) c039999 unverified Chloe Anastasiades commited on Aug 22
Disable HF account age requirement; submission fixes (#76) d60c9d9 unverified Jason commited on Aug 22
confirmation and error messages post submission (#74) 21516b7 unverified Smita R Smita commited on Aug 21
Update table legend to use new names + styling (#66) cdccabc unverified Amber Tanaka commited on Aug 18