OpenEvals

community

AI & ML interests

LLM evaluation

Recent Activity

SaylorTwift updated a Space about 3 hours ago

OpenEvals/open_benchmark_index

SaylorTwift updated a Space about 4 hours ago

OpenEvals/evals

SaylorTwift published a Space about 4 hours ago

OpenEvals/evals

View all activity

Articles

Gaia2 and ARE: Empowering the community to study agents

OpenEvals 's collections 5