Evaluation Batches

NameStatusProgressCreated
LLM Social Reasoning with Learning Benchmark (Feb/Mar 2026)completed210/21004/03/2026, 22:50:17
LLM Social Reasoning Benchmark (Feb/Mar 2026)completed210/21028/02/2026, 13:31:01