Republic of Agents

An experimental benchmark evaluating social capabilities of reasoning models and emergent dynamics: collaboration, deception, coalition building.