Data

LLM’s play Prisoner’s Dilemma: smaller models achieve higher rating [OC]

August 10, 2025

View 4 Comments

4 Comments

highlyeducated_idiot on August 10, 2025 4:10 pm

Do you have any insight into why smaller models might perform better in this test?
Ok-Commercial-924 on August 10, 2025 4:22 pm

Where is the key? Am I supposed to guess what the colors mean? The labels are illegible. They may be readable on a desktop, but on a mobile device (60% of reddit users), they are a blur.
shiny_thing on August 10, 2025 5:01 pm

Did models retain state between matches? If not, then there’s no point in actually doing a round robin, just get a sample from each model to estimate defect/cooperate rate. That’s enough to let you compute the expected scores.

The nature of the game means that the rating would be a function of the portion of cooperating peers, so it seems like ELO says more about the selection of the pool rather than general “strength” of a model.

I’d be interested in seeing results for an iterated prisoners dilemma.

I’m terms of the presentation itself, the “clustered by variant” isn’t great since it’s unclear how much data is being hidden. I wonder if a scatterplot of model size vs ELO / model size vs cooperation rate would be better. Points colored by model name.
MyPunsSuck on August 10, 2025 5:11 pm

> and also subsequently perform worse

By what definition of “perform”? LLMs are not designed to optimize short-term gains in thought experiments – they are designed to mimic what a *human* would say when given the same prompt. As models get better, they more accurately mimic what a human would say. Evidently, the humans in their training data would choose not to defect

Tags

LLM’s play Prisoner’s Dilemma: smaller models achieve higher rating [OC]

4 Comments