Thursday, January 28, 2010

Prisoner's Dillemma

So I was reading about the Prisoner's Dillema on Wikipedia (yes, I am a nerd and spending my free time reading about economics. but, in my defense, it is very cold in Seoul right now and I really can't be asked to leave my apartment.)

Anyhow, aside from all the game theory jargon and random tables, its actually quite interesting. The basic issue is what makes us fail to cooperate even when it would be in our mutual self-interest (think the arms race or bipartisan bickering on heathcare reform.)

Wiki breaks it down for us:

The prisoner's dilemma is a fundamental problem in game theory that demonstrates why two people might not cooperate even if it is in both their best interests to do so. In its classical form, the prisoner's dilemma ("PD") is presented as follows:

Two suspects are arrested by the police. The police have insufficient evidence for a conviction, and, having separated both prisoners, visit each of them to offer the same deal. If one testifies (defects from the other) for the prosecution against the other and the other remains silent (cooperates with the other), the betrayer goes free and the silent accomplice receives the full 10-year sentence. If both remain silent, both prisoners are sentenced to only six months in jail for a minor charge. If each betrays the other, each receives a five-year sentence. Each prisoner must choose to betray the other or to remain silent. Each one is assured that the other would not know about the betrayal before the end of the investigation. How should the prisoners act?
Where it gets interesting is that some guy had a competition. The challenge was to make a computer program that would do the best in repeated cycles of game play. He matches these up against eachother, and as it turns out sharking everyone you can find is not the best strategy if you want to "win" the game.

The programs that were entered varied widely in algorithmic complexity, initial hostility, capacity for forgiveness, and so forth.

When these encounters were repeated over a long period of time with many players, each with different strategies, greedy strategies tended to do very poorly in the long run while more altruistic strategies did better, as judged purely by self-interest.

The best deterministic strategy was found to be tit for tat, which Anatol Rapoport developed and entered into the tournament. It was the simplest of any program entered, containing only four lines of BASIC, and won the contest. The strategy is simply to cooperate on the first iteration of the game; after that, the player does what his or her opponent did on the previous move. Depending on the situation, a slightly better strategy can be "tit for tat with forgiveness." When the opponent defects, on the next move, the player sometimes cooperates anyway, with a small probability (around 1%-5%). This allows for occasional recovery from getting trapped in a cycle of defections.

By analysing the top-scoring strategies, Axelrod stated several conditions necessary for a strategy to be successful.

Nice
The most important condition is that the strategy must be "nice", that is, it will not defect before its opponent does (this is sometimes referred to as an "optimistic" algorithm). Almost all of the top-scoring strategies were nice; therefore a purely selfish strategy will not "cheat" on its opponent, for purely utilitarian reasons first.

Retaliating
However, Axelrod contended, the successful strategy must not be a blind optimist. It must sometimes retaliate. An example of a non-retaliating strategy is Always Cooperate. This is a very bad choice, as "nasty" strategies will ruthlessly exploit such players.

Forgiving
Successful strategies must also be forgiving. Though players will retaliate, they will once again fall back to cooperating if the opponent does not continue to defect. This stops long runs of revenge and counter-revenge, maximizing points.

Non-envious
The last quality is being non-envious, that is not striving to score more than the opponent (impossible for a ‘nice’ strategy, i.e., a 'nice' strategy can never score more than the opponent).

Actually, sounds like decent life advice. It's not exactly a turn the other cheek approach but it seems fair. Plus, I love when the rules of the game make sure the real assholes lose. =)


Side note: did anyone catch the state of the union? thoughts?

1 comment:

Daan said...

Yes - I saw it. There was lots to like and a few things I felt uneasy about. Generally really liked it. The part about student loan forgiveness was really moving. Sometimes it feels like the gov't forgets us college grads and it made me think of a few people who would really benefit.