Superrationality - Knowledge (XXG)

329:: an eccentric trillionaire contacts 20 people, and tells them that if one and only one of them send him or her a telegram (assumed to cost nothing) by noon the next day, that person will receive a billion dollars. If they receive more than one telegram or none at all, no one will get any money, and communication between players is forbidden. In this situation, the superrational thing to do (if it is known that all 20 are superrational) is to send a telegram with probability p=1/20—that is, each recipient essentially rolls a 269:

other player to cooperate, but rather the same logic leads to the same answer independent of communication or cause and effect. This debate is over whether it is reasonable for human beings to act in a superrational manner, not over what superrationality means, and is similar to arguments about whether it is reasonable for humans to act in a 'rational' manner, as described by game theory (wherein they can figure out what other players will or have done by asking themselves, what would I do if I was them, and applying

719: 36: 297:, it is superrational to cooperate with probability 1 even when mixed strategies are admitted, because the average payoff when one player cooperates and the other defects are the same as when both cooperate and so defecting increases the risk of both defecting, which decreases the expected payout. But in some cases, the superrational strategy is mixed. 261:, which defines a stable strategy as one where no player can improve the payoffs by unilaterally changing course. The superrational equilibrium in a symmetric game is one where all the players' strategies are forced to be the same before the maximization step. (There is no agreed-upon extension of the concept of superrationality to asymmetric games.) 176:

is usually framed in terms of jail sentences for criminals, but it can be stated equally well with cash prizes instead. Two players are each given the choice to cooperate (C) or to defect (D). The players choose without knowing what the other is going to do. If both cooperate, each will get $ 100. If

268:

in which each player supposes that their decision to cooperate will cause the other player to cooperate, even though there is no communication. Hofstadter points out that the concept of "choice" doesn't apply when the player's goal is to figure something out, and that the decision does not cause the

256:

Although standard game theory assumes common knowledge of rationality, it does so in a different way. The game-theoretic analysis maximizes payoffs by allowing each player to change strategies independently of the others, even though in the end, it assumes that the answer in a symmetric game will be

168:

The idea of superrationality is that two logical thinkers analyzing the same problem will think of the same correct answer. For example, if two people are both good at math and both have been given the same complicated problem to do, both will get the same right answer. In math, knowing that the two

246:

knowing what the strategy will be. The strategy is found by maximizing the payoff to each player, assuming that they all use the same strategy. Since the superrational player knows that the other superrational player will do the same thing, whatever that might be, there are only two choices for two

321:

So that defecting has a huge reward, the superrational strategy is defecting with a probability of 499,900/999,899 or a little over 49.995%. As the reward increases to infinity, the probability only approaches 1/2 further, and the losses for adopting the simpler strategy of 1/2 (which are already

116:) but assume that all other players are superrational too and that a superrational individual will always come up with the same strategy as any other superrational thinker when facing the same problem. Applying this definition, a superrational player playing against a superrational opponent in a 247:

superrational players. Both will cooperate or both will defect depending on the value of the superrational answer. Thus the two superrational players will both cooperate since this answer maximizes their payoff. Two superrational players playing this game will each walk away with $ 100.

341:; if an individual player sends telegrams they have a chance of receiving money, but if they send no telegrams they cannot get anything. (If all telegrams were guaranteed to arrive, they would only send one, and no one would expect to get any money). 336:

Notice though that this is not the solution in the conventional game-theoretical analysis. Twenty game-theoretically rational players would each send in telegrams and therefore receive nothing. This is because sending telegrams is the

322:

minimal) approach 0. In a less extreme example, if the payoff for one cooperator and one defector was $ 400 and $ 0, respectively, the superrational mixed strategy world be defecting with probability 100/299 or about 1/3.

237:

The conclusion is that the rational thing to do is to defect. This type of reasoning defines game-theoretic rationality and two game-theoretic rational players playing this game both defect and receive a dollar each.

241:

Superrationality is an alternative method of reasoning. First, it is assumed that the answer to a symmetric problem will be the same for all the superrational players. Thus the sameness is taken into account

252:

A superrational player playing against a player of uncertain superrationality will sometimes defect and sometimes cooperate, based on the probability of the other player being superrational.

149:

one. Hofstadter provided this definition: "Superrational thinkers, by recursive definition, include in their calculations the fact that they are in a group of superrational thinkers."

169:

answers are going to be the same doesn't change the value of the problem, but in the game theory, knowing that the answer will be the same might change the answer itself.

250:

Note that a superrational player playing against a game-theoretic rational player will defect, since the strategy only assumes that the superrational players will agree.

177:

they both defect, they each get $ 1. If one cooperates and the other defects, then the defecting player gets $ 150, while the cooperating player gets nothing.

618: 274: 349:

The question of whether to cooperate in a one-shot Prisoner's Dilemma in some circumstances has also come up in the decision theory literature sparked by

425: 674: 635: 361:

endorses lines of reasoning similar to superrationality and recommends cooperation in a Prisoner's Dilemma against a similar opponent.

453: 79: 46: 611: 325:

In similar situations with more players, using a randomising device can be essential. One example discussed by Hofstadter is the

869: 814: 156:", the superrational thinker will not always play the equilibrium that maximizes the total social utility and is thus not a 864: 289:: the possibility that the best choice could be to flip a coin, or more generally to choose different outcomes with some 553: 604: 333:

and only sends a telegram if it comes up "1". This maximizes the probability that exactly one telegram is received.

61: 376: 358: 57: 752: 381: 704: 644: 354: 718: 396: 294: 173: 117: 859: 809: 684: 386: 350: 820: 757: 694: 581:

Oesterheld, Caspar; Treutlein, Johannes; Grosse, Roger; Conitzer, Vincent; Foerster, Jakob (2023).

560:(2014). "Robust Cooperation in the Prisoner's Dilemma: Program Equilibrium via Provability Logic". 525: 430: 401: 391: 364: 270: 109: 747: 664: 627: 561: 486: 443: 137: 132: 227:

Assuming the other player defects, if I cooperate I get nothing and if I defect I get a dollar.

835: 803: 762: 557: 449: 338: 233:

So whatever the other player does, my payoff is increased by defecting, if only by one dollar.

772: 534: 478: 326: 265: 258: 153: 777: 742: 142: 654: 286: 230:

Assuming the other player cooperates, I get $ 100 if I cooperate and $ 150 if I defect.

146: 853: 124: 782: 157: 482: 330: 290: 128: 97: 17: 767: 93: 737: 120:

will cooperate while a rationally self-interested player would defect.

538: 490: 113: 596: 504:

Brams, Steven J. (1975). "Newcomb's Problem and Prisoners' Dilemma".

426:"Dilemmas for Superrational Thinkers, Leading Up to a Luring Lottery" 64:. Statements consisting only of original research should be removed. 582: 587:

Proceedings of the Neural Information Processing Systems (NeurIPS)

566: 523:

Howard, J.V. (May 1988). "Cooperation in the Prisoner's Dilemma".

285:

For simplicity, the foregoing account of superrationality ignored

180:

The four outcomes and the payoff to each player are listed below.

469:

Lewis, David (1979). "Prisoners' Dilemma is a Newcomb Problem".

600: 367:

has been proposed as a mechanistic model of superrationality.

29: 257:

the same for all. This is the definition of a game-theoretic

223:

One valid way for the players to reason is as follows:

53: 840:

By Hofstadter and the Fluid Analogies Research Group

357:

suggests that superrationality is irrational, while

796: 726: 634: 556:; Fallenstein, B.; Herreshoff, M.; LaVictoire, P.; 264:Some argue that superrationality implies a kind of 300:For example, if the payoffs in are as follows: 612: 8: 275:iterated elimination of dominated strategies 619: 605: 597: 583:"Similarity-based Cooperative Equilibrium" 565: 80:Learn how and when to remove this message 182: 413: 100:, a participant is considered to have 675:Fluid Concepts and Creative Analogies 7: 419: 417: 345:Formalizations and related concepts 145:different from the widely accepted 141:as an alternative type of rational 506:The Journal of Conflict Resolution 25: 448:. Basic Books. pp. 737–755. 424:Hofstadter, Douglas (June 1983). 135:in his article, series, and book 717: 34: 27:Trait of a player in game theory 815:Indiana University Bloomington 1: 471:Philosophy and Public Affairs 127:is not a mainstream model in 442:Hofstadter, Douglas (1985). 483:10.1093/0195036468.003.0011 60:the claims made and adding 886: 377:Evidential decision theory 359:evidential decision theory 834:Edited by Hofstadter and 830: 715: 112:(and thus maximize their 382:Instrumental convergence 281:Probabilistic strategies 106:renormalized rationality 163: 870:Rational choice theory 753:Hofstadter's butterfly 355:Causal decision theory 705:Surfaces and Essences 152:Unlike the supposed " 131:and was suggested by 865:Behavioral economics 810:Egbert B. Gebstadter 685:Le Ton beau de Marot 312:DC – $ 1,000,000/$ 0 309:CD – $ 0/$ 1,000,000 196:Player A cooperates 821:Victim of the Brain 695:I Am a Strange Loop 645:Gödel, Escher, Bach 526:Theory and Decision 431:Scientific American 402:Program equilibrium 392:Perfect rationality 365:Program equilibrium 188:Player B cooperates 154:reciprocating human 110:perfect rationality 665:Metamagical Themas 628:Douglas Hofstadter 539:10.1007/BF00148954 445:Metamagical Themas 397:Prisoner's dilemma 295:prisoner's dilemma 271:backward induction 174:prisoner's dilemma 164:Prisoner's dilemma 138:Metamagical Themas 133:Douglas Hofstadter 118:prisoner's dilemma 45:possibly contains 847: 846: 836:Daniel C. Dennett 804:Robert Hofstadter 763:Hofstadter points 387:Newcomb's problem 351:Newcomb's problem 339:dominant strategy 221: 220: 209:Player A defects 191:Player B defects 90: 89: 82: 47:original research 16:(Redirected from 877: 788:Superrationality 773:Platonia dilemma 758:Hofstadter's law 721: 710: 700: 690: 680: 670: 660: 650: 621: 614: 607: 598: 591: 590: 578: 572: 571: 569: 549: 543: 542: 520: 514: 513: 501: 495: 494: 466: 460: 459: 440:– reprinted in: 439: 421: 327:platonia dilemma 306:CC – $ 100/$ 100 287:mixed strategies 266:magical thinking 259:Nash equilibrium 253: 204:Player B: $ 150 183: 102:superrationality 85: 78: 74: 71: 65: 62:inline citations 38: 37: 30: 21: 885: 884: 880: 879: 878: 876: 875: 874: 850: 849: 848: 843: 826: 792: 778:Six nines in pi 743:BlooP and FlooP 731: 729: 722: 713: 708: 698: 688: 678: 668: 658: 648: 630: 625: 595: 594: 580: 579: 575: 551: 550: 546: 522: 521: 517: 503: 502: 498: 468: 467: 463: 456: 441: 423: 422: 415: 410: 373: 347: 283: 251: 213: 212:Player A: $ 150 203: 166: 143:decision making 108:) if they have 86: 75: 69: 66: 51: 39: 35: 28: 23: 22: 15: 12: 11: 5: 883: 881: 873: 872: 867: 862: 852: 851: 845: 844: 842: 841: 838: 831: 828: 827: 825: 824: 817: 812: 807: 800: 798: 794: 793: 791: 790: 785: 780: 775: 770: 765: 760: 755: 750: 745: 740: 734: 732: 727: 724: 723: 716: 714: 712: 711: 701: 691: 681: 671: 661: 651: 640: 638: 632: 631: 626: 624: 623: 616: 609: 601: 593: 592: 573: 554:Christiano, P. 544: 533:(3): 203–213. 515: 496: 477:(3): 235–240. 461: 454: 412: 411: 409: 406: 405: 404: 399: 394: 389: 384: 379: 372: 369: 346: 343: 319: 318: 317: 316: 313: 310: 307: 282: 279: 235: 234: 231: 228: 219: 218: 215: 210: 206: 205: 202:Player A: $ 0 200: 199:Both get $ 100 197: 193: 192: 189: 186: 165: 162: 158:philanthropist 147:game-theoretic 88: 87: 42: 40: 33: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 882: 871: 868: 866: 863: 861: 858: 857: 855: 839: 837: 833: 832: 829: 823: 822: 818: 816: 813: 811: 808: 805: 802: 801: 799: 795: 789: 786: 784: 781: 779: 776: 774: 771: 769: 766: 764: 761: 759: 756: 754: 751: 749: 746: 744: 741: 739: 736: 735: 733: 725: 720: 707: 706: 702: 697: 696: 692: 687: 686: 682: 677: 676: 672: 667: 666: 662: 657: 656: 652: 647: 646: 642: 641: 639: 637: 633: 629: 622: 617: 615: 610: 608: 603: 602: 599: 588: 584: 577: 574: 568: 563: 559: 558:Yudkowsky, E. 555: 548: 545: 540: 536: 532: 528: 527: 519: 516: 512:(4): 596–612. 511: 507: 500: 497: 492: 488: 484: 480: 476: 472: 465: 462: 457: 455:0-465-04566-9 451: 447: 446: 437: 433: 432: 427: 420: 418: 414: 407: 403: 400: 398: 395: 393: 390: 388: 385: 383: 380: 378: 375: 374: 370: 368: 366: 362: 360: 356: 352: 344: 342: 340: 334: 332: 328: 323: 314: 311: 308: 305: 304: 303: 302: 301: 298: 296: 292: 288: 280: 278: 276: 272: 267: 262: 260: 254: 248: 245: 239: 232: 229: 226: 225: 224: 217:Both get $ 1 216: 214:Player B: $ 0 211: 208: 207: 201: 198: 195: 194: 190: 187: 185: 184: 181: 178: 175: 170: 161: 159: 155: 150: 148: 144: 140: 139: 134: 130: 126: 125:decision rule 121: 119: 115: 111: 107: 103: 99: 95: 84: 81: 73: 63: 59: 55: 49: 48: 43:This article 41: 32: 31: 19: 18:Superrational 819: 787: 783:Strange loop 728:Concepts and 703: 693: 683: 673: 663: 655:The Mind's I 653: 643: 586: 576: 552:Barasz, M.; 547: 530: 524: 518: 509: 505: 499: 474: 470: 464: 444: 435: 429: 363: 348: 335: 331:20-sided die 324: 320: 315:DD – $ 1/$ 1 299: 284: 263: 255: 249: 243: 240: 236: 222: 179: 171: 167: 151: 136: 122: 105: 101: 91: 76: 70:January 2019 67: 44: 860:Game theory 291:probability 129:game theory 98:game theory 854:Categories 408:References 54:improve it 768:MU puzzle 567:1401.5577 293:. In the 94:economics 58:verifying 806:(father) 738:Ambigram 730:projects 371:See also 797:Related 748:Copycat 491:2265034 114:utility 52:Please 709:(2013) 699:(2007) 689:(1997) 679:(1995) 669:(1985) 659:(1981) 649:(1979) 489: 452: 244:before 636:Books 562:arXiv 487:JSTOR 123:This 450:ISBN 438:(6). 273:and 172:The 104:(or 96:and 535:doi 479:doi 436:248 277:). 92:In 56:by 856:: 585:. 531:24 529:. 510:19 508:. 485:. 473:. 434:. 428:. 416:^ 353:. 160:. 620:e 613:t 606:v 589:. 570:. 564:: 541:. 537:: 493:. 481:: 475:8 458:. 83:) 77:( 72:) 68:( 50:. 20:)

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

Index