Knowledge (XXG)

One-shot deviation principle

Source šŸ“

95:(SPE) if and only if there exist no profitable single deviation for each subgame and every player. In simpler terms, if no player can increase their expected payoff by deviating from their original strategy via a single action (in just one stage of the game), then the strategy profile is an SPE. In other words, no player can profit by deviating from the strategy in one period and then reverting to the strategy. 171:
observe the outcome of the first stage, one of four action combinations), and 16 in the third stage (4 times 4 histories of action combinations from the first two stages). The single-deviation condition requires checking each of these information sets, asking in each case whether the expected payoff of the player on the move would strictly increase by deviating at only this information set.
22: 170:
with two players in which each player makes binary choice decisions, A or B, in each of three stages. In each stage, the players observe the choices made in the previous stages (if any). Note that each player has 21 information sets, one in the first stage, four in the second stage (because players
98:
Furthermore, the one-shot deviation principle is very important for infinite horizon games, in which the principle typically does not hold, since it is not plausible to consider an infinite number of strategies and payoffs in order to solve. In an infinite horizon game where the
288: 40: 289:
https://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-254-game-theory-with-engineering-applications-spring-2010/lecture-notes/MIT6_254S10_lec15.pdf
260: 202: 138:
in the subgame. In a finite multi-stage game with observed actions, this analysis is equivalent to looking at single deviations from
58: 103:
is less than 1, a strategy profile is a subgame perfect equilibrium if and only if it satisfies the one-shot deviation principle.
92: 146:
differs from s at only one information set (in a single stage). Note that the choices associated with
306: 88: 80: 256: 198: 231: 154:
are the same at all nodes that are successors of nodes in the information set where s and
100: 167: 300: 84: 276: 236: 219: 275:
Obara, I. (2012). Subgame Perfect Equilibrium . Slide 13. Retrieved from
118:
is a subgame perfect Nash equilibrium, we have to ask every player
111:
The following is the paraphrased definition from Watson (2013).
287:
Ozdaglar, A. (2010). Repeated Games . Slide 13. Retrieved from
15: 87:. It says that a strategy profile of a finite multi-stage 255:(6. printing. ed.). Cambridge, Mass. : MIT Press. 36: 197:. New York: W. W. Norton & Company. p. 194. 31:
may be too technical for most readers to understand
130:that yields a strictly higher payoff for player 8: 277:http://www.econ.ucla.edu/iobara/SPE201B.pdf 235: 59:Learn how and when to remove this message 43:, without removing the technical details. 195:Strategy: An Introduction to Game Theory 180: 251:Tirole, Jean; Fudenberg, Drew (1991). 41:make it understandable to non-experts 7: 188: 186: 184: 79:) is the principle of optimality of 122:and every subgame, if considering 14: 224:Annals of Mathematical Statistics 220:"Discounting Dynamic Programming" 20: 158:prescribe different actions. 1: 73:one-shot deviation principle 93:subgame perfect equilibrium 91:with observed actions is a 323: 114:To check whether strategy 218:Blackwell, David (1965). 77:single-deviation property 237:10.1214/aoms/1177700285 126:, there is a strategy 193:Watson, Joel (2013). 89:extensive-form game 81:dynamic programming 262:978-0-262-06141-4 69: 68: 61: 314: 291: 285: 279: 273: 267: 266: 248: 242: 241: 239: 215: 209: 208: 190: 64: 57: 53: 50: 44: 24: 23: 16: 322: 321: 317: 316: 315: 313: 312: 311: 297: 296: 295: 294: 286: 282: 274: 270: 263: 250: 249: 245: 217: 216: 212: 205: 192: 191: 182: 177: 164: 109: 101:discount factor 75:(also known as 65: 54: 48: 45: 37:help improve it 34: 25: 21: 12: 11: 5: 320: 318: 310: 309: 299: 298: 293: 292: 280: 268: 261: 243: 210: 204:978-0393123876 203: 179: 178: 176: 173: 168:symmetric game 163: 160: 108: 105: 67: 66: 28: 26: 19: 13: 10: 9: 6: 4: 3: 2: 319: 308: 305: 304: 302: 290: 284: 281: 278: 272: 269: 264: 258: 254: 247: 244: 238: 233: 229: 225: 221: 214: 211: 206: 200: 196: 189: 187: 185: 181: 174: 172: 169: 161: 159: 157: 153: 149: 145: 141: 137: 133: 129: 125: 121: 117: 112: 106: 104: 102: 96: 94: 90: 86: 82: 78: 74: 63: 60: 52: 42: 38: 32: 29:This article 27: 18: 17: 283: 271: 252: 246: 227: 223: 213: 194: 165: 155: 151: 147: 143: 139: 135: 131: 127: 123: 119: 115: 113: 110: 97: 76: 72: 70: 55: 46: 30: 307:Game theory 253:Game theory 230:: 226ā€“235. 166:Consider a 107:Definitions 85:game theory 83:applied to 175:References 142:, meaning 134:than does 49:March 2014 301:Category 162:Example 35:Please 259:  201:  257:ISBN 199:ISBN 150:and 71:The 232:doi 39:to 303:: 228:36 226:. 222:. 183:^ 156:sā€™ 152:sā€™ 144:sā€™ 128:sā€™ 265:. 240:. 234:: 207:. 148:s 140:s 136:s 132:i 124:s 120:i 116:s 62:) 56:( 51:) 47:( 33:.

Index

help improve it
make it understandable to non-experts
Learn how and when to remove this message
dynamic programming
game theory
extensive-form game
subgame perfect equilibrium
discount factor
symmetric game



ISBN
978-0393123876
"Discounting Dynamic Programming"
doi
10.1214/aoms/1177700285
ISBN
978-0-262-06141-4
http://www.econ.ucla.edu/iobara/SPE201B.pdf
https://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-254-game-theory-with-engineering-applications-spring-2010/lecture-notes/MIT6_254S10_lec15.pdf
Category
Game theory

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

ā†‘