Knowledge

Mosaic plot

Source 📝

43:, is a graphical visualization of data from two or more qualitative variables. It is the multidimensional extension of spineplots, which graphically display the same information for only one variable. It gives an overview of the data and makes it possible to recognize relationships between different variables. For example, independence is shown when the boxes across categories all have the same areas. Mosaic plots were introduced by Hartigan and Kleiner in 1981 and expanded on by Friendly in 1994. Mosaic plots are also called 20: 293:, it is not possible for the mosaic plot to plot a confidence interval. However, the tiles can be colored according to the standardized residual from a model of independence, so that cells with excessively large or small deviations are shaded to show those that are 'significant' and the pattern of association can be discerned. 255:
survival probability. The survival probability for females is seen to have been higher than that for men (marginalised over all classes). Similarly, a marginalization over gender identifies first-class passengers as most probable to survive. Overall, about 1/3 of all people survived (proportion of light gray areas).
246:
At the left edge of the first variable we first plot "Gender," meaning that we divide the data vertically in two blocks: the bottom blocks corresponds to females, while the upper (much larger) one to males. One immediately sees that roughly a quarter of the passengers were female and the remaining
242:
The categorical variables are first put in order. Then, each variable is assigned to an axis. In the table to the right, sequence and classification is presented for this data set. Another ordering will result in a different mosaic plot, i.e., the order of the variables is significant as for all
254:
The last variable ("Survived") is finally applied, this time along the left edge with the result highlighted by shade: dark grey rectangles represent people that did not survive the disaster, light grey ones people that did. Women in the first class are immediately seen to have had the highest
250:
One then applies the second variable "Class" to the top edge. The four vertical columns therefore mark the four values of that variable (1st, 2nd, 3rd, and crew). These columns are of variable thickness, because column width indicates the relative proportion of the corresponding value on the
55:
prints. However, in statistical applications, mosaic plots can be colored and shaded according to deviations from independence, whereas Marimekko charts are colored according to the category levels, as in the image.
251:
population. Crew plainly represents the largest male group, whereas third-class passengers are the largest female group. The number of female crew members is also seen to have been marginal.
282:
The areas of the rectangular tiles that are available for a combination of features are proportional to the number of observations that have this combination of features.
23:
Mosaic plot showing cross-sectional distribution through time of different musical themes in the Guardian's list of "1000 songs to hear before you die".
615: 261: 538: 457: 432: 405: 378: 352: 302:
The mosaic plot has been criticised for making the data hard to perceive and to compare visually, because the values correspond to areas.
63:
and spineplots, the area of the tiles, also known as the bin size, is proportional to the number of observations within that category.
276:
The plot is of at least two variables. There is no upper limit, but too many variables may be confusing in graphic form.
502: 450:
Discrete Data Analysis with R: Visualization and Modeling Techniques for Categorical and Count Data
474: 534: 453: 428: 422: 401: 395: 374: 368: 348: 342: 321: 528: 260: 370:
New Techniques and Technologies for Statistics II: Proceedings of the Second Bonn Seminar
609: 589:
Computer Science and Statistics: Proceedings of the 13th Symposium on the Interface
555: 60: 52: 311: 316: 290: 286: 73: 19: 279:
The number of observations is not limited, but not read in the image.
77:. There are 2201 observations and 3 variables. The variables are: 71:
An example of mosaic plots uses data from the passengers on the
530:
Interactive Graphics for Data Analysis: Principles and Examples
273:
The displayed variables are categorical or ordinal scales.
600:Journal of Computational and Graphical Statistics 87:did this person survive the sinking (yes / no)? 527:Martin Theus; Simon Urbanek (23 March 2011). 8: 448:Michael Friendly & David Meyer (2016). 84:the class (1st, 2nd and 3rd class, or crew) 373:. IOS Press. 1 January 1997. p. 254. 193: 497: 495: 90: 81:the gender of the person (male / female) 18: 341:Sandra D. Schlotzhauer (1 April 2007). 333: 596:A Brief History of the Mosaic Display 7: 397:SAS System for Statistical Graphics 394:Michael Friendly (1 January 1991). 421:SAS Institute (6 September 2013). 14: 556:"Design Example: Marimekko Chart" 475:"How to apply Marimekko to data" 473:Smith, Alan (6 September 2017). 427:. SAS Institute. pp. 251–. 400:. SAS Institute. pp. 512–. 259: 616:Statistical charts and diagrams 344:Elementary Statistics Using JMP 585:Mosaics for contingency tables 347:. SAS Institute. p. 407. 1: 583:John Hartigan, Beat Kleiner: 182: 179: 176: 173: 165: 162: 159: 156: 145: 142: 139: 136: 128: 125: 122: 119: 51:because they resemble some 632: 452:. Chapman & Hall/CRC. 285:Unlike, for example, the 150: 113: 190:Mosaic plot construction 41:percent stacked bar plot 24: 424:JMP 11 Basic Analysis 247:three quarters male. 22: 602:, 2002, 11, 89–107. 243:multivariate plots. 591:. 1981, S. 268–273. 594:Michael Friendly: 25: 16:Data visualization 540:978-1-4200-1106-7 503:"Marimekko Chart" 459:978-1-4987-2583-5 434:978-1-61290-684-3 407:978-1-55544-441-9 380:978-90-5199-326-4 354:978-1-59994-428-9 322:Contingency table 240: 239: 187: 186: 623: 571: 570: 568: 566: 551: 545: 544: 524: 518: 517: 515: 513: 499: 490: 489: 487: 485: 470: 464: 463: 445: 439: 438: 418: 412: 411: 391: 385: 384: 365: 359: 358: 338: 263: 194: 91: 631: 630: 626: 625: 624: 622: 621: 620: 606: 605: 580: 578:Further reading 575: 574: 564: 562: 560:Perceptual Edge 553: 552: 548: 541: 526: 525: 521: 511: 509: 501: 500: 493: 483: 481: 479:Financial Times 472: 471: 467: 460: 447: 446: 442: 435: 420: 419: 415: 408: 393: 392: 388: 381: 367: 366: 362: 355: 340: 339: 335: 330: 308: 300: 270: 265: 192: 69: 39:, or sometimes 33:Marimekko chart 17: 12: 11: 5: 629: 627: 619: 618: 608: 607: 604: 603: 592: 579: 576: 573: 572: 554:Few, Stephen. 546: 539: 519: 507:Mekko Graphics 491: 465: 458: 440: 433: 413: 406: 386: 379: 360: 353: 332: 331: 329: 326: 325: 324: 319: 314: 307: 304: 299: 296: 295: 294: 283: 280: 277: 274: 269: 266: 257: 238: 237: 234: 231: 227: 226: 223: 220: 216: 215: 212: 209: 205: 204: 201: 198: 191: 188: 185: 184: 181: 178: 175: 172: 168: 167: 164: 161: 158: 155: 152: 148: 147: 144: 141: 138: 135: 131: 130: 127: 124: 121: 118: 115: 111: 110: 107: 104: 101: 98: 95: 89: 88: 85: 82: 68: 65: 15: 13: 10: 9: 6: 4: 3: 2: 628: 617: 614: 613: 611: 601: 597: 593: 590: 586: 582: 581: 577: 561: 557: 550: 547: 542: 536: 533:. CRC Press. 532: 531: 523: 520: 508: 504: 498: 496: 492: 480: 476: 469: 466: 461: 455: 451: 444: 441: 436: 430: 426: 425: 417: 414: 409: 403: 399: 398: 390: 387: 382: 376: 372: 371: 364: 361: 356: 350: 346: 345: 337: 334: 327: 323: 320: 318: 315: 313: 310: 309: 305: 303: 297: 292: 288: 284: 281: 278: 275: 272: 271: 267: 264: 262: 256: 252: 248: 244: 235: 232: 229: 228: 224: 221: 218: 217: 213: 210: 207: 206: 202: 199: 196: 195: 189: 170: 169: 153: 149: 133: 132: 116: 112: 108: 105: 102: 99: 96: 93: 92: 86: 83: 80: 79: 78: 76: 75: 66: 64: 62: 57: 54: 50: 46: 42: 38: 34: 30: 21: 599: 595: 588: 584: 563:. Retrieved 559: 549: 529: 522: 510:. Retrieved 506: 484:27 September 482:. Retrieved 478: 468: 449: 443: 423: 416: 396: 389: 369: 363: 343: 336: 301: 258: 253: 249: 245: 241: 72: 70: 58: 49:Mekko charts 48: 44: 40: 36: 32: 28: 26: 512:13 December 225:Horizontal 37:Mekko chart 29:mosaic plot 565:30 October 328:References 268:Properties 106:3rd Class 103:2nd Class 100:1st Class 61:bar charts 298:Criticism 236:Vertical 214:Vertical 97:Survived 53:Marimekko 45:Marimekko 610:Category 312:Heat map 306:See also 233:Survived 200:Variable 59:As with 317:Treemap 291:QQ plot 287:boxplot 151:Female 94:Gender 74:Titanic 67:Example 587:. In: 537:  456:  431:  404:  377:  351:  211:Gender 222:Class 203:Axis 197:Order 114:Male 109:Crew 598:In: 567:2011 535:ISBN 514:2017 486:2019 454:ISBN 429:ISBN 402:ISBN 375:ISBN 349:ISBN 174:141 171:Yes 163:106 146:192 134:Yes 129:670 126:422 123:154 120:118 289:or 183:20 180:90 177:93 160:13 154:No 143:88 140:25 137:62 117:No 47:or 612:: 558:. 505:. 494:^ 477:. 230:3. 219:2. 208:1. 166:3 157:4 35:, 31:, 27:A 569:. 543:. 516:. 488:. 462:. 437:. 410:. 383:. 357:.

Index


Marimekko
bar charts
Titanic

boxplot
QQ plot
Heat map
Treemap
Contingency table
Elementary Statistics Using JMP
ISBN
978-1-59994-428-9
New Techniques and Technologies for Statistics II: Proceedings of the Second Bonn Seminar
ISBN
978-90-5199-326-4
SAS System for Statistical Graphics
ISBN
978-1-55544-441-9
JMP 11 Basic Analysis
ISBN
978-1-61290-684-3
ISBN
978-1-4987-2583-5
"How to apply Marimekko to data"


"Marimekko Chart"
Interactive Graphics for Data Analysis: Principles and Examples
ISBN

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.