Knowledge

Talk:Multiple comparisons problem

Source šŸ“

1128:
certain defined conditions"). Also, the first category (methods that work "under any conditions") is so narrowly defined that it excludes basically any standard method, as there are pretty much always statistical assumptions that are required. And the second category (methods that work "except under certain defined conditions") is so vague as to be essentially meaningless. Also, some of the info here is false. For example, Tukey's range test is NOT an omnibus test; and a method can require a significant omnibus test and still provide strong control (e.g., Hayter's procedure). Also, the list of categories ignores procedures that are designed to control the false discovery rate, rather than the familywise error rate. It would be better to link to the familywise error rate article than attempt to cram a bunch of detail in here.
656:, by Westfall, Tobias, and Wolfinger (all respected names in the field), the authors say "the term 'multiple testing' is more common than 'multiple comparisons' when analyzing modern high-dimensional data." They make a fairly loose distinction between "multiple comparisons" and "multiple testing," saying that "multiple comparisons" is "often" used for comparisons of several treatment means, whereas "multiple testing" is used in "a broader class of applications." But certainly if one substituted nonparametric tests (in which case the comparisons wouldn't be of means), there would still be "multiple comparisons" in the literal sense. In fact, even multiple correlation tests can be considered multiple "comparisons," as each test is effectively a comparison of the correlation to zero. 630:
of P-values is the most common method for doing this, but not the only one. Multiple comparisons are the special cases where the tests are comparisons between groups, typically several pairwise comparisons on a set of groups (e.g. all-against-all or one-against-all); although Bonferroni correction is still valid, it is typically not very powerful as it does not make use of the dependencies between different pairwise tests performed on the same groups. I'll see if I can make an update to the multiple testing page at some point which will make this difference more clear. --
645:, Shaffer, 1995). In various instances over the decades, people have sometimes used the term "multiple comparisons" more narrowly (referring to ANOVA-type contexts), but I disagree with Septagon's statement that "multiple comparisons" can only be used in the context of group comparisons. For example, in a within-subjects (repeated measures) design, there aren't separate groups, but there could still be "multiple comparisons" if there are more than one factor or more than two factor levels or more than one outcome variable. 74: 53: 1122:
except under certain defined conditions. Methods which rely on an omnibus test before proceeding to multiple comparisons. Typically these methods require a significant ANOVA, MANOVA, or Tukey's range test. These methods generally provide only "weak" control of Type I error, except for certain numbers of hypotheses. Empirical methods, which control the proportion of Type I errors adaptively, utilizing correlation and distribution characteristics of the observed data.
795:"If the inferences are hypothesis tests rather than confidence intervals, the same issue arises. With just one test performed at the 5% level, there is only a 5% chance of incorrectly rejecting the null hypothesis if the null hypothesis is true. However, for 100 tests where all null hypotheses are false, the expected number of incorrect rejections is 5. If the tests are independent, the probability of at least one incorrect rejection is 99.4%. These errors are called 1178:
distributional assumptions" is vague and likely unclear to many readers. I suggest either being more explicit (e.g., "The Bonferroni procedure does not require any assumptions about the dependence (correlation) of the tests, and does not impose any added assumptions about the distributions of the test statistics") or simply linking to the Bonferroni correction article without going into the issue of assumptions here.
148: 371:", then indeed we are not talking about multiple testing in general (which covers more generally the problems of "using statistical tests repeatedly" as indicated in the intro). Do you agreeĀ ? In this case, most of the discussion could go in a more general article on multiple testing, that I will be happy to start. Unfortunately, all the definitions I have been able to find so far for 22: 1187:
pros and cons, and some procedures are more generalized or specialized than others. Mentioning Bonferroni makes sense (though we don't need to do so multiple times in the article), since that's the simplest method. Why don't we just mention Bonferroni and note that there are other procedures, rather than going into detail about two particular alternatives?
540:
or wilcoxon, it is more likely having false positives (the coin example in the article). In this way, we would be accepting a false hipotesis for example saying that A has the same mean that D. For this reason, we have tests designed to avoid this: ANOVA (parametric), Friedman (nonparametric), others??...
890:
The table is confusing, mostly because there is no clear relation between rows and columns. I believe "Declared significant" means "Researchers believe alternative hypothesis to be true", and "Declared non-significant" means "Researchers believe null hypothesis to be true". This relabeling might make
487:
about multiple testing (starting with Benjamini-Hochberg, as mentioned above). This is probably why I easily believed that multiple comparisons was the special case, but it may be only a bias. In any case, if the consensus is that multiple testing==multiple comparison (hopefully other people will say
1186:
That method (i.e., the Å idĆ”k procedure) is indeed marginally less conservative than Bonferroni, but it's actually marginally liberal when the tests are negatively dependent. Also, focusing on the Å idĆ”k and Holm procedures is abritrary. Numerous proceudres have been developed. Each procedures has its
1078:
In some situations, the Bonferroni correction is substantially conservative, i.e., the actual family-wise error rate is much less than the prescribed level Ī±. This occurs when the test statistics are highly dependent (in the extreme case where the tests are perfectly dependent, the family-wise error
629:
should remain different pages. The difference is not merely UK/US or terminology, but one of essential difference. Multiple testing, or multiple hypotheses testing, is the general problem of testing several null hypotheses while controlling the overall chance of false positive; Bonferroni correction
1121:
Methods where total alpha can be proved to never exceed 0.05 (or some other chosen value) under any conditions. These methods provide "strong" control against Type I error, in all conditions including a partially correct null hypothesis. Methods where total alpha can be proved not to exceed 0.05
543:
After performing ANOVA or Friedman, we only know that for example H0: A = B = C = D is not true. Then we would probably want to know which one is different from the others. For this purpose, we can apply one of the techniques that allow us to _compare_ every pair: Tukey test, Nemenyi, Bonferroni...
539:
With these definitions, I think that making _multiple test_ is repeating a test some times. An example, if we want to test if A is better than B (or equal, or whatever). After that we got C and we want to test A vs C, and B vs C. Then comes D and I want A-D, B-D, C-D... If we do that, with a t-test
482:
conception, I was ok with the distinction and spinned-off the multiple testing article, which no one objected about. This is why I am a bit puzzled about the going back. I wonder if there may be a systematic difference in vocabulary between statisticians working in different fields; the statistical
1177:
Not necessarily. There are procedures that can be more conservative than Bonferroni in some cases (e.g., the Benjaminiā€“Yekutieli procedure and the ScheffĆ© procedure). Also, note that we don't need to mention the Bonferroni procedure both here and in the next section. Also, "free of dependence and
1132:
The advent of computerized resampling methods, such as bootstrapping and Monte Carlo simulations, has given rise to many techniques in the latter category. In some cases where exhaustive permutation resampling is performed, these tests provide exact, strong control of Type I error rates; in other
456:
address the multiple comparisons problem. They alas do not have an entry on "multiple testing". It seems like the article text (parts not by me) and all of the statistical tests linked below that I am familiar with address "multiple comparisons" as the problem is conceived by me and Mathworld. Is
1137:
That may be true (though no source is provided). But again, there is an entire wiki article on the familywise error rate, which covers methods of control. Why not just link to that article, instead of going into detail in this article which is supposed to be about the general problem of multiple
565:
Multiple Hypotheses: the analysis of variance. This chapter introduces the problem of repeating the same test to over different samples to confirm various hypothesis over them (coin example). Then explains ANOVA and their non-paramentric extensions like Kruskal Walls and points to chapter 14 for
434:
really refers to the application of multiple statistical tests, whatever they are. The google searches do not tell us if the two terms have the exact same meaning (some of the links for multiple comparisons point towards the ANOVA question only; some talk about the more general problem). For the
1127:
That entire list is unsourced, and there is no particular reason to divide multiple testing procedures into those particular categories. For one thing, these categories are not mutually exclusive (e.g., methods that provide weak control are a subset of methods that provide control "except under
611:
I'm late to this discussion, but I've never heard of multiple testing, until just now. I use multiple comparisons as a term all the time (especially Bonferroni and friends). Could it be a UK/US thing, or a case where SPSS has dictated the vocabulary to the world? Otherwise, I think the time for
1065:
Not true. For instance, you could use sequential testing or a closed testing procedure. Also, the familywise error rate is only one "total error rate" that has been defined. It would be better to mention the familywise error rate and false discovery rate, and link to the wiki articles on those
886:
The section comes at an important point in the article and risks throwing the reader off-track. The section throws a lot of variables at the reader, forcing him or her to ponder a complicated table just to come up with some pretty intuitive and obvious ideas, such as that the number of false
640:
I'm late to the party here, but "multiple comparisons" and "multiple testing" are typically considered as fairly interchangeable terms these days. One of the most respected experts on the topic is J.P. Shaffer, who said: "The term 'multiple comparisons' has come to be used synonymously with
937:
The article is not structured well, has many holes, and even contains some wrong (or at least misleading) statements. It might be better to scrap it altogether and refer to a well-crafted review paper instead. Sorry about this negative rating but I feel the need to say it as it is.
547:
The previous could clearly split article in two, but probably I have left other ideas, like those about techniques to repeat a test in order to increase power that I do not know of. I think we should clearify which contents do we want here before deciding about one or two articles.
688:
I agree. I don't know what the sentence "The experimentwise Ī± level increases exponentially as the number of comparisons increases." means. What is an Ī± level, or where do I go to look it up? Not really a field I know that much about, so I look forward to a more clear article. --
981:
My issues with this article are: 1) the controlling procedures section is a repetitive since its merge with multiple testing correction so that needs to be condensed and cleaned up 2) the large-scale multiple testing section is largely missing citations for its claims
1108:
Because simple techniques such as the Bonferroni method can be conservative, there has been a great deal of attention paid to developing better techniques, such that the overall rate of false positives can be maintained without excessively inflating the rate of false
346:
use this term (I just added a sentence on this on the microarray page). Given that noone else has answered my suggestion (thanks for jumping inĀ !), I will not move the page, but I will indicate clearly that multiple testing is (also) what we are talking about here.
277:
Any outcome of tossing a fair coin 10 times has probibility of (1/2) ^ 10. There is only one way to get 10 heads and then 10 ways of getting 9 heads (once for each of the 10 throws being a tail e.g. first is a tail or third is a tail) hence (1 + 10) * (1/2).
648:"Multiple comparisons" is an older term than "multiple testing," so it is not surprising that it shows up more in searches. But "multiple testing" is now very much a standard, well accepted term in the field. For instance, there is a well known text called 392:. I have used it myself, and had reviewers of my own research papers claim I need to do "multiple comparisons corrections." Also, google returns 177,000 hits for the search bonferroni+"multiple comparisons", and 48,000 for bonferroni+"multiple testing". 1005:), would still need to have a source brought for it and deleted otherwise. Well here are the sources, you could just google "MCP conference XXX" (when XXX is the year of the confernce) and get these as first results, but I'll list those results anyway: 843: 809:
that is, we reject it when we think is false (based on the alpha level). In this case the problem arise because we reject it even if it is true. Sorry if I misunderstood- non-statistician here.Diego Diez 13:22, 23 September 2010 (UTC)
457:
your conception of multiple comparisons (i.e. the ANOVA f-test) a specific example of multiple testing/my conception of multiple comparisons? I wonder if we aren't just running into a linguistic rather than content-based hurdle here.
1147:
If m independent comparisons are performed, the family-wise error rate (FWER), is given by Ī±ĀÆ = 1 āˆ’ ( 1 āˆ’ Ī±)^m . Hence, unless the tests are perfectly positively dependent (i.e., identical), Ī±ĀÆ increases as the number of comparisons
850: 152: 1056:
Not clear what "recalculating probabilities" means. And "repeated multiple times" doesn't make sense in this context. Multiple testing doesn't mean the same test is repeated; it means multiple different tests are conducted.
1083:
Too much detail about one specific procedure (the Bonferroni procedure) in one implausible situation (perfect dependence). Why not just briefly mention the Bonferroni procedure and link to the wiki article on that topic?
488:
something), then the first priority would be to merge the other article, instead of rewriting it (although it may be too late). As a side note, at least some of the linked articles are indeed specially related to ANOVA.
882:
I suggest a re-thinking of the purpose and goal of this section. The main problem is that nothing in this section appears elsewhere in the article, so it should be deleted if no more work is done to make it useful.
408:
multiple comparison problem; indeed, most of the procedures linked from this page are specifically about "comparing sets of means", as was the lead section. Based on this, the article has been split between
1152:
The second statement, though true, doesn't follow from the first, so why "Hence?" Also, FWER is only one error rate that's been defined; what about the per-family error rate and the false discovery rate?
659:
To avoid confusion, maybe the best approach for this article is treat "multiple comparisons" as indeed equivalent to "multiple testing," but mention that the term has sometimes been defined more narrowly.
959:: Joseph P. Romano & Azeem M. Shaikh & Michael Wolf, 2009. "Hypothesis testing in econometrics," IEW - Working Papers 444, Institute for Empirical Research in Economics - University of Zurich. 426:
the general problem, then the two pages should be merged ā€” and this article should be cleaned up. But I must say that I like the split approach, and it seems logical: with the ANOVA, you are really
124: 1161:"We can still say?" I think what was meant was that, regardless of the dependence of the tests, Ī±ĀÆ ā‰¤ mĪ±percomparison., and the more positively dependent the tests, the more Ī±ĀÆ shrinks toward Ī±. 1169:
That "example" has no explanatory value. What's the point of simply plugging arbitrary values into the formula without providing explanation that connects those values to some useful context?
301:(cf also the Benjamini & Hochberg paper cited at end of the page), so I would be tempted to suggest to move the article to this name, but it may simply be a bias on my side. Any opinionĀ ? 466:
Basically, I first thought it was only a linguistic question when I started this discussion a few months ago. It is only based on the comments above (it was mentioned in particular that
1088:
For example, in fMRI analysis, tests are done on over 100,000 voxels in the brain. The Bonferroni method would require p-values to be smaller than .05/100000 to declare significance.
744:
There is a section in the middle of the article that is repeated word-for-word later in the article. Please fix. Thanx. --Cromwellt 5PM, 16 Feb 2007 (having login problems)
575:
Multiple comparisons. This chapter explains how the comparisons among pairs of the samples tested in an ANOVA test should be done and different test for comparisons like Tukey.
678:
I think that the lead section should be a little more accessable. The big picture in plain language. There is plenty of room for the subtulties of the concept further down.
530:
Disclaimer: I don't know if I am biased by may concrete problem, as I am not statistician, neither I am english native speaker, but I'll try to help. According to dictionary:
478:
the same thing), and the content of the article that I assumed that multiple comparisons (i.e. the ANOVA f-test) was a specific example of multiple testing ā€” while it was not
1061:
In order to retain a prescribed family-wise error rate Ī± in an analysis involving more than one comparison, the error rate for each comparison must be more stringent than Ī±.
256:
Can you add a reference of some kindĀ ? Maybe looks obvious to a statistician, but not so much for me. And I'm a scientistĀ ! Would be very helpful for the general reader.
939: 338:, but I'd be interested to know of any reference that documents the meaning that you indicated above. As I wrote above, I have read mainly papers that use the terminology 775:
may be problematic as they restore claims about Bonferroni that are uncited and have already been called into dispute. Can some of the other editors weigh in on this?
1182:
A marginally less conservative correction can be obtained by solving the equation for the family-wise error rate of m independent comparisons for Ī±percomparison.
702:
I agree as well, and will take a crack at an edit with a more accessible intro, taking into account the current trent in multiple comparisons v testing (above).
536:
Comparison n. 1. The act of comparing; an examination of two or more objects with the view of discovering the resemblances or differences; relative estimate.
1230: 114: 168: 1235: 497:
Yeah, it does seem like we need to rope in some other comments. I'll ask around if anyone has the time to comment on it. Maybe you can do the same?
439:
for the general problem, including in reviews of research papers. Hey, the only paper in the bibliography of the article that mention anything says
747: 342:, but this may be a bias among researchers specialised in a specific area. For example, almost all the literature on the statistical analysis of 517: 356:
Ok, after rereading the article again, it seems to me that it is very confusing in its present stateĀ ! If one reads only the first sentence,
279: 90: 1225: 242: 201: 1202: 940:
http://www.dictionaryofeconomics.com/article?id=pde2010_M000425&edition=current&q=romano%20wolf&topicid=&result_number=1
751: 660: 1070:
Boole's inequality implies that if each of m tests is performed to have type I error rate Ī±/m, the total error rate will not exceed Ī±.
1052:
Multiple testing correction refers to re-calculating probabilities obtained from a statistical test which was repeated multiple times.
826: 263: 582:
In the start of chapter 11: 'The term "multpliple comparisons" was introduced by D. B. Duncan in 1951', according to (David 1995).
1092:
fMRI analysis is often done on over 100,000 voxels, but not always. Also, this is not the best example, since using p-values for
910: 81: 58: 734: 641:'simultaneous inference,' even when the inferences do not deal with comparisons" (from "Multiple hypothesis testing: A review," 946:
We hope Wikipedians on this talk page can take advantage of these comments and improve the quality of the article accordingly.
401:
Sorry, I am lost here; please see the last comment I have written above. It seemed clear to me that this article was about the
594: 151:
This article is or was the subject of a Wiki Education Foundation-supported course assignment. Further details are available
894:
So the question comes down to: what do you want to teach the reader? Currently, the section teaches nothing worthwhile.
33: 1173:
The most conservative method, which is free of dependence and distributional assumptions, is the Bonferroni correction
1096:
in fMRI (as described in the cited sources) isn't quite the same as using p-values in standard null hypothesis tests.
971: 949:
We believe Dr. Wolf has expertise on the topic of this article, since he has published relevant scholarly research:
867: 726:
which I believe is in the public domain, as NIST is a US government agency. In fact I made a template for this:
419:(the general problem). The (good) changes you have made are about the general problem. If the consensus is that 967: 887:
positives and true positives add up to the number of discoveries. Do we need all this quasi-math to know that?
283: 246: 587:
H. A. David First (?) occurrence of common terms in mathematical statistics. Amer. Statist. 49: 121-133, 1995.
205: 1206: 822: 755: 664: 267: 226: 652:, which is authored by and edited by some of the most respected statisticians in the area. In the manual 191:"the likelihood that a fair coin would come up heads at least 9 out of 10 times is 11 * (Ā½)10 = 0.0107." 1157:
If we do not assume that the comparisons are independent, then we can still say: Ī±ĀÆ ā‰¤ m ā‹… Ī±percomparison
987: 859: 39: 818: 364:: if, by multiple comparison, this article means basically "what you do after obtaining a significant 73: 52: 1198: 1001:
I must say it is really ridiculous that something that can be found so easily using a search engine (
906: 902: 898: 814: 420: 259: 197: 21: 963: 840: 724: 435:
record, even though it is not relevant to this particular discussion, I have mostly seen the term
89:
on Knowledge. If you would like to participate, please visit the project page, where you can join
779: 323: 222: 194:
Can someone please explain where the 11 came from? My own understanding is that it should be a 10
176: 612:
merger may be here. I'll plan to do it in a couple of days, if I don't here from anyone else. -
1100:
Since adjacent voxels tend to be highly correlated, this threshold is generally too stringent.
1034: 591: 235:
P(Heads at least 9 out of 10 times)= P(Heads 10 out of 10 times) + P(Heads 9 out of 10 times)
983: 727: 415: 160: 156: 1210: 1038: 991: 914: 872: 782: 759: 706: 693: 690: 682: 668: 634: 616: 602: 552: 524: 501: 492: 461: 447: 396: 379: 351: 326: 305: 287: 271: 250: 230: 209: 180: 1079:
rate with no multiple comparisons adjustment and the per-test error rates are identical).
453: 796: 613: 343: 1219: 776: 703: 631: 498: 458: 393: 172: 1191:
Continuous generalizations of the Bonferroni and Å idĆ”k correction are presented in.
1030: 679: 731: 599: 549: 489: 444: 376: 348: 302: 147: 318:
to me sound like simultaneous testing of multiple null hypotheses, and that is
854:
to publish a user-friendly guide (and also a workbook) on multiple comparisons
521: 86: 297:
I have always heard of the problems explained in the article under the name
221:
And "likelihood" is the wrong word here; I've changed it to "probability".
1133:
cases, such as bootstrap sampling, they provide only approximate control.
1024: 1019: 1104:
It's POV to say the Bonferroni threshold is "generally too stringent."
1113:
It's POV to refer to other techniques being "better" than Bonferroni.
1014: 1009: 930:, and provided us with the following comments to improve its quality: 558:
Revising some bibliografy, in (Zar, 1999) these are chapters 10 an 11:
405: 368: 802:
shouldn't be "However, for 100 tests where all null hypotheses are
590:
Jerrol H. Zar, Biostatistica Analysis, 4th ed. Prentice-Hall 1999,
788:
Multiple comparisons for confidence intervals and hypothesis tests
402: 365: 720: 388:
I have to agree with Michael Hardy that the term often used is
15: 1044:
Multiple comparisons correction section is highly problematic
533:
Testing n. 1. The act of testing or proving; trial; proof.
878:
Unhelpful section: "Classification of m hypothesis tests"
856:. These sources should be authoritative and mainstream. 566:
other techniques with more than one factor ex. Friedman.
927: 772: 806:, the expected number of incorrect rejections is 5."? 650:
Multiple Testing Problems in Pharmaceutical Statistics
142:
Wiki Education Foundation-supported course assignment
1117:
Such methods can be divided into general categories:
85:, a collaborative effort to improve the coverage of 1165:
Example: 0.2649 = 1 āˆ’ ( 1 āˆ’ .05 ) 6 ā‰¤ .05 Ɨ 6 = 0.3
714: 654:
Multiple Comparisons and Multiple Testing Using SAS
1142:Problems with the "controlling procedures" section 849:. They have also worked with the wizards at the 1074:That sentence is grammatically indecipherable. 935: 8: 375:blur the distinction with multiple testing. 216:It says "at least nine", not "exactly nine". 1003:as I mentioned in the original edit summary 715:Tukey's Studentized Range Test/Distribution 443:, and it is about the general problemĀ ;-). 1196: 293:Multiple comparisons or multiple testingĀ ? 47: 169:Template:Dashboard.wikiedu.org assignment 921: 167:Above undated message substituted from 49: 19: 1195:That's not even a complete sentence. 518:Knowledge talk:WikiProject Mathematics 7: 79:This article is within the scope of 1231:High-importance Statistics articles 719:There is a nice summary of this by 38:It is of interest to the following 922:Dr. Wolf's comment on this article 238:= 1/2^{10} + choose(10,1) 1/2^{10} 14: 146: 99:Knowledge:WikiProject Statistics 72: 51: 20: 1236:WikiProject Statistics articles 483:papers I have seen so far were 119:This article has been rated as 102:Template:WikiProject Statistics 972:22:42, 24 September 2016 (UTC) 306:21:00, 21 September 2005 (UTC) 1: 783:15:38, 13 November 2007 (UTC) 760:23:03, 16 February 2007 (UTC) 617:03:59, 6 September 2006 (UTC) 231:05:57, 31 December 2008 (UTC) 210:05:07, 31 December 2008 (UTC) 93:and see a list of open tasks. 1039:17:43, 19 October 2016 (UTC) 992:03:18, 18 October 2016 (UTC) 694:07:13, 15 October 2005 (UTC) 380:00:53, 16 October 2005 (UTC) 352:00:03, 16 October 2005 (UTC) 327:22:09, 15 October 2005 (UTC) 322:. So I oppose such a move. 181:04:36, 17 January 2022 (UTC) 1226:C-Class Statistics articles 873:10:13, 24 August 2011 (UTC) 707:08:10, 3 January 2006 (UTC) 683:01:59, 8 October 2005 (UTC) 643:Annual Review of Psychology 635:23:43, 7 January 2007 (UTC) 525:15:08, 5 January 2006 (UTC) 502:05:20, 4 January 2006 (UTC) 493:17:35, 3 January 2006 (UTC) 462:16:54, 3 January 2006 (UTC) 448:15:30, 3 January 2006 (UTC) 397:08:09, 3 January 2006 (UTC) 314:is what I've always heard; 1252: 915:11:36, 22 April 2014 (UTC) 603:11:35, 23 April 2006 (UTC) 553:18:59, 20 April 2006 (UTC) 360:is indeed not the same as 288:14:40, 24 March 2010 (UTC) 272:09:48, 31 March 2009 (UTC) 251:19:14, 26 March 2024 (UTC) 1211:16:49, 27 June 2021 (UTC) 669:17:37, 27 June 2021 (UTC) 334:is exactly what you call 118: 67: 46: 735:18:57, 15 May 2006 (UTC) 943: 926:Dr. Wolf has reviewed 846:Tamhane & Hochberg 454:Bonferroni corrections 430:a set of means, while 82:WikiProject Statistics 28:This article is rated 750:comment was added by 187:Coin flip calculation 155:. Student editor(s): 792:The paragraph says: 627:multiple comparisons 472:multiple comparisons 452:Mathworld says that 421:multiple comparisons 411:multiple comparisons 390:multiple comparisons 373:multiple comparisons 358:multiple comparisons 336:multiple comparisons 312:Multiple comparisons 928:this Knowledge page 891:the table clearer. 841:multiple comparison 105:Statistics articles 320:not the same topic 159:. Peer reviewers: 153:on the course page 34:content assessment 1213: 1201:comment added by 918: 901:comment added by 871: 831: 817:comment added by 763: 413:(this page), and 262:comment added by 200:comment added by 139: 138: 135: 134: 131: 130: 1243: 997:sources for edit 917: 895: 870: 864: 857: 839:I recommend the 830: 811: 745: 623:Multiple testing 468:multiple testing 441:multiple testing 437:multiple testing 416:multiple testing 362:multiple testing 340:multiple testing 332:multiple testing 316:multiple testing 299:multiple testing 274: 241:= 11 * 1/2^{10} 212: 183: 150: 125:importance scale 107: 106: 103: 100: 97: 76: 69: 68: 63: 55: 48: 31: 25: 24: 16: 1251: 1250: 1246: 1245: 1244: 1242: 1241: 1240: 1216: 1215: 1144: 1125: 1046: 1025:2017 conference 1020:2015 conference 1015:2013 conference 1010:2011 conference 999: 979: 924: 896: 880: 860: 858: 837: 835:Further reading 812: 797:false positives 790: 769: 746:ā€”The preceding 742: 717: 676: 609:Strong Support. 295: 280:213.173.165.162 257: 195: 189: 166: 144: 121:High-importance 104: 101: 98: 95: 94: 62:Highā€‘importance 61: 32:on Knowledge's 29: 12: 11: 5: 1249: 1247: 1239: 1238: 1233: 1228: 1218: 1217: 1143: 1140: 1120: 1045: 1042: 1028: 1027: 1022: 1017: 1012: 998: 995: 978: 975: 964:ExpertIdeasBot 961: 960: 952: 945: 933: 923: 920: 879: 876: 836: 833: 789: 786: 768: 765: 741: 738: 716: 713: 712: 711: 710: 709: 697: 696: 675: 672: 638: 637: 606: 605: 597: 588: 584: 583: 579: 578: 577: 576: 570: 569: 568: 567: 560: 559: 528: 527: 516:Try asking at 513: 512: 511: 510: 509: 508: 507: 506: 505: 504: 386: 385: 384: 383: 382: 344:DNA microarray 294: 291: 254: 253: 243:128.231.234.23 239: 236: 233: 219: 217: 202:67.169.127.132 188: 185: 143: 140: 137: 136: 133: 132: 129: 128: 117: 111: 110: 108: 91:the discussion 77: 65: 64: 56: 44: 43: 37: 26: 13: 10: 9: 6: 4: 3: 2: 1248: 1237: 1234: 1232: 1229: 1227: 1224: 1223: 1221: 1214: 1212: 1208: 1204: 1203:23.242.195.76 1200: 1193: 1192: 1188: 1184: 1183: 1179: 1175: 1174: 1170: 1167: 1166: 1162: 1159: 1158: 1154: 1150: 1149: 1141: 1139: 1138:comparisons? 1135: 1134: 1129: 1124: 1119: 1118: 1114: 1111: 1110: 1105: 1102: 1101: 1097: 1095: 1090: 1089: 1085: 1081: 1080: 1075: 1072: 1071: 1067: 1063: 1062: 1058: 1054: 1053: 1049: 1048:For example: 1043: 1041: 1040: 1036: 1032: 1026: 1023: 1021: 1018: 1016: 1013: 1011: 1008: 1007: 1006: 1004: 996: 994: 993: 989: 985: 976: 974: 973: 969: 965: 958: 955: 954: 953: 950: 947: 942: 941: 934: 931: 929: 919: 916: 912: 908: 904: 900: 892: 888: 884: 877: 875: 874: 869: 865: 863: 855: 853: 852:SAS Institute 848: 847: 842: 834: 832: 828: 824: 820: 816: 807: 805: 800: 798: 793: 787: 785: 784: 781: 778: 774: 766: 764: 761: 757: 753: 752:67.142.130.42 749: 739: 737: 736: 733: 729: 725: 722: 708: 705: 701: 700: 699: 698: 695: 692: 687: 686: 685: 684: 681: 673: 671: 670: 666: 662: 661:23.242.195.76 657: 655: 651: 646: 644: 636: 633: 628: 624: 621: 620: 619: 618: 615: 614:Scott Alberts 610: 604: 601: 598: 596: 593: 589: 586: 585: 581: 580: 574: 573: 572: 571: 564: 563: 562: 561: 557: 556: 555: 554: 551: 545: 541: 537: 534: 531: 526: 523: 519: 515: 514: 503: 500: 496: 495: 494: 491: 486: 481: 477: 473: 469: 465: 464: 463: 460: 455: 451: 450: 449: 446: 442: 438: 433: 429: 425: 422: 418: 417: 412: 407: 404: 400: 399: 398: 395: 391: 387: 381: 378: 374: 370: 367: 363: 359: 355: 354: 353: 350: 345: 341: 337: 333: 330: 329: 328: 325: 324:Michael Hardy 321: 317: 313: 310: 309: 308: 307: 304: 300: 292: 290: 289: 285: 281: 275: 273: 269: 265: 261: 252: 248: 244: 240: 237: 234: 232: 228: 224: 223:Michael Hardy 220: 218: 215: 214: 213: 211: 207: 203: 199: 192: 186: 184: 182: 178: 174: 170: 164: 162: 158: 154: 149: 141: 126: 122: 116: 113: 112: 109: 92: 88: 84: 83: 78: 75: 71: 70: 66: 60: 57: 54: 50: 45: 41: 35: 27: 23: 18: 17: 1197:ā€”Ā Preceding 1194: 1190: 1189: 1185: 1181: 1180: 1176: 1172: 1171: 1168: 1164: 1163: 1160: 1156: 1155: 1151: 1146: 1145: 1136: 1131: 1130: 1126: 1123: 1116: 1115: 1112: 1107: 1106: 1103: 1099: 1098: 1094:thresholding 1093: 1091: 1087: 1086: 1082: 1077: 1076: 1073: 1069: 1068: 1064: 1060: 1059: 1055: 1051: 1050: 1047: 1029: 1002: 1000: 980: 962: 956: 951: 948: 944: 936: 932: 925: 897:ā€”Ā Preceding 893: 889: 885: 881: 861: 851: 845: 838: 819:Kurai yousei 808: 803: 801: 794: 791: 770: 743: 718: 677: 674:Lead section 658: 653: 649: 647: 642: 639: 626: 622: 608: 607: 546: 542: 538: 535: 532: 529: 484: 479: 475: 471: 467: 440: 436: 431: 427: 423: 414: 410: 389: 372: 361: 357: 339: 335: 331: 319: 315: 311: 298: 296: 276: 264:146.50.10.49 255: 193: 190: 165: 145: 120: 80: 40:WikiProjects 984:Jbrowning17 813:ā€”Preceding 258:ā€”Preceding 196:ā€”Preceding 161:Jbrowning17 157:Jbrowning17 1220:Categories 1148:increases. 1109:negatives. 957:Reference 903:AndrewOram 773:reversions 740:Redundancy 595:013081542X 96:Statistics 87:statistics 59:Statistics 977:Criticism 868:Wolfowitz 428:comparing 1199:unsigned 1066:topics. 911:contribs 899:unsigned 844:book by 827:contribs 815:unsigned 748:unsigned 704:Debivort 632:Septagon 499:Debivort 459:Debivort 394:Debivort 260:unsigned 198:unsigned 173:PrimeBOT 1031:Orielno 771:Recent 728:NIST-PD 680:ike9898 432:testing 123:on the 30:C-class 862:Kiefer 780:Bivort 732:Btyner 600:Arauzo 550:Arauzo 490:Schutz 445:Schutz 406:F-test 377:Schutz 369:F-test 349:Schutz 303:Schutz 36:scale. 522:linas 474:were 403:ANOVA 366:ANOVA 1207:talk 1035:talk 988:talk 968:talk 907:talk 823:talk 804:true 756:talk 721:NIST 691:Jake 665:talk 625:and 592:ISBN 470:and 284:talk 268:talk 247:talk 227:talk 206:talk 177:talk 115:High 799:." 767:RVs 723:at 485:all 476:not 171:by 1222:: 1209:) 1037:) 990:) 970:) 913:) 909:ā€¢ 829:) 825:ā€¢ 777:de 758:) 730:. 667:) 520:. 480:my 424:is 286:) 270:) 249:) 229:) 208:) 179:) 163:. 1205:( 1033:( 986:( 966:( 905:( 866:. 821:( 762:. 754:( 663:( 282:( 266:( 245:( 225:( 204:( 175:( 127:. 42::

Index


content assessment
WikiProjects
WikiProject icon
Statistics
WikiProject icon
WikiProject Statistics
statistics
the discussion
High
importance scale

on the course page
Jbrowning17
Jbrowning17
Template:Dashboard.wikiedu.org assignment
PrimeBOT
talk
04:36, 17 January 2022 (UTC)
unsigned
67.169.127.132
talk
05:07, 31 December 2008 (UTC)
Michael Hardy
talk
05:57, 31 December 2008 (UTC)
128.231.234.23
talk
19:14, 26 March 2024 (UTC)
unsigned

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

ā†‘