Analysis of variance: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
en>Rjwilmsi
m Journal cites, added 1 DOI using AWB (9887)
en>Michael Hardy
No edit summary
 
(One intermediate revision by one other user not shown)
Line 1: Line 1:
{{Use dmy dates|date=June 2013}}
My name is Minerva Lienhop. I life in Vitrolles (France).<br><br>Here is my site: [http://hemorrhoidtreatmentfix.com/hemorrhoid-symptoms symptoms of hemorrhoids]
'''Analysis of variance''' ('''ANOVA''') is a collection of [[statistical model]]s used to analyze the differences between group means and their associated procedures (such as "variation" among and between groups). In ANOVA setting, the observed [[variance]] in a particular variable is partitioned into components attributable to different sources of variation. In its simplest form, ANOVA provides a [[statistical test]] of whether or not the [[mean]]s of several groups are equal, and therefore generalizes the [[Student's t-test#Independent two-sample t-test|''t''-test]] to more than two groups. Doing multiple two-sample t-tests would result in an increased chance of committing a [[type I error]]. For this reason, ANOVAs are useful in comparing (testing) three or more means (groups or variables) for statistical significance.
 
==Motivating example==
[[File:ANOVA no fit.jpg|thumb|No fit at all]][[File:ANOVA fair fit.jpg|thumb|Fair fit]][[File:ANOVA very good fit.jpg|thumb|Very good fit]]The analysis of variance can be used as an exploratory tool to explain observations.  A dog show provides an example.  A dog show is not a random sampling of the breed.  It is typically limited to dogs that are male, adult, pure-bred and exemplary.  A histogram of dog weights from a show might plausibly be rather complex, like the yellow-orange distribution shown in the illustrations.  An attempt to explain the weight distribution by dividing the dog population into groups (young vs old)(short-haired vs long-haired) would probably be a failure (no fit at all). The groups (shown in blue) have a large variance and the means are very close.  An attempt to explain the weight distribution by (pet vs working breed)(less athletic vs more athletic) would probably be somewhat more successful (fair fit).  The heaviest show dogs are likely to be big strong working breeds.  An attempt to explain weight by breed is likely to produce a very good fit.  All Chihuahuas are light and all St Bernards are heavy.  The difference in weights between Setters and Pointers does not justify separate breeds.  The analysis of variance provides the formal tools to justify these intuitive judgments.  A common use of the method is the analysis of experimental data or the development of models.  The method has some advantages over correlation: not all of the data must be numeric and one result of the method is a judgment in the confidence in an explanatory relationship.
 
==Background and terminology==
ANOVA is a particular form of [[statistical hypothesis testing]] heavily used in the analysis of experimental data.  A statistical hypothesis test is a method of making decisions using data.  A test  result (calculated from the [[null hypothesis]] and the sample) is called statistically significant if it is deemed unlikely to have occurred by chance, ''assuming the truth of the null hypothesis''.  A statistically significant result (when a probability ([[p-value]]) is less than a threshold (significance level)) justifies the rejection of the [[null hypothesis]], but only if the a priori probability of the null hypothesis is not high.
 
In the typical application of ANOVA, the null hypothesis is that all groups are simply random samples of the same population.  This implies that all treatments have the same effect (perhaps none).  Rejecting  the null hypothesis implies that different treatments result in altered effects.
 
By construction, hypothesis testing limits the rate
of Type I errors (false positives leading to false scientific claims)
to a significance level.  Experimenters also wish to limit Type II
errors (false negatives resulting in missed scientific discoveries). 
The Type II error rate is a function of several things including
sample size (positively correlated with experiment cost), significance
level (when the standard of proof is high, the chances of overlooking
a discovery are also high) and effect size (when the effect is
obvious to the casual observer, Type II error rates are low).
 
The terminology of ANOVA is largely from the statistical
[[design of experiments]].  The experimenter adjusts factors and
measures responses in an attempt to determine an effect.  Factors are
assigned to experimental units by a combination of randomization and
[[Randomized block design|blocking]] to ensure the validity of the results.  [[Blind experiment|Blinding]] keeps the
weighing impartial.  Responses show a variability that is partially
the result of the effect and is partially random error.
 
ANOVA is the synthesis of several ideas and it is used for multiple
purposes.  As a consequence, it is difficult to define concisely or precisely.
 
"Classical ANOVA for balanced data does three things at once:
# As exploratory data analysis, an ANOVA is an organization of an additive data decomposition, and its sums of squares indicate the variance of each component of the decomposition (or, equivalently, each set of terms of a linear model).
# Comparisons of mean squares, along with F-tests&nbsp;... allow testing of a nested sequence of models.
# Closely related to the ANOVA is a linear model fit with coefficient estimates and standard errors."<ref>Gelman (2005, p 2)</ref>
 
In short, ANOVA is a statistical tool used in several ways to develop and confirm an explanation for the observed data.
 
Additionally:
#<li value="4"> It is computationally elegant and relatively robust against violations of its assumptions.</li>
# ANOVA provides industrial strength (multiple sample comparison) statistical analysis.
# It has been adapted to the analysis of a variety of experimental designs.
 
As a result:
ANOVA "has long enjoyed the status of being the '''most used''' (some would
say abused) statistical technique in psychological research."<ref>
Howell (2002, p 320)</ref>
ANOVA "is probably the '''most useful''' technique in the field of
statistical inference."<ref>Montgomery (2001, p 63)</ref>
 
ANOVA is difficult to teach, particularly for complex experiments, with [[Restricted randomization|split-plot designs]] being notorious.<ref>Gelman (2005, p 1)</ref>  In some cases the proper
application of the method is best determined by problem pattern recognition
followed by the consultation of a classic authoritative test.<ref>
Gelman (2005, p 5)</ref>
 
===Design-of-experiments terms===
(Condensed from the NIST Engineering Statistics handbook: Section 5.7. A
Glossary of DOE Terminology.)<ref>{{cite web
| title = Section 5.7. A Glossary of DOE Terminology
| work = NIST Engineering Statistics handbook
| publisher = NIST
| url = http://www.itl.nist.gov/div898/handbook/pri/section7/pri7.htm
| accessdate =  5 April 2012}}</ref>
 
; Balanced design: An experimental design where all cells (i.e. treatment combinations) have the same number of observations.
; Blocking: A schedule for conducting treatment combinations in an experimental study such that any effects on the experimental results due to a known change in raw materials, operators, machines, etc., become concentrated in the levels of the blocking variable. The reason for blocking is to isolate a systematic effect and prevent it from obscuring the main effects. Blocking is achieved by restricting randomization.
; Design: A set of experimental runs which allows the fit of a particular model and the estimate of effects.
; DOE: Design of experiments.  An approach to problem solving involving collection of data that will support valid, defensible, and supportable conclusions.<ref>{{cite web
| title = Section 4.3.1 A Glossary of DOE Terminology
| work= NIST Engineering Statistics handbook
| publisher = NIST
| url= http://www.itl.nist.gov/div898/handbook/pmd/section3/pmd31.htm
| accessdate = 14 Aug 2012}}</ref>
; Effect: How changing the settings of a factor changes the response. The effect of a single factor is also called a main effect.
; Error: Unexplained variation in a collection of observations. DOE's typically require understanding of both random error and lack of fit error.
; Experimental unit: The entity to which a specific treatment combination is applied.
; Factors: Process inputs an investigator manipulates to cause a change in the output.
; Lack-of-fit error: Error that occurs when the analysis omits one or more important terms or factors from the process model. Including replication in a DOE allows separation of experimental error into its components: lack of fit and random (pure) error.
; Model: Mathematical relationship which relates changes in a given response to changes in one or more factors.
; Random error: Error that occurs due to natural variation in the process. Random error is typically assumed to be normally distributed with zero mean and a constant variance. Random error is also called experimental error.
; Randomization: A schedule for allocating treatment material and for conducting treatment combinations in a DOE such that the conditions in one run neither depend on the conditions of the previous run nor predict the conditions in the subsequent runs.<ref group="nb">Randomization is a term used in multiple ways in this
material.  "Randomization has three roles in applications: as a device
for eliminating biases, for example from unobserved explanatory
variables and selection effects: as a basis for estimating standard
errors: and as a foundation for formally exact significance tests." 
Cox (2006, page 192)  Hinkelmann and Kempthorne use randomization
both in experimental design and for statistical analysis.</ref>
; Replication: Performing the same treatment combination more than once. Including replication allows an estimate of the random error independent of any lack of fit error.
; Responses: The output(s) of a process. Sometimes called dependent variable(s).
; Treatment: A treatment is a specific combination of factor levels whose effect is to be compared with other treatments.
 
==Classes of models==
There are three classes of models used in the analysis of variance, and these are outlined here.
 
===Fixed-effects models===
{{Main|Fixed effects model}}
The fixed-effects model of analysis of variance applies to situations in which the experimenter applies one or more treatments to the subjects of the experiment to see if the [[response variable]] values change. This allows the experimenter to estimate the ranges of response variable values that the treatment would generate in the population as a whole.
 
===Random-effects models===
{{Main|Random effects model}}
Random effects models are used when the treatments are not fixed. This occurs when the various factor levels are sampled from a larger population. Because the levels themselves are [[random variables]], some assumptions and the method of contrasting the treatments (a multi-variable generalization of simple differences) differ from the fixed-effects model.<ref>Montgomery (2001, Chapter 12: Experiments with random factors)</ref>
 
===Mixed-effects models===
{{Main|Mixed model}}
A mixed-effects model contains experimental factors of both fixed and random-effects types, with appropriately different interpretations and analysis for the two types.
 
Example:
Teaching experiments could be performed by a university department
to find a good introductory textbook, with each text considered a
treatment.  The fixed-effects model would compare a list of candidate
texts.  The random-effects model would determine whether important
differences exist among a list of randomly selected texts.  The
mixed-effects model would compare the (fixed) incumbent texts to
randomly selected alternatives.
 
Defining fixed and random effects has proven elusive, with competing
definitions arguably leading toward a linguistic quagmire.<ref>
Gelman (2005, pp 20–21)</ref>
 
==Assumptions of ANOVA==
The analysis of variance has been studied from several approaches, the most common of which uses a [[linear model]] that relates the response to the treatments and blocks. Note that the model is linear in parameters but may be nonlinear across factor levels. Interpretation is easy when data is balanced across factors but much deeper understanding is needed for unbalanced data.
 
===Textbook analysis using a normal distribution===
The analysis of variance can be presented in terms of a [[linear model]], which makes the following assumptions about the [[probability distribution]] of the responses:<ref>{{cite book |title = Statistical Methods
| last1 = Snedecor | first1 = George W.
| last2 = Cochran | first2 = William G.
| year = 1967 | edition = 6th | page = 321
}}</ref><ref>Cochran & Cox (1992, p 48)</ref><ref>Howell (2002, p 323)</ref><ref>
{{cite book | last1 = Anderson | first1 = David R.
| last2 = Sweeney | first2 = Dennis J.
| last3 = Williams | first3 = Thomas A.
| title = Statistics for business and economics
| publisher = West Pub. Co | location = Minneapolis/St. Paul
| year = 1996 | edition = 6th| isbn = 0-314-06378-1 | pages = 452–453}}
</ref>
* [[Statistical independence|Independence]] of observations &ndash; this is an assumption of the model that simplifies the statistical analysis.
* [[normal distribution|Normality]] &ndash; the distributions of the residuals are [[Normal distribution|normal]].
* Equality (or "homogeneity") of variances, called [[homoscedasticity]] &mdash; the variance of data in groups should be the same.
 
The separate assumptions of the textbook model imply that the [[errors and residuals in statistics|errors]] are independently, identically, and normally distributed for fixed effects models, that is, that the errors (<math>\varepsilon</math>'s) are independent and
 
:<math>\varepsilon \thicksim N(0, \sigma^2).\,</math>
 
===Randomization-based analysis===
{{See also|Random assignment|Randomization test}}
In a [[Randomized controlled trial|randomized controlled experiment]], the treatments are randomly assigned to experimental units, following the experimental protocol. This randomization is objective and declared before the experiment is carried out. The objective random-assignment is used to test the significance of the null hypothesis, following the ideas of [[Charles Sanders Peirce|C. S. Peirce]] and [[Ronald A. Fisher]]. This design-based analysis was discussed and developed by [[Francis J. Anscombe]] at [[Rothamsted Experimental Station]] and by [[Oscar Kempthorne]] at [[Iowa State University]].<ref>Anscombe (1948)</ref> Kempthorne and his students make an assumption of ''unit treatment additivity'', which is discussed in the books of Kempthorne and [[David R. Cox]].{{Citation needed|date=August 2011}}
 
====Unit-treatment additivity====
In its simplest form, the assumption of unit-treatment additivity<ref group="nb">Unit-treatment additivity is simply termed additivity
in most texts.  Hinkelmann and Kempthorne add adjectives and
distinguish between additivity in the strict and broad senses.  This
allows a detailed consideration of multiple error sources (treatment,
state, selection, measurement and sampling) on page 161.</ref> states that the observed response <math>y_{i,j}</math> from experimental unit <math>i</math> when receiving treatment <math>j</math> can be written as the sum of the unit's response <math>y_i</math> and the treatment-effect <math> t_j</math>, that is <ref>Kempthorne (1979, p 30)</ref><ref name="Cox">Cox (1958, Chapter 2: Some Key Assumptions)</ref><ref>Hinkelmann and Kempthorne (2008, Volume 1, Throughout.  Introduced in Section 2.3.3: Principles of experimental design; The linear model; Outline of a model)</ref>
: <math>y_{i,j}=y_i+t_j.</math>
The assumption of unit-treatment additivity implies that, for every treatment <math>j</math>, the <math>j</math>th treatment have  exactly the same effect <math>t_j</math> on every experiment unit.
 
The assumption of unit treatment additivity  usually cannot be directly [[falsificationism|falsified]], according to Cox and Kempthorne. However, many ''consequences'' of treatment-unit additivity can be falsified. For a randomized experiment, the assumption of unit-treatment additivity ''implies'' that the variance is constant for all treatments. Therefore, by [[contraposition]], a necessary condition for unit-treatment additivity is that the variance is constant.
 
The use of unit treatment additivity and randomization is similar to the design-based inference that is standard in finite-population [[survey sampling]].
 
====Derived linear model====
Kempthorne uses the randomization-distribution and the assumption of ''unit treatment additivity'' to produce a ''derived linear model'', very similar to the textbook model discussed previously.<ref>Hinkelmann and Kempthorne (2008, Volume 1, Section 6.3:
Completely Randomized Design; Derived Linear Model)</ref>  The test statistics of this derived linear model are closely approximated by the test statistics of an appropriate normal linear model, according to approximation theorems and simulation studies.<ref name="HinkelmannKempthorne">Hinkelmann and Kempthorne (2008, Volume 1, Section 6.6: Completely randomized design; Approximating the randomization test)</ref> However, there are differences. For example, the randomization-based analysis results in a small but (strictly) negative correlation between the observations.<ref>Bailey (2008, Chapter 2.14 "A More General Model" in Bailey, pp.&nbsp;38–40)</ref><ref>Hinkelmann and Kempthorne (2008, Volume 1, Chapter 7: Comparison of Treatments)</ref> In the randomization-based analysis, there is ''no assumption'' of a ''normal'' distribution and certainly ''no assumption'' of ''independence''. On the contrary, ''the observations are dependent''!
 
The randomization-based analysis has the disadvantage that its exposition involves tedious algebra and extensive time.  Since the randomization-based analysis is complicated and is closely approximated by the approach using a normal linear model, most teachers emphasize the normal linear model approach. Few statisticians object to model-based analysis of balanced randomized experiments.
 
====Statistical models for observational data====
However, when applied to data from non-randomized experiments or [[observational study|observational studies]], model-based analysis lacks the warrant of randomization.<ref>
Kempthorne (1979, pp 125–126,
"The experimenter must decide which of the various causes that he
feels will produce variations in his results must be controlled
experimentally.  Those causes that he does not control experimentally,
because he is not cognizant of them, he must control by the device of
randomization."  "[O]nly when the treatments in the experiment are
applied by the experimenter using the full randomization procedure is
the chain of inductive inference sound.  It is ''only'' under these
circumstances that the experimenter can attribute whatever effects he
observes to the treatment and the treatment only.  Under these
circumstances his conclusions are reliable in the statistical sense.") 
</ref> For observational data, the derivation of confidence intervals must use ''subjective'' models, as emphasized by [[Ronald A. Fisher]] and his followers. In practice, the estimates of treatment-effects from observational studies  generally are often inconsistent.  In practice, "statistical models" and observational data are useful for suggesting hypotheses that should be treated very cautiously by the public.<ref>Freedman {{full|date=November 2012}}</ref>
 
===Summary of assumptions===
The normal-model based ANOVA analysis assumes the independence, normality and
homogeneity of the variances of the residuals. The
randomization-based analysis assumes only the homogeneity of the
variances of the residuals (as a consequence of unit-treatment
additivity) and uses the randomization procedure of the experiment.
Both these analyses require homoscedasticity, as an assumption for the normal-model analysis and as a consequence of randomization and additivity for the randomization-based analysis.
 
However, studies of processes that
change variances rather than means (called dispersion effects) have
been successfully conducted using ANOVA.<ref>Montgomery
(2001, Section 3.8: Discovering dispersion effects)</ref>  There are
''no'' necessary assumptions for ANOVA in its full generality, but the
F-test used for ANOVA hypothesis testing has assumptions and practical
limitations which are of continuing interest.
 
Problems which do not satisfy the assumptions of ANOVA can often be transformed to satisfy the assumptions.
The property of unit-treatment additivity is not invariant under a "change of scale", so statisticians often use transformations to achieve unit-treatment additivity. If the response variable is expected to follow a parametric family of probability distributions, then the statistician may specify (in the protocol for the experiment or observational study) that the responses be transformed to stabilize the variance.<ref>Hinkelmann and Kempthorne (2008, Volume 1, Section 6.10: Completely randomized design; Transformations)</ref> Also, a statistician may specify that logarithmic transforms be applied to the responses, which are believed to follow a multiplicative model.<ref name="Cox" /><ref>Bailey (2008)</ref>
According to Cauchy's [[functional equation]] theorem, the [[logarithm]] is the only continuous transformation that transforms real multiplication to addition{{citation needed|date=October 2013}}.
 
==Characteristics of ANOVA==
ANOVA is used in the analysis of comparative experiments, those in
which only the difference in outcomes is of interest.  The statistical
significance of the experiment is determined by a ratio of two
variances.  This ratio is independent of several possible alterations
to the experimental observations: Adding a constant to all
observations does not alter significance.  Multiplying all
observations by a constant does not alter significance.  So ANOVA
statistical significance results are independent of constant bias and
scaling errors as well as the units used in expressing observations. 
In the era of mechanical calculation it was common to
subtract a constant from all observations (when equivalent to
dropping leading digits) to simplify data entry.<ref>Montgomery
(2001, Section 3-3: Experiments with a single factor: The analysis of
variance; Analysis of the fixed effects model)</ref><ref>
Cochran & Cox (1992, p 2 example)</ref>  This is an example of data
[[Coding (social sciences)|coding]].
 
==Logic of ANOVA==
The calculations of ANOVA can be characterized as computing a number
of means and variances, dividing two variances and comparing the ratio
to a handbook value to determine statistical significance.  Calculating
a treatment effect is then trivial, "the effect of any treatment is
estimated by taking the difference between the mean of the
observations which receive the treatment and the general mean."<ref>
Cochran & Cox (1992, p 49)</ref>
 
===Partitioning of the sum of squares===
{{main|Partition of sums of squares}}
ANOVA uses traditional standardized terminology.  The definitional
equation of sample variance is
<math>s^2=\textstyle\frac{1}{n-1}\sum(y_i-\bar{y})^2</math>, where the
divisor is called the degrees of freedom (DF), the summation is called
the sum of squares (SS), the result is called the mean square (MS) and
the squared terms are deviations from the sample mean.  ANOVA
estimates 3 sample variances: a total variance based on all the
observation deviations from the grand mean, an error variance based on
all the observation deviations from their appropriate
treatment means and a treatment variance.  The treatment variance is
based on the deviations of treatment means from the grand mean, the
result being multiplied by the number of observations in each
treatment to account for the difference between the variance of
observations and the variance of means.
 
The fundamental technique is a partitioning of the total [[sum of squares (statistics)|sum of squares]] ''SS'' into components related to the effects used in the model. For example, the model for a simplified ANOVA with one type of treatment at different levels.
 
:<math>SS_\text{Total} = SS_\text{Error} + SS_\text{Treatments}</math>
 
The number of [[Degrees of freedom (statistics)|degrees of freedom]] ''DF'' can be partitioned in a similar way: one of these components (that for error) specifies a [[chi-squared distribution]] which describes the associated sum of squares, while the same is true for "treatments" if there is no treatment effect.
 
:<math>DF_\text{Total} = DF_\text{Error} + DF_\text{Treatments}</math>
 
See also [[Lack-of-fit sum of squares]].
 
===The F-test===
{{Main|F-test}}
The [[F-test]] is used for comparing the factors of the total deviation. For example, in one-way, or single-factor ANOVA, statistical significance is tested for by comparing the F test statistic
 
:<math>F = \frac{\text{variance between treatments}}{\text{variance within treatments}}</math>
 
:<math>F = \frac{MS_\text{Treatments}}{MS_\text{Error}} = {{SS_\text{Treatments} / (I-1)} \over {SS_\text{Error} / (n_T-I)}}</math>
where  ''MS'' is mean square, <math>I</math>  = number of treatments and
<math>n_T</math> = total number of cases
 
to the [[F-distribution]] with <math>I - 1</math>, <math>n_T - I</math>  degrees of freedom. Using the [[F-distribution]] is a natural candidate because the test statistic is the ratio of two scaled sums of squares each of which follows a scaled [[chi-squared distribution]].
 
The expected value of F is <math>1 + {n \sigma^2_\text{Treatment}} /
{\sigma^2_\text{Error}}</math> (where n is the treatment sample size)
which is 1 for no treatment effect.  As values of F increase above 1
the evidence is increasingly inconsistent with the null hypothesis. 
Two apparent experimental methods of increasing F are increasing the
sample size and reducing the error variance by tight experimental
controls.
 
The textbook method of concluding the hypothesis test is to compare
the observed value of F with the critical value of F determined from
tables.  The critical value of F is a function of the numerator
degrees of freedom, the denominator degrees of freedom and the
significance level (α).  If F ≥ F<sub>Critical</sub> (Numerator DF, Denominator DF, α)
then reject the null hypothesis.
 
The computer method calculates the probability (p-value) of a value of
F greater than or equal to the observed value.  The null hypothesis is
rejected if this probability is less than or equal to the significance
level (α).  The two methods produce the same result.
 
The ANOVA F-test is known to be nearly optimal in the sense of
minimizing false negative errors for a fixed rate of false positive
errors (maximizing power for a fixed significance level).  To test the hypothesis that all treatments have exactly the same effect, the [[F-test]]'s p-values closely approximate the [[permutation test]]'s [[p-value]]s: The approximation is particularly close when the design is balanced.<ref name="HinkelmannKempthorne" /><ref>Hinkelmann and Kempthorne (2008, Volume 1, Section 6.7: Completely randomized design; CRD with unequal numbers of replications)</ref> Such [[permutation test]]s characterize [[uniformly most powerful test|tests with maximum power]] against all [[alternative hypothesis|alternative hypotheses]], as observed by Rosenbaum.<ref group="nb">Rosenbaum (2002, page 40) cites Section 5.7 (Permutation Tests), Theorem 2.3 (actually Theorem 3, page 184) of [[Erich Leo Lehmann|Lehmann]]'s ''Testing Statistical Hypotheses'' (1959).</ref> The ANOVA F–test (of the null-hypothesis that all treatments have exactly the same effect) is recommended as a practical test, because of its robustness against many alternative distributions.<ref>Moore and McCabe (2003, page 763)</ref><ref group="nb">The F-test for the comparison of variances has a mixed reputation.  It
is not recommended as a hypothesis test to determine whether two
''different'' samples have the same variance.  It is recommended for
ANOVA where two estimates of the variance of the ''same''
sample are compared.  While the F-test is not generally robust against
departures from normality, it has been found to be robust in the
special case of ANOVA.  Citations from Moore & McCabe (2003):
"Analysis of variance uses F statistics, but these are not
the same as the F statistic for comparing two population standard
deviations." (page 554) "The F test and other procedures for inference
about variances are so lacking in robustness as to be of little use in
practice." (page 556)  "[The ANOVA F test] is relatively insensitive
to moderate nonnormality and unequal variances, especially when the
sample sizes are similar." (page 763)  ANOVA assumes homoscedasticity,
but it is robust.  The statistical test for homoscedasticity (the
F-test) is not robust.  Moore & McCabe recommend a rule of thumb.</ref>
 
===Extended logic===
ANOVA consists of separable parts; partitioning sources of variance
and hypothesis testing can be used individually.  ANOVA is used to
support other statistical tools.  Regression is first used to fit more
complex models to data, then ANOVA is used to compare models with the
objective of selecting simple(r) models that adequately describe the
data.  "Such models could be fit without any reference to ANOVA, but
ANOVA tools could then be used to make some sense of the fitted models,
and to test hypotheses about batches of coefficients."<ref name="Gelman">Gelman (2008)</ref> 
"[W]e think of the analysis of variance as a way of understanding and structuring
multilevel models—not as an alternative to regression but as a tool
for summarizing complex high-dimensional inferences&nbsp;..."<ref name="Gelman" />
 
==ANOVA for a single factor==
{{Main|One-way analysis of variance}}
The simplest experiment suitable for ANOVA analysis is the completely
randomized experiment with a single factor.  More complex experiments
with a single factor involve constraints on randomization and include
completely randomized blocks and Latin squares (and variants:
Graeco-Latin squares, etc.).  The more complex experiments share many
of the complexities of multiple factors.  A relatively complete
discussion of the analysis (models, data summaries, ANOVA table) of
the completely randomized experiment is
[[One-way analysis of variance|available]].
 
==ANOVA for multiple factors==
{{Main|Two-way analysis of variance}}
ANOVA generalizes to the study of the effects of multiple factors. 
When the experiment includes observations at all combinations of
levels of each factor, it is termed [[Factorial experiment|factorial]]. 
Factorial experiments
are more efficient than a series of single factor experiments and the
efficiency grows as the number of factors increases.<ref name="Montgomery">Montgomery
(2001, Section 5-2: Introduction to factorial designs; The advantages
of factorials)</ref>  Consequently, factorial designs are heavily used.
 
The use of ANOVA to study the effects of multiple factors has a complication.  In a 3-way ANOVA with factors x, y and z, the ANOVA model includes terms for the main effects (x, y, z) and terms for [[Interaction (statistics)|interactions]] (xy, xz, yz, xyz). 
All terms require hypothesis tests.  The proliferation of interaction terms increases the risk that some hypothesis test will produce a false positive by chance. Fortunately, experience says that high order interactions are rare.<ref>Belle (2008, Section 8.4: High-order interactions occur rarely)</ref> 
The ability to detect interactions is a major advantage of multiple
factor ANOVA.  Testing one factor at a time hides interactions, but
produces apparently inconsistent experimental results.<ref name="Montgomery" />
 
Caution is advised when encountering interactions; Test 
interaction terms first and expand the analysis beyond ANOVA if
interactions are found.  Texts vary in their recommendations regarding
the continuation of the ANOVA procedure after encountering an
interaction.  Interactions complicate the interpretation of
experimental data.  Neither the calculations of significance nor the
estimated treatment effects can be taken at face value.  "A
significant interaction will often mask the significance of main effects."<ref>Montgomery (2001, Section 5-1: Introduction to factorial designs; Basic definitions and principles)</ref>  Graphical methods are recommended
to enhance understanding.  Regression is often useful.  A lengthy discussion of interactions is available in Cox (1958).<ref>Cox (1958,
Chapter 6: Basic ideas about factorial experiments)</ref>  Some interactions can be removed (by transformations) while others cannot.
 
A variety of techniques are used with multiple factor ANOVA to reduce expense. One technique used in factorial designs is to minimize replication (possibly no replication with support of [[Tukey's test of additivity|analytical trickery]]) and to combine groups when effects are found to be statistically (or practically) insignificant.  An experiment with many insignificant factors may collapse into one with a few factors supported by many replications.<ref>Montgomery (2001, Section 5-3.7: Introduction to factorial designs; The two-factor factorial design; One observation per cell)</ref>
 
==Worked numeric examples==
Several fully worked numerical examples are available.  A
[[F-test#One-way_ANOVA_example|simple case]] uses one-way (a single factor) analysis.  A [[Two-way analysis of variance|more complex case]] uses two-way (two-factor) analysis.
 
==Associated analysis==
Some analysis is required in support of the ''design'' of the experiment while other analysis is performed after changes in the factors are formally found to produce statistically significant changes in the responses.  Because experimentation is iterative, the results of one experiment alter plans for following experiments.
 
===Preparatory analysis===
 
====The number of experimental units====
 
In the design of an experiment, the number of experimental units is planned to satisfy the goals of the experiment. Experimentation is often sequential.
 
Early experiments are often designed to provide mean-unbiased estimates of treatment effects and of experimental error. Later experiments are often designed to test a hypothesis that a treatment effect has an important magnitude; in this case, the number of experimental units is chosen so that the experiment is within budget and has adequate power, among other goals.
 
Reporting sample size analysis is generally required in psychology. "Provide information on sample size and the process that led to sample size decisions."<ref>Wilkinson (1999, p 596)</ref>  The analysis, which is written in the experimental protocol before the experiment is conducted, is examined in grant applications and administrative review boards.
 
Besides the power analysis, there are less formal methods for selecting the number of experimental units. These include graphical methods based on limiting
the probability of false negative errors, graphical methods based on an expected variation increase (above the residuals) and methods based on achieving a desired confident interval.<ref>Montgomery (2001, Section 3-7: Determining sample size)</ref>
 
====Power analysis====
[[Statistical power|Power analysis]] is often applied in the context of ANOVA in order to assess the probability of successfully rejecting the null hypothesis if we assume a certain ANOVA design, effect size in the population, sample size and significance level. Power analysis can assist in study design by determining what sample size would be required in order to have a reasonable chance of rejecting the null hypothesis when the alternative hypothesis is true.<ref>Howell (2002, Chapter 8: Power)</ref><ref>Howell (2002, Section 11.12: Power (in ANOVA))</ref><ref>Howell (2002, Section 13.7: Power analysis for factorial experiments)</ref><ref>Moore and McCabe (2003, pp 778–780)</ref>
 
====Effect size====
{{Main|Effect size}}
Several standardized measures of effect have been proposed for ANOVA to summarize the strength of the association between a predictor(s) and the dependent variable (e.g., &eta;<sup>2</sup>, &omega;<sup>2</sup>, or &fnof;<sup>2</sup>) or the overall standardized difference (&Psi;) of the complete model. Standardized effect-size estimates facilitate comparison of findings across studies and disciplines.  However, while standardized effect sizes are commonly used in much of the professional literature, a non-standardized measure of effect size that has immediately "meaningful" units may be preferable for reporting purposes.<ref name="Wilkinson">Wilkinson (1999, p 599)</ref>
 
===Followup analysis===
It is always appropriate to carefully consider outliers.  They have a disproportionate impact on statistical conclusions and are often the result of errors.
 
====Model confirmation====
It is prudent to verify that the assumptions of ANOVA have been met. Residuals are examined or analyzed to confirm homoscedasticity and gross normality.<ref>Montgomery (2001, Section 3-4: Model adequacy checking)</ref>  Residuals should have the appearance of (zero mean normal distribution) noise when plotted as a function of anything including time and
modeled data values. Trends hint at interactions among factors or among observations.  One rule of thumb: "If the largest standard deviation is less than twice the smallest standard deviation, we can use methods based on the assumption of equal standard deviations and our results
will still be approximately correct."<ref>Moore and McCabe (2003, p 755, Qualifications to this rule appear in a footnote.)</ref>
 
====Follow-up tests====
A statistically significant effect in ANOVA is often followed up with one or more different follow-up tests. This can be done in order to assess which groups are different from which other groups or to test various other focused hypotheses. Follow-up tests are often distinguished in terms of whether they are planned ([[A priori and a posteriori|a priori]]) or [[Post-hoc analysis|post hoc]]. Planned tests are determined before looking at the data and post hoc tests are performed after looking at the data.
 
Often one of the "treatments" is none, so the treatment group can act as a control.  [[Dunnett's test]] (a modification of the t-test) tests whether each of the other treatment groups has the same
mean as the control.<ref>Montgomery (2001, Section 3-5.8: Experiments with a single factor: The analysis of
variance; Practical interpretation of results; Comparing means with a control)</ref>
 
Post hoc tests such as [[Tukey's range test]] most commonly compare every group mean with every other group mean and typically incorporate some method of controlling for Type I errors. Comparisons, which are most commonly planned, can be either simple or compound. Simple comparisons compare one group mean with one other group mean. Compound comparisons typically compare two sets of groups means where one set has two or more groups (e.g., compare average group means of group A, B and C with group D). Comparisons can also look at tests of trend, such as linear and quadratic relationships, when the independent variable involves ordered levels.
 
Following ANOVA with pair-wise multiple-comparison tests has been criticized on several grounds.<ref name="Wilkinson" /><ref>Hinkelmann and Kempthorne (2008, Volume 1, Section 7.5:
Comparison of Treatments; Multiple Comparison Procedures)</ref> There are many such tests (10 in one table) and recommendations regarding their use are vague or conflicting.<ref>Howell (2002, Chapter 12: Multiple comparisons among treatment means)</ref><ref>Montgomery (2001, Section 3-5: Practical interpretation of results)</ref>
 
==Study designs and ANOVAs==
There are several types of ANOVA. Many statisticians base ANOVA on the [[experimental design|design of the experiment]],<ref>Cochran & Cox (1957, p 9,
"[T]he general rule [is] that the way in which the experiment is conducted determines not only whether inferences can be made, but also the calculations required to make them.")</ref> especially on the protocol that specifies the [[random assignment]] of treatments to subjects; the protocol's description of the assignment mechanism should include a specification of the structure of the treatments and of any [[blocking (statistics)|blocking]]. It is also common to apply ANOVA to observational data using an appropriate statistical model.{{Citation needed|date=May 2011}}
 
Some popular designs use the following types of ANOVA:
*[[One-way ANOVA]] is used to test for differences among two or more [[statistical independence|independent]] groups (means),e.g. different levels of urea application in a crop. Typically, however, the one-way ANOVA is used to test for differences among at least three groups, since the two-group case can be covered by a [[t-test]].<ref>{{cite doi|10.1093/biomet/6.1.1|noedit}}</ref> When there are only two means to compare, the [[t-test]] and the ANOVA [[F-test]] are equivalent; the relation between ANOVA and ''t'' is given by ''F''&nbsp;=&nbsp;''t''<sup>2</sup>.
*[[Factorial experiment|Factorial]] ANOVA is used when the experimenter wants to study the interaction effects among the treatments.
*[[Repeated measures]] ANOVA is used when the same subjects are used for each treatment (e.g., in a [[longitudinal study]]).
*[[Multivariate analysis of variance]] (MANOVA) is used when there is more than one [[dependent variable|response variable]].
 
==ANOVA cautions==
Balanced experiments (those with an equal sample size for each treatment) are relatively easy to interpret; Unbalanced
experiments offer more complexity.  For single factor (one way) ANOVA, the adjustment for unbalanced data is easy, but the unbalanced analysis lacks both robustness and power.<ref>Montgomery (2001, Section 3-3.4: Unbalanced data)</ref>  For more complex designs the lack of balance leads to further complications. "The orthogonality property of main effects and interactions present in balanced data does not carry over to the unbalanced case. This means that the usual analysis of variance techniques do not apply. 
Consequently, the analysis of unbalanced factorials is much more difficult than that for balanced designs."<ref>Montgomery (2001, Section 14-2: Unbalanced data in factorial design)</ref>  In the general case, "The analysis of variance can also be applied to unbalanced data, but then the sums of squares, mean squares, and F-ratios will depend on the order in which the sources of variation
are considered."<ref name="Gelman" />  The simplest techniques for handling unbalanced data restore balance by either throwing out data or by synthesizing missing data.  More complex techniques use regression.
 
ANOVA is (in part) a significance test.  The American Psychological Association holds the view that simply reporting significance is insufficient and that reporting confidence bounds is preferred.<ref name="Wilkinson" />
 
While ANOVA is conservative (in maintaining a significance level) against [[multiple comparisons]] in one dimension, it is not conservative against comparisons in multiple dimensions.<ref>Wilkinson (1999, p 600)</ref>
 
==Generalizations==
ANOVA is considered to be a special case of [[linear regression]]<ref>Gelman (2005, p.1) (with qualification in the later text)</ref><ref>Montgomery (2001, Section 3.9: The Regression Approach to the Analysis of Variance)</ref> which in turn is a special case of the [[general linear model]].<ref>Howell (2002, p 604)</ref> All consider the observations to be the sum of a model (fit) and a residual (error) to be minimized.
 
The [[Kruskal&ndash;Wallis test]] and the [[Friedman test]] are [[nonparametric]] tests, which  do not rely on an assumption of normality.<ref>Howell (2002, Chapter 18: Resampling and nonparametric approaches to data)</ref><ref>Montgomery (2001, Section 3-10: Nonparametric methods in the analysis of variance)</ref>
 
==History==
While the analysis of variance reached fruition in the 20th century,
antecedents extend centuries into the past according to Stigler.<ref>
Stigler (1986)</ref>  These include hypothesis testing, the partitioning of sums of
squares, experimental techniques and the additive model.  [[Pierre-Simon Laplace|Laplace]] was
performing hypothesis testing in the 1770s.<ref>Stigler (1986, p 134)</ref> 
The development of least-squares methods by Laplace and [[Carl Friedrich Gauss|Gauss]] circa
1800 provided an improved method of combining observations (over the
existing practices of astronomy and geodesy).  It also initiated much
study of the contributions to sums of squares.  Laplace soon knew how
to estimate a variance from a residual (rather than a total) sum of
squares.<ref>Stigler (1986, p 153)</ref>  By 1827 Laplace was using least
squares methods to address ANOVA problems regarding measurements of
atmospheric tides.<ref>Stigler (1986, pp&nbsp;154–155)</ref> 
Before 1800 astronomers had isolated observational errors resulting
from reaction times (the "[[personal equation]]") and had developed
methods of reducing the errors.<ref>Stigler (1986, pp&nbsp;240–242)</ref>  The
experimental methods used in the study of the personal equation were
later accepted by the emerging field of psychology <ref>Stigler (1986,
Chapter 7 - Psychophysics as a Counterpoint)</ref> which developed strong
(full factorial) experimental methods to which randomization and
blinding were soon added.<ref>Stigler (1986, p 253)</ref>  An eloquent
non-mathematical explanation of the additive effects model was
available in 1885.<ref>Stigler (1986, pp&nbsp;314–315)</ref>
 
Sir Ronald Fisher introduced the term "variance" and proposed a formal analysis of [[variance]] in a 1918 article ''[[The Correlation Between Relatives on the Supposition of Mendelian Inheritance]]''.<ref>''The Correlation Between Relatives on the Supposition of Mendelian Inheritance''. Ronald A. Fisher. ''Philosophical Transactions of the Royal Society of Edinburgh''. 1918. (volume 52, pages 399–433)</ref>  His first application of the analysis of variance was published in 1921.<ref>On the "Probable Error" of a Coefficient of Correlation Deduced from a Small Sample. Ronald A. Fisher. Metron, 1: 3-32 (1921)</ref> Analysis of variance became widely known after being included in Fisher's 1925 book ''[[Statistical Methods for Research Workers]]''.
 
Randomization models were developed by several researchers.  The first was
published in Polish by [[Neyman]] in 1923.<ref>
Scheffé (1959, p 291, "Randomization models were first formulated by
Neyman (1923) for the completely randomized design, by Neyman (1935)
for randomized blocks, by Welch (1937) and Pitman (1937) for the Latin
square under a certain null hypothesis, and by Kempthorne (1952, 1955)
and Wilk (1955) for many other designs.")</ref>
 
One of the attributes of ANOVA which ensured its early popularity was
computational elegance.  The structure of the additive model allows
solution for the additive coefficients by simple algebra rather than
by matrix calculations.  In the era of mechanical calculators this
simplicity was critical.  The determination of statistical
significance also required access to tables of the F function which
were supplied by early statistics texts.
 
==See also==
{{Commons category|Analysis of variance}}
 
<div style="-moz-column-count:3; column-count:3;">
*[[AMOVA]]
*[[ANCOVA]]
*[[ANORVA]]
*[[ANOVA on ranks]]
*[[ANOVA-simultaneous component analysis]]
*[[Mixed-design analysis of variance]]
*[[Multivariate analysis of variance|MANOVA]]
*[[One-way analysis of variance]]
*[[Two-way analysis of variance]]
</div>
 
==Footnotes==
{{reflist|group="nb"}}
 
==Notes==
{{reflist|30em}}
 
==References==
* {{cite journal|doi=10.2307/2984159|title=The Validity of Comparative Experiments|authorlink=Francis J. Anscombe|first=F. J.|last=Anscombe|journal=[[Journal of the Royal Statistical Society]]. Series A (General)|volume=111|issue=3|year=1948|pages=181–211|jstor=2984159|mr=30181}}
* {{cite book |last=Bailey|first=R. A.|authorlink=Rosemary A. Bailey|title=Design of Comparative Experiments|publisher=Cambridge University Press|year=2008 |isbn=978-0-521-68357-9|url=http://www.maths.qmul.ac.uk/~rab/DOEbook}} Pre-publication chapters are available on-line.
* {{cite book | last = Belle | first = Gerald van
| title = Statistical rules of thumb | publisher = Wiley
| location = Hoboken, N.J | year = 2008 | edition = 2nd
| isbn = 978-0-470-14448-0 }}
* {{cite book | last1 = Cochran | first1 = William G.
| last2 = Cox | first2 = Gertrude M.
| title = Experimental designs | publisher = Wiley | location = New York
| year = 1992 | isbn = 978-0-471-54567-5 | edition = 2nd }}
* Cohen, Jacob (1988). ''Statistical power analysis for the behavior sciences'' (2nd ed.). Routledge ISBN 978-0-8058-0283-2
* {{Cite journal | doi = 10.1037/0033-2909.112.1.155 | author = Cohen, Jacob | year = 1992 | title = Statistics a power primer | url = | journal = Psychology Bulletin | volume = 112 | issue = 1| pages = 155–159 | pmid=19565683}}
*[[David R. Cox|Cox, David R.]] (1958). ''Planning of experiments''.  Reprinted as ISBN 978-0-471-57429-3
*{{cite book | last = Cox | first = D. R.
| title = Principles of statistical inference
| publisher = Cambridge University Press
| location = Cambridge New York | year = 2006
| isbn = 978-0-521-68567-2 }}
* [[David A. Freedman (statistician)|Freedman, David A.]](2005). ''Statistical Models: Theory and Practice'', Cambridge University Press.  ISBN 978-0-521-67105-7
* {{Cite journal
| last1 = Gelman | first1 = Andrew
| doi = 10.1214/009053604000001048
| title = Analysis of variance? Why it is more important than ever
| journal = The Annals of Statistics
| volume = 33
| pages = 1–53
| year = 2005
}}
*{{cite book | last = Gelman | first = Andrew
| title = The new Palgrave dictionary of economics
| publisher = Palgrave Macmillan
| location = Basingstoke, Hampshire New York
| chapter=Variance, analysis of |edition=2nd
| year = 2008 | isbn = 978-0-333-78676-5}}
*{{cite book
|author=Hinkelmann, Klaus & [[Oscar Kempthorne|Kempthorne, Oscar]]|year=2008|title=Design and Analysis of Experiments|volume=I and II|edition=Second|publisher=Wiley|isbn=978-0-470-38551-7}}
* {{cite book | last = Howell | first = David C.
| title = Statistical methods for psychology
| publisher = Duxbury/Thomson Learning | location = Pacific Grove, CA
| year = 2002 | edition = 5th | isbn = 0-534-37770-X}}
*{{cite book
|author=[[Oscar Kempthorne|Kempthorne, Oscar]]
|year=1979
|title=The Design and Analysis of Experiments
|edition=Corrected reprint of (1952) Wiley
|publisher=Robert E. Krieger
|isbn=0-88275-105-0
}}
* Lehmann, E.L. (1959) Testing Statistical Hypotheses.  John Wiley & Sons.
* {{cite book | last = Montgomery | first = Douglas C.
| title = Design and Analysis of Experiments
| publisher =  Wiley | location = New York
| year = 2001 | edition = 5th | isbn = 978-0-471-31649-7}}
* Moore, David S. & McCabe, George P. (2003).  Introduction to the Practice of Statistics (4e).  W H Freeman & Co.  ISBN 0-7167-9657-0
* Rosenbaum, Paul R. (2002). ''Observational Studies'' (2nd ed.). New York: Springer-Verlag.  ISBN 978-0-387-98967-9
* {{cite book |title=The Analysis of Variance
|last=Scheffé |first=Henry |location=New York
|publisher=Wiley |year=1959}}
*{{cite book | last = Stigler | first = Stephen M.
| title = The history of statistics : the measurement of uncertainty before 1900
| publisher = Belknap Press of Harvard University Press
| location = Cambridge, Mass | year = 1986 | isbn = 0-674-40340-1 }}
* {{Cite journal
|author = Wilkinson, Leland
|title = Statistical Methods in Psychology Journals; Guidelines and Explanations
|journal = American Psychologist
|volume = 54 
|issue = 8
|pages = 594–604
|year = 1999
|doi = 10.1037/0003-066X.54.8.594}}
 
==Further reading==
* {{cite journal
| last = Box | first = G. E. P.
| authorlink = George E. P. Box
| title = Non-Normality and Tests on Variances
| journal = Biometrika
| volume = 40
| issue = 3/4
| pages = 318–335
| publisher = Biometrika Trust
| year = 1953
| jstor = 2333350
}}
* {{Cite journal
| last1 = Box | first1 = G. E. P. |authorlink=George E. P. Box
| title = Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effect of Inequality of Variance in the One-Way Classification
| doi = 10.1214/aoms/1177728786
| journal = The Annals of Mathematical Statistics
| volume = 25
| issue = 2
| pages = 290
| year = 1954
| pmid = 
| pmc =
}}
* {{Cite doi|10.1214/aoms/1177728717|noedit}}
* {{cite book
|author=Caliński, Tadeusz & Kageyama, Sanpei|title=Block designs: A Randomization approach, Volume '''I''': Analysis|series=Lecture Notes in Statistics|volume=150|publisher=Springer-Verlag|location=New York|year=2000|isbn=0-387-98578-6
}}
* {{cite book|title=Plane Answers to Complex Questions: The Theory of Linear Models|last=Christensen|first=Ronald|location=New York|publisher=Springer|year=2002| edition=Third|isbn=0-387-95361-2}}
*[[David R. Cox|Cox, David R.]] & [[Nancy M. Reid|Reid, Nancy M.]] (2000). ''The theory of design of experiments''.  (Chapman & Hall/CRC).  ISBN 978-1-58488-195-7
* {{Cite journal | doi = 10.1017/S0021859600003750| author = Fisher, Ronald | year = 1918 | title = Studies in Crop Variation. I. An examination of the yield of dressed grain from Broadbalk | url = http://www.library.adelaide.edu.au/digitised/fisher/15.pdf | journal = Journal of Agricultural Science| volume = 11 | issue = | pages = 107–135}}
* [[David A. Freedman (statistician)|Freedman, David A.]]; Pisani, Robert; Purves, Roger (2007) ''Statistics'', 4th edition. W.W. Norton & Company ISBN 978-0-393-92972-0
* {{cite book|last1=Hettmansperger|first1=T. P.|last2=McKean|first2=J. W.|title=Robust nonparametric statistical methods| edition=First|series=Kendall's Library of Statistics|volume=Volume 5|editor=Edward Arnold|location=New York|publisher=John Wiley & Sons, Inc.|year=1998|pages=xiv+467 pp.|isbn=0-340-54937-8 |mr=1604954 }}
* {{cite book
|first=Marvin
|last=Lentner
|coauthor=Thomas Bishop
|title=Experimental design and analysis
|edition=Second
|publisher=Valley Book Company
|location=P.O. Box 884, Blacksburg, VA 24063
|year=1993
|isbn=0-9616255-2-X
}}
* Tabachnick, Barbara G. & Fidell, Linda S. (2007). ''Using Multivariate Statistics'' (5th ed.). Boston: Pearson International Edition.  ISBN 978-0-205-45938-4
* {{cite book|last=Wichura|first=Michael J.|title=The coordinate-free approach to linear models|series=Cambridge Series in Statistical and Probabilistic Mathematics|publisher=Cambridge University Press|location=Cambridge|year=2006|pages=xiv+199|isbn=978-0-521-86842-6|mr=2283455|ref=harv}}
 
==External links==
{{wikiversity}}
* [[SOCR]] [http://wiki.stat.ucla.edu/socr/index.php/AP_Statistics_Curriculum_2007_ANOVA_1Way ANOVA Activity] and [http://www.socr.ucla.edu/htmls/ana/ANOVA1Way_Analysis.html interactive applet].
* [http://www.southampton.ac.uk/~cpd/anovas/datasets/index.htm  Examples of all ANOVA and ANCOVA models with up to three treatment factors, including randomized block, split plot, repeated measures, and Latin squares, and their analysis in R]
* NIST/SEMATECH e-Handbook of Statistical Methods, [http://www.itl.nist.gov/div898/handbook/prc/section4/prc43.htm section 7.4.3: "Are the means equal?"]
 
{{Statistics|correlation|state=collapsed}}
{{Experimental design|state=collapsed}}
{{Portal bar|Statistics}}
 
{{DEFAULTSORT:Analysis Of Variance}}
[[Category:Analysis of variance| ]]
[[Category:Design of experiments]]
[[Category:Statistical tests]]
[[Category:Parametric statistics]]

Latest revision as of 20:25, 5 January 2015

My name is Minerva Lienhop. I life in Vitrolles (France).

Here is my site: symptoms of hemorrhoids