L-notation: Difference between revisions

From formulasearchengine
Jump to navigation Jump to search
No edit summary
Clarification of subexponentiality
 
Line 1: Line 1:
{{For|loudspeakers|mid-range speaker}}
Alyson Meagher is the name her mothers and fathers gave her but she doesn't like when people use her full name. Alaska is where he's always been residing. Invoicing is my profession. One of the very [http://ltreme.com/index.php?do=/profile-127790/info/ best psychics] issues in the globe for him is performing ballet and he'll be beginning something else alongside with it.
In [[statistics]], the '''mid-range''' or '''mid-extreme''' of a set of statistical data values is the [[arithmetic mean]] of the maximum and minimum values in a [[data set]], defined as:{{sfn|Dodge|2003}}
 
:<math>M=\frac{\max x + \min x}{2}.</math>
 
The mid-range is the midpoint of the [[Range (statistics)|range]]; as such, it is a measure of [[central tendency]].
 
The mid-range is rarely used in practical statistical analysis, as it lacks [[#Efficiency|efficiency]] as an estimator for most distributions of interest,  because it ignores all intermediate points, and lacks [[#Robustness|robustness]], as outliers change it significantly. Indeed, it is one of the least efficient and least robust statistics. However, it finds some use in special cases: it is the maximally efficient estimator for the center of a uniform distribution, trimmed mid-ranges address robustness, and as an [[L-estimator]], it is simple to understand and compute.
 
==Comparison with other measures==
===Robustness===
The midrange is highly sensitive to outliers and ignores all but two data points. It is therefore a very non-[[robust statistic]], having a [[breakdown point]] of 0, meaning that a single observation can change it arbitrarily. Further, it is highly influenced by outliers: increasing the sample maximum or decreasing the sample minimum by ''x'' changes the mid-range by <math>x/2,</math> while it changes the sample mean, which also has breakdown point of 0, by only <math>x/n.</math> It is thus of little use in practical statistics, unless outliers are already handled.
 
A [[trimmed estimator|trimmed]] midrange is known as a '''{{visible anchor|midsummary}}''' – the ''n''% trimmed midrange is the average of the ''n''% and (100&minus;''n'')% percentiles, and is more robust, having a [[breakdown point]] of ''n''%. In the middle of these is the [[midhinge]], which is the 25% midsummary. The [[median]] can be interpreted as the fully trimmed (50%) mid-range; this accords with the convention that the median of an even number of points is the mean of the two middle points.
 
These trimmed midranges are also of interest as [[descriptive statistics]] or as [[L-estimator]]s of central location or [[skewness]]: differences of midsummaries, such as midhinge minus the median, give measures of skewness at different points in the tail.{{sfn|Velleman|Hoaglin|1981}}
 
===Efficiency===
Despite its drawbacks, in some cases it is useful: the midrange is a highly [[Efficiency (statistics)|efficient]] [[estimator]] of μ, given a small sample of a sufficiently [[platykurtic]] distribution, but it is inefficient for mesokurtic distributions, such as the normal.
 
For example, for a [[continuous uniform distribution]] with unknown maximum and minimum, the mid-range is the [[UMVU]] estimator for the mean. The [[sample maximum]] and sample minimum, together with sample size, are a sufficient statistic for the population maximum and minimum – the distribution of other samples, conditional on a given maximum and minimum, is just the uniform distribution between the maximum and minimum and thus add no information. See [[German tank problem]] for further discussion. Thus the mid-range, which is an unbiased and sufficient estimator of the population mean, is in fact the UMVU: using the sample mean just adds noise based on the uninformative distribution of points within this range.
 
Conversely, for the normal distribution, the sample mean is the UMVU estimator of the mean. Thus for platykurtic distributions, which can often be thought of as between a uniform distribution and a normal distribution, the informativeness of the middle sample points versus the extrema values varies from "equal" for normal to "uninformative" for uniform, and for different distributions, one or the other (or some combination thereof) may be most efficient. A robust analog is the [[trimean]], which averages the midhinge (25% trimmed mid-range) and median.
 
====Small samples====
For small sample sizes (''n'' from 4 to 20) drawn from a sufficiently platykurtic distribution (negative [[excess kurtosis]], defined as γ<sub>2</sub> = (μ<sub>4</sub>/(μ<sub>2</sub>)²)&nbsp;&minus;&nbsp;3), the mid-range is an efficient estimator of the mean ''μ''. The following table summarizes empirical data comparing three estimators of the mean for distributions of varied kurtosis; the [[modified mean]] is the [[truncated mean]], where the maximum and minimum are eliminated.<ref>{{Cite thesis |type=Master's |title=An Investigation of Measures of Central Tendency Used in Quality Control |url=http://books.google.co.jp/books/about/An_Investigation_of_Measures_of_Central.html?id=GNtiNwAACAAJ |author= |last=Vinson |first=William Daniel |year=1951 |publisher= University of North Carolina at Chapel Hill |at=Table (4.1), pp. 32–34}}</ref><ref>{{cite book |title=Statistical methods in quality control |first=Dudley Johnstone |last=Cowden |publisher=Prentice-Hall |year=1957 |pages=[http://books.google.co.jp/books?ei=DXFqUf7hFIiiige4lICIAg&id=b-BTAAAAMAAJ&dq=William+D.+Vinson+statistics&q=William+D.+Vinson#search_anchor 67–68]}}</ref>
 
{| class="wikitable"
! Excess kurtosis (γ<sub>2</sub>) !! Most efficient estimator of ''μ''
|-
| &minus;1.2 to &minus;0.8 || Midrange                   
|-
| &minus;0.8 to  2.0 || Mean
|-
|  2.0 to  6.0 || Modified mean
|}
 
For ''n'' = 1 or 2, the midrange and the mean are equal (and coincide with the median), and are most efficient for all distributions. For ''n'' = 3, the modified mean is the median, and instead the mean is the most efficient measure of central tendency for values of ''γ''<sub>2</sub> from 2.0 to 6.0 as well as from &minus;0.8 to 2.0.
 
==Sampling properties==
 
For a sample of size ''n'' from the [[standard normal distribution]], the mid-range ''M'' is unbiased, and has a variance given by:{{sfn|Kendall|Stuart|1969|loc=Example 14.4}}
:<math>\operatorname{var}(M)=\frac{\pi^2}{24 \ln(n)}.</math>
 
For a sample of size ''n'' from the standard [[Laplace distribution]], the mid-range ''M'' is unbiased, and has a variance given by:{{sfn|Kendall|Stuart|1969|loc=Example 14.5}}
:<math>\operatorname{var}(M)=\frac{\pi^2}{12}</math>
and, in particular, the variance does not decrease to zero as the sample size grows.
 
For a sample of size ''n'' from a zero-centred [[Uniform distribution (continuous)|uniform distribution]], the mid-range ''M'' is unbiased, ''nM'' has an [[asymptotic distribution]] which is a [[Laplace distribution]].{{sfn|Kendall|Stuart|1969|loc=Example 14.12}}
 
==Deviation==
While the mean of a set of values minimizes the sum of squares of [[Deviation (statistics)|deviations]] and the [[median]] minimizes the [[average absolute deviation]], the midrange minimizes the [[maximum deviation]] (defined as <math>\max\left|x_i-m\right|</math>): it is a solution to a variational problem.
 
==See also==
* [[Range (statistics)]]
* [[Midhinge]]
 
==References==
{{reflist|2}}
{{refbegin}}
* {{cite isbn|0199206139}}
* {{cite isbn|0852641419}}
* {{cite isbn|087150409X}}
{{refend}}
 
{{DEFAULTSORT:Mid-Range}}
[[Category:Means]]
[[Category:Summary statistics]]

Latest revision as of 12:41, 16 August 2014

Alyson Meagher is the name her mothers and fathers gave her but she doesn't like when people use her full name. Alaska is where he's always been residing. Invoicing is my profession. One of the very best psychics issues in the globe for him is performing ballet and he'll be beginning something else alongside with it.