# Probability interpretations

The word **probability** has been used in a variety of ways since it was first applied to the mathematical study of games of chance. Does probability measure the real, physical tendency of something to occur or is it a measure of how strongly one believes it will occur, or does it draw on both these elements? In answering such questions, mathematicians interpret the probability values of probability theory.

There are two broad categories^{[1]}^{[2]} of **probability interpretations** which can be called "physical" and "evidential" probabilities. Physical probabilities, which are also called objective or frequency probabilities, are associated with random physical systems such as roulette wheels, rolling dice and radioactive atoms. In such systems, a given type of event (such as the dice yielding a six) tends to occur at a persistent rate, or "relative frequency", in a long run of trials. Physical probabilities either explain, or are invoked to explain, these stable frequencies. Thus talking about physical probability makes sense only when dealing with well defined random experiments.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} The two main kinds of theory of physical probability are frequentist accounts (such as those of Venn,^{[3]} Reichenbach^{[4]} and von Mises^{[5]}) and propensity accounts (such as those of Popper, Miller, Giere and Fetzer).{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}

Evidential probability, also called Bayesian probability (or subjectivist probability), can be assigned to any statement whatsoever, even when no random process is involved, as a way to represent its subjective plausibility, or the degree to which the statement is supported by the available evidence. On most accounts, evidential probabilities are considered to be degrees of belief, defined in terms of dispositions to gamble at certain odds. The four main evidential interpretations are the classical (e.g. Laplace's)^{[6]} interpretation, the subjective interpretation (de Finetti^{[7]} and Savage^{[8]}), the epistemic or inductive interpretation (Ramsey,^{[9]} Cox^{[10]}) and the logical interpretation (Keynes^{[11]} and Carnap^{[12]}).

Some interpretations of probability are associated with approaches to statistical inference, including theories of estimation and hypothesis testing. The physical interpretation, for example, is taken by followers of "frequentist" statistical methods, such as R. A. Fisher, Jerzy Neyman and Egon Pearson.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} Statisticians of the opposing Bayesian school typically accept the existence and importance of physical probabilities, but also consider the calculation of evidential probabilities to be both valid and necessary in statistics. This article, however, focuses on the interpretations of probability rather than theories of statistical inference.

The terminology of this topic is rather confusing, in part because probabilities are studied within a variety of academic fields. The word "frequentist" is especially tricky. To philosophers it refers to a particular theory of physical probability, one that has more or less been abandoned. To scientists, on the other hand, "frequentist probability" is just another name for physical (or objective) probability. Those who promote Bayesian inference view "frequentist statistics" as an approach to statistical inference that recognises only physical probabilities. Also the word "objective", as applied to probability, sometimes means exactly what "physical" means here, but is also used of evidential probabilities that are fixed by rational constraints, such as logical and epistemic probabilities.

It is unanimously agreed that statistics depends somehow on probability. But, as to what probability is and how it is connected with statistics, there has seldom been such complete disagreement and breakdown of communication since the Tower of Babel. Doubtless, much of the disagreement is merely terminological and would disappear under sufficiently sharp analysis.

—(Savage, 1954, p 2)^{[8]}

## Philosophy

The **philosophy of probability** presents problems chiefly in matters of epistemology and the uneasy interface between mathematical concepts and ordinary language as it is used by non-mathematicians.
Probability theory is an established field of study in mathematics. It has its origins in correspondence discussing the mathematics of games of chance between Blaise Pascal and Pierre de Fermat in the seventeenth century,^{[13]} and was formalized and rendered axiomatic as a distinct branch of mathematics by Andrey Kolmogorov in the twentieth century. In its axiomatic form, mathematical statements about probability theory carry the same sort of epistemological confidence shared by other mathematical statements in the philosophy of mathematics.^{[14]}^{[15]}

The mathematical analysis originated in observations of the behaviour of game equipment such as playing cards and dice, which are designed specifically to introduce random and equalized elements; in mathematical terms, they are subjects of indifference. This is not the only way probabilistic statements are used in ordinary human language: when people say that "*it will probably rain*", they typically do not mean that the outcome of rain versus not-rain is a random factor that the odds currently favor; instead, such statements are perhaps better understood as qualifying their expectation of rain with a degree of confidence. Likewise, when it is written that "the most probable explanation" of the name of Ludlow, Massachusetts "is that it was named after Roger Ludlow", what is meant here is not that Roger Ludlow is favored by a random factor, but rather that this is the most plausible explanation of the evidence, which admits other, less likely explanations.

Thomas Bayes attempted to provide a logic that could handle varying degrees of confidence; as such, Bayesian probability is an attempt to recast the representation of probabilistic statements as an expression of the degree of confidence by which the beliefs they express are held.

Though probability initially had somewhat mundane motivations, its modern influence and use is widespread ranging from Evidence based medicine, through Six sigma, all the way to the Probabilistically checkable proof and the String theory landscape.

Classical | Frequentist | Subjective | Propensity | |
---|---|---|---|---|

Main hypothesis | Principle of indifference | Frequency of occurrence | Degree of belief | Degree of causal connection |

Conceptual basis | Hypothetical symmetry | Past data and reference class | Knowledge and intuition | Present state of system |

Conceptual approach | Conjectural | Empirical | Subjective | Metaphysical |

Single case possible | Yes | No | Yes | Yes |

Precise | Yes | No | No | Yes |

Problems | Ambiguity in principle of indifference | Reference class problem | Unchecked opinion | Disputed concept |

^{[2]} (p 1132)

## Classical definition

{{#invoke:main|main}}
The first attempt at mathematical rigour in the field of probability, championed by Pierre-Simon Laplace, is now known as the **classical definition**. Developed from studies of games of chance (such as rolling dice) it states that probability is shared equally between all the possible outcomes, provided these outcomes can be deemed equally likely.^{[1]} (3.1)

The theory of chance consists in reducing all the events of the same kind to a certain number of cases equally possible, that is to say, to such as we may be equally undecided about in regard to their existence, and in determining the number of cases favorable to the event whose probability is sought. The ratio of this number to that of all the cases possible is the measure of this probability, which is thus simply a fraction whose numerator is the number of favorable cases and whose denominator is the number of all the cases possible.

— Pierre-Simon Laplace,

A Philosophical Essay on Probabilities^{[6]}

This can be represented mathematically as follows:
If a random experiment can result in N mutually exclusive and equally likely outcomes and if N_{A} of these outcomes result in the occurrence of the event A, the **probability of A** is defined by

There are two clear limitations to the classical definition.^{[16]} Firstly, it is applicable only to situations in which there is only a 'finite' number of possible outcomes. But some important random experiments, such as tossing a coin until it rises heads, give rise to an infinite set of outcomes. And secondly, you need to determine in advance that all the possible outcomes are equally likely without relying on the notion of probability to avoid circularity—for instance, by symmetry considerations.

## Frequentism

{{#invoke:main|main}}
Frequentists posit that the probability of an event is its relative frequency over time,^{[1]} (3.4) i.e., its relative frequency of occurrence after repeating a process a large number of times under similar conditions. This is also known as aleatory probability. The events are assumed to be governed by some random physical phenomena, which are either phenomena that are predictable, in principle, with sufficient information (see Determinism); or phenomena which are essentially unpredictable. Examples of the first kind include tossing dice or spinning a roulette wheel; an example of the second kind is radioactive decay. In the case of tossing a fair coin, frequentists say that the probability of getting a heads is 1/2, not because there are two equally likely outcomes but because repeated series of large numbers of trials demonstrate that the empirical frequency converges to the limit 1/2 as the number of trials goes to infinity.

If we denote by the number of occurrences of an event in trials, then if we say that *
*

The frequentist view has its own problems. It is of course impossible to actually perform an infinity of repetitions of a random experiment to determine the probability of an event. But if only a finite number of repetitions of the process are performed, different relative frequencies will appear in different series of trials. If these relative frequencies are to define the probability, the probability will be slightly different every time it is measured. But the real probability should be the same every time. If we acknowledge the fact that we only can measure a probability with some error of measurement attached, we still get into problems as the error of measurement can only be expressed as a probability, the very concept we are trying to define. This renders even the frequency definition circular.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}

## Logical, epistemic, and inductive probability

{{#invoke:main|main}}

It is widely recognised that the term "probability" is sometimes used in contexts where it has nothing to do with physical randomness.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} Consider, for example, the claim that the extinction of the dinosaurs was **probably** caused by a large meteorite hitting the earth. Statements such as "Hypothesis H is probably true" have been interpreted to mean that the (presently available) empirical evidence (E, say) supports H to a high degree. This degree of support of H by E has been called the **logical** probability of H given E, or the **epistemic probability** of H given E, or the **inductive probability** of H given E.

The differences between these interpretations are rather small, and may seem inconsequential. One of the main points of disagreement lies in the relation between probability and belief. Logical probabilities are conceived (for example in Keynes' Treatise on Probability^{[11]}) to be objective, logical relations between propositions (or sentences), and hence not to depend in any way upon belief. They are degrees of (partial) entailment, or degrees of logical consequence, not degrees of belief. (They do, nevertheless, dictate proper degrees of belief, as is discussed below.) Frank P. Ramsey, on the other hand, was skeptical about the existence of such objective logical relations and argued that (evidential) probability is "the logic of partial belief".^{[9]} (p 157) In other words, Ramsey held that epistemic probabilities simply *are* degrees of rational belief, rather than being logical relations that merely *constrain* degrees of rational belief.

Another point of disagreement concerns the *uniqueness* of evidential probability, relative to a given state of knowledge. Rudolf Carnap held, for example, that logical principles always determine a unique logical probability for any statement, relative to any body of evidence.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} Ramsey, by contrast, thought that while degrees of belief are subject to some rational constraints (such as, but not limited to, the axioms of probability) these constraints usually do not determine a unique value.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} Rational people, in other words, may differ somewhat in their degrees of belief, even if they all have the same information.

## Propensity

{{#invoke:main|main}}

Propensity theorists think of probability as a physical propensity, or disposition, or tendency of a given type of physical situation to yield an outcome of a certain kind or to yield a long run relative frequency of such an outcome.^{[17]} This kind of objective probability is sometimes called 'chance'.

Propensities, or chances, are not relative frequencies, but purported causes of the observed stable relative frequencies. Propensities are invoked to explain why repeating a certain kind of experiment will generate given outcome types at persistent rates,{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} which are known as propensities or chances. Frequentists are unable to take this approach,{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} since relative frequencies do not exist for single tosses of a coin, but only for large ensembles or collectives. In contrast, a propensitist is able to use the law of large numbers to explain the behaviour of long-run frequencies. This law, which is a consequence of the axioms of probability, says that if (for example) a coin is tossed repeatedly many times, in such a way that its probability of landing heads is the same on each toss, and the outcomes are probabilistically independent, then the relative frequency of heads will be close to the probability of heads on each single toss. This law allows that stable long-run frequencies are a manifestation of invariant *single-case* probabilities. In addition to explaining the emergence of stable relative frequencies, the idea of propensity is motivated{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} by the desire to make sense of single-case probability attributions in quantum mechanics, such as the probability of decay of a particular atom at a particular time.

The main challenge facing propensity theories is to say exactly what propensity means. (And then, of course, to show that propensity thus defined has the required properties.) At present, unfortunately, none of the well-recognised accounts of propensity comes close to meeting this challenge.

A propensity theory of probability was given by Charles Sanders Peirce.^{[18]}^{[19]}^{[20]}^{[21]} A later propensity theory was proposed by philosopher Karl Popper, who had only slight acquaintance with the writings of C. S. Peirce, however.^{[18]}^{[19]} Popper noted that the outcome of a physical experiment is produced by a certain set of "generating conditions". When we repeat an experiment, as the saying goes, we really perform another experiment with a (more or less) similar set of generating conditions. To say that a set of generating conditions has propensity *p* of producing the outcome *E* means that those exact conditions, if repeated indefinitely, would produce an outcome sequence in which *E* occurred with limiting relative frequency *p*. For Popper then, a deterministic experiment would have propensity 0 or 1 for each outcome, since those generating conditions would have same outcome on each trial.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} In other words, non-trivial propensities (those that differ from 0 and 1) only exist for genuinely indeterministic experiments.

A number of other philosophers, including David Miller and Donald A. Gillies, have proposed propensity theories somewhat similar to Popper's.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}

Other propensity theorists (e.g. Ronald Giere{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}) do not explicitly define propensities at all, but rather see propensity as defined by the theoretical role it plays in science. They argue{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}, for example, that physical magnitudes such as electrical charge cannot be explicitly defined either, in terms of more basic things, but only in terms of what they do (such as attracting and repelling other electrical charges). In a similar way, propensity is whatever fills the various roles that physical probability plays in science.

What roles does physical probability play in science? What are its properties? One central property of chance is that, when known, it constrains rational belief to take the same numerical value. David Lewis called this the *Principal Principle*,^{[1]} (3.3 & 3.5) a term that philosophers have mostly adopted.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} For example, suppose you are certain that a particular biased coin has propensity 0.32 to land heads every time it is tossed. What is then the correct price for a gamble that pays $1 if the coin lands heads, and nothing otherwise? According to the Principal Principle, the fair price is 32 cents.

## Subjectivism

{{#invoke:main|main}}
Subjectivists, also known as **Bayesians** or followers of **epistemic probability**, give the notion of probability a subjective status by regarding it as a measure of the 'degree of belief' of the individual assessing the uncertainty of a particular situation. Epistemic or subjective probability is sometimes called **credence**, as opposed to the term **chance** for a propensity probability.

Some examples of epistemic probability are to assign a probability to the proposition that a proposed law of physics is true, and to determine how probable it is that a suspect committed a crime, based on the evidence presented{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}.

Gambling odds don't reflect the bookies' belief in a likely winner, so much as the other bettors' belief, because the bettors are actually betting against one another. The odds are set based on how many people have bet on a possible winner, so that even if the high odds players always win, the bookies will always make their percentages anyway.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}

The use of Bayesian probability raises the philosophical debate as to whether it can contribute valid justifications of belief.

Bayesians point to the work of Ramsey^{[9]} (p 182) and de Finetti^{[7]} (p 103) as proving that subjective beliefs must follow the laws of probability if they are to be coherent.^{[22]} Evidence casts doubt on human coherence.^{[23]}^{[24]}

The use of Bayesian probability involves specifying a prior probability. This may be obtained from consideration of whether the required prior probability is greater or lesser than a reference probabilityTemplate:Clarify associated with an urn model or a thought experiment. The issue is that for a given problem, multiple thought experiments could apply, and choosing one is a matter of judgement: different people may assign different prior probabilities, known as the reference class problem. The "sunrise problem" provides an example.

## Prediction

{{#invoke:main|main}}
An alternative account of probability emphasizes the role of *prediction* – predicting future observations on the basis of past observations, not on unobservable parameters. In its modern form, it is mainly in the Bayesian vein.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}} This was the main function of probability before the 20th century,^{[25]}
but fell out of favor compared to the parametric approach, which modeled phenomena as a physical system that was observed with error, such as in celestial mechanics.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B=
{{#invoke:Category handler|main}}{{#invoke:Category handler|main}}^{[citation needed]}
}}
The modern predictive approach was pioneered by Bruno de Finetti, with the central idea of exchangeability – that future observations should behave like past observations.^{[25]} This view came to the attention of the Anglophone world with the 1974 translation of de Finetti's book,^{[25]} and has
since been propounded by such statisticians as Seymour Geisser.

## Axiomatic probability

The mathematics of probability can be developed on an entirely axiomatic basis that is independent of any interpretation: see the articles on probability theory and probability axioms for a detailed treatment.

## See also

- Philosophy of statistics
- Frequency (statistics)
- Negative probability
- Pignistic probability
- Sunrise problem
- Probability amplitude (quantum mechanics)

## References

- ↑
^{1.0}^{1.1}^{1.2}^{1.3}{{#invoke:citation/CS1|citation |CitationClass=citation }} The taxonomy of probability interpretations given here is similar to that of the longer and more complete Interpretations of Probability article in the online Stanford Encyclopedia of Philosophy. References to that article include a parenthetic section number where appropriate. A partial outline of that article:- Section 2: Criteria of adequacy for the interpretations of probability
- Section 3:
- 3.1 Classical Probability
- 3.2 Logical Probability
- 3.3 Subjective Probability
- 3.4 Frequency Interpretations
- 3.5 Propensity Interpretations

- ↑
^{2.0}^{2.1}{{#invoke:Citation/CS1|citation |CitationClass=journal }} "There are several schools of thought regarding the interpretation of probabilities, none of them without flaws, internal contradictions, or paradoxes." (p 1129) "There are no standard classifications of probability interpretations, and even the more popular ones may suffer subtle variations from text to text." (p 1130) The classification in this article is representative, as are the authors and ideas claimed for each classification. - ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}
- ↑ {{#invoke:citation/CS1|citation |CitationClass=book }} English translation of the original 1935 German. ASIN: B000R0D5MS
- ↑ {{#invoke:citation/CS1|citation |CitationClass=book }} English translation of the third German edition of 1951 which was published 30 years after the first German edition.
- ↑
^{6.0}^{6.1}Laplace, P. S., 1814, English edition 1951, A Philosophical Essay on Probabilities, New York: Dover Publications Inc. - ↑
^{7.0}^{7.1}{{#invoke:citation/CS1|citation |CitationClass=book }} Translation of the 1937 French original with later notes added. - ↑
^{8.0}^{8.1}{{#invoke:citation/CS1|citation |CitationClass=book }} - ↑
^{9.0}^{9.1}^{9.2}{{#invoke:citation/CS1|citation |CitationClass=book }} Contains three chapters (essays) by Ramsey. The electronic version contains only those three. - ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}
- ↑
^{11.0}^{11.1}{{#invoke:citation/CS1|citation |CitationClass=book }} - ↑ {{#invoke:citation/CS1|citation
|CitationClass=book
}} Carnap coined the notion
*"probability*and_{1}"*"probability*for evidential and physical probability, respectively._{2}" - ↑ Fermat and Pascal on Probability (@ socsci.uci.edu)
- ↑ Laszlo E. Szabo,
*A Physicalist Interpretation of Probability*(Talk presented on the Philosophy of Science Seminar, Eötvös, Budapest, 8 October 2001.) - ↑ Laszlo E. Szabo, Objective probability-like things with and without objective indeterminism, Studies in History and Philosophy of Modern Physics 38 (2007) 626–634 (
*Preprint*) - ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}
- ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}
- ↑
^{18.0}^{18.1}{{#invoke:Citation/CS1|citation |CitationClass=journal }} - ↑
^{19.0}^{19.1}{{#invoke:Citation/CS1|citation |CitationClass=journal }} - ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}
- ↑ Peirce, Charles Sanders and Burks, Arthur W., ed. (1958), the
*Collected Papers of Charles Sanders Peirce*Volumes 7 and 8, Harvard University Press, Cambridge, MA, also Belnap Press (of Harvard University Press) edition, vols. 7-8 bound together, 798 pages, online via InteLex, reprinted in 1998 Thoemmes Continuum. - ↑ {{#invoke:citation/CS1|citation |CitationClass=book }}
- ↑ {{#invoke:citation/CS1|citation |CitationClass=book }} The book contains numerous examples of the difference between idealized and actual thought. "[W]hen called upon to judge probability, people actually judge something else and believe they have judged probability." (p 98)
- ↑ {{#invoke:Citation/CS1|citation |CitationClass=journal }} Statistical decisions are consistently superior to the subjective decisions of experts.
- ↑
^{25.0}^{25.1}^{25.2}*Predictive Inference: An Introduction*, Seymour Geisser, CRC Press, 1993 ISBN 0-412-03471-9

## Further reading

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }} A comprehensive monograph covering the four principal current interpretations: logical, subjective, frequency, propensity.

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- Paul Humphreys, ed. (1994)
*Patrick Suppes: Scientific Philosopher*, Synthese Library, Springer-Verlag.- Vol. 1:
*Probability and Probabilistic Causality*. - Vol. 2:
*Philosophy of Physics, Theory Structure and Measurement, and Action Theory*.

- Vol. 1:
- Jackson, Frank, and Robert Pargetter (1982) "Physical Probability as a Propensity,"
*Noûs*16(4): 567–583. - {{#invoke:citation/CS1|citation

|CitationClass=book }} Covers mostly non-Kolmogorov probability models, particularly with respect to quantum physics.

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

- {{#invoke:citation/CS1|citation

|CitationClass=book }}

## External links

{{ safesubst:#invoke:Unsubst||$N=Use dmy dates |date=__DATE__ |$B= }}