# Law of total probability

In probability theory, the law (or formula) of total probability is a fundamental rule relating marginal probabilities to conditional probabilities. It expresses the total probability of an outcome which can be realized via several distinct events - hence the name.

## Statement

The law of total probability is[1] the proposition that if ${\displaystyle \left\{{B_{n}:n=1,2,3,\ldots }\right\}}$ is a finite or countably infinite partition of a sample space (in other words, a set of pairwise disjoint events whose union is the entire sample space) and each event ${\displaystyle B_{n}}$ is measurable, then for any event ${\displaystyle A}$ of the same probability space:

${\displaystyle \Pr(A)=\sum _{n}\Pr(A\cap B_{n})\,}$

or, alternatively,[1]

${\displaystyle \Pr(A)=\sum _{n}\Pr(A\mid B_{n})\Pr(B_{n}),\,}$

where, for any ${\displaystyle n\,}$ for which ${\displaystyle \Pr(B_{n})=0\,}$ these terms are simply omitted from the summation, because ${\displaystyle \Pr(A\mid B_{n})\,}$ is finite.

The summation can be interpreted as a weighted average, and consequently the marginal probability, ${\displaystyle \Pr(A)}$, is sometimes called "average probability";[2] "overall probability" is sometimes used in less formal writings.[3]

The law of total probability can also be stated for conditional probabilities. Taking the ${\displaystyle B_{n}}$ as above, and assuming ${\displaystyle C}$ is an event independent with any of the ${\displaystyle B_{n}}$:

${\displaystyle \Pr(A\mid C)=\sum _{n}\Pr(A\mid C\cap B_{n})\Pr(B_{n}\mid C)=\sum _{n}\Pr(A\mid C\cap B_{n})\Pr(B_{n})}$

## Informal formulation

The above mathematical statement might be interpreted as follows: given an outcome ${\displaystyle A}$, with known conditional probabilities given any of the ${\displaystyle B_{n}}$ events, each with a known probability itself, what is the total probability that ${\displaystyle A}$ will happen?. The answer to this question is given by ${\displaystyle \Pr(A)}$.

## Example

Suppose that two factories supply light bulbs to the market. Factory X's bulbs work for over 5000 hours in 99% of cases, whereas factory Y's bulbs work for over 5000 hours in 95% of cases. It is known that factory X supplies 60% of the total bulbs available. What is the chance that a purchased bulb will work for longer than 5000 hours?

Applying the law of total probability, we have:

where

Thus each purchased light bulb has a 97.4% chance to work for more than 5000 hours.

## Applications

One common application of the law is where the events coincide with a discrete random variable X taking each value in its range, i.e. ${\displaystyle B_{n}}$ is the event ${\displaystyle X=x_{n}}$. It follows that the probability of an event A is equal to the expected value of the conditional probabilities of A given ${\displaystyle X=x_{n}}$.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }} That is, ${\displaystyle \Pr(A)=\sum _{n}\Pr(A\mid X=x_{n})\Pr(X=x_{n})=\operatorname {E} [\Pr(A\mid X)],}$ where Pr(A | X) is the conditional probability of A given the value of the random variable X.[3] This conditional probability is a random variable in its own right, whose value depends on that of X. The conditional probability Pr(A | X = x) is simply a conditional probability given an event, [X = x]. It is a function of x, say g(x) = Pr(A | X = x). Then the conditional probability Pr(A | X) is g(X), hence itself a random variable. This version of the law of total probability says that the expected value of this random variable is the same as Pr(A). This result can be generalized to continuous random variables (via continuous conditional density), and the expression becomes ${\displaystyle \Pr(A)=\operatorname {E} [\Pr(A\mid {\mathcal {F}}_{X})],}$ where ${\displaystyle {\mathcal {F}}_{X}}$ denotes the sigma-algebra generated by the random variable X.{{ safesubst:#invoke:Unsubst||date=__DATE__ |$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }}

## Other names

The term law of total probability is sometimes taken to mean the law of alternatives, which is a special case of the law of total probability applying to discrete random variables.{{ safesubst:#invoke:Unsubst||date=__DATE__ |\$B= {{#invoke:Category handler|main}}{{#invoke:Category handler|main}}[citation needed] }} One author even uses the terminology "continuous law of alternatives" in the continuous case.[4] This result is given by Grimmett and Welsh[5] as the partition theorem, a name that they also give to the related law of total expectation.