Probability, Random Variable, and Distribution
Oh, Hyunzi. (email: wisdom302@naver.com)
Korea University, Graduate School of Economics.
Main References
Here, we briefly introduce more rigorous definitions of the probability theory based on the measure theory. The main goal of this note is to understand the concept of probability measure intuitively, and get familiar with the jargon of measure theory. The detailed theorems and results of the measure theory used in this section will be further analyzed in 📑Note for Probability Theory📑Note for Probability Theory, with mathematical proofs.
The goal of this note is to fully understand the concepts of the key elements consisting the probability space.
A probability space is the triple
In later chapter Probability MeasureProbability Measure, we will discuss about how to understand $\sigma-$field
A set function
Consider an experiment of tossing a coin. The two possible outcomes are head(
For an event
Let a probability space
A random variable
A function
Here, you can simply understand the Borel sets as a collection of subsets of the real line. A detailed explanations follows in Borel MeasureBorel Measure. Now we define a probability (measure) on
In the given sample space
A random variable is simply a mapping that maps each outcome to a real number where we can use well developed mathematical tools.
We now re-define some of the familiar concepts.
A distribution
A (cumulative) distribution function
See Basic Definitions in Probability > ^24cd70Basic Definitions in Probability > Theorem 7 (properties of distribution function) for the proof.
A probability mass function (pmf)
The density function of a continuous variable is more tricky to define.
If a measure
Note that the absolute continuity of
A probability density function (pdf)
First, we follow the previous notions of the sample space and the set of all events denoted
A sigma-algebra is a subset of the poser set
For a sample space
For
Here, note that the term 'sigma-' in mathematics usually implies 'countable union' or 'countably infinite'. Thus,
Furthermore, since countable set is isomorphic to
Let
For some
Let
And a triple
Here, the countable additivity is the key property that makes a measure a generalized length.
Let
Measure Theoretic Preliminaries > ^cdc44bMeasure Theoretic Preliminaries > Theorem 18 (properties of measure) for the proof.
From the definition of measuremeasure, by adding one more property, we finally have the definition of the probability measure.
Let
And the triple
Let
A measurable set
Let
Compared to the empty set, null set refers to the set which is practically non-existing, while empty set denotes the set which is actually empty. Here, the null means meaningless, insignificance, or negligible, rather then the absence.
Let
Intuitively, 'almost surely' refers to the every point except the null set. This concept is similar to the integral where
The sequence of random vectors
Note that in ^4fba4cDefinition 24 (almost surely converges), we do not care about the timing when
Let
Here, the term e.v. and i.o. emphasizes that in terms of convergence in a sequence, the only important thing is the long-term behavior, not the behavior in the first finite horizon of the time.
Define a sequence of sets as
Usually, ^2ff43bRemark 26 (almost sure convergence, and ev and io) is another common way to define ^4fba4cDefinition 24 (almost surely converges). For the brief understanding, suppose
Thus we have
Intuitively, a measurable function is a function between the two measurable spacemeasurable space that preserves the structure of the spaces. Here, the inverse image of any measurable function is measurablemeasurable.
Let
It is shown that it is equivalent to show the following conditions to prove the given function is measurable.
For a function
Proof.As 1 and 2 are complement to each other, similarly, 3 and 4 are complement to each other. By ^27dce2Definition 14 (sigma-algebra and measurable space), 1 and 2, and 3 and 4 are equivalent conditions (since the complement set of the element in
(
(
Remark that ^03f1b6Definition 27 (measurable function) is analogue to Continuous Function > ^1bc20fContinuous Function > Definition 5 (continuous function between topological spaces), which gives the sense of the preserving properties of the measurable function.
For your information, measurable function can be defined in more fundamental way, not directly defining from its inverse-image. In that case, we can also derive the same property of ^03f1b6Definition 27 (measurable function).
Furthermore, measurable function can alternatively obtained from the pointwise convergence of the sequence of simple functions.
Let
In real space, Borel algebra is the intersection of all sigma-algebrasigma-algebra that includes open sets in
Let
While we defined Borel algebra under Euclidean space, it is more general to start on an arbitrary set. However, in probability theory, it is sufficient enough to define on
Based on the closure properties of the
Let
Note that the equivalence in ^f8eea7Definition 31 (random variable) holds by the properties of ^67466eDefinition 29 (Borel sets on Euclidean space). Random variable is a mapping that maps each elements in the sample space
Remark that by the definition,
For your information, to generalize the definition into the multivariate random variable, we can simply define
Let
Let
The set function
Proof.Let
Thus,
Using ^f02a69Definition 33 (probability distribution), we can re-define the independence of the random variable using measure theory.
Let
Note that the independence between
The following conditions are equivalent.
Let
Note that the probability distribution
Consider the case when
Thus we can now define a general form of discrete probability distribution.
Let
Classical examples are:
Note that we can also define a random variable that is neither discrete nor continuous.
Suppose a car leaves city A at random between 1 pm and 2 pm. It travels at 100 km/h towards B which is 50 km from A. What is the probability distribution of the distance between the car and B at 2 pm?
Proof.If the car starts traveling before 1.30 pm, then it can arrive at B before 2 pm. However, if it starts after 1.30, then its distance from the B would follow a uniform distribution. Thus we have
Then, its probability distribution can be expressed as