Skip to Content
Math401Math 401, Summer 2025: Freiwald research project notesMath 401, Topic 1: Probability under language of measure theory

Math401 Topic 1: Probability under language of measure theory

Section 1: Uniform Random Numbers

Basic Definitions

Definition of Random Variables

A random variable is a function f:[0,1]Sf:[0,1]\to S, where [0,1]R[0,1]\subset \mathbb{R} and SS is a set of potential outcomes of a random phenomenon.

Definition of Uniform Distribution

The uniform distribution is defined by the length of function on subsets of [0,1][0,1] as a measure of probability (Lebesgue measure  by default).

Let XX be a random number taken from [0,1][0,1] and having the uniform distribution. The probability that XX should be the probability of the event that XX lies in AA.

Prob(XA)=λ(A)=length of A\operatorname{Prob}(X\in A) =\lambda(A)=\text{length of }A

Definition of Expectation

Let f:[0,1]Rf:[0,1]\to \mathbb{R} be a random variable (with nice properties such that it is integrable). Then the expectation of ff is defined as

E[f]=E[f(X)]=01f(x)dx\mathbb{E}[f]=\mathbb{E}[f(X)]=\int_0^1 f(x)dx

Definition of Indicator Function

The indicator function of an event AA is defined as

IA(x)={1if xA0if xA\mathbb{I}_A(x)=\begin{cases} 1 & \text{if } x\in A \\ 0 & \text{if } x\notin A \end{cases}

Definition of Law of variable X

The law of a random variable XX is the probability distribution of XX.

Let YY be the outcome of f(X)f(X). Then the law of YY is the probability distribution of YY.

μY(A)=λ(f1(A))=λ({x[0,1]:f(x)A})\mu_Y(A)=\lambda(f^{-1}(A))=\lambda(\{x\in [0,1]: f(x)\in A\})

1.1 Mathematical Coin Flip model

A coin flip if a random experiment with two possible outcomes: S={0,1}S=\{0,1\}. with probability pp for 00 and 1p1-p for 11, where p(0,1)Rp\in (0,1)\subset \mathbb{R}.

Definition of Independent Events

Two events AA and BB are independent if

λ(AB)=λ(A)λ(B)\lambda(A\cap B)=\lambda(A)\lambda(B)

or equivalently,

Prob(XAB)=Prob(XA)Prob(XB)\operatorname{Prob}(X\in A\cap B)=\operatorname{Prob}(X\in A)\operatorname{Prob}(X\in B)

Generalization to nn events:

λ(A1A2An)=λ(A1)λ(A2)λ(An)\lambda(A_1\cap A_2\cap \cdots \cap A_n)=\lambda(A_1)\lambda(A_2)\cdots \lambda(A_n)

Definition of Outcome selecting function

Let the set of all possible outcomes represented by a Cartesian product S={0,1}NS=\{0,1\}^{\mathbb{N}}. (a1,a2,a3,)S(a_1,a_2,a_3,\cdots)\subset S is an infinite (or finite) sequence of coin flips.

πi:S{0,1}\pi_i:S\to \{0,1\} is the ii-th projection function defined as πi((a1,a2,a3,))=ai\pi_i((a_1,a_2,a_3,\cdots))=a_i.

Note, this representation is isomorphic to the dyadic rationals (i.e., numbers that can be written as a fraction whose denominator is a power of 2) in the interval [0,1][0,1].

Section 2: Formal definitions

Recall, the σ\sigma-algebra (denoted as A\mathcal{A} in Math4121) is the collection of all subsets of a set SS satisfying the following properties:

  1. A\emptyset\in \mathcal{A} (empty set is in the σ\sigma-algebra)
  2. If AAA\in \mathcal{A}, then AcAA^c\in \mathcal{A} (if a set is in the σ\sigma-algebra, then its complement is in the σ\sigma-algebra)
  3. If A1,A2,A3,AA_1,A_2,A_3,\cdots\in \mathcal{A}, then i=1AiA\bigcup_{i=1}^{\infty}A_i\in \mathcal{A} (if a countable sequence of sets is in the σ\sigma-algebra, then their union is in the σ\sigma-algebra)

Event, probability, and random variable

Let Ω\Omega be a non-empty set.

Let F\mathscr{F} be a σ\sigma-algebra on Ω\Omega (Note, F\mathscr{F} is a collection of subsets of Ω\Omega that satisfies the properties of a σ\sigma-algebra).

Definition of Event

An event is a element of F\mathscr{F}.

Definition of Probability Measure

A probability measure PP is a function P:F[0,1]P:\mathscr{F}\to [0,1] satisfying the following properties:

  1. P(Ω)=1P(\Omega)=1
  2. If A1,A2,A3,FA_1,A_2,A_3,\cdots\in \mathscr{F} are pairwise disjoint (ij,AiAj=\forall i\neq j, A_i\cap A_j=\emptyset), then P(i=1Ai)=i=1P(Ai)P(\bigcup_{i=1}^{\infty}A_i)=\sum_{i=1}^{\infty}P(A_i)

Definition of Probability Space

A probability space is a triple (Ω,F,P)(\Omega, \mathscr{F}, P) defined above.

An event AA is said to occur almost surely (a.s.) if P(A)=1P(A)=1.

Definition of Random Variable

A random variable is a function X:ΩRX:\Omega\to \mathbb{R} that is measurable with respect to the σ\sigma-algebra F\mathscr{F}.

That is, for any Borel set BRB\subset \mathbb{R}, the preimage f1(B)Ff^{-1}(B)\in \mathscr{F}.

f1(B)={xΩ:f(x)B}Ff^{-1}(B)=\{x\in \Omega: f(x)\in B\}\in \mathscr{F}

Definition of sigma-algebra generated by a random variable

Let {fα:ΩR,αI}\{f_\alpha:\Omega\to \mathbb{R},\alpha\in I\} be a family of functions where II is an index set which is not necessarily finite or countable. The σ\sigma-algebra generated by the family of functions {fα:αI}\{f_\alpha:\alpha\in I\}, denoted as σ{fα:αI}\sigma\{f_\alpha:\alpha\in I\}, is the smallest σ\sigma-algebra containing all the subsets of Ω\Omega of the form

fα1(B)={ωΩ:fα(ω)B}Ff_\alpha^{-1}(B)=\{\omega\in \Omega: f_\alpha(\omega)\in B\}\in \mathscr{F}

for all αI\alpha\in I and BB(R)B\in \mathscr{B}(\mathbb{R}).

Equivalently,

σ{fα:αI}=σ(αIfα1(B))\sigma\{f_\alpha:\alpha\in I\}=\sigma\left(\bigcup_{\alpha\in I}f_\alpha^{-1}(B)\right)

the sigma-algebra generated by a random variable XX is the intersection of all σ\sigma-algebras on Ω\Omega containing the sets fα1(B)f_\alpha^{-1}(B) for all αI\alpha\in I and BB(R)B\in \mathscr{B}(\mathbb{R}).

Definition of distribution of random variable

Let f:ΩRf:\Omega\to \mathbb{R} be a random variable. The distribution of ff is the probability measure PfP_f on R\mathbb{R} defined by

Pf(B)=P(f1(B))=P({xΩ:f(x)B})P_f(B)=P(f^{-1}(B))=P(\{x\in \Omega: f(x)\in B\})

also noted as fPf_*P.

Definition of joint distribution of random variables

Let f1,f2,,fn:ΩRf_1,f_2,\cdots,f_n:\Omega\to \mathbb{R} be random variables. The joint distribution of f1,f2,,fnf_1,f_2,\cdots,f_n is the probability measure Pf1,f2,,fnP_{f_1,f_2,\cdots,f_n} on Rn\mathbb{R}^n defined by

Pf1,f2,,fn(B)=P(f11(B1)f21(B2)fn1(Bn))=P(ωΩ:(f1(ω),f2(ω),,fn(ω))B)P_{f_1,f_2,\cdots,f_n}(B)=P(f_1^{-1}(B_1)\cap f_2^{-1}(B_2)\cap \cdots \cap f_n^{-1}(B_n))=P(\omega\in \Omega: (f_1(\omega),f_2(\omega),\cdots,f_n(\omega))\in B)

Expectation of a random variable

Let f:ΩRf:\Omega\to \mathbb{R} be a random variable. The expectation of ff is defined as

E[f]=E[f(X)]=Ωf(x)dP\mathbb{E}[f]=\mathbb{E}[f(X)]=\int_\Omega f(x)dP

Note, PP is the probability measure on Ω\Omega.

Definition of variance

The variance of a random variable ff is defined as

Var(f)=E[(fE[f])2]=E[f2](E[f])2\operatorname{Var}(f)=\mathbb{E}[(f-\mathbb{E}[f])^2]=\mathbb{E}[f^2]-(\mathbb{E}[f])^2

Definition of covariance

The covariance of two random variables f,g:ΩRf,g:\Omega\to \mathbb{R} is defined as

Cov(f,g)=E[(fE[f])(gE[g])]\operatorname{Cov}(f,g)=\mathbb{E}[(f-\mathbb{E}[f])(g-\mathbb{E}[g])]

Point measures

Definition of Dirac measure

The Dirac measure is a probability measure on Ω\Omega defined as

δω(A)={1if ωA0if ωA\delta_\omega(A)=\begin{cases} 1 & \text{if } \omega\in A \\ 0 & \text{if } \omega\notin A \end{cases}

Note that Ωf(x)dδω(x)=f(ω)\int_\Omega f(x)d\delta_\omega(x)=f(\omega).

Infinite sequence of independent coin flips

Side notes from basic topology:

Definition of product topology:

It is a set constructed by the Cartesian product of the sets. Suppose XiX_i is a set for all iIi\in I. The element of the product set is a tuple (xi)iI(x_i)_{i\in I} where xiXix_i\in X_i for all iIi\in I.

For example, if Xi=[0,1]X_i=[0,1] for all iNi\in \mathbb{N}, then the product set is [0,1]N[0,1]^{\mathbb{N}}. An element of such product set is (1,0.5,0.25,)(1,0.5,0.25,\cdots).

The set of outcomes of such infinite sequence of coin flips is the product set of the set of outcomes of each coin flip.

S={0,1}NS=\{0,1\}^{\mathbb{N}}

Conditional probability

Definition of conditional probability

The conditional probability of an event AA given an event BB is defined as

P(AB)=P(AB)P(B)P(A|B)=\frac{P(A\cap B)}{P(B)}

The law of total probability:

P(A)=i=1P(ABi)P(Bi)P(A)=\sum_{i=1}^{\infty}P(A|B_i)P(B_i)

Bayes’ theorem:

P(BiA)=P(ABi)P(Bi)j=1P(ABj)P(Bj)P(B_i|A)=\frac{P(A|B_i)P(B_i)}{\sum_{j=1}^{\infty}P(A|B_j)P(B_j)}

Definition of independence of random variables

Two random variables f,g:ΩRf,g:\Omega\to \mathbb{R} are independent if for any Borel sets A,BB(R)A,B\subset \mathscr{B}(\mathbb{R}) the events

{ωΩ:f(ω)A} and {ωΩ:g(ω)B}\{\omega\in \Omega: f(\omega)\in A\}\text{ and } \{\omega\in \Omega: g(\omega)\in B\}

are independent.

In general, a finite or infinite family of random variables f1,f2,,fn:ΩRf_1,f_2,\cdots,f_n:\Omega\to \mathbb{R} are independent if every finite collection of random variables from this family are independent.

Definition of independence of sigma-algebras

Let G\mathscr{G} and H\mathscr{H} be two σ\sigma-algebras on Ω\Omega. They are independent if for any Borel sets AB(R)A\subset \mathscr{B}(\mathbb{R}) and BB(R)B\subset \mathscr{B}(\mathbb{R}), the finite collection of events are independent.

Section 3: Further definitions in measure theory and integration

L2L^2 space

Definition of L2L^2 space

Let (Ω,F,P)(\Omega, \mathscr{F}, P) be a measure space. The L2L^2 space is the space of all square integrable, complex-valued measurable functions on Ω\Omega.

Denoted by L2(Ω,F,P)L^2(\Omega, \mathscr{F}, P).

The square integrable functions are the functions f:ΩCf:\Omega\to \mathbb{C} such that

Ωf(ω)2dP(ω)<\int_\Omega |f(\omega)|^2 dP(\omega)<\infty

With inner product defined by

f,g=Ωf(ω)g(ω)dP(ω)\langle f,g\rangle=\int_\Omega \overline{f(\omega)}g(\omega)dP(\omega)

The L2(Ω,F,P)L^2(\Omega, \mathscr{F}, P) space is a Hilbert space.

Last updated on