Skip to Content
Math401Math 401, Fall 2025: Thesis notesMath 401, Fall 2025: Thesis notes, R1, Non-commutative probability theory

Math 401, Fall 2025: Thesis notes, R1, Non-commutative probability theory

Progress: 0/NaN=NaN% (denominator and enumerator may change)

Notations and definitions

This part will cover the necessary notations and definitions for the remaining parts of the recollection.

Notations of Linear algebra

Definition of vector space

link to vector space

A vector space over f\mathbb{f} is a set VV along with two operators v+wVv+w\in V for v,wVv,w\in V, and λv\lambda \cdot v for λF\lambda\in \mathbb{F} and vVv\in V satisfying the following properties:

  • Commutativity: v,wV,v+w=w+v\forall v, w\in V,v+w=w+v
  • Associativity: u,v,wV,(u+v)+w=u+(v+w)\forall u,v,w\in V,(u+v)+w=u+(v+w)
  • Existence of additive identity: 0V\exists 0\in V such that vV,0+v=v\forall v\in V, 0+v=v
  • Existence of additive inverse: vV,wV\forall v\in V, \exists w \in V such that v+w=0v+w=0
  • Existence of multiplicative identity: 1F\exists 1 \in \mathbb{F} such that vV,1v=v\forall v\in V,1\cdot v=v
  • Distributive properties: v,wV\forall v, w\in V and a,bF\forall a,b\in \mathbb{F}, a(v+w)=av+awa\cdot(v+w)=a\cdot v+ a\cdot w and (a+b)v=av+bv(a+b)\cdot v=a\cdot v+b\cdot v

Definition of inner product

link to inner product

An inner product is a bilinear function ,:V×VF\langle,\rangle:V\times V\to \mathbb{F} satisfying the following properties:

  • Positivity: v,v0\langle v,v\rangle\geq 0
  • Definiteness: v,v=0    v=0\langle v,v\rangle=0\iff v=0
  • Additivity: u+v,w=u,w+v,w\langle u+v,w\rangle=\langle u,w\rangle+\langle v,w\rangle
  • Homogeneity: λu,v=λu,v\langle \lambda u, v\rangle=\lambda\langle u,v\rangle
  • Conjugate symmetry: u,v=v,u\langle u,v\rangle=\overline{\langle v,u\rangle}

Examples of inner product

Let V=RnV=\mathbb{R}^n.

The dot product is defined by

u,v=u1v1+u2v2++unvn\langle u,v\rangle=u_1v_1+u_2v_2+\cdots+u_nv_n

is an inner product.


Let V=L2(R,λ)V=L^2(\mathbb{R}, \lambda), where λ\lambda is the Lebesgue measure. f,g:RCf,g:\mathbb{R}\to \mathbb{C} are complex-valued square integrable functions.

The Hermitian inner product is defined by

f,g=Rf(x)g(x)dλ(x)\langle f,g\rangle=\int_\mathbb{R} \overline{f(x)}g(x) d\lambda(x)

is an inner product.


Let A,BA,B be two linear transformation on Rn\mathbb{R}^n.

The Hilbert-Schmidt inner product is defined by

A,B=Tr(AB)=i=1nj=1naijbij\langle A,B\rangle=\operatorname{Tr}(A^*B)=\sum_{i=1}^n \sum_{j=1}^n \overline{a_{ij}}b_{ij}

is an inner product.

Definition of inner product space

A inner product space is a vector space equipped with an inner product.

Definition of completeness

link to completeness

Note that every inner product space is a metric space.

Let XX be a metric space. We say XX is complete if every Cauchy sequence (that is, a sequence such that ϵ>0,N\forall \epsilon>0, \exists N such that m,nN,d(pm,pn)<ϵ\forall m,n\geq N, d(p_m,p_n)<\epsilon) in XX converges.

Definition of Hilbert space

A Hilbert space is a complete inner product space.

Motivation of Tensor product

Recall from the traditional notation of product space of two vector spaces VV and WW, that is, V×WV\times W, is the set of all ordered pairs (v,w)(v,w) where vVv\in V and wWw\in W.

The space has dimension dimV+dimW\dim V+\dim W.

We want to define a vector space with notation of multiplication of two vectors from different vector spaces.

That is

(v1+v2)w=(v1w)+(v2w) and v(w1+w2)=(vw1)+(vw2)(v_1+v_2)\otimes w=(v_1\otimes w)+(v_2\otimes w)\text{ and } v\otimes (w_1+w_2)=(v\otimes w_1)+(v\otimes w_2)

and enables scalar multiplication by

λ(vw)=(λv)w=v(λw)\lambda (v\otimes w)=(\lambda v)\otimes w=v\otimes (\lambda w)

And we wish to build a way associates the basis of VV and WW to the basis of VWV\otimes W. That makes the tensor product a vector space with dimension dimV×dimW\dim V\times \dim W.

Definition of linear functional

Tip

Note the difference between a linear functional and a linear map.

A generalized linear map is a function f:VWf:V\to W satisfying the condition

  1. f(u+v)=f(u)+f(v)f(u+v)=f(u)+f(v)
  2. f(λv)=λf(v)f(\lambda v)=\lambda f(v)

A linear functional is a linear map from VV to F\mathbb{F}.

Definition of bilinear functional

A bilinear functional is a bilinear function β:V×WF\beta:V\times W\to \mathbb{F} satisfying the condition that vβ(v,w)v\to \beta(v,w) is a linear functional for all wWw\in W and wβ(v,w)w\to \beta(v,w) is a linear functional for all vVv\in V.

The vector space of all bilinear functionals is denoted by B(V,W)\mathcal{B}(V,W).

Definition of tensor product

Let V,WV,W be two vector spaces.

Let VV' and WW' be the dual spaces of VV and WW, respectively, that is V={ψ:VF}V'=\{\psi:V\to \mathbb{F}\} and W={ϕ:WF}W'=\{\phi:W\to \mathbb{F}\}, ψ,ϕ\psi, \phi are linear functionals.

The tensor product of vectors vVv\in V and wWw\in W is the bilinear functional defined by (ψ,ϕ)V×W\forall (\psi,\phi)\in V'\times W' given by the notation

(vw)(ψ,ϕ)ψ(v)ϕ(w)(v\otimes w)(\psi,\phi)\coloneqq\psi(v)\phi(w)

The tensor product of two vector spaces VV and WW is the vector space B(V,W)\mathcal{B}(V',W')

Notice that the basis of such vector space is the linear combination of the basis of VV' and WW', that is, if {ei}\{e_i\} is the basis of VV' and {fj}\{f_j\} is the basis of WW', then {eifj}\{e_i\otimes f_j\} is the basis of B(V,W)\mathcal{B}(V',W').

That is, every element of B(V,W)\mathcal{B}(V',W') can be written as a linear combination of the basis.

Since {ei}\{e_i\} and {fj}\{f_j\} are bases of VV' and WW', respectively, then we can always find a set of linear functionals {ϕi}\{\phi_i\} and {ψj}\{\psi_j\} such that ϕi(ej)=δij\phi_i(e_j)=\delta_{ij} and ψj(fi)=δij\psi_j(f_i)=\delta_{ij}.

Here δij={1if i=j0otherwise\delta_{ij}=\begin{cases} 1 & \text{if } i=j \\ 0 & \text{otherwise} \end{cases} is the Kronecker delta.

VW={i=1nj=1maijϕi(v)ψj(w):ϕiV,ψjW}V\otimes W=\left\{\sum_{i=1}^n \sum_{j=1}^m a_{ij} \phi_i(v)\psi_j(w): \phi_i\in V', \psi_j\in W'\right\}

Note that i=1nj=1maijϕi(v)ψj(w)\sum_{i=1}^n \sum_{j=1}^m a_{ij} \phi_i(v)\psi_j(w) is a bilinear functional that maps V×WV'\times W' to F\mathbb{F}.

This enables basis free construction of vector spaces with proper multiplication and scalar multiplication.

This vector space is equipped with the unique inner product vw,uxVW\langle v\otimes w, u\otimes x\rangle_{V\otimes W} defined by

vw,ux=v,uVw,xW\langle v\otimes w, u\otimes x\rangle=\langle v,u\rangle_V\langle w,x\rangle_W

In practice, we ignore the subscript of the vector space and just write vw,ux=v,uw,x\langle v\otimes w, u\otimes x\rangle=\langle v,u\rangle\langle w,x\rangle.

Note

All those definitions and proofs can be found in Linear Algebra Done Right by Sheldon Axler.

Notations in measure theory

Definition of Sigma algebra

link to measure theory

A collection of sets A\mathcal{A} is called a sigma-algebra if it satisfies the following properties:

  1. A\emptyset \in \mathcal{A}
  2. If {Aj}j=1A\{A_j\}_{j=1}^\infty \subset \mathcal{A}, then j=1AjA\bigcup_{j=1}^\infty A_j \in \mathcal{A}
  3. If AAA \in \mathcal{A}, then AcAA^c \in \mathcal{A}

Definition of Measure

A measure is a function v:ARv:\mathcal{A}\to \mathbb{R} satisfying the following properties:

  1. v()=0v(\emptyset)=0
  2. If {Aj}j=1A\{A_j\}_{j=1}^\infty \subset \mathcal{A} are pairwise disjoint, then v(j=1Aj)=j=1v(Aj)v(\bigcup_{j=1}^\infty A_j)=\sum_{j=1}^\infty v(A_j) (countable additivity)
  3. If AAA\in \mathcal{A}, then v(A)0v(A)\geq 0 (non-negativity)

Examples of measure

The Borel measure on R\mathbb{R} is the collection of all closed, open, and half-open intervals with m(U)=(U)m(U)=\ell(U) for any open set UU.

The Lebesgue measure on R\mathbb{R} is the collection of all Lebesgue measurable sets with mi=supK closed,KSm(K)m_i=\sup_{K\text{ closed},K\subseteq S}m(K) and me=infU open,SUm(U)m_e=\inf_{U\text{ open},S\subseteq U}m(U). and m(S)=me(S)=mi(S)m(S)=m_e(S)=m_i(S) for any Lebesgue measurable set SS.

Definition of Probability measure

Let F\mathscr{F} be a sigma-algebra on a set Ω\Omega. A probability measure is a function P:F[0,1]P:\mathscr{F}\to [0,1] satisfying the following properties:

  1. P(Ω)=1P(\Omega)=1
  2. PP is a measure on F\mathscr{F}

Definition of Measurable space

A measurable space is a pair (X,B,v)(X, \mathscr{B}, v), where XX is a set and B\mathscr{B} is a sigma-algebra on XX.

In some literatures, B\mathscr{B} is ignored and we only denote it as (X,v)(X, v).

Examples of measurable space

Let Ω\Omega be arbitrary set.

Let B(C)\mathscr{B}(\mathbb{C}) be the Borel sigma-algebra on C\mathbb{C} generated from rectangles over complex plane with real number axes and λ\lambda be the Lebesgue measure associated with it.

Let F\mathscr{F} be the set of square integrable, that is,

Ωf(x)2dλ(x)<\int_\Omega |f(x)|^2 d\lambda(x)<\infty

complex-valued functions on Ω\Omega, that is, f:ΩCf:\Omega\to \mathbb{C}.

Then the measurable space (Ω,B(C),λ)(\Omega, \mathscr{B}(\mathbb{C}), \lambda) is a measurable space. We usually denote this as L2(Ω,B(C),λ)L^2(\Omega, \mathscr{B}(\mathbb{C}), \lambda).

If Ω=R\Omega=\mathbb{R}, then we denote such measurable space as L2(R,λ)L^2(\mathbb{R}, \lambda).

Probability space

A probability space is a triple (Ω,F,P)(\Omega, \mathscr{F}, P), where Ω\Omega is a set, F\mathscr{F} is a sigma-algebra on Ω\Omega, and PP is a probability measure on F\mathscr{F}.

Lipschitz function

η\eta-Lipschitz function

Let (X,distX)(X,\operatorname{dist}_X) and (Y,distY)(Y,\operatorname{dist}_Y) be two metric spaces. A function f:XYf:X\to Y is said to be η\eta-Lipschitz if there exists a constant LRL\in \mathbb{R} such that

distY(f(x),f(y))LdistX(x,y)\operatorname{dist}_Y(f(x),f(y))\leq L\operatorname{dist}_X(x,y)

for all x,yXx,y\in X. And η=fLip=infLRL\eta=\|f\|_{\operatorname{Lip}}=\inf_{L\in \mathbb{R}}L.

That basically means that the function ff should not change the distance between any two pairs of points in XX by more than a factor of LL.

Operations on Hilbert space and Measurements

Basic definitions

SO(n)SO(n)

The special orthogonal group SO(n)SO(n) is the set of all distance preserving linear transformations on Rn\mathbb{R}^n.

It is the group of all n×nn\times n orthogonal matrices (AA=InA^\top A=I_n) on Rn\mathbb{R}^n with determinant 11.

SO(n)={ARn×n:AA=In,det(A)=1}SO(n)=\{A\in \mathbb{R}^{n\times n}: A^\top A=I_n, \det(A)=1\}

Extensions

In The random Matrix Theory of the Classical Compact groups , the author gives a more general definition of the Haar measure on the compact group SO(n)SO(n),

O(n)O(n) (the group of all n×nn\times n orthogonal matrices over R\mathbb{R}),

O(n)={ARn×n:AA=AA=In}O(n)=\{A\in \mathbb{R}^{n\times n}: AA^\top=A^\top A=I_n\}

U(n)U(n) (the group of all n×nn\times n unitary matrices over C\mathbb{C}),

U(n)={ACn×n:AA=AA=In}U(n)=\{A\in \mathbb{C}^{n\times n}: A^*A=AA^*=I_n\}

Recall that AA^* is the complex conjugate transpose of AA.

SU(n)SU(n) (the group of all n×nn\times n unitary matrices over C\mathbb{C} with determinant 11),

SU(n)={ACn×n:AA=AA=In,det(A)=1}SU(n)=\{A\in \mathbb{C}^{n\times n}: A^*A=AA^*=I_n, \det(A)=1\}

Sp(2n)Sp(2n) (the group of all 2n×2n2n\times 2n symplectic matrices over C\mathbb{C}),

Sp(2n)={UU(2n):UJU=UJU=J}Sp(2n)=\{U\in U(2n): U^\top J U=UJU^\top=J\}

where J=(0InIn0)J=\begin{pmatrix} 0 & I_n \\ -I_n & 0 \end{pmatrix} is the standard symplectic matrix.

Haar measure

Let (SO(n),,μ)(SO(n), \| \cdot \|, \mu) be a metric measure space where \| \cdot \| is the Hilbert-Schmidt norm  and μ\mu is the measure function.

The Haar measure on SO(n)SO(n) is the unique probability measure that is invariant under the action of SO(n)SO(n) on itself.

That is also called translation-invariant.

That is, fixing BSO(n)B\in SO(n), ASO(n)\forall A\in SO(n), μ(AB)=μ(BA)=μ(B)\mu(A\cdot B)=\mu(B\cdot A)=\mu(B).

The Haar measure is the unique probability measure that is invariant under the action of SO(n)SO(n) on itself.

The existence and uniqueness of the Haar measure is a theorem in compact lie group theory. For this research topic, we will not prove it.

Random sampling on the CPn\mathbb{C}P^n

Note that the space of pure state in bipartite system

Non-commutative probability theory

Pure state and mixed state

A pure state is a state that is represented by a unit vector in HN\mathscr{H}^{\otimes N}.

As analogy, a pure state is the basis element of the vector space, a mixed state is a linear combination of basis elements.

A mixed state is a state that is represented by a density operator (linear combination of pure states) in HN\mathscr{H}^{\otimes N}.

Partial trace and purification

Partial trace

Recall that the bipartite state of a quantum system is a linear operator on H=AB\mathscr{H}=\mathscr{A}\otimes \mathscr{B}, where A\mathscr{A} and B\mathscr{B} are finite-dimensional Hilbert spaces.

Definition of partial trace for arbitrary linear operators

Let TT be a linear operator on H=AB\mathscr{H}=\mathscr{A}\otimes \mathscr{B}, where A\mathscr{A} and B\mathscr{B} are finite-dimensional Hilbert spaces.

An operator TT on H=AB\mathscr{H}=\mathscr{A}\otimes \mathscr{B} can be written as (by the definition of tensor product of linear operators )

T=i=1naiAiBiT=\sum_{i=1}^n a_i A_i\otimes B_i

where AiA_i is a linear operator on A\mathscr{A} and BiB_i is a linear operator on B\mathscr{B}.

The B\mathscr{B}-partial trace of TT (TrB(T):L(AB)L(A)\operatorname{Tr}_{\mathscr{B}}(T):\mathcal{L}(\mathscr{A}\otimes \mathscr{B})\to \mathcal{L}(\mathscr{A})) is the linear operator on A\mathscr{A} defined by

TrB(T)=i=1naiTr(Bi)Ai\operatorname{Tr}_{\mathscr{B}}(T)=\sum_{i=1}^n a_i \operatorname{Tr}(B_i) A_i

Definition of partial trace for density operators

Let ρ\rho be a density operator in H1H2\mathscr{H}_1\otimes\mathscr{H}_2, the partial trace of ρ\rho over H2\mathscr{H}_2 is the density operator in H1\mathscr{H}_1 (reduced density operator for the subsystem H1\mathscr{H}_1) given by:

ρ1Tr2(ρ)\rho_1\coloneqq\operatorname{Tr}_2(\rho)

Examples

Let ρ=12(01+10)\rho=\frac{1}{\sqrt{2}}(|01\rangle+|10\rangle) be a density operator on H=C2C2\mathscr{H}=\mathbb{C}^2\otimes \mathbb{C}^2.

Expand the expression of ρ\rho in the basis of C2C2\mathbb{C}^2\otimes\mathbb{C}^2 using linear combination of basis vectors:

ρ=12(0101+0110+1001+1010)\rho=\frac{1}{2}(|01\rangle\langle 01|+|01\rangle\langle 10|+|10\rangle\langle 01|+|10\rangle\langle 10|)

Note Tr2(abcd)=acbd\operatorname{Tr}_2(|ab\rangle\langle cd|)=|a\rangle\langle c|\cdot \langle b|d\rangle.

Then the reduced density operator of the subsystem C2\mathbb{C}^2 in first qubit is, note the 00=11=1\langle 0|0\rangle=\langle 1|1\rangle=1 and 01=10=0\langle 0|1\rangle=\langle 1|0\rangle=0:

ρ1=Tr2(ρ)=12(1100+0101+1010+0011)=12(00+11)=12I\begin{aligned} \rho_1&=\operatorname{Tr}_2(\rho)\\ &=\frac{1}{2}(\langle 1|1\rangle |0\rangle\langle 0|+\langle 0|1\rangle |0\rangle\langle 1|+\langle 1|0\rangle |1\rangle\langle 0|+\langle 0|0\rangle |1\rangle\langle 1|)\\ &=\frac{1}{2}(|0\rangle\langle 0|+|1\rangle\langle 1|)\\ &=\frac{1}{2}I \end{aligned}

is a mixed state.

Purification

Let ρ\rho be any state  (may not be pure) on the finite dimensional Hilbert space H\mathscr{H}. then there exists a unit vector wHHw\in \mathscr{H}\otimes \mathscr{H} such that ρ=Tr2(ww)\rho=\operatorname{Tr}_2(|w\rangle\langle w|) is a pure state.

Proof

Let (u1,u2,,un)(u_1,u_2,\cdots,u_n) be an orthonormal basis of H\mathscr{H} consisting of eigenvectors of ρ\rho for the eigenvalues p1,p2,,pnp_1,p_2,\cdots,p_n. As ρ\rho is a states, pi0p_i\geq 0 for all ii and i=1npi=1\sum_{i=1}^n p_i=1.

We can write ρ\rho as

ρ=i=1npiuiui\rho=\sum_{i=1}^n p_i |u_i\rangle\langle u_i|

Let w=i=1npiuiuiw=\sum_{i=1}^n \sqrt{p_i} u_i\otimes u_i, note that ww is a unit vector (pure state). Then

Tr2(ww)=Tr2(i=1nj=1npipjuiuiujuj)=i=1nj=1npipjTr2(uiuiujuj)=i=1nj=1npipjuiujuiui=i=1nj=1npipjδijuiui=i=1npiuiui=ρ\begin{aligned} \operatorname{Tr}_2(|w\rangle\langle w|)&=\operatorname{Tr}_2(\sum_{i=1}^n \sum_{j=1}^n \sqrt{p_ip_j} |u_i\otimes u_i\rangle \langle u_j\otimes u_j|)\\ &=\sum_{i=1}^n \sum_{j=1}^n \sqrt{p_ip_j} \operatorname{Tr}_2(|u_i\otimes u_i\rangle \langle u_j\otimes u_j|)\\ &=\sum_{i=1}^n \sum_{j=1}^n \sqrt{p_ip_j} \langle u_i|u_j\rangle |u_i\rangle\langle u_i|\\ &=\sum_{i=1}^n \sum_{j=1}^n \sqrt{p_ip_j} \delta_{ij} |u_i\rangle\langle u_i|\\ &=\sum_{i=1}^n p_i |u_i\rangle\langle u_i|\\ &=\rho \end{aligned}

is a pure state.

Drawing the connection between the space S2n+1S^{2n+1}, CPn\mathbb{C}P^n, and R\mathbb{R}

A pure quantum state of size NN can be identified with a Hopf circle on the sphere S2N1S^{2N-1}.

A random pure state ψ|\psi\rangle of a bipartite N×KN\times K system such that KN3K\geq N\geq 3.

The partial trace of such system produces a mixed state ρ(ψ)=TrK(ψψ)\rho(\psi)=\operatorname{Tr}_K(|\psi\rangle\langle \psi|), with induced measure μK\mu_K. When K=NK=N, the induced measure μK\mu_K is the Hilbert-Schmidt measure.

Consider the function f:S2N1Rf:S^{2N-1}\to \mathbb{R} defined by f(x)=S(ρ(ψ))f(x)=S(\rho(\psi)), where S()S(\cdot) is the von Neumann entropy. The Lipschitz constant of ff is lnN\sim \ln N.

Last updated on