Math 401 Paper 1, Side note 3: Levy’s concentration theorem
Our goal is to prove the generalized version of Levy’s concentration theorem used in Hayden’s work for -Lipschitz functions.
Let be a -Lipschitz function. Let denote the median of and denote the mean of . (Note this can be generalized to many other manifolds.)
Select a random point with according to the uniform measure (Haar measure). Then the probability of observing a value of much different from the reference value is exponentially small.
This version of Levy’s concentration theorem can be found in Geometry of Quantum states 15.84 and 15.85.
Basic definitions
Lipschitz function
-Lipschitz function
Let and be two metric spaces. A function is said to be -Lipschitz if there exists a constant such that
for all . And .
That basically means that the function should not change the distance between any two pairs of points in by more than a factor of .
Levy’s concentration theorem in High-dimensional probability by Roman Vershynin
Levy’s concentration theorem (Vershynin’s version)
This theorem is exactly the 5.1.4 on the High-dimensional probability by Roman Vershynin.
Isoperimetric inequality on
Among all subsets with a given volume, the Euclidean ball has the minimal area.
That is, for any , Euclidean balls minimize the volume of the -neighborhood of .
Where the volume of the -neighborhood of is defined as
Here the is the Euclidean norm. (The theorem holds for both geodesic metric on sphere and Euclidean metric on .)
Isoperimetric inequality on the sphere
Let denotes the normalized area of on dimensional sphere . That is .
Let . Then for any subset , given the area , the spherical caps minimize the volume of the -neighborhood of .
The above two inequalities is not proved in the Book High-dimensional probability. But you can find it in the Appendix C of Gromov’s book Metric Structures for Riemannian and Non-Riemannian Spaces.
To continue prove the theorem, we use sub-Gaussian concentration (Chapter 3 of High-dimensional probability by Roman Vershynin) of sphere .
This will leads to some constant such that the following lemma holds:
The “Blow-up” lemma
Let be a subset of sphere , and denotes the normalized area of . Then if , then for every ,
where and is some positive constant.
Proof of the Levy’s concentration theorem
Proof:
Without loss of generality, we can assume that . Let denotes the median of .
So , and .
Consider the sub-level set .
Since , by the blow-up lemma, we have
And since
Combining the above two inequalities, we have
Levy’s concentration theorem in Metric Structures for Riemannian and Non-Riemannian Spaces by M. Gromov
Levy’s concentration theorem (Gromov’s version)
The Levy’s lemma can also be found in Metric Structures for Riemannian and Non-Riemannian Spaces by M. Gromov. The Levy concentration theory.
Theorem Levy concentration theorem:
An arbitrary 1-Lipschitz function concentrates near a single value as strongly as the distance function does.
That is
where
is the Levy mean of function , that is the level set of divides the sphere into equal halves, characterized by the following equality:
Hardcore computing may generates the bound but M. Gromov did not make the detailed explanation here.
Detailed proof by Takashi Shioya.
The central idea is to draw the connection between the given three topological spaces, , and .
First, we need to introduce the following distribution and lemmas/theorems:
OBSERVATION
consider the orthogonal projection from , the space where is embedded, to , we denote the restriction of the projection as . Note that is a 1-Lipschitz function (projection will never increase the distance between two points).
We denote the normalized Riemannian volume measure on as , and .
Definition of Gaussian measure on
We denote the Gaussian measure on as .
, is the Euclidean norm, and is the Lebesgue measure on .
Basically, you can consider the Gaussian measure as the normalized Lebesgue measure on with standard deviation .
Maxwell-Boltzmann distribution law
It is such a wonderful fact for me, that the projection of dimensional sphere with radius to is a Gaussian distribution as .
For any natural number ,
where is the push-forward measure of by .
In other words,
Proof
We denote the dimensional volume measure on as .
Observe that is isometric to , that is, for any , is a sphere with radius (by the definition of ).
So,
as .
note that for any .
So
Proof of the Levy’s concentration theorem via the Maxwell-Boltzmann distribution law
We use the Maxwell-Boltzmann distribution law and Levy’s isoperimetric inequality to prove the Levy’s concentration theorem.
The goal is the same as the Gromov’s version, first we bound the probability of the sub-level set of by the function by Levy’s isoperimetric inequality. Then we claim that the function is bounded by the Gaussian distribution.
Note, this section is not rigorous enough in sense of mathematics and the author should add sections about Levy family and observable diameter to make the proof more rigorous and understandable.
Proof
Let be a 1-Lipschitz function.
Consider the two sets of points on the sphere with radius :
Note that is the whole sphere .
By the Levy’s isoperimetric inequality, we have
We define as the following:
By the Levy’s isoperimetric inequality, and the Maxwell-Boltzmann distribution law, we have
Levy’s Isoperimetric inequality
This section is from the Appendix of Gromov’s book Metric Structures for Riemannian and Non-Riemannian Spaces.
Not very edible for undergraduates.
Crash course on Riemannian manifolds
This part might be extended to a separate note, let’s check how far we can go from this part.
References:
Riemannian manifolds
A Riemannian manifold is a smooth manifold equipped with a Riemannian metric, which is a smooth assignment of an inner product to each tangent space of the manifold.
An example of Riemannian manifold is the sphere .
Riemannian metric
A Riemannian metric is a smooth assignment of an inner product to each tangent space of the manifold.
An example of Riemannian metric is the Euclidean metric on .
Notion of Connection
A connection is a way to define the directional derivative of a vector field along a curve on a Riemannian manifold.
For every , where denote the manifold, suppose , then let be a vector field on . The directional derivative of along the point is defined as