CSE5313 Coding and information theory for data science (Lecture 13)
Recap from last lecture
Local recoverable codes:
An code is called -locally recoverable if
- every codeword symbol has a recovering set (),
- such that is computable from for all .
- for every .
Bounds for locally recoverable codes
Bound 1: .
Bound 2: . ( gives singleton bound)
Turan’s Lemma
Let be a graph with vertices. Then there exists an induced directed acyclic subgraph (DAG) of on at least nodes, where is the out-degree of vertex .
Proof using probabilistic method.
Proof via the probabilistic method
Useful for showing the existence of a large acyclic subgraph, but not for finding it.
Show that , and therefore there exists with , using pigeonhole principle.
For a permutation of , define .
Let if each of the outgoing edges from connect to a node with .
In other words, we select a subset of nodes such that each node in has an outgoing edge to a node in with a larger index. All edges going to right.
This graph is clearly acyclic.
Choose at random and Let be a random variable.
Let be the indicator random variable for .
So .
Using linearity of expectation, we have
is the probability that places before any of its out-neighbors.
For each node, there are ways to place the node and its out-neighbors.
For each node, there are ways to place the out-neighbors.
So, .
Since , we have
Recall Arithmetic mean () is greater than or equal to harmonic mean ().
Locally recoverable codes
Bound 1
Proof via Turan's Lemma
Let be a directed graph such that is an edge if . ( required to repair )
for every , so .
By Turan’s Lemma, there exists an induced directed acyclic subgraph (DAG) on with nodes such that .
Since is a DAG, it is acyclic. So there exists a vertex with no outgoing edges inside . (leaf node in )
Note that if we remove from , the remaining graph is still a DAG.
Let be the induced graph on .
We can find a vertex with no outgoing edges inside .
We can continue this process until we have all the vertices in removed.
So all symbols in are redundant.
Note the number of redundant symbols .
So .
So .
Bound 2
Definition for code reduction
For code, let be a set of indices, e.g., .
Let be the code obtained by deleting the symbols that is not in . (only keep the symbols with indices in )
Then is a code of length , is generated by (only keep the rows of with indices in ).
.
Note not necessarily of rank .
Reduction lemma
Let be an code. Then .
Proof for reduction lemma
We proceed from two inequalities.
First .
Let with .
Then there exists such that .
Let and .
Then have at least entries in common.
So .
So for every such that .
So .
Now we show the other inequality.
.
Let such that .
Let be the set on which and identical.
So .
since and have at least entries in common.
So . (by existence of )
So .
We prove bound 2 using the same subgraph from proof of bound 1.
Proof for bound 2
Let be a directed graph such that is an edge if . ( required to repair )
for every , so .
By Turan’s Lemma, there exists an induced directed acyclic subgraph (DAG) on with nodes such that .
Let with size .
exists since by bound 1, and .
So is also acyclic.
Let be the set of neighbors of in .
.
Complete to be of the size by adding elements not in .
Also
Note that all nodes in can be recovered from nodes in .
So .
Therefore, .
Using reduction lemma, we have .
Construction of locally recoverable codes
Recall Reed-Solomon code:
.
- Need .
- To encode , we need to evaluate .
Assume and .
Choose
Partition into subsets of size .
Definition for good polynomial
A polynomial over is called good if
- is constant for every ., i.e., .
To encode , we consider
Let
, where is a good polynomial.
Proof for valid codeword
If , then .
- Maximum degree monomial in :
.
- .
By Bound 1: .
If . then
- and agree on more points than their degree.
- , a contradiction.
Corollary: .
In other words,
For data , the th server stores .
Note is a linear code. and .
Let .
Suppose .
Locality is ensured by recovering failed server by from .
Note is constant on .
Denote the constant by .
Let .
for all .
Constructing good polynomial
Let be the multiplicative group of .
For a subgroup of this group, a multiplicative coset is .
The family of all cosets of .
This is a partition of .
Definition of annihilator
The annihilator of is .
Note that for all .
Annihilator is a good polynomial
Fix , consider the partition and let . Then is a good polynomial.
Proof
Need to show if , then .
Since , there exists such that and .
So .
By Fermat’s Little Theorem, for every .
So for every .
So .
Similarly, .