Skip to Content
CSE5313CSE5313 Coding and information theory for data science (Exam Review)

CSE5313 Computer Vision (Lecture 16: Exam Review)

Exam Review

Information flow graph

Parameters:

  • nn is the number of nodes in the initial system (before any node leaves/crashes).
  • kk is the number of nodes required to reconstruct the file kk.
  • dd is the number of nodes required to repair a failed node.
  • α\alpha is the storage at each node.
  • β\beta is the edge capacity for repair.
  • BB is the file size.

Graph construction

Source: System admin.

Sink: Data collector.

Nodes: Storage servers.

Edges: Represents transmission of information. (Number of Fq\mathbb{F}_q elements is weight.)

Main observation:

  • kk elements (number of servers required to reconstruct the file) The message size is BB. from Fq\mathbb{F}_q must “flow” from the source (system admin) to the sink (data collector).
  • Any cut (U,U)(U,\overline{U}) which separates source from sink must have capacity at least kk.

Bounds for local recoverable codes

Turan’s Lemma

Let GG be a graph with nn vertices. Then there exists an induced directed acyclic subgraph (DAG) of GG on at least n1+avgi(diout)\frac{n}{1+\operatorname{avg}_i(d^{out}_i)} nodes, where dioutd^{out}_i is the out-degree of vertex ii.

Bound 2

Consider the induced acyclic graph GUG_U on UU nodes.

By the definition of rr-locally recoverable code, each leaf node in GUG_U must be determined by other nodes in GGUG\setminus G_U, so we can safely remove all leaf nodes in GUG_U and the remaining graph is still a DAG.

Let N[n]UN\subseteq [n]\setminus U be the set of neighbors of UU in GG.

NrUk1|N|\leq r|U|\leq k-1.

Complete nn to be of the size k1k-1 by adding elements not in UU.

CNqk1|C_N|\leq q^{k-1}

Also NU=k1+k1r|N\cup U'|=k-1+\lfloor\frac{k-1}{r}\rfloor

All nodes in GUG_U can be recovered from nodes in NN.

So CNU=CNqk1|C_{N\cup U'}|=|C_N|\leq q^{k-1}.

Therefore, max{I:CI<qk,I[n]}NU=k1+k1r\max\{|I|:C_I<q^k,I\subseteq [n]\}\geq |N\cup U'|=k-1+\lfloor\frac{k-1}{r}\rfloor.

Using reduction lemma, we have d=nmax{I:CI<qk,I[n]}nk1k1r+2=nkkr+2d= n-\max\{|I|:C_I<q^k,I\subseteq [n]\}\leq n-k-1-\lfloor\frac{k-1}{r}\rfloor+2=n-k-\lceil\frac{k}{r}\rceil +2.

Reed-Solomon code

Last updated on