Created: 2026-03-06 07:53:04
Updated: 2026-03-06 07:53:04

估计CmkC_{m}^k的值:Chernoff bound

对独立同分布的随机变量X0,X1,X2,XnX_{0},X_{1},X_{2},\dots X_{n} Bernoulli distribution: E(X)=p,0x1E(X) = p,0\leq x\leq 1

P(1mi=1mXipϵ)emDB(p+ϵp)e2mϵ2P\left( \frac{1}{m} \sum_{i=1}^m X_{i} - p \geq \epsilon \right) \leq e^{-mD_{B}(p+\epsilon\mid\mid p)}\leq e^{-2m\epsilon^2}

其中

DB(p+ϵp)=(p+ϵ)lnp+ϵp+(1pϵ)ln1pϵ1pD_{B}(p+\epsilon\mid\mid p) = (p+\epsilon) \ln \frac{p+\epsilon}{p} + (1-p-\epsilon) \ln \frac{1-p-\epsilon}{1-p}

对于长为n的编码,为了使d(ci,cj)m4d(c_{i},c_{j})\geq \frac{m}{4},需要m维的空间(长为m编码)得到最小的m=f(n)m=f(n)。可以证明m=O(n)m=O(n).

Basic Definition

A code CC of a block length nn, message length kk and distance dd over an alphabet Σ\Sigma is called a (n,k,d)Σ(n,k,d)|_{\Sigma} code.

Rate of a code CΣnC \subseteq \Sigma^n is defined as R=logΣCn=knR= \frac{{\log_{\mid \Sigma\mid} |C|}}{n}=\frac{k}{n}。It shows the speed of information transmission.

Distance between two linear codes is the number of positions at which the corresponding symbols are different.

Hamming ball in n\sum^n of radius ee about xnx \in \sum^n is defined as

BΣn(x,e)={yΣn:δ(x,y)e}B_{\Sigma^n} (x,e) = \left\{y\in \Sigma^n :\delta(x,y)\leq e\right\}

As in the code space, the volumn of Hamming ball is denoted as

Volq(e,n)=1+Cn1(q1)+Cn2(q1)2++Cne(q1)eVol_{q}(e,n) = 1+C_{n}^1 (q-1)+C_{n}^2 (q-1)^2+\dots+C_{n}^e(q-1)^e

Our main purpose is to find out the erasures or errors during codes' transmission. Generally, a code with distance dd can :

  1. correct d1\leq d-1 erasures
  2. detect d1\leq d-1 errors
  3. correct d12\lfloor \frac{d-1}{2}\rfloor errors

Hamming Bound, GV Bound

They all show the trade-off between rate and distance. Briefly, Hamming Bound is the upper bound on the rate of a code (an impossibility result), and GV bound shows the lower bound of the rate (a possibility result).

1log(Volq(d1,n))noptimal(kn)1logq(Volq(d12,n))n1 - \frac{\log(Vol_{q}(d-1,n))}{n} \leq \text{optimal}\left( \frac{k}{n} \right)\leq 1-\frac{\log_{q}\left( Vol_{q}\left( \lfloor \frac{d-1}{2}\rfloor,n \right) \right)}{n}

Gilvert-Vashamov Bound

使用长m的coding编码长n的符号的纠错码Error Correcting Codes (ECC):{0,1}n{0,1}m for  0<δ<12\{0,1\}^n\to\{0,1\}^m\ for\ \forall\ 0<\delta<\frac{1}{2}
如果m2n1H(δ)m\geq \frac{2n}{1-H(\delta)}那么存在codewords CC使得码元之间距离总不小于δm\delta m,即dH(ci,cj)δmd_{H}(c_{i},c_{j})\geq\delta m for  ij\forall\ i\neq j。其中H(δ)=(δlnδ+(1δ)ln(1δ))H(\delta)=-(\delta \ln\delta + (1-\delta)\ln(1-\delta))

Proof:
IDEA: Probablisitc Method, if one randomly chooses objects from a specified class, the probablity that the result is of the prescribed kind is strictly greater than zero.

Let C be a random subspace of FqF^q of dimension kk , and GG is a random generator matrix of CC (generator matrix is not unique). For any fixed x0,Gxx\neq 0,G\cdot x is uniformly random in Fq{0}F^q-\{0\}
Then,
uniformly and independently draw c1,c2nc_{1},\dots c_{2^n} For fixed i,j,iji,j,i\neq j,

P(dH(ci,cj)δm)P(d_{H}(c_{i},c_{j})\leq\delta m) \leq \dots

那么

P(ij,dH(ci,cj)δm)22m×P(dH(ci,cj)δm)<1P(\exists i\neq j,d_{H}(c_{i},c_{j})\leq\delta m) \leq 2^{2m}\times P(d_{H}(c_{i},c_{j})\leq\delta m)<1

得证

Encoding & decoding is computationally efficient.

Hamming Codes

以3bit为例,构造(3,7)(3,7)矩阵H,使得其从左到右每一列均为从1到7的二进制编码

H=[000111101100111010101]H=\left[\begin{matrix} 0 & 0 & 0 & 1 & 1 & 1 & 1 \\ 0 & 1 & 1 & 0 & 0 & 1 & 1 \\ 1 & 0 & 1 & 0 & 1 & 0 & 1 \end{matrix}\right]

考虑H的Kernel space,dim(Ker(H))=dim(H)rank(H)=4dim(Ker(H))=dim(H)-rank(H)=4也就是空间中有4个独立向量。
从中任意取出两个不同的x1,x2x_{1},x_{2}必然有dH(x1,x2)3d_{H}(x_{1},x_{2})\geq 3
可以得知x1x20x_{1} \oplus x_{2}\neq 0,而且H(x1x2)=0H(x_{1}\oplus x_{2})=0.首先dd不可能为1,否则需要H有一列全0;d也不可能为2,否则需要H有两列全相同;事实上d有可能为3.

Leave a Comment

captcha
Fontsize