Let S,T be two finite sets, called the source and target alphabets. A CodeC:S→T∗ is a total function mapping, and an extension of C is a homomorphism: ext(C):S∗→T∗, which maps each sequence of source symbols to a sequence of target symbols.
Non-singular codes:指映射非单射
A code is non-singular if each source symbol is mapped to a different non-empty bit string, i.e. the mapping from source symbols to bit strings is injective(f(x1)=f(x2)⟹x1=x2).
可唯一解码的编码C(Uniquely decodable codes):
对于有限长的编码T′∈T∗,都存在唯一的解码方式S′∈S使得S′=ext(C)。
A code C is uniquely decodable if its extensionext(C) is non singular. Whether a given code is uniquely decodable can be decided with the Sardinas-Patterson algorithm.
![[294024f145e41457d784606e9b5742a6d7ca7ffe.svg]]:Singular ![[e7077bf9dbc7c975f4d5f510567a34ca5ff0e1d7.svg]]:Non-singular, its extension will generate a lossless coding ,which will be useful fo general data transmission. Note: it is not necessary for the non-singular code to be more compact than the source.
Consider M2 again. It's not uniquely decodable, since the string 011101110011 can be interpreted as cdb or babe. However, such a code is useful when the set of all possiblesource symbols is completely known and finite, or when there are restrictions (for example a formal syntax) that determine if source elements of this extension are acceptable. Such restrictions permit the decoding of the original message by checking which of the possible source symbols mapped the same are valid under the restrictions.
前缀码
C:S→T∗=(c1,c2,…cn):指的是任意i=j,有ci=prefix(cj)性质的编码 设可唯一解码的编码C构成集合A,前缀码/即时码的集合是B。 我们来研究具有最小平均编码长度(min average code length)的集合。我们指出,对任意c∈A−B具有最短编码长度,总能找到对应的c′∈B,使得二者编码长度相等。
Leave a Comment