## Learning is Compression

A Mathematical Theory of Communication In 1948, Claude E. Shannon, while working in Bell Labs published his paper “A Mathematical Theory of Communication”. Shannon was interested in modeling the English language. In his paper, he assumed that the English language has a 27-symbol alphabet of 26 letters and a space. He tried to model it using stochastic processes. The simplest stochastic process to model English is a process where each symbol is sampled equiprobably and independently....