核心概念
An abstract information processing model based on minimal yet realistic assumptions inspired by biological systems can achieve the early visual system's two ultimate objectives: efficient information transmission and accurate sensor probability distribution modeling.
摘要
The paper presents an abstract discrete feedforward information processing unit (IPU) model inspired by biological systems. The model aims to achieve two key objectives: efficient information transmission and accurate modeling of the input probability distribution.
The authors make four key assumptions about the IPU:
- The IPU strives to accomplish the two objectives of efficient information transmission and input probability distribution modeling.
- Both the input and output of the IPU are discrete, with the number of output states N being significantly fewer than the number of input states M (M ≫ N).
- In the limit as M → ∞, the input becomes continuous; in the limit as N → ∞, the IPU transformation becomes a continuous function, resulting in continuous output.
- The IPU transformation is deterministic.
The authors first analyze the single-pixel case and prove that optimizing for information transmission does not guarantee optimal probability distribution modeling in general. They then introduce two strategies to achieve a suitable compromise between the two objectives: even coding and factorial coding.
For image patches, the authors propose an unsupervised learning approach using a multilayer perceptron (MLP) as the IPU. The learned representation exhibits several desirable properties:
- An approximately even distribution of output states, achieved through a novel loss function that encourages repulsion between representations.
- Emergence of local edge detectors and orientation-selective units, similar to the early visual system.
- Encoding of both luminance and color information, in contrast to methods that focus only on contrast.
The authors compare the efficiency of their IPU model to a deep learning model, showing that the IPU model can achieve comparable performance using significantly less memory for the representations.
統計資料
The input to the model consists of discrete pixel intensities, with M possible states.
The output of the model has N discrete states, where M ≫ N.
The authors train their models on 10 million pairs of neighboring pixels from the COCO dataset and an artificial 2D normal distribution dataset.
引述
"Utilizing an abstract information processing model based on minimal yet realistic assumptions inspired by biological systems, we study how to achieve the early visual system's two ultimate objectives: efficient information transmission and accurate sensor probability distribution modeling."
"We prove that optimizing for information transmission does not guarantee optimal probability distribution modeling in general."
"Our model provides novel insights into the computational theory of early visual systems as well as a potential new approach to enhance the efficiency of deep learning models."