Top Banner
Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al., 2012] Integrated model of visual processing [Bullier, 2001] Bert Kappen
14

Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Jun 05, 2018

Download

Documents

vuongdieu
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Visual information processing

How Does the Brain Solve Visual Object Recognition? [DiCarlo et al., 2012]Integrated model of visual processing [Bullier, 2001]

Bert Kappen

Page 2: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Vision as a feed-forward problem.

Vision operates on different time scales and facilitates many functions:- object recognition, object tracking, segmentation, obstacle avoidance, object grasping

Core Object Recognition:- fast first recognition phase. ≈ 250ms in monkey, ≈ 350ms in human and can be presented at 100 ms rate.- in agreement with rapid eye movement during exploration where the eye is fixated 200-500 ms.

Humans and monkeys perform core recognition fast, despite invariance problem.

Bert Kappen Visual information processing 1

Page 3: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Vision as a feed-forward problem.

Early visual areas (retina, LGN, V1) havetangled representation of objects.

Higher areas (IT) untangle into ’linear sep-arable’ representation.

Vast majority of images are ’unlabelled’ buttemporally contiguous, which may facilitateunsupervised learning.

’Shape’ results as only feature that is nottemporally contiguous

Bert Kappen Visual information processing 2

Page 4: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Vision as a feed-forward problem.

Ventral stream houses key circuits that underlie object recognition:- lesions in the posterior ventral stream produce complete blindness in part of the visual field- lesions or inactivation of anterior regions, (IT), can produce selective deficits in the ability to distinguishamong complex objects- retinotopic mappings for V1, V2, V4. less clear in IT.

Bert Kappen Visual information processing 3

Page 5: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Feed-forward model

Dominant model of visual processing is feed-forward [Marr, 1982].- first local information processing in V1, V2- later global information processing in higher layers (V4,IT)- for instance orientation selectivity in V1 is computed from LGN center-surround input (Hubel Wiesel)

Problem:- feed-forward model fails when input is noisy, while humans succeed (Mumford 1993)- reason is recognition of local features fails and cannot be fixed by global integration- weakness is that local ambiguity cannot be resolved from global feed-back.

Bert Kappen Visual information processing 4

Page 6: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Integrated model of visual processing [Bullier, 2001]

This paper argues that:

Cortical processing requires that information be exchanged between neurons coding fordistant regions in the visual field.

It is argued that feedback rather than lateral connections are the best candidates for suchrapid long-distance interconnections.

Bert Kappen Visual information processing 5

Page 7: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Lateral connectivity

Early areas such as V1, V2 have small receptive fields and high magnification. Thereforehorizontal connections only reach short distance:- the axon of a V1 or V2 neuron with a foveal receptive field cannot reach beyond 0.6o (assuming a maximalaxon length of 6 mm).

Higher areas have larger receptive fields and smaller magnification. These areas encodespecialised features (movement direction, depth, color, shape).- they can spatially integrate their particular feature, integration accross features much more difficult

Feedback:- feedback connections have very large convergence regions. can carry information from long distances inthe visual field (Salin 1993)

Large scale spatial integration by- lateral connectivity in higher layers. Requires (fast) computation combining different features.- feed-back from higher layers to lower layers.

Bert Kappen Visual information processing 6

Page 8: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Feed-back connectivity

90 % of the information transferred by a V1, V2 neuron is transmitted during the first 100ms of its response to a visual stimulus. Thus feed-back requires:- rapid activation of higher order neurons- fast feed-back connections

Areas MT, MST, FEF (frontal eye field) and 7a that contain neurons that are activatedsufficiently early to influence neurons in areas V1 and V2.

Speed of feed-back is in order of 1-2 ms because axons between areas are myelinated(for comparison, speed of lateral connections is ≈ 0.1m/s, requiring about 100 ms toreach neurons 1o apart in V1).

Bert Kappen Visual information processing 7

Page 9: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

P and M channels

Magnocellular neurons of the LGN reach area V1 and beyond some 20 ms earlier thanactivity transferred from the parvocellular neurons of the LGN [34]. The early activationof the dorsal stream after visual stimulation (Fig. 1B) most likely results from its almostexclusive drive by the M channel [26] and the high degree of myelination in many areasof the ’fast brain’.

Bert Kappen Visual information processing 8

Page 10: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Empirical evidence: human fMRI

Moving Kanisza rectangles create strong motion percept (left) absent in the control stim-ulus (right).

As expected, stimulus produces strong signal in V5/MT, and LOS/KO responsible forobject motion.Surprisingly, also strong activity in V1, whose neurone have small receptive fields andare not activated by the stimulus. Probable interpretation is feedback from V5/MT to V1,V2 neurons.

Bert Kappen Visual information processing 9

Page 11: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Empirical evidence: neural activity in V3

Moving low contrast bar over background of white and gray rectangles. Conditions:- moving bar on static background (centre)- moving bar on co-moving background (centre + background)- moving background (background)

Area MT is known to be mainly involved in low contrast:- inactivation of MT removes feed-back in centre condition (A left)- active and inactive MT yields suppression in centre + background condition (A centre)

Bert Kappen Visual information processing 10

Page 12: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Empirical evidence: neural activity in V3

Area MT is known to be mainly involved in low contrast:- decrease of centre response due to MT cooling strongest at low saliency (B)- background induced suppression (centre minus centre+background) almost eliminated at low saliencydue to MT cooling (C)

Bert Kappen Visual information processing 11

Page 13: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

Empirical evidence: stimulus flash

Effect of MT cooling on response of 51 V1,V2,V3 neurons shows that feedback is as fastas 10 ms.

Bert Kappen Visual information processing 12

Page 14: Visual information processing - SNNbertk/comp_neurosci/visual_ff_fb_bullier.pdf · Visual information processing How Does the Brain Solve Visual Object Recognition? [DiCarlo et al.,

References

References

[Bullier, 2001] Bullier, J. (2001). Integrated model of visual processing. Brain research reviews, 36:96–107.

[DiCarlo et al., 2012] DiCarlo, J. J., Zoccolan, D., and Rust, N. C. (2012). How does the brain solve visualobject recognition? Neuron, 73(3):415–434.

[Marr, 1982] Marr, D. (1982). Vision: A computational investigation into the human representation andprocesing of visual information. Freeman, San Francisco.

[Olshausen et al., 1996] Olshausen, B. A. et al. (1996). Emergence of simple-cell receptive field propertiesby learning a sparse code for natural images. Nature, 381(6583):607–609.

Bert Kappen Visual information processing 13