Top Banner
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model Junting Pan M Cornia, L Baraldi, G Serra, R Cucchiara. Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model. arXiv preprint arXiv:1611.09571
27

Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Apr 11, 2017

Download

Data & Analytics

Xavier Giro
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 2: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

1. Introduction

Page 3: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Outline1. Introduction2. Model Architecture3. Experimental Evaluation4. Implementation Details5. Results 6. Conclusion

Page 4: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Outline1. Introduction2. Model Architecture

a. Dilated Residual Convolutional Networksb. Attentive Convolutional LSTMc. Learned Priors

3. Experimental Evaluation4. Implementation Details5. Results 6. Conclusion

Page 5: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture

Page 6: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - DRCN

Page 7: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - DRCN

ResNet-50

Page 8: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - DRCN

Page 9: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - DRCN

conv4 use dilated rate = 1conv5 use dilated rate = 3

Page 10: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - Attentive ConvLSTM

Page 11: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - Attentive ConvLSTM

Page 12: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - Attentive ConvLSTM

Page 13: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - Attentive ConvLSTM

Page 14: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - DRCN

Page 15: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - DRCN

Page 16: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

2. Model Architecture - DRCN

Filter size of 5x5 with dilated rate =3, which means a receptive field of 17x17

Page 17: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Outline1. Introduction2. Model Architecture3. Experimental Evaluation4. Implementation Details5. Results 6. Conclusion

- SALICON 5.000 testing images- MIT1003 1003 images- MIT300 300 testing images- CAT2000 2000 testing images

Page 18: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Outline1. Introduction2. Model Architecture3. Experimental Evaluation4. Implementation Details5. Results 6. Conclusion

Page 19: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

4. Implementation Details- Batch size : 10- RMSprop optimizer- Residual Network are initialized using ResNet-50 pre trained on ImageNet- Loss Function : KL-Divergence - Learning rate : 10e-4, decay by a factor of 10 every two epochs

Page 20: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Outline1. Introduction2. Model Architecture3. Experimental Evaluation4. Implementation Details5. Results 6. Conclusion

Page 21: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

5. Results - Model Ablation Analysis

Page 22: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

5. Results - Model Ablation AnalysisDRCN +Conv LSTM +Priors Ground Truth

Page 23: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

5. Results - SALICON MIT300 CAT200

Page 24: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

5. Results

Page 25: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Outline1. Introduction2. Model Architecture3. Experimental Evaluation4. Implementation Details5. Results 6. Conclusion

Page 26: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

6. Conclusions

- New Saliency Attentive Model for fixation prediction.

- Attentive Convolutional LSTM that is specifically designed to sequentially focus on the most salient regions of input images.

- Residual Architecture with dilated filters that maintains the spatial resolution.

Page 27: Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model (UPC Reading Group 2017)

Thanks for your attention!