Activity Understanding “ProcNets: Learning to Segment Procedures in Untrimmed and Unconstrained Videos” by Zhou, Xu and Corso Thomas Leyh University of Freiburg June 28th, 2017 Seminar on Current Works in Computer Vision Thomas Leyh Activity Understanding June 28th, 2017 1 / 24
38
Embed
Activity Understanding - “ProcNets: Learning to Segment ... · Activity Understanding \ProcNets: Learning to Segment Procedures in Untrimmed and Unconstrained Videos" by Zhou, Xu
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Activity Understanding“ProcNets: Learning to Segment Procedures in Untrimmed and
Unconstrained Videos” by Zhou, Xu and Corso
Thomas Leyh
University of Freiburg
June 28th, 2017Seminar on Current Works in Computer Vision
Thomas Leyh Activity Understanding June 28th, 2017 1 / 24
Outline
1 Introduction
2 Network ArchitectureContext-Aware Video EncodingProcedure Segment ProposalSequential Prediction
3 Performance
4 Conclusion
Thomas Leyh Activity Understanding June 28th, 2017 2 / 24
Outline
1 Introduction
2 Network ArchitectureContext-Aware Video EncodingProcedure Segment ProposalSequential Prediction
3 Performance
4 Conclusion
Thomas Leyh Activity Understanding June 28th, 2017 3 / 24
What is this about?
Thomas Leyh Activity Understanding June 28th, 2017 4 / 24
What is this about?
1 Grill the tomatoes in a pan
2 Add oil to a pan
3 Grill bacon until crispy...
8 Finish with bread
Number of segments and positions are inferred automatically!
Thomas Leyh Activity Understanding June 28th, 2017 5 / 24
What is this about?
1 Grill the tomatoes in a pan
2 Add oil to a pan
3 Grill bacon until crispy...
8 Finish with bread
Number of segments and positions are inferred automatically!
Thomas Leyh Activity Understanding June 28th, 2017 5 / 24
Why is this useful?
Video Description Generation
Activity Recognition
First step towards a self-learningrobot cook?
Figure: Kim Kyung-Hoon/Reuters
Thomas Leyh Activity Understanding June 28th, 2017 6 / 24
Why is this useful?
Video Description Generation
Activity Recognition
First step towards a self-learningrobot cook?
Figure: Kim Kyung-Hoon/Reuters
Thomas Leyh Activity Understanding June 28th, 2017 6 / 24
Outline
1 Introduction
2 Network ArchitectureContext-Aware Video EncodingProcedure Segment ProposalSequential Prediction
3 Performance
4 Conclusion
Thomas Leyh Activity Understanding June 28th, 2017 7 / 24
Network has three stages.
Thomas Leyh Activity Understanding June 28th, 2017 8 / 24
Stage 1
Reduce dimensionality of each frame.
Thomas Leyh Activity Understanding June 28th, 2017 9 / 24
What is ResNet?
What is Bi-LSTM?
Thomas Leyh Activity Understanding June 28th, 2017 10 / 24