Top Banner
Recurrent nets and LSTM Nando de Freitas
24

Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

May 17, 2018

Download

Documents

duongnhi
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Recurrent nets and LSTMNando de Freitas

Page 2: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Outline of the lecture

This lecture introduces you sequence models. The goal is for you tolearn about:

Recurrent neural networks The vanishing and exploding gradients problem Long-short term memory (LSTM) networksApplications of LSTM networks

Language models Translation Caption generation Program execution

Page 3: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

A simple recurrent neural network

[Alex Graves]

Page 4: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Vanishing gradient problem

[Yoshua Bengio et al]

Page 5: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Vanishing gradient problem

Page 6: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Simple solution

Page 7: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

LSTM

[Alex Graves]

Page 8: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

LSTM

Page 9: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Entry-wise multiplication layer

Page 10: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

LSTM cell in Torch

Page 11: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

LSTM column in Torch

Page 12: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

LSTMs for sequence to sequence prediction

[Ilya Sutskever et al]

Page 13: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

LSTMs for sequence to sequence prediction

Page 14: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Learning to parse

[Oriol Vinyals et al]

Page 15: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Learning to execute

[Wojciech Zaremba and Ilya Sutskever]

Page 16: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Video prediction

Page 17: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

[Alex Graves]

Hand-writing recognition and synthesis

Page 18: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Neural Turing Machine (NTM)

[Alex Graves, Greg Wayne, Ivo Danihelka]

Page 19: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Neural Turing Machine (NTM)

Page 20: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Neural Turing Machine (NTM)

Page 21: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Translation with alignment (Bahdanau et al)

Page 22: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Show, attend and tell

[Kelvin Xu et al, 2015]

Page 23: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Show, attend and tell

Page 24: Recurrent nets and LSTM - Department of Computer Science ... · Neural Turing Machine (NTM) [Alex Graves, Greg Wayne, ... learning known as autoencoders. ... Forget Gate Output Gate

Next lecture

In the next lecture, we will look techniques for unsupervisedlearning known as autoencoders. We will also learn aboutsampling and variational methods.

I strongly recommend reading Kevin Murphy’s variationalinference book chapter prior to the lecture.