Visual Dynamics: Probabilistic Future Frame Synthesis …vgg/rg/slides/vgg_rg_23_feb_2017... · Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks

VisualDynamics:ProbabilisticFutureFrameSynthesisviaCrossConvolutionalNetworks

TianfanXue* JiajunWu* KatieBouman BillFreeman

NIPS2016

VGGReadingGroup,24Feb2017AnkushGupta

Frame2

Task:futureframeprediction

Frame1 Frame2Deterministicneuralnetwork

Deterministicpredictionsfailtomodeluncertainty

Frame1 Deterministicneuralnetwork


Prediction

Whatistheproblem?

Frame1 Deterministicneuralnetwork


Prediction

Whatistheproblem?

SynthesisnetworkInputframe Sampledfutureframe

Sampledifferentfutureframes

Mainidea NetworkstructureOutline Whatthenetworklearns Result

Inputrandommotionvector𝑧~𝑝$(𝑧)

SynthesisnetworkInputframe

Sampledifferentfutureframes



Sampledfutureframe

Inputframe Anothersampledfutureframe

Segments Transformedsegments


Synthesizeusingdifferenttransformations


Sampledfutureframe

Motionvector𝑧

SynthesisnetworkInputframe

Encodingnetwork

Futureframe(groundtruth)

Training


Motionvector𝑧

Encodingnetwork

Synthesisnetwork Futureframe

(prediction)Trainingsamples

(Label-free)

Training

Inputframe

Futureframe(groundtruth)


Futureframe𝐼()*(prediction)

Motionvector𝑧

Encodingnetwork

Synthesisnetwork

Training

Futureframe𝐼+,(groundtruth)

Inputframe

Objectivefunction:𝐼()* − 𝐼+, + 𝐷01(𝒛||𝑁(𝟎, 𝐈))

Reconstructionloss




Inputframe

Encodingnetwork

Synthesisnetwork

Training Objectivefunction:𝐼()* − 𝐼+, + 𝐷01(𝒛||𝑁(𝟎, 𝐈))

KL-divergenceloss

Motionvector𝑧


Variational Autoencoder[Kingma andWelling,2014]


Synthesisnetwork

Testing


Encodingnetwork

Inputframe

Inputframe


u


Realoutputfromournetwork

Inputframe Futureframe

TransformsegmentsFindsegments

Inputrandommotionvector𝑧

Synthesizebytransformingsegments


Imagesegments Convolution

0 0 0

0 1 0

0 0 0

0 0 1

0 0 0

0 0 0

Movementcanbesynthesizedthroughconvolution


Imagesegments

Applyingmotiontoeachsegment


Motionkernels

Thedecodingnetworkgeneratesamotionkernelforeachcorrespondingsegment

Decodingnet

Motionvector𝑧

[Brabandere etal.2016][Finnetal.2016]

Motionvector𝑧

Inputframe

Futureframe

Synthesisnetwork

Futureframe


Whatisencodedinthemotionvector?

Encodingnetwork

Motionvector𝑧 Upwardmotionwhenchangingthisdimension


Eachdimensionencodesatypeofmotion

Motionvector𝑧 Legmotionwhenchangingthisdimension

Eachdimensionencodesatypeofmotion


• Simulatedshapes

• Trainingsamples

Results:toyexample


Input

Learnedsegments

Networkautomaticallydetectssegments

Triangles

Circles


Input SamplednextframeGroundtruthdistribution

Sampledistribution

Networklearnsthecorrelationbetweenappearanceandmotion


Input Sampledfutureframes

Results:real-worldimages

Mainidea NetworkstructureOutline Whatnetworklearns Result

Challenge:largemotion


Input TwosampledfutureframesArtifactsappearwhenmotionislarge

Baseline:Transferflow 25.5%Ourmethod 31.3%

Labeledasreal

MechanicalTurkstudytoassesssynthesisquality

Idealsynthesisalgorithmachieves50%


• Samplemultiplefutureframesthatareconsistentwiththeinput

• Synthesizeframesbytransformingsegments

• Learnamotionrepresentationwithoutsupervision

…

Contributions

Visual Dynamics: Probabilistic Future Frame Synthesis …vgg/rg/slides/vgg_rg_23_feb_2017... · Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks

Documents

Visual Dynamics: Probabilistic Future Frame Synthesis …vgg/rg/slides/vgg_rg_23_feb_2017... · Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks