Top Banner
1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of Doctor es Science in the subject of Applied Mathematics
60

1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

Mar 26, 2015

Download

Documents

Zachary Byrd
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

1

Geometric Methods for Learning and Memory

A thesis presentedby

Dimitri NowickiTo Universite Paul Sabatier

in partial fulfillmentfor the degree of Doctor es Science

in the subject ofApplied Mathematics

Page 2: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

2

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM

• Quotient spaces for Signal Processing

• Application: Electronic Nose

Page 3: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

3

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM

• Quotient spaces for Signal Processing

• Application: Electronic Nose

Page 4: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

4

Models and Algorithms requiring Geometric Approach

• Kalman–like filters

• Blind Signal Separation

• Feed-Forward Neural Networks

• Independent Component Analysis

Page 5: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

5

Introduction

• Riemannian spaces

• Lie groups and homogeneous spaces

• Metric spaces without any Riemannian structure

Spaces emerging in learning problems

Page 6: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

6

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM

• Quotient spaces for Signal Processing

• Application: Electronic Nose

Page 7: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

7

Outline

• Some facts from Riemannian geometry

• Optimization algorithms– Smooth– Nonsmooth

• Implementation– The case of Submanifolds– Computing exponential maps– Computing Hessian etc.

Page 8: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

8

Some concepts from Riemannian Geometry

• Geodesics

0

dtd

DtD

Page 9: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

9

Exponential map

MMTu xx :)(exp

yu

MTuMxy

x

x

:)(exp

)0(,)0();1(

Page 10: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

10

Parallel transport

• Computing parallel transport using an exponential map

vuv stds

dx

styx

exp)(

0,1,

Where u such that yux

)(exp

Page 11: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

11

Newton Method for Geometric optimization

)(exp)(12 DDxN x

))(exp(~

kkk xDBN

The modified Newton operator

0;)(2 kkkkk BthatsuchIxDB

Page 12: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

12

Wolfe condition for Riemannian manifolds

Page 13: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

13

Global convergence of modified Newton method

Page 14: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

14

Nonsmooth methods

• The subgradient:

MTu

uxgxfuf

x

fx

0

0

allfor

),()())(exp( 00

Page 15: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

15

The r-algorithm

MTxgxgrkkkk xkfxxkfk

)()( 1,*

1B

MTr

rkx

k

kk

)( 211 1,1 kkxxk kkkRBB .

),)(1()( xR

1111*

11~exp),(~

kkkxkkfkk gBhxxgBgk

Here

Page 16: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

16

Problem of constrained optimization

• Equality constraints

0)( tosubject

min

:

:

xF

D

DF mn

Page 17: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

17

Classical (extrinsic) methods

• The Lagrangian

m

kkk xFxL

1

)(),(

Newton-Lagrange method

Sequential quadratic programming

Page 18: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

18

Classical methods

• Penalty functions and the augmented Lagrangian

2

1 21

)(),( FxFxLm

kkk

Page 19: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

19

Advantages of Geometric methods

• Dimension of the manifold is n-m against n+m in the case of Lagrangian-based methods

• We may have convex function in the manifold even if the Lagrangian is non-convex

• Geometric Hessian may be positive-definite even if the classical one is not

Page 20: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

20

Implementation: The case of Submanifolds

surjective;))(();(

:

}0)(:{

2

xDFDCF

DF

xFxM

M

mn

n

Page 21: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

21

Hamilton Equations for the Geodesics

• The Lagrangian:

m

iii xFxL

1

2)(

21

The Hamiltonian:

m

iii xDFxxpH

1

2)(

21

,

Page 22: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

22

Hamilton Equations for the Geodesics

pxDF

pxDFxDFIpx

xFDp

MT

m

iii

x

))((

))()((

)(

*

1

2

Page 23: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

23

Lagrange equation are also constrained Hamiltonian

• We can rewrite Lagrange equations in the form:

px

ppxFDxDFp

),)(()( 2

Page 24: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

24

Symplectic Numerical Integration

• A transformation is called symplectic if it preserves following differential 2-form:

n

iii dxdp

1

2

Page 25: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

25

Implicit Runge-Kutta Integrators

s

jjijki

s

iiikk

YGayY

YGbyy

yy

1

11

0

)(

)(

givenis)0(

The IRK method is called symplectic if associated transformation preserves 2

y=(x,p)

Page 26: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

26

The Gauss method of order 4

6

3

4

1

6

3

4

1

i=1 i=2

j=1 1/4

j=2 1/4

1/2 1/2

ija

ib

6

3

4

1

6

3

4

1

Page 27: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

27

Backward error analysis

Page 28: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

28

Covariant Derivative on the Submanifold

)())()((

)()(ˆ

xfxDFxDFI

xfxf MTx

Page 29: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

29

Computing the constrained Hessian

• Direct computation

))((ˆ 2 DDD MTMT xx

“Mixed” computation

where))()(( xDFxDFIMTx

DDF

FDDDT

TMTx

)(

)(ˆ 222

Page 30: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

30

Example of geometric iterations

Page 31: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

31

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM

• Quotient spaces for Signal Processing

• Application: Electronic Nose

Page 32: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

32

Neural Associative memory

• Hopfield-type auto-associative memory. Memorized vectors are bipolar: vk{-1, 1} n, k=1…m. Suppose these vectors are columns of nm matrix V. Then synaptic matrix C of the memory is given by:

)(1 tt f Cxx

VVC

Associative recall is performed using following procedure: the input vector x0 is a starting point of the iterations:

where f is a monotonic odd function such that

1)(lim sfs

Page 33: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

33

Attraction radius

• We will call the stable fixed point of this discrete-time dynamical system an attractor. The maximum Hamming distance between x0 and a memorized pattern vk such that the examination procedure still converges to vk is called an attraction radius.

Page 34: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

34

Problem statement

Page 35: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

35

Generalized averaging on the manifold

argmin

argmin

Page 36: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

36

Computing generalized average on the Grassmann manifold

m

N

kk

XXX

CXX

rank;

)(min

2

1

2

Generalized averaging as an optimization problem

N

k

n

jiijkijkijij

N

k

n

jiijkij ccxxcx

1 1,

2,,

2

1 1,

2, 2)()(X

Transforming objective function:

constconst1 2

2

1

CXCX NN

NN

kk

Page 37: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

37

Statistical estimation

Page 38: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

38

Statistical estimation

Page 39: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

39

Experimental results: the simulated data

• n=256– for all experiments

Nature of the data

Page 40: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

40

Experimental results: simulated data

Page 41: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

41

Experimental results: simulated data

1

10

100

1000

0 5 10 15 20 25

Attractors

Fre

qu

en

cy

m = 8

m = 16

m = 24

m = 32

Frequencies of attractors of associative clustering network for different m, p=8

Page 42: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

42

Experimental results: simulated data

1

10

100

1000

0 5 10 15 20 25 30 35

Attractors

Fre

qu

en

cy

p = 8

p = 16

p = 24

p = 32

Frequencies of attractors of associative clustering network for different p, and m=p

Page 43: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

43

Experimental results: simulated data

• Distinction coefficients of attractors of associative clustering network for different p, and m=p

0.0001

0.001

0.01

0.1

1

0 5 10 15 20 25 30 35

Attractors

Dis

tin

cti

on

Co

eff

icie

nt

p = 8

p = 16

p = 24

p = 32

Page 44: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

44

The MNIST database: data description

• Gray-scale images 2828

• 10 classes: digits from “0” to “9”

• Training sample: 60000 images

• Test sample:10000 images

• Before entering to the network images were tresholded to obtain 784-dimensional bipolar vectors

Page 45: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

45

Experimental results: the MNIST database

• Example of handwritten digits from MNIST database

Page 46: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

46

Experimental results: the MNIST database

• Generalized images of digits found by the network

Page 47: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

47

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM

• Quotient spaces for Signal Processing

• Application: Electronic Nose

Page 48: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

48

Kernel AM

• The main algorithm

Page 49: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

49

Kernel AM• The Basic Algorithm (Continued)

Page 50: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

50

Algorithm Scheme

Page 51: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

51

Experimental Results

• Gaussian Kernel

Gaussian kernel

0

0.5

1

1.5

2

2.5

0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05

alpha

Att

ract

ion

ra

diu

s

Page 52: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

52

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM Quotient spaces for Signal Processing

• Application: Electronic Nose

Page 53: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

53

Model of Signal

Page 54: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

54

Signal Trajectories in the phase space

Page 55: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

55

The Manifold

Page 56: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

56

Page 57: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

57

Example of Signal Processing

-4 -3 -2 -1 0 1 2 3 4

-0.3

-0.2

-0.1

0

0.1

0.2

0.3

t, msec

-4 -3 -2 -1 0 1 2 3 4-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1

t, msec

-4 -3 -2 -1 0 1 2 3 4

-0.25

-0.2

-0.15

-0.1

-0.05

0

0.05

0.1

0.15

0.2

0.25

t, msec

Page 58: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

58

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM

• Quotient spaces for Signal Processing

• Application: Electronic Nose

Page 59: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

59

Application for Real-Life Problem

Electronic Nose: QCM Setup overview

Variance Distribution between principal Components

Page 60: 1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

60

Chemical images in space spanned by first 3 PCs