1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

Geometric Methods for Learning and Memory

A thesis presentedby

Dimitri NowickiTo Universite Paul Sabatier

in partial fulfillmentfor the degree of Doctor es Science

in the subject ofApplied Mathematics

Outline

• Introduction

• Geodesics, Newton Method and Geometric Optimization

• Generalized averaging over RM and Associative memories

• Kernel Machines and AM

• Quotient spaces for Signal Processing

• Application: Electronic Nose

Outline

• Introduction

Models and Algorithms requiring Geometric Approach

• Kalman–like filters

• Blind Signal Separation

• Feed-Forward Neural Networks

• Independent Component Analysis

Introduction

• Riemannian spaces

• Lie groups and homogeneous spaces

• Metric spaces without any Riemannian structure

Spaces emerging in learning problems

Outline

• Introduction

Outline

• Some facts from Riemannian geometry

• Optimization algorithms– Smooth– Nonsmooth

• Implementation– The case of Submanifolds– Computing exponential maps– Computing Hessian etc.

Some concepts from Riemannian Geometry

• Geodesics

Exponential map

MMTu xx :)(exp

MTuMxy

:)(exp

)0(,)0();1(

Parallel transport

• Computing parallel transport using an exponential map

vuv stds

Where u such that yux

Newton Method for Geometric optimization

)(exp)(12 DDxN x

))(exp(~

kkk xDBN

The modified Newton operator

0;)(2 kkkkk BthatsuchIxDB

Wolfe condition for Riemannian manifolds

Global convergence of modified Newton method

Nonsmooth methods

• The subgradient:

uxgxfuf

allfor

),()())(exp( 00

The r-algorithm

MTxgxgrkkkk xkfxxkfk

)()( 1,*

)( 211 1,1 kkxxk kkkRBB .

),)(1()( xR

11~exp),(~

kkkxkkfkk gBhxxgBgk

Problem of constrained optimization

• Equality constraints

0)( tosubject

Classical (extrinsic) methods

• The Lagrangian

kkk xFxL

Newton-Lagrange method

Sequential quadratic programming

Classical methods

• Penalty functions and the augmented Lagrangian

)(),( FxFxLm

Advantages of Geometric methods

• Dimension of the manifold is n-m against n+m in the case of Lagrangian-based methods

• We may have convex function in the manifold even if the Lagrangian is non-convex

• Geometric Hessian may be positive-definite even if the classical one is not

Implementation: The case of Submanifolds

surjective;))(();(

}0)(:{

xDFDCF

Hamilton Equations for the Geodesics

• The Lagrangian:

iii xFxL

The Hamiltonian:

iii xDFxxpH

Hamilton Equations for the Geodesics

pxDFxDFIpx

))()((

Lagrange equation are also constrained Hamiltonian

• We can rewrite Lagrange equations in the form:

ppxFDxDFp

),)(()( 2

Symplectic Numerical Integration

• A transformation is called symplectic if it preserves following differential 2-form:

iii dxdp

Implicit Runge-Kutta Integrators

jjijki

givenis)0(

The IRK method is called symplectic if associated transformation preserves 2

y=(x,p)

The Gauss method of order 4

i=1 i=2

j=1 1/4

j=2 1/4

1/2 1/2

Backward error analysis

Covariant Derivative on the Submanifold

)())()((

)()(ˆ

xfxDFxDFI

xfxf MTx

Computing the constrained Hessian

• Direct computation

))((ˆ 2 DDD MTMT xx

“Mixed” computation

where))()(( xDFxDFIMTx

)(ˆ 222

Example of geometric iterations

Outline

• Introduction

Neural Associative memory

• Hopfield-type auto-associative memory. Memorized vectors are bipolar: vk{-1, 1} n, k=1…m. Suppose these vectors are columns of nm matrix V. Then synaptic matrix C of the memory is given by:

)(1 tt f Cxx

Associative recall is performed using following procedure: the input vector x0 is a starting point of the iterations:

where f is a monotonic odd function such that

1)(lim sfs

Attraction radius

• We will call the stable fixed point of this discrete-time dynamical system an attractor. The maximum Hamming distance between x0 and a memorized pattern vk such that the examination procedure still converges to vk is called an attraction radius.

Problem statement

Generalized averaging on the manifold

argmin

Computing generalized average on the Grassmann manifold

Generalized averaging as an optimization problem

jiijkijkijij

jiijkij ccxxcx

2, 2)()(X

Transforming objective function:

constconst1 2

CXCX NN

Statistical estimation

Experimental results: the simulated data

• n=256– for all experiments

Nature of the data

Experimental results: simulated data

0 5 10 15 20 25

Attractors

m = 16

m = 24

m = 32

Frequencies of attractors of associative clustering network for different m, p=8

0 5 10 15 20 25 30 35

Attractors

p = 16

p = 24

p = 32

Frequencies of attractors of associative clustering network for different p, and m=p

• Distinction coefficients of attractors of associative clustering network for different p, and m=p

0.0001

0 5 10 15 20 25 30 35

Attractors

p = 16

p = 24

p = 32

The MNIST database: data description

• Gray-scale images 2828

• 10 classes: digits from “0” to “9”

• Training sample: 60000 images

• Test sample:10000 images

• Before entering to the network images were tresholded to obtain 784-dimensional bipolar vectors

Experimental results: the MNIST database

• Example of handwritten digits from MNIST database

Experimental results: the MNIST database

• Generalized images of digits found by the network

Outline

• Introduction

Kernel AM

• The main algorithm

Kernel AM• The Basic Algorithm (Continued)

Algorithm Scheme

Experimental Results

• Gaussian Kernel

Gaussian kernel

0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045 0.05

Outline

• Introduction

• Kernel Machines and AM Quotient spaces for Signal Processing

Model of Signal

Signal Trajectories in the phase space

The Manifold

Example of Signal Processing

-4 -3 -2 -1 0 1 2 3 4

t, msec

-4 -3 -2 -1 0 1 2 3 4-1

t, msec

-4 -3 -2 -1 0 1 2 3 4

t, msec

Outline

• Introduction

Application for Real-Life Problem

Electronic Nose: QCM Setup overview

Variance Distribution between principal Components

Chemical images in space spanned by first 3 PCs

1 Geometric Methods for Learning and Memory A thesis presented by Dimitri Nowicki To Universite Paul Sabatier in partial fulfillment for the degree of.

submanifold slide

simulated data slide

electronic nose slide

augmented lagrangian

case of submanifolds

exponential map slide

manifold argmin slide

statistical estimation

Documents

DIMITRI OBOLENSKY

Śpiewnik 158 - Bogusław Nowicki

34452894 Ascroft Nowicki Dolores Magische Rituale

Sabatier Laure book

Par Juliette Sabatier

Rethinking revenue sabatier

Opiniac case - Zbigniew Nowicki - eHandel 2011

Achats d’ouvrages BU Sciences - Paul Sabatier...

Audience development sabatier

Dimitri KARADIMAS

Papieże przeciw Polsce - Andrzej Nowicki

Paul E. Bierley Papers - University LibraryPaul E. Bierley.....

Ashcroft-Nowicki - Ritual Magic Workbook

Le Quartier Sabatier

Dolores Ashcroft-nowicki - Ritualna Magija

Université Toulouse III Paul Sabatier (UT3 Paul...