Top Banner
A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology [email protected]
29

A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology [email protected].

Dec 16, 2015

Download

Documents

Corey Golden
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

A Statistical Mechanical Analysis of Online Learning:

Can Student be more Clever than Teacher ?

Seiji MIYOSHIKobe City College of Technology

[email protected]

Page 2: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

2

Background (1)

• Batch Learning– Examples are used repeatedly– Correct answers for all examples– Long time– Large memory

• Online Learning– Examples used once are discarded– Cannot give correct answers for all examples– Large memory isn't necessary– Time variant teacher

Page 3: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

3

Background (2)

Teacher Student

J1

x1 xN

JNB1

x1 xN

BN

Page 4: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

4

Simple Perceptron

J1

x1 xN

JN

Output

Inputs

Connection weights

)sgn(Output1

N

iiixJ

+1

-1

Page 5: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

5

Background (2)

Teacher Student

J1

x1 xN

JNB1

x1 xN

BN

B

J

BJ

B J

Learnable Case

Page 6: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

6

Background (3)

Teacher Student

J1

x1 xN

JNB1

x1 xN

BN

Unlearnable Case( Inoue & Nishimori, Phys. Rev. E, 1997)( Inoue, Nishimori & Kabashima, TANC-97, cond-mat/9708096, 199

7)

Page 7: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

7

Background (4)B

J

B

J

Hebbian Learning

Perceptron Learning

B

J

Page 8: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

8

Model (1)

BMoving Teacher

JStudent

True Teacher

A

Page 9: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

9

Model (2)

Length of Student

Length of Moving Teacher

A

B J

Page 10: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

10

Model (3)

A

B J

Page 11: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

11

J1

x1 xN

JN

Output

Inputs

Connection weights

)sgn(Output1

N

iiixJ

Simple Perceptron

N

iiixJ

1

Output

Linear Perceptron

Page 12: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

12

Model (3)

Linear Perceptrons with Noises

A

B J

Page 13: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

13fg

Model (4)Squared Errors

Gradient Method

A

B J

Page 14: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

14

ErrorGaussian

Generalization Error

A

B J

Page 15: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

15

Differential equations for order parameters

Page 16: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

16fg

Model (4)Squared Errors

Gradient Method

A

B J

Page 17: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

17

Bm+1 = Bm + gm xm

+

NrBm+1 = NrB

m + gmym

NdtNrB

m+2 = NrBm+1 + gm+1ym+1

NrBm+Ndt = NrB

m+Ndt-1 + gm+Ndt-1ym+Ndt-1

NrBm+Ndt = NrB

m + Ndt <gy>

N(rB+drB) = NrB + Ndt <gy>

drB / dt = <gy>

Page 18: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

18

Differential equations for order parameters

Page 19: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

19

Sample Averages

Page 20: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

20

Differential equations for order parameters

Page 21: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

21

Analytical Solutions of Order Parameters

Page 22: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

22

Differential equations for order parameters

Page 23: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

23

ErrorGaussian

Generalization Error

A

B J

Page 24: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

24

Gen

era

liza

tio

n E

rro

r

t=m/N

J

B

0 5 10 15 20

1

2

1.5

0.5

Dynamical Behaviors of Generalization Errors

ηJ = 1.2

Gen

era

liza

tio

n E

rro

r

t=m/N

J

B

0 5 10 15 20

1

1.5

0.5

ηJ = 0.3

Page 25: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

25

Dynamical Behaviors of R and l

ηJ = 1.2 ηJ = 0.3

t=m/N

lJ

RJ

RB

lB

R, l

0

0

0.2

0.40.6

0.8

1.01.2

1.41.61.8

2.0

5 10 15 20

lJ

RJ

RB

lB

R, l

t=m/N

0

0.2

0.4

0.6

0.8

1.0

1.2

0 5 10 15 20

Page 26: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

26

Analytical Solutions of Order Parameters

Page 27: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

27

Steady State

Gen

era

liza

tio

n E

rro

r

0.00.1

1

10

0.5 1.0 1.5 2.0

J

B

η J

0.0 0.5 1.0 1.5 2.0

η J

lJ

lB

l

1

1.5

2

2.5

3

4

3.5

0.0 0.5 1.0 1.5 2.0

η J

R

0

0.2

0.4

0.6

0.8

1.0

RJ

RB

Page 28: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

28ηJ

20

B J B J AB JB J

Page 29: A Statistical Mechanical Analysis of Online Learning: Can Student be more Clever than Teacher ? Seiji MIYOSHI Kobe City College of Technology miyoshi@kobe-kosen.ac.jp.

29

Conclusions• Generalization errors of a model compose

d of a true teacher, a moving teacher, and a student that are all linear perceptrons with noises have been obtained analytically using statistical mechanics.

• Generalization errors of a student can be smaller than that of a moving teacher, even if the student only uses examples from the moving teacher.