Top Banner
Indian Institute of Science Bangalore, India भारतीय विान संथान बंगलौर, भारत Department of Computational and Data Sciences ©Department of Computational and Data Science, IISc, 2016 This work is licensed under a Creative Commons Attribution 4.0 International License Copyright for external content used with attribution is retained by their original authors CDS Department of Computational and Data Sciences Gaurav Kumar Nayak 1 , Konda Reddy Mopuri 1,2 , Vaisakh Shaj1 1,3 , R. Venkatesh Babu 1 , Anirban Chakraborty 1 1 Indian Institute of Science, 2 University of Edinburgh, 3 University of Lincoln Zero-Shot Knowledge Distillation in Deep Networks
19

भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

Jul 23, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

Indian Institute of Science

Bangalore, India

भारतीय विज्ञान संस्थान

बंगलौर, भारत

Department of Computational and Data Sciences

©Department of Computational and Data Science, IISc, 2016 This work is licensed under a Creative Commons Attribution 4.0 International License Copyright for external content used with attribution is retained by their original authors

CDS Department of Computational and Data Sciences

Gaurav Kumar Nayak1, Konda Reddy Mopuri1,2, Vaisakh Shaj11,3, R. Venkatesh Babu1, Anirban Chakraborty1

1Indian Institute of Science, 2University of Edinburgh, 3University of Lincoln

Zero-Shot Knowledge Distillation

in Deep Networks

Page 2: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Objective

▪ Can we do Knowledge Distillation without (access to)

training data (Zero-Shot)?

‣ Data is precious and sensitive – won’t be shared

‣ E.g. : Medical records, Biometric data, Proprietary data

‣ Federated learning – Only models are available, not data

2

Page 3: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Knowledge Distillation

3

T

S

{X,Y}

Distillation Loss

Cross-entropy Loss

Hinton et al. Distilling the Knowledge in a Neural Network, arXiv:1503.02531, 2015

Dataset

Page 4: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Data Free Knowledge Distillation

4

T

S

{X,Y}

Distillation Loss

Cross-entropy Loss

Dataset

Page 5: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Data Free Knowledge Distillation

5

Can Data be

Synthesised

From T?

T

S

{X,Y}

Distillation Loss

Cross-entropy Loss

Dataset

Page 6: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Pseudo Data Synthesis: Class Impressions (CI)

6 Mopuri et al., Ask, Acquire and Attack: Data-free UAP generation using Class impressions, ECCV’18

T

Pre-softmax

Dog

Page 7: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Pseudo Data Synthesis: Class Impressions (CI)

Mopuri et al., Ask, Acquire and Attack: Data-free UAP generation using Class impressions, ECCV’18

7

Page 8: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Pseudo Data Synthesis: Class Impressions (CI)

Squirrel Monkey Goldfish Cock

Mopuri et al., Ask, Acquire and Attack: Data-free UAP generation using Class impressions, ECCV’18

8

Page 9: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Class Impressions (CI): Limitations

▪ Generated samples are less diverse

▪ Relative probabilities of incorrect classes are not considered

▪ Student does not generalize well when trained on CIs

9

Page 10: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Data Impressions (DI)

10

T

Class similarity matrix

DI Car

Cat

Horse

Truck

Page 11: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Data Free Knowledge Distillation

11

T

S

{X,Y}

Distillation Loss

Cross-entropy Loss

Dataset

Page 12: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Data Free Knowledge Distillation

12

T

S

Distillation Loss

Cross-entropy Loss

Data Impressions (DI)

Page 13: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Data Free Knowledge Distillation

13

T

S

Distillation Loss

Data Impressions (DI)

Page 14: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Results: MNIST and CIFAR-10

14

MNIST T: LeNet S: LeNet-Half CIFAR-10 T: AlexNet S: AlexNet-Half

Page 15: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

CI Vs DI : MNIST and CIFAR-10

15

MNIST CIFAR-10

Page 16: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Results: Comparison

16

MNIST CIFAR-10

Model Performance

Teacher – CE 99.34

Student – CE 98.92

Student–KD

(Hinton et al., 2015)

60K original data

99.25

(Kimura et al., 2018) 200 original data

86.70

(Lopes et al., 2017) (uses meta data)

92.47

ZSKD (Ours) (24000 DIs, and no original data)

98.77

Model Performance

Teacher – CE 83.03

Student – CE 80.04

Student – KD

(Hinton et al., 2015) 50K original data

80.08

ZSKD (Ours) (40000 DIs, and no original data)

69.56

Page 17: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Recent works along this direction

▪ Micaelli P, Storkey A. Zero-shot Knowledge Transfer via Adversarial Belief Matching. arXiv preprint arXiv:1905.09768. 2019 May 23.

▪ Chen H, Wang Y, Xu C, Yang Z, Liu C, Shi B, Xu C, Xu C, Tian Q. Data-Free Learning of Student Networks. arXiv preprint arXiv:1904.01186. 2019 May 29.

17

Page 18: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

CDS.IISc.ac.in | Department of Computational and Data Sciences

Summary

▪ For the first time we have proposed a Zero-Shot KD approach

▪ The effectiveness of the Data Impressions is demonstrated by training a student network from scratch.

▪ Hope our ZSKD can inspire researchers to explore more interesting dimensions and applications in this area.

18

Page 19: भारतीय विज्ञान संस्थान बंगलौरभारत Zero-Shot Knowledge …13-11-00)-13-11-30... · Mopuri et al., Ask, Acquire and Attack:

Indian Institute of Science

Bangalore, India

भारतीय विज्ञान संस्थान

बंगलौर, भारत

Department of Computational and Data Sciences

©Department of Computational and Data Science, IISc, 2016 This work is licensed under a Creative Commons Attribution 4.0 International License Copyright for external content used with attribution is retained by their original authors

CDS Department of Computational and Data Sciences

Thanks! Thanks! Please Visit Our Poster (#74)

19