Effects of change propagation resulting from adaptive preprocessing in multicomponent predictive systems Manuel Martín Salvador, Marcin Budka, Bogdan Gabrys {msalvador,mbudka,bgabrys}@bournemouth.ac.uk Data Science Institute. Bournemouth University KES-2016, York, UK September 7th, 2016
33
Embed
Effects of change propagation resulting from adaptive preprocessing in multicomponent predictive systems
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Effects of change propagation resulting from adaptive preprocessing in multicomponent predictive systems
Manuel Martín Salvador, Marcin Budka, Bogdan Gabrys{msalvador,mbudka,bgabrys}@bournemouth.ac.uk
Data Science Institute. Bournemouth University
KES-2016, York, UKSeptember 7th, 2016
Outline1. Prologue2. Introduction to MCPS3. Motivation4. Reactive adaptation of MCPS5. Experiments6. Conclusion
PROLOGUE
Butterfly effectSmall causes can have large effects
Streaming error (mean over last 10 samples)SYN dataset with GFMM classifier
GFMMZ-Score PCA Min-Max
Wrongly classified
Need of preprocessing adaptation
Streaming error (mean over last 10 samples)SYN dataset with GFMM classifier
GFMMZ-Score PCA Min-Max
Wrongly classified (out of [0,1])
New hyperboxes
Main strategies for MCPS adaptation
Adaptation strategies GLOBAL LOCAL
Re-composition Full Partial
Hyperparameter optimisation (keep components) Full Partial
Parameterisation (keep components and hyperparameters) Full Partial
Main strategies for MCPS adaptation
Adaptation strategies GLOBAL LOCAL
Re-composition Full Partial
Hyperparameter optimisation (keep components) Full Partial
Parameterisation (keep components and hyperparameters) Full Partial
“Adapting Multicomponent Predictive Systems using Hybrid Adaptation Strategies with Auto-WEKA in Process Industry” @ AutoML / ICML 2016 http://bit.ly/adapting-mcps-paper
Streaming error (mean over last 10 samples)SYN dataset with GFMM classifier
GFMMZ-Score PCA Min-Max
Inconsistent hyperboxesdue to a different input space
REACTIVE ADAPTATION OF MCPS
Reactive adaptation of MCPSGFMMZ-Score PCA Min-Max
Time
i p1 p2 p3 o
[-3.1, 2.7]
x1 = 3.6
Reactive adaptation of MCPSGFMMZ-Score PCA Min-Max
Time
i p1 p2 p3 o
data
meta-data
[-3.1, 2.7]
x1 = 3.6
[-3.1, 3.6]
Reactive adaptation of MCPSGFMMZ-Score PCA Min-Max
Time
i p1 p2 p3 o
data
meta-dataprediction
[-3.1, 2.7]
x1 = 3.6
[-3.1, 3.6]
Updating a component: GFMM
0 1
1
0
(-3.1) (2.7)x1
x2
0 1
1
0
(-3.1) (3.6)x1
x2
Hyperboxes are mapped to the new
input space
EXPERIMENTS
Experiments
Name # Attr # Class Type
SYN 2 2 Synthetic
ELEC 7 2 Real
COVERTYPE 54 7 Real
GAS 128 6 Real
Datasets Scenarios
Id Adap.Model
Adap.Prepro.
ChangePropagation
#1 No No No
#2 Yes No No
#3 Yes Yes No
#4 Yes Yes YesFirst 200 samples for initial training, rest 400 for testing and online learning
GFMMZ-Score PCA Min-Max
Results
#3 crashes due to lack of change propagation when changing PCA components
CONCLUSION
Conclusion
Only model adaptation may not be enough to cope with evolving data streams, adaptive preprocessing should be considered.
However, “blind” adaptation of components can result in inconsistent models or even in a system crash.
Local adaptation of a component may require adapting further components. Therefore, a system must be reactive and propagate changes.
The definition of MCPS has been extended to support change propagation using a new token for meta-data in a coloured Petri net (cMCPS).
Future work
Large study to measure the actual cost of adaptation.
Open questions:
● How to handle propagation requiring changes of the Petri net structure?● How to handle transformations in systems with nonlinear components?● How to order components to reduce the cost of adaptation?● Can a meta-data token be removed at an early stage instead of being fully