Top Banner
A Tutorial on Feature Extraction Methods Tianyi Wang GE Global Research Subrat Nanda GE Power & Water September 24, 2012
41

A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

Mar 07, 2018

Download

Documents

ledung
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

A Tutorial on

Feature Extraction Methods

Tianyi Wang GE Global Research Subrat Nanda GE Power & Water September 24, 2012

Page 2: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

2

Outline

• Introduction

• Data characteristics

• Application & domain

• Feature extraction methods

• Feature dimensionality reduction

• Issues in real applications

• Summary

Page 3: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

3

Where Feature Extraction fits in a PHM System

Data Acquisition (DA)

Data Manipulation (DM)

State Detection (SD)

Health Assessment (HA)

Prognostics Assessment (PA)

Advisory Generation (AG)

source: MIMOSA OSA CBM architecture

a.k.a. Feature Extraction in

data-driven PHM solutions

such as normalization,

smoothing, outlier removal,

missing data imputation, ...

Page 4: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

4

Feature extraction: what and why

What:

Feature extraction transforms raw signals into more informative signatures or fingerprints of a system

Why:

• Extract information from data

• Serve the need of follow-up modeling procedures

• Achieve intended objectives

Features

Page 5: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

5

Example of feature extraction

Problem: bearing health assessment

Data: vibration (from accelerometers)

Extract frequency domain features:

• Segment the data with a certain time window

• Transform each segment into frequency spectrum with FFT

• Calculate energy for each frequency band around interested frequency F

where Af is the amplitude of frequency f

• Obtain feature vector [EF1, EF2, ...]

0 500 1000 1500 2000 2500 3000-20

-15

-10

-5

0

5

10

15

Run to failure

vibration data

One segment

Frequency

spectrum

Energy for

selected frequency

band

Feature vector

||

2

Ff

fF AE

Page 6: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

6

Feature extraction process

Feature generation

Feature dimensionality

reduction

• With exhaustive or ad hoc approach

• Dimensionality may reduce or increase

• Incorporate domain knowledge

• Underlying physical phenomenon

Two approaches

• Select a subset of generated features

• Transform the features to another

space with lower dimensions

Feature representation:

• A scalar or vector per feature

• One vector concatenating all features

• One matrix holding all samples of features

Data representation:

• Varying format: 1-D, 2-D time series, events,

...

• Potential heterogeneous

Feature set

Raw data/

preprocessed

data

Page 7: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

7

What features to extract? Factors to consider...

Application & Domain

Algorithm

Data

What data are available and

what are their properties

What feature extraction

algorithms are available

and applicable

What domain the

application is; what

knowledge and

requirements are present

Page 8: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

8

Outline

• Introduction

• Data characteristics

• Application & domain

• Feature extraction methods

• Feature dimensionality reduction

• Issues in real applications

• Summary

Page 9: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

9

Data (signal) properties

• Time variant

• Time invariant (meta data such as asset ID)

Time dependency

• Temperature

• Pressure • Current • Voltage

• Speed • Acceleration

• ...

Physics nature

• Binary

• Discrete nominal (categorical)

• Discrete ordinal (integer)

• Continuous (real number)

Sample value

discretization

• Transaction/event (push)

• Sensor (pull) • Evenly sampled • Unevenly sampled

Sampling (time

discretization)

• Scalar

• Vector • Matrix • ...

Sample dimension (not counting time)

• Stationary

• Cyclic (non periodic) • Waveform (periodic) • Stochastic (non cyclic)

Dynamics (relative

to sampling)

Page 10: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

10

Data sampling (time discretization)

Transaction/event (data are “pushed” by data originator)

• Data records occur only at the specified time stamp.

• Data between the time stamps (interpolation) are undefined.

Sensor (data are “pulled” from data originator)

• Data samples are acquired only at the specified time stamp

• Data between the time stamps are just not observed.

• Sampling rate

• Evenly sampled – controlled (e.g. 100 Hz)

• Unevenly sampled - triggered

Page 11: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

11

Sample value discretization

Binary

• Events status, on/off sensor

Discrete nominal (categorical)

• Event code, operating mode, asset ID

Discrete ordinal (integer)

• If interpolation is meaningful, treat as continuous; otherwise, treat as discrete nominal

Continuous (real number)

• Most sensors

Page 12: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

12

Signal dynamics (relative to sampling) Stationary (constant + white noise)

• Power, speed, temperature in steady state of motors, gas turbines, etc.

Stochastic (non-cyclic)

• Power, speed in wind turbine operation

Cyclic (consider each period individually)

• Power, speed, pressure in manufacturing process, gas turbine startup, etc.

Waveform (consider multiple period together)

• Vibration sensors, acoustic sensors

0 100 200 300 400 500 600 700 800 900 1000-10

-8

-6

-4

-2

0

2

4

6

8

10

0 1000 2000 3000 4000 5000 6000-10

0

10

20

30

40

50

60

0 500 1000 1500 2000 2500 3000-20

-15

-10

-5

0

5

10

15

0 100 200 300 400 500 600 700 800 900 10000

5

10

15

20

25

Page 13: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

13

Outline

• Introduction

• Data characteristics

• Application & domain

• Feature extraction methods

• Feature dimensionality reduction

• Issues in real applications

• Summary

Page 14: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

14

Properties of extracted features

Yes

No

Ease of Explainability

Ease of Maintenance

Handle Missing Data

Properties Presence

Uncertainty Handling

Subjective Opinions

Rule Out Black Box Methods

Spatial Space Features Statistical Moments Bayesian Models /Rules

Remarks

Feature Space Transforms

Pattern analysis methods

Domain Understanding

Sample rate

Data Property

Page 15: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

15

Application domain

Category

• Mechanical, structural, thermal, electrical, chemical, ...

Systems

• Machine tool, vehicle, aircraft, locomotive, wind turbine, construction machinery, ...

Common components

• Bearing, gearbox, motor, pump, engine, gas turbine, battery, ...

Many features extraction methods and data

processing procedures come from domain know-how

Page 16: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

16

Domain specific feature extraction

Failure Mode: depending upon the failure type, certain rations, differences, DFEs, etc. are extracted for tracking over time

Operating Mode: specific sensors can be more/less critical in different operating conditions of machines…

- raw sensors to be used for feature extraction…

- variances under different conditions itself can form basis for further feature extraction

Component Function: Features extracted on basis of knowledge about specific components for which PHM desired…

Known Relations: Certain relation types can be assumed between variables of interest…this can affect features calculated for those relations

Page 17: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

17

Outline

• Introduction

• Data characteristics

• Application & domain

• Feature extraction methods

• Feature dimensionality reduction

• Issues in real applications

• Summary

Page 18: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

18

Feature extraction method overview

• Data descriptive statistics

– For sensors: RMS, variance, kurtosis, crest factor, correlation coefficient, ...

– For events: count, occurrence rate, duration, time delays, ...

• Data descriptive models

– Distribution models: Parametric distributions, histogram, ...

– Information-based models: mutual information, minimal description length, ...

– Regression models (use model parameters or modeling errors): curve fitting, AR models, ...

– Classification/clustering models (use class label as feature), sequence matching likelihood

• Time-independent transforms – Explicit mathematical operations: difference, summation, ratio, logarithm, power n, ...

– Principal component analysis, Independent component analysis, etc.

• Time series transforms (mainly for waveform signal)

– Frequency domain, time-frequency domain, wavelet domain, EMD

• Domain dependent feature extraction

– Physics based features: expected input-output or output-output relations, derived hidden states, etc.

– Special procedures for data processing: operational regime segmentations, envelop analysis, etc.

Page 19: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

19

Data descriptive statistics

For sensors:

• One variable: RMS, mean, variance, kurtosis, crest factor, peak2peak, auto correlation...

• Two variables: cross correlation

For events:

• Count, occurrence rate, duration, time delays, ...

RMS

)0.5(factorcrest minmax xx

Page 20: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

20

Data descriptive models

Distribution models:

• Parametric distributions, histogram, ...

Information-based models:

• mutual information, minimal description length, ...

Regression models (use model parameters or modeling errors):

• Curve fitting (linear, exponential, etc.), AR models, ...

Classification/clustering models (use class label as feature):

• Any pattern classifiers (Fisher discriminant, Bayes, etc.)

• Sequence matching likelihood

Page 21: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

21

Time-independent transforms

Explicit mathematical operations:

• Difference, summation, ratio, logarithm, power n, ...

Data dimension reduction transforms:

• Principal component analysis, Independent component analysis, etc.

Notes: These transforms

• Do not alter the number of samples

• Are usually used to produce feature from features

Page 22: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

22

Frequency domain Time-frequency

• Spectral analysis

• Envelope analysis

• Cepstrum analysis

• Higher order spectrum

• Short-time Fourier Transform (STFT)

• Wigner-Ville distribution (WVD)

• Empirical mode decomposition (EMD)

• Spectral kurtosis

• Cyclostationary analysis

Stationary signals Non-stationary signals

Wavelets

• Continuous wavelet transform (CWT)

• Discrete wavelet transform (DWT)

• Wavelet packet transform

• Morlet wavelet

• Hilbert-Huang transform

Time series transforms

Methods mainly for vibration analysis/waveform data

Page 23: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

23

vibration

analysis

Scalars

Vectors

Matrices

direc

t

domain

specific

statistic

al

image

processing

features

vibration

signals

RMS, kurtosis,

etc

Spectrum, etc

wavelet coef map,

etc

Feature extraction ≠ vibration analysis

Signal

processing

• Time domain

• Freq. domain • Time-Freq domain

Page 24: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

24

Domain dependent feature extraction

Physics based features

• Simple input-output or output-output relations

• Errors between model output and observations

• Estimated unobservable states

• System identification parameters

Special procedures for data preprocessing

• Time synchronous averaging

• Enveloping/demodulation

• Operational regime segmentation

• ...

Model based FDI

approaches

Page 25: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

25

Domain dependent feature extraction: an example for bearing

sh

p

b fD

DNBPFIRaceInner

cos1

2

sh

p

b fD

DNBPFORaceOuter

cos1

2

sh

p

b

b

pf

D

D

D

DBSFRollerBall

2

cos12

/

sh

p

b fD

DFTFCage

cos1

2

1

)(Hzspeedshaftf

anglecontact

elementsrollingofdiameterpitchD

diameterelementrollingD

elementsrotatingofnumberN

sh

p

b

Bearing characteristic frequencies Raw Vibration

signal RMS

Kurtosis

Crest factor

etc.

Envelope

analysis

Energy of

characteristic

frequencies,

Spectral kurtosis

etc.

Discrete

Fourier transform

Discrete

Fourier transform

Demodulated

signal

Page 26: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

26

Domain dependent feature extraction: an example for gearbox

Raw Vibration

signal

Tacho signal

Signal

conditioning

Time Synchronous

Averaging

Remove

fundamental shaft and mesh frequencies and

harmonics

Band-pass around

fundamental mesh frequency including sidebands

Remove first order

side bands

DC offset

removal

Conditioned

raw signal

TSA signal

Residual signal

Difference signal Band pass mesh signal

RMS

Kurtosis

Delta RMS

Crest factor

Enveloping

Demodulation

FM0

NB4 NA4, NA4*

FM4, M6A,

M8A

Lebold, M.; McClintic, K.; Campbell, R.; Byington, C.; Maynard, K., Review of Vibration Analysis

Methods for Gearbox Diagnostics and Prognostics, Proceedings of the 54th Meeting of the

Society for Machinery Failure Prevention Technology, Virginia Beach, VA, May 1-4, 2000, p.

623-634.

Page 27: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

27

Requirements/limitations of algorithms

Examples of what a feature extraction algorithm may care

• Continuous value?

• Evenly sampled data?

• Missing data handled first?

• Waveform? e.g. frequency domain analysis applicable?

• Presence of special signals? e.g. to apply Time Synchronous Averaging (TSA), Tacho & Vibration signals are required

• One, or two, or more sensors together? e.g. to apply correlation, PCA

• Similar measurements? e.g. to apply mathematical difference

Page 28: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

28

Exhaustive feature generation

Application & Domain

Algorithm

Data

Features to extract

Reasoning/predicti

on engine

Page 29: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

29

Outline

• Introduction

• Data characteristics

• Application & domain

• Feature extraction methods

• Feature dimensionality reduction

• Issues in real applications

• Summary

Page 30: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

30

Feature extraction process

Feature generation

Feature dimensionality

reduction

• With exhaustive or ad hoc approach

• Dimensionality may reduce or increase

depending on the situation

Two approaches

• Select a subset of generated features

• Transform the features to another

space with lower dimensions

Feature representation:

• A scalar or vector per feature

• One vector concatenating all features

• One matrix holding all samples of features

Data representation:

• Varying format: 1-D, 2-D time series, events,

...

• Potential heterogeneous

Feature set

Raw data/

preprocessed

data

• With exhaustive or ad hoc approach

• Dimensionality may reduce or increase

• Incorporate domain knowledge

• Underlying physical phenomenon

Page 31: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

31

Feature selection: what are good features

Desired characteristics of features

• High relevance to the objective, e.g., anomaly detection, diagnosis, degradation, PoD/FDR, etc.

• Low redundancy (linearly independent) among the features

Additional characteristic that are frequently overlooked

• Low relevance to non-objective factors, e.g. across assets, environment, usage pattern/ operating conditions, etc.

Page 32: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

32

Feature selection strategies

Filter approach

– Metrics defined using local criteria different from the target models

– Search for ‘Good’ representation of raw data/features

– Computationally less-expensive

Wrapper

– Metrics defined by the performance (accuracy) of the target models

– ‘Application’ specific

– Computationally expensive

Embedded approach

– Feature selection built into the target model

– Regression: sparse regression, LASSO, etc.

– Classification: decision tree, regularized random forest

Page 33: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

33

Filter approaches

Exhaustive

Heuristic

Random

Sequencial

Searc

h m

eth

ods

Evaluation criteria

Examples • mRMR (Minimum-

redundancy-maximum-

relevance )

• Fisher score

• Gini score

• Kruskal Wallis statistics

Page 34: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

34

Feature transformation

Linear

• PCA (Principal Component Analysis)

• ICA (Independent component analysis)

• LDA (Latent Dirichlet Allocation )

• Latent semantic indexing

• Genetic Programming

Non-linear

• NPCA or KPCA

• NLDA or KLDA

• MDS (Multidimensional scaling)

• Principal curves

• Neural networks

• Genetic Programming

Page 35: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

35

Outline

• Introduction

• Data characteristics

• Application & domain

• Feature extraction methods

• Feature dimensionality reduction

• Issues in real applications

• Summary

Page 36: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

36

Issues in real applications

Issues:

• Features have high inconsistent (seemingly noisy) due to

• Varying operating conditions

• Asset-to-asset variations

• Features have low sensitivity to faults or degradation

Handling methods

• Normalization / Standardization

• Feature of features (find generalizable features)

• Operating condition clustering & time series segmentation

• Use of local models for post-feature-extraction processing

Page 37: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

37

Example: aircraft engine

Domain: Aircraft engine

Signals:

• Operational variables: altitude, speed, thrust, ambient temperature

• Measurements: pressure, temperature at multiple location inside the engine

Feature extraction:

• Average of each signal during flight cruise (steady state).

• One feature vector per flight; one scalar per signal channel

Ref: 2008 PHM data challenge

Page 38: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

38

Example: aircraft engine (2)

Ref: 2008 PHM data challenge

0 50 100 150 200 25036

38

40

42

44

46

48

50

0 50 100 150 200 25036

38

40

42

44

46

48

50

Run-to-failure time series of

one feature: line plot

Run-to-failure time series of

the same feature : dot plot

Seemingly random noise when

considering the features time

series as a whole

Trend more clear under each

operating condition

Trend?

Flight cycle Flight cycle

Am

plit

ud

e

Page 39: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

39

Example: aircraft engine (3)

Ref: 2008 PHM data challenge

0 50 100 150 200 25036

38

40

42

44

46

48

50

Run-to-failure time series of

the same feature : dot plot Handling methods: • Feature normalization

• with physics model

• with data-driven model

• Use of local models /multiple

models for follow-up procedures

• Generate feature of features that

is invariant to operating

conditions Flight cycle

Am

plit

ud

e

Trend more clear under each

operating condition

Page 40: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

40

Key takeaways

• Procedure: feature extraction + dimension reduction

• What to extract: data property vs. application domain vs. algorithm requirements

• Feature extraction vs. signal processing

• Feature goodness: relevance and redundancy

• Feature selection: wrapper approach vs. filter approach

• Feature consistency and sensitivity issues

Page 41: A Tutorial on Feature Extraction Methods - PHM Society · PDF file · 2012-10-05A Tutorial on Feature Extraction Methods Tianyi Wang ... Select a subset of generated features ...

A Tutorial on

Feature Extraction Methods

Tianyi Wang GE Global Research Subrat Nanda GE Power & Water September 24, 2012