Top Banner
Definition and overview of chemometrics
46

Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Jan 17, 2016

Download

Documents

Domenic Barker
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Definition and overview of chemometrics

Page 2: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Paul Geladi

Head of Research NIRCEChairperson NIR Nord

Unit of Biomass Technology and ChemistrySwedish University of Agricultural SciencesUmeåTechnobothniaVasa

paul.geladi @ btk.slu.se paul.geladi @ syh.fi

Page 3: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.
Page 4: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.
Page 5: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Project geography

Page 6: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Chemometrics

Mathematics

Statistics

Computer Science

In Chemistry

Page 7: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Similar fields

• Biometrics ±1900

• Psychometrics ±1930

• Econometrics ±1950

• Technometrics ±1960

Page 8: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Chemometrics

• Design of Experiments (DOE)

• Exploratory Data Analysis

• Classification

• Regression and Calibration

Page 9: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Design of Experiments

• Most important where possible

• Uses:

• ANOVA

• F-test

• t-test

• Plots

• Response Surfaces

Page 10: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Design of Experiments

y = b0 + b1x1 + b2x2 +...+bKxK + b11x12 +

b22x22 +...+ bKKxK

2 + b12x1x2 +...+

Factors x1, x2,...xK changed systematically

Response y measured and modeled

Page 11: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Exploratory Data Analysis

• Design not possible• Sampling situations• Find structure• Find groupings• Find outliers

Page 12: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Classification

• Check for groupings = UNSUPERVISED• Existing groupings = SUPERVISED• Visualize groupings• Classify• Test

Page 13: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Regression / Calibration

• Two types of variables X / y

• Relationship linear / nonlinear

• Model

• Diagnostics

• Residual

Page 14: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

x

y

Page 15: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Multivariate Data Analysis

Page 16: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Multivariate Data Analysis

• Sampled data and design with too many reponses:• Mining• Hospitals• Agriculture• Food industry• More

Page 17: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Nomenclature

• Samples are objects

• What is measured on the object is a variable

Page 18: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

34.92 Spectrum

Samples

Vectors

1 K1

I

Page 19: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

123.6

11.15.9340.51.417

A vector is a collectionof numbers.

It is always a columnvector.

Page 20: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

The transpose of a vector is a row vector.

Symbols for transpose are’ and T. a’ or aT.

12 3.6 11.1 5.9 34 0.5 1.4 17

Page 21: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

0 5 10 15 20 250

2

4

6

8

10

12

14

16

18

Particle size, 1 sample

Page 22: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

0 5 10 15 20 25 30 35 400

2

4

6

8

10

12

Small particles, 35 samples

Page 23: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

The Data Matrix

A data matrix is a vector of vectors

I

K

Page 24: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

0 5 10 15 20 250

5

10

15

20

25

30

35

40

Size histograms, all samples

Particle area

Page 25: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

0 200 400 600 800 1000 12000

0.5

1

1.5

2

2.5

3

3.5

4

NIR wavelengths

Times in batch reaction

Page 26: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Geometry of multivariate space

Page 27: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Problem

I and K can be large

Correlation

Univariate statistics does not apply

Page 28: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

I patients

3 variables: blood oxygen,iron, hemoglobin

Page 29: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 30: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 31: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 32: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 33: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 34: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 35: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 36: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 37: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 38: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Properties of multivariate spaceRotation

vectors unchanged / distance unchanged

Translation

vectors changed / distance unchanged

Rescaling / change units

all changes

Page 39: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Consequences

• We can move the coordinate sytem around

• The relative distances between objects do not change

• We can rotate the coordinate system

• Scale changes are important

• Move coordinate system to center of data

• Scale properly

Page 40: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Vectors (physics)

x = [ x1, x2, x3 ]

|| x || = ( x12 + x2

2 + x32 ) 1/2

Page 41: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Geometry

a

b

cc2 = a2 + b2

Page 42: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Vectors (K dimensions)

x = [ x1, x2,..., xK ]

|| x || = ( x12 + x2

2 +...+ xK2 ) 1/2

Page 43: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Problem

We can not see in more than 3 dimensions

Paper, computer screen: 2-2.5 dimensions

Page 44: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 45: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

O2

Fe

Hb

Page 46: Definition and overview of chemometrics. Paul Geladi Head of Research NIRCE Chairperson NIR Nord Unit of Biomass Technology and Chemistry Swedish University.

Projection

2D plane (screen, paper)

Many projections possible

Find a good one

Find a few good ones

What is good?