Planejamento e Otimização de Experimentos · radiofrequency (RF) generator causing plasma to be generated in the gap between the electrodes –RF power setting etch rate •C 2

Post on 18-Mar-2020

8 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

Transcript

Planejamento e Otimização

de Experimentos

The Analysis of Variance

Prof. Dr. Anselmo E de Oliveira

anselmo.quimica.ufg.br

anselmo.disciplinas@gmail.com

An Example

• Integrated circuits – Wafers – Plasma etching process is

employed to remove unwanted material

– Energy is supplied by a radiofrequency (RF) generator causing plasma to be generated in the gap between the electrodes

– RF power setting etch rate • C2F6 and 0.80 cm (gap) • RF power: 160, 180, 200, 220 W

(four levels) • Five wafers at each RF level

• Single-factor experiment

– a = 4 levels of the factor

– n = 5 replicates

– 20 runs

• Random order – How to generate the run order?

» Ex: random numbers using a random function in a spreadsheet; sort by that number

– Prevent the effects of unknown nuisance variables

» Nonrandomized order ( 5x160; 5x180 ...)

• Ex: the etching tool exhibits a warm-up effect: the longer it is on, the lower the observed etch rating readings will be

• It will destroy the validity of the experiment

Power (W)

1 2 3 4 5 Totals Averages

160 575 542 530 539 570 2756 551.2

180 565 593 590 579 610 2937 587.4

200 600 651 610 637 629 3127 625.4

220 725 700 715 685 710 3535 707.0

> load etch.oc

> whos

> etch

> boxplot(etch)

> tics ("x", 1:4, {"160"; "180"; "200";"220"});

> xlabel ("Power (W)"); ylabel("Etch rate (A/min)");

• Etch rate increases as the power setting increases. • There is no strong evidence to suggest that the variability in each rate

around the average depends on the power setting. • Suppose that we wish to test for differences between the mean etch rates:

– t-test for all six possible pairs of means. • It takes a lot of effort • It inflates the type I error (rejecting the null hypothesis when it is in fact true)

– Analysis of variance

The Analysis of Variance

• 𝑎 treatments or different levels of a single factor

𝑦.. 𝑦 ..

Treatment (Level)

Observations Totals Averages

1 𝑦11 𝑦12 ... 𝑦1𝑛 𝑦1. 𝑦 1.

2 𝑦21 𝑦22 ... 𝑦2𝑛 𝑦2. 𝑦 2.

⋮ ⋮ ⋮ ... ⋮ ⋮ ⋮

𝑎 𝑦𝑎1 𝑦𝑎2 ... 𝑦𝑎𝑛 𝑦𝑎. 𝑦 𝑎.

• (Linear Statistical) Models for the Data – Means model

𝑦𝑖𝑗 = 𝜇𝑖 + 𝜖𝑖𝑗 𝑖 = 1,2, … , 𝑎𝑗 = 1,2,… , 𝑛

𝑦𝑖𝑗 is the ijth observation

𝜇𝑖 is the mean of the ith factor level or treatment

𝜖𝑖𝑗 is the random error It is convenient to think of the errors as having mean zero, so that 𝐸 𝑦𝑖𝑗 = 𝜇𝑖

– Effects model 𝜇𝑖 = 𝜇 + 𝜏𝑖, 𝑖 = 1,2, … , 𝑎

𝑦𝑖𝑗 = 𝜇 + 𝜏𝑖 + 𝜖𝑖𝑗 𝑖 = 1,2,… , 𝑎𝑗 = 1,2, … , 𝑛

𝜇 is the overall mean

𝜏𝑖 is the ith treatment effect

one-way or single-factor analysis of variance (ANOVA)

• The experimental design is a completely randomized design

– The experiments must be performed in random order

– The environment in which the treatments are applied is as uniform as possible (experimental units)

– Hypotesis test

• 𝑦𝑖𝑗~𝑁 𝜇 + 𝜏𝑖 , 𝜎2

• Observations mutually independent

The Analysis of Fixed Effects Model

– 𝑎 treatments

– Test hypotheses about the treatment means, and the conclusions will apply only to the factor levels considered in the analysis

𝑦𝑖. = 𝑦𝑖𝑗

𝑛

𝑗=1

𝑦 𝑖. =𝑦𝑖.

𝑛 𝑖 = 1,2, … , 𝑎

𝑦.. = 𝑦𝑖𝑗

𝑛

𝑗=1

𝑎

𝑖=1

𝑦 .. =𝑦..𝑁

𝑁 = 𝑎𝑛

the total number of observations

– Testing the equality of the a treatment means

𝐸 𝑦𝑖𝑗 = 𝜇 + 𝜏𝑖 = 𝜇𝑖, 𝑖 = 1,2, … , 𝑎

𝐻0: 𝜇1 = 𝜇2 = ⋯ = 𝜇𝑎

𝐻1: 𝜇𝑖 ≠ 𝜇𝑗 for at least on pair 𝑖, 𝑗

𝜇 = 𝜇𝑖

𝑎𝑖=1

𝑎⟹ 𝜏𝑖 = 0

𝑎

𝑖=1

𝐻0: 𝜏1 = 𝜏2 = ⋯ = 𝜏𝑎 = 0

𝐻1: 𝜏𝑖 ≠ 0 for at least one 𝑖

– Thus, we speak of testing the equality of treatment means or testing that the treatment effects (the 𝜏𝑖) are zero

the treatment or factor effects can be thought of as deviations from the overall mean

• Decomposition of the Total Sum of Squares

– The name Analysis of variance is derived from a partitioning of total variability into its component parts

𝑆𝑆𝑇 = 𝑦𝑖𝑗 − 𝑦 ..2

𝑛

𝑗=1

𝑎

𝑖=1

= 𝑛 𝑦 𝑖. − 𝑦 ..2

𝑎

𝑖=1

+ 𝑦𝑖𝑗 − 𝑦 𝑖.2

𝑛

𝑗=1

𝑎

𝑖=1

𝑆𝑆𝑇 = 𝑆𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 + 𝑆𝑆𝐸

sum of squares of the differences between the treatment averages and the grand mean

sum of squares of the differences of observations within treatments from the treatment average

– Degrees of freedom

𝑆𝑆𝑇: 𝑁 − 1

𝑆𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠: 𝑎 − 1

𝑆𝑆𝐸: 𝑎 𝑛 − 1 = 𝑎𝑛 − 𝑎 = 𝑁 − 𝑎

– Mean squares

𝑀𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 =𝑆𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠

𝑎 − 1

𝑀𝑆𝐸 =𝑆𝑆𝐸

𝑁 − 𝑎

– Expected values 𝐸 𝑀𝑆𝐸 = 𝜎2

𝐸 𝑀𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 = 𝜎2 +𝑛 𝜏𝑖

2𝑎𝑖=1

𝑎 − 1

if there are no differences in treatment means (𝜏𝑖 = 0), 𝑀𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 also estimates 𝜎2

• Statistical Analysis

– If the null hypothesis of no difference in treatment means is true, the ratio

𝐹0 =

𝑆𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠𝑎 − 1

𝑆𝑆𝐸𝑁 − 𝑎

=𝑀𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠

𝑀𝑆𝐸

is distributed as F with 𝑎 − 1 and 𝑁 − 𝑎 degrees of freedom.

– Reject 𝐻0 if 𝐹0 > 𝐹𝛼,𝑎−1,𝑁−𝑎

• ANOVA Table

Source of Variation Sum of Squares Degrees of

Freedom Mean Square

𝑭𝟎

Between treatments 𝑆𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 =

1

𝑛 𝑦𝑖.

2 −𝑦..

2

𝑁

𝑎

𝑖=1

𝑎 − 1 𝑀𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 𝐹0

=𝑀𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠

𝑀𝑆𝐸

Error (within treatments) 𝑆𝑆𝐸 = 𝑆𝑆𝑇 − 𝑆𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 𝑁 − 𝑎 𝑀𝑆𝐸

Total 𝑆𝑆𝑇 = 𝑦𝑖𝑗 − 𝑦 ..

2𝑛

𝑗=1

𝑎

𝑖=1

𝑁 − 1

Power (W)

1 2 3 4 5 Totals 𝒚𝒊.

Averages 𝒚 𝒊.

160 575 542 530 539 570 2756 551.2

180 565 593 590 579 610 2937 587.4

200 600 651 610 637 629 3127 625.4

220 725 700 715 685 710 3535 707.0

𝒚.. =12,355 𝒚 .. = 617.75

𝑺𝑺𝑻 = 𝑦𝑖𝑗 − 𝑦 ..2

𝑛

𝑗=1

𝑎

𝑖=1

= 𝑦𝑖𝑗2 −

𝑦..2

𝑁

5

𝑗=1

4

𝑖=1

= 5752 + 5422 + ⋯+ 7102 −12,3552

20= 𝟕𝟐, 𝟐𝟎𝟗. 𝟕𝟓

𝑺𝑺𝑻𝒓𝒆𝒂𝒕𝒎𝒆𝒏𝒕𝒔 =1

𝑛 𝑦𝑖.

2 −𝑦..

2

𝑁

4

𝑖=1

=1

527562 + ⋯+ 35352 −

12,3552

20= 𝟔𝟔, 𝟖𝟕𝟎. 𝟓𝟓

𝑺𝑺𝑬 = 𝑆𝑆𝑇 − 𝑆𝑆𝑇𝑟𝑒𝑎𝑡𝑚𝑒𝑛𝑡𝑠 = 𝟓𝟑𝟑𝟗. 𝟐𝟎

• ANOVA Table

– RF Power (between-treatment mean square) ≫ Error (within-treatment) • it is unlikely that the treatment means are equal

– 𝐹0.05,3,16 = 3.24 (F table) • Because 𝐹0 = 66.80 > 3.24, we reject 𝐻0 and

conclude that the treatment means differ

Source of Variation Sum of Squares

Degrees of Freedom

Mean Square

𝑭𝟎 p-Value

RF Power 66,870.55 3 22,290.18 66.80 <0.01

Error 5339.20 16 333,70

Total 72,209.75 19

The RF power setting significantly affects the mean etch rate

> anova (etch)

• Estimation of the Model Parameters 𝑦𝑖𝑗 = 𝜇 + 𝜏𝑖 + 𝜖𝑖𝑗

𝜇 = 𝑦 .. 𝜏 = 𝑦 𝑖. − 𝑦 .. 𝑦 𝑖𝑗 = 𝑦 𝑖.

A 100(1 − 𝑎) percent confindence interval on the ith treatment mean 𝜇𝑖:

𝑦 𝑖. − 𝑡𝛼 2 ,𝑁−𝑎

𝑀𝑆𝐸

𝑛≤ 𝜇𝑖 ≤ 𝑦 𝑖. + 𝑡𝛼 2 ,𝑁−𝑎

𝑀𝑆𝐸

𝑛

A 100(1 − 𝑎) percent confindence interval on the difference in any two treatment means:

𝑦 𝑖. − 𝑦 𝑗. − 𝑡𝛼 2 ,𝑁−𝑎

2𝑀𝑆𝐸

𝑛≤ 𝜇𝑖 − 𝜇𝑗 ≤ 𝑦 𝑖. − 𝑦 𝑗. + 𝑡𝛼 2 ,𝑁−𝑎

2𝑀𝑆𝐸

𝑛

Model Adequacy Checking

• The Normality Assumption

– Normal probability plot of the residual

cd octavedocs

load etch.oc

etch

m = mean(etch)

one = ones(5,4)

M = one .* m

e = M – etch

normplot(e(:))

𝑦 𝑖.

𝑦 𝑖𝑗 = 𝑦 𝑖.

𝑒𝑖𝑗 = 𝑦 𝑖. − 𝑦 𝑖𝑗

• Standardized residuals

– Outliers

𝑑𝑖𝑗 =𝑒𝑖𝑗

𝑀𝑆𝐸

If the erros are 𝑁 0, 𝜎2 , the standardized residuals should be approximately normal with mean zero and unit variance

• ~68% of 𝑑𝑖𝑗 should fall within the limits 1

• ~95% ... 2

• ~100% ... 3

Using the largest standardized residual:

𝑑 = 1.404 should cause no concern

sort(e(:))

d = 25.6/sqrt(333.7)

• Plot of Residuals in Time Sequence

Plotting the residuals in time order of data collection is hepful in detecting strong correlation between the residuals

– A tendency to have runs of positive and negative residuals indicates positive correlation:

The independence assumption of the errors would have been violated

plot(etch_run(:),e(:),"*")

xlabel ("Run order or time"); ylabel("Residuals");

• Plot of Residuals Versus Fitted Values

If the model is correct and the assumptions are satisfied, the residuals should be structureless; in particular, they should be unrelated to any other variable including the predicted response

– Plasma etching experiment 𝑦 𝑖𝑗 = 𝑦 𝑖.

𝑒𝑖𝑗 = 𝑦 𝑖. − 𝑦 𝑖𝑗

plot(M(:),e(:),"*")

xlabel ("Predicted"); ylabel("Residuals");

There is not inequality of variance: the variance of the observations does not increase as the magnitude of the observation increases

• Statistical Test for Equality of Variance (in each treatment) 𝐻0: 𝜎1

2 = 𝜎22 = ⋯ = 𝜎𝑎

2 = 0

𝐻1: 𝜎𝑖2 ≠ 0 for at least one 𝑖

– Residual plots – Bartlett’s test

• 𝜒2 distribution with 𝑎 − 1 degrees of freedom • 𝑎 random samples from independent normal populations

𝜒02 = 2.3026

𝑞

𝑐

𝑞 = 𝑁 − 𝑎 𝑙𝑜𝑔𝑆𝑝2 − 𝑛𝑖 − 1

𝑎

𝑖=1

𝑙𝑜𝑔𝑆𝑖2

𝑐 = 1 +1

3 𝑎 − 1 𝑛𝑖 − 1 −1 − 𝑁 − 𝑎 −1

𝑎

𝑖=1

𝑆𝑝2 =

𝑛𝑖 − 1 𝑆𝑖2𝑎

𝑖=1

𝑁 − 𝑎

and 𝑆𝑖2 is the sample variance of the ith population.

• The quantity 𝑞 is large when the sample variances 𝑆𝑖2 differ greatly and is equal to zero

when all 𝑆𝑖2 are equal.

• Reject 𝐻0 𝜒0

2 > 𝜒𝛼,𝛼−12

var_p=sum(4/(20-4)*var(etch))

q = (20-4)*log10(var_p)-4*sum(log10(var(etch)))

c = 1+1/(3*(4-1))*(4*1/4-1/(20-4))

chi_0=2.3026*q/c

1 - chi2cdf(chi_0,3)

ans = 0.93324

• The tests return a p-value that describes the outcome of the test. • Assuming that the test hypothesis is true, the p-value is the

probability of obtaining a worse result than the observed one. • Large p-values corresponds to a successful test. • Usually a test hypothesis is accepted if the p-value exceeds 0.05

there is no evidence to counter the claim that all four variances are the same (plot residuals x fitted values)

we cannot reject the null hypothesis

Practical Interpretation of Results

After

conducting the experiment,

performing the statistical analysis, and

investigating the underlying assumptions,

the experimenter is ready to draw practical conclusions about the problem he or she is studying

• A Regression Model

– The factors involved in an experiment can be either quantitative or qualitative

– Empirical model of the process

• An interpolation equation for the response variable in the experiment

• Regression analysis – Method of least squares

linear model

quadratic model

• Comparing Pairs of Treatment Means

– Tukey’s Test

• Following an ANOVA in which the null hypothesis of equal treatment mean was reject

• Test all pairwise mean comparisons (all 𝑖 ≠ 𝑗) 𝐻0: 𝜇𝑖 = 𝜇𝑗

𝐻1: 𝜇𝑖 ≠ 𝜇𝑗

• Equal or unequal (Tukey-Kramer procedure) sample sizes

• Distribution of the studentized range statistics, 𝑞 table

𝑇𝛼 = 𝑞𝛼 𝑎, 𝑓𝑀𝑆𝐸

𝑛

• Plasma etching experiment 𝑎 = 4 𝛼 = 0.05

𝑓 = 16 degrees of freedom 𝑁 − 𝑎 𝑀𝑆𝐸 = 333.7 𝑛 = 5

𝑞0.05 4,16 → 𝑞 table 𝑞0.05 4,16 = 4.046

𝑇0.05 = 4.046333.7

5= 𝟑𝟑. 𝟏

𝑦 1. = 551.2 𝑦 2. = 587.4

𝑦 3. = 625.4 𝑦 4. = 707.0 𝑦 1. − 𝑦 2. = −36.2 𝑦 1. − 𝑦 3. = −74.2 …

the Tukey procedure indicates that all pairs of means differ

any pairs of treatment averages that differ in absolute value by more than 33.1 would imply that the corresponding pair of population means are significantly different

Nonparametric Method in the Analysis of Variance

• The Kruskal-Wallis Test

– In situations were the normality assumption is unjustified

– Reject 𝐻0 𝐻 > 𝜒𝛼,𝛼−1

2

[pval, chisq, df] =

kruskal_wallis_test(etch(:,1),etch(:,2),etch(:,3),etch(:,4))

pval = 7.3856e-004

chisq = 16.907

df = 3

we reject the null hypothesis

This is the same conclusion as given by the usual analysis of variance F test

top related