Top Banner
An Introduction to Hierarchial Linear Modeling Heather Woltman, Andrea Feldstain, J. Christine MacKay, Meredith Rocchi Tutorial in Quantitative Methods for Psychology 2012, Vol 8(1) p. 52-69 Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201
34

Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Dec 28, 2015

Download

Documents

Anne Wilkinson
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

An Introduction to Hierarchial Linear Modeling

Heather Woltman, Andrea Feldstain, J. Christine MacKay, Meredith RocchiTutorial in Quantitative Methods for Psychology 2012, Vol 8(1) p. 52-69

Advanced Business Research MethodIntructor : Prof. Feng-Hui Huang

Agung D. BuchdadiDA21G201

Page 2: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

1. Introduction2. Methods for Dealing with Nested Data3. Equations underlying Hierarchical Linear

Models4. Estimation of Effects5. Hypothesis Testing6. Conclusion

Contents

Page 3: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Hierarchical levels of grouped data are a commonly occurring phenomenon. For example, in the education sector, data are often organized at student, classroom, school, and school district levels.

Hierarchical Linear Modeling (HLM) is a complex form of ordinary least squares (OLS) regression that is used to analyze variance in the outcome variables when the predictor variables are at varying hierarchical levels

Introduction

Page 4: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

The development of this statistical method occurred simultaneously across many fields, it has come to be known by several names, including multilevel-, mixed level-, mixed linear-, mixed effects-, random effects-, random coefficient (regression)-, and (complex) covariance components-modeling .

HLM simultaneously investigates relationships within and between hierarchical levels of grouped data, thereby making it more efficient at accounting for variance among variables at different levels than other existing analyses.

Introduction

Page 5: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Hierrarchical Level Example Variables

Level 3 School Level •School’s geographic•Location•Annual budget

Level 2 Classroom Level •Class size•Homework assignment•Load•Teaching Experience•Teaching Style

Level 1 Student Level •Gender•Intelligence Quotient•Socioeconomic status•Self-esteem rating•Behavioural conduct rating•Breakfast consumption•GPA (dependent)

Introduction

Research Question: Do student breakfast consumption and teaching style influence student GPA?

Page 6: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

DISAGGREGATION Disaggregation of data deals with

hierarchical data issues by ignoring the presence of group differences. It considers all relationships between variables to be context free and situated at level-1 of the hierarchy.

Disaggregation thereby ignores the presence of possible between-group variation

Method for Dealing with Nested Data

Page 7: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

DISAGGREGATION

Method for Dealing with Nested Data

Student ID (Lvl 1)

Classroom ID (Lvl 2)

School ID (Lvl 3)

GPA Score (Lvl 1)

Breakfast Consumption Score (Lvl 1)

1 1 1 5 1

2 1 1 7 3

3 2 1 4 2

4 2 1 6 4

5 3 1 3 3

6 3 1 5 5

7 4 1 2 4

8 4 1 4 6

9 5 1 1 5

10 5 1 3 7

Page 8: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

DISAGGREGATION

Method for Dealing with Nested Data

0 1 2 3 4 5 6 7 80

1

2

3

4

5

6

7

8

Breakfast COnsumption

GP

A

Page 9: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

DISAGREGATION By bringing upper level variables down to

level-1, shared variance is no longer accounted for and the assumption of independence of errors is violated. If teaching style influences student breakfast consumption, for example, the effects of the level-1 (student) and level-2 (classroom) variables on the outcome of interest (GPA) cannot be disentangled.

Method for Dealing with Nested Data

Page 10: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

AGGREGATION Instead of ignoring higher level group

differences, aggregation ignores lower level individual differences. Level-1 variables are raised to higher hierarchical levels (e.g., level-2 or level-3) and information about individual variability is lost.

In aggregated statistical models, within-group variation is ignored and individuals are treated as homogenous entities

Method for Dealing with Nested Data

Page 11: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

AGGREGATION

Method for Dealing with Nested Data

Teacher ID (Lvl 2) Classroom GPA (Lvl 2)

Classroom Breakfast Consumption (Lvl 2)

1 6 2

2 5 3

3 4 4

4 3 5

5 2 6

Page 12: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

AGGREGATION

Method for Dealing with Nested Data

1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.50

1

2

3

4

5

6

7

Classroom Breakfast Consumption

Cla

ssro

om

GP

A

Page 13: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

AGGREGATION Using aggregation, the predictor variable (breakfast

consumption) is again negatively related to the outcome variable (GPA). In this method of analysis, all (X, Y) units are situated on the regression line, indicating that unit increases in a classroom’s mean breakfast consumption perfectly predict a lowering of that classroom’s mean GPA.

Although a negative relationship between breakfast consumption and GPA is found using both disaggregation and aggregation techniques, breakfast consumption is found to impact GPA more unfavourably using aggregation.

Method for Dealing with Nested Data

Page 14: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

HLM

Method for Dealing with Nested Data

Page 15: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

HLM using HLM each level-1 (X,Y) unit (i.e., each student’s GPA and

breakfast consumption) is identified by its level-2 cluster (i.e., that student’s classroom). Each level-2 cluster’s slope (i.e., each classroom’s slope) is also identified and analyzed separately.

Using HLM, both the within- and between-group regressions are taken into account to depict the relationship between breakfast consumption and GPA.

The resulting analysis indicates that breakfast consumption is positively related to GPA at level-1 (i.e., at the student level) but that the intercepts for these slope effects are influenced by level-2 factors [i.e., students’ breakfast consumption and GPA (X, Y) units are also affected by classroom level factors].

Although disaggregation and aggregation methods indicated a negative relationship between breakfast consumption and GPA, HLM indicates that unit increases in breakfast consumption actually positively impact GPA.

Method for Dealing with Nested Data

Page 16: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

HLM HLM can be ideally suited for the analysis of nested data because it identifies the

relationship between predictor and outcome variables, by taking both level-1 and level-2 regression relationships into account.

In addition to HLM’s ability to assess cross-level data relationships and accurately disentangle the effects of between- and within-group variance, it is also a preferred method for nested data because it requires fewer assumptions to be met than other statistical methods .

HLM can accommodate non independence of observations, a lack of sphericity, missing data, small and/or discrepant group sample sizes, and heterogeneity of variance across repeated measures.

HLM Disadvantage A disadvantage of HLM is that it requires large sample sizes for adequate power.

This is especially true when detecting effects at level-1. As well, HLM can only handle missing data at level-1 and removes groups with

missing data if they are at level-2 or above. For both of these reasons, it is advantageous to increase the number of groups as

opposed to the number of observations per group. A study with thirty groups with thirty observations each (n = 900) an have the same power as one hundred and fifty groups with five observations each (n = 750; Hoffman, 1997).

Method for Dealing with Nested Data

Page 17: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

This study will explain two level of hierarchical variable on the previous example. In this two-level hierarchical models, separate level 1 (students) are developed for each level 2 unit (classrooms)

These models are also called within-unit models. First, It take the simple regression form:

Equations underlying Hierarchical Linear Models

Page 18: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

These models are also called within-unit models. First, It take the simple regression form:

Equations underlying Hierarchical Linear Models

Page 19: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

In the level-2 models, the level-1 regression coefficients are used as outcome variables and are related to each of the level-2 predictors

Equations underlying Hierarchical Linear Models

Page 20: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

The model developed would depend on the pattern of variance in the level-1 intercepts and slopes.If there was no variation in slopes then the equation 4 would be deleted.

The assumption in level-2 models (equations(5)):

Equations underlying Hierarchical Linear Models

Page 21: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Then, combined model (eq.6) : (substituting equation 3 and equation 4 to equation 1)

Eq. 6 is often termed a mixed model, combination of fixed effect and random effect.

Equations underlying Hierarchical Linear Models

Page 22: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Two-level hierarchical models involve the estimation of three types of parameters.

The first type of parameter is fixed effects = (γ00, γ01,γ11,γ10) in eq. 3 and eq. 4, and these do not vary across groups.

While level-2 fixed effect could be estimated using OLS approach, it is not good strategy since the homoscedasticity assumption could not be met. The techniques used, then, is a Generalized Least Square (GLS) estimate which allocate more weight on the level-2 regression equation. (further reading : Raudenbush & Bryk (2002))

Estimation of Effects

Page 23: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

The second type of parameter is the random level-1 coefficients (β0j amdβ1j) which are permitted to vary across groups

Hierarchical models provide two estimates:1. Computing OLS regression for level-1 2. Predicted the value of the parameters in the

level-2 models (eq.3 & eq.4)Software HLM could provide the best estimation

which provides smaller mean square error term.

Further reading ( Carlin and Louis 1996)

Estimation of Effects

Page 24: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

The final type of parameter estimation concerns the variance-covariance components which include:

1. The covariance between level-2 error term2. The variance in the level-1 error term3. The variance in the level-2 error term If data balance, closed-form formulas can be used to

estimate variance and covariance components If data not balance (most probable in reality): full

maximum likelihood, restricted maximum likelihood, and Bayes estimation.

Further reading (Raudenbush and Bryk (2002))

Estimation of Effects

Page 25: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Table 4 Hypothesis and necessary condition: Does student breakfast consumption and teaching style influence student GPA

Hypotheses

1 Breakfast consumption is related to GPA

2 Teaching style is related to GPA, after controlling for breakfast consumption

3 Teaching style moderates the breakfast consumption-GPA relationship

Conditions

1 There is systematic within- and between-group variance in GPA

2 There is significant variance at the level-1 intercept

3 There is significant variance in the level-1 slope

4 The variance in the level-1 intercept is predicted by teaching style

5 The variance in the level-1 slope is predicted by teaching style

Hypothesis Testing

Page 26: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Equation upon the case:

Condition 1:There is a systematic within- and between-group variance in GPA

The first condition provides useful preliminary information and assures that there is appropriate variance to investigate the hypotheses

Hypothesis Testing

Page 27: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

The relevant sub models for condition 1:Level-1: GPAij = β0j + rij ............(10)

Level-2: β0j = γ00 + U0j ..............(11)

Where:β0j = mean GPA for classroom j

γ00 = grand mean GPA

Variance (rij) = σ2 =within group variance in GPA

Variance (U0j) = τ00 = between groups variance in GPA

HLM test for significance of the between-groups variance (τ00) but not within groups

Variance (GPAij)= τ00 +σ2

Then ICC = τ00 / (τ00 +σ2)

Once this condition is satisfied, HLM can examine the next two condition to determine whether there are significant differences in intercepts and slopes across clasrooms

Hypotheses Testing

Page 28: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Condition 2 and 3: There is significant variance in the level-1 intercept and slope

The relevant sub model for this condition:Level-1: GPAij = β0j + βij (Breakfast) +rij ....(13)

Level-2: β0j = γ00 + U0j ..............(14)

Level-2: β0j = γ10 + U1j ..............(15)

Where:γ00 = mean of the intercepts across classrooms

γ10 = mean of the Slope across classrooms (H1)

Variance (rij) = σ2 =Level-1 residual variance

Variance (U0j) = τ00 = variance in intercepts

Variance (U1j) = τ11 = variance in slopes

Hypothesis Testing

Page 29: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

HLM runs t-test to asses whether γ00 and γ10 differ significantly from zero.

The χ2 test is used to asses whether the variance in intercept and slopes differs significantly from zero.

Using both test (condition 1 and condition 2) HLM calculate the percent of variance in GPA that is accounted fro by breakfast consumption wtih the formula :

Hypothesis Testing

Page 30: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Condition 4: The variance in the level-1 intercept is predicted by teaching style

It could be tested only the condition 2 and condition 3 are fulfilled

Equations:Level-1: GPAij = β0j + βij (Breakfast) +rij ....(16)

Level-2: β0j = γ00 + γ01(Teaching Style)+U0j(17)

Level-2: β0j = γ10 + U1j ..............(18)

Hypothesis Testing

Page 31: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Where:γ00 = level-2 intercept

γ01 = level-2 slope (H2)

γ10 = mean (pooled) slopes

Variance (rij) = σ2 =Level-1 residual variance

Variance (U0j) = τ00 = residual variance in intercepts

Variance (U1j) = τ11 = variance in slopes

The t-test for intercept and slope conducted is similar to test in condition 2 and 3.

The residual variance (τ00 ) is assessed using another χ2 test.

The variance of GPA is accounted for by teaching style is compared to the total intercept variance :

Hypothesis Testing

Page 32: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

Condition 5: The variance in the Level-1 slope is predicted by teaching style

Equations:Level-1: GPAij = β0j + βij (Breakfast) +rij ....(16)

Level-2: β0j = γ00 + γ01(Teaching Style)+U0j(17)

Level-2: β0j = γ10 + γ11(Teaching Style)+U1j(18)

Where:γ00 = level-2 intercept

γ01 = level-2 slope (H2)

γ10 = level-2 intercept

γ11 = level-2 slope (H3)

Variance (rij) = σ2 =Level-1 residual variance

Variance (U0j) = τ00 = residual variance in intercepts

Variance (U1j) = τ11 = residual variance in slopes

Hypothesis Testing

Page 33: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

With Teaching style as a predictor of the level-1 slope, U1j becomes a measure of the residual variance in the averaged level-1 slopes across groups. If a χ2 test on U1j is significant, it indicates that there is systematic variance in the level-1 slopes, therefore other level-2 predictor can be added to the model.

The percent of variance attributable to teaching style can be computed as a moderator in the breakfast-GPA relationship by the formula:

Hypothesis Testing

Page 34: Advanced Business Research Method Intructor : Prof. Feng-Hui Huang Agung D. Buchdadi DA21G201.

1. HLM has risen in popularity as the method of choice for analyzing nested data

2. HLM is a multistep, time-consuming process. Prior to conducting an HLM analysis, background interaction effects between predictor variables should be accounted for, and sufficient amounts of within- and between-groups variance. HLM presumes that data is normally distributed: any violations the output could be biased.

Conclusion