Top Banner

of 73

MultinomialLogisticRegression_BasicRelationships

Feb 19, 2018

Download

Documents

MarZiYa
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    1/73

    SW388R7

    Data Analysis &

    Computers II

    Slide 1

    Multinomial Logistic Regressionasic Relations!ips

    Multinomial Logistic Regression

    Descri"ing Relations!ips

    Classi#ication Accuracy

    Sample $ro"lems

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    2/73

    ters II

    Slide 2Multinomial logistic regression

    Multinomial logistic regression is used to analy%e relations!ips"eteen a non'metric dependent (aria"le and metric or

    dic!otomous independent (aria"les)

    Multinomial logistic regression compares multiple groups

    t!roug! a com"ination o# "inary logistic regressions)

    *!e group comparisons are e+ui(alent to t!e comparisons #or a

    dummy'coded dependent (aria"le, it! t!e group it! t!e

    !ig!est numeric score used as t!e re#erence group)

    -or e.ample, i# e anted to study di##erences in SW, MSW,

    and $!D students using multinomial logistic regression, t!e

    analysis ould compare SW students to $!D students and MSW

    students to $!D students) -or eac! independent (aria"le, t!ere

    ould "e to comparisons)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    3/73

    ters II

    Slide 3W!at multinomial logistic regression predicts

    Multinomial logistic regression pro(ides a set o# coe##icients #or

    eac! o# t!e to comparisons) *!e coe##icients #or t!e

    re#erence group are all %eros, similar to t!e coe##icients #or t!e

    re#erence group #or a dummy'coded (aria"le)

    *!us, t!ere are t!ree e+uations, one #or eac! o# t!e groups

    de#ined "y t!e dependent (aria"le)

    *!e t!ree e+uations can "e used to compute t!e pro"a"ility

    t!at a su"/ect is a mem"er o# eac! o# t!e t!ree groups) A caseis predicted to "elong to t!e group associated it! t!e !ig!est

    pro"a"ility)

    $redicted group mem"ers!ip can "e compared to actual group

    mem"ers!ip to o"tain a measure o# classi#ication accuracy)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    4/73

    ters II

    Slide 4Le(el o# measurement re+uirements

    Multinomial logistic regression analysis re+uires t!at t!e

    dependent (aria"le "e non'metric) Dic!otomous, nominal, and

    ordinal (aria"les satis#y t!e le(el o# measurement re+uirement)

    Multinomial logistic regression analysis re+uires t!at t!eindependent (aria"les "e metric or dic!otomous) Since S$SS ill

    automatically dummy'code nominal le(el (aria"les, t!ey can "e

    included since t!ey ill "e dic!otomi%ed in t!e analysis)

    In S$SS, non'metric independent (aria"les are included as0#actors) S$SS ill dummy'code non'metric I2s)

    In S$SS, metric independent (aria"les are included as

    0co(ariates) I# an independent (aria"le is ordinal, e ill

    attac! t!e usual caution)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    5/73

    ters II

    Slide 5Assumptions and outliers

    Multinomial logistic regression does not mae any assumptions

    o# normality, linearity, and !omogeneity o# (ariance #or t!e

    independent (aria"les)

    ecause it does not impose t!ese re+uirements, it is pre#erredto discriminant analysis !en t!e data does not satis#y t!ese

    assumptions)

    S$SS does not compute any diagnostic statistics #or outliers) *o

    e(aluate outliers, t!e ad(ice is to run multiple "inary logisticregressions and use t!ose results to test t!e e.clusion o#

    outliers or in#luential cases)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    6/73

    ters II

    Slide 6Sample si%e re+uirements

    *!e minimum num"er o# cases per independent (aria"le is 14,

    using a guideline pro(ided "y 5osmer and Lemes!o, aut!ors o#

    Applied Logistic Regression, one o# t!e main resources #or

    Logistic Regression)

    -or pre#erred case'to'(aria"le ratios, e ill use 64 to 1)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    7/73

    ters II

    Slide 7Met!ods #or including (aria"les

    *!e only met!od #or selecting independent (aria"les in S$SS is

    simultaneous or direct entry)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    8/73

    ters II

    Slide 8(erall test o# relations!ip ' 1

    *!e o(erall test o# relations!ip among t!e independent

    (aria"les and groups de#ined "y t!e dependent is "ased on t!e

    reduction in t!e lieli!ood (alues #or a model !ic! does not

    contain any independent (aria"les and t!e model t!at contains

    t!e independent (aria"les)

    *!is di##erence in lieli!ood #ollos a c!i's+uare distri"ution,

    and is re#erred to as t!e model c!i's+uare)

    *!e signi#icance test #or t!e #inal model c!i's+uare a#ter t!eindependent (aria"les !a(e "een added9 is our statistical

    e(idence o# t!e presence o# a relations!ip "eteen t!e

    dependent (aria"le and t!e com"ination o# t!e independent

    (aria"les)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    9/73

    ters II

    Slide 9(erall test o# relations!ip ' 6

    Model Fitting Information

    284.429

    265.972 18.457 6 .005

    Model

    Intercept Only

    Final

    -2 Log

    Likeliood !i-"#$are d% "ig.

    The presence of a relationship between the dependentvariable and combination of independent variables isbased on the statistical significance of the final modelchi-square in the SPSS table titled "Model FittingInformation".

    In this analsis! the probabilit of the model chi-square#$.%&'( was ).))&! less than or equal to the level ofsignificance of ).)&. The null hpothesis that there wasno difference between the model without independentvariables and the model with independent variables wasre*ected. The e+istence of a relationship between theindependent variables and the dependent variable wassupported.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    10/73

    Slide

    !

    Strengt! o# multinomial logistic regressionrelations!ip

    W!ile multinomial logistic regression does compute correlationmeasures to estimate t!e strengt! o# t!e relations!ip pseudo Rs+uare measures, suc! as :agelere;s R

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    11/73

    Slide

    =(aluating use#ulness #or logistic models

    *!e "enc!mar t!at e ill use to c!aracteri%e a multinomiallogistic regression model as use#ul is a 6>? impro(ement o(ert!e rate o# accuracy ac!ie(a"le "y c!ance alone)

    =(en i# t!e independent (aria"les !ad no relations!ip to t!egroups de#ined "y t!e dependent (aria"le, e ould stille.pect to "e correct in our predictions o# group mem"ers!ipsome percentage o# t!e time) *!is is re#erred to as "y c!anceaccuracy)

    *!e estimate o# "y c!ance accuracy t!at e ill use is t!eproportional "y c!ance accuracy rate, computed "y summingt!e s+uared percentage o# cases in eac! group) *!e onlydi##erence "eteen "y c!ance accuracy #or "inary logisticmodels and "y c!ance accuracy #or multinomial logistic modelsis t!e num"er o# groups de#ined "y t!e dependent (aria"le)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    12/73

    Slide

    2Computing "y c!ance accuracy

    *!e percentage o# cases in eac! group de#ined "y t!e dependent(aria"le is #ound in t!e @Case $rocessing Summary ta"le)

    Case Processing Summary

    62 &7.1'

    9& 55.7'12 7.2'

    167 100.0'

    10&

    270

    15&a

    1

    2&

    (I)(*+,"

    + /I)"

    alid

    Mi33ing

    otal

    "$pop$lation

    Marginal

    ercentage

    e dependent ariale a3 only one al$e o3ered

    in 146 95.4' 3$pop$lation3.

    a.

    The proportional b chance accurac rate wascomputed b calculating the proportion of cases foreach group based on the number of cases in eachgroup in the ,ase Processing Summar,! and thensquaring and summing the proportion of cases in eachgroup ).'#/ 0 ).&&'/ 0 ).)'1/ 2 ).%&(.

    The proportional b chance accurac criteria is &3.34#.1& + %&.4 2 &3.34(.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    13/73

    Slide

    3Comparing accuracy rates

    *o c!aracteri%e our model as use#ul, e compare t!e o(erall

    percentage accuracy rate produced "y S$SS at t!e last step in !ic!

    (aria"les are entered to 6>? more t!an t!e proportional "y c!ance

    accuracy) :oteB S$SS does not compute a cross'(alidated accuracy

    rate #or multinomial logistic regression )9

    Classification

    15 47 0 24.2'

    7 86 0 92.5'

    5 7 0 .0'

    16.2' 8&.8' .0' 60.5'

    O3ered

    1

    2

    &

    Oerall ercentage

    1 2 &

    ercent

    !orrect

    redicted

    The classification accurac rate was 3).&4which was greater than or equal to theproportional b chance accurac criteria of&3.34 #.1& + %&.4 2 &3.34(.

    The criteria for classification accurac is

    satisfied in this e+ample.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    14/73

    Slide

    4:umerical pro"lems

    *!e ma.imum lieli!ood met!od used to calculate multinomiallogistic regression is an iterati(e #itting process t!at attemptsto cycle t!roug! repetitions to #ind an anser)

    Sometimes, t!e met!od ill "rea don and not "e a"le tocon(erge or #ind an anser)

    Sometimes t!e met!od ill produce ildly impro"a"le results,reporting t!at a one'unit c!ange in an independent (aria"leincreases t!e odds o# t!e modeled e(ent "y !undreds o#t!ousands or millions) *!ese implausi"le results can "eproduced "y multicollinearity, categories o# predictors !a(ing

    no cases or %ero cells, and complete separation !ere"y t!eto groups are per#ectly separated "y t!e scores on one ormore independent (aria"les)

    *!e clue t!at e !a(e numerical pro"lems and s!ould notinterpret t!e results are standard errors #or some independent(aria"les t!at are larger t!an 6)4)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    15/73

    Slide

    5

    Relations!ip o# indi(idual independent(aria"les and t!e dependent (aria"le

    *!ere are to types o# tests #or indi(idual independent

    (aria"lesB

    *!e lieli!ood ratio test e(aluates t!e o(erall relations!ip

    "eteen an independent (aria"le and t!e dependent

    (aria"le

    *!e Wald test e(aluates !et!er or not t!e independent

    (aria"le is statistically signi#icant in di##erentiating "eteen

    t!e to groups in eac! o# t!e em"edded "inary logistic

    comparisons)

    I# an independent (aria"le !as an o(erall relations!ip to t!e

    dependent (aria"le, it mig!t or mig!t not "e statistically

    signi#icant in di##erentiating "eteen pairs o# groups de#ined "y

    t!e dependent (aria"le)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    16/73

    Slide

    6

    Relations!ip o# indi(idual independent(aria"les and t!e dependent (aria"le

    *!e interpretation #or an independent (aria"le #ocuses on itsa"ility to distinguis! "eteen pairs o# groups and t!e

    contri"ution !ic! it maes to c!anging t!e odds o# "eing in

    one dependent (aria"le group rat!er t!an t!e ot!er)

    We s!ould not interpret t!e signi#icance o# an independent(aria"les role in distinguis!ing "eteen pairs o# groups unless

    t!e independent (aria"le also !as an o(erall relations!ip to t!e

    dependent (aria"le in t!e lieli!ood ratio test)

    *!e interpretation o# an independent (aria"les role indi##erentiating dependent (aria"le groups is t!e same as e

    used in "inary logistic regression) *!e di##erence in

    multinomial logistic regression is t!at e can !a(e multiple

    interpretations #or an independent (aria"le in relation to

    di##erent pairs o# groups)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    17/73

    Slide

    7

    Relations!ip o# indi(idual independent(aria"les and t!e dependent (aria"le

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&

    -1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!

    !OL)I"

    Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"

    a

    OO LIL

    +/O: I)(

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= OO M:!(.a.

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&

    -1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8.00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!

    !OL)I"

    Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    SPSS identifies the comparisons it ma5es forgroups defined b the dependent variable inthe table of 6Parameter 7stimates!8 using eitherthe value codes or the value labels! dependingon the options settings for pivot table labeling.

    The reference categor is identified in the

    footnote to the table.

    In this analsis! two comparisons will bemade9

    :the T;; T group coded 1 !shaded orange(( will be compared to theT;; M=> group coded ! shaded

    purple(.

    The reference categor plas the same role inmultinomial logistic regression that it plas inthe dumm-coding of a nominal variable9 it isthe categor that would be coded with Cerosfor all of the dumm-coded variables that allother categories are interpreted against.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    18/73

    Slide

    8

    Relations!ip o# indi(idual independent(aria"les and t!e dependent (aria"le

    Likelihood Ratio Tests

    268.&2& 2.&50 2 .&09

    268.625 2.652 2 .265

    270.&95 4.42& 2 .110

    275.194 9.221 2 .010

    %%ect

    Intercept

    +)

    :!

    !OL)I"

    -2 Log

    Likeliood o%

    ed$ced

    Model !i-"#$are d% "ig.

    e ci-3#$are 3tat i3tic i3 te di%%erence in -2 log-likeliood3

    etodel and a red$ced >odel. e red$ced >odel i3

    %or>ed y o>itting an e%%ect %ro> te %inal >odel. e n$ll ypote3i3

    i3 tat all para>eter3 o% tat e%%ect are 0.

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&-1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!!OL)I"

    Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    In this e+ample! there is astatisticall significantrelationship between theindependent variable;D

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    19/73

    Slide

    9

    Interpreting relations!ip o# indi(idual independent(aria"les to t!e dependent (aria"le

    Likelihood Ratio Tests

    268.&2& 2.&50 2 .&09

    268.625 2.652 2 .265

    270.&95 4.42& 2 .110

    275.194 9.221 2 .010

    %%ect

    Intercept

    +)

    :!

    !OL)I"

    -2 Log

    Likeliood o%

    ed$ced

    Model !i-"#$are d% "ig.

    e ci-3#$are 3tat i3tic i3 te di%%erence in -2 log-likeliood3

    etodel and a red$ced >odel. e red$ced >odel i3

    %or>ed y o>itting an e%%ect %ro> te %inal >odel. e n$ll ypote3i3

    i3 tat all para>eter3 o% tat e%%ect are 0.

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&-1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!!OL)I"

    Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    Surve respondents who had less confidence in congress highervalues correspond to lower confidence( were less li5el to be in thegroup of surve respondents who thought we spend too little moneon highwas and bridges G categor #(! rather than the group ofsurve respondents who thought we spend too much mone onhighwas and bridges G categor (.

    For each unit increase in confidence in ongress! the odds of being

    in the group of surve respondents who thought we spend too littlemone on highwas and bridges decreased b '%.'4. ).1& H #.)2 -).'%'(

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    20/73

    Slide

    2!

    Interpreting relations!ip o# indi(idual independent(aria"les to t!e dependent (aria"le

    Likelihood Ratio Tests

    268.&2& 2.&50 2 .&09

    268.625 2.652 2 .265

    270.&95 4.42& 2 .110

    275.194 9.221 2 .010

    %%ect

    Intercept

    +)

    :!

    !OL)I"

    -2 Log

    Likeliood o%

    ed$ced

    Model !i-"#$are d% "ig.

    e ci-3#$are 3tat i3tic i3 te di%%erence in -2 log-likeliood3etodel and a red$ced >odel. e red$ced >odel i3

    %or>ed y o>itting an e%%ect %ro> te %inal >odel. e n$ll ypote3i3

    i3 tat all para>eter3 o% tat e%%ect are 0.

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&-1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!!OL)I"

    Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    Surve respondents who had less confidence in congress highervalues correspond to lower confidence( were less li5el to be in thegroup of surve respondents who thought we spend about the rightamount of mone on highwas and bridges G categor 1(! ratherthan the group of surve respondents who thought we spend toomuch mone on highwas and bridges G ategor (.

    For each unit increase in confidence in ongress! the odds of beingin the group of surve respondents who thought we spend about theright amount of mone on highwas and bridges decreased b$).4. ).## H #.) 2 ).$)(

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    21/73

    Slide

    2

    Relations!ip o# indi(idual independent(aria"les and t!e dependent (aria"le

    Likelihood Ratio Tests

    &27.46&a .000 0 .

    &&&.440 5.976 2 .050

    &29.606 2.14& 2 .&4&

    &&4.6&6 7.17& 2 .028

    &&8.985 11.521 2 .00&

    %%ect

    Intercept

    +)

    :!

    OLI*"

    "?

    -2 LogLikeliood o%

    ed$ced

    Model !i-"#$are d% "ig.

    e ci-3#$are 3tati3 tic i3 te di%%erence in -2 log-likeliood3

    etodel and a red$ced >odel. e red$ced >odel

    i3 %or>ed y o>itting an e%%ect %ro> te %inal >odel. e n$ll

    ypote3i3 i3 tat all para>eter3 o% tat e%%ect are 0.

    i3 red$ced >odel i3 e#$ialent to te %inal >odel eca$3e

    o>itting te e%%ect doe3 not increa3e te degree3 o% %reedo>.

    a.

    Parameter Estimates

    8.4&4 2.2&& 14.261 1 .000

    -.02& .017 1.756 1 .185 .977

    -.066 .102 .414 1 .520 .9&6

    -.575 .251 5.2&4 1 .022 .56&

    -2.167 .805 7.242 1 .007 .115

    05 . . 0 . .

    4.485 2.255 &.955 1 .047

    -.001 .018 .00& 1 .955 .999

    .011 .104 .011 1 .916 1.011

    -.&97 .257 2.&75 1 .12& .67&

    -1.606 .824 &.800 1 .051 .201

    05 . . 0 . .

    Intercept

    +)

    :!

    OLI*"

    @"?A1B

    @"?A2B

    Intercept

    +)

    :!

    OLI*"

    @"?A1B

    @"?A2B

    +!(La

    OO LIL

    +/O: I)(

    / "td. rror *ald d% "ig. ;p/

    e re%erence cate or i3= OO M:!(.a.

    In this e+ample! there isa statisticall significantrelationship between S7Jand the dependentvariable! spending onchildcare assistance.

    ?s well! S7J plas astatisticall significant rolein differentiating the T;;

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    22/73

    Slide

    22

    Interpreting relations!ip o# indi(idual independent(aria"les and t!e dependent (aria"le

    Likelihood Ratio Tests

    &27.46&a .000 0 .

    &&&.440 5.976 2 .050

    &29.606 2.14& 2 .&4&

    &&4.6&6 7.17& 2 .028

    &&8.985 11.521 2 .00&

    %%ect

    Intercept

    +)

    :!

    OLI*"

    "?

    -2 LogLikeliood o%

    ed$ced

    Model !i-"#$are d% "ig.

    e ci-3#$are 3tati3 tic i3 te di%%erence in -2 log-likeliood3

    etodel and a red$ced >odel. e red$ced >odel

    i3 %or>ed y o>itting an e%%ect %ro> te %inal >odel. e n$ll

    ypote3i3 i3 tat all para>eter3 o% tat e%%ect are 0.

    i3 red$ced >odel i3 e#$ialent to te %inal >odel eca$3e

    o>itting te e%%ect doe3 not increa3e te degree3 o% %reedo>.

    a.

    Parameter Estimates

    8.4&4 2.2&& 14.261 1 .000

    -.02& .017 1.756 1 .185 .977

    -.066 .102 .414 1 .520 .9&6

    -.575 .251 5.2&4 1 .022 .56&

    -2.167 .805 7.242 1 .007 .115

    05 . . 0 . .

    4.485 2.255 &.955 1 .047

    -.001 .018 .00& 1 .955 .999

    .011 .104 .011 1 .916 1.011

    -.&97 .257 2.&75 1 .12& .67&

    -1.606 .824 &.800 1 .051 .201

    05 . . 0 . .

    Intercept

    +)

    :!

    OLI*"

    @"?A1B

    @"?A2B

    Intercept

    +)

    :!

    OLI*"

    @"?A1B

    @"?A2B

    +!(La

    OO LIL

    +/O: I)(

    / "td. rror *ald d% "ig. ;p/

    e re%erence cate or i3= OO M:!(.a.

    Surve respondents who were male code # for se+( were less li5elto be in the group of surve respondents who thought we spend toolittle mone on childcare assistance G categor #(! rather than thegroup of surve respondents who thought we spend too muchmone on childcare assistance G categor (.

    Surve respondents who were male were $$.&4 less li5el ).##& H#.) 2 -).$$&( to be in the group of surve respondents who thoughtwe spend too little mone on childcare assistance.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    23/73

    Slide

    23

    Interpreting relations!ips #or independent(aria"le in pro"lems

    In t!e multinomial logistic regression pro"lems, t!e pro"lem

    statement ill as a"out only one o# t!e independent (aria"les)

    *!e anser ill "e true or #alse "ased on only t!e relations!ip

    "eteen t!e speci#ied independent (aria"le and t!e dependent

    (aria"le) *!e indi(idual relations!ips "eteen ot!erindependent (aria"les are t!e dependent (aria"le are not used

    in determining !et!er or not t!e anser is true or #alse)

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    24/73

    Slide

    24$ro"lem 1

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o#a statisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, andt!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence in CongressGHconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses toGopinion a"out spending on !ig!ays and "ridgesG Hnatroad) *!ese predictors di##erentiatesur(ey respondents !o t!oug!t e spend too little money on !ig!ays and "ridges #rom sur(eyrespondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges#rom sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(eyrespondents !o !ad less con#idence in congress ere less liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges, rat!er t!an t!egroup o# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)-or eac! unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges decreased "y

    7J)7?) Sur(ey respondents !o !ad less con#idence in congress ere less liely to "e in t!egroup o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on!ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend toomuc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence in Congress, t!e oddso# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o#money on !ig!ays and "ridges decreased "y 84)K?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    25/73

    Slide

    25Dissecting pro"lem 1 ' 1

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# astatisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, andt!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence in CongressGHconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses to Gopiniona"out spending on !ig!ays and "ridgesG Hnatroad) *!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges #rom sur(eyrespondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges#rom sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(eyrespondents !o !ad less con#idence in congress ere less liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges, rat!er t!an t!egroup o# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges) -oreac! unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(ey respondents!o t!oug!t e spend too little money on !ig!ays and "ridges decreased "y 7J)7?) Sur(eyrespondents !o !ad less con#idence in congress ere less liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges,rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!aysand "ridges) -or eac! unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o#sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and"ridges decreased "y 84)K?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    For these problems! we willassume that there is no problem

    with missing data! outliers! orinfluential cases! and that thevalidation analsis will confirmthe generaliCabilit of theresults

    In this problem! we are told touse ).)& as alpha for themultinomial logistic regression.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    26/73

    Slide

    26Dissecting pro"lem 1 ' 6

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o#a statisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, andt!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les "age" [age], "highest year of school completed" [educ] and "confidence in

    Congress" [conlegis]ere use#ul predictors #or distinguis!ing "eteen groups "ased onresponses to "opinion about spending on highways and bridges" [natroad].*!ese predictorsdi##erentiate sur(ey respondents !o t!oug!t e spend too little money on !ig!ays and"ridges #rom sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and"ridges and sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on!ig!ays and "ridges #rom sur(ey respondents !o t!oug!t e spend too muc! money on!ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(ey

    respondents !o !ad less con#idence in congress ere less liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges, rat!er t!an t!egroup o# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)-or eac! unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges decreased "y7J)7?) Sur(ey respondents !o !ad less con#idence in congress ere less liely to "e in t!egroup o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on!ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend toomuc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence in Congress, t!e odds

    o# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o#money on !ig!ays and "ridges decreased "y 84)K?)

    SPSS onl supports direct orsimultaneous entr of independentvariables in multinomial logisticregression! so we have no choice ofmethod for entering variables.

    The variables listed first in the problemstatement are the independent variablesIGs(9 "age" Lage! "highest ear of schoolcompleted" Leduc and "confidence inongress" Lconlegis.

    The variable used to definegroups is the dependentvariable G(9 "opinion aboutspending on highwas and

    bridges" Lnatroad.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    27/73

    Slide

    27Dissecting pro"lem 1 ' 3

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# astatisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, and t!att!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o# signi#icance o#4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence in CongressG

    Hconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses to Gopiniona"out spending on !ig!ays and "ridgesG Hnatroad) *!ese predictors di##erentiate surveyrespondents who thought we spend too little money on highways and bridges from surveyrespondents who thought we spend too much money on highways and bridgesand surveyrespondents who thought we spend about the right amount of money on highways and bridgesfrom survey respondents who thought we spend too much money on highways and bridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!e groupsde#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(ey respondents !o !adless con#idence in congress ere less liely to "e in t!e group o# sur(ey respondents !o t!oug!t e

    spend too little money on !ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !ot!oug!t e spend too muc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence inCongress, t!e odds o# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend too little moneyon !ig!ays and "ridges decreased "y 7J)7?) Sur(ey respondents !o !ad less con#idence in congressere less liely to "e in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!tamount o# money on !ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !o t!oug!te spend too muc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence in Congress,t!e odds o# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o#money on !ig!ays and "ridges decreased "y 84)K?)

    SPSS multinomial logistic regression models the relationship bcomparing each of the groups defined b the dependent variable to thegroup with the highest code value.

    The responses to opinion about spending on highwas and bridges were9

    #2 Too little! 1 2 ?bout right! and 2 Too much.

    The analsis will result in two comparisons9: surve respondents who thought we spend too little mone

    versus surve respondents who thought we spend too muchmone on highwas and bridges

    : surve respondents who thought we spend about the rightamount of mone versus surve respondents who thought wespend too much mone on highwas and bridges.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    28/73

    Slide

    28Dissecting pro"lem 1 ' J

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence inCongressG Hconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased onresponses to Gopinion a"out spending on !ig!ays and "ridgesG Hnatroad) *!ese predictors

    di##erentiate sur(ey respondents !o t!oug!t e spend too little money on !ig!ays and"ridges #rom sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and"ridges and sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on!ig!ays and "ridges #rom sur(ey respondents !o t!oug!t e spend too muc! money on!ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Surveyrespondents who had less confidence in congress were less likely to be in the group ofsurvey respondents who thought we spend too little money on highways and bridges, rather

    than the group of survey respondents who thought we spend too much money on highwaysand bridges. For each unit increase in confidence in Congress, the odds of being in thegroup of survey respondents who thought we spend too little money on highways andbridges decreased by !..Survey respondents who had less confidence in congress wereless likely to be in the group of survey respondents who thought we spend about the rightamount of money on highways and bridges, rather than the group of survey respondentswho thought we spend too much money on highways and bridges. For each unit increase inconfidence in Congress, the odds of being in the group of survey respondents who thoughtwe spend about the right amount of money on highways and bridges decreased by #$.%.

    7ach problem includes a statement about the relationship betweenone independent variable and the dependent variable. The answerto the problem is based on the stated relationship! ignoring therelationships between the other independent variables and thedependent variable.

    This problem identifies a difference for both of the comparisonsamong groups modeled b the multinomial logistic regression.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    29/73

    Slide

    29Dissecting pro"lem 1 ' >

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# astatisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, and t!at t!e(alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o# signi#icance o# 4)4> #ore(aluating t!e statistical relations!ips)

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence in CongressGHconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses to Gopinion a"outspending on !ig!ays and "ridgesG Hnatroad) *!ese predictors di##erentiate sur(ey respondents !ot!oug!t e spend too little money on !ig!ays and "ridges #rom sur(ey respondents !o t!oug!t espend too muc! money on !ig!ays and "ridges and sur(ey respondents !o t!oug!t e spend a"out t!erig!t amount o# money on !ig!ays and "ridges #rom sur(ey respondents !o t!oug!t e spend too muc!money on !ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!e groupsde#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(ey respondents !o !ad lesscon#idence in congress ere less liely to "e in t!e group o# sur(ey respondents !o t!oug!t e spend toolittle money on !ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spendtoo muc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence in Congress, t!e odds o#"eing in t!e group o# sur(ey respondents !o t!oug!t e spend too little money on !ig!ays and "ridgesdecreased "y 7J)7?) Sur(ey respondents !o !ad less con#idence in congress ere less liely to "e in t!e

    group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and"ridges, rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!aysand "ridges) -or eac! unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges decreased"y 84)K?)

    In order for the multinomial logistic regressionquestion to be true! the overall relationship mustbe statisticall significant! there must be noevidence of numerical problems! the classificationaccurac rate must be substantiall better thancould be obtained b chance alone! and thestated individual relationship must be statisticallsignificant and interpreted correctl.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    30/73

    Slide

    3!Re+uest multinomial logistic regression

    Select theRegression |Multinomial Logisticcommand from the

    Analyze menu.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    31/73

    Slide

    3Selecting t!e dependent (aria"le

    Second! clic5 on the rightarrow button to move thedependent variable to theDependentte+t bo+.

    First! highlight thedependent variablenatroadin the listof variables.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    32/73

    Slide

    32Selecting metric independent (aria"les

    Move the metricindependent variables!age, educand conlegistothe Covariate(s)list bo+.

    Metric independent variables are specified as covariatesin multinomial logistic regression. Metric variables canbe either interval or! b convention! ordinal.

    In this analsis! there are no non-metric independent variables. Don-metric independent variables would be

    moved to the actor(s)list bo+.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    33/73

    Slide

    33Speci#ying statistics to include in t!e output

    Nhile we will accept most ofthe SPSS defaults for theanalsis! we need to specificallrequest the classification table.

    lic5 on the !tatisticsO buttonto ma5e a request.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    34/73

    Slide

    34Re+uesting t!e classi#ication ta"le

    First! 5eep the SPSSdefaults for !ummarystatistics! Li"eli#oodratio test! and$arameter estimates.

    Second! mar5 thechec5bo+ for the

    Classi%ication ta&le.

    Third! clic5on theContinuebutton tocomplete therequest.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    35/73

    Slide

    35

    Completing t!e multinomiallogistic regression re+uest

    lic5 on the '

    button to requestthe output for themultinomial logisticregression.

    The multinomial logistic procedure supportsadditional commands to specif the modelcomputed for the relationships we will use thedefault main effects model(! additionalspecifications for computing the regression!and saving classification results. Ne will notma5e use of these options.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    36/73

    Slide

    36L=2=L - M=ASFR=M=:* ' 1

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o#a statisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, andt!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence in CongressGHconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses to"opinion about spending on highways and bridges" [natroad]) *!ese predictors di##erentiatesur(ey respondents !o t!oug!t e spend too little money on !ig!ays and "ridges #rom sur(eyrespondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges and sur(ey

    respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges#rom sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(eyrespondents !o !ad less con#idence in congress ere less liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges, rat!er t!an t!egroup o# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)-or eac! unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges decreased "y

    7J)7?) Sur(ey respondents !o !ad less con#idence in congress ere less liely to "e in t!egroup o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on!ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend toomuc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence in Congress, t!e oddso# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o#money on !ig!ays and "ridges decreased "y 84)K?)

    1) *rue

    6) *rue it! caution

    Multinomial logistic regression requires that thedependent variable be non-metric and theindependent variables be metric or dichotomous.

    ";pinion about spending on highwas andbridges" Lnatroad is ordinal! satisfing the non-metric level of measurement requirement for thedependent variable.

    It contains three categories9 surve respondentswho thought we spend too little mone! aboutthe right amount of mone! and too much moneon highwas and bridges.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    37/73

    Slide

    37L=2=L - M=ASFR=M=:* ' 6

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# astatisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, and t!att!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o# signi#icance o#4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les "age" [age], "highest year of school completed" [educ]and "confidence in

    Congress" [conlegis]ere use#ul predictors #or distinguis!ing "eteen groups "ased on responsesto Gopinion a"out spending on !ig!ays and "ridgesG Hnatroad) *!ese predictors di##erentiatesur(ey respondents !o t!oug!t e spend too little money on !ig!ays and "ridges #rom sur(eyrespondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges #romsur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!e groupsde#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(ey respondents !o!ad less con#idence in congress ere less liely to "e in t!e group o# sur(ey respondents !ot!oug!t e spend too little money on !ig!ays and "ridges, rat!er t!an t!e group o# sur(eyrespondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges) -or eac! unitincrease in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(ey respondents !ot!oug!t e spend too little money on !ig!ays and "ridges decreased "y 7J)7?) Sur(ey respondents!o !ad less con#idence in congress ere less liely to "e in t!e group o# sur(ey respondents !ot!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges, rat!er t!an t!e groupo# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges) -or eac!unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(ey respondents !ot!oug!t e spend a"out t!e rig!t amount o# money on !ig!ays and "ridges decreased "y 84)K?)

    "?ge" Lage and "highest ear ofschool completed" Leduc are interval!satisfing the metric or dichotomouslevel of measurement requirement forindependent variables.

    "onfidence in ongress" Lconlegis is ordinal!satisfing the metric or dichotomous level ofmeasurement requirement for independentvariables. If we follow the convention of treatingordinal level variables as metric variables! the level

    of measurement requirement for the analsis issatisfied. Since some data analsts do not agreewith this convention! a note of caution should beincluded in our interpretation.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    38/73

    Slide

    38Sample si%e ratio o# cases to (aria"les

    Case Processing Summary

    62 &7.1'

    9& 55.7'

    12 7.2'

    167 100.0'

    10&

    270

    15&a

    1

    2

    &

    (I)(*+,"

    + /I)"

    alid

    Mi33ing

    otal

    "$pop$lation

    Marginal

    ercentage

    e dependent ariale a3 only one al$e o3ered

    in 146 95.4' 3$pop$lation3.

    a.

    Multinomial logistic regression requires that the minimum ratioof valid cases to independent variables be at least #) to #. Theratio of valid cases #3'( to number of independent variables( was &&.' to #! which was equal to or greater than the

    minimum ratio. The requirement for a minimum ratio of casesto independent variables was satisfied.

    The preferred ratio of valid cases to independent variables is1) to #. The ratio of &&.' to # was equal to or greater than thepreferred ratio. The preferred ratio of cases to independentvariables was satisfied.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    39/73

    Slide

    39

    2=RALL R=LA*I:S5I$ =*W==:I:D=$=:D=:* A:D D=$=:D=:* 2ARIAL=S

    Model Fitting Information

    284.429

    265.972 18.457 6 .005

    Model

    Intercept Only

    Final

    -2 Log

    Likeliood !i-"#$are d% "ig.

    The presence of a relationship between the dependentvariable and combination of independent variables isbased on the statistical significance of the final modelchi-square in the SPSS table titled "Model FittingInformation".

    In this analsis! the probabilit of the model chi-square

    #$.%&'( was ).))&! less than or equal to the level ofsignificance of ).)&. The null hpothesis that there wasno difference between the model without independentvariables and the model with independent variables wasre*ected. The e+istence of a relationship between theindependent variables and the dependent variable wassupported.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    40/73

    Slide

    4!

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&

    -1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!

    !OL)I"

    Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    :FM=RICAL $RL=MS

    Multicollinearit in the multinomiallogistic regression solution isdetected b e+amining the standarderrors for the b coefficients. ?standard error larger than 1.)indicates numerical problems! such

    as multicollinearit among theindependent variables! Cero cells fora dumm-coded independentvariable because all of the sub*ectshave the same value for thevariable! and ,complete separation,whereb the two groups in thedependent event variable can beperfectl separated b scores onone of the independent variables.?nalses that indicate numericalproblems should not be interpreted.

    Done of the independent variables inthis analsis had a standard errorlarger than 1.). Ne are notinterested in the standard errorsassociated with the intercept.(

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    41/73

    Slide

    4

    Likelihood Ratio Tests

    268.&2& 2.&50 2 .&09

    268.625 2.652 2 .265

    270.&95 4.42& 2 .110

    275.194 9.221 2 .010

    %%ect

    Intercept

    +)

    :!

    !OL)I"

    -2 Log

    Likeliood o%

    ed$ced

    Model !i-"#$are d% "ig.

    e ci-3#$are 3tati3tic i3 te di%%erence in -2 log-likeliood3

    etodel and a red$ced >odel. e red$ced >odel i3

    %or>ed y o>itting an e%%ect %ro> te %inal >odel. e n$ll ypote3i3

    i3 tat all para>eter3 o% tat e%%ect are 0.

    R=LA*I:S5I$ - I:DI2IDFAL I:D=$=:D=:*2ARIAL=S * D=$=:D=:* 2ARIAL= ' 1

    The statistical significance of the relationship betweenconfidence in ongress and opinion about spending onhighwas and bridges is based on the statistical significance ofthe chi-square statistic in the SPSS table titled "

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    42/73

    Slide

    42

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&

    -1.&7& .620 4.91& 1 .027 .25&&.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!

    !OL)I"Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    R=LA*I:S5I$ - I:DI2IDFAL I:D=$=:D=:*2ARIAL=S * D=$=:D=:* 2ARIAL= ' 6

    In the comparison of surve respondents who thought we spend

    too little mone on highwas and bridges to surve respondentswho thought we spend too much mone on highwas andbridges! the probabilit of the Nald statistic %.#( for thevariable confidence in ongress Lconlegis was ).)1'. Since theprobabilit was less than or equal to the level of significance of).)&! the null hpothesis that the b coefficient for confidence inongress was equal to Cero for this comparison was re*ected.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    43/73

    Slide

    43

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&

    -1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!

    !OL)I"

    Intercept+)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    R=LA*I:S5I$ - I:DI2IDFAL I:D=$=:D=:*2ARIAL=S * D=$=:D=:* 2ARIAL= ' 3

    The value of 7+p@( was ).1& which implies that for each unitincrease in confidence in ongress the odds decreased b '%.'4).1& - #.) 2 -).'%'(.

    The relationship stated in the problem is supported. Surverespondents who had less confidence in congress were less li5elto be in the group of surve respondents who thought we spendtoo little mone on highwas and bridges! rather than the group ofsurve respondents who thought we spend too much mone onhighwas and bridges. For each unit increase in confidence inongress! the odds of being in the group of surve respondentswho thought we spend too little mone on highwas and bridgesdecreased b '%.'4.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    44/73

    Slide

    44

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&

    -1.&7& .620 4.91& 1 .027 .25&&.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!

    !OL)I"Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    R=LA*I:S5I$ - I:DI2IDFAL I:D=$=:D=:*2ARIAL=S * D=$=:D=:* 2ARIAL= ' J

    In the comparison of surve respondents who thought we spendabout the right amount of mone on highwas and bridges tosurve respondents who thought we spend too much mone onhighwas and bridges! the probabilit of the Nald statistic'.1$( for the variable confidence in ongress Lconlegis was).))'. Since the probabilit was less than or equal to the levelof significance of ).)&! the null hpothesis that the b coefficientfor confidence in ongress was equal to Cero for this comparisonwas re*ected.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    45/73

    Slide

    45

    Parameter Estimates

    &.240 2.478 1.709 1 .191

    .019 .020 .906 1 .&41 1.019

    .071 .108 .427 1 .514 1.07&

    -1.&7& .620 4.91& 1 .027 .25&

    &.6&9 2.456 2.195 1 .1&8

    .00& .020 .017 1 .897 1.00&

    .172 .110 2.46& 1 .117 1.188

    -1.657 .61& 7.298 1 .007 .191

    Intercept

    +)

    :!

    !OL)I"

    Intercept

    +)

    :!

    !OL)I"

    (I)(*+,"

    + /I)"a

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    R=LA*I:S5I$ - I:DI2IDFAL I:D=$=:D=:*2ARIAL=S * D=$=:D=:* 2ARIAL= ' >

    The value of 7+p@( was ).## which implies that for each unit increase in

    confidence in ongress the odds decreased b $).4 ).##-#.)2-).$)(.The relationship stated in the problem is supported. Surve respondentswho had less confidence in congress were less li5el to be in the group ofsurve respondents who thought we spend about the right amount ofmone on highwas and bridges! rather than the group of surverespondents who thought we spend too much mone on highwas andbridges. For each unit increase in confidence in ongress! the odds ofbeing in the group of surve respondents who thought we spend about theright amount of mone on highwas and bridges decreased b $).4.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    46/73

    Slide

    46

    Case Processing Summary

    62 &7.1'

    9& 55.7'

    12 7.2'

    167 100.0'

    10&

    270

    15&a

    1

    2

    &

    (I)(*+,"

    + /I)"

    alid

    Mi33ing

    otal

    "$pop$lation

    Marginal

    ercentage

    e dependent ariale a3 only one al$e o3ered

    in 146 95.4' 3$pop$lation3.

    a.

    CLASSI-ICA*I: FSI: *5= MFL*I:MIAL LIS*ICR=R=SSI: MD=LB C5A:C= ACCFRAC RA*=

    The proportional b chance accurac rate was computed bcalculating the proportion of cases for each group based onthe number of cases in each group in the ,ase ProcessingSummar,! and then squaring and summing the proportion ofcases in each group ).'#/ 0 ).&&'/ 0 ).)'1/ 2 ).%&(.

    The independent variables could be characteriCed as usefulpredictors distinguishing surve respondents who thought wespend too little mone on highwas and bridges! surverespondents who thought we spend about the right amountof mone on highwas and bridges and surve respondentswho thought we spend too much mone on highwas andbridges if the classification accurac rate was substantiallhigher than the accurac attainable b chance alone.;perationall! the classification accurac rate should be 1&4or more higher than the proportional b chance accuracrate.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    47/73

    Slide

    47

    Classification

    15 47 0 24.2'

    7 86 0 92.5'

    5 7 0 .0'

    16.2' 8&.8' .0' 60.5'

    O3ered

    1

    2

    &

    Oerall ercentage

    1 2 &

    ercent

    !orrect

    redicted

    CLASSI-ICA*I: FSI: *5= MFL*I:MIAL LIS*ICR=R=SSI: MD=LB CLASSI-ICA*I: ACCFRAC

    The classification accurac rate was 3).&4which was greater than or equal to the

    proportional b chance accurac criteria of&3.34 #.1& + %&.4 2 &3.34(.

    The criteria for classification accurac issatisfied.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    48/73

    Slide

    48Ansering t!e +uestion in pro"lem 1 ' 1

    11) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# astatisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, and t!att!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o# signi#icance o#4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence in CongressGHconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses to Gopiniona"out spending on !ig!ays and "ridgesG Hnatroad) *!ese predictors di##erentiate sur(ey respondents!o t!oug!t e spend too little money on !ig!ays and "ridges #rom sur(ey respondents !ot!oug!t e spend too muc! money on !ig!ays and "ridges and sur(ey respondents !o t!oug!t e

    spend a"out t!e rig!t amount o# money on !ig!ays and "ridges #rom sur(ey respondents !ot!oug!t e spend too muc! money on !ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!e groupsde#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(ey respondents !o !adless con#idence in congress ere less liely to "e in t!e group o# sur(ey respondents !o t!oug!t espend too little money on !ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !ot!oug!t e spend too muc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence inCongress, t!e odds o# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend too littlemoney on !ig!ays and "ridges decreased "y 7J)7?) Sur(ey respondents !o !ad less con#idence incongress ere less liely to "e in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!erig!t amount o# money on !ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !ot!oug!t e spend too muc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence inCongress, t!e odds o# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!tamount o# money on !ig!ays and "ridges decreased "y 84)K?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    Ne found a statisticall significant overallrelationship between the combination ofindependent variables and the dependentvariable.

    There was no evidence of numerical problems inthe solution.

    Moreover! the classification accurac surpassedthe proportional b chance accurac criteria!supporting the utilit of the model.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    49/73

    Slide

    49Ansering t!e +uestion in pro"lem 1 ' 6

    *!e (aria"les GageG Hage, G!ig!est year o# sc!ool completedG Heduc and Gcon#idence inCongressG Hconlegis ere use#ul predictors #or distinguis!ing "eteen groups "ased onresponses to Gopinion a"out spending on !ig!ays and "ridgesG Hnatroad) *!ese predictorsdi##erentiate sur(ey respondents !o t!oug!t e spend too little money on !ig!ays and"ridges #rom sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and"ridges and sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on!ig!ays and "ridges #rom sur(ey respondents !o t!oug!t e spend too muc! money on!ig!ays and "ridges)

    Among t!is set o# predictors, con#idence in Congress as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on !ig!ays and "ridges) Sur(eyrespondents !o !ad less con#idence in congress ere less liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges, rat!er t!an t!egroup o# sur(ey respondents !o t!oug!t e spend too muc! money on !ig!ays and "ridges)-or eac! unit increase in con#idence in Congress, t!e odds o# "eing in t!e group o# sur(eyrespondents !o t!oug!t e spend too little money on !ig!ays and "ridges decreased "y7J)7?) Sur(ey respondents !o !ad less con#idence in congress ere less liely to "e in t!egroup o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on

    !ig!ays and "ridges, rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend toomuc! money on !ig!ays and "ridges) -or eac! unit increase in con#idence in Congress, t!eodds o# "eing in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amounto# money on !ig!ays and "ridges decreased "y 84)K?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    Ne verified that each statement about the relationshipbetween an independent variable and the dependentvariable was correct in both direction of the relationshipand the change in li5elihood associated with a one-unitchange of the independent variable! for both of thecomparisons between groups stated in the problem.

    The answer to the question is truewith caution.

    ? caution is added because of theinclusion of ordinal level variables.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    50/73

    Slide

    5!$ro"lem 6

    1) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o#a statisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases,and t!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les G!ig!est year o# sc!ool completedG Heduc, Gse.G Hse. and Gtotal #amily incomeGHincomeK8 ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses toGopinion a"out spending on space e.plorationG Hnatspac) *!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on space e.ploration #rom sur(ey

    respondents !o t!oug!t e spend too muc! money on space e.ploration and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration #romsur(ey respondents !o t!oug!t e spend too muc! money on space e.ploration)

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on space e.ploration) Sur(eyrespondents !o !ad !ig!er total #amily incomes ere more liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration,rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend too muc! money on space

    e.ploration) -or eac! unit increase in total #amily income, t!e odds o# "eing in t!e group o#sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on spacee.ploration increased "y N)4?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    51/73

    Slide

    5Dissecting pro"lem 6 ' 1

    &. 'n the dataset (SS)$$$, is the following statement true, false, or an incorrect

    application of a statistic* +ssume that there is no problem with missing data, outliers, orinfluential cases, and that the validation analysis will confirm the generaliability of theresults. -se a level of significance of $.$ for evaluating the statistical relationships.

    *!e (aria"les G!ig!est year o# sc!ool completedG Heduc, Gse.G Hse. and Gtotal #amily incomeGHincomeK8 ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses toGopinion a"out spending on space e.plorationG Hnatspac) *!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on space e.ploration #rom sur(eyrespondents !o t!oug!t e spend too muc! money on space e.ploration and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration #romsur(ey respondents !o t!oug!t e spend too muc! money on space e.ploration)

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on space e.ploration) Sur(eyrespondents !o !ad !ig!er total #amily incomes ere more liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration,rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend too muc! money on space

    e.ploration) -or eac! unit increase in total #amily income, t!e odds o# "eing in t!e group o#sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on spacee.ploration increased "y N)4?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    For these problems! we willassume that there is no problemwith missing data! outliers! or

    influential cases! and that thevalidation analsis will confirmthe generaliCabilit of theresults

    In this problem! we are told touse ).)& as alpha for themultinomial logistic regression.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    52/73

    Slide

    52Dissecting pro"lem 6 ' 6

    1) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# astatisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, and t!att!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o# signi#icance o#4)4> #or e(aluating t!e statistical relations!ips)

    /he variables "highest year of school completed" [educ], "se0" [se0] and "total family income"[income%#]ere use#ul predictors #or distinguis!ing "eteen groups based on responses to"opinion about spending on space e0ploration" [natspac].*!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on space e.ploration #rom sur(ey respondents!o t!oug!t e spend too muc! money on space e.ploration and sur(ey respondents !o t!oug!t espend a"out t!e rig!t amount o# money on space e.ploration #rom sur(ey respondents !o t!oug!te spend too muc! money on space e.ploration)

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!e groupsde#ined "y responses to opinion a"out spending on space e.ploration) Sur(ey respondents !o !ad!ig!er total #amily incomes ere more liely to "e in t!e group o# sur(ey respondents !o t!oug!t

    e spend a"out t!e rig!t amount o# money on space e.ploration, rat!er t!an t!e group o# sur(eyrespondents !o t!oug!t e spend too muc! money on space e.ploration) -or eac! unit increase intotal #amily income, t!e odds o# "eing in t!e group o# sur(ey respondents !o t!oug!t e spenda"out t!e rig!t amount o# money on space e.ploration increased "y N)4?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    SPSS onl supports direct orsimultaneous entr of independentvariables in multinomial logisticregression! so we have no choice ofmethod for entering variables.

    The variables listed first in the problemstatement are the independent variablesIGs(9 "highest ear of school completed"Leduc! "se+" Lse+ and "total familincome" Lincome$.

    The variable used to definegroups is the dependentvariable G(9 "opinion aboutspending on space

    e+ploration" Lnatspac.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    53/73

    Slide

    53Dissecting pro"lem 6 ' 3

    1) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# a statisticEAssume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, and t!at t!e (alidationanalysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o# signi#icance o# 4)4> #or e(aluating t!estatistical relations!ips)

    *!e (aria"les G!ig!est year o# sc!ool completedG Heduc, Gse.G Hse. and Gtotal #amily incomeG HincomeK8ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses to Gopinion a"out spending onspace e.plorationG Hnatspac) /hese predictors differentiate survey respondents who thought we spendtoo little money on space e0ploration from survey respondents who thought we spend too much moneyon space e0ploration and survey respondents who thought we spend about the right amount of money onspace e0ploration from survey respondents who thought we spend too much money on spacee0ploration.

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!e groups de#ined "yresponses to opinion a"out spending on space e.ploration) Sur(ey respondents !o !ad !ig!er total #amily

    incomes ere more liely to "e in t!e group o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!tamount o# money on space e.ploration, rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spendtoo muc! money on space e.ploration) -or eac! unit increase in total #amily income, t!e odds o# "eing in t!egroup o# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.plorationincreased "y N)4?)

    1) *rue

    6) *rue it! caution

    3) -alse

    SPSS multinomial logistic regression models the relationshipb comparing each of the groups defined b the dependentvariable to the group with the highest code value.

    The responses to opinion about spending on the spaceprogram were9#2 Too little! 1 2 ?bout right! and 2 Too much.

    The analsis will result in two comparisons9: surve respondents who thought we spend too little mone

    versus surve respondents who thought we spend too muchmone on space e+ploration

    : surve respondents who thought we spend about the rightamount of mone versus surve respondents who thought wespend too much mone on space e+ploration.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    54/73

    Slide

    54Dissecting pro"lem 6 ' J

    *!e (aria"les G!ig!est year o# sc!ool completedG Heduc, Gse.G Hse. and Gtotal #amily incomeGHincomeK8 ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses toGopinion a"out spending on space e.plorationG Hnatspac) *!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on space e.ploration #rom sur(eyrespondents !o t!oug!t e spend too muc! money on space e.ploration and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration #romsur(ey respondents !o t!oug!t e spend too muc! money on space e.ploration)

    +mong this set of predictors, total family income was helpful in distinguishing among thegroups defined by responses to opinion about spending on space e0ploration. Surveyrespondents who had higher total family incomes were more likely to be in the group ofsurvey respondents who thought we spend about the right amount of money on spacee0ploration, rather than the group of survey respondents who thought we spend too muchmoney on space e0ploration. For each unit increase in total family income, the odds ofbeing in the group of survey respondents who thought we spend about the right amount ofmoney on space e0ploration increased by 1.$.

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    7ach problem includes a statement about therelationship between one independent variable andthe dependent variable. The answer to theproblem is based on the stated relationship!ignoring the relationships between the otherindependent variables and the dependent variable.

    This problem identifies a difference for onl oneof the two comparisons based on the three valuesof the dependent variable.

    ;ther problems will specif both of the possiblecomparisons.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    55/73

    Slide

    55Dissecting pro"lem 6 ' >

    *!e (aria"les G!ig!est year o# sc!ool completedG Heduc, Gse.G Hse. and Gtotal #amily incomeGHincomeK8 ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses toGopinion a"out spending on space e.plorationG Hnatspac) *!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on space e.ploration #rom sur(eyrespondents !o t!oug!t e spend too muc! money on space e.ploration and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration #romsur(ey respondents !o t!oug!t e spend too muc! money on space e.ploration)

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on space e.ploration) Sur(eyrespondents !o !ad !ig!er total #amily incomes ere more liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration,rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend too muc! money on spacee.ploration) -or eac! unit increase in total #amily income, t!e odds o# "eing in t!e group o#sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on spacee.ploration increased "y N)4?)

    1) *rue 6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    In order for the multinomial logistic regressionquestion to be true! the overall relationship mustbe statisticall significant! there must be noevidence of numerical problems! the classificationaccurac rate must be substantiall better thancould be obtained b chance alone! and thestated individual relationship must be statisticallsignificant and interpreted correctl.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    56/73

    Slide

    56L=2=L - M=ASFR=M=:* ' 1

    1) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o#a statisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases,and t!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les G!ig!est year o# sc!ool completedG Heduc, Gse.G Hse. and Gtotal #amily incomeGHincomeK8 ere use#ul predictors #or distinguis!ing "eteen groups based on responses to"opinion about spending on space e0ploration" [natspac]. /hese predictors differentiatesurvey respondents who thought we spend too little money on space e0ploration from

    survey respondents who thought we spend too much money on space e0ploration andsurvey respondents who thought we spend about the right amount of money on spacee0ploration from survey respondents who thought we spend too much money on spacee0ploration.

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!egroups de#ined "y responses to opinion a"out spending on space e.ploration) Sur(eyrespondents !o !ad !ig!er total #amily incomes ere more liely to "e in t!e group o# sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration,

    rat!er t!an t!e group o# sur(ey respondents !o t!oug!t e spend too muc! money on spacee.ploration) -or eac! unit increase in total #amily income, t!e odds o# "eing in t!e group o#sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on spacee.ploration increased "y N)4?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    Multinomial logistic regression requires that thedependent variable be non-metric and theindependent variables be metric or dichotomous.

    ";pinion about spending on space e+ploration"Lnatspac is ordinal! satisfing the non-metriclevel of measurement requirement for thedependent variable.

    It contains three categories9 surve respondentswho thought we spend too little mone! aboutthe right amount of mone! and too much moneon space e+ploration.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    57/73

    Slide

    57L=2=L - M=ASFR=M=:* ' 6

    1) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o# astatisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, andt!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les "highest year of school completed" [educ], "se0" [se0] and "total family income"

    [income%#]ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses toGopinion a"out spending on space e.plorationG Hnatspac) *!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on space e.ploration #rom sur(eyrespondents !o t!oug!t e spend too muc! money on space e.ploration and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration #romsur(ey respondents !o t!oug!t e spend too muc! money on space e.ploration)

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!e groupsde#ined "y responses to opinion a"out spending on space e.ploration) Sur(ey respondents !o!ad !ig!er total #amily incomes ere more liely to "e in t!e group o# sur(ey respondents !ot!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration, rat!er t!an t!e groupo# sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on spacee.ploration) -or eac! unit increase in total #amily income, t!e odds o# "eing in t!e group o#sur(ey respondents !o t!oug!t e spend a"out t!e rig!t amount o# money on spacee.ploration increased "y N)4?)

    1) *rue

    6) *rue it! caution

    "Se+" Lse+ is dichotomous!satisfing the metric ordichotomous level of measurementrequirement for independentvariables.

    ">ighest ear of schoolcompleted" Leduc is interval!satisfing the metric ordichotomous level ofmeasurement requirement forindependent variables.

    "Total famil income" Lincome$ is ordinal!satisfing the metric or dichotomous level ofmeasurement requirement for independentvariables. If we follow the convention of treatingordinal level variables as metric variables! the level

    of measurement requirement for the analsis issatisfied. Since some data analsts do not agreewith this convention! a note of caution should beincluded in our interpretation.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    58/73

    Slide

    58Re+uest multinomial logistic regression

    Select theRegression |Multinomial Logisticcommand from the

    Analyze menu.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    59/73

    Slide

    59Selecting t!e dependent (aria"le

    Second! clic5 on the rightarrow button to move thedependent variable to theDependentte+t bo+.

    First! highlight thedependent variablenatspacin the listof variables.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    60/73

    Slide

    6!Selecting non'metric independent (aria"les

    Move the non-metricindependent variableslisted in the problem tothe actor(s)list bo+.

    Select thedichotomousvariable se.

    Don-metric independent variables are specified asfactors in multinomial logistic regression. Don-metricvariables can be either dichotomous! nominal! or ordinal.

    These variables will be dumm coded as needed andeach value will be listed separatel in the output.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    61/73

    Slide

    6Selecting metric independent (aria"les

    Move the metricindependent variables!educand income*+! tothe Covariate(s)list bo+.

    Metric independent variables are specified as covariatesin multinomial logistic regression. Metric variables canbe either interval or! b convention! ordinal.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    62/73

    Slide

    62Speci#ying statistics to include in t!e output

    Nhile we will accept most ofthe SPSS defaults for theanalsis! we need to specificallrequest the classification table.

    lic5 on the !tatisticsO buttonto ma5e a request.

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    63/73

    Slide

    63Re+uesting t!e classi#ication ta"le

    First! 5eep the SPSSdefaults for !ummarystatistics! Li"eli#ood

    ratio test! and$arameter estimates.

    Second! mar5 thechec5bo+ for the

    Classi%ication ta&le.

    Third! clic5on theContinuebutton to

    complete therequest.

    C l ti g t! lti i l

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    64/73

    Slide

    64

    Completing t!e multinomiallogistic regression re+uest

    lic5 on the '

    button to requestthe output for themultinomial logisticregression.

    The multinomial logistic procedure supportsadditional commands to specif the modelcomputed for the relationships we will use thedefault main effects model(! additionalspecifications for computing the regression!and saving classification results. Ne will notma5e use of these options.

    Slid

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    65/73

    Slide

    65

    Case Processing Summary

    && 15.9'

    90 4&.&'

    85 40.9'

    94 45.2'

    114 54.8'

    208 100.0'

    62

    270

    1&8a

    1

    2

    &

    "+! ?LO+IO

    O)+M

    1

    2

    "O" "?

    alid

    Mi33ingotal

    "$pop$lation

    Marginal

    ercentage

    e dependent ariale a3 only one al$e o3ered in 112

    81.2' 3$pop$lation3.

    a.

    Sample si%e ratio o# cases to (aria"les

    Multinomial logistic regression requires that the minimum ratioof valid cases to independent variables be at least #) to #. Theratio of valid cases 1)$( to number of independentvariables ( was 3. to #! which was equal to or greater than

    the minimum ratio. The requirement for a minimum ratio ofcases to independent variables was satisfied.

    The preferred ratio of valid cases to independent variables is1) to #. The ratio of 3. to # was equal to or greater than thepreferred ratio. The preferred ratio of cases to independentvariables was satisfied.

    Slid 2=RALL R=LA*I:S5I$ =*W==:

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    66/73

    Slide

    66

    Model Fitting Information

    &54.268

    &&4.967 19.&01 6 .004

    Model

    Intercept Only

    Final

    -2 Log

    Likeliood !i-"#$are d% "ig.

    2=RALL R=LA*I:S5I$ =*W==:I:D=$=:D=:* A:D D=$=:D=:* 2ARIAL=S

    The presence of a relationship between the dependentvariable and combination of independent variables isbased on the statistical significance of the final modelchi-square in the SPSS table titled "Model FittingInformation".

    In this analsis! the probabilit of the model chi-square#.)#( was ).))%! less than or equal to the level ofsignificance of ).)&. The null hpothesis that there wasno difference between the model without independentvariables and the model with independent variables wasre*ected. The e+istence of a relationship between theindependent variables and the dependent variable wassupported.

    Slid

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    67/73

    Slide

    67

    Parameter Estimates

    -4.1&6 1.157 12.779 1 .000

    .101 .089 1.276 1 .259 1.106

    .097 .050 &.701 1 .054 1.102

    .672 .426 2.488 1 .115 1.959

    0 . . 0 . .

    -2.487 .840 8.774 1 .00&

    .108 .068 2.521 1 .112 1.114

    .058 .0&4 2.9&2 1 .087 1.060

    .501 .&17 2.492 1 .114 1.650

    0 . . 0 . .

    Intercept

    :!

    I!OM98

    @"?A1B

    @"?A2B

    Intercept

    :!

    I!OM98

    @"?A1B

    @"?A2B

    "+! ?LO+IO

    O)+Ma

    1

    2

    / "td. rror *ald d% "ig. ;p/

    e re%erence category i3= &.a.

    i3 para>eter i3 3et to Cero eca$3e it i3 red$ndant..

    :FM=RICAL $RL=MS

    Multicollinearit in the multinomiallogistic regression solution isdetected b e+amining thestandard errors for the bcoefficients. ? standard errorlarger than 1.) indicates numerical

    problems! such as multicollinearitamong the independent variables!Cero cells for a dumm-codedindependent variable because all ofthe sub*ects have the same valuefor the variable! and ,completeseparation, whereb the twogroups in the dependent eventvariable can be perfectl separatedb scores on one of the

    independent variables. ?nalsesthat indicate numerical problemsshould not be interpreted.

    Done of the independent variablesin this analsis had a standarderror larger than 1.).

    Slid R=LA*I:S5I$ - I:DI2IDFAL I:D=$=:D=:*

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    68/73

    Slide

    68

    R=LA*I:S5I$ - I:DI2IDFAL I:D=$=:D=:*2ARIAL=S * D=$=:D=:* 2ARIAL= ' 1

    Likelihood Ratio Tests

    &&4.967a .000 0 .

    &&7.788 2.821 2 .244

    &40.154 5.187 2 .075

    &&8.511 &.544 2 .170

    %%ect

    Intercept

    :!

    I!OM98

    "?

    -2 Log

    Likeliood o%

    ed$ced

    Model !i-"#$are d% "ig.

    e ci-3#$are 3tati3tic i3 te di%%erence in -2 log-likeliood3

    etodel and a red$ced >odel. e red$ced >odel

    i3 %or>ed y o>itting an e%%ect %ro> te %inal >odel. e n$ll

    ypote3i3 i3 tat all para>eter3 o% tat e%%ect are 0.

    i3 red$ced >odel i3 e#$ialent to te %inal >odel eca$3e

    o>itting te e%%ect doe3 not increa3e te degree3 o% %reedo>.

    a.The statistical significance of the relationship betweentotal famil income and opinion about spending on spacee+ploration is based on the statistical significance of thechi-square statistic in the SPSS table titled "

  • 7/23/2019 MultinomialLogisticRegression_BasicRelationships

    69/73

    Slide

    69Ansering t!e +uestion in pro"lem 6

    1) In t!e dataset SS6444, is t!e #olloing statement true, #alse, or an incorrect application o#a statisticE Assume t!at t!ere is no pro"lem it! missing data, outliers, or in#luential cases, andt!at t!e (alidation analysis ill con#irm t!e generali%a"ility o# t!e results) Fse a le(el o#signi#icance o# 4)4> #or e(aluating t!e statistical relations!ips)

    *!e (aria"les G!ig!est year o# sc!ool completedG Heduc, Gse.G Hse. and Gtotal #amily incomeGHincomeK8 ere use#ul predictors #or distinguis!ing "eteen groups "ased on responses toGopinion a"out spending on space e.plorationG Hnatspac) *!ese predictors di##erentiate sur(eyrespondents !o t!oug!t e spend too little money on space e.ploration #rom sur(ey

    respondents !o t!oug!t e spend too muc! money on space e.ploration and sur(eyrespondents !o t!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration #romsur(ey respondents !o t!oug!t e spend too muc! money on space e.ploration)

    Among t!is set o# predictors, total #amily income as !elp#ul in distinguis!ing among t!e groupsde#ined "y responses to opinion a"out spending on space e.ploration) Sur(ey respondents !o!ad !ig!er total #amily incomes ere more liely to "e in t!e group o# sur(ey respondents !ot!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration, rat!er t!an t!e groupo# sur(ey respondents !o t!oug!t e spend too muc! money on space e.ploration) -or eac!

    unit increase in total #amily income, t!e odds o# "eing in t!e group o# sur(ey respondents !ot!oug!t e spend a"out t!e rig!t amount o# money on space e.ploration increased "y N)4?)

    1) *rue

    6) *rue it! caution

    3) -alse

    J) Inappropriate application o# a statistic

    Ne found a statisticall significant overallrelationship between the combination ofindependent variables and the dependentvariable.

    There was no evidence of numerical problems inthe solution.

    >owever! the individual relationship betweentotal famil income a