Top Banner
DOCUMENT RESUME -: D 173 TITLE Proceedis s of the Itivit tional cinference '-ProbXems (10th. NSW York' tifw York tictobP 1954). aisTITanolLestinServias%Pzj-'ldetanf PUB DATE . 30 Oct 5L -- NOTE 30p. EURS PRICE DESCRIPTORS MFOI/PC06 Plus Postag-e. *Academically Gifted; ,iAcbelera ed.RrogramS; Advanded. Placement PrograMs; AnalysiS..of'VariAnc,;- Depision. Making ;; Educational l.eitittn. ElementaiySee-A7ndary. Education; Factor Analy$1s; Group Dynamics; ?*Group Pelationsg_Higher.EApcdtion; Individual Differences;, *information Theory.; Job- Placement;.tfanpOier Predictor Variables,; PsyChological- TestinsyChometrics; Social, 'Relations.; *Testing Problems IDENTIFIERS. Latent-Structure Analysis; Russell RelationsTast ABSTRACT Four-Malor'topiC-s .were-,iscussed -the-conferente:- . application of informatio,n theory to *testIng;- recemtadvances.in psychometric .methods;-evaluatingq.ro;np-intraCtion;and new 4). deVelopments in the education.. of abler students.Papers delivered- :SUltiple Assignment-of P,-IrsOns ta,JObs, by Paul S Dwyer.; N. Light on Test Strategy-frbm Dcision.The4ry,. by J. Cro.nbach; Thy Relation Between Uncertainty and variance, by William J. McGill; Sote ---Reent Results-in Latent Structure Analysis, by. T. W. Anderson; The Moderator Variable" as a- Useful Tool In Prediction, by David R. Saunders; A Meth0dof factoring Withbut Communalities, by 'T*arstopeL A Ne4 Techhique for Ih4Vidhal Differences in Conformity t© Group Judgment, by Richard S.,Iirutchfield; The Russell Sage Social Relations Test: A riasure of G'roup problemSolving Skills in Elementary SchOol Childr4n, by Dora m. Damrin;,DescriOtion of GrOup:Characteristic. by John K. Hemphill;. Acceleration: Basic Principles- and Recent Research, by Sid1-1,;y L. Pressey;.College AdmiSSion with Advanced Standing, by William A Cornol; and Special Treatment for Abler Students and Its ,eiation to National Manpower, L. Wolf-le., (MI eproductions supplied by EDRS a,-: the best th ,_ can be made from th-e origihal'documer ******** *********.**************** *- ******
65

TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar¬ well own to.inost workers in educational and`psYchological measurement.

May 08, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

DOCUMENT RESUME -:

D 173

TITLE Proceedis s of the Itivit tional cinference'-ProbXems (10th. NSW York' tifw York tictobP

1954).aisTITanolLestinServias%Pzj-'ldetanfPUB DATE . 30 Oct 5L --

NOTE 30p.

EURS PRICEDESCRIPTORS

MFOI/PC06 Plus Postag-e.*Academically Gifted; ,iAcbelera ed.RrogramS; Advanded.Placement PrograMs; AnalysiS..of'VariAnc,;- Depision.Making ;; Educational l.eitittn. ElementaiySee-A7ndary.Education; Factor Analy$1s; Group Dynamics; ?*GroupPelationsg_Higher.EApcdtion; Individual Differences;,*information Theory.; Job- Placement;.tfanpOier

Predictor Variables,; PsyChological-TestinsyChometrics; Social, 'Relations.; *TestingProblems . .

IDENTIFIERS. Latent-Structure Analysis; RussellRelationsTast

ABSTRACTFour-Malor'topiC-s .were-,iscussed -the-conferente:-

.

application of informatio,n theory to *testIng;- recemtadvances.inpsychometric .methods;-evaluatingq.ro;np-intraCtion;and new 4).

deVelopments in the education.. of abler students.Papers delivered-:SUltiple Assignment-of P,-IrsOns ta,JObs, by Paul S Dwyer.; N.

Light on Test Strategy-frbm Dcision.The4ry,. by J. Cro.nbach; ThyRelation Between Uncertainty and variance, by William J. McGill; Sote

---Reent Results-in Latent Structure Analysis, by. T. W. Anderson; TheModerator Variable" as a- Useful Tool In Prediction, by David R.Saunders; A Meth0dof factoring Withbut Communalities, by'T*arstopeL A Ne4 Techhique for Ih4Vidhal Differences inConformity t© Group Judgment, by Richard S.,Iirutchfield; The RussellSage Social Relations Test: A riasure of G'roup problemSolving Skillsin Elementary SchOol Childr4n, by Dora m. Damrin;,DescriOtion ofGrOup:Characteristic. by John K. Hemphill;. Acceleration: BasicPrinciples- and Recent Research, by Sid1-1,;y L. Pressey;.CollegeAdmiSSion with Advanced Standing, by William A Cornol; and SpecialTreatment for Abler Students and Its ,eiation to National Manpower,

L. Wolf-le., (MI

eproductions supplied by EDRS a,-: the best th ,_ can be madefrom th-e origihal'documer

******** *********.*****************-

******

Page 2: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

U De FAR TtA Epp oF HE AL TH.DUCATION A WILFARE

NATIONAL INSTITUTE OT

THIS DOCUMENT HAS BEEN REPRO.DUCE° EXACTLY AS RECEIVE.ti FROMTHE PERSON OR ORGANIZATIONATING IPOliaTS Or VIEW OW oiDINIoNssTATEco DO NOT NECESSARILY REPRE-SENT OFFICIAL NATIONAL INSTITUTE OFEDUCATION POSITION OR POLICY

:TERMISSION TO REPi-1013L4pE THISIfiEHIAL HA6 BrEpol GRANTErf BY

TO THE EOuCATIONAL RESOURCESINFORMATION CENTER (ERIC1-

Page 3: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

BOARD- OF . TRUSTEES, 14544955 .

.Lewis W. Jades, Chairman

Samuel. T. Arnold ,Katharine E. McBride,

Frank IMF. Bowles Thomas R. McConnell

Doak S. Cdmpbell Ed;Arard S. Noyes

Charles W. Cole: William G. Sal onst

Donaccl.K: David-

Benjamin

OFFICERS

Henry Chauncey, President

Stillivan, Vice Prisident a'nd:Trea

iam W. Turnbull, Vice President

Henry Dyer, Vice ?residentK. Rimalover, Seci-etary

Cathirin G. Sharp, Assistant Speretary

Robert F. Kolkebeck, Assistant Treasurer

, '1

COPYRIGHT, 1055; EDUCATIONAL TESTING SERVICE20 NASSAU STREET, PRINCETON, N. J.

PRINTED' IN THE UNITED STATES OF AMERICA

Library of Congress Catalog Number: 47-11220

Page 4: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TAno L CONFEREN E

p

'E ING PR

OCTOBER 1954,

EDWARD E. E. CURCTON, Chairman

Application of Information Theory to Testing

Recent Advances in Psychometric Methods

Evaluating Group Interaotion..4

New Developments in fhe Education of Abler Students

EDVCATIONAL TESTING 'SERVICE

11, NEW JERSEY LOS ANGELES, CALIFORNIA

Page 5: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

The 1954 Invitational Conference on T'estin foplems, the tenth ofesdimeetings and the seyent4;.spollsoied- ducational Testing

ervice;'wal one of the most successful. S and invited guestsfrent t*ar. and far participated in a program o 6 scope and significance.'This ptiblished record of the proceedings, *ill, We hope, carrythe Words and wisdom of the participants to n 'even greater audience.

To Edward E/cureton, Chairman of the1954 Conference, goes_fullcredo fox:a 'job well donein spite of an artluoI professional schedule.His imagination, energy, and attention to d ail were rimarily rc

aside ed and well conducted progtam. Hi- su,cessful efforts are deeply appreciated by the many who enjoye1954 Conference.

.FIENTIY CHAUNCEY

President

Page 6: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

Invitational on Testing obi s, sponsoredTesthig Service, "was held in Now York City at 'the

I in pa* years, on October,00. 114;yolurne providesof . e pa and discussions .' i .

I . i event prpgrittn, it was,' necessary to try to

.. ....,y2.; eialinrecent ears 'the at-._ , r. - 4," i I. it: IL ±,J I ; 1 I '

on proportions. More anof ,those invited are ustially

try. to preserve, something ofavitational conference, and at the same time

g e I which would to-, some 500-odd peopleof diverse interests d ba unds..

At the technical eirel measurementtheory is undergoing rapid re-f

the impact coMmunibation.theory, inlorniation the?ry,ion th ry, latent structure anal' sis, factor analysis, and

e theory, to name only few. These theor are all highly mathe-,atic 1, ,and the branches of mathematics which they use ar&not wellown to.inost workers in educational and`psYchological measurement.

In the broader field of eval ation,' moreover, clinical psychologists addsocial psychologists/ _are d vetoing new methods of assessing suchthings as creative talents, personality traits, the dimensions of grouinteraction, the nature and quality of leadership in various settings, anthe processes of human judgrnent. These methods are in many casesquite different from those employed in assessing cognitive aptitudesand school achievement. It was felt that the new theories and, methods .t.,should be brought to the attention of measurement workers despitetherather considerable obstacles to effective communication. The secondproblem was to try to reduce these obstacles as much as possible, andto rind speakep who could present some of the new theories and methods in terms- which measurement workers could understand.

We werei, fortunate in securing as our luncheon speaker Dr. DanielStarch, .whose address, ". . . And Have Not Wisdom, recalled forciblythe need to teach students how to make ethical value judgments, andthe need to develop methods for measuring- the attainment of -this vital ,

educational objective. , ..

The rest of the program' consist of a,first Morning session on someapplications of information theo 'to _testin problems, two parallelsessions laterAri the morning, one on recent dvances in psychometric ,_methods and one on the evaluation of ou interaction, and an after=noon session on new developments in the education of abler students.

4We hope that this program achieved the barances. implied by its obitives. . i. ..".-..

The Chairman welcomes this opportunity to express his sinceappreciation to all the speakers for thejr contribliti ons, toiEducationakT sp onsoring`Testing Service for sponsong the Conference, to-' Jack K. Rimaloverfor his unfailing- support, assistance, and counsel, and to Mrs. Cath-erine p. Sharp for her assistance in making such exacellent local ar-

erangements.EDWARD E. CURETON, Chairman1954 Conference

Page 7: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

-A ipli t of

LE

ul S:

GENE FETING

o atian Theo to Testing"

oNM OF oNs To Jo_ 11

a+vyer amity of Michigan

TEST STRATEGY FROM fiECISION TI-LEORY 30b Leh, Lininersa IllinoisLee

RELATION Biri"WT:EN,U

William J. McGill:Technology/

1113 VARIANGS

achusetts Institute of sr

6

SECTIONA4

MEETINGS

ION -Recent Advances in Psychometric MelhodsChaiwan and Discussion Leader

Phillip J. Rulafi, Hartard University

SOME RECENT R IN LATENT STRUCTURE ANkl. rts 49T. W. And _pryColumbia Linke

THE ''114,0DERA OR VARIABLE" AS USE ©OL IN

David R. Saunders, Educe at Teiting 9

ETHOU OF FACTORING WITHOUT COMMUNA

L. L. Thurstone, Univ y 'of North Carolina

Page 8: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

SECTION II: ng Group Interaction"Chairman and DinliSit er

rirving:Lorge, Columbia University

ANENvToranmxin OR MEASURING INDIVMUAk DIfFERENCONFORMITY TO ROUF jUDGMiNT .. . . . .

Richard S. C, &field, University of California,I -i

THE RUSSELL OGUL RELATIONS TEST_ A MEASURE' OF

GROUP. PROS- xSOLVINOStiLLS IN 'ELEMENTARY SCHOOL

CHILDRENDora E. Damrin, Educational Testing Service

..,

DESCRIVIION OF CROUP CIULRACTERISI1CS

John K. tlemphill, phio State University-11

GHEON ADDRESS,.i ".. . And ace Not Wisdom".Daniel Starch, resident

Daniel Starc dr Staff

.GENERAL MEETING

"New Developmenti in the Education o f Abler S nts"

ACOEIErATION:Sidney L; P

ASIC PRINCIPI AND AECENT RESEARCH . .

ssey, Ohio State nivqrsity

Amdissiorg WITH ADVANCED STANDINqilliam H.,Cornog, Philcglelppla coviral High School

SPEC TREATMENT FOR AALER STUDENTS AND ITS RELATIO

TO NATIONAL MANPOVV1M

Dad Wolge, American Association for the 4tivekace-i agent 'of Scienc

107,

Page 9: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

GENERAL T1

Appliegtioofn or ation Ttleory to Testing

Page 10: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

Multiple Assignments of Nrsons to Jobs

DWYER

I. Introduction. I want to talk to you about a problem arise;when men are to be,assigned to jobs in the most efficient manner. Inthe tine at my disposal I can give you only a brief outline of the natureof the prtiblem, of the general methods proposed for its solution, andof my recent work on the solution of the problem with the use of trans-formations. However, I am giving yonupplementary material whiThyou may examine lat& in'more detail if yop 'desire to Obtain a morecomprehensive Ifiqw of the problemand the rnefirds of solution.

2. The Nature of The ProblemA simple illustratjon may serve t6 give Iyou -some idea- df the nature of the problem. A corporation hires 4college graduates to fill 4 vacancies without detirmining which indi-vidual is to be placed on which job. These graduates, though they havediffering abilitie's as indicated by their records, are all hired for thesame. salary. The problem of the corporation is then to plate these 4men on the 4 jobs An such a way that the corporation.will obtain maxi-mum value from their services.

The corporation may do this by estimating the worth to the corpora-tion in thousands of dollars per year of each individual if he were to beplaced in each one of the 4 jobs. Such a set of estimates is shown inTable I. The entries in the table show the values, denoted by cii,swhichindicate the estimated contribution to the total effort (in units ofWOO) which individual i will make if he is placed on job j. Thus,iridividual 1 is most valuble on job 1 but so is individu'al 2 and indi-vidual 4. The problem is to,.place all 4 individuals on all 4 jobs in sucha way thatithe sum of the assigned cii values is as large as possible.

Now since each individual can fill one and but one job and sinceeach job must be 'filW by some one individual, it folldws that anyassignment of the 4 individuals to the 4 jobs must involve one andone selection from each row .and from each column. Hence the problem

,4 This research was sup rted in part by the United States Mr Force under Con-tract No. AF18(800)-1I t monitored by Director, Detachment 4 (Crew Re-search Laboratory), Air Force Personnel and Trairang Research Center, RandolphAir Force Base, Randolph Field, Texas. Permission is granted for reproduction,translation, publication, use and disposal ircw,hole and in part by or for the UnitedStates Government.

11'

Page 11: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

12 1954 UNIVITATIONAL CONFERENCE

becomes one of -making selections of eij values, one from each rowand one from each column, so that the sum is as large as possible.

A more general statement of the problem results from thAisplayof Table II where the ell values of N jobs .and N individuals are indi-cated.- The problem is to select values of cf., So that the sum of the

----err-values-so-selected is-as-hu-ge as possible-See references A. '

a Alternate Forms of the Problems: The form of Table I and Table IIis that of a square array: of cii values featuring N rows and N columns.As a result of grouping the number of columns may be reduted, insome problems, so that the array takes on rectangular form with heightgrpater `than width.' For example the problem-of Table I may be so.reduced since the values of cii hf column 3 are identical with. the clivalues of column 4. Inoo far as the solution is concerned, there is nodifferencebetween job 3 and job 4 so that the two jobs may be exo?together in a -common job cate f .,11 we denote the nUmber4 h,5f suchcataories by in, and the vim of jobs in job category j by qj, wehave m = 3, qi = 1, cis = 2 for the problem of Table I. Thevalues th, which indicate the numbers of individuals to be assignedto ihe- respective job categories, are called quotas. This form of thepreblem which features these job categories and quotas is sometimes

known as the quota form.The quota form of the problem of Table I is shown in Table III.

The quota form of Lthe general problem of Table II is shown inTable IV. .

Bows may also be grouped to form personnel categories when the euvalues in different rows are identical or approximately so. When personnel categories and job categories are both used, we have a two-waygrouped distribution which takes on the forItt sometimes calledthe fre-quency form. The number of personnel Categories is taken as n, andthe frequency in personnel category i is indicated by ft. bf course thesum of the ff values equals the sum of the qj values which Thisfrequency form of the problem is illustrated in Table V.

The form of Table I and Table II, Since it features the nongroupedvalues for both individual -and jobs, is sometimes called the ini dual

form.

4. Equivalent Prohlems. This prOblem is essentially the equivalent ofproblems in other -fields. For example the Hitchcock transportationproblem is the mathematical equivalent of the personnel classificatien

problem tlioug,h it Is for the,selection of the el, values from each rowand column so as minimize, rather than maxhnize, the sum. There

1

Page 12: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS- 13

is ion of these equivalent problems but fefeences B are provided for those who are interested.

S Illythods of Solution Previously Used Many methodsproposes oar the solution of this problem. Monographs giving survey

es .1.1 - tmeithodx are- mentlenedMreffrences, C. OP thesemethods- I, sho call your attention to the method p1 all possiblEt

i cub, the simplex Method, and*the method of optimal regiorks.met of. an possible assignments, every possible alter ve

meat is made Now there are-N1 Alternative assignments thosome of the tissignment sums may be equal. Thus with NTable T, the computation of the 41 = 24 poSsible sums shows us theMaximum sum of $23,000. But the-method of all possible assignments isimpractical for li.ger values of N since NI increase§ very rapidly.

The simplex method was designed for more general problems mlinear programming and game theory. Dantzig and Vbtaw have ap-plied it to this problem (see D). It is any thesis that the machinery ofthe simplex method is unnecessarily complex for this problem and thatsimpler methods, described below, are preferable.

The method of optimEd.region.s L yery -useful ixt solving the classifica-tion problem especially when the number of job categories is small(see E). But the method IyMh to discuss with you today-is the methodof transformations byeh the array of c1 .values can be transformedto a new array from which the solution can by obtained by the selectionof zero terms.

6. Method of Transformations. A solution of problem, and theremay be more than one: consists in the assignment of each individual ito some job j so that the sum of the cu.valuegis to be maximized. Thismaximum sum is not the solution, though it can be calc6lated once thesolution is known:A solution consists in the assignments, i.e., the pairsof values of 1 and j. Thus in Table I a solution consists in the assignmentof man 1 to job 1, man 2 to job 3, man 3 to job 4, and man 4 to job 2.If we indicate the job assigned top individUal i by Ji and consider themen in the order 1, 2, 3, 4 we can write this solution compactly by

1, 3, 4, 2. The solution sum b 6 + 6 -I- 5 ± 6 -7-' 23 units but thisis not the solution. The solution is simply the set of elements (i, J1).

I can new state an important relation which seeves as the basis. of themethod of transformations. Any Constant 'may be 'subtracted frdievery element in any row or column without changing the solution.Thy solution sum is decreased by the amount subtracted but the solu-tion is not changed. Hence we may subtract sLmultaneously constants

Page 13: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

ci.frorn every row i and constants cj from every row j without changingthe solution. The puipose of the method of transformations is to makeuse of successive subtractions- from rows and coltimils until an arrayresults from which the solution is immediately obtainable. Specificdirecti4ns follow.

The first step in Ole solution of a maximization problem is the sub-%

,,traction of the largest, element in each rah, from each element in thatrow. The results of these subtractions eijW, are either 0 or negative.The process is illustrated 'in Table-V.1 where the maximum values forthe rows of Table ,I are shown at the right of the first array and thevalues of cijw in the second array. Now the cijw array, since it con-tains only non-positive termsr cannpt have. a solution with a sumgreater than zero. We cannot locate a solution with sum 0 as long as'tiny column has all nob-4ero tears. So we subtraCt the largest element"in each eoldmn as indicated at the bottom of the array. The resultingcum array. has at least one zero in-each row and each coluiliti: See thethird array of Table VI. Sometimes a solution can be obtained fromthis array by using only 0 elements. This is not possible in Table VI.An additional tr=ansformation is in order.

Before indicating the nature of this transformation, we note that thesecond and third arrays of Table VI feature negative signs. Thesecould be eliminated if, in the first array, the elements were `subtractedfrom the maximum values rather than vice-versa. Then the subtractionof the, smallest element in each column of the second array is indi-cated: This process is illustrated in the first three arrays of TableVII. The positive elements' of the second and third of these arrays areidentical, aside from sign, with those of Table VI,

IVe are unable to find a solution using the 0 elements of the thirdarray of Table VII since the 0 terms in columns 2, 3, 4 are all in row 3.However if we' subtract 1 from this row, we can then also subtract1 (the smallest non-zero value in columns 2, 3 and 4) from each ele-ment of columns 2, 3 and 4 to form the c,," array. The net sum of thesubtracted values is 1 + 1 -f- 1 = 1 2 which is placed in the lowerright corner. In general any such transformation in which the sumof the subtracted constants is positive and which does not result innegative terms, may constitute the next step of the solution.

In this problem the solution can be obtained from the 0 terms of the,c,,' " array. The 0 terms indicating the solution are marked with aster-isks in the ci]''' array. The corresponding terms in the el, array are alsomarked with asterisks." The sum of these terms is the solution sum,23 units. This may be checked by subtracting from 26, the lower rightentry in the first array, the sum of the lower right entries in the two

Page 14: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS 15

arrays beldw it. In more complex probleins additional transformationsof this type may be necessary. But we can prove that it is always pos-sible to make transformations of this type so that the solution maybebased eventually on 0 elements. These transforrations are fairly easilydiscovered in practice since they are based o pans of zeros.

The method is applied to a problem in the frequency form, previ-Ously used by Votaw to illustrate the simplex method, in Table VIII.The cli(2' array results from the subtraction from the maximurnrowvalues followed by the subtraction of the minimum column values.But, considgring frequencies, there are not enough 0 terms in row 2and row 3 of the third array to satisfy the frequencies. However, thesubtraction of 1 from row 2 and row 3, with the subtraction of 1

from column 3 and column 4 (and a net sum of 30 ± 35 25 2812) leads to the ci,"' array with very many 0's. One of the many solu-tions immediately evident from this array is indicated by superscripts.

A final illustration '(Table IX) uses a modification of Brogden'squota form problem with 109 men and 4 job categories. The solutionfollows the steps outlined above. Subtractions are made from the ci,")so as to meet the quotas for column-3 and corresponding subtractionsare made from the rows so as to keep one 0 term in each row. Acorresponding treatment of thq first column of the ei,(2) matrix leadsto the ci,"' matrix with enough 0 terms to reveal the solution indi-cated in the column headed ji

The solution sum 708 units can be obtained by adding the valuesof the first array indicated by the solution. It can he checked by form-ing 723 (6 ± ).

7. Concluding Remarks. The method' of transformations just de-scribed gives a solution to the classification problem which is as simpleas one can expect. It can he programmed for machines but, exceptfor the most complex problems_ methods are quite satisfactory.

We now appear to have a nod solution for problems with true el,cvalues. When true c values are unavailable, as they commonly arc,

questions arise as to the estimation of the values, as to the validity andsampling errors of the estimates, as to the resulting effect on the for-mulation of the problem, etc. The study of this general area, involvingpossible alternative procedures using the information available, is veryimportant.

A narrower and more immediate practical problem also commonlyconfron v can we, with our present knowledge and informa-tion lculate useful estimates of the ci, to which we can applythe available techniques? (See F.) We might use standard scores

Page 15: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 INVITATIONAL CONFERENCE

of variables correlated with success on thQ job, as in the Brogdenillustration, but, as the vocational counselor knows, such single predic-,tors are not commonly imrnediately observable. And even combina-tions of observable predictors, such as those obtained by regression?are not enough for this problem, even if valid; since they must betransformed to estimates of the contribution to the common effort.Some sort of a weight must be given to each particular job since ameasure of the importance of the -job, as well as the proficiency ofthe individual on the job, is needed for estimating the contribution ofthe individual to the common effort. How are we to determine thesejob weights? Aside from the matter of the validity of the predictorswe are forced, for the most part, to rely on the estimates of experts orto use hypothetical weights. In conclusion, I would like to pose= this

question for future research: How can we use available informationin obtaining practical objective measures of those job weights whosedetermination is prerequisite to any useful solution of the personnelclassification problem?

Table I A Simple Problem

c6 in $1000 units

2 3

1

234

673

8

344

6

4li

54

4

4

Table II: The General Problem

2 3 j N

1

23

. . .

NS

C11

C21

C3i

. . .

C II

CNi

C

C22

el/ 1. . ,

C i2

CN2

Cis

C23

Cas

C t3'

CN3

C2j

Cs]

. . .

C il

CN1

. . .

.

CiN

C2N

C3N

. . .

CD;

CNN'

Page 16: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS 17

Table I I Quota for Problem of Table I

cii in $1 units

(I; 2

1 6 3 4

2 7 4 6

3 3 4 5'4 8

Table IV: Quota Forth for General Problem

Cijl -

- --,

q2 q3 9J

1 C11 C12 C13 C1J C1n3

2 C21 C22 C23 C2j C2m

3 en en C33 C3j Cam

. . . . .

CH - C12 ei3 . . . C ij Cim

CM CN2 CN3 CPO Cl'im

Table V: Frequency Form for. General Problem

co

I,qi q2 q3

i_qi q

fif2

f3

. . ..-

f i

.

f.

C11

C21

C31

Cil

.

Cal

C12

C22

C32

.

C12

#

Cn2

A.

C,3

C23

C33

. . .

Cia

a @

CO3

Cu

C2j

C3j

. . .

Cii

a .

CO

CIM

C2m

C30

Cjm

@

elm

Page 17: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

954 INVITAT FERENCE

blc VI: Successive Transformations for Problem of Table

cis

4 Max.

6 3 4 4 67 4 6 6 7

3 4 5 5

8 6 4 4 S

ell,

2 3 4

1 0 3 2 22 0 3 1 13 2 1 0 04 0 2 t 4

Max. 0 1

cii(2)

2 3 4

1, 0 2 2 22 0 2 1 13 2 0 0 04 0 1 4 4

1 7

Page 18: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

2

2

34

2'34

TESTING PROBLEMS

Table VII: Solution of the Problem of Table .I

19

Max.C j

3446*

46* s5 _

4*

4G

5*4

67

5

8

26

c Li( I)

0 3.3 ,-

2 21

0

1

2

20

201

21

04

0

1 1 1

A

Ji

0*

30

1

00*

1

0*03 .

1

00*3

1

3 .

4

2

23

Page 19: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

20 1954 INVITATIONAL,. CONFERENCE'

Table VII:Soiution of Vcitaw

C ij

1230

_ 3523

15 20 25 28 12_.4.------1 0 2 1 02. 5 0. .2 L

1,

22 3 1

0 1 2 0

12303523

fi15 20 25 28 12

2 2 1

30 1 3 0 2

35 1 "1 3 2 0

23 0 0 1 2 01

1 1

15

12.

Page 20: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTINC PlItOBIAMS

abie VM (Coptinued)

21

ution of Broaden Problem

d C i i

25 25 25 25 Max. i 25 '25 25 25 Miax.

,8 24 1 4' 1 6 -0 1 -L- 16 62 3- 19 3 7 7 52' 0 0 2 5 _

3 -4 18 1 3 3 53 3 1 94 4 411 17 6 11 11 54 5 1 1. 8 S1 16 8 10 1 55 7 1

5 15 -7 3 7 56 3 47 7 14 1 13 1 57 3 2 ¥ate 118 6 13 2 4 4 55 6 ,7 3- 29 1 13 6 4,- 1 t9 . 1 2 8 7 7

10 =2 12 9 11 2 4 2 5.-12 4

11 7 12 3 1 3 61 5 3 4..10 1012 11 7 14 7 X62 -1 3 3 313 .11 1 9 1 63 3 3 4 _ 314 -L10 3 5 5 64 1 3 2 9 315 10 6 0 6 65 0 4 4 2 4.

Table IX -dontinued pages 22-8

I

Page 21: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

4

22 1954 INVITgIONAL CONFERENCE

L,16 10, 5 3 '66 6 4 1, 117 9 ° -10 4 18 8

.18 -19' i 8 4' 4 0 0

19 2 9 6 5 -77 8 6

20 2 12 12 ' 70! 4-' 5 11 13-

21. 8 : , n ,....2 5 1 88 72. 0. 5 0 6 "- '

3 3_ 7.A 1 -1 vd /0 12

-77 , .6 7 74 0 6 9 19, 19

25 - 7 3. 6 1 6s

-4,

'26` _._ 2 8 8 76 8 7 6

27 1 -9 -3 . 1 7 1 7, -4 .28 1 6 2 5 =1 . 78 0 7 1

29 6 5 11 17 5,,' 79 1 8 3 10

30 5 5 9 5 80 0 ig 6 8

31 2 1. 2 #.°'1.. 81 1 0

32 2 0 10 10 82 i8 9 0

33 3 7 3 83 2 9 2 ''.12

34 -- 4 4 15 15 84 3 9 6 5 9

'35 1. 4 5 3 _ 85 0 10 6 13 10

j

36 4., --(1 7 1 86 -2 10 4 5 10

.37 5 3 --6 47 5 10 1 2 10

38 4 3 0. 4 4 88 =4 11 2 11

29 7 3 , 1 6 ,,--'7 89 3 11 -73 -11

40 1 . $ 3 10, 11

*---

90 0 12, .

8 13 13

'41 2 2 4 17 17 91 9 12 0 11 12

.- 42 3 2 3 8 8 92 0 13 2 15 13

43 10 2 2 2 2 93 11 1'3 1 1 13

44 1 2 '7 7 7I' 94 1 14 3 19 14

4'45 1..----..

1 3 2 1 95 4 15 3 3 15

46 2 1 .-9 6 2 96 2 16 -2 8 16

47 1_ 1 5 10 5 97 0 17 -4 2 17'

4 --4 1 10 =5 1,0 98 3 18 1 14 18''

49 2 , 0 2 7 2 99 0 19 1 -1 19',

50 3 0 16 16 100 524 0. 1 24

2 1 723 .

Page 22: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TE TING ftROBLEMS

Ta le IX *( Continued)'

cum cuul

..25 2 25 q' 25 25 25

-----° 9- 25 0 5- , 2 51, 6 111 :22

2 10 26 -_ 0 52 2 2 6' 7 2, 7 - 21

X22'

(:) 53- 1 3 0 8 2"-_ 28 l5 0 54 3' 7 7 , 0

.0 15 -- 55 0 6 7 1 .

-1

12 22 0 10 56 4 6 3 07 6'. /3 0 12 2' 57 `5 0 58 10 17 2 58 8 0 9 59 0 12 . 6 0 59 8 5 15

10 0 '10. 7 60 0 2 9 _

11 10 15. 0 --2 611

15 7 612 2 -4 0 2 62' 4 0 .0 a. 213 4 12 0 1 2 63 6 0 7 2114 13-..d15 ' $ 64 2 0 1 12, 115 lb 16 0 6 2 0 8 6

-

16 6--I3. 8 0 66 0 2 5 223 33 -.34 0 `-61 - $. 14 18 0

1 14 188 .1 0 -1 68 0 0 4 419 3 .14 0 10 2 69 0 1 13 1.4-

20 14 20 10 0 70.' 9 8 2 0 .

16 17 3 0 71 7 0 4 133- 17 . 11 0 72.. 0 4 9 15

23 0 10 1 4 ` -1 73' 13 6 12 0° 24 0 14 13 5 '74 19 13 10 0

,I258. 13 '3 0 75 5 0 2 2

Page 23: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 INVITATIONAL CONFE ENCE

Table IX ( Continued )

2627

2

,4.--...r._

6

100'

0

14.`7

50

19

6

, 01

610

044

1214.

2176777879

1 2., 326 0 11 87 0 6 29 2 7 0$ 0 13

._

o 3 0 -81 7 0 9 8,10 0 . 1 0 9 10

0 10 2 (.8283 11 0.11 7

34 -,..21 19 11 =0 84'' 6 0 15 4

35 4 0, 0 2 2 -85 10 0 16 23

36 3 11 0 6 .2 86 8 0 14 5

37 4 2 0 5 ,-2 :.87` 5 0 11 12

38 8 7 4 0 -8 47 0 13 23

39 0- 10 8 1- 89 ra8 0 14 15

40 4 4 11 po 13 1 5 0

41 19 19 13 ' 0 91 3 0 12 1

42 92 13 0

4 43 12 4 1 3 4 2 93 2 0 14 - 14

44 8 9 0 14 2 94 13 0 17 33

45 0 0 2 1 95 19 0 18 12

46 0 3 11 8 96 14 0 18 24

47 6 0 15 2 97 17 0 21 la48 14 11 0 15 2 98 15 0 17 4

49 0 2 4 9 99 19 0 18 18

50 13 1. 11 0 100 19 0 24 23

2 2 6

Page 24: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING ROBLEMS

Table IX Continued)

c .(2) cii(2

25 25 25 25 q 25 25 25 25--,

1. 11 27 0 51 O 6 5 22 3-2 10 26 2 O 52 4 0 9

7 21 0 0 E3 3 5 0 10

4 22 28 3 0 54 3 7 <5 05 0 15 5 9 & 55 .. 0 6 5 1

--a-6 14 24 O 12 56 4 1 07 8 15 14 57 5 0 , 3 13

8 10 17 0. 0 58 S 0 7 59 0 12 3 3 3 59 8 5 dig 0

10. 0 10 5 9 a. 60 0 2. 7 16 73

11 12 17 0 4 ,61 15 7 4 012 , 4 6 0 9 62 6 2 '0;13 6 14 0 12 63 0 0 5 2114 13 15 6 0 64 A 1 0,, 1315 12 18 0 8 65 4 0 6

16 6 13 6 0 16 0 2 3 2 317 23 33 2 0 67 8 14 10 0

®18 15 19 0 1 68 0 0 2 4 319 5 10 0 12 69 0 1 11 14 320 14 20 8 0 70 9 S 0 0

21. 16 17 1 0 71 0 2 1322 3 17 9 72 0 4 7 15 323 w1 11° 0 5 2 73 13 6 0. 024 0 14 11 5 3 f74 19 13 8 025 8 13 1 0 75 5 0 0 2

Page 25: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 INVITATIONAL ONFERENCE

Table IX (Continited.

2627282930

612

1

149

.60

4...41

048

0' 6

5

12

14

223

767778'

. 7980

0 1 ,06 0 9

- 7 0 49 .2 5_

8 0 11

3282'

011

r3 '

.

X 31 0 7 0 3 81 7 0' 7 ..

8

32 12 15 0- 82 1 0 7 10

33 8 9 0 12 83 11 0 9 7

34 21 19 9 0 84 6 0 13 4

35 6 11 0 .4 ' 85 10 O. 14 23

36 5 1,43. 0 8 86 8 0 12 5

37- 0 4 0 7 87 5 0 9 12.

38 8 7 2 0 88 0 11 23

39 0 10 ,6 1 3 89- 8 0 12 15

40 0 4 2 11 3 90 13 1 3 0

41 19 19 11 0 91 3 0 110 1

42 L1 10 3 0 92 13' 0 13 28

43 14 6 0 6 93 2 0 12 14 144 10 11 0 16 94 0 15 33

45 0 0 0 1 =3 Oa 19 0 16 12

46 0 3 9 S 3 96 14 0 16 24

47 8 8 0 17 97 17 0 19 15

48 16 13 0 17 98 15 0 15 4

49 0 2 2 93 99 19 0 16 18

50 13 16 9 0 100 19 0 22 23

3B

3 9

Page 26: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS

Table IX

27

_ - _

q1.

25 25 25 25 j i -1- qi 25 25 25 25

1

2345

8 27 6,. 7 A7 26 2 it4- 21 a 0

19.28 3 00- 18 8 12

43,441

.4341

1

52'53'5465

0 9 8.25 ,-.1 4 0 9

. 0 5 0 100 7 5- 0

. 0 9 8 . 4

1

31,31,41

3.3, 4

6 11 24 0 12 3' 3 56 1 6 1 0 4' 47 5 45 0 14 3 57 -2 01.3 13 2. 2

8 7 17 0 0 3,4 3 58 5 .0- 7- 5 2 2

9 0 15 6 6 . 1 1 .59 5 - 5 13 0 4 4

10 0 13 .8 12 1 1 60 0 6 10 19 11,1 1 9 17 0 4 3 3 61 .12 7 4 0 4

12 1 6 0 9 _ 3 62 3 2 0 5 3 3.13 3 14 0 12 3 3 63 3 0 521 2

14 10 15 6 0 4. 4 64 0 1 0 13 -1,315 9 18 0 8 3 3 65 1 0 6 6 2

16 3 13 6 0 66 0 5 6 5 117 20 33 32 0 4 4 67 5 14 16 018 12 19 0 1 3 3 68 0 3 5 7

19 2 16 0 12 3 69 0 4 14 17 1

20. 1 20 8 0 41 4 -70 6 8 0 0 3,4

21 13 17 1 0 4 4 71 4 0 2 13 222 0 17 9 0 1,4 4 72 0.7-10 18 . 1. 1

23 0 13 2 7 1 1 73 10 6 10 0 4 4

24 0 17 14 8 1 1 74 16 13 8 0 4" 4

25 .5 13 1 0 4 4 75 2 0 0 2 2,3 2

26 3 14 4 0 4 4. 76 6 4 3 35 1 1

27 9 9 0 6 _ 3 77 3 0 9 S 2 2

28 0 8 2 7 1 1 78 '. 4 0 4 2 .2 2

29 0 26.14.14 79 6 2 5 0 4 4

30 0 13 11 17 1 1 80 5 0 11 11 2 2

2 6

Page 27: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

4 INVITATIONAL CONFERENCE

TABLE IX (Continued

3132 '

3334,_,35..

0 109.155 9

1819.3 11.

4 38 00 129 00 4

4343

4343

81828384

, 85

,4 0 7 80 2 /9 128 0' 9 7

3 0 13 47 0 14, 23

222

2;2

36 2 13 0 8 3, 3 86 5 0 12 5 2 2

37 3 4 0 7' 3 3 87 2 0 9 12 2 2

38 ,- 5 7 2 0 4 4_ 88 4- 0 11 23 2 2

39 0 13 9 4 1 1 89 5 0 12 15 2 '240 0 7 5.14 1 1 =90 10 -1 3 0 4 4

41 16 19 11 0 4 4 `91 0 0 10 1 1,2 1

42 8 10 3 0 4 4 92 ., 10 . 0 13 28 2

43 11 6 0' "6 3 3 93 0 1 13 15 1 1

44 7 il 0 16 3 3 94 10 4 15 33 . 2 2

45 0 3 3 4 1 1 95 - 16 01,6 12 2 2

46 0 6 12 11 1 1 '96 ' 11 0 16 24 2

. 47 . 5 8 0 17 3 3 97 14 0 19 15 2

48 .."13 13 .0 17 03 3' 98 12 '0 15 4 2 2

49 0 5 5.12 1 1 99 16.01618 2 2

50 10 16 9 0, 4 4 100 0 22 23 2 2

7

REFERENCES

A. To not NATUTtE OF THE Tinorm.m.r.1. Brogden, Hubert E. An approach ° the problem of differential prediction.

Paw 1946, 11, 139-154.2. Thomdike, Robert-L. The problem of -classification of personnel. -Psycho-

nsetrika, 1950, 15, 215.235.3. Votaw, D. F., Jr. Methods of solving some personuel.elassificatIon problems.

FsvchontafrOca, 1952, 17, 255-266.4. Dwyer, P. S. Basic theory and methods in optimal classification of personnel.

Fersormel Research,Branch, Deparunent of the Army, 1953.

Page 28: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

A

TESTING PROBLEMS

B. To EQUIVALENT mama.5. Dan

lem.B. A`rplitLnn of the plea

XXIII Ao mats ofMonograph No. 13 New York: Wiley, 1951.

B. Rao, C. R. Advance& statistical methods in biometric research. New York:1952.

n Neumann, johik. A certain zero-sum two-pereon game equivalent to theassignment problem. In H. W. Kuhn and

*I the theory of games, Vol. H. Annals obon: rrincetan Univer. Press; 1953.

rntonA., Prerequisites for pair-scoresRBIused for assemblingtale' Research Bulletin QC--TR-54-13 Mr Force Person-

n and Training iesearch CentereLackland Mr Force Base, December 1953.

C. To SWOMT. OF METHODS OF SOLUTION.

9.. Smith, Robert. B. Hand gutational meMr. Training Command, _human Resourc Research Center, September1951.

10. Votaw, D. F., jr. and Dailey, T. ent ofersonn to lobs. Res_ Bulletin 5-24,_ Air Training mman , Human Re urces Re

Laddand Air Force Base, August, 1952.11. Dwyer P. S. Selected computational methods related theory in the

optimal classification of personnel. Personnel Research Branch, Departmentof the Army, 1954.See also (4).

cker (

for the classification problem.

D. To =a SruPswit Mattson.See (3), (4), (5), (10).

E. To ..rnE MA 1-HOD OF OFTLMAX, REGIONS AND RE.LATEM THEORY.

12. P. S. Solution of the personnel classification problem with themethod of optimal regions. Psychometraca, 1954, 19, 11-26.

13. Lord, Frederick Notes on a problem of multiple classification. Psycho-metrlica, 1952, 17, 297-304.See also (1), (2.), (6), (11). /

F. To TIE DETERNONATION OF c, VALUES.

Dwyer, P. S. Selection and linear combination ople criteria and differential classification. PRBwent of the Army, January 1953.See also (1), (2).

relation to multi-Note 7, Depart-

Page 29: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

30 1954 INVITATIONAL CONFERENCE

New Light. on Test Strategy fromDecision 'Theory*

LEE I. -CRONBACH

In every practical use of tests, our aim is to make decisions. This

is obvious in personnel selection and in Dr. Dwyer's assignment prob-

lem, but it is also true of testing in the classroom and in the clinic. The

teacher uses tests because he has to make decisions about appropriateinstructional methods. The clinician uses tests as an aid in deciding

on therapeutic tactics. Sometimes, as in vocational guidance, the de-

cisions are made not by the tester but by the person tested. Test theory

should indicate how to reach the best possible decision in any of these

situations.-We use the word strategy to refer to the process by which an in-

dividual arrives at a decision. A strategy may be very simple: "I shall

examine+the applicant's grade average, and if it is B or better I shall

accept him." The strategy may- instead be complex, stating what tests

if any will be given, what decision will be made for any particular

pattern of results, and what further steps will be taken to decide on

borderline cases. Choosing among alternative strategies is the essen-

tial problem of test theory.There are two questions in choosing a strategy. First, with any given

procedure for gathering information, what is the best procedure for

slating this information into final decisions? Dr, Dwyer has just

shown us the solution for one problem of that type. The second, but

logically prior, questiorils: Among several alternative procklures for

gathering inforniation, which is most profitable?

In order to compare, two strategies, we have to determine howmuch benefit we gain from either one. Most of the problems of de-

cision theory therefore reduce to determining just 'how much benefit

is gained from a particular decision-making procedure,Since this morning's program is intended to deal primarily with

insights from some of these newer points of view, I shall not dwell

on the mathematics of decision problems. There is available a large

amount of relevant theory its the work, of economists on utility, in the

theory of games, and in tlfe statistical decision theory Abraham

* Based on work conducted under Contract Mori -07146 with the Office of

Naval Research.

Page 30: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS,

Joh pro s can be attacked in many ways, but wehave confined ourselves to strategies which maximize expected utility.This is reasonable only if we are dealing with a stable and familiarsituation_

The -decision model requires us to specify three aspects of anydecision. One is the proposed strategy or decision rule. For example,

strategy-might die to give two tests, combine scores by a regres-sion formula, and accept everyone above a given cutoff. Second, weconsider the adequacy of the information to be used. The usual con-tingency matrix or scatter diagram relating test scores and criterionscores deals with this question. Dr. McGill's work with informationanalysis is primarily concerned with -studying validity matrices. Thethird nectraly element is an evaluation matrix. This, sometimes calleda payoff matrix, states specifically just what benefit or detriment accompanies each possible decision: Dr.pwyer's Tables I and II are,evaluation matrices - (but his problem is so stated that the validityof the predictors also affects his entries). Once these three aspects ofa pro-bk.in hive- been described, we arc ready to compute the payoff aperson can expect if he bases decisions upon this information and thisstrategy.

Dr. Dwyer properlt drew attention to the fact that it will be diffi-cult and at times impossible to write the evaluation matrix for a par-ticular situation. To -let this difficulty deter us from using decisiontheory, however, would be to deny the possibility of sound test theory.Test effectiveness simply cannot be evaluated without an evaluationmatrix. Even the conventional procedures of test analysis assumecertain payoffs covertly, and the reasonableness of some of these hid-den assumptions is. open to question. In the future, testers may wishto determine sortutilities' by a so of cost accounting in any specific prac-

cal situation, in order to arrive at the best decisions. Our projectis proceeding along different lines. We are working with hypothetical(but we hope realistic) decision problems. By assuming that the evalu-ation matrix has some characteristic form, we are able to judge theutility of different types of tests and strategies. Such an approach canbe no better than our assumptions. We hope nonetheless to Arrive atgeneral principles of testing which will illuminate many real situations.

Let me turn to some of the concepts a decision approach brings toour attention. I shall cover four such points.

L Our model suggests that the value of test information should bejudged by -how much it improves decisions over the best possible

ions made-without the test, whereas the conventional validity co-evident reports how much better test decisions are an cluince de-

0

Page 31: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 INVITATIONAL CONFERENCE

cisions. In the majority of situations w uldbeusect-asubstantial amount of information is already avaiiable,-and if no test

were given the decision would still be considerably better than Chance.

Our most valid tests. -are essentially work samples of their criferia.

Where such a work sample might be used, evidence of past perform-

ance is also a valid basis, for decisions, and such evidence is often

readily available. In predicting school marks, for exam. ple, a scholastic

aptitude test is not greatly more valid than past school records. The

contribution of this test to decision making is much smaller than its

zero-order validity coefficient would indicate, because better-than--

chance decisions could be made without it.A similar conception applies to classroom testing. The basic know1-

-edge and skill objectives can be assessed with considerable accuracy

from. day - today assignments% a test can add only a small increment

to the, soundness of decisions. On the other hand, a teacher has rather

little basis for judging, which pupils have problems of adjustment. The

teacher may therefore gain more useful knowledge from a test of

adjustment which has limited validity, than from an achAvement

test which largely duplicates data already available. There are serious

weaknesses in our tests for .such educational outcomes as creativity,

`1,- reasoning habits, attitudes, and application of knowledge to problem

situations. They are markedly inferior in validity to tests of general ,

intelligence or factual knowledge. But the factors that make testing

difficult also prevent valid non-test decisions about these objectives. It

may therefore be wiser to use imperfect tests of important objectives

that are hard to measure, than to use highly valid tests that merely

supplement non-test data.Utility analysis leads us to examine the value of adapting .to in

di ual differences in either selection or placement. This can best be

considered in terms of a placement problem, such as assigning students

to sections of freshman English according to their initial ability. We

might think of the various levels as predetermined, and of the test

as assigning persons to each category using fixed cutting scores. It is

sounder to see the test, the okrricula, and the cutting scores as inter-

locked. We can increase or decrease the demands instruction in any

section makes, to fit it to the ability of the persons assigned. Under

this procedure we benefit more from testing than when we leave the

treatment fixed.Certain simple assumptions lead, to interesting conclusions. If a

sample is divided into groups, using fixed cutting scores, the extent to

which treatment for the groups should be differentiated depends on

the validity of the placement test. If the information has zero validity,

Page 32: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PR BLEMS

utility 4 -maximized when we teach all sections in the manner suitedto the average the population As validity increases, th'ei treabnentsgiven the sections may differ more; but,no matter how veld the test,there is an optimum degree of differentiation of treatment. If treat-ment is differentiated beyond WS point, the benefit from sectioningdeclines. Ind ssible to differentiate treatments so radicallythnt from sectioning every though the test .usedhas considerable valiclity.

This analysis raises serious quesU n as to whether we are right whenwe urge teachers to adapt to individual differences. If the teacher has astandard plan, Well fitted to the average of the group, he should hesi-tate to depart from it. Marked alteration of the plan to fit individualsappears to be advisable only when individual differences are validlyassessed and their implicationS for treatment clear.

3. We turn, now, t& another Sugg9stion encountered in decisiontheory. It is customary to look at a tetas a unit, and to use u

-one -terminal _decision. At ,any point in testing, however,make a terminal decision or can continue Co gather informsfrontiers open for us when we view testing as a multi-stage,tial, operation.

Suppose, in a simple selection problem, we have several short apti-tude tests, which together might constitute a selection battery. We givethe first short test; some men can be rejected or accepted at once, butless clear-cut cases are retained for further testing. After the secondtest is given to these men, we can make more final decisions, and only.a borderline group goes on to the third test. This process terminateswhen the benefit from information to be gained at any stage is out-weighed by the cost of testing. Considering cost of testing, the se-quential method is more profitable than giving the same test toeveryone: If testing is expensive, one reaches the final decision fora surprisingly large proportion of men, after only the first short test.One paper on this line of attack has been published by .Arbous. andSichel (1), but our detailed results will differ from theirs in importantrespects.

A sequential plan would require new ways of organizing testing.I shall discuss one procedure for possible use, in vocational guidance,clinical diagnosis, or evaluation ef classroom learning. Here, it can bedescribed in terms of the job assignment problem. For, different jobs,

1

so many abilities `are relevant that we cannot hope to measure themall_accurately in_aresionable period. With" brief tests, however, onecould crudely measure as many as 50 variables in a half day. Such asurvey will indicate some jobs for which the man is an unlikely pros-

Page 33: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 INVITATIONAL CONFERENCE

and a second group of jobs for which the tests show possibleaptitude. Pm the_man's qecond test session, perhaps only an haulthe ,first test, we would assemble a set of booklets to test him

or thoroughly ?n those promising aptitudes. This progressivero g would be continued. When the final job assignment is made, wewould have athigMyjeliable measure of rthe man's aptitude for thatjob, and alsoaa,good measure for the other jobs serious4, consideredas alternatives. But we would have wasted little time in getting anaccurate measure of 'his sce ability or his dexterity, if these areaswere not among his' better aptitudes on the first survey.

We might actually develop different sorts of tests for the earlierand later screens. The Strong Vocational Interest Blank might be re-placed by a brief questionnaire, perhaps one page long. This seemslikely to identify the important 'interest groups for a given man. Theremight then be a separate," onger interest blank for each of these in-terest groups, to provia more precise differentiation between relatedoccupations .than is now possible.

4. Perhaps our most far-reaching coneldiion is that we should takea more favorable view of tests with low validity. Traditionally, if ascore has low validity, we conelude tIlat it should not be used. But suchtests become valuable- when selection ratios are low (as Taylor andRussell noted ), when they give even a little new evidence on an im-portant decision, and particularly when they are used as a preliminary,survey.

The survey is especially important when many decisions are to bemade. Sometimes, as in vocational choice, the decisions are interrelatedand lead to one final course of action. The decisions may, on the other

-hand, be quite independent, as when one diagnosesmany persons.The prOblem in testing is ordinaterselect information-getting de-vices %Vlach will yield greatest benefit for the time available. If wespend a lot of time to get an accurate answer to one question, we mustanswer other questions witlijout added information. In this situation itmay be much wiser to use several tests of limited validity, so that everydecisiod is made with some wisdom, than to get highly accurate in-formation for just one of the decisions. '-

The difference between validity and utility is clear when we com-pare group and individual tests. An individual mental test measuresone person, with essentially the same expenditure of effort by thetester as the group test measuring one hundred. If the two tests havethe same validity, the group test gives us 100 times as much information, and bears on 100 decisions while the individual test bears on oneHence the improvement of decisions is vastly greater when the group

Page 34: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING pRong

mu r vse would depend on theapecific decision

s a favorable view of interviews and others which cover many aspects of the personality. These

m undetiendable, are well suited to a wide-casting sur----1,eyi-gathering-st little information on each of hundreds of questions.

Such a prel iminary scanning draws attention to the critical areas wherefurther information should be gathered pri6r to any decision. he tra-clitioha.1, narrowly focussed measuring device is ideal when we knowin advance exactly what question needs_to be answered. But in decid-ing Whether a man will make a goctd executive, br in locating a pa-tient's chief conflict areas; no such focus is possible. The first taskof the assessor is to discover which critical variables will dictate theproper decision about the individual; in different cases, different vari-ables will be critical.

Personnel workers regard the interview as indispensable, and clini-cians have considerable faith in ;projective methods and qualitativeanalysis of intelligence test protocols. In my opinion, this faith hadeveloped largely-because of regarding experience with these tec -niques in their survey function, i.e., as the first stage in a sequentialassessment. If it is true that these multi-dimensional techniqUes havea unique place in assessment, we should judge hog, well they do thatjob, and should not demand `that they be good measuring instrumentswhich they are not. On the other side of the picture, if their properfiniction is to make preliminary surveys so that more intensive exam-ination can follow, one should not rest final judgments on these fallible

'instruments.Taken as a whole, decision theory is a mathematical system which

permits us to examine the problems we face in deireloping tests, choosing between tests, and interpreting tests. Whenever we can specifyany particular decision problem in the detail Professor Dwyer's prob-lem required, then decision theory can tell us just.what to do. Bystudying common type-problems, decision theory can also offer gen-eral recommendations regarding testing strategy..,

Conventional test theory assumes That we use tests to obtain nu-merical measures on an interval scale, as in the physical sciences. Thatis rarely or never true. The ftinction of psychological and educationaltests is to aid in making discrete decisions. The greatest contributionof decision theory is'to help testers see this function more clearly.

See next pica for i-eferinces)

Page 35: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

71964 INVITATIONAL CONFERENCE

REFERENCESA C Bain Sienxi H. S. On the economies oT a pre-screening tech-atitude test batteries. Psuchomeirike, 1952, 17, 331-3443.

aris-wzm, awn) Awn Ciaturms, M. A. Theory of games and statistical de-. New York:` flay, 1954.

3. 13noonser, H. E.; AND TA .on, E. K. The dollar criterion--applyin the costaccounting concept to criterion construction. Personnel Psycho 1950,133454.

4. Baoss,,,Invrirs D. J. Dari,gn for decision. New York: MacMillan, 1953.5. GRONRAOR UN J. The counselors problems from the perspective of communi-

cation theOry. In Vivian H Hewer (Ed.), New perspectives in counseling,11Lifinesota Studies in Personnel Work, No. 7. Mtmiespolis; U_ niver. of MinnesotaPress, 1955.

CI. EnwAnre, WARD. The theory of decision making. Paychol. Bull., 1954, 51,380-418.

7. GOODMAN, Lao A. The use and validity of a prediction instrument. ASociol. 1953, 58,'503 -512.

8. Tom, R. M., Ccosss, C H:, AW bAVL9, R. L., Editors, Decision PNew York; W

4,

Page 36: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTI-N0 PROBLEMS 37

The Relation Between Uncertainiand Variance

There has been a great deal of hue and cry about information the-'ory in piyehologicil circles. In the midst of this hue and cry, it is easyto become confused about where information theory is supposed tomake contact with psychology. What, if anything, should information

.2 theory contribute to psychology?The theory deals with transmitting` symbols from one system to an-

other. That certainly sounds familiar. Many questions that come up inlearning, in perception, psycho-physics, testing (just to mention afew areas), take on new -signiRcanc6 in the terms of information theory.Consequently when we read presentations of infomiation theory andrun across words like "message," "noise," "channel capacity" our nos-trils begin to quiver and we sniff a familiar scent in the air. At oncewe try to find analogues for the classical problems of psychology inthe theorems presented to us by the communication engineers. nisis one large zone of contact between information theory and psy-chology. Information' theory is a source of .analogies and ideas thatmight not have occurred to us, if we thought about our problems inanother way. Perhaps the analogies are helpful, perhaps not. I wouldrather not discuss the merits of using information theory for this pur-pose. I bring it up only because I want to hammer at the distinctionbetween information theory and information measures. The theoryis concerned with transmitting symbols despite noise. Informationmeasures are concerned with "the arithmetic of mean-log-prObability.

Mean-log-probability is something like variance.' It measures theamount of spread in a discrete probabilff distribution. The forniulafor mean-log-probability is usually written as follows:

krIf(y) s p(k) loge p(k)

k-=.1

In this formula, y is a variable that can assume any one of r discretevalueS. Each of these values has some probability, p(k). The negativesign before the-summation insures that -U(y) is positive. The interest-ing thing about mean-log-probability is that any numbers or meas-

' urements which the various= categories of y might represent do notappear in the formula for U(y). In other words U(y) is non-Metric.

3 (3

Page 37: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1054 INVITATIONAL CONFERENCE

Mean-log-probabili -.-t.an be calculated for a variable like "methodsof parental discipline" (just to choose an example), whereyno num-bers can be attached to any of the methods. One method is as good

or as bad as another. All you can say about them is that they are dif-

, ferent. Obviously, you ca'n also compute mean-log-probability forvariables_ like IQ, but in so doing you sacrifice whatever you

it gain from knbWing that an IQ of 100 is very close to an IQ of

1p5 and very far from an IQ of 150.Inthe formula for mean-log-probability, logs are taken to the base

2. This is done in order to provide a sLmple unitthe "bit." When'U(y)is measured in bits4 it turns out that U(y) is the average number ofbinary, or two-category decisions required in order to identify one ofthe values of y exactly. It is not really necessary to measure mean-log-probability in bits, but almost everyone does.

You can apply mean:log-probability in many situations where ,in-formatioir theory has nothing whatever to say, and where in fact an

attempt to apply information theory might even look a little ridiculous.For example, suppose you want to express the relation between

anxiety level in children and several methods of parental discipline.The relation might be demonstrated very easily by using mean-log-probability. But our understanding of the relation' would not be en-hanced if.we considered the disciplinary methods as messages and the

anxiety levels as the versions of these messages received at the end of

the channel. The "capacity" of such a "channel" is of little interest tous. The fact that we can with effort construct information- theory -typeinterpretations of relations like this one, merely, demonstrates thatcommunicatign engineers and psychologists measure things withroughly the same kind of arithmetic. It does not mean that informationtheortjThas anything significant to contribute to our understanding ofthe relation between discipline and anxiety. Consequently we can be

interested-in the information measures entirely apart from their sig-nificance in information theory. To fortify thii distinction, let us nowcall these information measures by the niiime "uncertainty" measures.Uncertainty means mean-log-probability. It has no necessary connec-tion with information theory.

In this paper I want to show that uncertainty is essentially non-metric variance. This can be a very hollow claim revealing-little morethan a superficial similarity unless we are prepared to outline in de-tail the properties of variance that uncertainty possesses.

What are the principal properties of variance?1. Variance is a measure of spread or variabilityso is uncertainty.2. Variances that-are independent are additiveuncertainties that

are ,independent are also additive.

Page 38: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

3., Va4ances can be partitioned into components that reflect contri-butions to the total variance from a number of predicting variables

takeone at a time and in combination; uncertainty has precisely theproperty:

4. Variances are seriously affected by the metric or scale propertiesof the data from which they are derived. on the other hand uncertaintyis unaffected bY the metric or scale properties of the data. You canchop a distribution into parts_ and rearrange all the parts withoutchangirig uncertainty but the variance will change radically.

It is- obvious that uncertainty and variance are closely related butit is equally obvious that the two measi.u-e. are not identical. There isno simple equation that takes you from uncertainty to variance. In-stead, the parallel is in terms of structure or operations. To make thispoint clearer I have prepared two tables, Tables 1 and IA in thehandout. Table 1 shows the symbols, forMula% and definitions used ina double-classification analysis of variance. Table IA shows the samearrangerneht for a double-classification uncertainty-analysis. You cansee by Warming across the tables that the parallelism is very complete.In both ,cases the predictor variables do not have to be metric. Theycan be pure classifications like our example of methods of parentaldiscipline. In tlTe\wariance analysis the dependent variable is metric.It is something that can be measured numerically. I have pictured thedependent variable also as having discrete claSsifications but this isjust a convenience of notation.

You can see the relation between the two analyses very clearly whenyou look at this pair of equations:

U(y) U.(y)* U(y:w,x) (I)V(y) V.,i(y) V(y:w,x) (la)

The definitions of the terms in these equations are explain0 in Tables1 and 1A. The equations state that the. variability of thvcriterion ordependent variable can be analyzed into predictable aneunpredict-able parts. Furthermore the predictable variability may be decom-posed as follows:

_U(y:w,x) U(y:w) U(y:x) U(y:wx) (2)V(y:w,x ) V(y:w) V(y:x) V(y:wx) (2a)

where again U stands for uncertainty and V stands for" variance. Theseequations state that the total predictable variability can biokendown into a part predictable from w, a, part predietableIrom x,",and apark predictable from unique combinations of w and x.

Sometimes the uncertainty interaction term, U(y:wx), can be nega-rive. What is not generally realized is that f( y:wx ), a component of

Page 39: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 INVITATIONAL, cODIFERENCE

variance, can also be negative if it is defined as it is in Table 1. Thereason is that this term contains a hidden cross product, a correlationterm, as well as a sum if squares. Furthermore the correlation termhas a negative sign and can sornetirri6s be greater in magnitude than

-.Interaction sum of squares. This means that the interaction termin the variance-analysis cian sometimes be pushed negative by thenegative correlation term.

When does this happen? It happens when the predictor variables.arc not independent. In ordinary analysis .of variance this situationnever comes up because we are careful to construct the analysis sothat there are equal numbers of observations in every cell, This auto-matically insures that the predictor variables are independent. We capguarantee that things behave properly in both uncertainty-analysisand valiance-analysis by making. the...predicting variables orthogonal,i.e. independent, When the predictor variables have some sort of,me c t its is equivalent to tequiring:that the correlation between thepredictors is zero. In fact; all the arguments I have been making forthe. 'similarity between uncertainty analysis and analysis of Aarianceare equally true of multiple regression analysis, i.e. the case ffi whichthe variances in Table 1 are computed around a regression plane. InMultiple regression any interaction computed by the rule shown inthe last line of Table 1 is bound to be due to non-orthogonality.

We see that multiple correlation, analysis of variance and..uncer-tainty analysis are all very closely related. The different pfedictivemethods are necessitated becausesometUnes the predicting variableshave metrics and sometimes nOt;sometimes the dependent variablehas a metric and sometimes np

What remains stable in each e is the structure of the statistical,process of prediction, the operations intoll,Ted ?in making the pre-

diction.One important wpraetical consequence of this invariance of struc-

ture in statistical prediction is that we can -he* analyze non-metricdata in the manner of analysis of variance without resorting to the dis-tortion of giving the data an artificial metric. The non-metric analysisis carried out with uncertainty measures and mean-log-probability.Another very, important consequence is that all the literature on ex-

rirn ental design in variance analysis is dirWtly applicable to the uncertainty analysis.

Page 40: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

11.4.2:1;

.,Table 1

formula, : and 'defions, usod in double cation, au4ysis .,.9f vaiance The 14 on variabIe y is

assumedio be metric, The predictor variables, w and x.arecitigaTiied but O'citneceisWiltialau n1/11j. ,7114.7

Mince: The variance of the criterion v4ria0e y.i4 4t.

. '

!." t!

! .

r3 A

i V (Yk

ni., lli.kyi.,

in k ni.. '

tiioriai. Variante: The variance: of y when one ,piedictor

variable w held constant

,

rrii:Varianze: The variance reraiiningiin y when both predictor ,

yariabIe, w and xi are held constant,

1:11

1 il

Prliat. gain gif The variance y due to the "predictor (I)

variable ,

.

.When'the:prdictoi variable w, is held- constant

"!

C

v(yq) v(y) vw(y)

. 4

(tin ffeet: The variance of ydue to the predictor van

:(y _ V(y)

V(Y:YAYwi(Y)

a, rir 4

Total pridietlibtelthrialice; variance of y to the joint

-infifimce of the prediotOf vatiables w and x. 'i

: . . .. . , , , ,,. !. :. ' :' ... ,i ':,

, .

..i. Ly.' ivic. ', IV') + V(y) +:v).(y) :77. IT,i' (y) InieraftiOrtfilitrige.1 Ai Variance of y due to

., -, ',- . ,z Vir(yia) -- V :0. . , '.,, of the predictor .variableat, w andx,, ., H - ,

: : ,. .

. ,S. -. ..e I' :-2 ..= * :

.

iqueeprollinati,ons

. :

The ,ctitsion viriable Is talented by It an asninn'any value, yk.! The twOtt dot vatiablei are wand x,. tete two vari-ablen can asSme values iv! or x Thus;thodati arrays are three'amensionalOn three dimensional data tali', nok refers to the

'timber of 'cases occurring lb a single ',ce11; The dot notation kdientes that-al:A.113s been added over the .nlissin-' i.subsoipts,

1

Page 41: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

Table 111'

Symbols, formulas, and definitions used in three-variable uncertainty analysis. The criterion variable y is assumed

to be nonletTic, The predictor variables, w and x, are categdrized and may or may not be medic variables.*

DefinitionSymbol Formula

1. 'U(y)1'..k n.,k

,1

lop11 n nl.. ni

14

Told uncriainty., amount of uncertainty in the criterion Et

Variable y:

z1.4

',Conditionfil.imetiainty: The amount of uncertainty y 1,v4en one

predictor variable, w, is held constant.

nil; not ; niik10

n k nil, nil,

Error uncertalitly: The amount of ,uncertainty remaining when0

bothprediapr variables, wand x, ardeld constant. ,

2

,

U(y) uw

Contingektinedlainty: The uncertainty y due to the predictor

2

.

Itt4,1 9.011.00 Uncertai'nt'y: The uncertainty in y due to the PI

tri

joint influence of'00)predictor variables w and

variaple,

6 U U(y) 15,(y),,Multiple ofitingeittUncerlainly; The uncertainty in y due to the

joint influence oithe.predictor variables, w and

7.11(y) 13,(y) Ux(y) m U(y) interaction tinartaiti(y: The uncertainty, of y due to unique

_ w(yx) U(y x) combinations of the predictor variables, w and jx.

Page 42: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS 43

PARTICIPANTS

LSE J. CRONRACH, EDWARD E. CURETON, PAUL S. DWYER, DOROTHEA EWERS,

M. M. KOSTICK, WILLIAM J. MCGILL, VICTOR H. NOLL, JOSEPH ZUBIN

Dn. NOLL: This is by way of comment rather than criticism on apoint that Dr. Cronbach made in his paper. I have reference.to thesecond point in relation to use of tests in providing for individual dif-ferences.

It seems to me that the implication there was a little unrealisticin that we do not ordinarily, in the classroom, attempt to provide forindividual differences on such a mechanical basis. We do not say,"Your IQ is 117; consequently, you do this," and "Your IQ is 90, andyou do this."

It seems to me it is more a matter of,providtrig a variety ofmaterialsfrom "which the student then, by a process of selection, determineswhat is suitable for him. In other words, it is more a curricular prob-lem than it is a problem of measurement. Perhaps I am putting an im-plication' on Dr. Cronbach's statement which he did not intend, butI think it is important that we think of it in terms of providing a va-riety of experiences from which students of different types' of abilityand interest then choose rather than a mechanical process of deter-mining, on the basiS of a test, just what a student is going to do.

Dn. enoNnAc-H: I agree with Dr. Noll's views on the curriculum.The generalization of the paper is this: our system of analysis forcesus to reconsider much of the doctrine we have ad regarding indi-vidual differences. This doctrine takes a variety of forms. For instance,experiments on homogeneousigtoups have not la to conclusive results.In these studies, pupils have been given different treatments, treat-ments varied rather mechanically, plus some additional flexibilitiessuch as Dr. Noll mentioned.' Thre results to be expected from X-Y-Zsectioning will differ greatly; Oepericling upon the extent to which Xpupils truly differ from Y's and Y's differ from Z's. I know of nothingdone in these research programs to make sure that the adaptation toindividual differences applied in each section was optimal..

42

Page 43: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

44 1954 INVITATIONAL CONFERENCE

If _treatment = is. overdifferentiated or underdfferentiated, they ctainly do not get the hilt value of homogeneous grouping.

Our result also bears on the assessment literature. Thus it has beensaid that there must be something wrong with counselors because theirstandard deviation of estimates of ability,is lower than the measuredstandard deviation. But the evidence is that counselors are makingmistakes chiefly because they are overdifferentiating, taking the tests

too seriously.Obviously we have been encouraging teachers to look at the students'

personalities and to handle some of' them_differently from others, onthe basis of-judgment that one nods encouragement and another needs

stern treatment.The answer lies in this direction: it is fine to differentiate treatment,

so Inng as you do it on a tentative basis, and allow for trial and error.If you make irreversible decisions, such as "you cannot go to college,"then your information has to be extremely good.

Da. CiAlgrON: It seems to me we might consider one illustrationwhile we are on this topic, because it happens to be fairly common inAmericas universities and to rest on pretty definite information of anon metric type. This is the matter of freshman English. I think prob.ably m colleges and universities you will find more specific sectioningin English than in almost any other area. The reason is that there seemto be some fairly definite cutoffs which imply specifically differenttypes of instruction. For example, if a student can write good EnglishSentences quite regularly, and can also-handle them in paragraphs andlarger units with acceptable style, many universities will excuse themfrom freshman English at- the outset. Secondly, if he can write accept-able grammatical sentences, but does not do too well in organizingthem into paragraphs and larger units and produces poor style, he islikely to be assigned to a regular section. And finally, if he is unable

to write grammatical sentences and handle properly the basic elementsgrammatiCal structure, 'then he is likely to be assigned to a special

section in English in which he will get the type of drill that is notneeded by those whose secondary' preparation is somewhat better,

Dn. CRONBAM: Our analysis suggests research 'on the performanceof students of differing initial ability under various treatments alongthis continuum, from very routine English training to the highest levelof English. The highest level might involve going into matters of style,for instance, well above the usual considerations of clarity.

Da. Kamm: This is for Dr. Cronbach: I was wondering about us-ing the sequential method in entrance examinations.

I carried out a study of the effectiveness of entrance examinations in

3

Page 44: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS 45

our Massachusetts State Teachers Colleges. We found that by the endof the sophomore year, of those people who were admitted by certifi-cation, only 6% "flunked our or dropped out with low grades. We alsofound that for those people who were admitted with an enhance exam-ination mark of over 160, only 18% were dropped. Of course, the markof 160 or any other mark doesn't have meaning unless you know whatit tands for We used a combination of the score on the psychologicalexamination and the score on the English Examination.

What I am, particularly interested in, is the lowest group. We havea range between 130 and 140 which is questionable. From that group,48% "flunked out."

I was wondering A.:it would be a good suggestion to see whether wecould use some sort of sequential testing of these people, instead- ofgoing to the expense of putting them through the freshman or sopho-more year and then having practically 50% of them drop out.

Dn. CRONBACEI: In sequential testing you often can get more thor-ough measures than you could afford to give to everyone. There is nocompelling reason why borderline cases could not sit for a two-dayexamination even though, you wouldn't think of doing this with everyentrant. However, you are likely to make a great deal more gain if youfind some new sources of tests that will supplement your predictionrather than just extending the old one.

The statement that was made focuses on the importance of Davinvery clearly in mind your value assumptions before you try to mdecisions about test strategy. We can each make our own value asstion. I would. be hesitant in this particular example to accept the r phcation that ff a person flunks out after two years, this result has ega-

five utility. There may, be positive utility in those two years of liege,particularly when judged from the viewpoint of the individual, atherthan from the values of the institution. I think we fall too east y intoseeing the problem only from the institutional side. '

Dn. EWERS: I simply want to add to Dr. Cureton's statement wedo the same thing- for freshmen in high. school, except we h ve onemore category, that is, if they cannot read, we put them in a fourthgroup. But I wonder really how well we do teach them in these variouslevels. If we could do a better job there, we might not have so muchtrouble at the college level.

Dn. Zuznkr: Dr. McGill, I am, a little 'puzzled by your analysis, espe-cially by the terms you used They evoke memories of previous knowl-edge and I wonder whether you would like to connect up for us theprevious meaning attached to these times with the meaning you havegiven them. Analysis of variance gives the components of the total

Page 45: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 INVITATIONAL CONFERENCE

variance in a systematic fashion, and, of course, leaves a little bit for

the uncontrolled variance as an error term. Ordinarily we refer to it as

the standard error, something left with which to measure the signifi-

cance of the single components. As a matter of fact, what is the rela-

stionship betAveen uncertainty and thoe older ideas. of standard error?

Are they at all related, or are they two completely independent things?

Dr. Dwyer's method, of course, applies only to situations where you

already know the value of each job and -you know the value of the

person, and so on. Now, could that method be applied to situations

where we still not know these values? In the clinical field, for exam-

ple, we do not know the amount of money we could save by a certain

procedure. Would it- be possible to set up a contingency table with

unknown cy, and then solve for them under certain conditions? Rather

than assume you know your cy., could you put in specified cii.. and

see what happens?DR. McGill.: The answer is that it was my intention to by to evoke

associations from as many people as possible, because I believe that if

you have the old associations firmly in hand, it is extremely easy to

manipulate these so-called new concepts. They are not new at all.

The analogy of uncertainty with error variance is an example. The

"error" uncertainty contains error plus everything else you forgot to

analyze. If you have constructed an experiment 'properly (which almost

nobody ever does), the error will be what the model claims it is.Theiteresting thing about uncertainty analysis is its technique for

testing null hypotheses./In this analysis you do not test the predictable

components against error; you test the error against zero and the pre-

dictable domponents against zero. It is a tricky little switch, but the

interpretation of the components is identical with variance analysis.

DR. DWYER: You can't work the problem unless you have some

(values of There isn't any reason why you can't have a whole series

of hypothetical ci and maybe the collective problem would be some

sort of answer. This formulation of the problem demands it, but we

could have a whole series of hypothetical values, which might be

interesting.

Page 46: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

SECTION I

Recent Advances

in Psychometric Methods

47

Page 47: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS 49

Some Recent Results in LatentStructure Analysis

T. W. ANDERSON

Latent structure analysis may be thought of as an analysis of dis-Crete data that is analogous to factor analysis of continuous data. Theusual model for factor analysis is

(I) xi = X,,f, + Ili

where xi is the i-th test score, ft, is the , -th factor score, p.i is a con-stant and ui is the sum of the i-th specific factor and the error of meas-urement. The test scores are observed; the quantities on the right bandside are unobserved or latent.

Fdr convenience of discussion, we shall assume that the f. and titare normally distributed. The key assumptions of this model are thatthe set ui are statistically independent of (uncorrelated with) the setft, and the ui are statistically mutually independent (mutually uncor-related). This means that if we takelhe subpopulation defined by therequirement that the factor scores are given values, then in this sub-population the test scores are statistically independent; that is, giventhe factor scores the predictability of one test score from another iszero. More specifically, given the values of the test scores are nor-malty and independently distributed with means 'ilt f 4- /..Li andvariance ail. These assumptions are reflected in the formulas for thevariances and covariances of the test scores. If we assume the factorsare uncorrelated and have unit variances and let the mean of ut bezero and the variance of ut be E ui° = cri2, then the variance of the i-thtest score is

(2) E(xl mi)2

and the covariance of two test scores is

E(xi Ai) Pi)o=1

Page 48: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

50 1054 INVITATIONAL CONFERENCE

We can say that the common factors "explain" the interdependence or

correlation of the test scores; the effect of the ul appear only in the

variances.In considering latent structure analysis, we shall assume that each

item is a dichotomy; that is, an item score takes on the value one for

a positive response and zero,for a negative response. Thus the con-

tinuous test scores x, are replaced by the discrete item scores y,. The

continuous factor scores G are replaced by the latent attributes gv,

which may be discrete or continuous. In the subpopulation defined by

given latent attribute cores, the responses to the p items are statis-

tically independent; in other words, given the latent attribute scorethe predictability of one item from another is zero; In such a sub-

population let 7ri(g) be the probability of a positive response to the

1-th item; for ease of expositiowe shall assume there is only one latent

attribute score g. Then, for example, the probability of a positive

response on every item is Y

(4) Pr yr 1, y, --- I, . . . I g t 7ri(g)7r2(g) . . 7r (g).

For given g, the y, area set 'of independent binomial variables. Next

we assume that there is a distribution of the latent attribute, say f ( g)

over the whole population. The probabilities of y...,yr, or the rela-

tive frequencies of various resporise patterns for the entire population

are obtained from those for the subpopulations by averaging with

respect to f (g) .To make these as more concrete we shall consider two special

cases of the latent- structure model. One of these can be derived

from the factor analysis infidel. For simplicity we shall assume that

there is one common factor f and 0; that is,

(5) NJ. ± ,P

Suppose that the co -ore xi is replaced by a dichotomous var-

iable y, with y, and y, 0 if x, 4 a,. From -(5) weknow that x, are normally distributed with means zero, vari-

ances A.,2 and coy. an is A, A, and from this fact we can corn-

pute the probability o pattern of the dichotomous respones. For

example, . r

dx, _

Page 49: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING .PROBLEMS 51

where h (xi , x, ) ,s the density of the normal distribution. Whenf is fixed, the conditional distribution of x, is normal with mean ALf andvariance ai2; in this sul3population the probability of a positive re-sponse on the i-th item is

(7) Pr { yi 1 I f } = f _ e 2/'71 dxi

ai2r 01

where 0 (b) is the probability of a unit normal variable being lessthan b. From the fact that the u, are statistically independent it fol-lows that in the subpopulatiop of a given f the x, are independentand therefore the y, are independent. For example,

(8) P y . YP 1 I f = 71()

I

The case of one latent attribute (that is, one factor sore) has beenof particular interest to sociologists and social psychologists. The itemsmay have to do with opinions on various questions, all of which arerelated to a given atti-Wde. For instance, the items may be questionssuch as "Would you like to work With a negro?" "Would you like tolime in a nonsegregated community?", etc. The underlying attitude isthat of racial prejudice. The end purpose of the investigation may beto rate the respondents on a scale of racial prejudice.

El. a p

In (7) vi (f) is given as a particular function of f. Other functionsmay be postulated. Lazarsfeld has been studying mathematical andstatistical problems that arise when 1(f ) a, ± bif cif2 or 7T, ( f )= a, + b,f" (0 < f < 1).

Perhaps the 'simplest of ,the latent structure models arises whenthe latent attribute is discrete. Suppose that the attribute can haveone of q values, say f q. Then ri(f) can be designated as ria,where'a q. The distribution of the latent attribute is specifiedby Pr i f = a IP,. The probability of thawing an individual from thea-th latent class is va; the probability of drawiqg such an individual andgetting a positive response on the i-th item is,var, a; and the probabilityof drawing some individual and getting a positive response on the i-thitem is

4 Ll

Page 50: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

52 1,954 INVITATIONAL CONFERENCE

VI = Z

Similarly, the over-all probability of getting positive responses onitems i andj is

(10)q

11-11 Z Itia7T1 ffY4,

and the probability of getting positive responses on i, j, and k is

1k Z ria 'rim k, i k.

There are similar expressions for the probabilities of other sets of

responses.The ter,, vii, and rip, are known or are estimated front the data; the

7ria and Va are to be inferred. When the structure is known a respon-

dent can be classified into one of the q latent classes on the basis of his

responses to the items.A method for inferring vi4 and v., suggested by Bert Green [2],

is based on factor analysis methods. Equation (10) can be written

(12) rii X (7,-;* va)

are identical with (3) if Ala. ria V Va. The matrix (wij ) can be

ored to determine =the matrix (A10) =-; (Tri°41/ va), but there is

left 'the Mdeterminacy of rotation. To eliniinate this, Green suggests

also factoring ..1:kvijk. Generally there will be only one solution that

' factors both matrices; in the case of fallible or statistical data there

will be one factor matrix that gives the best fit. (This description of the

method should be taken only as approximate.)

An important difficulty with this method is that 7,1, rill, etc. are not

defined and not observed and these must be approximated in some

fashion; this causes particular trouble with the second matrix (Xkviik)

that is factored. Another method [1] which bypasses this difficulty as

well as the need for factoring large matrices is to use only part of equa-

tions (9), (10), and (11). The part of (10) that is used is a set that is

entirely below the main diagonal of the, matrix (r,1). In this method

2q' of the observed rt, Fr,1 and R,& are used to infer the same number

of the latent, parameters. Tile algebra that is involved is the standard

Page 51: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS

theory of characteristic roots and vectors, but we cannot go into detailshere. This method has been developed further to -use more of the data,and a large sample theory of tests and confidence regions has been, drived. Neither of the methods mentioned is in principle efficientusing all of the information in the data but either or makes it possibleto analyse data in ferrns of this model. However, further developmentsin statistical inference are needed even for the simple case.

REFERENCES1. ANDUSaN, T. W. On eitirnation of parameters in latent structure analysis.

Psychometrika, 1954, 19, 1-10.2. Cartrx, BtatT, JR. A general solution for the latent class model of latent structure

analysis, Psychometric, 1951, 10,,151-100.3. L.AzArtarmx, P. F. A cane tual introduction to latent structure anal sis. In

P. F. Lazargeld (Ed.), Mat emtitical thinking in the social sciences. FreePress, .4954, Pp. 349-387.STOWFX-8, SAM Me.,, et al. Measurement and prediction. Vol,, IV. Studies insocial psychology in World War II. Princeton Univer. Press, 1950. Ch. 10 and 11.

Page 52: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

54 954 INVITATIONAL CONFERENCE

The Mod.ra t ariable" as a Useful'Tool in Prediction

DAVID R. SAUNDERS

paper is intended to be partly informative and partly per-

uasive. From the information standpoint, I hope to supply you with

nswers to three main questions: What is a moderator variable? How

do you use a moderator variable? and, thirdly, Why should you, any-

Way? The persuasive aspect of paper is ob.rious hi tli third ques-

lion, just as in the titleAt is an important aspect, because much of the

information I can give you is neither very new nor very complicated..

Moderator variables have been used'for many years by our friends

hi economics and agriculture to help in fitting regression surfaces to

their data. In these applications.* moderator effect is typically just

one of many that are possible WO 4 multivariate curvilinear regression

based on a polynomial exPansion. Our economic and agricultural

friends don't need and dou t seem to have any special- name for it.

Our biological A-iendsmight be tempted to stiAge,,st the name -syner-

gistic vailtil31e;" but this is a term that already has a lot of additional

scientific corinoptions that we want to remain neutral on.

So far as I knew, Gaylord and Carroll werepsychology.

t .._first to use a moder-

ator variable in ychology. They called lek". _Pulation control vari-

able," and presented "a paper on it dpring the 1.48 APA meetings. The

term "population control variable" 'is a good one, because it 'Suggests

a very important application. But it is a bad term to the extent that it

tends to blind up to a number of other equally important applications,

which I will touch on. The term "moderator variable" seems to be gen-

eral enough in its meaning_ , and still not to be loaded with too many

undesirable connotations.Wellwe have just christened this thing, but have only hinted at

what it is. Let's look aVsome examples. By now I've managed to think

of dozens of attractivehypothetical examples, but I'll spare you these

and emphasize two examples that have been fully Worked out and

cross-validated.The first example is the one that originally led me to think a moder-

ator variable might be an important Concept. Frederiksen and Melville,

at ETS, had just shown that interests were less predictive of academic

success for "compulsive" people than for non people (2).

Page 53: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TEStINg - ,PROBLEMS

For Ills discussion, we can igricir6, the theasitted'that Were used to de-fine interests, success, and 'Compulsiveness, except tti note.that they allwere regarded as . continuous variables. Frederiksen's end Melville'sexperimental design, rieyeighele'ss, had to be the analysis of covariance;the total experimental simple was. arbitrarily divided near the median,compulsiveness score to produce two groupscalled "Compirlsiv'es.:.and -nontcompulsives." The relations between interest and successwere compared for these groups, and were found to differ significantlyin the slopes of the regression lines. This was done separately for-tendifferent interests as predictori.

This example is typical of nq'any sitiptions in which the use of amoderator variable should be Lnsidered.: In' this particular kind ofsituation the term "population control 'variable" would also be apt.Clearly, what, e are after is a means of treating compulsiveness as die'continuous variable which it means of 'avoiding the arbitrarinessof dichotomizing or otherwise dividing the population into smallerpieces-4 means of maintaining the integrity of the total populationwhile still maintaining a statistical control on each individual's mem-bership in one of a continuous, infinite series of sub-populations sle-fined by his compulsiveness score. In short, we will allow the meaningof his interest score to be "moderated.' by hiscompulsiveness score.6.

a this turns out to be extremely, simple to do. 1,iriposestatt with> an ordinary _linear regression using several variables.

_ Oefiing .eVeilithing in stiindard scores, we would. write ;the equation

1=1800,

where y is our criterion, and /3, is the beta-weight for predictor x,. Nowsuppose that pi, instead of being a constant, is itself a linear functioof a series of moderator variables, zf's. If we plug this into our equdtion, and do a litttercarranging, we immediately find that we can write

k 1

y aixi bizi c

This would be just another linear regression if it were not for the lestterm, involyidg the products of the 'es and the z's. If we want to, weaft always chooe origins of measurement for The x's and z's that yrill

e everything drop out of this equatekr, &cept the products, and aconstant term.

It is evidently these product terms whit, are inextricably tied up

Page 54: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

58 E '1854 'INVITATIONAL CONFERENCE

with the useol, a Moderator variable. And that is all that has happened.

So long as-there- is a clear separation of the x's and z's, we cannot have

any use for sqUared.variables, let alone terms of higher power. All we

have to do to fit ihepodel to data is to.find the appropriate xz product

(or productt) for eag,li' subject, and treat it (or them) as new 'rode=

pendent predietors-in any standard multiple correlation technique. Of

'Oon-rse, we cannot introduce a product variable into a battery unless

both-ef its factors-are already there.-There are many interesting mathematical sidelights to this thing, but

if want to persuade you that moderator variables are useful and

PtaCtica]#*.d better 'go back to our examples.

We' took the data that Frederiksen and Melville.had collected, and

computed :the products of the interest predictors -with Compulsiveness

for each -individual subject. Then we ran multiple correlations, first

adding compulsiveness to the interest predictor and then adding their

product to The battery ( 3 ).For three- of,the ten interest predictors studied, the simple addition

of compulsivenes.to the battery 'gave a- significant increase to the

multiple B. In these' three eases the compulsiveness, score happened

to act as a suppressor variable. A moderator variable is dfferent from

a suppressor variable, though they both typically have zero zero-order

relation to the criterion; a moderator variable does not have to have

zero.order correlations with, -even the' predictor.Back to the -example: In five of the remaining -seven insuinees, addi-

tion of the appropriate, product score to each battery of two Measures.

resulted in a further significant increase in the multiple: R. The sign

k--the beta weight for the third term was correctly predicted from the

'hypothesis in :.=i1rten-of the ten instances. You will recall that the

hypethesis -told us.at which end of the compulsiveness scale to look for

.god predictions.These results looked Promising. So Nye moved'thekene of operations

from Princeton to Rochester; from: a .group of self-referred counselees

to: 'a larger grdup tested routinely (hiring Freshman Week; from Strong

Interest .Blanki scored with weighted responses to scoring with unitweights,.The criterion was still freshman grade average for engineer-

ing majors, and the moderator waSistill the Accountant Interest score

of the Strong, as a measure of -compulsiveness. Insofar as the same or

similar interest scores were. available'to try as predictors, we were able

to cross-validate all but one of the statistically significant findings from

Princeton. t. _

1;In this example we have observed that in the More significant

stances, the predictive contribution of the moderator effect is just as

Page 55: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS

contribution of the :interest:'Variable predictor.pea seem like saying much, but note that Frecieriksen and,

latiens .ranging all the way from zero to oversame population, divided ping a pal

# I I, S'S

does not lead to much se in the multiple R, it deei lead to quited erent predictions for some individuals, ankshould be used if itseffect is even statistic fly significant These Predictions may differ instandard error as well ap in the expected value itself.

In the tecond main-example that I wanttto discuss 'here, neither theodepito4 tfor the pncliptar has been shown to have a significant zero-

order ciirre4tion- iath the criterion) In this situation, the predictorand modezator lose their separate identity, and a name like "populationcontiotMuiable- becomes more awkward toese. IiJ this example, thereis no significant multiple R until the prodivt terrO inti-oduced; thenit jumps to values like 45: This example is basedion some of Fiedler's

.recently reported work on the influence of leadelleyman relatiens onsmall group effectiveness. Many of you may have sees this written upin Time magazine recently, even if you missed Fiedler's SPA presenta-tion and haVen't yet seen his report.to the Office of Naval Research (1).

The gist of it is this. Suppose you are the formally designated leader'of a small grouto or team. There is probablyfomeone in the group whois your right-hand manyour principal stiherd&iafe or keyman. If youare the kind of person who gerieially has Warn ,feelings for most peo-ple; for whom shnilaritief areopg other people are more important than

ffer'ences, it will pay yoti and your group for you to maintain a rela-tive aloofness from your group, and especially from your keyrnan. Onthe other hand, if you tend to.think of others you like as being differentfrom those you dislike, -it will p .0 do the opposite, namely for you tocultivate strong sociometric ties ith your group, and especially withyour keYrnani

Here, then, are two variables youparticular leader's effectiveness in a

ed to measure, to predict aicular group. While neither

variable is related to the criterion, their product has aaebstantialega-live correlation with it The results iteppen to be psyelfologically verysensible, and they can prolicibry be used to counsel leaders towards amore effective style of leadership, and tem-edict what groups will re-spond best to given:styles of leadership.

These two examples have been very different in many respects, butthey do have two things in common in addition to their featuring ofimportant moderator effitets.. For one thing', they both feature normcognitive variables as predictors---and as moderator if y still care

Page 56: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

58 1954 INVITATIONAL CONFERENCE.

which is which: It seems to me that we can expect to fuld examples of

this kind relatively more easily, but cognitive examples are still a real

possibility. For instance, if we could obtain a meaningful coe l cient

of reliability for an individual's' test score, ind.ePendentry of the group

w e apPe --terbe-iestedrittested;-probably.anocleilteany predictions made from the lest score itself.,

In the second place, both these examples were initially studied by

breaking a total sample up into sub,groups. There are many studies`

that have been carried this far, and then ceported in tedious detailffor

lack of an organizing concept such as the moderator variab ides.

For the past three years I have made it apoint at the APA eet gc to

seek out studies of thiskind; I have always found my fill wit going

;very far afield, For some reason, studies involving-the "Authoritarian

4 Personality Syndrome" seem to lie m particular need of something like

a moderator variable. But ho more so, I would say, than people who.do

co ural scoring or make clinical judgments on the basis of person-

There is one last topic on my agenda. Assuming=' you have decided

to look for moderator effects, whlit is the best way to go about it?

There are at least four methods to consider. 1. You can start with a

good hypothesis. This is always a good idea, 2. If you don't hasie one

handy, you can look for one by studying'a few eases intensively, and

seeking out variables whose interpretation seems to depend on other

variables. 3. You can do the same thing with larger groups of cases by

looking for sets of sub-groups within which correlations are sign&

candy different. 4. You can do the same thing with items linstead'Of

'variables, by testing the interaction variance of a pair of items against

a criterion..This fourth approach is a whole technique in itself, and I

wish4here were time to tell .you about it. It capitalizes on the electronic

"cornphteri; it can be generalized beyond pairs of items; and it bringsy

the old idea of keying patterns of response to several items into a new

perspective. /REFERENCES

FIEDLER, FRED E. The influences cit leader-keyman relations. on combat crew

effectiveness. Urbana: Croup Effectiveness Research Laboratory, University of

Illinois, June 1954. (Technical Report No. 9, Contrac No. N6-or1.07135.)

2. PREDERIEEN, N., AND MELVILLE, D. Differential predictability in the use of

test:scores. Educ. psychol. Nfeasrut., 14, in press.

3. Airivinns, 13. R. Moderator variables in prediction. Princeton: ETS Research

Bulletin .53-23, O&tober 1953.

Page 57: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

Method of Factoring WithoutCommunafities

59

Ever sin* multiple factor analysis was developed over twenty yearsagh eunknown diagonal cells of the correlation Matrix have been aserious Mem; These-qtre called the corlimunalities and they repre

,

se tha = art of the total variance of a test which it shares withsone'tor more of the other tejts in the battery. For each method of factoringit had heSixf necessary to estimate these diagonal values id the correla-

,....; tic& matrix and various methods have been devised ,for doing so.In a theoretical case one can construct a correlation matrix of orderan rank r where rtk-n. The diagonal entries of such a correlation 144

matrix are ifitep so determined that all minors of order (r ± 1) vaniWhen the order is is larger than, say, 8 or 8, the diagonal values areusually unique-An dealing with experimentally observed correlation

Ificients, the necessarily have variable errors so that aow rank cannot be found in apy exact manner. However, one can

usually write another correlation matrix. with side entries djat are:nearly the same as the experimentallY observed values and which is ofa rank -mtich lower than n so that, for example, r < n/2. This is the

'situation often found in multiple factor studies. -w

Various methods- of factoring the correlation matrix have been de-vised in which the diagonal elements are first estimated. One or thesimple c assign to each diagonal cell a kalue equal to the absolutevalue of e highest entry in the corresponding column or row. Theestimate is revised at ter each factor has been extracted, This rough

'method of estimation is quite successful when the order iirlrIarge,say, about 15 or 20 or higher. Although this procedure is, quite success-ft& for many scientific problems with correlation mOtiices of highorder, it is far fro_ m satisfactory for theoretical formulations of the far-toring problem.

There is an important relation between the commonalities and-thenumber of factors that are used for describing a correlation matrixThe large the number of factors, the higher are the commonalities.The communality of any test variable j is the sum of squares in thecorresponding row of the igi!tnr matrix F. If it is decided that the first

Page 58: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

60 1954 INVITATIONAL CONFERENCE

inadequate for the description of the correlationthen the next principal axis may be determined. This gives

1) columns of the factor matrix. The communality of any test jen augmented by the square of the entry in column (r + 1) in the

Uritirneverknowrrbefoie. hand-how-many-principal-axes-neeetermined in order to reduce the residual correlation coefficients

to values that can be ignored in terms of their sampling errors. Hencewe have the anomalous situation of not knowing at the start of thecomputations how many factors need to be postulated to account forthe correlations. The communality estimate should rise with the num-ber of factors that are extracted but the relation depends entirely onthe unknown configuration of test vectors in each given problem:

In practice the problem is resolved by adjusting the communalityfor each factor that is extracted from the correlations or by rep_ eating

the whole factoring process until the estimated commonalities at thestart agree with the sum of squares of the factor loadings in the rowsof the resulting factor matrix. But this process depends on a certainnumber of factors as determined by the first cycle. The adjustmentshould be done over again if the investigator decides to increase thenumber of factors used When the number of factors is quite large,such as 10 or 15, then the adjustments in the diagonals are ordinarilyquite small for each additional factor.

In trying to relate the theory of multiple factor analysis to practicalscientific work with large correlation matrices, this situation is evi-dently quite unsatisfactory, even though the practical compromiseshave been adequate to resolve most of the scientific problems so far in

the isolation of the components of human intelligence which have been

called primary mental abilities.Several months ago I was sitting in an airplane in Helsinki in Fin-

land, waiting for the take-off for Stockholm. It occurred to me suddenlythat this awkward situation that we have fought and compromised withfor a long time could be resolved in a ridiculously simple way. I shalldescribelhe idea here although we have not yet developed the bestcomputing methods, and I hope that some other students of the multi-ple factor problem will consider the new method and how it might bereduced to the simplest possible computing procedures.

In the experimentally given correlation matrix, the diagonal cellsare Unknowii. They are certainly not experimentally given. Let us de-termine the first column of the factor matrix by the method of leastsquares so that the first factor residuals are a minimum. This is pre-cisely what we do in determining the first principal component byHotelling's iterative method with one important e:cception. This excep-

Page 59: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

. tTESTING. PROBLEMS 61.

t we ignore the =known diagonal cells completely. Sinceunknown, they are not represented in any observation equa-.

only =the experimentally known correlation coe dentsate in the observation equations for the least squares solution. When

oter-matrix--has-beenv-determined-scv-atvtO"or the-side correlationp, then we find the first

slduals for the side entries. The diagonal cells are ignoredy. Then we proceed as before for each additional columnnor matrix ,until the residuals ti the side correlations can

The communalities for this given lumber of factors aresimply the sums of squares of the rows of the factor matrix F.

In this iii-ocidure 'we deterinine the cornmunalities after the factoringjob has been completed. If, for any reason, we decide to reduce theresiduals still fuither, that can be done by extracting another factor.The new commonalities are then merely the sums of the squares of therows of F with another column added. None of the factoring' need berepeated because communality estimates are not involved in thefactoring of the correlation matrix.

It was several weeks until I had the opportunity to try the newmethod iin Frankfurt;, Germany, .where I discussed this method withthree of my former students in cago. They were Dr. Hans Anger,Dr.. Sten Ilenrysson, and Rolf Borgmann. Bergmann set up three testcases and reported that the solution gets dose to the principal axessolution and Dr. Sten Henrysson has reported sknilar findings with themethod in Uppsala. When I return to my laboratory in Chapel Hill,I expect to Investigate the method blither.

The computational procedure can be tried a manner analogousto Hotelling's iterative solution but it is likely that one of several otheralternatives will be more effective. In one manner of writing the prob-km we get third degree normal equations which can be solved bysuccessive apprpximations with additive corrections to the assumedfactor loadingi:

There may be an interesting geometric twist to this problem ni thatthe customary 'geometric model may have to be revised but I am notprepared yet to elaborate on the geometric implications of this prob-lem. The slight disturbance of -the geometrical model for the correla-tion coefficient and the factor matrix can be seed with a theoreticalcase of exact ranresiduals do not vfactoring matrix ob.interpretation ancUthe cas scalar products of the

. When this new method is used, the T th factorh identically.._ are small, however. The

d can still be given the usual geometricalelation coefficients.. are closely representedst vector§ that are defined by the factor

Page 60: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

62 1954 INVITATIONAL CONFERENCE

matrix. It is a question to be investigated whether the disturbance of

the true geometrical model is less important than the disturbance im-

plied in all the adjustments of the conimunalitjes in practical com-

puting procedures.s-to-min _formula for the best fitting

single factor. However, the application of such a formula may lead to

trouble if applied to the residuals for subsequent factors after the first

factor. The, theory of this problem. should also be investigated with

reference to the possible appearance of the Heywood case, So far we

have not encountered it. In several trials we find factorial -solutions

that are close to the principal axes.We hope that other students of fact 'al analysis may be.interesed

to explore this factoring method.

o' 0

Page 61: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS 63

PARTICIPANTS

T. W. ANDERSON, EDWARD E. CURETON, FREDERIC LORD, FRANK ROSENBLATT,

--,P.1.-RuLON,-DAVHaL SAUNIARS, ROBERT L. -THORNDIKE, L. L.THURSTONE, LEDYARD R TUCKER, JOHN W. TUKEY,

JOSEPH ZUBIN

DR. &mix: Dr. Anderson, did you make each probability of responseindependent, or would you allow for interaction, for instance, betweensayulg yes to orte item and no to another? Is the probability of thatPattern based on the product of the original probabilities, or might416e s interaction? .

:-The assumption here is that given the latent structure'score, then t. responses are independent. If one estimates the latent

et fs;-- .thpf writes down the right-hand sides of the equationsndk- they do not correspond so well to the left-hand sides, that

d i cl etc thatIhe assumption is not borne out veryy well Thenone can SitensMerModel so that essentially, instead of basing this onone latent score, your have several latent scores. For a particular

:, ,couplet, let us say, you may have to throw in an extra latent score toaccount for the kind of interaction you are talking about.

Da. Amor.% Are there any other questions? I do not know whetherDr. Anderson made himself so clear that `there are no questions, or soobscure that there a a no questions.

DR. CURE'rON: I h ve a comment that might have a bearing on Dr.Zubin's qbestion. It s to be noted that the number of latent, classesso determined that the responses will be independent within suclasses as far as that can be determined statistically. In other wordi,you simply postulate enough latent classes so that you -do have inde-pendence within each one.of them. ,.

DR. ZITRIN: Is there any way of telling how many subgroups youhave in the overall sample? r

.- ANDERSON: One technique for doing that is to carry over 'whatyou do in factor analysis, again using the analogy between these equa-tions. In factor analysis, there are techni 'ques to determine how many

Page 62: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

1954 iNVi'fiii;

factors there are, and themine how many latentview of the theoretical sworked out, or they doan ys

Dn. Clawrolq: Dr. Sauride*.a"moderator variable," tl fenjud.,p=

to put in the squares of the,stretched from linear topir4blip

Da. SAIINDERS: In gen.done. Again, in gener4.1,ber of parameters that-akgoinmental data In the absenep Of ,SO

these square terms areproach to steer clear of`

Da. THOENDICE;

sed here to deter-orn the point of

flues are not as wellthe ordinary factor

e that you put in theit be equally easy

e direct regressions are

why this cannot beincreasing the num-ed from the experi-

evidence to suggest that'would be a cautious ap-

inded, be sure.a fifth possible way pf

mexploring theseoderat0:varigdpi!iiist.:ri)titinely to compute all the

products a.nd run. thein'.0.4s-41441°01 Variables in your basic predic-

tion enterprise?s. SAtiNninit:Yilliiii.s.ariatitiroach,-and it is a, lot of work.

DE. CuRE-ToNtlotly0x% talking abont electronic "coinputers..

Dit'Roser.ratif&::-If a matrix of variables you could

compute an ordirwbrieat mulftp,10-r for this inaimx, and take the

coOfficient of detertninatiOn, (or r2).. If you. compare this .;cbe dent of

determination with that of a;'Correlation which iicludei the. product

terns, have you any, thought how the difference between these

orreltitions ,stands 'kith respect( tettlie usual troth iii of statistic.

er-

acti8n, coming'qut Of rui'.anarysis of varinnce?It .germs to me a -,the

'eoriesprinclenee'beci5rods-:qiiite -Close- ( i.e., the iriereirrekital .Coirelrition

tine tci thy'. frincierator :Varialiki-,CoiTesponcls cle)selyAn -interaetiMi

Val-brimDli. SA.triii?OtS would say. that it is quite clese..Iric, entelly,

spells out approach that you can take in testing. the sigr4eanee

the producflem You tined to-cornpare these coefficients.pf deterMina-'

thin withandWitlimit'thepfoduct tenns.Do.tiumsfewim This-ise.kaedy the same thing OS we: get in:,1-roteb

ling's'proeedire;:0*Ceptitiat the sum of the squares of theicoetorflOad,.

logs in Xietelling's -*thud is the sum of the.squares in all Oflilie factor.

loadings and the ,CoiTeSponding relations here would include the dia

onals. -ente we-dealOnly with die known values. But if you wri

,.zero in the :diagonals, .:then:this term can be interpreted in an 'aiding.

,kay, This um.Of the squares of all but one of the factor loadingi:

Page 63: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

cTESTING PROBLEMS 133

Psrax-awr, Mildred, Deparnent of Per-` sonnel, New York CityParrt.oiT, -.Robert, Personnel Research

&with; ACOPEru.orr, Evelyn, Board of. Education,

MarylandPERM, William D., University of North

CarolinaPEnursorr, Donald A., Life Ironrance

Agency Management Associatioq.PHELAN, Robert F., Prudential Insur-

ance Company- `

Purason, George A., Queens CollegePrime, Charles F., Educational Test-

Mg ServicePrrouge, Barbara, Educational Testing

ServicePLUM L», L. B., Educationd .Tes4ing

ServicePoinr, A. -Terrence, U. 5; Air Force,

Mitchel AFBPoLLAm, Norman C., New York State

Department of Civil ServicePBEDIDET, Sidney_ L., Ohio State Univer-

sityParsszv, Mrs., Sidney L., Ohio State

UniversityRADDrowrrz, William, Bank Street Col-

lege of Education, New York CityRADASCH, John, Department of Edoca-

tion, Concord, New HampshireRAPPADLLE, John H., ,OwenFlAssnr, Evelyn, Brooklyn CollegeFlAssukr, Judith, Queens CollegeRentoun,`Thomas J., HarvanbUniver-

sityRAT-MOND, Vincent R., Newark College

of EngineeringREAD, Thomas, Peddle School, JlIghis-

town, New JerseyREGAN, James J., Special Devices Cen-

ter, PNR&WILLER, Melvin, Queens Collegeliziows, H. H., Purdue UniversityRironm, W. H., Educational Testing

ServIceRte, Warren A., U. S. Armed Forces

Institute

Riquomoson, M. W., Richardson,lows, Henovand Company, Incorpo-rated 6

RICHARDSON, Ruth P. NYSPA, NewYork City

Rim, J. H., Jr., Psychologiad Coritoration

Jack K, Educational Test-in S

ROBBINS, g, . ueons CollegeROCA, Pab of Education,

Puerto orFlocs, Robert T., Jr., Fordham Univer-

Ros LATT, Frank, Cornell UniversityRosNan, Benjamin, Columbia UniversityAlmon, P. J., Harvard UniversitySADACCA, Robert, Educational Testing

ServiceSA a, David, R., Educational Test

ing ServiceSAIT, Eward, Rensselaer Polytechnic

InstituteSAWIN, E. I., Air Force ROTC Hai-

quarters, MontgomerySCATES, Alice. Y., U. S. Office of Edu-

cation.SCATES, Douglas E., American Social

Rygiene AssociationScnApuro, Harold B., Great Neck, New

YorkScrmAnea, William B., Educational

Testing ServiceScore, C. Winfield, Vocational Counsel-

ing Service, Inc.SEASHORE, Harold G., Psychological

CorporationSEBALD, Dorothy D., Hunter College'SEBALD, J. F., Worthington Corporation''SEIDEL, Dean W., Harvard Univs.rsitySFORZA, Richard F.,, New York State ".

Department of Civil ServiceSHARP, Catherine C., Educational Test-

ing ServiceSuAycorr, Marion F., American Insti-

tute for ResearchSHIMBERG, Benjamin, Educational Test-

ing ServiceSmorwrin, P. T., American Telephone

& Telegraph

1:28

Page 64: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

134

StumnidAri, Harry, City College ofNew York.

SIMPSON, Mrs. Eliz)liseth A., Illinois In-stitute of. Technology

Slant, Allan' B., University of Con-necticut ,

Stun-n . Marjorie B., flunter CollegeSMIT, Jo Anne, Washington, D. C.Snarl, rienzel D., Office of Naval Re-

searchSaari, Louise, Edugationd Testing

ServiceSmint, Muriel, Educational Testing.

ServiceSOLOMON, Robert, Educational Testing

Service s

SORENSON, Garth, University of Cali-fornia

Soupur.n, Mary T., Tower Hill School,WilnOngton

SPANEY, Emma, Queens CollegeSpAuLnwo, Geraldine, Educational Rec-

ords BureauSPEER, George' S., Illinois Institute of

TechnologySPENCER, Douglas, New York CitySTARCH, Daniel, Daniel Starch & &adSTArickt, Mrs. Daniel, Mamaroneck.

New YorkSTEPHAN, Frederick F., Princeton Uni-

versitySTERNBERG, Carl, Queens CollegeS-mwAnT, Mary, New York UniversitySTEwART, Naon, Educational Testing

ServiceSTicz, Glen, Educational Testing Sery

iceS-rozns, Thomas M., Metropolitan Life

Insurance CompanyS-roblz, Paul T. Huntingdon CollegeSTUART, William A., Educational Test-

ing ServiceS-riximum, Harold, Metropolitan Life

Insurance Co.SULISVAN, Daniel S., New York State

Department of Civil ServiceStn.i.rwar, Richard H., Educational

Testing SeniCeSurer, Donald E., Teachers College,

Columbia University

4 INVITATIONAL CONFERENCE

I

Swregnironn, Frances, Educational Test-ing Service

SYMONDS, Percival M., Teachers Col-' lege, Columbia UniversityTASSON Charles A., American CyanamidTATSUOEA, Maurice, Harvard. Univer-

sityTAYLOR, John F., Woodmere AcademyTAYLOR, Justine N., Educational Test-

ing ServiceTENtri.rron, Hugh, New York State

Department of EducationTERRAL, J. E., Educational Testihg

ServiceThORMEZ, Robert L., Teachers Col-

lege, Columbia UniversityTriunsroNp, L. L., University of North

CarolinaTwaxe, Arthur-E., Educatibn Rec-

ords BureauTruces, Frances, Commission on Diag-

nostic Reading TestsTucr.zit, Anthony C., Department of

DefenseTumral, Ledyard R, Educational Test-

ing ServiceTUKEY, PAM W., Princeton UniversityTOXEY, Mrs. John W., Princeton, New

JerseyTuTornm.1., William W, Educational

Testing ServiceTwrrono, Loran C., Special Devices

Center, ONR.ViCEERY., Verna L., Southeastern Lou-

isiana CollegeVoss, Harold A., Special Devices Cen-

ter, ONAWADELL., Blandena C., World Book

Company .w

WAHLCnEN, Hardy L., State Teach_ ersCollege, Geneseo, New York

WALKen.- Helen M., Teachers College,Columbia University

WALLACE, Wimburn, L., PsychologicalCorporation

WALsu, B. Thomas, Personnel Depart-ment, Philadelphia

W_Ai sir; John J., Boston. CollegeWALToN, Wesley, W.-, Educational

Testing Service

Page 65: TITLE tifw York aisTITanolLestinServias%Pzj-'ldetanf · atic 1, ,and the branches of mathematics which they use ar&not well own to.inost workers in educational and`psYchological measurement.

TESTING PROBLEMS 135

WALizislor, Hal, New York State De-parbnunt of Civil Service -

WATS, ON, Walter S.,, Cooper UnionWiu.s..cx, A. A., University of New

MerdcoWarsa, Cecilia T., University of Con-

neeticutWzrazi., Bernice isf., Barnard CollegeWismax,- A. G., Psychological Corpo-.

rationWnl-rx, George L., Silver Burdett

CompanyWirrna, Dean IC., Harvard UniversityWrxramv: Alfred G., Life -Insurance

Agency Management AssociationWrrax," Marverite M., Board 6f Edu-

cation, Greenwich e

Wn rya, Walter H., Nor York Univsity

Wixxs, S. S., P _ceto UniversityWILLAnn, Richard W., rird Univer-

sity

Wn.r.var. ., Columbia Univer-sity ,

Wn.LiAms, Roger 'K., Morgan StateCollege, Baltimore

WnsoN, Kenneth M., Princeton Uni-,varsity

VVnaorr, Phyllis C., Queens CollegeWismts, S. David, New Jersey Depart-

ment of. EducationWnrco, Alfred L., Virginia State De-

partment of EducationWir,immuncrrrora, John A, Educational

Testing ServiceWrrrEranortm, J. R., Rutgers UniversityWourra, Dael, American Association

for the Advancement of ScienceWoon, Ray C., Ohio State Department

of EducationWrucifr, Wilbur H., Teachers College,

Ganeseo, New YorkYocikraN, Harvey, University of DenverZaranin,-Shelclon S., Brooklyn CollegeZurim, Joseph, Columbia Univers10

I

BI5111.5 .