VITA NAME: Ronald K. Hambleton HOME ADDRESS: 268 Iduna Lane Amherst, MA 01002 (413) 253-5344 OFFICE ADDRESS: Center for Educational Assessment Hills South/Room 154 University of Massachusetts Amherst, MA 01003 (413) 545-0262 FAX: (413) 545-4181 e-mail: [email protected]MARITAL STATUS: Married, two sons BIRTH DATE: June 27, 1943 BIRTHPLACE: Hamilton, Ontario, Canada EDUCATION: B.A. Honors, University of Waterloo, 1966 Major: Mathematics; Minor: Psychology M.A. University of Toronto, 1967 Major: Psychometric Methods; Minor: Statistics Ph.D. University of Toronto, 1969 Major: Psychometric Methods; Minor: Computer Science, Statistics AWARDS AND HONORS: • Graduate Fellowship, University of Toronto, 1966-1969. • American College Testing Summer Postdoctoral Fellowship, 1971. • Research Fellowship, Educational Research Institute of British Columbia, Vancouver, Canada, 1982. • President, National Council on Measurement in Education, 1989-1990. • President, International Test Commission, 1990-1994. • Psychometric Fellowship, University of Twente, The Netherlands, 1991. • National Council on Measurement in Education Career Achievement Award, 1993. • University of Massachusetts Chancellor's Medal, 1994. • Honorary Doctorate, University of Umea, Faculty of Social Sciences, 1994. • President, Division II, International Association of Applied Psychology, 1998-2002. • President, Division 5, American Psychological Association, 1996-1997 • College Outstanding Teacher Award, University of Massachusetts, 1996-1997. • Appointed Distinguished University Professor, University of Massachusetts, 1998. • 2003 Association of Test Publishers’ Career Achievement Award. 1
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
VITA NAME: Ronald K. Hambleton HOME ADDRESS: 268 Iduna Lane
Amherst, MA 01002 (413) 253-5344
OFFICE ADDRESS: Center for Educational Assessment
Hills South/Room 154 University of Massachusetts Amherst, MA 01003 (413) 545-0262 FAX: (413) 545-4181 e-mail: [email protected]
MARITAL STATUS: Married, two sons BIRTH DATE: June 27, 1943 BIRTHPLACE: Hamilton, Ontario, Canada EDUCATION:
B.A. Honors, University of Waterloo, 1966 Major: Mathematics; Minor: Psychology
M.A. University of Toronto, 1967
Major: Psychometric Methods; Minor: Statistics Ph.D. University of Toronto, 1969
Major: Psychometric Methods; Minor: Computer Science, Statistics AWARDS AND HONORS:
• Graduate Fellowship, University of Toronto, 1966-1969. • American College Testing Summer Postdoctoral Fellowship, 1971. • Research Fellowship, Educational Research Institute of British Columbia, Vancouver,
Canada, 1982. • President, National Council on Measurement in Education, 1989-1990. • President, International Test Commission, 1990-1994. • Psychometric Fellowship, University of Twente, The Netherlands, 1991. • National Council on Measurement in Education Career Achievement Award, 1993. • University of Massachusetts Chancellor's Medal, 1994. • Honorary Doctorate, University of Umea, Faculty of Social Sciences, 1994. • President, Division II, International Association of Applied Psychology, 1998-2002. • President, Division 5, American Psychological Association, 1996-1997 • College Outstanding Teacher Award, University of Massachusetts, 1996-1997. • Appointed Distinguished University Professor, University of Massachusetts, 1998. • 2003 Association of Test Publishers’ Career Achievement Award.
• Honorary Doctorate, University of Oviedo, Oviedo, Spain, 2003. • International Test Commission Award for Distinguished Service, 2003. • E. F. Lindquist Award for Outstanding Research in Assessment (AERA and ACT), 2005. • University of Massachusetts Award for Outstanding Accomplishments in Research and
Creative Activity, 2005. • Samuel J. Messick Award for Scientific Contributions to the Field of Measurement,
Division 5 of APA, 2006. PROFESSIONAL EXPERIENCE: Appointments
• Lecturer, Ontario College of Education, University of Toronto, Summers 1968-1972. • Graduate Assistant, Department of Measurement and Evaluation, The Ontario Institute
for Studies in Education, 1966-1969. • Assistant Professor (1969-1973), Associate Professor (1973-1980), and Professor (1980-
1998), Distinguished University Professor (1998-present), University of Massachusetts at Amherst.
• Visiting Professor, School of Business Administration, United States International University, Summer 1976.
• Adjunct Professor, Graduate School of Applied Behavioral Sciences, California American University, 1976-1980.
• Chairperson, Laboratory of Psychometric and Evaluative Research, University of Massachusetts at Amherst, 1973-present.
• Lecturer, George Washington University, Summer 1980. • Visiting Professor, University of Leiden, The Netherlands, Fall 1981. • Visiting Scholar, UCLA, Fall 1982. • Visiting Professor, Technical Teachers' Training Institute, Bhopal, India, Summer 1987. • Member, National Faculty, Center for the Study of Evaluation, UCLA, 1987-1991. • Visiting Professor, University of Umea, Sweden, September, 1990, June, 2004. • Visiting Professor, University of Ottawa, Spring, 1992. • Executive Director, Center for Educational Assessment, University of Massachusetts,
2004-present. National/International Committee Work
• Joint AERA-NCME-APA Committee on Test Standards, 1977-1978. • AERA Publications Committee, 1979-1981. • APA Psychological Tests and Assessment Committee, 1980-1982. • APA Division 5 Public Affairs Committee, 1982-1984. • APA representative to the International Test Commission, 1982-1986. • NCME Board of Directors, 1983-1986. • NCME representative to the Joint Committee on Standards for Educational Evaluation,
1984-1987. • NCME Publications Committee, 1984-1986. • ETS Blue Ribbon Committee to Evaluate the Mantel-Haenszel Statistic, Spring 1986. • ETS Advisory Panel on Design of Assessment Services Relating to the Educational
Equality Project, Member, 1985.
2
• International Test Commission, Vice-President, 1986-1990; President, 1990-1994; Past-President, 1994-1998.
• New Jersey High School Proficiency Test Technical Advisory Committee, Chaiperson, 1986-present.
• NCME Committee on the Recruitment of Measurement Professionals, Member, 1987. • NCME Vice-President, 1988-1989; President, 1989-1990; Past-President, 1990-1991. • NCME Awards Committee, Chairperson, 1989. • NCME Membership Committee, Chairperson, 1989. • National Research Advisory Committee to the National Board of Medical Examiners,
Member, 1989-1991. • Technical Review Committee for the National Adult Literacy Project, Member, 1990-
1993. • Division 5, APA Workshops Committee, Member, 1990-1991. • National Assessment of Educational Progress (NAEP) Technical Advisory Committee,
Member, 1990-1994. • NAGB-ACT Technical Advisory Committee to the NAEP Achievement-Level Setting in
Mathematics, Reading, and Writing, Member, 1991-2000. • European Conference on Educational Research, Research Methodology, and Evaluation
Research, Program Co-Chairperson, 1992. • National Board for Professional Teaching Standards, Technical Analysis Group,
Member, 1992-1996. • International Association of Applied Psychology, Division 2, Executive Committee,
Member, 1992-1996. • NCME International Measurement Issues Committee, Member, 1992-1994. • National Board of Medical Examiners John P. Hubbard Award Committee, Member,
1993, 1994. • International Committee to Develop Guidelines for Adapting Instruments and
Establishing Score Equivalence, Chairperson, 1992-2000. • Professional Examination Service, Board of Directors, 1994-1999. • NCME Instructional Modules Committee, Member, 1994-1998. • Massachusetts Assessment Advisory Committee, Member, 1994-1997. • European Association of Psychological Assessment Awards Committee, Member, 1994-
1995. • KIRIS National Technical Review Committee, Chairperson, 1994-1995. • Technical Advisory Committee, Graduate Record Examinations Program, Member,
1995-1997. • Board on International Comparative Studies in Education, National Research Council,
Member, 1995-1998. • Technical Advisory Panel, Department of Defense Education Activity, Member, 1995-
1996. • NAEP Design and Feasibility Committee, National Assessment Governing Board,
Member, 1996. • National Council on Measurement in Education Student Dissertation Awards Committee,
Chair, 1997-1998. • National Council on Measurement in Education Nominations Committee, Member, 1997. • International Advisory Committee to the Swedish Scholastic Aptitude Testing Program,
member, 1992-present. • Technical Advisory Committee on Computer-Based Exams, British Columbia
Department of Education, Member, 1996-1998.
3
• Technical Advisory Committee to the Early Childhood Longitudinal Study, U.S. Department of Education, Member, 1996-1999.
• Committee to Develop International Guidelines on Core Standards for Test Use, International Test Commission, Member, 1996-1999.
• Technical Review Panel for the Computerization of the USMLE, National Board of Medical Examiners, Member, 1996-2000.
• Scientific Advisory Board to the National Institute for Testing and Evaluation, Israel, Member, 1996-present.
• IAAP Division 2 1998 Program Committee, Member, 1996-1998. • Technical Review Panel for the Standardized Patient Project, National Board of Medical
Examiners, Member, 1996-2001. • AIR Technical Advisory Committee for the Volunteer National Test, Member, 1997-
2000. • National Research Council Committee on Embedding Common Test Items in State and
District Assessments, Member, 1999. • Massachusetts Department of Education Technical Advisory Committee, 1997-2003. • Virginia Department of Education Technical Advisory Committee, Chairperson, 1999-
present. • Florida Department of Education Technical Advisory Committee, 1998-1999. • Wisconsin Department of Education Technical Advisory Committee, 1998-2002. • New York Department of Education Blue Ribbon Committee on English Language Arts,
2002-2005. • Program Committee of the Joint European Conference of the IACCP and the ITC, Graz,
Austria, 1996-1999. • Cultural Review Panel, OECD/PISA 2000 Project to Assess School Achievement in 30
Countries, Chairperson, 1999. • GMAT Research Policy Task Force, Member, 1999-2000. • New York State Career and Technical Education Advisory Group, Member, 1999-
present. • NIMH Project to Develop and Validate a Consumer Mental Health Outcome Measure,
Consultant, 1999-Present. • Virginia Technical Advisory Committee, Chairperson, 1999-present. • Technical Review Committee for the Maryland Testing Program, Chairperson, 1999-
2000. • National Technical Analysis Group (TAG-2), National Board for Professional Teaching
Standards, Member, 1996-2003. • Psychometric Oversight Committee, American Institute of Certified Public Accountants,
Chairperson, 1999-present. • Assessment Advisory Committee, South Africa, Member, 2000-present. • National Research Council Committee on Embedding Items in Assessments, Member,
1999. • Pennsylvania Department of Education Technical Advisory Committee, Member, 1996-
present. • Selection Committee for the Medical College of Canada’s Outstanding Achievement in
the Evaluation of Clinical Competence Award, Member, 2001-2003.
4
• National Cancer Institute, Cancer Outcomes Measurement Working Group, Member, 2001-2002.
• Delaware Department of Education, Technical Advisory Committee, Member, 2001-2003.
• Advisory Committee to the West Virginia Department of Education, Member, 2000-2001.
• Advisor to the Connecticut Department of Education on Standard Setting, 2001. • AERA International Relations Committee, Member, 2002–2005. • Department of Health and Human Services Project to Develop a Consumer Mental
Health Outcomes Measure, Consultant, 1997-2003. • SRI International Project to Evaluate the Performance Standards in Washington State,
2003-2005. • National Council on Measurement in Education Career Award Committee, 2002-2004,
2005-present. • HEM National Technical Advisory Committee, Member, 2003-2007. • SHL Scientific Advisory Board, Member, 2003-present. • Educational Quality and Accountability Office, Ontario Department of Education
Technical Advisory Committee Member, 2003-2004. • Alaska Department of Education, Technical Advisory Committee Member, 2004-present. • National Board of Medical Examiners, Center for Innovation Advisory Committee
Member, 2005-2007. • Medical Council of Canada Award for Outstanding Achievement Committee, Member,
2003-2005. • Center for Applied Linguistics Test Design Committee Member, 2005-2006. • 9th European Congress of Psychology, International Advisory Board, 2004-2005. • Center on Outcomes, Research and Education, Northwestern University, Project to
Refine and Standardize Health Literacy Assessment, Consultant, 2005-2008. • Technical Advisory Committee, PISA, Chairperson, 2005. • National Board of Osteopathic Medical Examiners, Consultant, 2004-2005. • NIH Statistical Co-ordinating Center for PROMIS, Consultant, 2005-present. • Ordinate Corporation, Consultant, 2005. • Harcourt Education Measurement Project with EQAO, Ontario, Consultant, 2004-2005. • NCEO/University of Minnesota Technical Work Group, Member, 2006-2010. • APA Divisions 5 and 52 Task Force to Improve Quantitative Skills Training in Cross-
Cultural Psychology, 2006-present. • Medical Council of Canada’s Examination Development Advisory Committee, Member,
2006-present. • Assessment Strategies Inc., Consultant, 2006-present. • IAAP Division 2, Secretary-Treasurer, 2007-present. • Institute of Education Sciences Statistics and Modeling Scientific Review Panel,
Member, 2007-2009. • Pearson Advisory Board, 2007-present. • American Psychological Association Psychological Tests and Assessment Committee,
Member, 2008-2010. • Washington Advisory Group on Assessment of English Language Learners, 2007. • Puerto Rico NAEP Technical Panel, 2008-present.
5
Consulting Activities - School Districts
Cincinnati, Cleveland, OH; Amherst, Barre, Billerica, Concord, Holyoke, Lowell, Westfield, Worcester, MA; Providence, RI; Baltimore, Hagerstown, Montgomery County, MD; Kamehameha Schools, Honolulu, HI; Manhasset, New York, Rochester, Port Washington, NY; Houston, Dallas, TX; Glendale, AZ; Newark, DE; New York City; Warren Hills, NJ; Los Angeles, CA; Atlanta, GA; Baton Rouge, LA; Suffield, CT; Hampton, ME: Charleston, SC; Philadelphia, PA; Washington, DC; Tulsa, OK
- State and Provincial Departments of Education
Alabama, Alaska, California, Connecticut, Delaware, Florida, Georgia, Hawaii, Kentucky, Louisiana, Maryland, Massachusetts, Michigan, New Jersey, New Mexico, New York, Pennsylvania, Rhode Island, Texas, Virginia, West Virginia, Wisconsin, British Columbia, Ontario, Quebec, Alberta
- International
Australia, Canada, England, France, Germany, India, Indonesia, Israel, Italy, Japan, The Netherlands, Saudi Arabia, Scotland, Singapore, Spain, Swaziland, Sweden, Taiwan
- Professional Exams Federation of State Boards of Physical Therapy
Institute of Banking, Saudi Arabia Municipal Securities Rulemaking Board National Association of Security Dealers
New York Stock Exchange American Institute of Certified Public Accountants National Board of Medical Examiners American Board of Family Practice American Board of Internal Medicine
Law School Admissions Council National Association of Purchasing Management
National Center for Health Education Canadian Nursing Association National Commission for Health Certifying Agencies Educational Services for the Professions American Dental Association Professional Examination Service Certified Systems Professionals IOX Associates
Graduate Management Admission Council The Medical Council of Canada
Educational Commission for Foreign Medical Graduates National Board of Chiropractic Examiners
6
- Industry
Xerox Polaroid Corporation American Telephone & Telegraph GM/UAW Hewlett-Packard Hoffman-Roche Microsoft RAND Simplex Time Recorder Westat - Other Abt Associates
American College Testing Program American Council of Learned Societies American Council on Education
American Institutes for Research Antioch University
Brown University Buros Institute Educational Collaborative for Greater Boston, Inc. Educational Testing Service Educational Development Corporation Educational Quality and Accountability Office, Province of Ontario
Erlbaum Publishers Foreign Service Institute Harcourt Educational Measurement Harper and Row HumRRO Institute for International Research International Education Associates Kluwer Academic Publishers Manpower Demonstration Research Corporation Mathematica Policy Research Mediax National Assessment Governing Board
7
National Center for Education Statistics National Institute of Education National Opinion Research Center New England Research Institute Northwest Regional Educational Laboratory
Nuclear Power Office of Educational Research and Improvement, U.S. Dept. of Education
Office of Technology Assessment - U.S. Congress Pelavin Associates
Riverside Publishing Company RMC Sage Publications SHL Group, Inc. Springer-Verlag Publishers SRI International Teaching Resources UNESCO University of Indiana Medical School U.S. Army U.S. Air Force WICAT Systems Reviewing Activities
• Reviewer to the AERA Division D Program Committee. (1972, 1975, 1979-present) • Reviewer to the APA Division 5 Program Committee. (1991-present)
• Occasional Reviewer for Psychometrika, Review of Educational Research; Curriculum
Theory Network; Educational Psychologist; American Educational Research Journal; Canadian Journal of Education; Psychological Bulletin; Social Science Research; Educational Researcher; Educational Evaluation and Policy Analysis; Journal of Applied Psychology; Journal of Cross-Cultural Psychology; American Psychologist; Journal of Experimental Psychology; Educational Measurement: Issues and Practice; Research Quarterly for Exercise and Sport; Linguistics and Education; European Journal of Psychological Assessment, Educational Assessment; Archives of Clinical Neuropsychology.
• Advisory Editor to the Journal of Educational Measurement. (1972-1980)
• Co-Chairperson of the NERA-NCME Program Committee. (1972, 1973)
• Editorial Consultant to Review of Research in Education. (1982)
• Advisory Editor to Applied Psychological Measurement. (November 1976-present)
• Associate Editor to Journal of Educational Statistics. (1981-1989)
• Book Review Editor to Journal of Educational Measurement. (1984-1986)
• Advisory Editor to Evaluation and the Health Professions. (1987-1997)
8
• Advisory Editor to Educational and Psychological Measurement. (1988-present)
• Advisory Editor to Revista Portuguesa Educacao. (1986-1998)
• Advisory Editor to Psicothema. (1989-present)
• Editorial Consultant to Educational Measurement. (1989, 3rd edition)
• Advisory Editor to Sage's Measurement Methods for the Social Sciences. (1988-2002)
• Advisory Editor to the Journal of Educational Measurement. (1988-1992)
• APA Division 15, National Advisory Committee to the Handbook of Educational Psychology. (1989-1994)
• Editor to Instructional Topics in Educational Measurement Series, NCME. (1990-1991)
• Consulting Editor to Multivariate Behavioral Research. (1990-present)
• Advisory Editor to Applied Measurement in Education. (1990-present)
• Associate Editor to European Journal of Psychological Assessment. (1993-present)
• Advisory Editor to Educational Research Quarterly. (1993-present)
• Advisory Editor to Instructional Topics in Educational Measurement Series. (1997-1999)
• Advisory Editor to Current Issues in Education (1999 - present)
• Consulting Editor to the International Journal of Testing. (1999 - present)
• Advisory Editor to Indian Journal of Vocational Education. (2001-present)
• Advisory Editor to Metodología de las Ciencias del Comportamiento. (2002-present)
• Advisory Editor to European Journal of Methodology. (2004-present)
• Advisory Editor to Psychology Science. (2006-present)
Miscellaneous Professional Activities
• Invited speaker at Educational testing Service, University of Alberta, University of Delaware, National Institute of Education, University of Stirling, University of Montreal, North Texas State University, Tulsa Reading Council, University of Connecticut, University of Giessen, University of Ottawa, Miami-Dade Community College, Michigan Educational Research Association, Ontario Institute for Studies in Education, Scottish Council for Educational Technology, University of Leiden, UCLA, Scottish Council of Educational Research, London University, University of Maryland School of Nursing, U.S. Army (20 workshops), Congressional Hearings on Uses of Achievement Scores,
9
Plymouth University, British Post Office, University of Wisconsin, National Board of Medical Examiners, University of Hawaii, University of Amsterdam, University of Twente, Free University of Amsterdam, Florida Educational Research Association.
• Instructor, 1977, 1978, 1979, 1980, and 1981 Two-Day AERA Training Programs
entitled, "Introduction to Criterion-Referenced Testing and Measurement."
• Member, Advisory Board for the Johns Hopkins University Symposium on Educational Research, 1977-1982.
• Instructor, Invitational Seminar on Methods of Mental Measurement, Plymouth, England,
September, 1987.
• Instructor, UNESCO sponsored psychometric methods course, Bhopal, India, July, 1987.
• Instructor, Invitational Seminar on Advanced Psychometric Methods, National Institute for Testing and Evaluation, Jerusalem, Israel, January, 1989.
• Participant and reviewer, U.S. Department of Education's Assessment of Student
Learning in Post-Secondary Education Workshop, November 15-17, 1991.
• Consultant to the Cross-European Longitudinal Study of Aging, 1995-2000.
• University Human Subjects Review Committee, 1972-1974. • University Research Council, 1972-1974. • School of Education Personnel Committee, Co-Chairperson, 1974. • School of Education Dean Search Committee, 1975-1976. • EPRA Division Personnel Committee, Chairperson, 1983. • University Committee to Evaluate Teaching, 1984. • University Graduate Fellowship Awards Committee, 1987, 1988. • School of Education Dean Search Committee, 1987. • School of Education Task Force on Governance, 1989. • Laboratory of Psychometric and Evaluative Research Program, Chairperson, 1973-
present. • School of Education Dean Search Committee, 1994. • School of Education Dean Evaluation Committee, 1998. • Provost’s Distinguished Professor Committee, Chairperson, 1999-2003. • EPRA Department Academic Matters Committee, Chairperson, September 2002-present. • Center for Educational Assessment, Co-Director, 2000-2004. • Center for Educational Assessment, Executive Director, 2004-present.
10
RESEARCH AND EVALUATION CONTRACT AND GRANT AWARDS:
• University of Massachusetts Faculty Research Grant (Comparative Study of Test Administration Procedures and Scoring Methods with Achievement Tests), 1970.
• Massachusetts Division of Special Education Grant (An Evaluative Study of In-Service Teacher Training), 1976.
• National Institute of Education Basic Skills Research Grant (Psychometric and Statistical Contributions to the Theory and Practice of Criterion-Referenced Testing), 1976-1977.
• Air Force Contract (Applications of Latent Trait Theory to the Development of Norm-Referenced and Criterion-Referenced Tests ), 1977-1978.
• Air Force Contract (Latent Trait Model Contributions to Criterion-Referenced Testing Technology), 1979-1980.
• National Assessment of Educational Progress (Utilization of Latent Trait Models with NAEP Exercise Results), 1982.
• Air Force Contract (Construction and Validation of Air Force Specialty Diagnostic Achievement Tests), 1984-1988.
• Massachusetts Department of Education Contract (Programs to Assist School Districts in Collecting and Using Achievement Test Data), 1987-1988.
• Chapter 636 Program Evaluation Contract (Evaluation of the Worcester Chapter 636 Programs), 1988.
• Chapter 636 Program Evaluation Contract (Evaluation of the Worcester Chapter 636 Programs), 1988-1989.
• NY DOE Contract (Mantel-Haenszel Item Bias and IRT Analyses), 1989. • Institute for International Research (Development of Criterion-Referenced Tests in
Swaziland), 1990-1994. • Graduate Management Admission Council (Solving GMAT Technical Problems with
IRT Models), 1990-1994. • Indonesian Ministry of Education (Four-Month Psychometric Training Program for
Educators), 1991. • National Science Foundation (Methods of Setting Standards on Performance
Assessments in State Wide Assessment Contexts), 1995-1998. • Law School Admissions Council (Assessing Item Difficulty with Anchor-Based Methods
and Bayesian Statistics), 1996-1998. • National Assessment of Educational Progress (Enhancing Score Reporting), 1996-1997. • Massachusetts Department of Education (Psychometric Analyses of the MCAS), 1998-
1999. • Microsoft, Inc. (Computer-Based Test Examinations), 1998-present. • Harcourt Educational Measurement (Psychometric Analyses on State Assessment Data),
2000-2003. • Massachusetts Department of Education (MCAS Validity Studies), 2002-2004. • Measured Progress (MCAS Research and Validity Studies), 2004-present. • College Board (Enhancements in Score Reporting), 2006-2008. • Pearson Educational Measurement (Validity Studies), 2007-present.
11
COMPUTER PROGRAMMING EXPERIENCE:
• Many years of experience writing computer programs. Programs written include:
Hambleton, R. K. Computation of Swineford's tendency to gamble scores, Fortran IV program for the IBM 7094 Computer. Department of Measurement and Evaluation, the Ontario Institute for Studies in Education, 1969.
Hambleton, R. K. Computation of information curves and efficiency of three logistic test
models, Fortran IV program for the CDC 3600 Computer. Center for Educational Research, School of Education, University of Massachusetts at Amherst, 1970.
Hambleton, R. K. Estimating observed-score distributions using logistic test models,
Fortran IV program for the CDC 3600 Computer. Center for Educational Research, School of Education, University of Massachusetts at Amherst, 1970.
Hambleton, R. K., & Barbuto, P. F. (1971). A computer program for optimal scaling.
Behavioral Science, 16, 413.
Hambleton, R. K., & Rovinelli, R. (1973). A Fortran IV program for generating examinee response data from logistic test models. Behavioral Science, 17, 73-74. (Revised, September 1990)
Hambleton, R. K., & Rovinelli, R. A computer simulation program for item-examinee
sampling. Center for Educational Research, School of Education, University of Massachusetts at Amherst, 1971.
Hambleton, R. K., & Traub, R. E. An individual differences model for multi-dimensional
scaling, Fortran IV program for the IBM 7094 Computer. Department of Measurement and Evaluation, The Ontario Institute for Studies in Education, 1969.
Liang, T., Han, K. T., & Hambleton, R. K. (in press). ResidPlots-2: Computer software
for IRT graphical residual analyses. Applied Psychological Measurement.
Murray, L., Hambleton, R. K., & Simon, R. A Fortran IV program to carry out residual analyses for logistic test models. Laboratory of Psychometric and Evaluative Research, School of Education, University of Massachusetts at Amherst, 1982. (Revised, June 1988)
Rogers, H. J., & Hambleton, R. K. A program to conduct IRT item bias investigations.
Laboratory of Psychometric and Evaluative Research, School of Education, University of Massachusetts at Amherst, 1987.
Rogers, H. J., & Hambleton, R. K. (1994). MH: A Fortran V program to compute the
Mantel-Haenszel statistic for detecting differential item functioning. Educational and Psychological Measurement, 54(1), 101-104.
Rovinelli, R., & Hambleton, R. K. (1972). A general Fortran IV program for the
analysis of semantic differential data. Behavioral Science, 17, 74.
12
Sheehan, D. S., & Hambleton, R. K. (1974). A general Fortran IV test-scoring program.
Educational and Psychological Measurement, 34, 169-171. TEACHING INTERESTS:
• Principles of Educational and Psychological Testing, Modern Assessment Practices, Classical Test Theory and Practices, Item Response Theory and Applications, Educational Research Methods, Advanced Measurement Seminar.
PROFESSIONAL AFFILIATIONS:
• American Educational Research Association • American Psychological Association (Fellow of Divisions 5 and 15) • International Association of Applied Psychology • National Council on Measurement in Education • Northeastern Educational Research Association • Psychometric Society • Canadian Educational Research Association • British Psychological Society
COMPLETED STUDIES: (a) Dissertations
The effects of item order and anxiety on test performance and stress. Unpublished masters thesis, University of Toronto, 1968.
Empirical investigation of the Rasch test-theory model. Unpublished doctoral dissertation, University of Toronto, 1969.
(b) Publications
Allalouf, A., Hambleton, R. K., & Sireci, S. (1999). Identifying the causes of DIF in translated verbal items. Journal of Educational Measurement, 36(3), 185-198.
Avis, N. E., Smith, K. W., Hambleton, R. K., et al. (1996). Development of the
multidimensional index of life quality: a quality of life measure for cardiovascular disease. Medical Care, 34(11), 1102-1120.
Avis, N. E., Smith, K. W., Mayer, K. H., Swislow, L., & Hambleton, R. K. (2001). The
multidimensionalquality of life questionnaire for persons with HIV/AIDS: Development and evaluation (Final Report). Newton, MA: NERI.
Bartram, D., & Hambleton, R. K. (Eds.). (2006). Computer-based testing and the
internet: Issues and advances. New York: Wiley.
13
Boulet, J., Friedman, M., Hambleton, R. K., Burdick, W., & Ziv, A. (1997). Assessing the adequacy of the post-encounter written scores in standardized patient exams. In A. Scherpbier, C. van der Vleuten, & J. Rethans (Eds.), Proceedings of the Seventh Ottawa Conference on Medical Education (pp. 410-412). Dordrecht, The Netherlands: Kluwer Academic Publishers.
Boulet, J. R., Friedman Ben-David, M., Hambleton, R. K., Burdick, W., Ziv, A., & Gary,
N. E. (1998). An investigation of the sources of measurement error in the post-encounter written scores from standardized patient examinations. Advances in Health Science Education, 3, 89-100.
Boulet, J. R., McKinley, D. W., Whelan, G. P., & Hambleton, R. K. (2003). Quality
assurance methods for performance-based assessments. Advances in Health Sciences Education, 8, 27-47.
Boulet, J. R., McKinley, D. W, Whelan, G. P., & Hambleton, R. K. (2003). The effect
of task exposure on repeat candidate scores in a high stakes performance assessment. Teaching and Learning in Medicine, 15, 227-232.
Boulet, J. R., McKinley, D. W., Whelan, G. P., van Zanten, M., & Hambleton, R. K.
(2002). Clinical skills deficiencies among first-year residents: Utility of the ECFMG clinical skills assessment. Academic Medicine, 77, S33-S35.
Bourque, M. L., & Hambleton, R. K. (1993). Measurement issues in setting standards
on NAEP. Measurement and Evaluation in Counselling and Development, 26(1), 41-47.
Caban, J. P., Hambleton, R. K., Coffing, D. G., Conway, M. T., & Swaminathan, H.
(1978). Mental imagery as an approach to spelling instruction. Journal of Experimental Education, 46, 15-21.
Clauser, B., Mazor, K., & Hambleton, R. K. (1993). The effects of purification of the
matching criterion on the identification of DIF using the Mantel-Haenszel procedure. Applied Measurement in Education, 6, 269-280.
Clauser, B., Mazor, K. M., & Hambleton, R. K. (1994). The effects of score group width
on the Mantel-Haenszel procedure. Journal of Educational Measurement, 31(1), 67-78.
Clauser, B. E., Mazor, K., & Hambleton, R. K. (1991). The influence of test
homogeneity on the identification of DIF test items using the Mantel-Haenszel procedure. Applied Psychological Measurement, 15(4), 353-359.
de Gruijter, D. N. M., & Hambleton, R. K. (1983). Using logistic test models in
criterion-referenced test item selection. In R. K. Hambleton (Ed.), Applications of item response theory. Vancouver, BC: Educational Research Institute of British Columbia.
de Gruijter, D. N. M., & Hambleton, R. K. (1984). On problems encountered using
decision theory to set cut-off scores. Applied Psychological Measurement, 8, 1-8.
14
de Gruijter, D. N. M., & Hambleton, R. K. (1984). Reply to van der Linden's "Thoughts
on the Use of Decision Theory to Set Cut-off Scores." Applied Psychological Measurement, 8, 19-20.
Fernandez-Ballesteros, R., Hambleton, R. K., & van de Vijver, F. (1999). EXCELSA
protocol adaptation procedures. In J. J. F. Schroots, R. Fernandez-Ballesteros, & G. Rudinger (Eds.), Aging in Europe (pp. 169-184). Amsterdam: IOS Press.
Friedman, M., Boulet, J. R., Burdick, W. P., Ziv, A., Hambleton, R. K., & Gary, N. E.
(1997). Issues of validity and reliability concerning who scores the post-encounter patient progress note. Academic Medicine, 72(10), 579-581.
Gifford, J. A., & Hambleton, R. K. (1981). Construction and use of criterion-referenced
tests in program evaluation studies. Academic Psychology Bulletin, 3, 411-436. Goodman, D., & Hambleton, R. K. (2004). Student test score reports and interpretive
guides: Review of current practices for future research. Applied Measurement in Education, 17, 145-220.
Goodman, D., & Hambleton, R. K. (2005). Some misconceptions about large-scale
educational assessments. In R. Phelps (Ed.), Defending standardized testing (pp. 91-110). Mahwah, NJ: Erlbaum.
Gorth, W. P., & Hambleton, R. K. (1972). Measurement considerations for criterion-
referenced testing and special education. Journal of Special Education, 6, 303-314.
Green, L. W., Cook, T., Doster, M. E., Fors, S. W., Hambleton, R. K., Smith, A., &
Walberg, H. J. (1985). Thoughts from the School Health Education Evaluation Advisory Panel. Journal of School Health, 55, 300.
Gumpert, R., & Hambleton, R. K. (1979). Situational leadership: How Xerox managers
fine tune managerial styles to employee maturity and task needs. Management Review, 6, 303-314.
Haley, S. M., Ni, P., Hambleton, R. K., Slavin, M. D., & Jette, A. M. (2006). Computer-
adaptive testing improves accuracy and precision of scores over random item selection in a physical functioning item bank. Journal of Clinical Epidemiology, 59, 1174-1182.
Hambleton, R. K. (1973). Collection of various psychometric and technological area
bibliographies. JSAS Catalog of Selected Documents in Psychology, 3, 93. (240 pages)
Hambleton, R. K. (1974). Assessing student progress: A criterion-referenced
measurement approach. In D. W. Allen & J. Hecht (Eds.), Controversies in education (pp. 370-376). New York: Saunders.
Hambleton, R. K. (1977). Some comments on Aikenhead's "New Methodology for Test
Construction." Journal of Research in Science Teaching, 14, 473-474.
15
Hambleton, R. K. (1978). Development and validation of criterion-referenced tests and
using and reporting of test score information for classroom teachers. Proceedings of the Fifth Annual Conference on Measurement and Evaluation. Los Angeles: Los Angeles County Public Schools.
Hambleton, R. K. (1978). On the use of cut-off scores with criterion-referenced tests in
instructional settings. Journal of Educational Measurement, 25, 277-290.
Hambleton, R. K. (1979). Latent trait models and applications. In R. E. Traub (Ed.), New directions for testing and measurement: Analysis of test data (pp. 13-32). San Francisco: Jossey-Bass.
Hambleton, R. K. (1980). Test score validity and standard-setting. In R. Berk (Ed.),
Criterion-referenced testing: State of the art. Baltimore: Johns Hopkins University Press.
Hambleton, R. K. (1980). Latent ability scales: Interpretations and uses. In S. Mayo
(Ed.), New directions for testing and measurement: Interpreting test scores (pp. 73-97). San Francisco: Jossey-Bass.
Hambleton, R. K. (Ed.). (1980). Contributions to criterion-referenced testing
Hambleton, R. K. (1982). Latent trait model contributions to criterion-referenced testing technology (Final Report F33615-79-C-0020). Lowry AFB: Air Force Human Resources Laboratory.
Hambleton, R. K. (1982). Utilization of item response models with NAEP exercise
results (Final Report). Washington, DC: National Institute of Education.
Hambleton, R. K. (1982). Competency-based education. The World Book Encyclopedia. Chicago: World Book-Childcraft International, Inc.
Hambleton, R. K. (1982). Advances in criterion-referenced testing technology. In C.
Reynolds & T. Gutkin (Eds.), Handbook of school psychology. New York: Wiley.
Hambleton, R. K. (1983). Application of item response models to criterion-referenced
Hambleton, R. K. (Ed.). (1983). Applications of item response theory. Vancouver, BC: Educational Research Institute of British Columbia.
Hambleton, R. K. (1984). Criterion-referenced measurement. In T. Husen & T. N.
Postlethwaite (Eds.), International encyclopedia of education: Research and studies. New York: Pergamon Press. (Reprinted in M. Eraut [Ed.], The international encyclopedia of educational technology. New York: Pergamon Press. Reprinted in J. P. Keeves [Ed.], Educational research, methodology, & measurement: An international handbook. New York: Pergamon Press, 1988.)
16
Hambleton, R. K. (1984). Validating the test scores. In R. Berk (Ed.), A guide to criterion-referenced test construction (pp. 199-230). Baltimore, MD: The Johns Hopkins University Press.
Hambleton, R. K. (1984). Determining suitable test lengths. In R. Berk (Ed.), A guide
to criterion-referenced test construction (pp. 144-168). Baltimore, MD: The Johns Hopkins University Press.
Hambleton, R. K. (1984). Using microcomputers to develop tests. In M. Hiscox, & E.
Bryzezinski (Eds.), Educational measurement: Issues and practice, 3, 10-14.
Hambleton, R. K. (1984). Item response theory. Professional Examination Service Quarterly Newsletter. New York: Professional Examination Service.
Hambleton, R. K. (1984). Commentary. Professions Education Researcher Notes, 6, 9-
10.
Hambleton, R. K. (1985). New technical advances in measurement for certification exams. In Proceedings of the National Conference on Continuing Competence Assurance in the Health Professions (pp. 102-110). Washington, DC: The National Commission for Health Certifying Agencies.
Hambleton, R. K. (1985). A review of the Nelson-Denny Reading Test. In R. C.
Sweetland & D. N. Keyser (Eds.), Test critiques: Volume III. Kansas City: Test Corporation of America. (Reprinted in R. C. Sweetland and D. N. Keyser [Eds.], Test Critiques Applied Topics. Kansas City: Test Corporation of America, 1988.)
Hambleton, R. K. (1985). Criterion-referenced assessment of individual differences. In
C. Reynolds & V. L. Willson (Eds.), Methodological and statistical advances in the study of individual differences (pp. 393-424). New York: Plenum Press.
Hambleton, R. K. (1986). The validity of NAPM's Certified Purchasing Management
process. Journal of Purchasing and Materials Management, 2-10. Hambleton, R. K. (1986). The changing conception of measurement: A commentary.
Applied Psychological Measurement, 10, 415-421.
Hambleton, R. K. (Ed.). (1986). Standards for educational and psychological testing: Six reviews. Journal of Educational Measurement, 23(1), 83-98.
Hambleton, R. K. (1987). Computerized adaptive testing: Theory, applications, and
standards. Bulletin of the International Test Commission, 14, 5-18.
Hambleton, R. K. (1987). The three-parameter logistic model. In D. L. McArthur (Ed.), Alternative approaches to the assessment of achievement (pp. 129-158). Boston: Kluwer Academic Publishers.
Hambleton, R. K. (1987). Evaluating criterion-referenced tests. ERIC Digest Series.
Princeton, NJ: ERIC Clearinghouse of Tests, Measurement, and Evaluation.
17
Hambleton, R. K. (1987). Determining optimal test lengths with a fixed total testing time. Educational and Psychological Measurement, 47, 339-347.
Hambleton, R. K. (1988). A review of Iowa Tests of Basic Skills, Forms G and H. In D.
J. Keyser & R. C. Sweetland (Eds.), Test critiques: Volume VI. Kansas City: Test Corporation of America. (Reprinted in D. J. Keyser and R. C. Sweetland [Eds.], Test Critiques Applied Topics. Kansas City: Test Corporation of America, 1988.)
Hambleton, R. K. (1989). Principles and applications of item response theory. In R. L.
Linn (Ed.), Educational measurement (3rd edition, pp. 147-200). New York: Macmillan.
Hambleton, R. K. (Ed.). (1989). Applications of item response theory. International
Journal of Educational Research, 13, 121-220.
Hambleton, R. K. (1991). Issues to be considered in the content validity portions of RFPs for large-scale assessment programs. In P. Aschbacher & E. L. Baker (Eds.), Improving large-scale assessment. Los Angeles, CA: Center for Research on Evaluation, Standards and Student Testing, UCLA.
Hambleton, R. K. (1989). Item response theory models and methods for measurement in
exercise science and sport. In M. J. Safrit (Ed.), Measurement theory and practice in exercise science and sport (pp. 1-29). Madison, WI: University of Wisconsin Press.
Hambleton, R. K. (1989). Constructing tests with item response models: A discussion of
methods and two problems. Bulletin of the International Test Commission, 16, 96-106.
Hambleton, R. K. (1989). Preparation of exam items for the Uniform CPA Examination
(Final Report). New York: American Institute of Certified Public Accountants.
Hambleton, R. K. (1989). Portrait, notice biographique et bibliographique. Revue de Psychologie Appliquée, 39(4), 309-323.
Hambleton, R. K. (1990). Other objective formats. In AICPA, Uniform CPA
examination item writer's guide (Chapter 3, pp. 22-43). New York: American Institute of Certified Public Accountants.
Hambleton, R. K. (1990). Setting achievement levels for the 1990 NAEP mathematics
assessment: Handbook for judges. Washington, DC: National Assessment Governing Board.
Hambleton, R. K. (1990). Criterion-referenced testing methods and practices. In T.
Gutkin & C. Reynolds (Eds.), Handbook of school psychology (2nd ed.; pp. 388-414). New York: Wiley.
Hambleton, R. K. (1990). Item response theory: Introduction and bibliography.
Psicothema, 2(1), 97-107.
18
Hambleton, R. K. (1990). Criterion-referenced measurement in student and curriculum evaluation. In A. Lewy (Ed.), International Encyclopedia of Curriculum. New York: Pergamon Press.
Hambleton, R. K. (1990). Criterion-referenced assessment in evaluation. In H. J.
Walberg and G. D. Haertel (Eds.), The International Encyclopedia of Educational Evaluation. New York: Pergamon Press.
Hambleton, R. K. (Ed.). (1991). Test translations for cross-cultural studies. Bulletin of
the International Test Commission, 18, 1-101.
Hambleton, R. K. (1991). Individualized criterion-referenced testing (Technical Manual). Tulsa, OK: Educational Development Corporation.
Hambleton, R. K. (1992). What skills do teachers need in educational testing? In D.
Bateson (Ed.), Classroom testing in Canada, Proceedings of the Second Invitational Conference on Classroom Testing (pp. 91-96). Vancouver, BC: University of British Columbia.
Hambleton, R. K. (1992). Measurement advances to address educational policy
questions. In T. J. Plomp, J. M. Pieters, & A. Feteris (Eds.), Book of summaries: European Conference on Educational Research (pp. 681-684). Enschede, The Netherlands: University of Twente.
Hambleton, R. K. (1992). Setting standards on national tests. International Journal of
Psychology, 27, 570. (Abstract).
Hambleton, R. K. (1992). Test translations for cross-cultural studies. In B. Wilpert, H. Motoaki, & J. Misumi (Eds.), Proceedings of the 22nd International Congress of Applied Psychology (pp. 271-275). Hillsdale, NJ: Erlbaum.
Hambleton, R. K. (1992). The uses of international data in setting achievement levels
(Final Report). Washington, DC: National Center for Educational Statistics.
Hambleton, R. K. (1992). Item response theory: Measurement for the 1990s. CLEAR Exam Review, Winter, 18-20.
Hambleton, R. K. (1992). Fitting item response models to the Series 7 Examination and
equating test scores. Amherst, MA: Psychometric and Evaluative Research Services, Inc.
Hambleton, R. K. (1993). International Test Commission: Organization, goals, and
current projects. European Journal of Psychological Assessment, 9(1), 54-56.
Hambleton, R. K. (1993). Translating achievement tests for use in cross-national studies. European Journal of Psychological Assessment, 9(1), 57-68.
Hambleton, R. K. (1993). Summary of conference on test use with children and youth.
European Review of Applied Psychology, 43, 261-262.
19
Hambleton, R. K. (1994). Municipal Securities Rulemaking Board guide to item writing and review. Washington, DC: MSRB. (65 pages.)
Hambleton, R. K. (1994). Rise and fall of criterion-referenced measurement?
Educational Measurement: Issues and Practice, 13(4), 21-26. Hambleton, R. K. (1994). Item response theory: A broad psychometric framework for
measurement advances. Psicothema, 6(3), 535-556.
Hambleton, R. K. (1994). Guidelines for adapting educational and psychological tests: A progress report. European Journal of Psychological Assessment, 10(3), 229-244.
Hambleton, R. K. (1995). Meeting the measurement challenges of the 1990s and
beyond: New assessment models and methods. In T. Oakland & R. K. Hambleton (Eds.), International perspectives on academic assessment (pp. 83-104). Boston, MA: Kluwer Academic Publishers.
Hambleton, R. K. (1995). Criterion-referenced measurement. In T. Husen & T. N.
Postlethwaite (Eds.), International Encyclopedia of Education (2nd ed.; pp. 1183-1189). New York: Pergamon Press.
Hambleton, R. K. (1995). Setting standards on criterion-referenced tests. In T. Husen &
T. N. Postlethwaite (Eds.), International Encyclopedia of Education (2nd ed.; pp. 5721-5726). New York: Pergamon Press.
Hambleton, R. K. (1996). Adapting psychological tests: technical guidelines for
improving practices. International Journal of Psychology, 31(3), 439. (Abstract) Hambleton, R. K. (1996). Advances in assessment models, methods, and practices. In
D. Berliner & R. Calfee (Eds.), Handbook of educational psychology (pp. 899-925). New York: Macmillan.
Hambleton, R. K. (1996). New models and methods for psychological tests.
Contemporary Group Care Practice Research and Evaluation, 6(1), 34-41.
Hambleton, R. K. (1996). Adapting tests for use in multiple languages and cultures. In J. Muñiz (Ed.), Psicometria (pp. 207-238). Madrid: Editorial Universitas, S.A.
Hambleton, R. K. (1997). The future of educational assessment: likely directions and
technical problems to overcome. NERA Researcher, 35(3), 6-9. Hambleton, R. K. (1997). Measurement quality of the Kentucky Instructional Results
Information System (KIRIS), 1991-1994. In J. Millman (Ed.), Grading teachers, grading schools (pp. 210-218). Newbury Park, CA: Corwin Press.
Hambleton, R. K. (1998). Future directions in item response modeling and applications. In J. Muñiz (Ed.), Introduccíon a la Teoría de respuesta a los ítems. Madrid: Ediciones Pirámide, S.A.
20
Hambleton, R. K. (1998). Setting performance standards on achievement tests. In L. H. Hansche (Ed.), Handbook for the development of performance standards: Meeting the requirements of Title I. Washington, DC: U.S. Department of Education. Netherlands: IEA.
Hambleton, R. K. (1998). Criterion-referenced testing principles, technical advances,
and evaluation guidelines. In C. Reynolds & T. Gutkin (Eds.), Handbook of school psychology (3rd ed., pp. 409-434). New York: Wiley.
Hambleton, R. K. (1998). Enhancing the validity of NAEP achievement level score
reporting. In M. L. Bourque (Ed.), Proceedings of the Achievement Level Workshop (pp. 77-98). Washington, DC: National Assessment Governing Board.
Hambleton, R. K. (1999). Politicians fail, not the teachers. Education Connection,
Winter Issue, 19-22.
Hambleton, R. K. (2000). International Test Commission. In A. E. Kazdin (Ed.), Encyclopedia of Psychology. New York: Oxford University Press.
Hambleton, R. K. (2000). Emergence of item response modeling in instrument
development and data analysis. Medical Care, 38(9), II 60-65.
Hambleton, R. K. (Ed.). (2000). Advances in performance assessment methodology. Applied Psychological Measurement, 24(4), 291-378.
Hambleton, R. K. (2001). Growing problems in applied psychology: Limited training in
assessment. IAAP Newsletter, 13(1), 11-12.
Hambleton, R. K. (2001). Setting performance standards on educational assessments and criteria for evaluating the process. In G. Cizek (Ed.), Setting performance standards: Concepts, methods, and perspectives (pp. 89-116). Hillsdale, NJ: Lawrence Erlbaum Associates.
Hambleton, R. K. (2001). The next generation of the ITC test translation and adaptation
guidelines. European Journal of Psychological Assessment, 17(3), 164-172.
Hambleton, R. K. (2002). How will we understand and use test score information? In R. W. Lissitz & W. D. Schafer (Eds.), Assessments in Educational Reform (pp. 192-205). Boston: Allyn and Bacon.
Hambleton, R. K. (2002). New computer-based technical issues: Developing items,
pretesting, test security, and item exposure. In C. Mills et al. (Eds.), Computer-based testing: Building the foundation for future assessments (pp. 193-203). Mahwah, NJ: Lawrence Erlbaum Publishers.
Hambleton, R. K. (2002). Adapting achievement tests into multiple languages for
international assessments. In A. Porter, & A. Gamoran (Ed.), Methodological advances in large-scale cross-national education surveys (pp. 58-79) Washington: National Academy of Sciences.
21
Hambleton, R. K. (2003). Criterion-referenced testing: Methods and procedures. In R. Fernandez-Ballesteros (Ed.), Encyclopedia of psychological assessment (pp. 280-283). London: Sage.
Hambleton, R. K. (2003). Setting passing scores on tests . . . not too high . . . not too low
. . . but just about right. Education Connection, pp. 11-14.
Hambleton, R. K. (2004). Theory, methods, and practices in testing for the 21st century. Psicothema, 16, 696-701.
Hambleton, R. K. (2005). Issues, designs, and technical guidelines for adapting tests in
multiple languages. In R. K. Hambleton, P. Merenda, & C. Spielberger (Eds.), Adapting educational and psychological tests for cross-cultural assessment (pp. 3-38). Hillsdale, NJ: Lawrence Erlbaum Associates.
Hambleton, R. K. (2005). Applications of item response theory. In J. Lipscomb, C. C.
Gotay, & C. Snyder (Eds.), Outcomes of assessment in cancer (pp. 445-464). Cambridge, UK: Cambridge University Press.
Hambleton, R. K. (2005). Foreword. In W. J. van der Linden. Models for optimal test
design (p. i to v). New York: Springer-Verlag. Hambleton, R. K. (2005). Biography of Frederic Lord. In B. Everitt & D. Howell
(Eds.), Encyclopedia of Statistics in Behavioral Science (pp. 1104-1106). West Sussex, UK: John Wiley & Sons.
Hambleton, R. K. (2006). Psychometric models, test designs and item types for the next
generation of educational and psychological tests. In D. Bartram & R. K. Hambleton (Eds.), Computer-based testing and the internet: Issues and advances (pp. 77-90) New York: Wiley.
Hambleton, R. K. (2006). Good practices for identifying differential item functioning.
Medical Care, 44(11), 182-188. Hambleton, R. K. (2006, winter). An interview with Ronald Hambleton. People and
Organizations@Work, 1-2, 13.
Hambleton, R. K., Anderson, G. E., & Murray, L. (1983). Applying micro-computers to classroom testing practices. In W. Hathaway (Ed.), New directions for testing and measurement: Testing in the schools. San Francisco: Jossey-Bass.
Hambleton, R. K., & Bollwark, J. (1991). Adapting tests for use in different cultures:
Technical issues and methods. Bulletin of the International Test Commission, 18, 3-32.
Hambleton, R. K., Bollwark, J., & Traub, R. E. (1990). NCME Publication Survey
Results. Educational Measurement: Issues and Practice, 9(1), 17-18. Hambleton, R. K., & Bourque, M. L. (1991). Initial performance standards for the 1990
Hambleton, R. K., Brennan, R. L. Brown, W., Dodd, B., Forsythe, R. A., Mehrens, W. A., Nellhaus, J., Reckase, M., Rindone, D., van der Linden, W. J., & Zwick, R. (2000). A response to “Setting Reasonable and Useful Performance Standards” in the National Academy of Sciences’ Grading the Nation’s Report Card. Educational Measurement: Issues and Practice, 19, 5-13.
Hambleton, R. K., Clauser, B. E., Mazor, K. M., & Jones, R. W. (1993). Advances in
the detection of differentially functioning test items. European Journal of Psychological Assessment, 9(1), 1-18.
Hambleton, R. K., & Cook, L. L. (1977). Latent trait models and their use in analyzing
educational test data. Journal of Educational Measurement, 14, 75-96.
Hambleton, R. K., & Cook, L. L. (1983). The robustness of item response models and effects of test length and sample size on the precision of ability estimates. In D. Weiss (Ed.), New horizons in testing (pp. 33-49). New York: Academic Press.
Hambleton, R. K., & Cook, L. L. (1984). The robustness of latent trait models. In D.
Weiss (Ed.), Proceedings of the 1979 Computerized Adaptive Testing Conference. Minneapolis, MN: University of Minnesota.
Hambleton, R. K., & de Gruijter, D. N. M. (1983). Application of item response models
to criterion-referenced test item selection. Journal of Educational Measurement, 20, 355-367.
Hambleton, R. K., & de Jong, J. (Eds.). (2003). Advances in translating and adapting
educational and psychological tests: A special issue. Language Testing, 20(2), 127-134.
Hambleton, R. K., & Dirir, M. (2003). Classical and modern item analysis. In R.
Fernandez-Ballesteros (Ed.), Encyclopedia of psychological assessment (pp. 188-192). London: Sage.
Hambleton, R. K., Dirir, M., & De Brisay, M. (1993). New measurement models and
methods for constructing language tests. Carlton Papers in Applied Language Studies, 10, 63-81.
Hambleton, R. K., & Eignor, D. R. (1977). Adaptive testing applied to hierarchically
structured objectives-based curricula. In D. Weiss (Ed.), Proceedings of the Second Computerized Adaptive Testing Conference. Minneapolis, MN: University of Minnesota.
Hambleton, R. K., & Eignor, D. R. (1978). Guidelines for evaluating criterion-
referenced tests and test manuals. Journal of Educational Measurement, 15, 321-327.
Hambleton, R. K., & Eignor, D. R. (1979). Competency test development, validation,
and standard-setting. In R. M. Jaeger & C. Tittle (Eds.), Minimum competency achievement testing. Berkeley, CA: McCutchan Publishing Co.
23
Hambleton, R. K., Eignor, D. R., & Rovinelli, R. (1979). Toward better achievement tests and test score interpretations in PSI courses. Journal of Personalized Instruction, 3, 180-186.
Hambleton, R. K., & Fennessy, L. (1994). Progrés techniques dan le developpement
d'examens d'accreditaiton. Mesure et Évaluation en Éducation, 17(2), 83-106.
Hambleton, R. K., & Fennessy, L. M. (1995). Technical advances in credentialing examination development. In D. Laveault, B. D. Zumbo, M. E. Gessaroli, & M. W. Boss (Eds.), Modern theories of measurement: Problems and issues (pp. 279-303). Ottawa, Canada: University of Ottawa Press.
Hambleton, R. K., Gorth, W. P., & O'Reilly, R. P. (1973). An application of an
evaluation model for classroom instruction. Journal of Educational Systems, 2, 117-131. (In T. T. Liao & D. C. Miller [Eds.], [1978]. Systems approach to instructional design. Farmingdale, NY: Baywood Publishing Co.)
Hambleton, R. K., Gower, C., & Bollwark, J. (1988). Assessing higher order thinking
skills. Proceedings of the 29th Annual Conference of the Military Testing Association (pp. 628-633). Ottawa, Canada.
Hambleton, R. K., & Gumpert, R. (1982). Validity of Hersey-Blanchard's theory of
leader effectiveness. Group and Organizational Studies, 7, 225-242. Hambleton, R. K., & Han, N. (2005). Assessing the fit of IRT models to educational and
psychological test data: A five step plan and several graphical displays. In W. R. Lenderking & D. Revicki (Eds.), Advances in health outcomes research methods, measurement, statistical analysis, and clinical applications (pp. 57-78). Washington: Degnon Associates.
Hambleton, R. K., Hutten, L., & Swaminathan, H. (1976). A comparison of several
methods for assessing student mastery in objectives-based instructional programs. Journal of Experimental Education, 45, 57-64.
Hambleton, R. K., Impara, J., Mehrens, W., Plake, B. S., Pitoniak, M. J., Zenisky, A. L.,
& Smith, L. F. (2000). Psychometric review of the Maryland School Performance Assessment Program (Final Report). Baltimore, MD: Abell Foundation
Hambleton, R. K., Jaeger, J., Koretz, D., Linn, R. L., Millman, J., & Phillips, S. (1995,
June). A review of the measurement quality of the Kentucky Instructional Results Information System (Final Report). Frankfort, KY: Office of Educational Accountability.
Hambleton, R. K., Jaeger, R., Plake, B. S., & Mills, C. N. (2000). Setting performance
standards on complex educational assessments. Applied Psychological Measurement, 24(4), 355-366.
Hambleton, R. K., & Jirka, S. (2004). How to do your best on standardized tests: Some
suggestions for adult learners. Adventures in Assessment, 16, 5-12.
24
Hambleton, R. K., & Jirka, S. (2006). Anchor-based methods for judgmentally estimating item statistics. In S. Downing & T. Haladyna (Eds.), Handbook of test development (pp. 399-420). Mahwah, NJ: Lawrence Erlbaum Publishers.
Hambleton, R. K., & Jodoin, M. (2003). Item response theory: Models and features. In
R. Fernandez-Ballesteros (Ed.), Encyclopedia of psychological assessment (pp. 509-514). London: Sage.
Hambleton, R. K., & Jones, R. W. (1992). International impact of IRT models
on testing practices. (Abstract). International Journal of Psychology, 27, 371.
Hambleton, R. K., & Jones, R. W. (1993). Comparison of classical test theory and item
response theory and their applications to test development. Educational Measurement: Issues and Practice, 12(3), 38-47.
Hambleton, R. K., & Jones, R. W. (1994). Item parameter estimation errors and their
influence on test information functions. Applied Measurement in Education, 7(3), 171-186.
Hambleton, R. K., & Jones, R. W. (1994). Comparison of empirical and judgmental
methods for detecting differential item functioning. Educational Research Quarterly, 18(1), 21-36.
Hambleton, R. K., Jones, R. W., & Rogers, H. J. (1993). Influence of item parameter
estimation errors in test development. Journal of Educational Measurement, 30(2), 143-155.
Hambleton, R. K., & Jurgensen, C. (1990). Criterion-referenced assessment of school
achievement. In C. R. Reynolds & T. W. Kamphaus (Eds.), Handbook of psychological and educational assessment of children: Volume 1, intelligence and achievement (pp. 456-476). New York: The Guilford Press.
Hambleton, R. K., & Kanjee, A. (1995). Translating tests and attitude scales. In T.
Husen & T. N. Postlethwaite (Eds.), International Encyclopedia of Education (2nd ed.; pp. 6328-6334). New York: Pergamon Press.
Hambleton, R. K., & Kanjee, A. (1995). Increasing the validity of cross-cultural
assessments: use of improved methods for test adaptations. European Journal of Psychological Assessment, 11(3), 147-157.
Hambleton, R. K., & Li, S. (2005). Statistical audit of the ABCTE professional teaching
knowledge, elementary education, English/language arts and secondary mathematics tests. Leesburg, VA: Mid-Atlantic Psychometric Services.
Hambleton, R. K., & Li. S. (2005). Translation and adaptation issues and methods for
educational and psychological tests. In C. Frisby & C. Reynolds (Eds.), Handbook of multicultural school psychology (pp. 881-903). New York: Wiley.
25
Hambleton, R. K., & Li, S. (2005). Criterion-referenced testing: Purposes, technical issues and advances.. In B. Everitt & D. Howell (Eds.), Encyclopedia of Statistics in Behavioral Science (pp. 435-440). West Sussex, UK: John Wiley & Sons.
Hambleton, R. K., & Ma, X. (2003). Investigation of IRT model fit and equating for the
National Board of Chiropractic Examiners (Final Report). Greeley, CO: NBCE.
Hambleton, R. K., Malaka, M., & Jones, R. W. (1994). Teachers' handbook on achievement testing. Arlington, VA: Institute for International Research.
Hambleton, R. K., & Martois, J. (1983). Evaluation of a test score prediction system
based upon item response model principles and procedures. In R. K. Hambleton (Ed.), Applications of item response theory (pp. 196-211). Vancouver, BC: Educational Research Institute of British Columbia.
Hambleton, R. K., & Meara, K. (2000). Newspaper coverage of NAEP results - 1990 to
1998. In M. L. Bourque & S. Byrd (Eds.), Student performance standards on the National Assessment of Educational Progress (pp. 133-155). Washington, DC: National Assessment Governing Board.
Hambleton, R. K., Merenda, P., & Spielberger C. (Eds.). (2005). Adapting educational
and psychological tests for cross-cultural assessment. Mahwah, NJ: Lawrence Erlbaum.
Hambleton, R. K., Mills, C. N., & Simon, R. (1983). Determining the lengths for
criterion-referenced tests. Journal of Educational Measurement, 20, 27-38.
Hambleton, R. K., & Murphy, E. (1991). Changes in educational testing practices. The Kamehameha Journal of Education, 2(2), 17-26.
Hambleton, R. K., & Murphy, E. (1992). A psychometric perspective on authentic
measurement. Applied Measurement in Education, 5(1), 1-16.
Hambleton, R. K., & Murray, L. N. (1983). Goodness-of-fit investigations with item response models. In R. K. Hambleton (Ed.), Applications of item response theory (pp. 71-94). Vancouver, BC: Educational Research Institute of British Columbia.
Hambleton, R. K., & Murray, L. N. (1984). Testing in the United States with
microcomputers. Bulletin of the International Test Commission, 11, 17-24.
Hambleton, R. K., & Novick, M. R. (1973). Toward an integration of theory and method for criterion-referenced tests. Journal of Educational Measurement, 10, 159-170. (Also published as ACT Research Report No. 53. Iowa City, IA: American College Testing Program, 1972.)
Hambleton, R. K., & Oakland, T. (1993). International Test Commission: Goals,
activities, and membership. Psychology International, 4(2), 8-9.
Hambleton, R. K., & Oakland, T. (Eds.). (2004). Advances in assessment testing and practices. Applied Psychology: International Review, 53(2), 155-259.
26
Hambleton, R. K., & Patsula, L. (1996). Test adaptations: review of methods and
suggestions for additional research. International Journal of Psychology, 31(3), 84. (Abstract)
Hambleton, R. K., & Patsula, L. (1998). Adapting tests and questionnaires for use in
multiple languages and cultures. Social Indicators Research, 45, 153-171.
Hambleton, R. K., & Patsula, L. (1999). Increasing the validity of adapted tests: Myths to be avoided and guidelines for improving test adaptation practices. Journal of Applied Testing Technology, 1, 1-16.
Hambleton, R. K., Peele, H. A., Swaminathan, H., & Sawyer, J. (1973). The Jencks-saw
puzzle: Sorting out relationships among schooling, cognitive skills, and income. Meforum, 1, 23-33.
Hambleton, R. K., & Pitoniak, M. J. (2002). Testing and measurement. In J. Wixted
(Ed.), Stevens’ handbook of experimental psychology (3rd ed., 517-561). New York: John Wiley and Sons.
Hambleton, R. K., & Pitoniak, M. J. (2006). Setting performance standards. In R. L.
Brennan (Ed.), Educational measurement (4th ed.). Westport, CT: American Council on Education/Praeger.
Hambleton, R. K., & Plake, B. (1995). Using an extended Angoff procedure to set
standards on complex performance assessments. Applied Measurement in Education, 8(1), 41-55.
Hambleton, R. K., & Powell, S. (1983). A framework for viewing the process of
standard-setting. Evaluation and the Health Professions, 6, 3-24.
Hambleton, R. K., Roberts, D. M., & Traub, R. E. (1970). A comparison of the reliability and validity of two methods for assessing partial knowledge of a multiple-choice test. Journal of Educational Measurement, 7, 75-82.
Hambleton, R. K., Robin, R., & Xing, D. (2000). Item response models for the analysis
of educational and psychological data. In H. E. A. Tinsley & S. Brown (Eds.), Handbook of applied multivariate statistics and mathematical modeling (pp. 553-581). New York: Academic Press.
Hambleton, R. K., & Rogers, H. J. (1986). Advances in preparing certification and
licensure examinations. Evaluation and the Health Professions, 9, 205-229.
Hambleton, R. K., & Rogers, H. J. (1989). Design of an item bias review form: Issues and questions (Final Report). Albany, NY: Department of Education. (ERIC Clearinghouse on Tests, Measurements, and Evaluation: TM012649)
Hambleton, R. K., & Rogers, H. J. (1989). Detecting biased test items: Comparison of
the IRT area and Mantel-Haenszel methods. Applied Measurement in Education, 2, 313-334.
27
Hambleton. R. K., & Rogers, H. J. (1989). Solving criterion-referenced measurement problems with item response models. International Journal of Educational Research, 13, 145-160.
Hambleton, R. K., & Rogers, H. J. (1989). Die anwendung von item-response-modellen
in nationalen lernerfolgsmessungen. In J. K. Ingekamp & W. H. Schreiber (Eds.), Was sissen unsere Schuler? (pp. 267-310). Weinheim: Deutscher, Studien, Verlag.
Hambleton, R. K., & Rogers, H. J. (1990). Using item response models in educational
assessments. In W. H. Schreiber & K. Ingekamp (Eds.), International developments in large-scale assessment (pp. 155-184). Windsor, UK: NFER-Nelson.
Hambleton, R. K., & Rogers, H. J. (1990). Approaches for identifying and
understanding bias in test items. (Abstract). In S. E. Newstead, S. H. Irvine, & P. D. Dann (Eds.), Cognition and motivation: Lectures and seminars. Dordrecht, The Netherlands: Kluwer Academic Publishers.
Hambleton, R. K., & Rogers, H. J. (1991). Evaluation of the plot method for identifying
potentially biased test items. In P. L. Dann, S. H. Irvine, & J. M. Collis (Eds.), Computer-based human assessment (pp. 307-330). Boston, MA: Kluwer Academic Publishers.
Hambleton, R. K., & Rogers, H. J. (1991). Advances in criterion-referenced
measurement. In R. K. Hambleton & J. Zaal (Eds.), Advances in educational and psychological testing: Theory and applications (pp. 3-41). Boston: Kluwer Academic Publishers.
Hambleton, R. K., & Rogers, H. J. (1995). Item bias review (EDO-TM-95-9).
Washington, DC: ERIC. Hambleton, R. K., & Rogers, H. J. (2002). A differential item functioning analysis of
the National Health Survey (Laboratory of Psychometric and Evaluative Research Report No. 418). Amherst, MA: University of Massachusetts, School of Education.
Hambleton, R. K., & Rovinelli, R. (1975). Toward better college grading practices: A
framework for research and development. In D. W. Allen, M. A. Melnick, & C. C. Peelle (Eds.), Reform, renewal, and reward: Improving university teaching. Amherst, MA: Clinic to Improve University Teaching, University of Massachusetts.
Hambleton, R. K., & Rovinelli, R. (1986). Assessing the dimensionality of a set of test
items. Applied Psychological Measurement, 10, 287-302. Hambleton, R. K., Rovinelli, R., & Gorth, W. P. (1971). Efficiency of various item-
examinee sampling designs for estimating test parameters. Proceedings of the 79th Annual Convention of the American Psychological Association, 5, 121-122. (Summary)
28
Hambleton, R. K., Rovinelli, R., Sheehan, D., & Newby, J. (1975). A comparative study of middle school students in different instructional programs. JSAS Catalog of Selected Documents in Psychology, 5, 199-200. (130 pages)
Hambleton, R. K., & Scarpati, S. (2002). Reform of vocational education and new
testing practices in the United States. Indian Journal of Vocational Education, 4, 1-10.
Hambleton, R. K., & Sheehan, D. S. (1971). On the evaluation of higher-order science
objectives. Science Education, 61, 307-315. Hambleton, R. K., & Simon, R. (1980). National Assessment of Educational Progress
social studies and citizenship exercises and their usefulness for improving instruction. In P. L. Williams & J. R. Moore (Eds.), Criterion-referenced testing for the social studies (Bulletin 64). Washington, DC: National Council for the Social Studies.
Hambleton, R. K., & Sireci, S. G. (1997). Future directions for norm-referenced and
criterion-referenced achievement testing. International Journal of Educational Research, 21, 379-393.
Hambleton, R. K., Sireci, S. G., & Robin, F. (1999). Adapting credentialing exams for
use in multiple languages. CLEAR Exam Review, 10(1), 24-28. Hambleton, R. K., & Slater, S. C. (1994). NAEP state reports in mathematics: Valuable
information for policy-makers. New England Journal of Public Policy, 10(1), 209-222.
Hambleton, R. K., & Slater, S. C. (1995, October). Are NAEP executive summary
reports understandable to policy-makers and educators? Los Angeles, CA: CRESST, UCLA.
Hambleton, R. K., & Slater, S. C. (1997). Item response theory models and testing
practices: Current international status and future directions. European Journal of Psychological Assessment, 13(1), 21-28.
Hambleton, R. K., & Slater, S. C. (1997). Reliability of credentialing examinations and
the impact of scoring models and standard-setting policies. Applied Measurement in Education, 10(1), 19-38.
Hambleton, R. K., Slater, S. C., Narayanan, P., & Setiadi, H. (1996). Automated test
construction: concepts, technical advances, and applications. In J. Muñiz (Ed.), Psicometria (pp. 705-728). Madrid: Editorial Universitas, S. A.
Hambleton, R. K., & Stetz, F. P. (1979). The development of objectives-based
instructional programs in career education. Journal of Career Education, 5, 220-225.
Hambleton, R. K. & Swaminathan, H. (1985). Item response theory: Principles and
Hambleton. R. K., & Swaminathan, H. (1985). A look at psychometrics in the Netherlands. Dutch Journal of Psychology, 40, 446-451.
Hambleton, R. K., Swaminathan, H., & Algina, J. (1976). Some contributions to the
theory and practice of criterion-referenced testing. In D. N. M. de Gruijter & L. J. Th. van der Kamp (Eds.), Advances in psychological and educational measurement (pp. 51-62). New York: Wiley.
Hambleton, R. K., Swaminathan, H., Algina, J., & Coulson, D. (1978). Criterion-
referenced testing and measurement: A review of technical issues and developments. Review of Educational Research, 48, 1-47.
Hambleton, R. K., et al. (1976). Evaluation of student progress and school environment
in the Anisa early childhood educational program. Research Relating to Children Bulletin 36 (Abstract). Urbana-Champaign, IL: Educational Resources Information Center/Early Childhood Education, University of Illinois.
Hambleton, R. K., Swaminathan, H., & Cook, L. L. (1981). Program evaluation
methods and techniques for day care and early childhood program personnel. In D. Streets (Ed.), Administrative handbook for day care and preschool administration. Boston: Allyn and Bacon, Inc.
Hambleton, R. K., Swaminathan, H., Cook, L. L., Eignor, D., & Gifford, J. A. (1978).
Developments in latent trait theory: A review of models, technical issues, and applications. Review of Educational Research, 48, 467-510.
Hambleton, R. K., Swaminathan, H., Gifford, J. A., & Mills, C. (1981). Individualized
criterion-referenced testing technical manual. Tulsa, OK: Educational Development Corporation.
Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item
response theory. Newbury Park, CA: Sage Publications, Inc. Hambleton, R. K., & Traub, R. E. (1971). Information curves and efficiency of three
logistic test models. British Journal of Mathematical and Statistical Psychology, 24, 273-281. (Summary published in the Proceedings of the 78th Annual Convention of the American Psychological Association, 1970, 4, 121-122.)
Hambleton, R. K., & Traub, R. E. (1973). Analysis of empirical data using two logistic
latent trait models. British Journal of Mathematical and Statistical Psychology, 26, 195-211.
Hambleton, R. K., & Traub, R. E. (1974). The effects of item order on test performance
and stress. Journal of Experimental Education, 43, 40-46. Hambleton, R. K., & van der Linden, W. (Eds.). (1982). Technical contributions to item
Hambleton, R. K., & Wedman, I. (Eds.). (1997). Advances in assessment practices
[special issue]. European Journal of Psychological Assessment, 13(1), 1-58.
30
Hambleton, R. K., & Xing, D. (2006). Optimal and Nonoptimal computer-based test
designs for making pass-fail decisions. Applied Measurement in Education, 19(3), 221-239.
Hambleton, R. K., Yu, J., & Slater, S. C. (1999). Field test of the ITC guidelines for
adapting educational and psychological tests. European Journal of Psychological Assessment, 15(3), 270-276.
Hambleton, R. K., & Zaal, J. (Eds.). (1991). Advances in educational and psychological
testing: Theory and applications. Boston, MA: Kluwer Academic Publishers. Hambleton, R. K., Zaal, J., & Pieters, J. P. M. (1991). Computerized adaptive testing:
Theory, applications, and standards. In R. K. Hambleton & J. Zaal (Eds.), Advances in educational and psychological testing: Theory and applications (pp. 341-366). Boston: Kluwer Academic Publishers.
Hambleton, R. K., & Zenisky, A. (2003). Issues and practices of performance
assessment. In C. R. Reynolds & T. W. Kamphaus (Eds.), Handbook of psychological and educational assessment of children (2nd ed., pp. 377-404). New York: The Guilford Press.
Hambleton, R. K., & Zhao, Y. (2005). Item response theory models for the analysis of
dichotomously scored data. In B. Everitt & D. Howell (Eds.), Encyclopedia of Statistics in Behavioral Science (pp. 982-990). West Sussex, UK: John Wiley & Sons.
Hersey, P., Blanchard, K. H., & Hambleton, R. K. (1978). Contracting for leadership
style: A process and instrumentation for building effective work relationships. In W. W. Burke (Ed.), The cutting edge: Current theory and practice in organization development. La Jolla, CA: University Associates.
Jodoin, M., Zenisky, A., & Hambleton, R. K. (2006). Comparison of the psychometric
properties of several computer-based test designs for credentialing exams with multiple purposes. Applied Measurement in Education, 19(3), 203-220.
Jones, R. W., & Hambleton, R. K. (1992). Recent advances in psychometric methods.
Revista Portuguesa de Educacao, 5(2), 1-13. Linn, R. L., Drasgow, F., Camara, W., Crocker, L., Hambleton, R. K., Plake, B. S., Stout,
W., & van der Linden, W. J. (2002). Computer-based testing: A research agenda. In C. N. Mills, M. T. Potenza, J. J. Fremer, & W. C. Ward (Eds.), Computer-based testing: Building the foundation for future assessments (pp. 289-300). Mahwah, NJ: Lawrence Erlbaum Publishers.
Linn, R. L., & Hambleton, R. K. (1991). Customized tests and customized test norms.
Applied Measurement in Education, 4(3), 185-207. Lu, Y., & Hambleton, R. K. (2004). Statistics for detecting disclosed items in a CAT
environment. Metodologiz de las Ciencias del Comportamiento, 5(2), 225-242..
31
Madaus, G., Airasian, P., & Hambleton, R. K. (1982). Development and application of criteria for screening commercial standardized tests. Educational Evaluation and Policy Analysis, 4, 401-415.
Mazor, K., Clauser, B., & Hambleton, R. K. (1992). The effect of sample size on the
functioning of the Mantel-Haenszel statistic. Educational and Psychological Measurement, 52, 443-451.
Mazor, K., Clauser, B., & Hambleton, R. K. (1994). Identification of non-uniform
differential item functioning using a variation of the Mantel-Haenszel procedure. Educational and Psychological Measurement, 54(2), 284-291.
Mazor, K., Hambleton, R. K., & Clauser, B. (1998). Effects of conditioning on two
Oakland, T., Poortinga, Y., Schlegel, J., & Hambleton, R. K. (2001). International Test Commission: Its history, current status, and future directions. International Journal of Testing, 1(1), 3-32.
Olsen, L. K., Hambleton, R. K., & others. (1985). Development and application of the
student test used in the School Health Education Evaluation. Journal of School Health, 55, 309-315.
Phillips, G. W., Mullis, I. V. S., Bourque, M. L., Williams, P. L., Hambleton, R. K.,
Owen, E. H., & Barton, P. E. (1993). Interpreting NAEP scales. Washington, DC: National Center for Education Statistics.
Pitoniak, M. J., Hambleton, R. K., & Biskin, B. H. (2003). Setting standards on tests
containing computerized performance tasks (Center for Educational Assessment Research Report No. 488). Amherst, MA: University of Massachusetts, School of Education.
Plake, B. S., & Hambleton, R. K. (2000). A standard-setting method designed for
Plake, B. S., & Hambleton, R. K. (2001). The analytic judgment method for setting
standards on complex performance assessments. In G. Cizek (Ed.), Setting performance standards: Concepts, methods, and perspectives. Hillsdale, NJ: Lawrence Erlbaum Associates.
Plake, B. S., Hambleton, R. K., & Jaeger, R. M. (1997). A new standard-setting method
for performance assessments: The dominant profile judgment method and some field-test results. Educational and Psychological Measurement, 57(3), 400-411.
Popham, W. J., & Hambleton, R. K. (1990). Can you pass the test on testing? Principal,
38-39. Ranney, P., & Hambleton, R. K. (2006). It’s time to consider a new test model in
clinical licensure programs. Journal of the American Dental Association, 137, 30-42.
Robin, F., Sireci, S. G., & Hambleton, R. K. (2003). Evaluating the equivalence of
different language versions of a credentialing exam. International Journal of Testing, 3(1), 1-20.
Robin, R., Xing, D., & Hambleton, R. K. (1999). Review of the software package,
Rasch Scaling Program (R.S.P.). Applied Psychological Measurement, 23(1), 90-94.
Rogers, H. J., & Hambleton, R. K. (1989). Evaluating computer-simulated baseline
statistics for interpreting item bias statistics. Educational and Psychological Measurement, 49, 355-369.
33
Rovinelli, R., & Hambleton, R. K. (1977). On the use of content specialists in the assessments of criterion-referenced test item validity. Dutch Journal of Educational Research, 2, 49-60.
Royer, M., Hambleton, R. K., & Cadorette, L. (1978). Individual differences in
memory: Theory, data and educational implications. Contemporary Educational Psychology, 3, 182-203.
Royer, J. M., Lynch, D. J., Hambleton, R. K., & Bulgareli, C. (1984). Using the
sentence verification technique to assess the comprehension of technical text. American Educational Research Journal, 21, 839-870.
Sheehan, D. S., & Hambleton, R. K. (1977). A predictive study of success in an
individualized science program. Journal of School Science and Mathematics, 77, 13-20.
Sheehan, D. S., & Hambleton, R. K. (1977). Adapting instruction to student differences
in an individualized science program. Journal of Research in Science Teaching, 14, 27-32.
Sireci, S. G., Hambleton, R. K., Huff, K. L., & Jodoin, M. G. (2000). Setting standards
on licensure exams using direct consensus (Laboratory of Psychometric and Evaluative Research Report No. 395). Amherst, MA: University of Massachusetts, School of Education.
Sireci, S. G., Hambleton, R. K., & Pitoniak, M. J. (2004). Setting passing scores on
licensure exams using direct consensus. CLEAR Exam Review, 15, 21-25. Sireci, S. G., Patsula, L., & Hambleton, R. K. (2005) Statistical methods for identifying
flawed items in the test adaptation process. In R. K. Hambleton, P. Merenda, & C. Spielberger (Eds.), Adapting educational and psychological tests for cross-cultural assessment (pp. 93-115). Hillsdale, NJ: Lawrence Erlbaum Associates.
Skorupski, W., & Hambleton, R. K. (2005). What are panelists thinking when they
participate in standard-setting studies? Applied Measurement in Education, 18(3), 233-255.
Smith, I. L., & Hambleton, R. K. (1991). Content validity studies of licensing
examinations. Educational Measurement: Issues and Practice, 9, 7-10. Smith, I. L., Hambleton, R. K., & Rosen, G. A. (1988). Content validity studies of the
Examination for Professional Practice in Psychology. Professional Practice of Psychology, 9(1), 43-80.
Spineti, J., & Hambleton, R. K. (1977). A computer simulation study of tailored testing
strategies for objectives-based instructional programs. Educational and Psychological Measurement, 37, 139-158.
Stufflebeam, D. L., & Hambleton, R. K. (1988). Improving personnel evaluations
through professional standards. Bulletin of the International Test Commission, 15, 3-24.
34
Stufflebeam, D. L., Hambleton, R. K., & others. (1989). Professional standards for educational evaluation systems. Beverly Hills, CA: Sage Publications.
Swaminathan, H., Hambleton, R. K., & Algina, J. (1974). Reliability of criterion-
referenced tests: A decision-theoretic formulation. Journal of Educational Measurement, 11, 263-267.
Swaminathan, H., Hambleton, R. K., & Algina, J. (1975). A Bayesian decision-theoretic
procedure for use with criterion-referenced tests. Journal of Educational Measurement, 12, 87-98.
Swaminathan, H., Hambleton, R. K., Sireci, S., Xing, D., & Rizavi, S. (2003). Small
sample estimation in dichotomous item response models: Effects of priors based on judgmental information on the accuracy of item parameter estimates. Applied Psychological Measurement, 27, 27-51.
Traub, R. E., & Hambleton, R. K. (1972). The effect of scoring instructions and degree
of speededness on the validity and reliability of multiple-choice tests. Educational and Psychological Measurement, 32, 737-758.
Traub, R. E., & Hambleton, R. K. (1972). The effect of instruction on the cognitive
structure of statistical and psychometric concepts. Canadian Journal of Behavioral Science, 6, 30-44.
Traub, R. E., Hambleton, R. K., & Singh, B. (1969). Effects of promised reward and
threatened penalty on performance of a multiple-choice vocabulary test. Educational and Psychological Measurement, 29, 847-861.
van der Linden, W. J., & Hambleton, R. K. (1997) Item response theory: brief history,
common models, and extensions. In W. J. van der Linden & R. K. Hambleton (Eds.), Handbook of modern item response theory (pp. 1-28). New York: Springer-Verlag.
van der Linden, W. J., & Hambleton, R. K. (Eds.). (1997). Handbook of modern item
response theory. New York: Springer-Verlag Publishers. van de Vijver, F., & Hambleton, R. K. (1996). Translating tests: some practical
guidelines. European Psychologist, 1, 89-99. Wainer, H., Hambleton, R. K., & Meara, K. (1999). Alternative displays for
communicating NAEP results: A redesign and validity study. Journal of Educational Measurement, 36(4), 301-335.
Watts, J., Brown, W., Hambleton, R. K., & Mora, L. (2001). West Virginia
accountability study (Final Report). Atlanta, GA: Southern Regional Education Board.
Welsh, W., & Hambleton, R. K. (1976). On the use of goals in evaluation: A review of
Whelan, G. P., Boulet, J. R., McKinley, D. W., Norcini, J. J., van Zanten, M., Hambleton, R. K., Burdick, W. P., & Peitzman, M. D. (2005). Scoring standardized patient examinations: Lessons learned from the development and administration of the ECFMG Clinical Skills Assessment. Medical Teacher, 27, 200-206.
Xing, D., & Hambleton, R. K. (2004). Impact of test design, item quality, and item bank
size on the psychometric properties of computer-based credentialing examinations. Educational and Psychological Measurement, 64(1), 5-21.
Yu, J., & Hambleton, R. K. (1996). Field test of the ITC guidelines for adapting
psychological tests. International Journal of Psychology, 31(3), 439. (Abstract) Zenisky, A. L., & Hambleton, R. K. (2003). Formats for assessments. In R. Fernandez-
Ballesteros (Ed.), Encyclopedia of psychological assessment (pp. 420-424). London: Sage.
Zenisky, A. L., Hambleton, R. K., & Robin, F. (2003). Detection of differential item
functioning in large-scale assessments: A study of evaluating a two-stage approach. Educational and Psychological Measurement, 63, 51-64.
Zenisky, A. L., Hambleton, R. K., & Robin, F. (2004). DIF detection and interpretation
Zenisky, A. L., Hambleton, R. K., & Sireci, S. G. (2002). Identification and evaluation
of local item dependencies in the Medical College Admissions Test. Journal of Educational Measurement, 39(4), 291-309.
Zenisky, A. L., Hambleton, R. K., & Robin, F. (2003). Detection of differential item
functioning in large-scale assessments: A study of evaluating a two-stage approach. Educational and Psychological Measurement, 63, 51-64.
(c) Reviews
Clauser, B., & Hambleton, R. K. (1994). A review of Holland and Wainer's Differential Item Functioning. Journal of Educational Measurement, 31(1), 88-92.
Eignor, D. E., & Hambleton, R. K. (1977). A review of H. W. Collins, J. H. Johansen, &
J. A. Johnson's Educational Measurement and Evaluation. Educational and Psychological Measurement, 37, 273-276.
Eignor, D. E., & Hambleton, R. K. (1979). A review of Gronlund's Constructing
Achievement Tests. Educational and Psychological Measurement, 39, 246-249. Fitzpatrick, A., & Hambleton, R. K. (1979). A review of Thorndike and Hagen's
Measurement and Evaluation in Psychology and Education. Educational and Psychological Measurement, 39, 249-251.
36
Hambleton, R. K. (1972). A review of the new forms S and T of the Bennett Mechanical Comprehension Test. Journal of Educational Measurement, 1971, 8, 55-56. Reprinted in Buros, O. (Ed.), The Seventh Mental Measurements Yearbook. Highland Park, NJ: Gryphon Press, pp. 1486-1487.
Hambleton, R. K. (1978). A review of the CGP Self-Scoring Placement Tests in English
and Mathematics. In O. Buros (Ed.), The Eighth Mental Measurements Yearbook. Highland Park, NJ: Gryphon Press.
Hambleton, R. K. (1978). A review of the Everyday Skills Tests. In O. Buros (Ed.), The
Eighth Mental Measurements Yearbook. Highland Park, NJ: Gryphon Press. Hambleton, R. K. (1985). A review of the Differential Aptitude Test. In J. Mitchell
(Ed.), The Ninth Mental Measurements Yearbook (pp. 504-505). Lincoln, NE: Buros Institute.
Hambleton, R. K. (1985). A review of the Steenburgen Diagnostic-Prescriptive
Program. In J. Mitchell (Ed.), The Ninth Mental Measurements Yearbook (pp. 1477-1478). Lincoln, NE: Buros Institute.
Hambleton, R. K. (1992). A review of Hudson Education Skills Inventory. In J. C.
Conoley & J. J. Kramer (Eds.), The Eleventh Mental Measurements Yearbook (pp. 390-392). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska.
Hambleton, R. K. (1992). A review of Survey of Problem-Solving and Educational
Skills. In J. C. Conoley & J. J. Kramer (Eds.), The Eleventh Mental Measurements Yearbook (pp. 908-910). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska.
Hambleton, R. K. (1995). A review of The Seventh Edition of the Metropolitan
Achievement Tests. In J. C. Conoley & J. Impara (Eds.), The Twelfth Mental Measurements Yearbook (pp. 606-610). Lincoln, NE: The Buros Institute.
Hambleton, R. K. (2003). Tribute to Ross E. Traub. Alberta Journal of Educational
Research, 49(3), 208-210. Hambleton, R. K. (2005). Review of the Iowa Tests of Basic Skills, Forms, K, L, M. In
D. J. Keyser & R. C. Sweetland (Eds.), Test critiques (volume 11) (pp. 138-150). Kansas City: Test Corporation of America.
Hambleton, R. K. (2005). A review of the Academic Competence Evaluation Scales. In
R. A. Spies, & B. S. Plake (Eds.), The 16th Mental Measurements Yearbook (pp. 1-4). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska.
Hambleton, R. K. (2005). A review of the Wechsler Memory Tests. In R. A. Spies, &
B. S. Plake (Eds.), The 16th Mental Measurements Yearbook (pp. 1097-1099). Lincoln, NE: Buros Institute of Mental Measurements, University of Nebraska.
37
Hambleton, R. K. (2006). National Council on Measurement in Education. In N. Salkind (Ed.), Encyclopedia of Measurement and Statistics. Newbury Park, CA: Sage.
Hambleton, R. K., & Carter, W. (1977). A review of D. P. Warwick & C. A. Lininger's,
The Sample Survey: Theory and Practice. Educational and Psychological Measurement, 37, 568-569.
Hambleton, R. K., & Cook, L. L. (1977). A review of D. G. Lewis' Assessment in
Education. Educational and Psychological Measurement, 37, 559-560. Hambleton, R. K., & Kaplan-deVries, D. (1985). A review of the Basic Achievement
Skills Individual Screener (BASIS). Journal of Counseling and Development, 63, 383-384.
Hambleton, R. K., & Murray, L. (1983). A review of Thorndike's Applied
Psychometrics. Applied Psychological Measurement, 7, 243-245. Hambleton, R. K., & Narayanan, P. (1992). Review of RASCAL. Rasch Measurement,
6(3), 236. Hambleton, R. K., & Powers, T. (1973). A review of G. H. Bracht, K. D. Hopkins, and
J. C. Stanley's Perspectives in Educational and Psychological Measurement. Educational and Psychological Measurement, 33, 512-513.
Hambleton, R. K., & Rovinelli, R. (1972). A review of W. Clemans' Educational Uses
of the Computer: An Introduction. Educational and Psychological Measurement, 32, 526-529.
Hambleton, R. K., & Swaminathan, H. (1981). A review of Lord's Applications of Item
Response Theory to Practical Testing Problems. Journal of Educational Measurement, 18, 178-180.
Jones, R. W., & Hambleton, R. K. (1992). A review of Osterlind's Constructing Test
Items. Journal of Educational Measurement, 29, 195-197. Sheehan, D. S., & Hambleton, R. K. (1975). A review of D. M. Shoemaker's Principles
and Procedures of Multiple Matrix Sampling. Educational and Psychological Measurement, 35, 1059-1061.
Swaminathan, H., & Hambleton, R. K. (1972). A review of Van der Geer's Introduction
to Multivariate Analysis for the Social Sciences. Educational and Psychological Measurement, 32, 1152-1156.
(d) Technical Reports (Reports Published in Books or Journals Are Not Included)
Algina, J., Bourque, M. L., Hambleton, R. K., & Larrivee, B. An evaluative study of selected outcomes of the Hampton Maine Anisa Program (1973-1974) (Final Report). Hampden, ME: Hampden School Department. (130 pages)
38
Arrasmith, D., & Hambleton, R. K. (1987). Steps for setting standards with the Angoff method (Final Report). New York: Professional Examination Service.
Avis, N. E., Smith, K. W., Mayer, K. H., Swislow, L., & Hambleton, R. K. (1997). The
multidimensional quality of life questionnaire for persons with HIV/AIDS: development and evaluation (Final Report). Watertown, MA: New England Research Institute.
Bourque, M. L., Goodman, G., Hambleton, R. K., & Han, N. (2004). Reliability
estimates for the ABTE tests in elementary education, professional teaching knowledge, secondary mathematics and English/language arts (Final Report). Leesburg, VA: Mid-Atlantic Psychometric Services.
Clauser, B., Mazor, K., & Hambleton, R. K. (1991). Examination of various influences
on the Mantel-Haenszel statistic (Laboratory of Psychometric and Evaluative Research Report No. 210). Amherst, MA: School of Education, University of Massachusetts.
Cook, L. L., Eignor, D., Fitzpatrick, A., Gifford, J. A., Hambleton, R. K., Swaminathan,
H., & Wroble, L. An evaluative study of the Social Literacy Project, 1977. (120 pages)
Coulson, D., & Hambleton, R. K. (1974). Some validation methods for domain-
referenced tests (Laboratory of Psychometric and Evaluative Research Report No. 7). Amherst, MA: School of Education, University of Massachusetts.
Eignor, D. R., & Hambleton, R. K. (1979). Effects of test length and advancement score
on several criterion-referenced test reliability and validity indices (Laboratory of Psychometric and Evaluative Research Report No. 86). Amherst, MA: School of Education, University of Massachusetts.
Eignor, D. R., Hambleton, R. K., & Blanchard, K. (1976). Improving leadership
effectiveness: Situational leadership theory, instrumentation, and applications (Laboratory of Psychometric and Evaluative Research Report No. 41). Amherst, MA: School of Education, University of Massachusetts.
Ertel, K., Hambleton, R. K., & Schiff, R. (1973). Career education potential and
alternatives in the Southern Berkshire Region: A study of schools with limited resources (Final Report). Boston: Massachusetts Commission for Occupational Education. (158 pages)
Fitzpatrick, A. R., & Hambleton, R. K. (1983). Similarity between the skills covered by
the Louisiana Basic Skills Tests and the skills covered by commonly used standardized achievement tests (Grades 2, 3, 4) (Final Report). Amherst, MA: Psychometric and Evaluative Research Services, Inc.
Friedman, M., van Zanten, M., White, D., Hambleton, R. K., & Whelan, G. P. A survey
of clinical skills of foreign medical graduates in their first year of residency (Research Report). Philadelphia, PA: Educational Commission for Foreign Medical Graduates.
39
Gifford, J. A., Cook, L. L., & Hambleton, R. K. (1976). Alternative schools: Rationale, descriptions, and problems of evaluation (Laboratory of Psychometric and Evaluative Research Report No. 32). Amherst, MA: School of Education, University of Massachusetts.
Gimpel, J. R., Boulet, J. R., Weidner, Al., Dowling, D. J., Hambleton, R. K., Kerns, L.,
Solomon, M., & LaMarra, D. (2005). Standard setting summary report: COMLEX-USA Level 2-PE (Final Report). Philadelphia: National Board of Osteopathic Medical Examiners.
Hambleton, R. K. (1970). Evaluation and research model for METEP (Final Report).
Washington: Office of Education. Hambleton, R. K. (1971). A report on the research and evaluation activities in the
Jamesville-Dewitt individualized instruction program in ninth grade science (Final Report). Albany, NY: Bureau of School and Cultural Research, New York State Education Department (122 pages).
Hambleton, R. K. (1972). An evaluative study of the Educational Project to Implement
Conservation (Final Report). Westfield, MA: Westfield Public Schools. (80 pages)
Hambleton, R. K. (1974). A comment on Crehan's techniques for validating criterion-
referenced testing (Laboratory of Psychometric and Evaluative Research Report No. 14). Amherst, MA: School of Education, University of Massachusetts.
Hambleton, R. K. (1976). An assessment of School of Education grading practices and
preferences (Laboratory of Psychometric and Evaluative Research Report No. 21). Amherst, MA: School of Education, University of Massachusetts.
Hambleton, R. K. (1977). What classroom teachers need to know about criterion-
referenced testing (Laboratory of Psychometric and Evaluative Research Report No. 50). Amherst, MA: School of Education, University of Massachusetts.
Hambleton, R. K. (1977). Contributions to criterion-referenced test theory: On the uses
of item characteristic curves and related concepts (Laboratory of Psychometric and Evaluative Research Report No. 51). Amherst, MA: School of Education, University of Massachusetts.
Hambleton, R. K. (1977). Worcester Title I reading program evaluation (1976-1977)
(Final Report). Providence, RI: International Educational Associates. Hambleton, R. K. (1978). An evaluative study of Project Support (1977-1978) (Final
Report). Billerica, MA: Billerica School Department. (75 pages) Hambleton, R. K. (1978). Assessment of second level manager competence (Final
Report). Basking Ridge, NJ: American Telephone and Telegraph. (62 pages) Hambleton, R. K. (1979). A field study of the validity of Hersey-Blanchard's model of
Hambleton, R. K. (1984). Standard-setting: State of the art, future prospectus (Laboratory of Psychometric and Evaluative Research Report No. 142). Amherst, MA: School of Education, University of Massachusetts.
Hambleton, R. K. (1985). Validity investigation for the certification examination of the
National Association of Purchasing Management (Final Report). Amherst, MA: Psychometric and Evaluative Research Services, Inc. (93 pages)
Hambleton, R. K. (1991). Follow-up evaluation study of the 1989 to 1991 workshops of
the Consortium for the Improvement of Math and Science Teaching (Final Report). North Adams, MA: North Adams State College.
Hambleton, R. K. (1995). Setting achievement levels on the NAEP mathematics
assessment: Response to technical criticisms (Laboratory of Psychometric and Evaluative Research Report No. 250). Amherst, MA: University of Massachusetts, School of Education.
Hambleton, R. K. (2004). 2002-2003 MCAS research and validity studies (Final
Report). Amherst, MA: University of Massachusetts, Centr for Educational Assessment.
Hambleton, R. K. (2004). Review of the translation/adaptation process for the Child
Assessment Battery for the Head Start National Reporting System (Final Report). Washington: Government Accounting Office.
Hambleton, R. K., & Berberoglu, G. (1997, May). TIMSS instruments adaptation
process: a formative evaluation (Final Report). Amsterdam, The Netherlands. Hambleton, R. K., & Bourque, M. L. (1975). An evaluation of the Providence Title I
Mathematics Remediation Laboratory Program (Final Report). Providence, RI: Providence School Department.
Hambleton, R. K., & Eignor, D. (1978). Comments on selected questions raised in
connection with the home environment study (Final Report). Princeton, NJ: Mathematica Policy Research.
Hambleton, R. K., & Eignor, D. (1979). Comments on the Alaska instructional
diagnostic system (Final Report). Portland, OR: Northwest Regional Educational Laboratory.
Hambleton, R. K., & Eignor, D. (1979). A practitioner's guidebook to criterion-
referenced test development, validation, and test score usage (Laboratory of Psychometric and Evaluative Research Report No. 70). Amherst, MA: School of Education, University of Massachusetts. (2nd ed.)
Hambleton, R. K., & Gifford, J. A. (1977). An evaluative study of the CIP Screening
Device and related instruments in Project CHILD FIND (Final Report). Providence, RI: Providence School Department.
41
Hambleton, R. K., & Gorth, W. P. (1971). Criterion-referenced testing: Issues and applications (Center for Educational Research Technical Report No. 13). Amherst, MA: School of Education, University of Massachusetts. (ERIC: ED 060 025)
Hambleton, R. K., Gower, C., Bollwark, J., Mazor, K., & Donovan, C. (1989).
Evaluation of the 1988-1989 Worcester Chapter 636 Magnet School Program (Final Report). Amherst, MA: School of Education, University of Massachusetts. (215 pages)
Hambleton, R. K., Jones, R. W., & Cadman, S. (1993). Innovations in testing and
evaluation of student competencies in technical and vocational education (Final Report). Paris: UNESCO.
Hambleton, R. K., & Meara, K. (2000). Newspaper coverage of NAEP results - 1990 to
1998 (Laboratory of Psychometric and Evaluative Research Report No. 366). Amherst, MA: University of Massachusetts, School of Education.
Hambleton, R. K., & Murray, J. (1977). A comparative study of faculty and student
attitudes toward a variety of college grading purposes and practices (Laboratory of Psychometric and Evaluative Research Report No. 48). Amherst, MA: University of Massachusetts, School of Education.
Hambleton, R. K., Murray, L., & Anderson, J. (1983). Uses of item statistics in item
evaluation and test development (Laboratory of Psychometric and Evaluative Research Report No. 131). Amherst, MA: University of Massachusetts, School of Education.
Hambleton, R. K., Murray, L., & Williams, P. (1983). Fitting item response models to
the Maryland Functional Reading Test results (Laboratory of Psychometric and Evaluative Research Report No. 139). Amherst, MA: University of Massachusetts, School of Education.
Hambleton, R. K., & Olszewski, F. (1972). Woodworking objective and test item bank
(Final ESCOE Report). Boston, MA: Massachusetts Department of Education. Hambleton, R. K., & Pauker, R. (1976). Coordination and delivery of in-service
education in Massachusetts project: Year one evaluation report (Final Report). Boston, MA: Department of Education.
Hambleton, R. K., & Pauker, R. (1976). An evaluation plan for the project to coordinate
and deliver in-service education in Massachusetts (Final Report). Boston, MA: Department of Education.
Hambleton, R. K., & Rovinelli, R. (1971). Efficiency of various item-examinee
sampling designs for estimating test parameters (Center for Educational Research Technical Report No. 12). Amherst, MA: School of Education, University of Massachusetts.
42
Hambleton, R. K., Sireci, S. G., Swaminathan, H., Xing, D., & Rizavi, S. (2003, October). Anchor-based methods for judgmentally estimating item difficulty parameters (Law School Admission Council Computerized Testing Report 98-05). Newtown, NJ: LSAC.
Hambleton, R. K., & Smith, I. L. (1988). Content validity and fairness review of the
1987 forms of the Examination for Professional Practice of Psychology (Final Report). Washington, DC: American Association of State Psychology Boards, Inc. (132 pages)
Hambleton, R. K., & Smith, T. (1999). An evaluation of the general/public 1996 NAEP
Science Reports (Laboratory of Psychometric and Evaluative Research Report No. 361). Amherst, MA: University of Massachusetts, School of Education.
Hambleton, R. K., Stetz, F. P., & Newby, J. F. (1973). An assessment of selected
components of the Baltimore Model Cities Project (Final Report). Baltimore, MD: Baltimore Model Cities Staff. (88 pages)
Hambleton, R. K., Swaminathan, H., Arrasmith, D., Gower, C., & Rogers, H. J. (1986).
Proposed steps for constructing and validation Air Force Specialty Diagnostic Achievement Tests (Laboratory of Psychometric and Evaluative Research Report No. 164). Amherst, MA: School of Education, University of Massachusetts.
Hambleton, R. K., Swaminathan, H., Arrasmith, D., Gower, C., Rogers, H. J., & Zhou, A.
(1986). Development of an integrated system to assess and enhance basic job skills: Research plan, personnel measurement subsystem (Laboratory of Psychometric and Evaluative Research Report No. 163). Amherst, MA: School of Education, University of Massachusetts.
Hambleton, R. K., Swaminathan, H., Bollwark, J., Gower, C., Reshetar, R., Rogers, H. J.,
& Zhou, A. (1986). Program to assist school districts in collecting and using achievement test data (Final Report). Holyoke and Lowell, MA: Holyoke and Lowell Public School Systems. (39 pages)
Hambleton, R. K., Swaminathan, H., & Eignor, D. (1976). An evaluative study of the
leadership development and team building laboratory for administrative personnel of the Baltimore City Public School System (Final Report). Baltimore, MD: Baltimore Public Schools.
Hambleton, R. K., et al. (1976). An evaluative study of the third year of the Anisa
program in the Hampden, Maine School System (Final Report). Hampden, ME: Hampden School Department.
Hambleton, R. K., & Zhao, Y. (2004). Alignment of MCAS grade 10 English Language
Arts and Mathematics Assessments with the curricula frameworks and the test specifications (Center for Educational Assessment Research Report No. 538). Amherst, MA: University of Massachusetts, Center for Educational Assessment.
43
MacCormack, J., Miller, C., Hambleton, R. K., & Eignor, D. (1976). Goal setting ability in young children: Theory, instrumentation, and measurement (Laboratory of Psychometric and Evaluative Research Report No. 25). Amherst, MA: School of Education, University of Massachusetts.
Madaus, G., Airasian, P., & Hambleton, R. K. (1979). Development and application of
criteria for screening commercial standardized tests (Final Report). Boston, MA: Massachusetts Department of Education.
Malaka, M., & Hambleton, R. K. (1991). Formative evaluation of the first two criterion-
referenced testing workshops for Swaziland teachers (Final Report). Amherst, MA: School of Education, University of Massachusetts. (37 pages)
Mazor, K., Miller, T., & Hambleton, R. K. (1992). Predicting the academic success of
minority students (Laboratory of Psychometric and Evaluative Research Report No. 248). Amherst, MA: University of Massachusetts, School of Education.
Meara, K., Hambleton, R. K., & Sireci, S. G. (2000). A survey of standard-setting
practices in the credentialing/licensing field (Laboratory of Psychometric and Evaluative Research Report No. 387). Amherst, MA: University of Massachusetts, School of Education.
Mills, C. N., & Hambleton, R. K. (1980). Guidelines for reporting criterion-referenced
test score information (Laboratory of Psychometric and Evaluative Research Report No. 100). Amherst, MA: School of Education, University of Massachusetts.
Mills, C. N., Hambleton, R. K., Biskin, B., Kobrin, J., Evans, J., & Pfeffer, M. (2000). A
comparison of the standard-setting methods for the Uniform CPA Examination (Technical Report). Jersey City, NJ: American Institute of Certified Public Accountants.
Newby, J., Hambleton, R. K., Rovinelli, R., & Sheehan, D. (1972). A comparative study
of creative behavior of middle school students in different instructional programs (Supplemental Report No. 1). Concord, MA: Concord School Department.
Olsen, J., Hambleton, R. K., & Reckase, M. D. (1998). Tekcheck psychometric review
(Final Report). Orem, UT: Alpine Media. O'Reilly, R. P., & Hambleton, R. K. (1971). A CMI model for an individualized
learning program in ninth grade science (Center for Educational Research Technical Report No. 14). Amherst, MA: School of Education, University of Massachusetts.
Patsula, L., & Hambleton, R. K. (1999). A comparative study of ability estimates
obtained from computer-adaptive and multi-stage testing (Laboratory of Psychometric and Evaluative Research Report No. 348). Amherst, MA: University of Massachusetts, School of Education.
44
Pauker, R., & Hambleton, R. K. (1976). Matching students and teachers to maximize learning: What do students think? (Laboratory of Psychometric and Evaluative Research Report No. 46). Amherst, MA: School of Education, University of Massachusetts.
Rollins, L., & Hambleton, R. K. (1997). Job analysis study of municipal securities sales
representatives, public finance professionals, and traders and underwriters (Final Report). Washington, DC: Municipal Securities Rulemaking Board.
Rollins, L., & Hambleton, R. K. (2000). Job analysis study for the Series 53
Examination (Final Report). Washington, DC: Municipal Securities Rulemaking Board.
Roman, J., & Hambleton, R. K. (1979). Screening tests for primary school children
(Laboratory of Psychometric and Evaluative Research Report No. 101). Amherst, MA: School of Education, University of Massachusetts.
Rovinelli, R., & Hambleton, R. K. (1973). Some procedures for the validation of
criterion-referenced test items (Final Report). Albany, NY: Bureau of School and Cultural Research, New York State Education Department. (96 pages)
Setiadi, H., & Hambleton, R. K. (1996, June). Item banks to improve assessment
practices (Final Report). Jakarta: Indonesian Department of Education. Setiadi, H., & Hambleton, R. K. (1996, June). Item selection using IRT models (Final
Report). Jakarta: Indonesian Department of Education. Sheehan, D. S., & Hambleton, R. K. (1972). An evaluative study of the Jamesville-
DeWitt individualized science program (1971-1972) (Final Report). Albany, NY: Bureau of School and Cultural Research, New York State Education Department. (191 pages)
Sheehan, D. S., & Hambleton, R. K. (1972). An evaluative study of the Jamesville-
DeWitt individualized science program (1971-1972) (Supplemental Report No. 1). Albany, NY: Bureau of School and Cultural Research, New York State Education Department. (228 pages)
Sheehan, D. S., & Hambleton, R. K. (1976). A review of selected factors affecting
questionnaire and interview results (Laboratory of Psychometric and Evaluative Research Report No. 29). Amherst, MA: School of Education, University of Massachusetts.
Stetz, F. P., & Hambleton, R. K. (1973). An assessment of the Berkshire Hills Schools
readiness program (Final Report). Pittsfield, MA: Berkshire Hills School System.
Swaminathan, H., Hambleton, R. K., & Pauker, R. (1976). An evaluative study of
Project Self (Final Report). Rocky Hill, CT: Rocky Hill Board of Education.
45
Traub, R. E., Gundlack, L., Wolfe, C., Hambleton, R. K., & Winslow, I. (1968). Technical Report for the Canadian Scholastic Aptitude Test Pretest: May-June 1968. Toronto: Ontario Institute for Studies in Education.
Traub, R. E., Tuppen, C. J., & Hambleton, R. K. (1966). Validity and reliability of the
Dominion Group Tests of Learning Capacity (Test Development Papers). Toronto: Ontario Institute for Studies in Education.
Xing, D., & Hambleton, R. K. (1998). Documentation for running Bilog 3.11 in
Windows 95 (Laboratory of Psychometric and Evaluative Research Report No. 342). Amherst, MA: University of Massachusetts, School of Education.
Zenisky, A. L., Hambleton, R. K., & Sireci, S. G. (2000). Effects of local item
dependencies on the validity of IRT item, test, and ability statistics (Laboratory of Psychometric and Evaluative Research Report No. 363). Amherst, MA: University of Massachusetts, School of Education.
(e) Published Tests
Blanchard, K. H., Hambleton, R. K., Zigmari, D., & Forsyth, D. (1981). Leader Behavior Analysis, Self and Other (Form A). Escondido, CA: Blanchard Training and Development.
Hambleton, R. K. (1974). Diagnostic tests of selected reading skills. Providence, RI:
International Educational Associates.
Hambleton, R. K. (1975). Reading skills inventory: A criterion-referenced assessment (three editions). Materials produced included:
(1) Reading skills inventory description and technical manual.
(2) Indicators of prereading skills test. (Two forms) (3) Indicators of word-attack skills test. (Two forms) (4) Indicators of dictionary skills test. (Two forms) (5) Indicators of reading comprehension test. (Nine levels, two forms)
Providence, RI: International Educational Associates. Hambleton, R. K. (1983). Blueprint for Learning. A comprehensive K-12 criterion-
referenced reading and mathematics testing system. Tulsa, OK: Educational Development Corporation.
Hambleton, R. K., Blanchard, K. H., & Hersey, P. (1977). Professional Maturity Scale.
LaJolla, CA: University Associates.
Hersey, P., Blanchard, K. H., & Hambleton, R. K. (1980). Leadership Scale. LaJolla, CA: University Associates.
46
PAPERS PRESENTED AT PROFESSIONAL MEETINGS: Allalouf, A., Bastari, Sireci, S., & Hambleton, R. K. (1997, October). Comparing the
dimensionality of a test administered in two languages. Paper presented at the meeting of NERA, Ellenville, NY.
Allalouf, A., Hambleton, R. K., & Sireci, S. (1998, April). Detecting the causes of differential
item functioning in translated verbal items. Paper presented at the meeting of NCME, San Diego.
Avis, N. E., Smith, K. W., Hambleton, R. K., Feldman, H. A., Selwyn, A., & Jacobs, A. (1994,
October). Development of the multidimensional index of life quality: A quality of life measure for cardiovascular disease. Paper presented at the Drug Information Association Second Symposium on Contributed Papers in Quality of Life Evaluation, Charleston, SC.
Baldwin, P., Keller, L. A., & Hambleton, R. K. (2004, April). Using auxiliary information for
small sample estimation with the Medical College Admission Test. Paper presented at the meeting of NCME, San Diego.
Berberoglu, G., & Hambleton, R. K. (2004, July). Translating tests across languages for
different uses: Issues, problems, and possible solutions. Paper presented at the JURE Conference, Istanbul.
Berberoglu, G., Sireci, S. G., & Hambleton, R. K. (1997, March). Comparing translated items
using bilingual and monolingual items. Paper presented at the meeting of NCME, Chicago.
Berberoglu, G., & Hambleton, R. K. (2005, July). Test translation for intra-cultural and cross-
cultural purposes: Issues, problems, techniques, and solutions. Paper presented at the 9th European Congress of Psychology, Granada, Spain.
Berberoglu, G., Sireci, S. G., & Hambleton, R. K. (1997, July). A comparison of the graded
response model and the Mantel-Haenszel method for detecting DIF across different language groups. Paper presented at the Fifth European Congress of Psychology, Dublin, Ireland.
Bollwark, J., & Hambleton, R. K. (1990, May). Using the Mantel-Haenszel method in item bias
studies. Paper presented at the meeting of the New England Educational Research Organization, Rockport, Maine.
Boulet, J., Friedman, M., Hambleton, R. K., Burdick, R., & Ziv, A. (1996, June). Assessing the
adequacy of the post-encounter written scores in simulated patient exams. Paper presented at the 7th Ottawa Medical Testing Conference, Maastricht, The Netherlands.
Boulet, J., Hambleton, R. K., Burdick, W. B., & Friedman, M. (1998, September). The use of
case performance data to improve the technical quality of standardized patient examinations. Paper presented at the meeting of the Association of Medical Educators in Europe, Prague.
47
Boulet, J., Hambleton, R. K., Friedman, M., & Whelan, G. (1998, April). A comprehensive holistic approach for setting standards on performance assessments. Paper presented at the meeting of NCME, San Diego.
Boulet, J., McKinley, D., Hambleton, R. K., & Whelan, G. P. (1999, September). Quality
control measures to monitor the accuracy and consistency of scores from standardized patient assessments. Paper presented at the meeting of the AMEE, Linkoping, Sweden.
Boulet, J. R., McKinley, D., Whelan, G. P., van Zanten, M., & Hambleton, R. K. (2002,
November). Clinical skills deficiencies among first-year residents. Paper presented at the annual meeting of the Association of American Medical Colleges, San Francisco.
Boulet, J. R., McKinley, D. W., Whelan, G., & Hambleton, R. K. (2002, April). The effect of
task exposure on repeat candidate scores in a high-stakes performance assessment. Paper presented at the meeting of AERA, New Orleans.
Clauser, B., Mazor, K., & Hambleton, R. K. (1990, April). The influence of test homogeneity on
item bias results using the Mantel-Haenszel procedure. Paper presented at the meeting of AERA, Boston.
Clauser, B., Mazor, K., & Hambleton, R. K. (1991, April). Examination of various influences on
the Mantel-Haenszel statistic. Paper presented at the meeting of AERA, Chicago. Clauser, B., Mazor, K., & Hambleton, R. K. (1992, April). Effects of score group width on DIF
with the MH procedure. Paper presented at the meeting of AERA, San Francisco. Cook, L. L., & Hambleton, R. K. (1978, April). Application of latent trait theory to the
development of norm-referenced and criterion-referenced tests. Paper presented at the meeting of NCME, Toronto.
Cook, L. L., & Hambleton, R. K. (1979, April). Effects of test length and sample size on the
estimates of precision of latent ability scores. Paper presented at the meeting of AERA, San Francisco.
Cook, L. L., & Hambleton, R. K. (1979, April). A comparative study of item selection methods
utilizing latent trait theoretic models and concepts. Paper presented at the meeting of AERA, San Francisco.
Coulson, D., & Hambleton, R. K. (1974, August). On the validation of criterion-referenced tests
designed to measure individual mastery. Paper presented at the meeting of APA, New Orleans.
Eignor, D. R., & Hambleton, R. K. (1974, April). Effects of test length and advancement score
on several criterion-referenced test reliability and validity indices. Paper presented at the meeting of AERA, San Francisco.
Elosua, P., Hambleton, R. K., & Zenisky, A. (2006, July). Improving the methodology for
detecting biased test items. Paper presented at the 5th ITC Conference on Adapting Tests, Brussels.
48
Fernandos-Ballesteros, R., Hambleton, R. K., & O’Neil, T. (2001, July). The European Survey on Aging Protocol (ESAP): Translation and adaptation to seven European countries. Paper presented at the International Congress of Gerontology, Vancouver, BC.
Friedman, M., Boulet, J., Burdick, B., Ziv, A., Hambleton, R. K., & Gary, N. (1997, October).
Who should score the post-encounter patient progress note? Paper presented at the annual meeting of the American Association of Medical Colleges, Washington, DC.
Friedman, M., Hambleton, R. K., Boulet, J., Ziv, A., Peitzman, S., Burdick, W. B., & Whelan, G.
(1998, September). The learning curve in implementing standard-setting procedures in the health profession. Paper presented at the meeting of the Association of Medical Educators in Europe, Prague.
Gifford, J. A., & Hambleton, R. K. (1979, October). Construction and use of criterion-
referenced tests in program evaluation studies. Paper presented at the meeting of NERA, Ellenville, New York.
Gifford, J. A., & Hambleton, R. K. (1980, April). Construction and use of criterion-referenced
tests in program evaluation studies. Paper presented at the meeting of AERA, Boston. Goodman, D., & Hambleton, R. K. (2003, April). Reporting student results on state assessments:
Current practice, problems, and possibilities. Invited paper presented at the meeting of NCME, Chicago.
Hambleton, R. K. (1968, April). The effects of item order and anxiety on test performance and
stress. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K. (1969, May). The role of computers in education. An invited address at the
meeting of the Ontario Vocational Educational Association, London, Ontario. Hambleton, R. K. (1972, March). Applications of Bayesian statistical methods to individually
prescribed instruction programs. Paper presented at the meeting of NCME, Chicago. Hambleton, R. K. (1973, April). A decision-theoretic approach to criterion-referenced testing
and measurement. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K. (1973, April). A review of several testing models for individualized
instruction. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K. (1973, October). Objectives-based instruction, testing, and measurement.
Paper presented at the meeting of NERA, Ellenville, New York. Hambleton, R. K. (1974, August). Recent developments in criterion-referenced assessment.
Paper presented at the meeting of APA, New Orleans. Hambleton, R. K. (1974, August). Criterion-referenced testing: A review of recent
developments. Invited paper presented at the meeting of NERA, Ellenville, New York. Hambleton, R. K. (1974). College grading practices: A review of the issues. Paper presented at
the First International Conference on Improving University Teaching, University of Massachusetts at Amherst.
49
Hambleton, R. K. (1975, April). Toward a theory and practice of criterion-referenced testing. Paper presented at an invited symposium at the meeting of AERA, Washington.
Hambleton, R. K. (1976, October) A survey of evaluative methods and program results of the
three-year Anisa field project. Paper presented at the meeting of NERA, Ellenville, New York.
Hambleton, R. K. (1977, April). Contributions to criterion-referenced test theory: On the uses of
item characteristic curves and related concepts. Paper presented at the meeting of AERA, New York.
Hambleton, R. K. (1977, May). Guidelines for more effective objectives-based reading
programs. Paper presented at the meeting of the International Reading Association, Miami Beach.
Hambleton, R. K. (1977, June). The validity of criterion-referenced tests. Paper presented at the
Third International Symposium on Educational Testing, University of Leyden, The Netherlands.
Hambleton, R. K. (1978, April). Standards for educational and psychological tests. Paper
presented at the meeting of AERA, Toronto. Hambleton, R. K. (1978, May). Constructing criterion-referenced reading tests: What are the
steps? Paper presented at the International Reading Association, Houston. Hambleton, R. K. (1978, October). Validation of criterion-referenced test score interpretations
and standard setting methods. Invited paper presented at the First Annual Johns Hopkins University National Symposium on Educational Research, Washington.
Hambleton, R. K. (1979, March). Advances in testing technology. Presentation at the Learning
Tomorrow for Today's Generations Conference at the University of Massachusetts at Amherst.
Hambleton, R. K. (1979, April). Testing assumptions and determining the goodness of fit of
latent trait models. Paper presented at the meeting of AERA, San Francisco. Hambleton, R. K. (1979, April). Applications of latent trait theory to the development and use of
criterion-referenced tests. Paper presented at the meeting of AERA, San Francisco. Hambleton, R. K. (1979, May). Setting standards on criterion-referenced reading tests: What are
the steps? Paper presented at the meeting of the International Reading Association, Atlanta.
Hambleton, R. K. (1979, June). Competency testing: Setting educational performance standards
for the individual. Invited paper presented at the 9th Annual Conference on Large-Scale Assessment, Denver.
Hambleton, R. K. (1979, June). Determining the validity of competency tests. Invited paper
presented at the 9th Annual Conference on Large-Scale Assessment, Denver.
50
Hambleton, R. K. (1979, October). Will the real competency test please stand up? Keynote address at the meeting of NERA, Ellenville, New York.
Hambleton, R. K. (1980, April). Review methods for criterion-referenced test items. Paper
presented at the meeting of AERA, Boston. Hambleton, R. K. (1980, May). Guidelines for selecting criterion-referenced tests. Invited paper
at the meeting of the International Reading Association, St. Louis. Hambleton, R. K. (1980, June). Ability estimation with three logistic test models. Paper
presented at the Fourth International Symposium of Educational Testing, Antwerp, Belgium.
Hambleton, R. K. (1980, June). Putting the Rasch model into perspective: Its advantages and
disadvantages for district and state assessment applications. Invited paper presented at the 10th Annual Conference on Large-Scale Assessment, Denver.
Hambleton, R. K. (1981, April). Latent ability scales, interpretations, and uses. Paper presented
at the meeting of AERA, Los Angeles. Hambleton, R. K. (1981, April). Advances in criterion-referenced measurement in reading.
Invited presentation at the meeting of the International Reading Association, New Orleans.
Hambleton, R. K. (1981, June). Goodness of fit studies for latent trait models. Invited paper presented at the 11th Annual Conference on Large-Scale Assessment, Boulder, Colorado.
Hambleton, R. K. (1981, December). Measures of goodness of fit for item response models.
Invited paper presented at the meeting of the Netherlands Psychometric Society, Amsterdam.
Hambleton, R. K. (1982, March). Recent advances in competency test development, standard-
setting, and validity assessment. Invited presentation at the Fourth Annual Northern New England Educational Tests, Measurement, and Evaluation Conference, Plymouth, New Hampshire.
Hambleton, R. K. (1982, June). The utilization of item response models with NAEP
mathematics exercises. Invited presentation at the 12th Annual Large-Scale Assessment Conference, Boulder, Colorado.
Hambleton, R. K. (1982, August). Some pitfalls in applying item response models. Paper
presented at the meeting of APA, Washington, DC. Hambleton, R. K. (1983, April). Standard-setting: State of the art, future prospectus. Paper
presented at the meeting of AERA, Montreal. Hambleton, R. K. (1983, June). Applications of item response theory. Invited presentation at the
meeting of the Canadian Society for the Study of Education, Vancouver. Hambleton, R. K. (1984, April). Promising solutions to several problems that arise in applying
IRT. Paper presented at the meeting of AERA, New Orleans.
51
Hambleton, R. K. (1984, July). Applications of item response theory. Invited paper presented at the 23rd International Congress of Psychology, Acapulco.
Hambleton, R. K. (1984, December). New technical advances in measurement for certification
and licensure exams. Invited address at the NCHCA National Conference on Continuing Competence Assurance, Miami Beach.
Hambleton, R. K. (1985, April). A competency test program evaluation from a
psychometrician's viewpoint. Paper presented at the meeting of AERA, Chicago. Hambleton, R. K. (1986, March). Objectives-based testing. Invited presentation at the Orlando
Conference, Lake Buena Vista, Florida. Hambleton, R. K. (1987, May). Uses of computers in school testing programs. Invited
presentation at the Conference on Measurement and Evaluation, Los Angeles. Hambleton, R. K. (1987, May). Future of item response theory. Invited presentation at the
Conference on Measurement and Evaluation, Los Angeles. Hambleton, R. K. (1988, August). Some pitfalls in current educational testing practices. Invited
paper presented at the 24th International Congress of Psychology, Sydney, Australia. Hambleton, R. K. (1989, June). Educational testing practices: Trends, problems, and future
directions. President's invited address at the meeting of the Canadian Educational Research Association, Quebec City.
Hambleton, R. K. (1989, October). Item response models in physical education. Keynote
address at the Sixth Measurement and Evaluation Symposium, University of Wisconsin, Madison.
Hambleton, R. K. (1990, April). Future directions for educational assessment. President's
address presented at the meeting of NCME, Boston. Hambleton, R. K. (1990, June). What do teachers need to know about testing? Invited
presentation at a national conference on classroom testing practices, Victoria, BC. Hambleton, R. K. (1990, November). Future directions for educational assessment. Keynote
address at the meeting of the Florida Educational Research Association, Deerfield Beach, FL.
Hambleton, R. K. (1991, August). Meeting the measurement challenges of the 1990s: New
psychometric models, methods, and tests. Invited address presented at the meeting of APA, San Francisco.
Hambleton, R. K. (1991, September). Advances in item bias research. Invited presentation at the First European Congress on Psychological Assessment, Barcelona, Spain.
Hambleton, R. K. (1991, November). Setting standards and choosing testing methods for
national and international assessments. Invited presentation at the Assessing Learning and Educational Achievement Conference, Johnson Foundation Conference Center, Racine, Wisconsin.
52
Hambleton, R. K. (1992, April). Item response theory: A broad psychometric framework for measurement advances. Invited presentation at the meeting of NCME, San Francisco.
Hambleton, R. K. (1992, April). The case for item response theory. Invited presentation at the
meeting of AERA, San Francisco. Hambleton, R. K. (1992, April). Uses of international data in setting American educational
standards. Invited presentation at a joint meeting of NCES/NAGB, Washington, DC. Hambleton, R. K. (1992, June). Measurement advances to address educational policy questions.
Keynote address at the European Conference of Educational Research, Enschede, The Netherlands.
Hambleton, R. K. (1992, June). Translating tests and establishing test score equivalence. Invited
paper at the meeting of the Canadian Educational Research Association, Charlottestown, Prince Edward Island.
Hambleton, R. K. (1992, July). Setting standards on national tests. Paper presented at the 25th
International Congress of Psychology, Brussels, Belgium. Hambleton, R. K. (1993, April). Rise and fall of criterion-referenced measurement? Invited
paper presented at the meetings of AERA and NCME, Atlanta. Hambleton, R. K. (1993, June). New measurement models, methods, and tests for the 1990s and
beyond. Paper presented at the meeting of CERA, Ottawa. Hambleton, R. K. (1993, August). Guidelines for translating tests. Presentation at the meeting
of APA, Toronto. Hambleton, R. K. (1994, February). Methodological issues arising in cross-national comparative
studies. Invited paper presented at the American Association for the Advancement of Science, San Francisco.
Hambleton, R. K. (1994, April). Setting performance standards: Essential research studies.
Paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (1994, April). Scales, scores, and reporting forms to enhance the utility of
educational testing. Invited paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (1994, April). International perspectives on assessment: International Test
Commission. Paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (1994, June). Setting performance standards: New methods and essential
research studies. Invited presentation at the Medical Council of Canada's "Post Ottawa Conference," Toronto.
Hambleton, R. K. (1994, July). Developing guidelines for adapting instruments. Invited paper
presented at the 23rd Congress of Applied Psychology, Madrid.
53
Hambleton, R. K. (1994, November). Standard-setting methods for performance assessments in clinical problem-solving. Invited presentation at the meeting of the Research in Medical Education Conference, Boston.
Hambleton, R. K. (1994, December). Translating tests: Issues and methods. Invited presentation
at the NCES Limited English Proficiency Conference, Washington. Hambleton, R. K. (1995, January). Standard-setting in state assessments: current status and
future research directions. Invited presentation at the CCSSO-SCASS meeting, New Orleans.
Hambleton, R. K. (1995, May). New directions for college admissions testing and research in
the United States. Invited presentation at the Third International SweSAT Conference, Umea, Sweden.
Hambleton, R. K. (1995, June). Psychological testing in the 21st century. Key-note address at
the Congress on Psychometrics, Pretoria, South Africa. Hambleton, R. K. (1995, June). The detection of item bias: methods, research findings, and
applications. Invited presentation at the Congress on Psychometrics, Pretoria, South Africa.
Hambleton, R. K. (1995, June). Adapting tests for use in multiple languages and cultures: issues,
methods, and guidelines. Invited presentation at the Congress on Psychometrics, Pretoria, South Africa.
Hambleton, R. K. (1995, July). Guidelines for adapting psychological tests for use in multiple
languages and cultures. Paper presented at the Fourth European Congress of Psychology, Athens.
Hambleton, R. K. (1995, August). Setting standards on performance assessments: technical
issues and promising methods. Paper presented at the meeting of APA, New York. Hambleton, R. K. (1995, August). Psychological assessment advances for the 21st century: New
psychometric models, methods, and technology. Keynote address presented at the Third European Congress of Psychological Assessment, Trier, Germany.
Hambleton, R. K. (1995, October). Translating psychological tests and medical examinations:
Main issues, methods, and technical guidelines. Invited paper presented at the Medical Selection Conference, Fribourg, Switzerland.
Hambleton, R. K. (1995, December). Assessing student progress in Massachusetts: Radical
changes for the 21st century. Invited presentation at the Academy for Legislators: An Educational Forum, University of Massachusetts Amherst.
Hambleton, R. K. (1996, February). Reactions to "Domain scores: A new concept in reporting
NAEP results". Presentation at the NAGB Work Group on Planning Meeting, Washington, DC.
Hambleton, R. K. (1996, February). Producing comparable scores on non-equivalent
examinations. Presentation at a meeting of the NASBA Users' Panel, Orlando, FL.
54
Hambleton, R. K. (1996, April). Guidelines for adapting educational and psychological tests. Paper presented at the meeting of NCME, New York.
Hambleton, R. K. (1996, April). Assessing medical competence: some promising solutions.
Keynote address presented at the annual meeting of the Northeast Group on Educational Affairs in Medicine, Philadelphia.
Hambleton, R. K. (1996, May). Reporting of state assessment results: issues, methods, and
essential research. Presentation at the CCSSO State Collaborative on Assessment and Student Standards Meeting, St. Louis.
Hambleton, R. K. (1996, May). Setting standards on performance assessments: progress report.
Presentation at the CCSSO State Collaborative on Assessment and Student Standards Meeting, St. Louis.
Hambleton, R. K. (1996, June). Innovations in large scale assessment: psychometric lessons
learned from Kentucky. Paper presented at the National Conference on Large Scale Assessment, Phoenix, Arizona.
Hambleton, R. K. (1996, August). Adapting psychological tests: technical guidelines for
improving practices. Paper presented at the 26th International Congress of Psychology, Montreal.
Hambleton, R. K. (1996, August). Development of guidelines for adapting psychological and
educational tests for use in multiple languages and cultures. Invited paper presented at the 13th Congress of the International Association for Cross-Cultural Psychology, Montreal, Canada.
Hambleton, R. K. (1996, August). Application of the Joint Committee's Program Evaluation
Standards to education. Paper presented at the meeting of APA, Toronto. Hambleton, R. K. (1996, October). The future of educational assessment: Likely directions and
technical problems to overcome. Keynote address presented at the annual meeting of NERA, Ellenville, NY.
Hambleton, R. K. (1996, December). Setting performance standards on achievement tests in
Title I. Presentation at the meeting of SCASS, Washington. Hambleton, R. K. (1997, March). Issues and methods in setting standards on performance
assessments. Invited presentation at the meeting of the Northeast Group on Educational Affairs, Washington, DC.
Hambleton, R. K. (1997, March). NAEP redesign: technical committee report and some
personal observations. Invited paper presented at the meeting of AERA, Chicago. Hambleton, R. K. (1997, March). Some notes on item response theory. Invited graduate student
seminar at the AERA meeting, Chicago. Hambleton, R. K. (1997, May). Judgmental estimates of item difficulty. Presentation at the
Annual Swedish Scholastic Aptitude Conference, Umea, Sweden.
55
Hambleton, R. K. (1997, July). Issues, methods,and guidelines for adapting tests from one language and culture to another. Paper presented at the Fifth European Congress of Psychology, Dublin, Ireland.
Hambleton, R. K. (1997, July). Establishing cross-cultural validity: a discussion. Paper
presented at the Fifth European Congress of Psychology, Dublin. Hambleton, R. K. (1997, July). Future directions in educational assessment. Invited presentation
at the Scientific Council of the National Institute for Testing and Evaluation, Jerusalem. Hambleton, R. K. (1997, August). Increasing the validity of NAEP scores and score reporting
with achievement levels. Invited paper presented at the NAEP Achievement Levels Workshop, Boulder, Colorado.
Hambleton, R. K. (1997, August). Changing measurement models and methods for the 21st
century. Invited Division 5 Presidential Address at the meeting of the American Psychological Association, Chicago.
Hambleton, R. K. (1997, October). Promising GMAT item formats for the 21st century. Invited
presentation at the international workshop on the GMAT, Paris, France. Hambleton, R. K. (1997, December). Setting performance standards on national and state
educational assessments. Invited presentation at the Title I-CCSSO Conference, Washington.
Hambleton, R. K. (1998, April). Setting standards on multi-format assessments: a review of
methods and a program of research. Paper presented at the meetings of AERA and NCME, San Diego.
Hambleton, R. K. (1998, May). Computer-based testing: The promises and the problems to
overcome. Paper presented at the 26th annual meeting of the Canadian Society for the Study of Education.
Hambleton, R. K. (1998, June). Setting standards on complex performance assessments. Paper
presented at the Large-Scale Assessment Conference, Colorado Springs, CO. Hambleton, R. K. (1998, August). Translation and adaptation of psychological tests: Issues,
research designs, statistical approaches, and practical steps. Invited paper presented at the 24th International Congress of Applied Psychology, San Francisco.
Hambleton, R. K. (1998, September). Translating and adapting credentialing exams into
multiple languages: Issues, steps, and guidelines. Invited paper at the 18th annual meeting of CLEAR, Denver.
Hambleton, R. K. (1998, October). Advances in standard-setting methodology. Invited
presentation at the Measurement and Evaluation: Current and Future Research Directions Conference, Banff, Alberta, Canada.
Hambleton, R. K. (1998, October). Educational assessment for the 21st century. Keynote
address at the 3rd National Forum on Educational Evaluation, Veracruz, Mexico.
56
Hambleton, R. K. (1998, December). Are the Massachusetts teacher tests valid? Invited presentation at Westfield State College, Westfield, MA.
Hambleton, R. K. (1999, April). Guidelines for adapting and translating educational and
psychological tests. Invited paper presented at the meeting of NCME, Montreal. Hambleton, R. K. (1999, April). Performance assessment: A synthesis of current research and
future directions. Invited paper presented at the meeting of NCME, Montreal. Hambleton, R. K. (1999, May). Issues, designs and technical guidelines for adapting tests in
multiple languages and cultures. Invited address at the International Conference on Adapting Tests for Use in Multiple Languages and Cultures. Washington, DC.
Hambleton, R. K. (1999, June). Setting standards on complex performance assessments. Invited
paper presented at the 19th annual National Conference on Large-Scale Assessment, Snowbird, Utah.
Hambleton, R. K. (1999, July). Issues, designs, and guidelines for adapting tests. Invited
address at the Joint European Conference of the IACCP and the ITC, Graz, Austria. Hambleton, R. K. (1999, July). Advances in test adaptation methodology. Invited presenter in a
symposium at the Joint European Conference of the IACCP and the ITC, Graz, Austria. Hambleton, R. K. (1999, August). Advances in testing methods. Invited presentation at the
Sweden Department of Education, Stockholm. Hambleton, R. K. (1999, September). Advances in item response modeling of educational and
psychological test data. Invited presentation at the 6th Congress of Social Science Methodology, University of Oviedo, Oviedo, Spain.
Hambleton, R. K. (1999, September). Computer-based testing: Ten promises, ten problems to
overcome. Keynote address at the 6th Congress of Social Science Methodology, University of Oviedo, Oviedo, Spain.
Hambleton, R. K. (1999, October). Evaluative criteria and methods for setting performance
standards. Invited presentation at the Edward F. Reidy, Jr., First Interactive Lecture Series. Dover, NH: The National Center for the Improvement of Educational Assessment.
Hambleton, R. K. (2000, February). Computer-enhanced assessment: Great promise and
problems to overcome. Keynote address at the American Test Publishers Conference, Carmel, CA.
Hambleton, R. K. (2000, April). Test and scoring models for the new generation of assessments.
Invited paper presented at the meeting of NCME, New Orleans. Hambleton, R. K. (2000, April). Evaluation of NAEP standard-setting: Let’s see both sides.
Paper presented at the meeting of NCME, New Orleans, LA. Hambleton, R. K. (2000, April). Enhancing the validity of the test adaptation process: Improving
the judgmental process. Paper presented at the meeting of NCME, New Orleans, LA.
57
Hambleton, R. K. (2000, April). Setting standards on complex performance assessments: A summary of an NSF-CCSSO-NCME project. Paper presented at the meeting of NCME, New Orleans, 2000.
Hambleton, R. K. (2000, April). Advances in standard-setting methods. Paper presented at the
NCME meeting, New Orleans, LA. Hambleton, R. K. (2000, June). Improving the ways we report test scores to policy-makers and
the public. Invited presentation at the University of Maryland Invitational Conference on Measurement, College Park, MD.
Hambleton, R. K. (2000, June). Possible methods for setting performance standards on NAEP.
Invited presentation to the National Assessment Governing Board Achievement Levels Committee, Snowbird, Utah.
Hambleton, R. K. (2000, June). A look at NAEP score reporting: Progress, the press, and
Popham’s proposals. Invited presentation to the National Assessment Governing Board Achievement Levels Committee, Snowbird, Utah.
Hambleton, R. K. (2000, July). Computer-based exams: Current issues, advances, and essential
research. Invited paper presented at the 27th International Congress of Psychology, Stockholm.
Hambleton, R. K. (2000, September). Translation of NAEP achievement levels to the Voluntary
National Tests. Invited paper presented to a meeting of AIR and NAGB, Washington. Hambleton, R. K. (2000, November). New advances in assessment practices. Keynote address
presented at the meeting of the Association for Educational Assessment, Prague. Hambleton, R. K. (2001, April). What we know about standards-based score reporting. Paper
presented at the meeting of AERA, Seattle. Hambleton, R. K. (2001, July). New approaches for improving the ways test scores are reported.
Invited paper presented at the 7th European Congress of Psychology, London. Hambleton, R. K. (2001, December). Future directions for adult education assessment.
Presentation at the National Academies Board on Testing and Assessment Meeting on Performance Assessments for Adult Education, Washington, DC.
Hambleton, R. K. (2002, February). A new challenge: Making results from large-scale
assessments understandable and useful. Invited presentation at the Provincial Testing in Canadian Schools: Research, Policy, and Practice Conference, Victoria, British Columbia.
Hambleton, R. K. (2002, February). Adapting credentialing exams for use in multiple languages.
Invited presentation at ATP’s Conference on Computer-Based Testing, Carlsbad, CA. Hambleton, R. K. (2002, February). A non-technical introduction to item response theory for
credentialing exams: Models, applications, and issues. Invited presentation at ATP’s Conference on Computer-Based Testing, Carlsbad, CA.
58
Hambleton, R. K. (2002, April). Test designs for the next generation of large-scale assessments. Invited presentation at the NCME meeting, New Orleans.
Hambleton, R. K. (2002, April). Misconceptions about the technical aspects of large scale state
assessments. Key-note address at the meeting of the New England Educational Research Organization, Northampton, Massachusetts.
Hambleton, R. K. (2002, June). Test designs and item formats for the next generation of
assessments. Invited discussant remarks at the International Conference on Computer-Based Testing and the Internet, Winchester, England.
Hambleton, R. K. (2002, June). Testing in the 21st century: What’s new and what measurement
problems need to be solved? Keynote address at the GITP Conference, “Psychological Research: Luxury or Necessity,” Amsterdam, the Netherlands.
Hambleton, R. K. (2002, June). Adding meaning to test scores, finally! Presentation at the 32nd
Annual National Conference on Large-Scale Assessment, Palm Desert, California. Hambleton, R. K. (2002, July). Progress in large-scale medical testing: Methodological
advances and new challenges. Keynote address at the Tenth Ottawa Conference for Medical Education, Ottawa.
Hambleton, R. K. (2002, July). The promises and challenges of computer-based testing
[Abstract]. Proceedings of the 25th International Congress of Applied Psychology, Singapore.
Hambleton, R. K. (2002, November). Setting performance standards on state assessments.
Invited presentation at the Harcourt Midwest Assessment Forum, Chicago. Hambleton, R. K. (2002, December). Psychometric developments, 1966 to 2002, and challenges
for the future. Invited presentation at the International Conference on Measurement for the Social Sciences (Festschrift to Honour Ross Traub), Toronto.
Hambleton, R. K. (2003, January). Theory, methods, and practices in testing for the 21st
century. Presentation at the Honoris Causa Ceremony, University of Oviedo, Spain. Hambleton, R. K. (2003, February). Advances in testing practices in the 21st century . . . not so
fast. Keynote address at the annual meeting of the Association of Test Publishers, Amelia Island, Florida.
Hambleton, R. K. (2003, April). Evaluation of new computer-based test designs for
credentialing exams. Paper presented at meeting of NCME, Chicago. Hambleton, R. K. (2003, July). Computer-based testing: Great concept but many statistical
problems to overcome. Invited address at the IX Seminar on Applied Statistics, Rio de Janeiro, Brazil.
Hambleton, R. K. (2003, July). Applying item resonse theory models in educational testing.
Keynote address at the IX Seminar on Applied Statistics, Rio de Janeiro, Brazil.
59
Hambleton, R. K. (2004, February). ITC guidelines for adapting exams into multiple languages and cultures. Invited presentation at the ATP Conference on Computer-Based Testing, Palm Springs, 2004.
Hambleton, R. K. (2004, February). Setting AICPA passing scores: So how much is good
enough? Invited presentation at the ATP Conference on Computer-Based Testing, Palm Springs, 2004.
Hambleton, R. K. (2004, June). Comparing IRT models for the analysis of quality of life
research data. Invited address at the 2004 International Society for Quality of Life Research Symposium, Boston.
Hambleton, R. K. (2004, June). Consistency of performance standards over grades and subjects.
Presentation at the annual CCSSO Conference, Boston. Hambleton, R. K. (2004, June). Traditional and modern approaches to outcomes measurement.
Invited presentation at the Advances in Health Outcomes Measurement Conference, Bethesda, MD.
Hambleton, R. K. (2004, October). Guidelines and methodology for adapting educational and
psychological tests. An invited presentation at the 4th International Test Commission Conference on Equitable Assessment Practices, Williamsburg, VA.
Hambleton, R. K. (2005, February). A new challenge in testing: Making test scores more
understandable. An invited presentation at ATP’s Innovations in Testing Conference, Scottsdale, AZ.
Hambleton, R. K. (2005, May). Educational assessment in the 21st century: Two stories to tell
so far. Keynote presentation at the CERA Meeting, London, Ontario. Hambleton, R. K. (2005, July). Item response theory: Recent advances and technical challenges.
Invited presentation at the 9th European Congress of Psychology, Granada, Spain. Hambleton, R. K. (2005, November). Advances in assessment for the 21st century. Invited
presentation at the meeting of the Center for Innovation, National Board of Medical Examiners, Philadelphia.
Hambleton, R. K. (2006, February). Making diagnostic score reports more clear and meaningful
for candidates. An invited presentation at the ATP Conference, Orlando, FL. Hambleton, R. K. (2006, February). Using item response theory (IRT) models to equate test
scores. An invited presentation at the ATP Conference, Orlando, FL. Hambleton, R. K. (2006, March). Six big problems to overcome in educational and
psychological measurement. An invited presentation at the University of Oviedo, Spain. Hambleton, R. K. (2006, May). Applying IRT models to health science data. An invited
presentation at Northwestern University, Evanston. Hambleton, R. K. (2006, June). Automated test assembly with item response theory. An invited
presentation at the CCSSO meeting, San Francisco.
60
Hambleton, R. K. (2006, June). Multiple languages in large-scale assessments. An invited
presentation at the CCSSO meeting, San Francisco. Hambleton, R. K. (2006, July). Recent developments in educational assessment. Invited
presentation at the 26th International Congress of Applied Psychology, Athens, Greece. Hambleton, R. K. (2006, August). Issues in test adaptation methodology. Invited paper
presented at the meeting of APA, New Orleans. Hambleton, R. K. (2006, August). Five big challenges in educational and psychological
assessment. Invited presentation at the meeting of APA, New Orleans. Hambleton, R. K. (2006, October). Item response theory and models for the next generation of
educational and psychological tests. An invited presentation at the Winemiller 2006 Conference on Methodological Development of Statistics in the Social Sciences, Columbia, Missouri.
Hambleton, R. K. (2006, October). Applications of item response theory to improve health
outcomes assessment. An invited presentation at the Conference on New Methods for the Analysis of Family and Dyadic Processes, University of Massachusetts, Amherst.
Hambleton, R. K., Arrasmith, D., & Smith, I. L. (1986, April). Optimal selection of test items.
Paper presented at the meeting of NCME, Washington, DC. Hambleton, R. K. Arrasmith, D., & Smith, I. L. (1986, June). Optimal item selection for
credentialing examinations. Paper presented at the meeting of the Psychometric Society, Toronto.
Hambleton, R. K., & Artes-Ferragud, M. (1990, June). New directions in item response theory:
Applications of multichotomous response models. Paper presented at the meeting of the Canadian Educational Research Association, Victoria, BC.
Hambleton, R. K., & Berberoglu. G. (1997, March). Third International Mathematics and
Science Study: test adaptation methods and results. Paper presented at the meeting of NCME, Chicago.
Hambleton, R. K., Blanchard, K. H., & Hersey, P. (1978, June). Validity of situational
leadership theory and applications. Paper presented at the 19th International Congress of Applied Psychology, Munich.
Hambleton, R. K., & Bollwark, J. (1990, July). Test translations in cross-cultural studies.
Invited paper presented at the meeting of the International Congress of Applied Psychology, Kyoto, Japan.
Hambleton, R. K., Bollwark, J., & Rogers, H. J. (1990, April). Detecting potentially biased test
items. Paper presented at the meeting of AERA, Boston. Hambleton, R. K., & Boulet, J. (1996, September). Psychometric methods for medical
examinations. Presentation at the annual meeting of the Association for Medical Education in Europe, Copenhagen.
61
Hambleton, R. K., & Bourque, M. L. (1992, April). Methodological considerations in setting standards on national examinations. Invited paper presented at the meeting of AERA, San Francisco.
Hambleton, R. K., & Cadman, S. (1994, July). Item response theory models and applications:
Current status and future directions. Invited paper presented at the 23rd Congress of Applied Psychology, Madrid.
Hambleton, R. K., & Cook, L. L. (1976, April). Introduction to latent trait models and their use
in analyzing educational test data. Paper presented at the meeting of NCME, San Francisco.
Hambleton, R. K., & Cook, L. L. (1978, April). Robustness of latent trait models. Paper
presented at the meeting of AERA, Toronto. Hambleton, R. K., Dirir, M., & Lam, P. (1992, April). Effects of optimal test designs on
measurement precision and decision accuracy. Paper presented at the meeting of AERA, San Francisco.
Hambleton, R. K., & Eignor, D. R. (1977, July). Adaptive testing applied to hierarchically
structured objectives-based curricula. Invited paper presented at the Second Conference on Computerized Adaptive Testing, University of Minnesota.
Hambleton, R. K., & Eignor, D. R. (1978, April). Criteria for evaluating criterion-referenced
tests and test manuals. Paper presented at the meeting of NCME, Toronto. Hambleton, R. K., & Eignor, D. R. (1978, February). Minimum competency level identification:
A review of selected issues, methods, and implementation strategies. Paper presented at the AERA Conference on Minimum Competency Testing, Washington.
Hambleton, R. K., & Eignor, D. R. (1978, April). Allocating testing time in objectives-based
instructional programs. Paper presented at the meeting of APA, Toronto. Hambleton, R. K., & Fennessy, L. (1991, November). Advances in credentialing examination
methods. Invited paper presented at the International Symposium on Modern Theories in Measurement: Problems and Issues. Chateau Montebello, Montebello, Quebec, Canada.
Hambleton, R. K., & Friedman, M. (1996, September). Advances in assessment using
standardized patient methodology: a psychometrician's perspective. Keynote address presented at the annual meeting of the Association for Medical Education in Europe, Copenhagen.
Hambleton, R. K., & Gifford, J. A. (1979, July). Robustness of latent trait models. Invited paper
presented at the 1979 Computerized Adaptive Testing Conference, Minneapolis. Hambleton, R. K., & Gorth, W. P. (1970, October). Item Analysis for criterion-referenced tests.
Paper presented at the meeting of NERA, Liberty, New York. Hambleton, R. K., Gorth, W. P., & O'Reilly, R. P. (1971, October). A formative evaluative
model for classroom instruction. Paper presented at the meeting of NERA, Liberty, New York.
62
Hambleton, R. K., Gower, C., & Bollwark, J. (1987, October). Assessing problem-solving ability with computer-adaptive testing procedures. Paper presented at the 29th meeting of the Military Testing Association, Ottawa, Canada.
Hambleton, R. K., Gower, C., & Bollwark, J. (1988, April). New testing methods to assess
technical problem solving. Paper presented at the meeting of AERA, New Orleans. Hambleton, R. K., Gower, C., & Bollwark, J. (1988, August). Computer-administered tests to
assess troubleshooting skills. Paper presented at the meeting of APA, Atlanta. Hambleton, R. K., Gower, C., & Rogers, H. J. (1989, April). Customized testing: Review of
issues and methods. Paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., & Han, N. (2004, April). Assessing the fit of IRT models. Paper presented at
the meeting of NCME, San Diego. Hambleton, R. K., & Han, N. (2006, April). Have my test items been stolen? Item statistics to
find out. Invited paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., Han, N., & Ying, L. (2004, February). Detecting disclosed test items in a
computer-based testing environment. Invited presentation at the ATP Conference on Computer-Based Testing, Palm Springs, CA.
Hambleton, R. K., Hutten, L., & Swaminathan, H. (1974, August). A comparison of several
methods for assessing student mastery in objectives-based instructional programs. Paper presented at the meeting of APA, New Orleans.
Hambleton, R. K., Jaeger, R., & Plake, B. (1994, October). Performance standard setting on the
EAG assessment package: What was done? What was learned? Presentation at the first NBPTS-ADL-TAG colloquium on measurement and methodology, Washington.
Hambleton, R. K., Jaeger, R. M., Plake, B. S., & Mills, C. (1997, March). Issues and methods
for setting standards on performance assessments. Paper presented at the meeting of AERA, Chicago.
Hambleton, R. K., & Jodoin, M. (2001, February). Applying item response models to
credentialing exams: Answers to the 10 most important questions. Invited presentation at the ATP Conference on Computer-Based Testing, Tucson, Arizona.
Hambleton, R. K., & Jones, R. W. (1991, April). Influence of various factors on the accuracy of
test information functions. Paper presented at the meeting of NCME, Chicago. Hambleton, R. K., & Jones, R. W. (1992, April). Comparison of statistical and judgmental
methods for assessing DIF. Paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., & Jones, R. W. (1992, July). International impact of item response theory on
testing practices. Invited paper presented at the 25th International Congress of Psychology, Brussels, Belgium.
63
Hambleton, R. K., & Jones, R. W. (1993, April). Item parameter estimation errors and their influence on test information functions. Paper presented at the meeting of NCME, Atlanta.
Hambleton, R. K., Jones, R. W., & Rogers, H. J. (1990, May). Comparison of empirical and
judgmental methods for detecting potentially biased test items. Paper presented at the meeting of the New England Educational Research Organization, Rockport, Maine.
Hambleton, R. K., & Kanjee, A. (1992, October). Methodological issues in large scale
assessment. Invited paper presented at the International Symposium in China's Higher Education Examinations, Nanjing, China.
Hambleton, R. K., & Kanjee, A. (1993, April). Enhancing the validity of cross-national validity
studies: Solving the test translation problem. Paper presented at the meeting of AERA, Atlanta.
Hambleton, R. K., & Kanjee, A. (1994, July). Enhancing the validity of cross-cultural testing
issues, research designs, and psychometric methods. Paper presented at the 23rd Congress of Applied Psychology, Madrid.
Hambleton, R. K., & Li, S. (2004, August). Effective implementation of the International Test
Commission Guidelines for Adapting Tests. Invited presentation at the 28th International Congress of Psychology, Beijing, China.
Hambleton, R. K., Li, S., & Sireci, S. G. (2003, April). Identifying common problems in item
translation: A meta analysis. Paper presented at the meeting of NCME, Chicago. Hambleton, R. K., & Martois, J. S. (1982, April). Validity of a derived score prediction system
based on item response theory principles and procedures. Paper presented at the meeting of AERA, New York.
Hambleton, R. K., Martois, J. S., & Williams, C. (1983, April). Detection of biased items with
item response models. Paper presented at the meeting of AERA, Montreal. Hambleton, R. K., & Meara, K. (1998, August). The Graduate Record Examination: What is the
validity evidence? Invited paper presented at the meeting of the American Psychological Association, San Francisco.
Hambleton, R. K., & Meara, K. (1999, November). Newspaper coverage of NAEP results:
1990-1998. Presentation at the meeting of the National Assessment Governing Board, Washington, DC.
Hambleton, R. K., & Mills, C. N. (1981, April). Ability estimation with three logistic test
models. Paper presented at the meeting of NCME, Los Angeles. Hambleton, R. K., Mills, C. N., & Simon, R. (1981, April). Determining the optimal length of a
criterion-referenced test. Paper presented at the meeting of NCME, Los Angeles. Hambleton, R. K., & Murray, J. (1977, April). A comparative study of faculty and student
attitudes toward a variety of college grading purposes and practices. Paper presented at the meeting of NCME, New York.
64
Hambleton, R. K., & Murray, L. N. (1984, April). Assessing the dimensionality of NAEP reading items: A look at several approaches. Paper presented at the meeting of AERA, New Orleans.
Hambleton, R. K., Murray, L. N., & Williams, P. (1983, April). Fitting item response models to
test data: Approaches and examples. Paper presented at the meeting of AERA, New York.
Hambleton, R. K., & Patsula, L. (1996, August). Adaptation/translation of tests: issues, technical
advances, and practical steps. Paper presented at the meeting of APA, Toronto. Hambleton, R. K., & Patsula, L. (1996, August). Test adaptations: review of methods and
suggestions for additional research. Paper presented at the 26th International Congress of Psychology, Montreal.
Hambleton, R. K., & Patsula, L. (1997, September). Adapting tests for use in multiple languages
and cultures: sources of error, possible solutions, and practical guidelines. Invited paper presented at the Fourth European Conference on Psychological Testing, Lisbon.
Hambleton, R. K., & Patsula, L. (1998, April). Increasing the validity of adapted tests: Problems
to overcome and guidelines to follow for improving test adaptation practices. Paper presented at the meeting of AERA, San Diego.
Hambleton, R. K., & Plake, B. S. (1994, April). Using an extended Angoff procedure to set
standards on complex performance assessments. Paper presented at a joint meeting of AERA and NCME, New Orleans.
Hambleton, R. K., & Plake, B. S. (1997, March). An anchor-based approach to setting standards
on complex performance assessments. Paper presented at the meeting of AERA, Chicago.
Hambleton, R. K., Plake, B. S., & Engelhard, G. (2001, April). Richard M. Jaeger’s
contributions to standard-setting methods. Invited symposium at the meeting of AERA, Seattle.
Hambleton, R. K., & Powell, S. (1978, May). Future directions in testing. Paper presented at the
National Future Studies Conference, University of Massachusetts at Amherst. Hambleton, R. K., Powell, S., & Eignor, D. R. (1979, April). Issues and methods for standard-
setting. Paper presented at the meeting of NCME, San Francisco. Hambleton, R. K., Powers, T., & Rovinelli, R. (1972, April). An investigation of the effects of
test administration procedures and scoring on the reliability and validity of achievement tests. Paper presented at the meeting of AERA, Chicago.
Hambleton, R. K., Roberts, D. M., & Traub, R. E. (1969, February). Comparison of two
methods for assessing partial knowledge. Paper presented at the meeting of the Canadian Conference for Research in Education, Victoria, British Columbia.
Hambleton, R. K., & Rogers, H. J. (1985, April). Evaluation of the plot method for identifying
biased test items. Paper presented at the meeting of AERA, Chicago.
65
Hambleton, R. K., & Rogers, H. J. (1985, April). Advances in developing certification and licensure tests. Paper presented at the meeting of AERA, Chicago.
Hambleton, R. K., & Rogers, H. J. (1986, April). Promising advances in assessing the fit of item
response models. Paper presented at the meetings of AERA and NCME, San Francisco. Hambleton, R. K., & Rogers, H. J. (1987, June). Solving criterion-referenced testing problems
with item response models. Paper presented at the biannual meeting of the European Psychometric Society, Enschede, The Netherlands.
Hambleton, R. K., & Rogers, H. J. (1988, April). Applications of IRT models to criterion-
referenced measurement problems. Invited paper presented at the meetings of AERA and NCME, New Orleans.
Hambleton, R. K., & Rogers, H. J. (1988, April). Detecting biased test items: Comparison of the
IRT area and Mantel-Haenszel methods. Paper presented at the meeting of AERA, New Orleans.
Hambleton, R. K., & Rogers, H. J. (1988, June). Applying IRT models to large-scale assessment
data. Invited paper presented at the International Symposium on Large-Scale Assessments in an International Perspective, Deidesheim, Federal Republic of Germany.
Hambleton, R. K., & Rogers, H. J. (1989, April). Detecting potentially biased test items:
Comparison of empirical and judgmental methods. Paper presented at the meeting of AERA, San Francisco.
Hambleton, R. K., & Rogers, H. J. (1990, April). Solving some practical problems that arise in
using IRT models. Invited one-day training session at the meeting of NCME, Boston. Hambleton, R. K., Rogers, H. J., & Arrasmith, D. (1986, April). A comparison of the Mantel-
Haenszel statistic and item response methods of identifying differential item performance. Paper presented at the meeting of AERA, San Francisco.
Hambleton, R. K., Rogers, H. J., & Arrasmith, D. (1986, August). Identifying potentially biased
test items: A comparison of Mantel-Haenszel statistic and several item response theory methods. Paper presented at the meeting of APA, Washington, DC.
Hambleton, R. K., Rogers, H. J., & Jones, R. W. (1990, August). Influence of item parameter
estimation errors in test development. Paper presented at the meeting of APA, Boston. Hambleton, R. K., & Rovinelli, R. J. (1983, April). Assessing the dimensionality of a set of test
items. Paper presented at the meeting of AERA, Montreal. Hambleton, R. K., Rovinelli, R. J., & Gorth, W. P. (1971, April). Efficiency of various item-
examinee sampling designs for estimating test parameters. Paper presented at the meeting of APA, Washington, DC.
Hambleton, R. K., & Simon, R. (1979, October). A comprehensive model for building criterion-
referenced tests. Paper presented at the meeting of NERA, Ellenville, New York.
66
Hambleton, R. K., & Simon, R. (1980, April). Steps for constructing criterion-referenced tests. Paper presented at the meeting of AERA, Boston.
Hambleton, R. K., & Slater, S. (1994, October). Using performance standards to report national
and state assessment data: Are the reports understandable and how can they be improved? Invited paper presented at the Joint Conference on Standard-Setting for Large-Scale Assessments, Washington.
Hambleton, R. K., & Slater, S. (1995, April). Reliability issues and methods for credentialing
exams. Paper presented at the meetings of AERA and NCME, San Francisco. Hambleton, R. K., & Slater, S. (1995, July). Item response theory: Models and applications.
Paper presented at the Fourth European Congress of Psychology, Athens. Hambleton, R. K., & Slater, S. C. (1996, April). Are NAEP executive summary reports
understandable to policy-makers and educators? Invited paper presented at the meeting of NCME, New York.
Hambleton, R. K., Stetz, R., & Rios, A. (1983, April). The development of objectives-based
programs in occupational education. Paper presented at the meeting of NERA, Ellenville, New York.
Hambleton, R. K., Sutnick, A. I., & Friedman, M. (1995, September). New methods for setting
standards on performance assessments. Paper presented at the meeting of the Association for Medical Education in Europe, Zaragoza, Spain.
Hambleton, R. K., Swaminathan, H., & Algina, J. (1975, June). Toward a theory and practice of
criterion-referenced testing. Paper presented at the Second International Symposium of Educational Testing, Montreaux, Switzerland.
Hambleton, R. K., Swaminathan, H., Sireci, S., Xing, D., & Rizavi, S. (1998, April). Estimating
item statistics with judgmental data and Bayesian statistical procedures. Paper presented at the meeting of AERA, San Diego.
Hambleton, R. K., & Traub, R. E. (1970, February). Analysis of empirical data using the Rasch
model and two- and three-parameter logistic models. Paper presented at the meeting of AERA, Minneapolis.
Hambleton, R. K., & Traub, R. E. (1970, May). Some preliminary results on the robustness of
the Rasch test theory model. Paper presented at the meeting of the New England Educational Research Organization (NEERO), Boston.
Hambleton, R. K., & Traub, R. E. (1970, August). Information curves and efficiency of three
logistic test models. Paper presented at the meeting of the American Psychological Association, Miami.
Hambleton, R. K., & Traub, R. E. (1971, April). Some results on the robustness of the Rasch test
theory model. Paper presented at the meeting of AERA, New York.
67
Hambleton, R. K., et al. (1977, April). Measurement models for the future: A review of latent trait models, technical developments, and applications. Symposium presented at the meeting of AERA and NCME, New York.
Hambleton, R. K., & van der Linden, W. (1993, June). Advances in measurement models,
methods, and practices. Invited paper presented at the ITC Conference on Test Use with Children and Youth, Oxford, England.
Hambleton, R. K., & Xing, D. (2002, January). Maximizing the usefulness of computer-based
test designs for making pass-fail decisions. Paper presented at the meeting of the Canadian Educational Research Association, Toronto.
Hambleton, R. K., & Yu, J. (1991, December). Impact of item response theory models on testing
practices. Invited paper presented at the International Symposium on Psychological Measurement, Nanjing, P.R.C.
Hambleton, R. K., & Zaal, J. (1986, July). Computerized adaptive testing: Theory, applications,
and standards. Paper presented at the 21st meeting of the International Congress of Applied Psychology, Jerusalem.
Hambleton, R. K., & Zenisky, A. (2001, April). Increasing the meaningfulness of score scales
and reports. Paper presented at the meeting of NCME, Seattle. Hambleton, R. K., Zenisky, A., & Jodoin, M. (2001, July). Computer-based test designs and
item formats for the next generation of tests. Invited paper presented at the 7th European Congress on Psychology, London.
Han, N., & Hambleton, R. K. (2004, April). Detecting exposed test items in a computer-based
testing environment. Paper presented at the NCME meeting, San Diego. Han, N., Li, S., & Hambleton, R. K. (2005, April). Kernel versus IRT equating. Paper presented
at the meeting of NCME, Montreal. Jaeger, R. M., Hambleton, R. K., & Plake, B. S. (1995, April). Eliciting configural performance
standards through a sequenced application of complementary methods. Paper presented at the meetings of AERA and NCME, San Francisco.
Jaeger, R. M., Plake, B., & Hambleton, R. K. (1993, January). Designs for setting standards on
multidimensional performance assessments. Paper presented at the meeting of the North Carolina Association for Research in Education, Greensboro, NC.
Jaeger, R., Plake, B. S., & Hambleton, R. K. (1993, April). Integrating multi-dimensional
performances and setting standards. Paper presented at the meeting of NCME, Atlanta. Jirka, S. J., Baldwin, S. G., Karantonis, A. M., Wells, C. S., & Hambleton, R. K. (2006,
October). Population invariance: Comparison of converted scores for a national testing program. Paper presented at the Northeastern Educational Research Association, Kerhonkson, New York.
68
Jodoin, M., Zenisky, A., & Hambleton, R. K. (2002, April). Comparison of the psychometric properties of several computer-based test designs for credentialing exams. Paper presented at the meeting of NCME, New Orleans.
Jones, R. W., & Hambleton, R. K. (1991, April). Fitting IRT models to the Graduate
Management Admissions Test. Paper presented at the meeting of NEERO, Portsmouth, NH.
Karantonis, A. M., Baldwin, S. G., Jirka, S. J., Wells, C. S., & Hambleton, R. K. (2006,
October). Item parameter invariance across states in a national assessment program. Paper presented at the Northeastern Educational Research Association, Kerhonkson, New York.
Karantonis, A. M., Wells, C., & Hambleton, R. K. (2007, April). Defining performance
categories: Using an IRT-based approach to identify exemplar items. Paper presented at the NCME meeting, Chicago.
Lam, P., Swaminathan, H., & Hambleton, R. K. (1992, April). Use of binary programming in
test designs to address content balancing in adaptive tests. Paper presented at the meeting of AERA, San Francisco.
Ma, X., Klauck, S., Ying, L., & Hambleton, R. K. (2001, October). DIF analyses on a state
assessment. Paper presented at the meeting of NERA, Ellenville, NY. Mazor, K., Clauser, B., & Hambleton, R. K. (1991, April). The effect of sample size on the
functioning of the Mantel-Haenszel statistic. Paper presented at the meeting of NCME, Chicago.
Mazor, K., Clauser, B., & Hambleton, R. K. (1992, April). Detection methods for non-uniform
bias. Paper presented at the meeting of NCME, San Francisco. Mazor, K., Hambleton, R. K., & Clauser, B. (1994, April). The effects of conditioning on two
internally derived ability estimates in multidimensional DIF analysis. Paper presented at the meeting of AERA, New Orleans.
McCormack, J., Miller, C., Hambleton, R. K., & Eignor, D. R. (1976, May). Goal-setting ability
in young children: Theory, instrumentation, and measurement. Paper presented at the annual meeting of NEERO, Provincetown, Massachusetts.
McKinley, D. W., Boulet, J. R., & Hambleton, R. K. (2000, April). Standard setting for
performance based assessment: A pilot study using an empirically defined, multi-faceted approach. Paper presented at the meeting of AERA, New Orleans.
McKinley, D. W., Boulet, J. R., Hambleton, R. K., & Burdick, W. P. (1999, September).
Statistical procedures for improving standardized patient assessments. Paper presented at the meeting of the AMEE, Linkoping, Sweden.
McKinley, D. W., Boulet, J., & Hambleton, R. K. (2003, September). Psychometric challenges
associated with standardized patient assessments. Paper presented at the meeting of the Association for Medical Education in Europe, Bern, Switzerland.
69
McKinley, D. W., Boulet, J. R., & Hambleton, R. K. (2004, July). An examinee-centered approach to setting passing scores for standardized patient examinations. Paper presented at the Ottawa Conference for Medical Education, Barcelona, Spain.
Melican, G., Breithaupt, K., Mills, C. N., Hambleton, R. K. (2005, April). Multi-stage testing
and case studies in a functioning licensing examination. Paper presented at the meeting of NCME, Montreal.
Mills, C. N., & Hambleton, R. K. (1979, October). Issues and methods of reporting criterion-
referenced test scores. Paper presented at the meeting of NERA, Ellenville, New York. Mills, C. N., & Hambleton, R. K. (1980, April). Guidelines for reporting criterion-referenced
test score information. Paper presented at the meeting of AERA, Boston. Mills, C. N., & Hambleton, R. K. (1982, April). Developing norms for a vertically equated item
bank. Paper presented at the meeting of AERA, New York. Mills, C., Jaeger, R. M., Plake, B. S., & Hambleton, R. K. (1998, April). An investigation of
several new methods for establishing standards on complex performance assessments. Paper presented at the meeting of AERA, San Diego.
Mills, C. N., Plake, B. S., Jaeger, R. M., & Hambleton, R. K. (1997, March). Lessons learned: a
comparison of two methods for establishing performance standards on complex performance assessments. Paper presented at the meeting of AERA, Chicago.
Monahan, P. O., Stump, T. E., Finch, H., & Hambleton, R. K. (2005, April). Bias of exploratory
and cross-validated DETECT index under null hypothesis of unidimensionality. Paper presented at the meeting of NCME, Montreal.
Muniz, J., & Hambleton, R. K. (1991, April). Medio siglo de teoria de respuesta a los items.
Invited paper presented at the Second Congress of Behavioral Sciences Methodology, Canary Islands, Spain.
Muñiz, J., Hambleton, R. K., & Xing, D. (1997, July). Small sample empirical procedures for
detecting poorly translated or adapted test items. Paper presented at the Fifth European Congress of Psychology, Dublin, Ireland.
Muñiz, J., Hambleton, R. K., & Xing, D. (1997, September). Evaluation of differential item
functioning in small samples. Paper presented at the Congress of Methodology for the Social Sciences, Seville, Spain.
Muñiz, J., Hambleton, R. K., & Xing, D. (1998, April). Small sample studies to detect flaws in
test translation. Paper presented at the meeting of NCME, San Diego. Muñiz, J., Hambleton, R. K., & Xing, D. (1998, August). Small sample statistical approaches for
identifying poorly adapted test items. Invited paper presented at the 24th International Congress of Applied Psychology, San Francisco.
Muñiz, J., Hambleton, R. K., & Xing, D. (1999, May). Small sample detection of poorly
translated test items. Paper presented at the International Conference on Adapting Tests for Use in Multiple Languages and Cultures, Washington, DC.
70
Murray, L. N., & Hambleton, R. K. (1981, April). Building item banks. Paper presented at the meeting of NEERO, Lenox, Massachusetts.
Murray, L. N., & Hambleton, R. K. (1983, April). Compiling evidence to address item response
model-test data fit. Paper presented at the meeting of AERA, Montreal. Narayanan, P., Hambleton, R. K., & Plake, B.S. (1994, April). Two-stage testing as an
approximation to computerized adaptive testing. Paper presented at the meeting of AERA, New Orleans.
Oakland, T., & Hambleton, R. K. (1999, April). Improving testing practices around the world.
Invited paper presented at the meeting of NCME, Montreal. O'Reilly, R. P., & Hambleton, R. K. (1981, April). A CMI model for an individualized learning
program in ninth grade science. Paper presented at the meeting of AERA, New York. O'Reilly, R. P., & Hambleton, R. K. (1971, April). Applied CMI models for groups and
individually prescribed instruction in New York State. Paper presented at the meeting of NCME, New York.
Patsula, L., & Hambleton, R. K. (1999, April). Accuracy of ability estimates obtained from
computerized adaptive, paper and pencil, and multi-stage tests. Paper presented at the meeting of NCME, Montreal.
Pauker, R., & Hambleton, R. K. (1976, April). Matching students and teachers to maximize
learning: What do students think? Paper presented at the meeting of the International Congress for Individualized Instruction, Boston.
Pitoniak, M. J., Hambleton, R. K., & Biskin, B. H. (2003, April). Setting standards on tests
containing computerized performance tasks. Paper presented at the meeting of NCME, Chicago.
Pitoniak, M., Hambleton, R. K., & Sireci, S. (2002, April). Comparative analysis of two
methods for setting standards. Paper presented at the meeting of NCME, New Orleans. Plake, B. S., Hambleton, R. K., & Jaeger, R. M. (1995, April). Score profile method for setting
standards for complex performance assessments. Paper presented at the meeting of AERA, San Francisco.
Plake, B. S., & Hambleton, R. K. (1998, April). Categorical assignments of student work: an
analytical standard-setting method designed for complex performance assessments with multiple performance categories. Paper presented at the meetings of AERA and NCME, San Diego.
Rogers, H. J., & Hambleton, R. K. (1987, April). Evaluation of computer-simulated baseline
statistics for use in item bias studies. Paper presented at the meeting of AERA, Washington, DC.
Rovinelli, R., & Hambleton, R. K. (1973, October). Some procedures for the validation of
criterion-referenced test items. Paper presented at the meeting of NERA, Ellenville, New York.
71
Rovinelli, R., & Hambleton, R. K. (1976, April). On the use of content specialists in the assessment of criterion-referenced test item validity. Paper presented at the meeting of AERA, San Francisco.
Rovinelli, R., & Hambleton, R. K. (1976, May). Improving the quality of achievement tests used
in PSI programs. Paper presented at the Third National Conference on Personalized Instruction, Washington, DC.
Royer, M., Hambleton, R. K., & Cadorette, L. (1976, April). Individual differences in the long-
term retention of meaningful materials. Paper presented at the meeting of AERA, San Francisco.
Skorupski, W. P., & Hambleton, R. K. (2003, April). What are panelists really thinking when
they set performance standards? Paper presented at the meeting of NCME, Chicago. Sheehan, D. S., & Hambleton, R. K. (1972, October). An application of latent partition analysis
to the evaluation of instruction. Paper presented at the joint meeting of NERA-NCME, Boston.
Sheehan, D. S., & Hambleton, R. K. (1976, April). A review of selected factors affecting
questionnaire and interview results. Paper presented at the meeting of AERA, San Francisco.
Slawson, D. A., Novak, J., & Hambleton, R. K. (1988, April). A qualitative approach to the
evaluation of expert system shells. Paper presented at the meeting of AERA, New Orleans.
Smith, I. L., Hambleton, R. K., & Rosen, G. (1988, August). Content validity studies of the
Examination for Professional Practice of Psychology. Paper presented at an invited symposium at the meeting of APA, Atlanta.
Spineti, R., & Hambleton, R. K. (1973, October). A computer simulation study of tailored
testing strategies for objectives-based instructional programs. Paper presented at the meeting of NERA, Ellenville, New York.
Swaminathan, H., Hambleton, R. K., & Algina, J. (1973, October). A decision-theoretic
approach to issues in criterion-referenced assessment. Paper presented at the meeting of NERA, Ellenville, New York.
Swaminathan, H., Hambleton, R. K., & Algina, J. (1974, April). Reliability of criterion-
referenced tests. Paper presented at the meeting of APA, New Orleans. Traub, R. E., & Hambleton, R. K. (1970, February). Effect of scoring instructions and degree of
speededness on validity and reliability of multiple-choice tests. Paper presented at the meeting of AERA, Minneapolis.
Traub, R. E., & Hambleton, R. K. (1971, April). The effect of instruction upon the semantic
space defined by measurement concepts. Paper presented at the meeting of AERA, New York.
72
Traub, R. E., Hambleton, R. K., & Singh, B. (1968, February). Effects of promised reward and threatened penalty on performance in a multiple-choice vocabulary test. Paper presented at the meeting of AERA, Chicago.
van de Vijver, F. J. R., & Hambleton, R. K. (1996, August). Translating tests: Some practical
guidelines. Paper presented at the meeting of APA, Toronto. Wainer, H., Hambleton, R. K., & Meara, K. (1999, April). Alternative displays for
communicating NAEP results: A redesign and validity study. Paper presented at the meeting of NCME, Montreal.
Welsh, W., & Hambleton, R. K. (1975, April). On the use of goals in evaluation: A review of
selected issues. Paper presented at the meeting of AERA, Washington, DC. Xing, D., & Hambleton, R. K. (2002, April). Impact of test design, item quality, and item bank
size on the psychometric properties of computer-based credentialing exams. Paper presented at the meeting of NCME, New Orleans.
Ying, L., & Hambleton, R. K. (2004, April). Statistics for detecting disclosed items in a CAT
environment. Paper presented at the meeting of NCME, San Diego. Ying, L., & Hambleton, R. K. (2004, April). Statistics for detecting disclosed items in a CAT
environment. Paper presented at the meeting of NCME, San Diego. Yu, J., & Hambleton, R. K. (1996, August). Field test of the ITC guidelines for adapting
psychological tests. Paper presented at the 26th International Congress of Psychology, Montreal.
Zenisky, A. L., & Hambleton, R. K. (2004, April). Investigating the effects of selected
multistage test design alternatives on credentialing outcomes. Paper presented at the NCME meeting, San Diego.
Zenisky, A. L., Hambleton, R. K., & Robin, F. (2001, August). Two-stage large sample DIF
procedures for state assessments. Paper presented at the meeting of APA, San Francisco. Zenisky, A. L., Hambleton, R. K., & Sireci, S. G. (2000, April). Effects of item dependencies
among MCAT items on the validity of IRT item, test, and ability statistics. Paper presented at the meeting of NCME, New Orleans.
Zhao, Y., & Hambleton, R. K. (2006, April). Impact of IRT model misfit on score precision and
performance classifications. Paper presented at the meeting of NCME, San Francisco. Zhao, Y., & Hambleton, R. K. (2006, October). Consequences of IRT model fit in equating.
Paper presented at the Northeastern Educational Research Association, Kerhonkson, New York.
Zumbo, B. D., Sireci, S. G., & Hambleton, R. K. (2003, April). Revisiting exploratory methods
for construct comparability: Is there something to be gained for the ways of the old? Paper presented at the meeting of NCME, Chicago.
73
INVITED DISCUSSANT AT PROFESSIONAL MEETINGS:
• Applications of criterion-referencing to the testing of language. Symposium presented at the meeting of the Eastern Psychological Association, Washington, DC, 1973.
• Criterion-referenced testing. Symposium presented at the meeting of AERA, Chicago,
1974.
• Perspectives on criterion-referenced testing. Paper-reading session at the meeting of NCME, San Francisco, 1976.
• Evaluation of student progress and school environment in the Anisa early childhood
educational program. Symposium presented at the meeting of NEERO, Provincetown, Massachusetts, 1976.
• Mastery teaching and mastery testing: The integration of instruction and measurement.
Symposium presented at the meeting of AERA, Toronto, 1978.
• What's happening in measurement? The use of Rasch and other latent trait models. Symposium presented at the meeting of the Eastern Educational Research Association, Williamsburg, Virginia, 1978.
• Practical uses of item response theory. Symposium presented at the meeting of AERA, San Francisco, 1979.
• Applications of the Rasch test model. Symposium presented at the meeting of AERA,
San Francisco, 1979.
• Latent trait applications. Symposium presented at the meeting of the NERA, Ellenville, New York, 1979.
• Issues in setting performance standards. Symposium at the 10th Annual Conference on
Large-Scale Assessment, Denver, 1980.
• Competency testing in Detroit. Symposium presented at the meeting of AERA, Boston, 1980.
• Comparison and evaluation of standard-setting methods. Symposium presented at the
meeting of AERA, Boston, 1980.
• Local and state competency testing. Symposium presented at the meeting of AERA, Boston, 1980.
• Methods and issues in setting standards for minimum proficiency tests. Symposium
presented at the meeting of NCME, Los Angeles, 1981.
• Measurement challenges of basic skills assessment programs. Symposium presented at the meeting of AERA, Los Angeles, 1981.
• A multidisciplinary review of criterion-referenced measurement. Symposium presented
at the meeting of AERA, Los Angeles, 1981.
74
• Impact of test disclosure legislation on national testing programs. Symposium presented at the 11th Annual Conference on Large-Scale Assessment, Boulder, Colorado, 1981.
• The use of item response theory for the development of tests and the interpretation of test
scores. Symposium presented at the meeting of NCME, New York, 1982.
• Measurement models for assessment data. Symposium presented at the meeting of AERA, New York, 1982.
• Using statewide basic skills tests to make promotion decisions: Political and
psychometric issues. Symposium presented at the meeting of AERA, New York, 1982.
• Practically induced expansions in measurement technology. Symposium presented at the meeting of AERA, New York, 1982.
• Latent trait models: How useful are they to professional education? Symposium
presented at the meeting of AERA, New York, 1982.
• Comparing the one- and three-parameter latent trait models: Point, counterpoint, and discussion. Symposium presented at the meeting of AERA, New York, 1982.
• State testing programs and testing policies: How they influence schools. Symposium
presented at the meeting of AERA, Montreal, 1983.
• Framework for problem identification in test projects. Symposium presented at the meeting of AERA, Montreal, 1983.
• Issues and developments in item response theory. Symposium presented at the meeting
of AERA, New Orleans, 1984.
• The criterion problem in professional evaluation: Ministry, medicine, and law. Symposium presented at the meeting of AERA, New Orleans, 1984.
• Critical measurement issues in learning disabilities. Invited symposium presented at the
meeting of APA, Toronto, 1984.
• Fitting item response models to multidimensional data. Symposium presented at the meeting of AERA, Chicago, 1985.
• NAEP: An educational indicator. Symposium presented at the meeting of NCME,
Chicago, 1985.
• Setting standards for high-stakes tests. Symposium presented at the meetings of AERA and NCME, San Francisco, 1986.
• Promising item response model applications. Critique session presented at the meetings
of AERA and NCME, San Francisco, 1986.
• Building tests with item response models. Symposium presented at the meeting of APA, Washington, DC 1986.
75
• Item response theory. Symposium presented at the meeting of AERA, Washington, DC, 1987.
• Multidimensional item response models: Models and data. Symposium presented at the
meeting of AERA, Washington, DC, 1987.
• Research on differential item functioning. Papers presented at the meeting of NCME, New Orleans, 1988.
• Customization of a national standardized achievement test. Papers presented at the
meeting of NCME, New Orleans, 1988.
• Assessing dimensionality of test data. Papers presented at the meeting of AERA, New Orleans, 1988.
• Techniques for detecting differential item performance. Papers presented at the meeting
of AERA, New Orleans, 1988.
• Criterion-referenced passing points: New applications, adjustments, and alternatives. Papers presented at the meeting of AERA, New Orleans, 1988.
• Frontiers of assessment in the teaching profession. Papers presented at the meeting of
AERA, New Orleans, 1988.
• Personnel evaluation standards. Symposium presented at the meeting of AERA, San Francisco, 1989.
• Setting standards of performance. Papers presented at the meeting of NCME, San
Francisco, 1989.
• Assessing the utility of IRT models. Papers presented at the meeting of NCME, Boston, 1990.
• Strong modeling approaches to problems in measuring learning and change. Symposium
presented at the meeting of NCME, Boston, 1990.
• Research design methodology. Papers presented at the NEERO meeting, Rockport, Maine, 1990.
• Methodological and practical issues in the normative application of criterion-referenced
assessments. Papers presented at the meeting of NCME, Chicago, 1991.
• Data-based development of licensure tests for teachers. Papers presented at the meeting of NCME, Chicago, 1991.
• Application of performance-based assessment for a whole literacy program. Symposium
presented at the meeting of AERA, San Francisco, 1992.
• Multidimensional IRT models. Papers presented at the meeting of AERA, Atlanta, 1993.
76
• Equating computer adaptive and paper-and-pencil tests: experiences and lessons learned. Symposium presented at the meeting of AERA, San Francisco, 1995.
• Applied dimensionality. Symposium presented at the meeting of NCME, San Francisco,
1995.
• Assessment in Kentucky: Things are going quite nicely, thank you. Symposium presented at the meeting of NCME, San Francisco, 1995.
• Content validity: An important construct in measurement. Symposium presented at the
meeting of NCME, San Francisco, 1995.
• CATucopia: Measurement issues faced by a large-scale computer adaptive testing program. Symposium presented at the meeting of NCME, New York, April 1996.
• Perspectives on reporting scaling results to students and teachers. Symposium presented
at the meeting of NCME, New York, April, 1996.
• Validity considerations for automated scoring of open-ended responses. Symposium presented at the meeting of NCME, Chicago, 1997.
• The 1997 USMLE Step 1 CBT field-test: Examinee performance, perceptions and
pacing. Symposium presented at the meeting of the NCME, San Diego, 1998.
• Linking complex performance-based assessments: A comparison of novel procedures. Symposium presented at the meeting of the AERA, San Diego, 1998.
• Test-taker rights and responsibilities: Issues and perspectives. Symposium presented at
the meeting of the American Psychological Association, San Francisco, 1998.
• An international perspective on the development of test standards. Invited symposium at the meeting of the 24th International Congress of Applied Psychology, San Francisco, August, 1998.
• Methodological advances in test adaptations for cross-cultural and cross-lingual
assessment. Invited symposium at the meeting of the 24th International Congress of Applied Psychology, San Francisco, August, 1998.
• Translations dif research: Advances and applications. Symposium presented at the
meeting of NCME, Montreal, April, 1999. • Latent trait and latent class modeling. Symposium presented at the meeting of the
AERA, Montreal, April, 1999.
• What have we learned about the test accommodation strategies for English language learners? Symposium presented at the meeting of the NCME, Montreal, April, 1999.
• Understanding fairness in a CAT environment. Symposium presented at the meeting of
NCME, Montreal, April, 1999.
77
• Issues in grading essays and passages. Symposium presented at the AERA meeting, New Orleans, April, 2000.
• Advances in automated scoring of performance assessments. Symposium presented at
the NCME meeting, New Orleans, April, 2000.
• A comparison of methods for setting standards on NAEP. Symposium presented at the CCSSO Large-Scale Assessment Conference, Snowbird, Utah, June, 2000.
• Technical issues in item response theory. Paper presentation session at the meeting of the
AERA, Seattle, April, 2001.
• Advances in test adaptation methodology. Symposium presented at the meeting of NCME, New Orleans, April, 2002.
• Advances in measurement: Improving measurement by using IRT and MCMC methods.
Paper presentation session at the meeting of NCME, New Orleans, 2002.
• School assessment and evaluation. Submitted paper session at the meeting of AERA, Chicago, 2003.
• International perspectives: Issues of achievement and reform. Submitted papers session
at the meeting of AERA, Chicago, 2003.
• Making test results more useful and understandable. Invited symposium at the meeting of NCME, Chicago, 2003.
• Science and mathematics in an international perspective. Submitted papers session at the
meeting of AERA, San Diego, 2004.
• Standard setting methods: Studying sources of complexity. Invited symposium at the meeting of NCME, Montreal, 2005.
• Test translation methodology: New approaches, practical examples. Symposium
presented at the 9th European Congress of Psychology, Granada, Spain, 2005.
• Methodological developments in international educational research: Experiences from the OECD PISA study. Symposium presented at the meeting of the AERA, Stan Francisco, 2006.
• Administration mode effects in computer-based large-scale assessments. Symposium
presented at the meeting of the AERA, San Francisco, 2006.
• Topics in IRT modeling. Submitted papers session at the meeting of NCME, San Francisco, 2006.
• Response-time modeling and applications. Discussant for this invited presentation at the
meeting of NCME, San Francisco, April, 2006.
78
• Designing accessible large-scale reading assessments for students with disabilities: Research and practice. Discussant for this session at the meeting of the CCSSO, San Francisco, June, 2006.
• Setting performance standards under NCLB: Approaches, issues, and implications.
Discussant for this session at the meeting of the CCSSO, San Francisco, June, 2006.
• Is your definition of proficiency limited by the standard setting method you use? Discussant for this session at the meeting of the CCSSO, San Francisco, June, 2006.
• Theoretical and practical aspects of vertically-articulated standards. Discussant for this
session at the meeting of the CCSSO, San Francisco, June, 2006.
• Exploration of personality across 19 countries. Discussant for this session at the 5th International Test Commission Conference on Test Adaptation, Brussels, July, 2006.
• Psychometric lessons learned in a large-scale medical licensure performance assessment.
Discussant for this invited session at the meeting of NCME, Chicago, 2007.
• Standard-setters: Stand up and take a stand. Discussant for this invited session at the meeting of NCME, Chicago, 2007.
• Comparability of adapted versions of multilingual tests: Implications of incomparability
on score interpretations in international assessments. Discussant for this session at the meeting of NCME, Chicago, 2007.
• Innovations in standard setting. Discussant for this session at the meeting of NCME,
Chicago, 2007.
• Making NAEP scores more meaningful. Panel member for this session at the NSSC 2008 Winter Assessment Literacy Workshop, Washington.
• The role of user-centered design in building better assessments. Discussant for this
session at the meeting of AERA, New York, 2008.
• The big challenges and research opportunities in testing and measurement. Discussant and chairperson for this session at the meeting of AERA, New York, 2008.
• Dissecting the bookmark standard setting procedure. Discussant for this session at the
meeting of NCME, New York, 2008.
• Technical advances in international assessments such as TIMSS and PISA. Discussant for this session at the meeting of NCME, New York, 2008.
79
Recent Activities (Since September, 2007) STUDIES IN PROGRESS/NEW COMPLETED STUDIES: In Preparation Hambleton, R. K. (in preparation). National Assessment of Educational Progress. In CC Clauss
-Ehlers (Ed.), Enclyclopedia. Heidelberg, Germany: Springer. Hambleton, R. K. (in preparation). Five big challenges for educational and psychological
assessment. Measurement: Interdisciplinary Research and Perspectives. (invited) Hambleton, R. K., Plake, B. S., & Mills, C. N. (in preparation). Handbook on setting
performance standards.
Hambleton, R. K., & Swaminathan, H. (in preparation). Item response theory: Principles and applications (2nd ed.). Boston, MA: Kluwer Academic Publishers.
Hambleton, R. K., & van der Linden, W. J. (in preparation). Polytomous response IRT models:
Brief history of model building advances. In M. Nering & R Ostini (Eds.), Development and applications of polytomous item response theory models. Mahwah, NJ: Lawrence Erlbaum Associates, Inc., Publishers.
Hambleton, R. K., & Zenisky, A. (in preparation). Adapting tests for cross-cultural assessment.
In D. Matsumoto & F. van de Vijver (Eds.), Cross-cultural research methods. Oxford, England: Oxford University Press.
Hambleton, R. K., & Zenisky, A. (in preparation). Improving score reporting practices. CLEAR.
Hambleton, R. K., Zumbo, B., & Sireci, S. G. (in preparation). Psychometric methods and
practices. Mahwah, NJ: Erlbaum Publishers. Jette, A. M., McDonough, C. M., Haley, S. M., Ni, P., Olarsch, S., Latham, N., Hambleton, R. K.,
Felson, D., Kim Y. J., & Hunter, D. (in press). A computer-adaptive disability instrument for lower extremity osteoarthritis research demonstrated promising breadth, precision, and reliability. Journal of Clinical Epidemiology.
Jette, A. M., McDonough, C.M., Ni, P, Haley, S. M., Hambleton, R. K., Olarsch, S., Hunter, D.,
Kin, Y., Felson, D. (in review). A functional difficulty and functional pain instrument for lower extremity.
Lyren, P. E., & Hambleton, R. K. (in preparation). Systematic equating error with randomly-
equivalent groups designs: An examination of the equal ability distribution assumption. Ni, P., Haley, S. M., Hambleton, R. K., & Jette, A. M. (in preparation). IRT model selection
using Markov Chain Monte Carlo estimation in a functional difficulty item bank for persons with osteoarthritis.
In Press Byrne, B.M., Oakland, T., Leong, F.T.L., van de Vijver, F.J.R., Hambleton, R.K., Cheung, F.M.,
80
& Bartram, D. (in press). A critical analysis of cross-cultural research and testing practices: Implications for improved education and training in psychology. Training and Education in Professional Psychology.
Gregoire, J., & Hambleton, R. K. (Eds.). (in press). Advances in test adaptation research
[Special Issue]. International Journal of Testing. Haley, S. M., Fragala-Pinkham, M. A., Dumas, H. M., Ni, P., Gorton, G., Watson, K., Montpetit,
K., Bilodeau, N., Hambleton, R. K., & Tucker, C. A. (in press). Evaluation of an item bank for a computerized adaptive test of activity in children with cerebral palsy. Physical Therapy.
Haley, S. M., Ni, P., Dumas, H. M., Fragala-Pinkham, M. A., Hambleton, R. K., Montpetit, K.,
Bilodeau, N., Gorton, G. E., Watson, K., & Tucker, C. A. (in press). Measuring global physical health in children with cerebral palsy: Illustration of a multidimensional bi-factor model and computerized adaptive testing. Quality of Life Research.
Hambleton, R. K. (in press). Criterion-referenced testing. In E. Anderman (Ed.), Psychology of
classroom learning: An encyclopedia. Detroit: Macmillan Reference. Hambleton, R. K., Sireci, S. G., & Smith, Z. R. (in press). How do other countries measure up to
the mathematics achievement levels on the National Assessment of Educational Progress? Applied Measurement in Education.
Han, N., & Hambleton, R. K. (in press). Using moving averages to detect exposed test items in
computer-based testing. In S. Sawilowsky (Ed.), Real data analysis. Greenwich, CT: Information Age Publishers.
Tucker, C., Gorton, G., Watson, K., Fragala-Pinkham, M., Dumas, H., Montpetit, K., Bilodeau,
N., Ni, P., Hambleton, R., & Haley, S. (in press). Development of a parent-report computer adaptive test to assess physical functioning in children with cerebral palsy—lower extremity and mobility skills. Developmental Medicine & Child Neurology.
Tucker, C., Montpetit, K., Bilodeau, N., Dumas, H., Fragala-Pinkham, M., Watson, K., Gorton,
G., Ni, P., Hambleton, R., Mulcahey, M., & Haley, S. (in press). Development of a parent-report computer adaptive test to assess physical functioning in children with cerebral palsy II. Developmental Medicine & Child Neurology.
van de Vijver, F. J. R., & Hambleton, R. K. (in press). Adapting educational tests for multicultural assessment. Educational Measurement: Issues and Practice.
Wells, C. S., Baldwin, S., Hambleton, R. K., Sireci, S. G., Karatonis, A., & Jirka, S. (in press).
Evaluating score equity assessment for state NAEP. Applied Measuement in Education. Zenisky, A., Hambleton, R. K., & Luecht, R. (in press). Multi-stage testing. In W. J. van der Linden & C. Glas (Eds.), Computerized adaptive testing. New York: Springer. Zenisky, A., Hambleton, R. K., & Sireci, S. G. (in press). Getting the message out: An
evaluation of NAEP score reporting practices with implications for disseminating test
81
results. Applied Meaurement in Education. Completed Hambleton, R. K. (2008). Criterion-referenced tests—norm-referenced tests. In G. McCulloch
& D. Crook (Eds.), International Encyclopedia of Education. London: Routledge. Hambleton, R. K. (2008). Measurement specialists look to the future. NCME Newsletter, 16(2),
2-3. Hambleton, R. K., & Sireci, S. (2008). Development and validation of enhanced SAT score
scales using item mapping and performance category descriptions (Final Report). New York: College Board.
Han, N., & Hambleton, R. K. (2008). Detecting the unintended exposure of test items in
operational testing programs. In C. L. Wild & R. Ramaswamy (Eds.), Improving testing: Applying quality tools and techniques (pp. 323-348). Mahwah, NJ: Lawrence Erlbaum Associates, Inc., Publishers.
Keller, L. A., Hambleton, R. K., Parker, P., & Copella, J. (2008). MCAS equating research: An
investigation of FCIP-1, FCIP-2, and Stocking and Lord equating methods (Center for Educational Assessment Research Report No. 690). Amherst, MA: University of Massachusetts, Center for Educational Assessment.
Liang, T., Han, K., & Hambleton, R. K. (2008). User’s guide for ResidPlots-2: Computer software for IRT graphical residual analyses, Version 2.0 (Center for Educational Assessment Research Report No. 688). Amherst, MA: University of Massachusetts, Center for Educational Assessment.
Lyrén, P.-E., & Hambleton, R. K. (2008). Systematic equating error with the randomly-equivalent groups design: An examination of the equal ability distribution assumption (EM Report No. 61). Umeå, Sweden: Umeå University, Department of Educational Measurement.
Monahan, P. O., Stump, T. E., Finch, H., & Hambleton, R. K. (2007). Bias of exploratory and cross-validated DETECT index under null hypothesis of unidimensionality. Applied Psychological Measurement, 31 (6), 483-503.
Reeve, B. B., Hays, R. D., Bjorner, J. B., Cook, K. F., Crane, P. K., Teresi, J., Thissen, D.,
Revicki, D. A., Weiss, D. J., Hambleton, R. K, & others. (2007). Psychometric evaluation and calibration of health-related quality of life item banks. Medical Care, 45(5), 22-31.
Sireci, S. G., & Hambleton, R. K. (2009). Mission--Protect the public: Licensure and
certification testing in the 21st century. In R. P. Phelps (Ed.), Correcting fallacies about educational and psychological testing (pp. 199-218). Washington, DC: American Psychological Association.
Swaminathan, H., Hambleton, R. K., & Rogers, H. J. (2007). Assessing the fit of item response
82
theory models. In C. R. Rao & S. Sinharay (Eds.), Handbooks of statistics: Psychometrics (Volume 27; pp. 683-718). Amsterdam: North Holland.
PAPERS PRESENTED/TO BE PRESENTED AT PROFESSIONAL MEETINGS: Deng, N., & Hambleton, R. K. (2008, March). Assessment dimensionality of multi-stage tests.
Paper presented at the meeting of NCME, New York. Deng, N., Wells, C. S., & Hambleton, R. K. (2008, October). A confirmatory factor analytic
study examining the dimensionality of an educational achievement test. A paper presented at the meeting of the NERA, Hartford. (Published in the NERA Proceedings, 2008.)
Elosua, P., & Hambleton, R. K. (2008, July). DIF detection methods and consequences.
Presentation at the 6th Conference of the International Test Commission, Liverpool, England.
Elosua, P., & Hambleton, R. K. (2008, July). Test score comparability across language and
cultural groups in the presence of item bias. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain.
Hambleton, R. K. (2007, February). Methods and guidelines for translating and adapting
educational and psychological tests into multiple languages and cultures. An invited presentation at the 2007 ATP Innovations in Testing Conference, Palm Springs, CA.
Hambleton, R. K. (2007, June). A new challenge: Making test scores more understandable and
useful. A presentation presented at the annual CCSSO meeting, Nashville. Hambleton, R. K. (2007, June). Making diagnostic score reports more clear and meaningful for
users. A presentation at the annual CCSSO meeting, Nashville. Hambleton, R. K. (2007, July). What are the psychometric skills needed in cross-cultural
psychology today? Invited presentation at the meeting of the 10th European Congress of Psychology, Prague.
Hambleton, R. K. (2007, July). International Test Commission guidelines for adapting
educational and psychological tests. Invited presentation at the meeting of the 10th European Congress of Psychology, Prague.
Hambleton, R. K. (2007, August). Major challenges for educational and psychological testing
practices. Invited presentation at the National Authority for Measurement and Evaluation in Education Conference, Jerusalem, Israel.
Hambleton, R. K. (2007, October). Cross-cultural instrument translation and instrumentation.
An invited presentation at the Cooper Institute Diversity in Physical Activity and Health: Measurement and Research Issues and Challenges Conference, Dallas, TX.
Hambleton, R. K. (2008, January). On-going challenge for NAEP: Making score reports
understandable and useful. Keynote address at the NSSC 2008 Winter Assessment Literacy Workshop, Washington.
83
Hambleton, R. K. (2008, March). A non-technical introduction to item response theory for credentialing exams and achievement tests. An invited presentation at the ATP Innovations in Testing Conference, Dallas, Texas.
Hambleton, R. K. (2008, March). Reporting candidate scores in more understandable and
meaningful ways: A review of the recent literature and promising research. An invited presentation at the ATP Innovations in Testing Conference, Dallas, Texas.
Hambleton, R. K. (2008, March). Comparative perspectives on classical psychometrics and item
response theory. Invited presentation at the meeting of AERA, New York. Hambleton, R. K. (2008, March). Guidelines for translating and adapting educational and
psychological tests. Paper presented at the meeting of AERA, New York. Hambleton, R. K. (2008, June). CAT…from an educational testing perspective. A presentation
at the Promis Psychometric Summit-2, Northwestern University, Evanston. Hambleton, R. K. (2008, July). The next great challenges for psychological and educational
measurement. Keynote address delivered at the Third European Congress of Methodology, Oviedo, Spain.
Hambleton, R. K. (2008, July). The International Test Commission Guidelines for Adapting
Tests, 2nd edition: A progress report. Invited presentation at the 29th International Congress of Psychology, Berlin.
Hambleton, R. K. (2008, September). A personal history of computer-adaptive testing. An
invited address at the International Conference on Outcomes Measurement, Bethesda, MD.
Hambleton, R. K. (2009, February). Problems to overcome in globalizing testing. A keynote
address at the Association of Test Publishers Conference, Palm Springs, CA. Hambleton, R. K. (2009, February). Predicting future directions for testing. Invited presentation
at the Association of Test Publishers Conference, Palm Springs, CA. Hambleton, R. K., Deng, N., & Lozano, L. (2009, February). Customized test score norms using
item response theory: A new example. Paper presented at the meeting of the American Test Publishers Conference, Palm Springs, CA.
Hambleton, R. K., & Han, N. (2008, July). Detecting exposed test items in a computerized
adaptive testing environment. Paper presented at the 6th Conference of the International Test Commission, Liverpool, England.
Hambleton, R. K., & Han, N. (2008, July). Catching exposed test items with IRT-based statistics
in computer-based testing. Paper presented at the 29th International Congress of Psychology, Berlin.
Hambleton, R. K., & Lozano, L. (2008, July). Customized test score norms with item response
theory. A presentation at the 6th Conference of the International Test Commission, Liverpool, England.
84
Hambleton, R. K., Sireci, S., & Smith, Z. (2008, March). Are the NAEP achievement levels in mathematics set too high? Paper presented at the meeting of NCME, New York.
Hambleton, R. K., & Wells, C. (2008, July). Using IRT models to construct tests and equate and
report scores. A workshop at the 6th Conference of the International Test Commission, Liverpool, England.
Hambleton, R. K., & Zenisky, A. (2008, July). A key for valid uses of tests: Making test score
reports more understandable and user-friendly. Key-note address presented at the 6th Conference of the International Test Commission, Liverpool, England.
Hambleton, R. K., & Zenisky, A. (2008, October). Reporting test scores in more meaningful ways: Some new findings, research methods, and guidelines for score report design. A presentation at the NERA meeting, Hartford.
Lozano, L., & Hambleton, R. K. (2008, July). Constructing and evaluating customized test score
norms. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain.
Lyrén, P.-E., & Hambleton, R. K. (2007, April). Systematic equating error with randomly-
equivalent groups designs: An examination of the equal ability distribution assumption. Paper presented at the meeting of NCME, Chicago.
Meng, Y., Wells, C. S., & Hambleton, R. K. (2008, October). A comparison of methods for
handling missing data when assessing dimensionality via linear factor analysis. Paper presented at the meeting of NERA, Hartford.
Ni, P., Jette, A. M., Haley, S. M., & Hambleton, R. K. (2008, March). IRT model selection
using Markov Chain Monte Carlo estimation in a physical functioning item bank. Paper presented at the Patient-Reported Outcomes Measurement Information System meeting, Washington.
Pitoniak, M., & Hambleton, R. K. (2007, April). Setting performance standards. Paper
presented at the meeting of NCME, Chicago. Sireci, S., & Hambleton, R. K. (2008, July). Communicating results of comparisons of
international assessments to NAEP. A paper presented at the 6th International Test Commission Conference, Liverpool, England.
Sireci, S., Hambleton, R. K., Huff, K. (2008, July). Enhancing the meaningfulness of score
scales using item response theory. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain.
Wells, C. S., Hambleton, R. K., & Liang, T. (2008, July). A nonparametric approach for
investigating model fit in item response theory. An invited paper presented at the Third European Congress of Methodology, Oviedo, Spain.
Yoo, H., & Hambleton, R. K. (2008, October). Item exposure control for computerized-adaptive
testing: A review of methods. Paper presented at the meeting of NERA, Hartford. Zenisky, A., Hambleton, R. K., & Sireci, S. (2008, July). Communicating the utility of NAEP
score reports. A paper presented at the 6th International Test Commission Conference,
85
Liverpool, England. Zhao, Y., & Hambleton, R. K. (2008, October). Graphical approaches for assessing differential
item functioning in polytomously-scored items. Paper presented at the meeting of the NERA, Hartford.