SURVEY OF COMMONALITY WITH OTHER DISCIPLINES WORKSHOP 2 – JULY 25, 2013 INDIANAPOLIS, INDIANA MICAH ALTMAN DIRECTOR OF RESEARCH, MIT LIBRARIES MASSACHUSETTS INSTITUTE OF TECHNOLOGY [email protected]PRIMARY RESEARCH OR PRACTICE AREA(S) • INFORMATION SCIENCE • SOCIAL SCIENCE PREVIOUS EXPERIENCE • DIGITAL LIBRARIES • DIGITAL PRESERVATION • STATISTICAL COMPUTING RELATED WORK • PUBLICMAPPING.ORG • INFORMATICS.MIT.EDU CONTACT INFORMATION E25-131, 77 MASSACHUSETTS AVE, MIT, CAMBRIDGE, MA, 02139
18
Embed
Characterizing Data and Software for Social Science Research
This presentation describes the landscape of data and software use across the social sciences in terms of the abstract dimensions of data and data use. It then examines three use cases.
Presentation for DASPOS < https://daspos.crc.nd.edu/index.php/workshops/workshop-2 > Workshop at JCDL.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
SURVEY OF COMMONALITY WITH OTHER DISCIPLINESWORKSHOP 2 – JULY 25, 2013
INDIANAPOLIS, INDIANA
MICAH ALTMANDIRECTOR OF RESEARCH, MIT LIBRARIESMASSACHUSETTS INSTITUTE OF [email protected]
PRIMARY RESEARCH OR PRACTICE AREA(S)• INFORMATION SCIENCE• SOCIAL SCIENCE
PREVIOUS EXPERIENCE• DIGITAL LIBRARIES• DIGITAL PRESERVATION• STATISTICAL COMPUTING
RELATED WORK• PUBLICMAPPING.ORG • INFORMATICS.MIT.EDU
Director of Research, MIT LibrariesNon-Resident Senior Fellow, Brookings Institution
Data and Software in Social Science Research
DISCLAIMERThese opinions are my own, they are not the opinions of MIT, Brookings, any of the project funders, nor (with the exception of co-authored previously published work) my collaborators
Secondary disclaimer:
“It’s tough to make predictions, especially about the future!”
-- Attributed to Woody Allen, Yogi Berra, Niels Bohr, Vint Cerf, Winston Churchill, Confucius, Disreali [sic], Freeman Dyson, Cecil B. Demille, Albert Einstein, Enrico Fermi, Edgar R.
Fiedler, Bob Fourer, Sam Goldwyn, Allan Lamport, Groucho Marx, Dan Quayle, George Bernard Shaw, Casey Stengel, Will Rogers, M. Taub, Mark Twain, Kerr L. White, etc.
Data and Software in Social Science Research
Collaborators & Co-Conspirators
• Jonathan Crabtree, Nancy McGovern• National Digital Stewardship Coordination
Committee & Working Group Chairs• Privacy Tools for Sharing Research Data
Team (Salil Vadhan, P.I.)http://privacytools.seas.harvard.edu/people
• Research Support– Supported in part by NSF grant CNS-1237235– Thanks to the Library of Congress, & the
Related Work• CoData Task Group on Data Citations, 2013 (Forthcoming) Out of Cite, Out of Mind:
The Current State of Practice, Policy, and Technology for the Citation of Data, Co-Data Journal (Special Volume).
• Altman & Jackman, 2012, 19 Ways of Looking at Statistical Software, Journal of Statistical Software• National Digital Stewardship Alliance, 2013, 2014 National Agenda for Digital
Stewardship.• Novak, K., Altman, M., Broch, E., Carroll, J. M., Clemins, P. J., Fournier, D.,
Laevart, C., et al. 201.. Communicating Science and Engineering Data in the Information Age. Computer Science and Telecommunications. National Academies Press
• Altman, M., Rogerson, K., & U, D. (2008). Open Research Questions on Information and Technology in Global and Domestic Politics – Beyond “E-.i, 41(4), 1-8. Retrieved from http://www.journals.cambridge.org/abstract_S104909650824093X
• Altman, Gill & McDonald. 2003. Numerical Issues in Statistical Computing for the Social Scientist
Some Characteristics of Research Data UseAttribute Type Examples
Analysis methods - Counting- GLM model family- MLE model family- (Constrained) continuous nonlinear
optimization - Blind global optimization- Discrete optimization - Bayesian Methods (MCMC)- Heuristically/algorithmically defined - Text mining- Clustering- Coding and qualitative analysis- Exploratory Data Analysis
Desired Outputs - Summary scalars- Summary table- Data subset - Static data publication- Static visualization- Dynamic Visualization
Data and Software in Social Science Research
Some Characteristics of Use ConstraintsContract Intellectual Property
More Information• Grimmer, Justin, and Gary King. "General purpose computer-
assisted clustering and conceptualization." Proceedings of the National Academy of Sciences 108.7 (2011): 2643-2650.
• King, Gary, Jennifer Pan, and Molly Roberts. "How censorship in China allows government criticism but silences collective expression." APSA 2012 Annual Meeting Paper. 2012.
• Lazer, David, et al. "Life in the network: the coming age of computational social science." Science (New York, NY) 323.5915 (2009): 721.
Data and Software in Social Science Research
Trends: MoreMore Types of Evidence More CollaborationMore Data
More Publications, More Filters
More Learners
More Open
More Replication
Data and Software in Social Science Research
Some Challenges for Long-Term Replication/Access
• “messy” human sensors• Mix of data types, structures, sparsity• Complex constraints: confidentiality, licensing,