Integrated Public Use Microdata Series Integrated Public Use Microdata Series International: census microdata for International: census microdata for research and policy research and policy * * * * * * Robert McCaa Robert McCaa Albert Albert Esteve Palós Esteve Palós Minnesota Population Center Minnesota Population Center Centre Centre d’Estudis Demogràfics d’Estudis Demogràfics “Only used statistics are useful statistics.”
22
Embed
Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Integrated Public Use Microdata SeriesIntegrated Public Use Microdata SeriesInternational: census microdata for research and policyInternational: census microdata for research and policy
* * ** * *Robert McCaa Robert McCaa Albert Esteve PalósAlbert Esteve Palós
Minnesota Population Center Minnesota Population Center Centre d’Estudis DemogràficsCentre d’Estudis Demogràfics
“Only used statistics are useful statistics.”
1. IPUMS international: goals and benefits1. IPUMS international: goals and benefits
“…“…best practice for a data repository of international statistical data”best practice for a data repository of international statistical data”--Dennis Trewin --Dennis Trewin
chair UNECE task force on Statistical Confidentiality & Microdata Accesschair UNECE task force on Statistical Confidentiality & Microdata Access
1.1. Preserve census microdata and documentation for all the Preserve census microdata and documentation for all the countries in the worldcountries in the world
2.2. Integrate microdata and metadataIntegrate microdata and metadata--a CD with source data and codebook is not sufficient--a CD with source data and codebook is not sufficient
3.3. Disseminate--without cost--extracts of samples to bona-fide Disseminate--without cost--extracts of samples to bona-fide researchers worldwide, regardless of country of birth, researchers worldwide, regardless of country of birth, citizenship or residence.citizenship or residence.
» Sustained, major funding since 1999 through 2014 by:Sustained, major funding since 1999 through 2014 by:» National Science Foundation (USA)National Science Foundation (USA)
» National Institutes of Health (USA)National Institutes of Health (USA)
» University of MinnesotaUniversity of Minnesota
3
Preservation: 1973 census tapes of Sudan at risk!
Benefits of IPUMS-InternationalBenefits of IPUMS-International» Preservation – IPUMS provides material and technical resourcesPreservation – IPUMS provides material and technical resources
» Recover Recover historical census data and documentationhistorical census data and documentation» ArchiveArchive data and documentation to the highest international standards data and documentation to the highest international standards
» Integration – IPUMS does the workIntegration – IPUMS does the work» DrawDraw high-precision samples to uniform specifications high-precision samples to uniform specifications» AnonymizeAnonymize microdata to highest international standards microdata to highest international standards» IntegrateIntegrate samples according to national practices samples according to national practices and and international international
principlesprinciples» Dissemination – IPUMS manages the riskDissemination – IPUMS manages the risk
» License License samples and documentation in a global initiative (US$5,000 per samples and documentation in a global initiative (US$5,000 per census of 1 million or more person records)census of 1 million or more person records)
» Disseminate Disseminate microdata with minimal risk and maximum benefit, at no microdata with minimal risk and maximum benefit, at no costcost
5
Microdata
Integrated into IPUMS
Entrusted to IPUMS None entrusted
None inventoried
IPUMS-International IPUMS-International dark greendark green = integrated and disseminating = integrated and disseminating
(55 countries, 159 censuses, 325 millon person records)(55 countries, 159 censuses, 325 millon person records)green = to be integrated (35 countries, 90 censuses, 150 mill.)green = to be integrated (35 countries, 90 censuses, 150 mill.)
Mollweide projection
IPUMS-InternationalIPUMS-International
2011:2011:Cambodia 2008Cambodia 2008Egypt 2006 Egypt 2006 France 2006France 2006GermanyGermanyIrelandIrelandNicaraguaNicaraguaSierra Leone Sierra Leone etc.etc.
2. Integrating Census Microdata and 2. Integrating Census Microdata and MetadataMetadata
See also: See also: 2009: “Timely dissemination of integrated census microdata and metadata: The IPUMS-
International approach.” ASSD V: “Information and communication technology in data dissemination: bridging closer producers and users during the 2010 round of Population and Housing Censuses” (19-21 November 2009, Dakar, Senegal)
Constructing the IPUMS-International integrated Constructing the IPUMS-International integrated metadata and microdata systemmetadata and microdata system
» IPUMS-International NEVER IPUMS-International NEVER disseminates source microdata!disseminates source microdata!
» 5 step process of integration—5 step process of integration—2+ years to integrate2+ years to integrate metadata and microdata: metadata and microdata:
1.1. Confirm the integrity and validity of source Confirm the integrity and validity of source microdata and metadatamicrodata and metadata
2.2. Draw and anonymize high precision samples Draw and anonymize high precision samples 3.3. Integrate microdata sample (next slide)Integrate microdata sample (next slide)4.4. Integrate metadata (following slide)Integrate metadata (following slide)5.5. Confirm the integrity and validity of the Confirm the integrity and validity of the
integrated microdata sample and metadata integrated microdata sample and metadata 11
Step 3 of integration in the IPUMS systemStep 3 of integration in the IPUMS system• Composite coding scheme:Composite coding scheme:
1)1) preserve every significant detail and preserve every significant detail and 2)2) harmonize every code harmonize every code
4.4. Integrate metadata (XML): Document Integrate metadata (XML): Document every census, sample, variable and code:every census, sample, variable and code:
• Source documents (pdf) in official language Source documents (pdf) in official language and English and English
• Dynamic metadata system—compare any Dynamic metadata system—compare any combination of countries and samples:combination of countries and samples:
• wording of any census question and instructions wording of any census question and instructions to field workers to field workers
• Characteristics of each census and sampleCharacteristics of each census and sample• Describe each variable: “universe”, Describe each variable: “universe”,
Research Topics—extraordinarily diverseResearch Topics—extraordinarily diverse
» Economists:Economists:» Comparative study of labor force participationComparative study of labor force participation» Demand and supply of public services (water, electricity, sewage, etc.)Demand and supply of public services (water, electricity, sewage, etc.)» Economic impact of family planning and fertility declineEconomic impact of family planning and fertility decline» Discrimination in credit marketsDiscrimination in credit markets» Econometric analysis of labor force and incomeEconometric analysis of labor force and income» Effect of long-term youth unemploymentEffect of long-term youth unemployment» Effects of volume of human capital on returns to educationEffects of volume of human capital on returns to education» Human capital and agingHuman capital and aging» Impact of trade policies on growth, development, immigration, labor Impact of trade policies on growth, development, immigration, labor
markets, and inequalitymarkets, and inequality» Etc.Etc.
For uses, see http://bibliography.ipums.orgFor uses, see http://bibliography.ipums.org
Better: scholar.google.com Better: scholar.google.com IPUMS & key-word: subject, name of country, etc.IPUMS & key-word: subject, name of country, etc.
Conclusion: Invitation to continued cooperationConclusion: Invitation to continued cooperation
» In 1999, our dream: integrate samples of 21 countries in 10 In 1999, our dream: integrate samples of 21 countries in 10 yearsyears
» Thanks to generous cooperation of 55 National Statistical OfficesThanks to generous cooperation of 55 National Statistical Offices» Undreamed technological innovationsUndreamed technological innovations
» By 2009, integrated samples for 44 countriesBy 2009, integrated samples for 44 countries» Number of users and usage far exceeded expectationsNumber of users and usage far exceeded expectations
» For the 2010 decade, our dream: For the 2010 decade, our dream: » Double (2x) the number of integrated samplesDouble (2x) the number of integrated samples» Triple (3x) the number of usersTriple (3x) the number of users» Quadruple (4x) research output from census microdataQuadruple (4x) research output from census microdata