Top Banner
IPUMS workshop IPUMS workshop https://international.ipums.org * * * * * * Robert McCaa, Professor of Population Robert McCaa, Professor of Population History History University of Minnesota University of Minnesota [email protected] additional information at: additional information at: www.hist.umn.edu/~rmccaa/ipums-europe www.hist.umn.edu/~rmccaa/ipums-europe
15

IPUMS workshop * * * Robert McCaa, Professor of Population History University of Minnesota [email protected] additional information.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

IPUMS workshopIPUMS workshophttps://international.ipums.org

* * ** * *Robert McCaa, Professor of Population HistoryRobert McCaa, Professor of Population History

University of MinnesotaUniversity of [email protected]

additional information at:additional information at:www.hist.umn.edu/~rmccaa/ipums-europewww.hist.umn.edu/~rmccaa/ipums-europe

Page 2: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

The IPUMS team The IPUMS team (lack computer gurus, some researchers, & 3 PIs were away or too busy to pose!)(lack computer gurus, some researchers, & 3 PIs were away or too busy to pose!)

Steven Ruggles, Inventor of IPUMS, Professor of History and Director, Minnesota Population Center

Page 3: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

IPUMS-Greece TimelineIPUMS-Greece Timeline

• May, 2003: Memorandum of Understanding signedMay, 2003: Memorandum of Understanding signed• July, 2005: Microdata samples entrusted for censuses July, 2005: Microdata samples entrusted for censuses

of 1971, 1981, 1991 and 2001of 1971, 1981, 1991 and 2001• April, 2006: Translations of documentation completedApril, 2006: Translations of documentation completed• Dec., 2006: Integration completed; dissemination Dec., 2006: Integration completed; dissemination

beginsbegins

Page 4: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

Outline (65 slides, yikes!!)Outline (65 slides, yikes!!)

» 1. IPUMS goals1. IPUMS goals and milestones and milestones 11 slides 11 slides

» 2. Applying for Access2. Applying for Access 8 slides8 slides

» 3. Studying documentation3. Studying documentation 15 slides 15 slides

» 4. Creating an extract4. Creating an extract 9 slides9 slides» a. Selecting samplesa. Selecting samples

» b. Selecting variablesb. Selecting variables

» c. Selecting sub-populations c. Selecting sub-populations

» 5. Integrating microdata5. Integrating microdata 9 slides9 slides

» 6. Managing access: Users and Uses6. Managing access: Users and Uses 13 slides 13 slides

Page 5: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

Project goalsProject goals

• IPUMS is a global partnership to IPUMS is a global partnership to (1) preserve census microdata and documentation,(1) preserve census microdata and documentation,(2) integrate census microdata samples, and(2) integrate census microdata samples, and(3) manage access to anonymized sample extracts for (3) manage access to anonymized sample extracts for researchers and policy makers, at no cost—regardless researchers and policy makers, at no cost—regardless of country of birth, residence or citizenshipof country of birth, residence or citizenship

Page 6: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

Preservation: 1973 census tapes of Sudan at risk!

Page 7: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

A Solution: Data recovery

Page 8: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

Data recovery. Example: Bangladesh Bureau of Data recovery. Example: Bangladesh Bureau of Statistics (1981 census, 276 tapes to recover)Statistics (1981 census, 276 tapes to recover)

>3,000 tapes >3,000 tapes recovered: 1971 Germanyrecovered: 1971 Germany

1980 Mexico, 1980 Mexico, Mali 76, Sudan 73Mali 76, Sudan 73

and many moreand many more

MicrodataMicrodataon this tape on this tape

were recovered!!were recovered!!

Page 9: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

Integration, Dissemination: March 2008Integration, Dissemination: March 2008

dark greendark green = = disseminating (26 countries, 80 disseminating (26 countries, 80 censuses, 200mpr)censuses, 200mpr)

green = harmonizing (34 countries, 95 censuses, green = harmonizing (34 countries, 95 censuses, 190mpr)190mpr)

lightest greenlightest green = negotiating = negotiating (see handout)(see handout)

Mollweide projection

Page 10: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

MilestonesMilestones

» 1999: Founded by Steven Ruggles and Bob McCaa,1999: Founded by Steven Ruggles and Bob McCaa,––restrict access to trusted users, and apply corresponding restrict access to trusted users, and apply corresponding confidentiality techniquesconfidentiality techniques» 2002: 12002: 1stst release of integrated samples for 7 countries; >200 release of integrated samples for 7 countries; >200

users in first yearusers in first year

» 2008: Big hit! 79 countries signed; 70 entrusted data to 2008: Big hit! 79 countries signed; 70 entrusted data to IPUMS, datasets for more than 230 censuses, >150 entire IPUMS, datasets for more than 230 censuses, >150 entire datasetsdatasets

Page 11: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

MilestonesMilestones

» 2007, 42007, 4thth release: release: » data for 26 countries, samples for 80 censuses, data for 26 countries, samples for 80 censuses,

» 202 million person records, 202 million person records,

» ~2,000 users~2,000 users

» 2009, 62009, 6thth release: release: » data for 40 countries, samples for ~130 censusesdata for 40 countries, samples for ~130 censuses

» >300 million person records>300 million person records

» thousands of usersthousands of users

» Note: data extracts are provided only to licensed users.Note: data extracts are provided only to licensed users.

Page 12: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

2. 2. UsingUsing https://www.ipums.org/international:https://www.ipums.org/international:

1. Logon 1. Logon w/ passwordw/ password

2a. Study documentation2a. Study documentation2b. Design extract2b. Design extract

3. Receive email; 3. Receive email; logon with p/wordlogon with p/word

4. Download 4. Download extract (SSL extract (SSL encrypted)encrypted)

5. UnZip data5. UnZip data

(also SAS, (also SAS, STATA) STATA)

6. Analyze6. Analyze

Page 13: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

- Warning -- Warning -

• IPUMS microdata are anonymized samples.IPUMS microdata are anonymized samples.– They are for advanced analysis and research. They are for advanced analysis and research. – Use of a statistical software is required.Use of a statistical software is required.– Statistical software provides great power.Statistical software provides great power.– “ “With great power, comes great responsibility.”With great power, comes great responsibility.”

• IPUMS samples are for analysis.IPUMS samples are for analysis.• IPUMS samples are IPUMS samples are not not official statistics.official statistics.

Page 14: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

http://international.ipums.orghttp://international.ipums.org

Apply for access (see form and conditions of use)Apply for access (see form and conditions of use)

Study integrated documentation (variables)Study integrated documentation (variables)

Link to Official Statistical Agency home pagesLink to Official Statistical Agency home pages

Examine integrated metadata (samples)Examine integrated metadata (samples)

Construct a custom-tailored request: select Construct a custom-tailored request: select countries, years, sub-populations, & variablescountries, years, sub-populations, & variables

Page 15: IPUMS workshop  * * * Robert McCaa, Professor of Population History University of Minnesota rmccaa@umn.edu additional information.

1.1. Respect “restricted-access” conditions of use: protect Respect “restricted-access” conditions of use: protect confidentiality; “share” data with only with registered usersconfidentiality; “share” data with only with registered users

2.2. Study both source documentation and metadata: Study both source documentation and metadata:

» source: census forms and instructions to enumerators source: census forms and instructions to enumerators

» metadata: samples, variables, comparability discussionsmetadata: samples, variables, comparability discussions

3.3. Construct extracts judiciously:Construct extracts judiciously:use “subsamp” (1% sample for testing) use “subsamp” (1% sample for testing) extract only needed countries, censuses, variables, sub-pops extract only needed countries, censuses, variables, sub-pops

4.4. Use weights:Use weights:either households or individuals (geographical strata = power)either households or individuals (geographical strata = power)

5.5. Analyze carefully:Analyze carefully:proper statistical techniques, keeping in mind data qualityproper statistical techniques, keeping in mind data quality

6.6. Cite properly: IPUMSCite properly: IPUMSii and National Statistical Agencies and National Statistical Agencies

7.7. Share publications: IPUMSShare publications: IPUMSii and National Statistical Agenciesand National Statistical Agencies

Dr. Bob’s 7 rules for using IPUMSDr. Bob’s 7 rules for using IPUMSii microdata microdata