Top Banner
Big but personal (meta)data How Human Behavior Bounds Privacy and What We Can We Do About It Yves-Alexandre de Montjoye @yvesalexandre MIT Media Lab
47

Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Aug 05, 2015

Download

Data & Analytics

freshdatabos
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Big but personal (meta)data

How Human Behavior Bounds Privacy and What We Can We Do About It

Yves-Alexandre de Montjoye @yvesalexandre MIT Media Lab

Page 2: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 3: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 4: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 5: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 6: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 7: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

12 points

Page 8: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 9: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 10: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 11: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 12: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 13: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Is the way you move around

as unique as your fingerprint

Page 14: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

We can use points to identify a fingerprint

Page 15: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Scott

Page 16: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

From 10 to 11am

1 km²

1 point for mobility data

~

Page 17: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

2 points

Around 11:30am

Page 18: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

3 points

For lunch

Page 20: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

How many points do I need to uniquely identify a

mobility traces?

Page 22: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 23: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Entire country of 1.5 millions people

Our behavior is unique enough

4 points

Identify 95% of people

de Montjoye, Y. A., Hidalgo, C. A., Verleysen, M., & Blondel, V. D. (2013). Unique in the Crowd: The privacy bounds of human mobility. Nature SRep, 3.

Page 24: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

What it means

1. It is possible to re-identify mobile phone metadata (even if there is no name or phone number)

Page 25: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Resolution: 800 pixels

Page 26: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Resolution: 300 pixels

Page 27: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Resolution: 150 pixels

Page 28: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Resolution: 75 pixels

Page 29: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Resolution: 30 pixels

Page 30: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Where’s Thierry ?

Page 31: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

?

Page 32: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

4pm – 10pm 7pm-8pm

Page 33: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Estimating Privacy

Spatial resolution Temporal resolution

Number of points

de Montjoye, Y. A., Hidalgo, C. A., Verleysen, M., & Blondel, V. D. (2013). Unique in the Crowd: The privacy bounds of human mobility. Nature SRep, 3.

Page 34: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Harder to find people

Much easier to find people

Harder to find people

Page 35: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It
Page 36: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

What it means

1. It is possible to re-identify mobile phone metadata (even if there is no name or phone number)

2. It is not simply a question of coarsening the data (we’d just need a few more points)

Page 37: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

BFI: Personality test

Page 38: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

BFI: Personality test

Page 39: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Behavioral indicators derived from metadata using the Bandicoot toolbox

Page 40: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Predicting personality using metadata

de Montjoye, Y. A., Quoidbach, J., Robic, F., & Pentland, A. S. (2013). Predicting personality using novel mobile phone-based metrics. In Social Computing, Behavioral-Cultural Modeling and Prediction (pp. 48-55). Springer Berlin Heidelberg.

Page 41: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

What it means

1. It is possible to re-identify mobile phone metadata (even if there is no name or phone number)

2. It is not simply a question of coarsening the data (we’d just need a few more points)

3. It is not “just” metadata or what is directly visible in the data (e.g. one might use it to predict your personality)

Page 42: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Eagle, N., de Montjoye, Y-A.., & Bettencourt, L. M. (2009). Community computing: Comparisons between rural and urban societies using mobile phone data. IEEE Computational Science and Engineering

We should use this data

Deville, P. et al. (2014). Dynamic population mapping using mobile phone data. Proceedings of the National Academy of Sciences, 201408439.

Wesolowski, A., Eagle, N., Tatem, A. J., Smith, D. L., Noor, A. M., Snow, R. W., & Buckee, C. O. (2012). Quantifying the impact of human mobility on malaria. Science, 338(6104), 267-270.

Page 43: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

(but in a privacy-conscientious way)

We should use this data

by: understanding what the real risks are

and designing solutions

Page 44: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Privacy-conscientious anonymization

de Montjoye, Y. A., Smoreda, Z., Trinquart, R., Ziemlicki, C., & Blondel, V. D. (2014). D4D-Senegal: The Second Mobile Phone Data for Development Challenge. arXiv preprint arXiv:1407.4885.

e.g. 2-week mobility traces of 27 x 300.000 individuals + Bandicoot’s behavioral indicators

Page 45: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Online systems: from privacy to security

openPDS/SafeAnswers: - Only shares answers, not raw data - Security mechanisms

Page 46: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

openPDS/SafeAnswers

de Montjoye Y.-A., Wang S., Pentland A., On the Trusted Use of Large-Scale Personal Data. IEEE Data Engineering Bulletin, 35-4 (2012). de Montjoye, Y. A., Shmueli, E., Wang, S. S., & Pentland, A. S. (2014). openPDS: Protecting the Privacy of Metadata through SafeAnswers. PLoS ONE, 9(7), e98790.

Page 47: Big But Personal Data: How Human Behavior Bounds Privacy and What We Can We Do About It

Yves-Alexandre de Montjoye MIT Media Lab

@yvesalexandre http://deMontjoye.com

In collaboration with Alex “Sandy” Pentland, César Hidalgo, Vincent Blondel, Cameron Kerry, Jake Kendall, Michel Verleysen, Erez Shmueli, Arek Stopczynski, Sune Lehmann, Eaman Jahani