Top Banner
Risks and mitigations of releasing data Risk analysis and complexity in de-identifying and releasing data. Sara-Jayne Terp RDF Discussion
20

Sjt risks and mitigations of releasing data

Jan 22, 2018

Download

Data & Analytics

kjantin
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sjt risks and mitigations of releasing data

Risks and mitigations of releasing data

Risk analysis and

complexity in de-identifying

and releasing data.

Sara-Jayne Terp

RDF Discussion

Page 2: Sjt risks and mitigations of releasing data

First, Do No Harm

“If you make a dataset public, you

have a responsibility, to the best of your knowledge, skills, and advice, to

do no harm to the people connected to that dataset. You balance making data

available to people who can do

good with it and protecting the

data subjects, sources, and

managers.”

2

Page 3: Sjt risks and mitigations of releasing data

What is risk?What is the risk here?

3

Page 4: Sjt risks and mitigations of releasing data

RISK

“The probability of something happening multiplied by the resulting cost or benefit if it does” (Oxford English Dictionary)

Three parts:

•Cost/benefit

•Probability

•Subject (to what/whom)4

Page 5: Sjt risks and mitigations of releasing data

Subjects: Physical

5

“Witnesses told us that

a helicopter had been

circling around the

area for hours by the

time the bakery opened

in the afternoon. It

had, perhaps, 200

people lined up to get

bread. Suddenly, the

helicopter dropped a

bomb that hit a building

on the opposite side [of

the street] from the

bakery, spraying

shrapnel and debris

over the breadline”

- FirstMileGeo report on Aleppo

Page 6: Sjt risks and mitigations of releasing data

Subjects: Reputational

6

Page 7: Sjt risks and mitigations of releasing data

Subjects: Physical

7

Page 8: Sjt risks and mitigations of releasing data

Collectors: Physical

8

Page 9: Sjt risks and mitigations of releasing data

Processors: Legal

9

Page 10: Sjt risks and mitigations of releasing data

Risk OF What?

• Physical harm

• Legal harm (e.g. jail, IP disputes)

• Reputational harm

• Privacy breach

10

Page 11: Sjt risks and mitigations of releasing data

Risk to Whom?

• Data subjects (elections example)

• Data collectors (conflict example)

• Data processing team (military equipment example)

• Person releasing the data (corruption example)

• Person using the data

11

Page 12: Sjt risks and mitigations of releasing data

Likelihood of Risk

Low

Medium

High

12

Page 13: Sjt risks and mitigations of releasing data

piIHow I handle it

13

Page 14: Sjt risks and mitigations of releasing data

PII

“Personally identifiable information (PII) is any data that could potentially identify a specific individual. Any information that can be used to distinguish one person from another and can be used for de-anonymizing anonymous data can be considered PII.”

14

Page 15: Sjt risks and mitigations of releasing data

Learn to spot Red Flags

• Names, addresses, phone numbers

• Locations: lat/long, GIS traces, locality (e.g. home + work as an identifier)

• Members of small populations

• Untranslated text

• Codes (e.g. “41”)

• Slang terms

• Can be combined with other datasets to produce PII

15

Page 16: Sjt risks and mitigations of releasing data

Consider Partial Release

Release to only some groups

• Academics

• People in your organisation

• Data subjects

Release at lower granularity

• Town/district level, not street

• Subset or sample of data ‘rows’

• Subset of data ‘columns’

16

Page 17: Sjt risks and mitigations of releasing data

Include locals

Locals can spot:

•Local languages

•Local slang

•Innocent-looking phrases

Locals might also choose the risk

17

Page 18: Sjt risks and mitigations of releasing data

Consider Interactions Between Datasets

18

Page 19: Sjt risks and mitigations of releasing data

Learn From Experts

Over to you…

19

Page 20: Sjt risks and mitigations of releasing data

THANK YOU

For questions or

suggestions:

Responsible Data Forum

For questions or

suggestions:

Responsible Data Forum