Top Banner
Connecting castles securely with safe share Simon Thompson Director of IT for Informatics @ Swansea University Medical School
46

Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Oct 17, 2018

Download

Documents

vunguyet
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Connecting castles securely with safe share

Simon Thompson

Director of IT for Informatics

@ Swansea University Medical School

Page 2: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

How are we ? Swansea University medical School – 3rd Best in UKhttp://www.walesonline.co.uk/news/health/swansea-universitys-medical-school-uks-12986635

Wales is the best part of the united kingdom sitting proudly on the west side of the country. Devolved government in which Health is devolvedPopulation 3.5m humans and 10m sheep (with excellent healthcare)

Somewhere called Oxford / Cambridge are better apparently

Page 3: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

SAIL Databank (https://saildatabank.com/)

• Over 9 billion records for >5 million people• Much data goes back 10-20 years• All pre-linked data• 300+ feeder systems across Wales• >£5 million investment in high performance IT• Industrial strength, reusable infrastructure. • >300 users, • >£90m projects from UKRCs• 140+ approved SAIL projects, with 79 active today• 100 staff in Swansea working on Health Informatics-

related projects

Background

Page 4: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

SAIL Databank Reach

Page 5: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

• MRC-funded institute of health informatics

• £9.3 million investment at Swansea

• Four Centres across the UK

• Centre for Improvement in Population Health through E-records Research (CIPHER)

• (Swansea, Cardiff, Bristol, Uni of W Australia, Curtin, Ottawa)

• Focus on large scale studies

• Aim: provide the physical and electronic infrastructure to facilitate collaboration across the four nodes

• UK Secure e-Research Platform (UKSeRP)

• National Research Data Appliances (NRDA)

• New methods, public engagement, innovative governance, capacity building

Page 6: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

• ADRN is a UK-wide partnership:

• Universities• Government • National statistics authorities• Third sector• Funders • Researchers

• ADRC-W one of four centres

• £8m investment from ESRC

• Part of the focus on governmental data sharing

• Information assurance and privacy protection

• Secure environment for research

• Using SAIL infrastructure in Wales, with UK SeRP

• Aimed at UK social researchers

• Help accredited researchers carry out social and economic research

• Help to using linked, de-identified administrative data – information which is routinely collected by government organisations.

Page 7: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

• Capacity >1000 virtual

research servers

• Compute cloud for

academics

• 2,880 CPU cores

• 4PB storage (2.8PB

usable)

• £4m investment from

MRC

• Swansea, Cardiff,

Birmingham, Warwick

Universities

• Large in CPU or Memory

servers as host servers

• OpenStack VM Stack

Page 8: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

UK Secure e-Research Platform (UKSeRP)

Funded by: MRC & ESRC

AIM: to provide best in class informatics research platform for national research programmes

Just gone live: Medical Imaging

Next to go live: Include Biomedical / Genomics

Page 9: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Research Access Portal

• End User Experience – Remote desktop to Windows 10 / Ubuntu shared/dedicated system

Loaded with scientific software, access DB, Filestore, all network services

Vmware ViewSecurity Server

(VPN)(x3)

Vmware view Connection Broker

Pooled standard config

Dedicated configurable

Specialist / Custom configTwo Factor

Authentication Server

Data Warehouse

Active Directory

c

New Super-Size Desktops

64 Cores, 4TB memory6TB local SSD raid array

Window (UKSeRP) / Linux (CLIMB)

Page 10: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

UKSeRP / National Research Data Appliance (NRDA)

DATASET Access Control Data storage Documentation Schema Editor ER Diagram Metrics and

Validation Artefacts / Files

Web Front End

FTP / ETL

DATASET Access Control Data storage Documentation Schema Editor ER Diagram Metrics and

Validation Artefacts / Files

DATASET Access Control Data storage Documentation Schema Editor ER Diagram Metrics and

Validation Artefacts / Files

Security, Configuration & Capability Model

Pu

blis

hin

g

Local Data Catalogue

Linkage & Matching

Database Loader

(File

Sp

litte

r)

Sharing & IG

Data Quality and Metrics

MS SQL

PostgreSQL

External

MS SQL

IBM DB2

HADOOP

PostgreSQL

Trusted Third PartyLinkage & Matching

Other Appliance

Regional / Global Data Catalogue

User interface for dataset managementMatching and LinkageData LoaderData QualityData CataloguePluggable architecture

Simplistic ViewpointNRDA1

NRDA2

NRDA3

Page 11: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

A Dataset

Specific version & Date

All section attach files

Contact

Request

VIMO

Theme / Type / Level

Tags

Page 12: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

A Dataset (cont.)

DDI, SPSS, SAS, STATA

Page 13: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

•D

ata

“sch

ema”

au

tom

atic

ally

co

mp

ute

d

bas

ed o

n d

ata

con

tain

ed in

up

load

ed f

ile

Page 14: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Data Catalogue – a specific table

Page 15: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

New ALF Process

Page 16: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

RALF – Residential ALF. Relevant as now we can pipeline address cleaning to improve matching

Page 17: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

• Assessment of all pairs to decide if they belong to the same person

• Identify all pairs of records for each individual

• Combine ‘true positive’ pairs together into Groups

• Group output provides the linkage map

NRDA brings world leading linkage

Page 18: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

• Hashed and Bloom Filtered Linkage Capability

• Demographics Hashed/Encrypted at source

• Deterministic and Probabilistic matching strategies

• Very small % drop in matching viability

NRDA brings world leading linkage

Technology productised, integrated and available now

Wales – Australia –Germany – Canada collaboration.

Page 19: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

UKSeRP uses NRDA

+Postgres, +MySQL, +MongoDB, +ElasticSearch

Page 20: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Recent Expansion

• Image storage, HPC Cluster, (Transmart), EMIF tooling

Storage360TB

Storage360TB

XNAT XNATIncoming Dataset

Data Appliance

UKD

P Sp

eci

ficIn

fras

truc

ture

TransMart TransMartSymantec

Harmonisation

*

Data Model Transformation

*

Load balancer

PostgreSQLPostgreSQL

Load balancer

PG ClusterStorage Server

Storage360TB

Storage360TB

Storage Backup

Job Scheduler

Compute Node

Compute Node

Compute Node

Compute Node

Compute Node

Compute Node

Compute Node

Compute Node

Backup Shared Infrastructure

XNAT DICOM

XNATfs

Open Stack10 to 15 server,

Intel 40 core, 96GB+ each

Page 21: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Multi centred model – central hub providing central location of cross site analysis and external data processing/contributions.

Page 22: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

UKSeRP

• Better ROI for funders

• Researchers can focus on doing research

• IT specialist run the IT

• Better performance from combined infrastructure

File

PG SQL

HAD.

Infrastructure

Data Infrastructure

DB2

Page 23: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

File

PG SQL

HAD.

Infrastructure

Data Infrastructure

Data

DB2

Page 24: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

File

PG SQL

HAD.

Infrastructure

Data Infrastructure

Data

Remote Access

Appliance

Desktop Images / Software

Portal

DB2

Page 25: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

File

PG SQL

HAD.

Infrastructure

Data Infrastructure

Data

Remote Access

Appliance

Desktop Images / Software

HPC / Bespoke InfrastructurePortal

DB2

Page 26: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

File

PG SQL

HAD.

Infrastructure

Data Infrastructure

Data

Remote Access

Appliance

Desktop Images / Software

HPC / Bespoke InfrastructurePortal

DB2

Data

Remote Access

Appliance

Portal

Page 27: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

UK Secure e-Research Platform (UKSeRP)

• Large scale data and compute platform• Performance and scale

• A remote access analytics platform• Best practice: data management, security, governance• Suite of standard and bespoke data analytical tools • Accessible across UK and internationally

• Leaves data ownership with the cohorts/programme• Each ‘controls’ slice of UKSeRP• Devolved account and access control • Information governance remains with cohorts/programme• Brings together data for DPUK across cohorts

• Enables researchers to focus on the science

Reduce Costs, Reduce Risk

Page 28: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Connection ServerSecurity Server

Float Pool

Dedicated Pool

STATA Pool

2FA Server

S3

Maxing at 33

Maxing at 16

Maxing at 10

Last 2 days 21st and 22nd March 2017

Page 29: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Connection ServerSecurity Server

6 months – peek 100 desktops

Page 30: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Security Server Connection Server

Security Server Connection Server

Security Server Connection Server

Page 31: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Different Security requirements

• SAIL Databank – No geo restrictions / NHS connections

• ADRC – Locked down environments / Safe-Rooms

• FARR – Also Identifiable datasets, vary by project

• DPUK – Multi model / integration, complex

• Biobank – Restricted to project members

• Alspac – Local secure direct access / Users UK

• (more not public yet) – like Perth, Australia

Protecting the data and ensuring only authorised access is keyTwice weekly vulnerability / pen testing – fully aware of our risk exposure (slept well this weekend)

Patching, compliance and Hardening core components

Page 32: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

We built a castle !

Wales loves Castles – have 600 of them. Conway Castle

Page 33: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Castles are Great – Big, Strong, Expensive

• Many Layers of security – restricted building, data centres, 2 inch steel door, independent alarms, devolved control of access control system, CCTV, facial detection, multi vender perimeter firewalls, internal firewalls 3rd vender, network segmentation and isolation.

• Garrison of solders to protect – team of DevOps staff, security officers, compliance and governance people

• Controlled Entry points

Page 34: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Everything outside is vulnerable

• End users / access – have to let people in

• External systems / data suppliers

Page 35: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Creating Silos

• How do you work together ?

• How do you share data if you never data out ?

We need to trust each other and create tunnels for safe passage of data/access/people

Page 36: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

How / Who do you trust ?

• Equivalence, hard to measure so subjective

• Build a relationship up

• Standards and accreditation helpful. ISO 27001 is great but does not cover governance

• Never fully trust, also put a portcullis at the end the tunnel just incase

Page 37: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

The safe share project

The safe share project 38

Example “service slice”: FarrInstitution LAN

Farr trusted environments

Janet, internet or other network

safe share router at edge

Safe sharecore

IT MAKES TUNNELS !!!

Page 38: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

The safe share project: HAN design overview

22/05/2017 The safe share project 39

TWO main designs1. All Nodes interconnected2. All Nodes connected hub-spoke

Page 39: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

How we have used it

FARR

ADRC

FARR

ADRC ADRC

Page 40: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Why?, could do it ourselves ?

FARR

ADRC

FARR

ADRC ADRC

Already connecting 3 organisations – so independent trusted name like JISC advantage, accreditations, service management and governance strong “selling point”Same strong standards, guaranteed interconnect / compatibility

Less workLess resistance

High assuranceHigh design standards

Page 41: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Could / going to - FARR

FARR FARR

FARR FARR

End users from each site could connect to any other infrastructure Other infrastructure / HPC could securely access datasets in remote data centres

Swansea

Manchester

Edinburgh

London

FARR v2 – 4 centres to 10 centres

Page 42: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Could / going to - ADRC

ADRC

ADRC ADRC

ADRC

ADRC

Swansea Southampton

UCLCardiffSwansea

ADRC

Belfast

Page 43: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Could / going to - ADRC

ADRC

ADRC ADRC

ADRC

ADRC

Swansea Southampton

UCLCardiffSwansea

Edinburgh

ADRC

All safe room can connect to all data centres – significantly increasing access

ADRCBelfast

Page 44: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Could / going to - ADRC

ADRC

ADRC ADRC

ADRC

ADRC

Swansea Southampton

Edinburgh

ADRCADRC

Belfast

Secure transfer / access of sensitive large datasets

Page 45: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

JISC Safe Share

• We need secure network inter connections

• High assurance / data governance

• Multiple over lays based on use case / requirements

• Trusted third party is the ideal solution

• The key will be for the academia community to buy into this vision, only at scale does it make sense.

• If we create silo’s then in the end things will be worseits started….

Page 46: Connecting castles securely with safe share - Jisc community Thompson... · Connecting castles securely with safe share ... PostgreSQL PostgreSQL Load balancer PG Cluster Storage

Questions?

Simon Thompson

[email protected]