Top Banner
2013 US Federal Project Open Data
52

DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

May 07, 2015

Download

Technology

Andrew Hoppin

Presentation about DKAN, the open data distribution of Drupal, given at San Diego Drupalcamp in January 2014.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

2013 US Federal Project Open Data

Page 2: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

2009 Data.gov

Page 3: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

2009 Data.gov.uk

Page 4: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Facili&es

2012 Local Laws....

Page 5: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Facili&es

Resulting in New Open Data Portals

Page 6: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Open  Data:  Government  Opera/ons  Innova/on

Facili/es

Page 7: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Open  Data:  Civic  Empowerment

Facili/es

Page 8: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Open  Data:  Economic  Development  

Facili/es

(Shout-­‐Out:  Portland  CivicApps  <-­‐-­‐>  CitySync)

Page 9: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

2013 Civic Open Data Ecosystem: 100s of Companies, 1000s of Governments, 10,000s of Apps, 1,000,000s of Data Sets...

Page 10: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

San Diego is Next!

Page 11: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Proprietary SaaS Open Data Portal Market Leaders

Page 12: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

CKAN: Open Source Market Leader

Page 13: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

More About CKAN

Python

Javascript

ORM: Pylons web framework & SQLAlchemy

PostgrSQL

Jinja2 Template Engine

SOLR

Modular; ~40 extensions

Page 14: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

In 2012 CKAN Broke Out...

Page 15: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

CKAN In the US

Page 16: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

CKAN Worldwide

Page 17: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

• No vendor lock-In / choice of consultants / ability to build in-house capacity • Collaborate w/ our peers (White House)

• Security transparency (US DoD is a major consumer for this reason)

• Open-Source platforms often pay more heed to open formats and standards (e.g.: DCAT, RDFa, OData, JSON vs Shapefiles, PDF, etc.)

• Innovation: healthy open-source projects can aggregate more engineering effort than proprietary alternatives, propagate great new extensions faster

• Freedom of Hosting Options: consume as a cloud-hosted service today, change our mind and host in-house tomorrow, etc.

Why Open-Source Matters...

Page 18: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

http://flickr.com/photos/rocketqueen/1573565705/

But  Data.gov.uk,  Data.gov,  HealthData.gov,    OpenGovPlaKorm,  etc.  all  added  Drupal  to  CKAN

Page 19: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

If  you  Already  Use  Drupal,  Do  You  Really  Need  CKAN?

Page 20: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

With  DKAN  Distro,  Drupal  Itself  Now  Also  Becoming  a  Public  Sector  Data  Management  System  (“DMS”)

Page 21: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

• Manage content, data, permissions through same platform

• Single software stack to maintain

• Single site to design & theme

• Easy to extend with social features

• Transparent, well-governed upgrade path of Drupal • Extensive Drupal ecosystem of civil service talent, consultants, hosting, support

Why DKAN Instead of Drupal+CKAN?

Page 22: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

•  MATURE:  >1  million  sites  (2%  of  all  sites),  3,718  Code  commits/wk,  6,388  issue  comments/wk  •  IN-­‐HOUSE  SKILLS:  24%  of  .gov  sites

•  EXTENSIBLE:    18,489  Modules,  1,512  Themes,  21,009  Contributors

•  FISMA-­‐Cer&fied  Cloud  Hos&ng  Op&ons

•  INTEGRATES  easily  w/  public  websites  lots  of  de  facto  data  is  already  published  as  content

Why Drupal?

Page 23: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Why Drupal? Open Content since 2009...

Page 24: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

24

WhiteHouse.gov

Page 25: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

NYSenate.gov

Page 26: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

h_p://flickr.com/photos/rocketqueen/1573565705/

Content = Data (Legislative Committees)

Page 27: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Open Data Sets Catalog

Page 28: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Custom Search for Special Data Types

Page 29: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

http://flickr.com/photos/rocketqueen/1573565705/

Allow Data to be Augmented

Page 30: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Make it Easily Discoverable...

Page 31: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

31

And Accessible Everywhere...

Page 32: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

http://flickr.com/photos/rocketqueen/1573565705/

...And Easily Shared

Page 33: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Drupal  “DKAN”  Distribu&onOpen-­‐Source  Data  Management  System  (DMS)  

+  Content  Management  System  (CMS)  =  Streamlined  Data  Collabora&on

Page 34: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Open Data is Just “Sharing Your Files”

• Datasets  are  collec&ons  of  resources,  with  some  descrip&ve  metadata

• Resources  are  just  files.  They  can  be  any  kind  of  file,  but  ohen  they  are  CSV  files,  spreadsheets  or  some  other  kind  of  tabular  data  file.

• Organiza&ons  create  datasets  and  upload  resources.

• Data  consumers  can  browse  datasets  and  some&mes  see  visualiza0ons  of  resources.

34

Page 35: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN•  Fully functional data portal housing datasets, Solr search, accessible via JSON and RDF; csv or xml files uploaded through Drupal, stored in *SQL, visualized through Recline.js

• Seeks to replicate CKAN 2.0 functionality, design, standards, & API

• Reuses CKAN components wherever possible (e.g.: Recline.js)

• Built with support and input from the Open Knowledge Foundation

• Fully open project, with code on Drupal.org/project/DKAN

Page 36: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

• DKAN  Distribu&on

DKAN Datasetmodule

DKAN Datastoremodule

DKAN Components

Page 37: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 38: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 39: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 40: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 41: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 42: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 43: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 44: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

DKAN Walkthrough

Page 45: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Early Returns: Cologne, Germany

Page 46: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Early Returns: Puerto Rico

Page 47: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Early Returns: CivicInfo British Columbia

Page 48: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Early Returns: Data Wisconsin

Page 49: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

•  Adding  feedback  on  datasets,  other  social  features

•  Support  for  addi&onal  file  types

•  Adding  DKAN_DataSet  &  DKAN_DataStore  modules  to  other  Distros  like  OpenCivic  

•Offering  enterprise  support  &  hosted  OpenSaaS  DKAN

Ongoing Development

Page 50: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

•  NuData  =  our  DKAN  as  a  turnkey  hosted  24/7  supported  sohware-­‐as-­‐a-­‐service

•  Governments  like  SaaS  like  Socrata  because  it’s  quick,  affordable,  and  no  technology  burden  on  exis&ng  staff

•  Governments  like  open-­‐source  (e.g.:  CKAN)  because  they’re  in  control-­‐-­‐  no  vendor  lock-­‐in,  ability  to  customize,  innovate

•  OpenSaaS  =  the  best  of  both  worlds;  SaaS  but  truly  open-­‐-­‐    you  can  take  your  app  and  your  data  with  you  with  minimal  switching  cost

•  Drupal  is  excep&onally  well  posi&oned  to  enable  OpenSaaS  businesses

“NuData DKAN” OpenSaaS Offering

Page 51: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Drupal Open Data Policy Compliance Recipesadd /data.html & /data.json pages

to existing Drupal site with new Open

Data Module? (sandbox project)

add data management &

publishing features to a Drupal site with

DKAN Data Set & DKAN Data Store

Modules

deploy new Open Data Catalog / Portal

with the DKAN Distribution, on your

own, or as SaaS

Page 52: DKAN: The Drupal Open Data Distribution (presented at SANDCamp San Diego Drupalcamp)

Followup

Project Page: http://drupal.org/project/DKAN

“Fully Made” DKAN w/ .js libraries: https://github.com/nuams/dkan

Marketing Site: http://nucivic.com/dkan/

[email protected] | @ahoppin | nuams.com | nucivic.com