Top Banner
GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Dr Vishwas Chavan Senior Programme Officer for Senior Programme Officer for DIGIT DIGIT [email protected] [email protected] WWW.GBIF.ORG Towards Data Publishing Framework for primary biodiversity data Building the Biodiversity Informatics Commons Building the Biodiversity Informatics Commons DataCite Summer Meeting 7-8 June 2010, Hannover
25

GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT [email protected] Towards Data Publishing Framework.

Mar 27, 2015

Download

Documents

Dylan McKenna
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

GLOBALBIODIVERSITYGLOBALBIODIVERSITYINFORMATIONFACILITYINFORMATIONFACILITY

Dr Vishwas ChavanDr Vishwas ChavanSenior Programme Officer for Senior Programme Officer for [email protected]@gbif.org WWW.GBIF.O

RGWWW.GBIF.O

RG

Towards Data Publishing Framework for primary

biodiversity data

Towards Data Publishing Framework for primary

biodiversity data

Building the Biodiversity Informatics CommonsBuilding the Biodiversity Informatics Commons

DataCite Summer Meeting7-8 June 2010, Hannover

Page 2: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

GBIF: an intergovernmental initiative to share biodiversity information

GBIF: an intergovernmental initiative to share biodiversity information

Currently 54 countries; 44 International Organisations…

Page 3: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

GBIF’s Mandate ”To facilitate free and open access to biodiversity data worldwide, via the Internet, to underpin scientific research, conservation and sustainable development.”

GBIF is govt-initiated, and govt. funded, in response to government agency needs in biodiversity information access and management;

GBIF is in service to science, as a global ‘public good’

Page 4: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Data shared online via GBIF Data shared online via GBIF

(>201 m biodiversity records mapped to a 1 X 1 degree grid)

Data Publishers: 316Data Resources: 9900

Page 5: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

GBIF facilitates access/exchange of dataGBIF facilitates access/exchange of data

GBIF-mediated data on the ‘India’

Page 6: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

GBIF, Global Information Infrastructure for Biodiversity

GBIF, Global Information Infrastructure for Biodiversity

Global InfrastructureTools, Standards, and ProcessesStrategies and Policy

FrameworkOutreach and Capacity Building

Page 7: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Global Biodiversity Research

Infrastructure

Global Biodiversity Research

Infrastructure

Page 8: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

In summary…GBIF’s InformaticsIn summary…GBIF’s Informatics

Improved accessto Names, Metadata and Primary Biodiversity Data

Distributed GBIF informatics architecture

Faster and easier publishing of data

Page 9: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Primary biodiversity data and information effectively available

Data and information that have been produced but are not easy to find, access, and use (i.e not effectively available!) - a gigantic task of mobilising billions of data is still needed, as well as integrating new data.

Biological collections

Scientific publications

Observations

Reports

Gray literature

Data Bases

Geography

Page 10: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Improving mobilisation and Cultural ChangesImproving mobilisation and Cultural Changes

Broadening Data Types

Data Resources Discovery

Innovative Approaches to Data Mobilisation

Data Mobilisation Strategy Discussions

Data Publishing Framework

Page 11: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

What is there for

me?

Recognition

Opportunities

Investment

Why should I publish data?Why should I publish data?

Page 12: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Data Publishing FrameworkData Publishing Framework

Cultural change towards ‘free and open access’ to biodiversity data

Addresses social, technical, and policy concerns

Answer ‘What is there for me?’ for ALL

Page 13: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2

Page 14: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

DPF: Core Technical ComponentsDPF: Core Technical Components

Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2

Page 15: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Occurrence Data

Occurrence Data

KML file

Data Publication together with scholarly publication: ZooKeys

experience

Penev, et.al. (2009). ZooKeys, 11: 1-8.

Page 16: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

PersistentIdentifiers

Journal System

SubmissionSubmission

AcceptanceAcceptance

RevisionRevision

Peer ReviewPeer Review

PublicationPublication

Registry

GBRDS

DoI

DistributedMetadata Catalogues

Metadata Authors

auto conversion to manuscriptauto conversion to manuscript

GBIF Metadata Repository

Current

Biology

PhytoKeys

Indian J. Mar. Sci.

Data Paper:Recognising Data Discovery

Page 17: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Data Citation Mechanism & ServiceData Citation Mechanism & Service

Deep data citation mechanismRecognise ALL with their rolesMultilayer citation – producer, publisher,

aggregatorCitations within citations

Data Citation ServiceResolve citation any timeDiscover the underlined data

Under development

Page 18: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Data Usage Index (DUI): Why?Data Usage Index (DUI): Why?

To demonstrate to data publishers that their biodiversity efforts do have impact

• To encourage …– Increase of high quality data discovery and

mobilisation– Further usage of biodiversity data and information in

scientific work– Formal citation behavior in research papers of dataset– Standardisation of dataset information

Page 19: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

GBIF Indicators 19

Data Usage Index (DUI): What is it?Data Usage Index (DUI): What is it?

As set of indicators operating on data concerned with: Unique Visits Loyal Visits (repeated visits by same IP address) Download of datasets & dataset records Volume and (rank) distributions of dataset records

per visit, visitor, dataset provider (institution, country, region, world, theme) & period

Indicators to be normalised (by records or MB), relative (to world, theme) and weighted (according to provider profile of species/taxa/themes)

Chavan, June 2009

Page 20: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Data Flow type

DigitisationBottom – TopTop – Bottom

GlobalDUIs

Natl.,Regional,ThematicDUIs

Local DUIs

UN

IVE

RS

AL

D

UI

Mirror MirrorGDUI

GDUIGDUI

Aggregator AggregatorAggregator

RDUITDUI

TDUI

AggregatorAggregator

AggregatorNDUINDUI

LDUI

Publishin

g Toolkit

Publishin

g Toolkit

Publishin

g Toolkit

Publishin

g Toolkit

LDUILDUI

Implementation of DUI

Page 21: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Data Usage Index (DUI) implementation

Data Usage Index (DUI)

Phase I Phase IIIPhase II

Access UseManagement

Data Life Cycle

Improving the relevance of Data Usage Index

Page 22: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

DPF: ChallengesDPF: Challenges

Chavan and Ingwersen (2009) , BMC Bioinformatics, 10 (Suppl. 14): S2

Policy & Political Uptake

Cultural & Social Acceptance

Individual Researcher

Scientific and Academic Institutions

Funding and Donor Agencies

Traditional Publishing Industry

Page 23: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Funding Agencies

Project

Data Creation, Collection

Analysis,Interpretation

ScholarlyPublishing

Data Management,& Archival

DataPublishing

Increased Data Usage

Knowledge Dissemination

support

results in Inspires another

results in

requires

provide feedback on gaps

and strategies fo

r

leads t

o

Metadata

facilitate

facilitate

results in

Impr

oves

dat

a qu

ality

and

fitne

ss

facil

itate

enco

urag

es

Existing cycle

Complementary Expected cycleImpact Factor

Data Usage Index

DataDiscovery

Incentivisation

through Data Paper

leads to

leads to

leads to results

in

Source: BMC Bioinformatics 2009, 10(Suppl 14):S2, doi:10.1186/1471-2105-10-S14-S2

Impact of Data Publishing FrameworkImpact of Data Publishing Framework

Page 24: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Primary Data and Scholarly PublicationsPrimary Data and Scholarly Publications

Seamless, embedded interconnections between data & paper

• Unconventional use of data

• Improving reliability & credibility

Page 25: GLOBAL BIODIVERSITY INFORMATION FACILITY Dr Vishwas Chavan Senior Programme Officer for DIGIT vchavan@gbif.org  Towards Data Publishing Framework.

Email: [email protected]

Data Publishing together with Scholarly Publishing!

Data Publishing together with Scholarly Publishing!