Top Banner
2004-09-15 NVO Summer School, Aspen Center for Physics 1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene
24

2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

Mar 27, 2015

Download

Documents

Amelia Kerr
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 1

Publishing and Resource Discovery with Registries

Ray PlanteGretchen Greene

Page 2: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 2

All about Registries

• Overview of the Registry Framework

• Publishing to the NVO

• Visits to Registries (publishing)

• VOResource: Resource Metadata in XML

• Visits to Registries (searching)

• IVOA Standard Registry Interface

• Exercise: query registry in an application

Page 3: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 3

The role of Resource Registries

• Used to discover and locate resources—data and services—that can be used in a VO application

• Resource: anything that is describable and identifiable.– Besides data and services: organizations, projects,

software, …– Presently concerned with simple set of resource types

• Registry: a list of resource descriptions– Expressed as structured metadata

to enable automated processing and searching

Page 4: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 4

An Overview of Data Discovery

• You can search the main NVO registry to find resources based on descriptive criteria

• NVO Registries are “coarse-grained”– You can find organizations, archives, catalogs– Won’t find images, celestial objects, table

records

• Registry framework contains multiple registries:– searchable registries– publishing registries

Page 5: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 5

Local PublishingRegistry Local

SearchableRegistry

FullSearchableRegistry

Local PublishingRegistry

FullSearchableRegistry

DataCenters

VOProjects

SpecializedPortals & Services

Registry Framework

Page 6: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 6

Local PublishingRegistry Local

SearchableRegistry

FullSearchableRegistry

Local PublishingRegistry

FullSearchableRegistry

DataCenters

VOProjects

SpecializedPortals & Services

Registry Framework

harvest(pull)

Page 7: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 7

Local PublishingRegistry Local

SearchableRegistry

FullSearchableRegistry

Local PublishingRegistry

FullSearchableRegistry

DataCenters

VOProjects

SpecializedPortals & Services

Registry Framework

harvest(pull)

Cross-harvest

Page 8: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 8

Local PublishingRegistry Local

SearchableRegistry

FullSearchableRegistry

Local PublishingRegistry

FullSearchableRegistry

DataCenters

VOProjects

SpecializedPortals & Services

Registry Framework

harvest(pull)

selectiveharvesting

Cross-harvest

Page 9: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 9

Local PublishingRegistry Local

SearchableRegistry

FullSearchableRegistry

Local PublishingRegistry

FullSearchableRegistry

DataCenters

VOProjects

SpecializedPortals & Services

ClientApplications

searchqueries

Registry Framework

Page 10: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 10

Local PublishingRegistry Local

SearchableRegistry

FullSearchableRegistry

Local PublishingRegistry

FullSearchableRegistry

DataCenters

VOProjects

SpecializedPortals & Services

ClientApplications

searchqueries

Registry Framework

Page 11: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 11

NVO Public Registries

Registry URL Searchable?

Publishing?

STScI/JHUNVO Registry

http://nvo.stsci.edu/voregistry/ Yes Yes

Caltech Carnivore http://nvo.caltech.edu:8080/carnivore/ Yes Yes

NCSARegistrationPortal

http://nvo.ncsa.uiuc.edu/nvoregistration.html

No Yes

Private Publishing Registries• HEASARC• CDS

Only support harvesting protocol

Page 12: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 12

Overview of Publishing

• Resources are published if one can use NVO facilities to find them.

• How to Publish to the NVO http://us-vo.org/pub/files/PublishHowto.html

– Multiple layers of publishing• Starts with registry description of resource• Data Access Services

Incremental exposure for incremental effort

– Who are you? How you publish depends on what you want to publish.

• An individual with a small data collection• An archive center• Someone with a cool service

Page 13: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 13

Small collections:VO-ready Repositories

• Repositories that allow users to deposit data to share with community– Guarantee long-term storage, availability

• Automatic support for VO publishing mechamisms– Entries into NVO Registry– Support for standard services:

Cone Search, SIA, SSA, SkyNode

• Currently available Repositories– Images: NCSA Astronomy Digital Image Library

http://adil.ncsa.uiuc.edu/– Spectra: Spectrum Service for the VO

http://voservices.net/spectrum/

• More public repositories are expected to emergeCheck NVO website (http://us-vo.org/) for latest

Page 14: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 14

Persistent Archives:Tools for Federation

• Registering your resources with a VO publishing registry– Enter description into registration form at one of the

available NVO registries:• STScI/JHU Registry: http://nvo.stsci.edu/voregistry/• NCSA Registration Portal:

http://nvo.ncsa.uiuc.edu/nvoregistration.html• Caltech Carnivore:

http://nvo.caltech.edu:8080/carnivore/

– If you have a large number of resources to register, you can run your own registry on your own site

• Caltech Carnivore: http://nvo.caltech.edu:8080/carnivore/

• NCSA VORegistry-in-a-Box http://nvo.ncsa.uiuc.edu/VO/software/

Page 15: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 15

• Which registry should I register with?– Behavior is slightly different– Different strategies for making it “easier” to

register multiple resources– More uniform support is underway

• Support for standard services is strongest– Cone Search, SIA, SkyNode– SkyNode: go to STScI or Carnivore– SSA: go to NCSA

• Your feedback is valuable!

Caution: Construction Ahead

Page 16: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 16

• What can/should you register?– Should: your Organization

• Declares yourself as a publisher with an ID– Should: your Data Collection

• Users at least know how to access it via a Browser– Can: your existing services

• Browser-based services: e.g. search page• Traditional CGI services• Web Services

The next level…• Implement and register one or more standard services

– Cone Search– Simple Image Access– SkyNode– Simple Spectral Access*

*standard still in development

Persistent Archives:Tools for Federation

Page 17: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 17

Cool Services:Integrating with the VO

1. Register your service at a registry• Currently as a generic resource• Improved support for non-standard services coming• (immediate future: contact NVO project)

2. Integrate support for standard VO formats, schemas• FITS and VOTable• Standard Data Model schemas (emerging)

• VOResource, Space-time Coordinates, Spectra

3. Implement Standard Support Interface• a standard in development for:

Self-description, tracking health and usage

Page 18: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 18

A word about Identifiers…

• IVOA Identifier: a globally-unique URI identifying a resource

Ex: ivo://adil.ncsa/targeted/SIA

• Required as part of a registered resource description

• As publisher, you control what it looks like• Two components:

– Authority ID: e.g. adil.ncsaDefines a namespace for identifiersOwned by a single publishing organization

– Resource Key: e.g. targeted/SIAName for the resource unique within the namespaceEncourage re-use of local identifiers

Page 19: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 19

Visits to the Publishing Registries

• Publishing Registries– STScI/JHU Registry:

http://nvo.stsci.edu/voregistry/

– NCSA Registration Portal: http://nvo.ncsa.uiuc.edu/nvoregistration.html

– Caltech Carnivore: http://mercury.cacr.caltech.edu:8080/carnivore/

• Recommend you stick with one registry– Authority IDs are currently “stuck” to a single registry

• Can’t do something? Contact us!

Page 20: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 20

Resource Metadata: XML Schema

• Classes of Resources– Generic Resource– Extensions: e.g.

DataCollection, Service, ConeSearch, SimpleImageAccess, SkyNode

• Organized into separate schemas:– Core resource metadata: VOResource

– Various extensions schemas containing specific types

• Capable of describing…– Data centers, research organizations, missions, observatories– Data collections, archives – VO standard services: Cone Search, Simple Image Access,

Simple Spectral Access, SkyNode– Existing Browser/CGI-based services– Web Services

Page 21: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 21

Searching the Registry

Available Interfaces:– Keyword Search– Intermediate Interfaces

• Form-based• Under-development• e.g. http://nvo.caltech.edu:8080/carnivore/search

– Advanced SQL/ADQL/XQuery interfaces

Registry URL Searchable?

Publishing?

STScI/JHUNVO Registry

http://nvo.stsci.edu/voregistry/ Yes Yes

Caltech Carnivore http://nvo.caltech.edu:8080/carnivore/ Yes Yes

NCSARegistrationPortal

http://nvo.ncsa.uiuc.edu/nvoregistration.html No Yes

Page 22: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 22

IVOA Standard Registry Interface

• IVOA Working Draft: 2 parts– Harvesting:

sending descriptions from publishers to searchable registry– Searching

• Searching– methods

• keywordSearch(string words, boolean combineByOr)• search(ADQLWhere constaints)

– Returns a list of VOResource descriptions– Advanced searching with ADQL:

• Just the “where” part—i.e. search constraints--of ADQL• In place of column names, use XPath to VOResource

element– Curation/Publisher like ‘%NASA%’– Query extensible to any VOResource XML extension– Maps readily to registry implementations based on RDBMS

or XML-DB.

Page 23: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 23

Why might a client use the standard search interface?

• Uniform interface to all VO Registries

– Not dependant on a single registry

• Direct relationship between information you are querying and information you get back.

• Extensible to any type of resource description

• Re-use of ADQL

Custom interfaces to Registries:

• Registries can provide extended functionality

– More advanced capabilities

– Simpler interfaces for specialized purposes

• Client toolkits can provide simplifying interfaces

– Support for ADQL/s

– XPath aliases

Page 24: 2004-09-15NVO Summer School, Aspen Center for Physics1 Publishing and Resource Discovery with Registries Ray Plante Gretchen Greene.

2004-09-15NVO Summer School, Aspen Center for Physics 24

What you might use this week

• STScI or Carnivore Registry portals to search for resources

• Any of the registration portals to register resources

• Software for querying registry from an application– Java: see java/dev/nvoregistry– IDL: see idl/VOlib_0.1 (call_registry.pro)– Python: see python/PythonVOClientLibrary.html

(VORegistry.py)– PHP: see in php/web: kwdSearch.php, phpVOlib