Top Banner
AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit
21

AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Dec 26, 2015

Download

Documents

Luke Ramsey
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases

AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases

Frehiwot Fisseha

FAO/WAICENT

AGRIS/CARIS and Documentation Unit

Page 2: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Overview

* Background - AGRIS * Motivation for AGRIS -MHS * Vision * Challenges * Way forward * Components and arichtecture of the MHS

Page 3: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Background- AGRIS

AGRIS is the international information system for the agricultural science and technology.

AGRIS is a bibliographic database created by FAO in 1974 to facilitate information exchange and to identify world literature dealing with all aspects of agriculture

AGRIS is a co-operative system. 199 national, international and intergovernmental centres participate and submit about 14.000 records per month

The system collects bibliographic references (to date, about 3 million) to either conventional (journal articles, books) or non-conventional materials (sometimes called "grey literature" e.g. theses, reports, etc.), not available through ordinary commercial channels.

Http://www.fao.org/agris

Page 4: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Motivation for the AGRIS MHS

We have wide range of bibliographic information that should be included into the AGRIS database

Large components of the AGRIS database are existing national bibliographic databases like AGRICOLA and ELFIS.

• Some of the databases are searchable on the internet and need not be copied or reprocessed by FAO's AGRIS processing group.

Page 5: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Vision

AGRIS as a focal point of access for agricultural

information (bibliographic as well as full text) on distributed databases and information systems

uniform presentation of meta information

easily accessible information content

Page 6: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Challenges

Distributed databases

Databases are heterogeneous• different data structure

• different metadata information

Page 7: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Way Forward

Develop a search engine to make a parallel search on distributed databases (Outsource development-ZADI)

• develop XML wrapper for each co-operating database

Develop common meta elements to homogenise search result set (In-house)

Page 8: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Way forward (Contd.)Related activities

Develop and propose DC based meta data description for agricultural resources for the purpose of resource discovery and interoperability

– organise working group to work on proposing DC based standards (agricultural Dublin Core elements and qualifiers)

– draft specification for a DC based standard for describing document-like agricultural resources

– crosswalk for the existing databases (AGRIS, CARIS, FAO online Catalogue)

– posted on Agstandard mailing list for discussion and feed back

– work in progress to encode the standard with an XML framework (DTD, RDF)

Page 9: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

ClientClient

AGRIS-ServerAGRIS-Server

DistributedDatabases

DistributedDatabases

Request Search Result

Search ResultsRequests

The AGRIS MHS System

Page 10: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Wrapper Manager

Wrapper Manager

TransactionManager

TransactionManager

DistributedDatabases

DistributedDatabases

ClientClient

static pages dynamic pages

AGRIS-WWW-Server

Components of the AGRIS MHS

Page 11: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS-ServerAGRIS-Server ClientClient

1. Login (AGRIS Homepage) andrequest for the search form (HTTP)

2. Transmission of a static search form (HTML) including intelligent checks (Java-Script)

TransactionManager

TransactionManager

AGRIS-WWW-Server

static pages dynamic pages

Page 12: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS-ServerAGRIS-Server ClientClient

3. Input of search criteria includingthe searchable databasesand request sending (http)

AGRIS-WWW-Server

static pages dynamic pages

TransactionManager

TransactionManager

4. Request transmission to dynamically generate thesearch result page

Page 13: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS-ServerAGRIS-Server

TransactionManager

TransactionManager

WrapperManager

WrapperManager

5. Kernel activation and request transmissionincluding request syntax and search criteria (databases, fields, search pattern)

DistributedDatabases

DistributedDatabases

6. Distributed requests on distributed databases through IP (XML) 7. Search result

delivery (XML)

8. Compilation of one homogenous result set and delivery

Using AG-DC

Page 14: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS-ServerAGRIS-Server

TransactionManager

TransactionManager

ClientClient

9. Result set conversion (XML->HTML) and transmission

10. Request answering (HTML)

AGRIS-WWW-Server

static pages dynamic pages

Search Result in AG-DC format

Page 15: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

WrapperManager

WrapperManager

TransactionManager

TransactionManager

DistributedDatabases

DistributedDatabases

ClientClientAGRIS-ServerAGRIS-Server

AGRIS-WWW-Server

static pages dynamic pages

Page 16: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS-KernelAGRIS-Kernel

XML Query GeneratorXML Query Generator XML Result Set Composer

XML Result Set Composer

8. Compilation and Transmission of onehomogen result set

TransactionManager

TransactionManager

DistributedDatabases

DistributedDatabases

Knowledge Base(Database - URL)

Knowledge Base(Database - URL)

6. Translation of the http requestinto a standardised XML query and delivery to the databases

7. Search result delivery (XML)

5. Kernel activation and request transmissionincluding request syntax and search criteria (databases, fields, search pattern)

Page 17: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

ClientClient

AGRIS-ServerAGRIS-Server

DistributedDatabases

DistributedDatabases

Request Search Result

Search ResultRequest

Page 18: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS-ServerAGRIS-Server

XML-Wrapper

DistributedDatabases

DistributedDatabases

Database-Kernel

Data, Indexes

Knowledge BaseAGRIS-XML

<-> local Schemes

6.1. Translation of the XML-Request into the database specific syntax, opening of a retrieval session and activation of the database kernel

6.2. Database search

6. Translation of the http requestinto a standardised XML query and delivery to the databases

7. Search result delivery (XML)

With AG-DC elements

Page 19: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

AGRIS-ServerAGRIS-Server

DistributedDatabases

DistributedDatabases

6.3. Search result in database specific syntax

6.4. Conversion into XML

6. Translation of the http requestinto a standardised XML query and delivery to the databases

7. Search result delivery (XML)

XML-Wrapper

Database-Kernel

Data, Indexes

Knowledge BaseAGRIS-XML

<-> local Schemes

Page 20: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

ClientClient

AGRIS-ServerAGRIS-Server

DistributedDatabases

DistributedDatabases

Request Search Result

Search ResultRequest

http:/cfs.zadi.de

Page 21: AGRIS Multi-Host Search System: Using Dublin Core to homogenise distributed databases Frehiwot Fisseha FAO/WAICENT AGRIS/CARIS and Documentation Unit.

Thank you for your attention!