DC 2006 Mexico | 03-06/10/2006 | 1 MENGER | Federal Environment Agency The Semantic Network Service Supporting Heterogeneous Environmental Information Systems Federal Environment Agency Matthias Menger / Maria Rüther {matthias.menger|maria.ruether}@uba.de
25
Embed
DC 2006 Mexico | 03-06/10/2006 | 1MENGER | Federal Environment Agency The Semantic Network Service Supporting Heterogeneous Environmental Information Systems.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
DC 2006 Mexico | 03-06/10/2006 | 1MENGER | Federal Environment Agency
The Semantic Network Service
Supporting Heterogeneous Environmental Information
Systems
Federal Environment AgencyMatthias Menger / Maria Rüther
{matthias.menger|maria.ruether}@uba.de
DC 2006 Mexico | 03-06/10/2006 | 2MENGER | Federal Environment Agency
Background
environmental community• cover many disciplines -> many topics, terms,
DC 2006 Mexico | 03-06/10/2006 | 12MENGER | Federal Environment Agency
Graphical View1 Level of Associations
DC 2006 Mexico | 03-06/10/2006 | 13MENGER | Federal Environment Agency
Graphical View2 Levels of Associations
DC 2006 Mexico | 03-06/10/2006 | 14MENGER | Federal Environment Agency
Services Make Use of Semantic Structure (TopicMap)
• findTopics- search topics by names and topic types
• getPSI- reference of topic characteristics and its associations (Published Subject Identifier) - navigating along the relations of a specific term (tree of related topics)
• autoClassify- automatic classification indexing (html, xhtml, pdf)- resource can be a document or just an URL- result list with significant topics (ranking mechanism)
DC 2006 Mexico | 03-06/10/2006 | 15MENGER | Federal Environment Agency
• getSimilarTerms- returns ‘somehow’ similar terms for a given search term
• findEvents- events matching the given search term
• anniversary- events in chronicle happened x years ago by reference date as a reminder
Services Make Use of Semantic Structure (TopicMap)
DC 2006 Mexico | 03-06/10/2006 | 16MENGER | Federal Environment Agency
autoClassify1.
read document
discover terms
find matching topics
recognise term positions
3.
relevance by frequency
… by term positions
… by clustering
2.
understand composite terms
resolve ambiguities
replace non-descriptors
significant topics of a document index
DC 2006 Mexico | 03-06/10/2006 | 17MENGER | Federal Environment Agency
Topic Clusters
`topic space`documedocumentnt
topics grouped around addressable information objects
primary topic cluster
secondary topic cluster
loner
DC 2006 Mexico | 03-06/10/2006 | 20MENGER | Federal Environment Agency
SNS-Metadata
• metadata is stored with the URL – at application site (e.g. PortalU) – not at in the original document
• use of same algorithm for – analysing and indexing of documents…
– analysing user`s search request
DC 2006 Mexico | 03-06/10/2006 | 21MENGER | Federal Environment Agency
Integrate DC Metadata• currently not used – because there are not
enough DC metadata available
• concept allows to integrate DC metadata in the classification process
• currently used meta tags:– title, keywords (and headers h1-h3) with higher
priority for ranking– terms in the body (text)– parser allows to analyse HTML, XHTML, and PDF
documents
DC 2006 Mexico | 03-06/10/2006 | 22MENGER | Federal Environment Agency
Used in…
UmweltinformationsnetzDeutschland2003
Geodaten Infrastruktur2004
Geodaten InfrastrukturThüringen 2004
Umwelt-PortalBaden-Württemberg,in Entwicklung 2006
SNSsemantic
Web Services
SNSsemantic
Web Services
Umweltdaten-katalog,in Planung 2006
Geodaten InfrastrukturRheinland-Pfalz 2005
Seit Juni 2006
Geodaten InfrastrukturMecklenburg-Vorpommern 2006
…environmental portals + Spatial Data Information brokers
DC 2006 Mexico | 03-06/10/2006 | 23MENGER | Federal Environment Agency
www.PortalU.de
• German environmental portal
• 100 different information providers
• SNS analyse documents, create an index, and harvest the content of each provider matching to one topic
• SNS currently handle each document seperately one-by one
DC 2006 Mexico | 03-06/10/2006 | 24MENGER | Federal Environment Agency
User
• IT professionals– integrating the services in their
applications
• scientific user– searching and indexing (their) web objects
• public– searching relevant information more easily
DC 2006 Mexico | 03-06/10/2006 | 25MENGER | Federal Environment Agency
Outlook
• make use of available data servicesgazetteer of Federal Agency for Cartographyno double efforts in maintainance
• OWL instead of TopicMap interoperability
• integrate additional semantics if needed!
• develop additional services if needed!
DC 2006 Mexico | 03-06/10/2006 | 26MENGER | Federal Environment Agency
Outlook (2)
• integrate SNS in further applications if central service is not desired
• consider the context of document currently documents handled one-by-one
• derive Ontologies automatically avoid manual maintenance of vocabularies
• integrate more metadataif available! Educate and convince people + offer more automated approaches
DC 2006 Mexico | 03-06/10/2006 | 27MENGER | Federal Environment Agency