Top Banner
Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl
42

Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Fusing Corporate Thesaurus Management with

Linked Data using PoolParty

Thomas Schandl

Page 2: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

PoolParty at a glance

• Developed by punkt. netServicesCurrent release: PoolParty 2.8

• Main focus on three applicationareas:

– SKOS Thesaurus Management

– Linked Data (publishing & consuming)

– Semantic Search & Semantic Indexing

2

Page 3: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Challenge for Content Management

3

1.Annotation: Add meaning to the content

2.Link content: Bring content together in a meaningful way

3.Make content searchable: Add background knowledge to the content

Page 4: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Traditional approach to annotate content with metadata

4

Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices.

Apple

application

merchandise

iPod touch

iPadiPhone

Page 5: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Semantic Web approach: Concepts & Relations instead of simple text

5

Apple is in the process of launching an application to allow iPhone, iPad and iPod Touch users to purchase Apple merchandise straight from their devices.

http://my.com/AppleApple

Apple Inc.

http://my.com/iPhone

http://my.com/iPhone3G

iPhone

iPhone 3GS

iPhone 3G

http://my.com/smartphone

Page 6: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

in a nutshell

• W3C Semantic Web standards: Management of multi-lingual (corporate) thesauri & taxonomies on top of Semantic Web standards (SKOS, RDF, OWL & SPARQL)

• Usability: easy-to-use, web-based AJAX user interface

• Scalable Semantic Technologies: RDF Triple Store (SAIL), (Lucene) index engine and a phrase-extraction component

• Service oriented: PoolParty Server offers a Java-API & several interfaces: HTTP web services, SPARQL endpoint, Linked Data

6

Page 7: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

PoolParty GUI

7

Page 8: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Full compatibility with SKOS/RDF

8

Page 9: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Some highlights: PoolParty thesaurus management

• Drag & drop , Auto-Complete

• Document analysis: phrase extraction

• Enrich concepts by using linked data

• Publish thesauri as linked data

• Advanced reporting functionality

• Import and validation of thesauriand CSV files

• Thesauris quality checker

• Wiki style collaborative editing of thesauri

• Visual browsing and map navigation

9

Page 10: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Built-in automatic phrase extraction

10

• Supports different formats (html, doc,pdf, ppt, …)

• Thesaurus basedextraction

• Integrable withCMS, CRM etc.

• Supports different formats (html, doc,pdf, ppt, …)

• Thesaurus basedextraction

• Integrable withCMS, CRM etc.

Page 11: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Some Applications on top of PoolParty

• Tag recommendation: support users and content managers when annotating text

• Semantic Indexing: PoolParty TagEvent Store as a basis for a semantic index ( IndexBuilder)

• Similarity search: „Similarity“ is configurable: Certain features of a document can be „boosted“ (example: persons, places / user tags etc.)

• Semantic Search and Navigation: Thesaurus can be used for facetted and moderated search (examples: emteba.at, ecoi.net)

• Search Engine Dictionaries: provide company or domain specific terms for search engine dictionary

11

Page 12: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Similarity search: finding the unexpected…

12

Expert #4532

Senior Product Manager Enterprise Wiki

at MitchelLake Consulting

in Sydney Area………

Project #AZ67

Integration of Confluence which is a web-based

corporate wiki. It is developed and

marketed by Atlassian, Australia.

…..

same topic

near location

Page 13: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

PoolParty DemoZone

• compare thesaurus based approach with traditional approach

• tag recommender

• similar documents

• find images which fit to your document

• browser bookmarklet

13

Page 14: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Wordpress Glossary Plugin

14

• automatic generation of glossaries for Wordpress blogs

• SKOS compatibility

• automatic link detection and linkage with glossary term

Page 15: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Programmatic access via Web Services

• getProposedTagsForDocument

• addTaggingEvent

• getTagFrequencies

• addDocumentToSimilarityIndex

• findSimilarDocuments

• getConceptSuggestions

• …..

15

Page 16: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Programmatic access – Example: emteba.at

16

Page 17: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

PoolParty

Linked DataFeatures in Detail

Page 18: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

SKOS Thesauri + Linked Data

18

Page 19: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Linked Data – Benefits & Application Scenarios

19

Thesaurus Management•Automatic population ofthesauri•(Semi) Automatic categorization of new concepts End User

•Content augmentation•Improved recommender services•Improved navigation elements, e.g. in web-shopsContent Provider

•Improved SEO•Reduced costs of content management•New services and mashups

Page 20: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Publishing Linked Data with PoolParty

20

• using linked data patterns and „Cool URIs“

• Linked Data front-end

Additionally:

• Wiki front-end

• SPARQL-endpoint

Page 21: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Linked Data frontend

21

Page 22: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Consuming Linked Data

22

• advanced linked data look-up services

• expandable number of linked data sources already integrated

• linked data synchronisation mechanisms (beta)

Page 23: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Linked Data Screencast

• Here comes a screencast

23

Page 24: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Using SKOS context to link concepts to LD resources and semi-automatic population of thesaurus

Example: Thesaurus about arts and artists Concept „Painters“ with NT:

Kandinsky, Rembrandt and Berners-Lee

• Using broader and sibling concepts to help disambiguate and suggest the painter Berners-Lee

• Finding mutual categories from Dbpedia or Freebase

• Suggesting more NTs for Painters using LD categories

24

Page 25: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

PoolParty

Semantic Search

Page 26: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

More background knowledge from thesauri and linked data can improve semantic search

• better disambiguation of search terms

• background knowledge of search terms help to „expand queries“

• better similarity search because of more metadata

• content augmentation through linked data

26

Page 27: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Semantic Services provided by PoolParty

27

Search assistants(Auto-Complete, faceted search)

Improve user´s search experience

Moderated Search

Creating complex queries

Tag Recommendation

Identifying the meaning of a document

Similarity Search(Recommender Systems)

Understanding relations

1

2

3

4

Page 28: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Search Assistants

28

• clever auto-complete

• query expansion

• faceted search

• visual search

• Google synonyms

Page 29: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Moderated Search

29

• thesaurus helps to create complex queries

• supports multi-linguality

• helps to explore a domain without deep knowledge

Page 30: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Tag Recommendation

30

• annotation of documents with low effort

• motivation for people to annotate documents

• basis for building a semantic index

Page 31: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Similarity Search

31

• improved similarity detection on top of additional background knowledge

• build recommender systems for web-shops or knowledge management systems

• help people to skim large document collections

• detect hidden relations between documents

Page 32: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Integration of thesauri with Enterprise Search

32

PoolParty ReportingExport parts of thesauri intoindividual XML-formats and synchronize with search engine

Possible integrations with enterprise search engine:•Autocomplete-Server•Entity dictionary•Query rewriting•Moderated search•Enrich semantic index

PoolParty Web-ServicesIntegrate thesauriinto search enginewith real-timequeries

• improved semantic enterprise search

• all metadata can be administrated at one single place

• expandable via linked data mechanisms

Page 33: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

PoolParty

Thesaurus ManagementAdvanced Features

Page 34: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Multilinguality

34

Page 35: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Concept mapping

• skos:exactMatch

• skos:closeMatch

used for linked data mapping

used for concept mapping, e.g. after having imported a thesaurus

35

Page 36: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Associating notes with concepts

36

• skos:historyNote

• skos:changeNote

• skos:editorialNote

used to trace meanings of a concept

used to discuss meanings of a concept

Page 37: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Introduce individual relations between concepts

37

Create your own individual inverse or symmetric relations between concepts

Page 38: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Import / export / reporting

38

• import & export of SKOS using various RDF serializations

• import of CSV

• import of Zthes

• import/export of sub-trees

• custom reports and XML exports based on PoolParty´s template engine

Page 39: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Quality checks and validation service

39

Check thesauri to….

• be complete

• be non-cyclic (e.g. no circularity in the broader/narrower hierarchy).

• have no disjoints between related and hierarchical paths.

Page 40: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Visual browsing

40

Page 41: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Use your favourite theme!

41

Page 42: Fusing Corporate Thesaurus Management with Linked Data using PoolParty Thomas Schandl.

Contact

Apply for a PoolParty demo accounthttp://poolparty.punkt.at/

Thomas [email protected]+43-1-8974122-27

punkt. netServices GmbHLerchenfelder Guertel 43A—1160 Wien / Austriahttp://www.punkt.at/

42