Top Banner
Exploratory Search upon Semantically Described Web Data Sources SSW workshop @ VLDB 2012, Istanbul, Turkey Marco Brambilla Politecnico di Milano marco.brambilla@polim i.it marcobrambi
29

Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Jan 15, 2015

Download

Technology

Marco Brambilla

This presentation was given at the SSW workshop, collocated with VLDB 2012.

Exploratory search applications upon structured Web content is becoming one of the main information seeking paradigms for users. This is mainly due to the move towards mobile and pervasive Web access and to the more and more tight intertwining between everyday life and information seeking.

Structured data is typically distributed on the Web and accessible through a service-oriented paradigm. This paper proposes a vision on: (1) a semantically-enabled service registration framework for describing in a Web data services in a convenient way; and (2) a design method for applications that exploit such model using a design pattern -based method.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Exploratory Search upon Semantically Described

Web Data Sources

SSW workshop @ VLDB 2012, Istanbul, Turkey

Marco BrambillaPolitecnico di Milano

[email protected]

Page 2: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Outline

• Context

• Search Services Specification• Description• Semantic Annotation

• Exploratory Search

• Design patterns

• Demo

• Outlook

Page 3: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Context

Web is a huge, heterogeneous data source:

Structured, unstructured and semi-structured data Known problems of trust, reputation, consistency

User needs to solve real-life problems, not to find a web site

Page 4: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Context

Google? Well, yes… an “interesting” system

Page 5: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Context

User needs to solve real-life problems, not to find a web site

Web queries get increasingly complex and specialized Exploratory search From document search to object search

Search as a service

Viability of systems based upon search service orchestration

Page 6: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

What are search services?

• APIs over Web data sources • Structured data• Domain-specific

• Wrapping of information utility sites

Page 7: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

How can we use them?

“… search for upcoming concerts close to an attractive location (like a beach, lake, mountain, natural park, and so on), considering also availability of good, close-by hotels …”

• Applying complex queries (also with “joins”)

Page 8: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Background: semantic multi-domain search

“… expand the search to get information about available restaurants near the candidate concert locations, news associated to the event and possible options to combine further events …”

Page 9: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Liquid Query: Query Submission

Concert query conditions

Hotelsquery conditions

Example Scenario 1: Trip planner for events

Page 10: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Liquid Query: Query Execution

Page 11: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Liquid Query: alternative visualizations and domain-independent platform

Example Scenario 2: Scientific Publication search

Page 12: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Problem 1: Service specification

• No service description per se

• Focused on search• Ranking aware

• Description• Bottom-up• Based on the service interface

• Annotation• Relying on an external reference knowledge base

Page 13: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

SDF and SAF

Service Description (SDF) vs. Service Annotation (SAF)

Page 14: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Example of SDF instances

Page 15: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

The registration of services

Bottom-up approach from the service signatures

Registration process fully specified and implemented

See demo video at: http://search-computing.it/registration_demo

• Starts from SI details (name, type of service, etc.) and SI field details, i.e. name, data type and I/O directionality

• the name and I/O fields of the SI are scanned with NLP and Semantic techniques in order to identify the most suitable Domain Diagram items to represent them

• The expert user's intervention is required to provide a feedback concerning system-hypothesized mappings

• When all mappings have been validated, a newly created Access Pattern and its corresponding Service Mart are committed

Page 16: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Example of resulting service mart

• A set of predefined combinations of services, to be reused for specific cases

Page 17: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Problem 2: Reduce flexibility

Maximum flexibility over huge amounts of search services is not always the best solution

People want straightforward paths and want to be quick Commercial implementations are likely to be on fixed sets of domains and

fixed exploration directions

Page 18: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Design Patterns

• A set of blueprint combinations of services, to be reused for different cases

• Very much like UML design patterns or datamart patterns

Page 19: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Design Patterns – some examples

• Sequence

Page 20: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Design Patterns – some examples

• Star

Page 21: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Design Patterns – integrated examples

• Join

Page 22: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Design Patterns – integrated examples

• Join

Page 23: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Exploration implementation

• Not just a matter of data sources

• Also: data visualization, user interface specification, usability, ..

See demo videos at:http://demo.search-computing.net/night_planner_demo/seco/seco.htmlhttp://demo.search-computing.net/new_job_demo/seco/seco.html

Page 24: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Problem 3 - Outlook

When dealing with real-life problems, people do not trust the web completely

Want to go back to discussion with people Expect insights, opinions, reassurance

Exploratory search must be blended with social-network based recommendations and inputs

Page 25: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Social Search: increasing quality in search

• From exploratory search to friends and experts feedback

Exploratory Search System

Human Search System

Initial query

Exploration step

Exploration step

System API Social API

Database / IR index

Crowd / Community

Page 26: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Example: Find your job (social invitation)

Selected data items can be transferred to the crowd question

Page 27: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Find your job (response submission)

Page 28: Exploratory Search upon Semantically Described Web Data Sources: Service registration and methodology. At vldb2012

Conclusions and future work

Well, I’ve shown everything..

See our papers at WWW 2010 (Liquid Query) and WWW 2012 (CrowdSearcher)

Future work

• More experiments (e.g., vs. sociality of users, vs. crowds, …)

• Not only search: active integration of web structured data and social sensors

Some ads

• Search Computing book series (Springer LNCS)

• Workshop Very Large Data Search at VLDB

• VLDB Journal special issue (deadline Sept 2012)

Third book

coming

next Sept.