12/2/2013 1 T ouristic Intelligence Tirol (TiTi) Dieter Fensel, Andreas Lackner, Christian Maurer, and Bernhard Rieder with the help of ... STI Innsbruck With the help of … Birgit Juen Zaenal Akbar Dr. José María García Dr. Anna Fensel 2 Dr. Nelia Lasierra Ioannis Stavrakantonakis Serge Tymaniuk Dr. Ioan Toma
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
12/2/2013
1
Touristic Intelligence Tirol (TiTi)
Dieter Fensel, Andreas Lackner, Christian Maurer, and Bernhard Rieder
with the help of ...
1STI Innsbruck
With the help of …
Birgit JuenZaenal Akbar Dr. José María GarcíaDr. Anna Fensel
2
Dr. Nelia Lasierra Ioannis Stavrakantonakis Serge TymaniukDr. Ioan Toma
12/2/2013
2
What is the talk about?
• The Challenge: Successful value generation through Tourisms in the 21st century
• An essential means: A shared IT infrastructure for effective and efficient marketing and sales.
• Follow-Up opportunities
• Summary
3
y
The Challenge
Successful value generation through Tourisms in the 21st century
4STI Innsbruck
12/2/2013
3
The Challenge
The Hotelier of today has to deal with many different communication channels:
5
HOTEL RECEPTION
The Challenge
- walk-in customerThe Hotelier of today has to deal with many different communication channels:
6
HOTEL RECEPTION
12/2/2013
4
- walk-in customer- phone
The Hotelier of today has to deal with many different communication channels:
The Challenge
7
HOTEL RECEPTION
The Challenge
- walk-in customer- phone-email
The Hotelier of today has to deal with many different communication channels:
8
HOTEL RECEPTION
12/2/2013
5
The Challenge
- walk-in customer- phone- email- fax
The Hotelier of today has to deal with many different communication channels:
9
HOTEL RECEPTION
The Challenge
- walk-in customer- phone- email- fax- hotel website
The Hotelier of today has to deal with many different communication channels:
hotel website
10
HOTEL RECEPTION
12/2/2013
6
The Challenge
- walk-in customer- phone- email- fax- hotel website
The Hotelier of today has to deal with many different communication channels:
- review sites
11
HOTEL RECEPTION
The Challenge
- walk-in customer- phone- email- fax- hotel website
The Hotelier of today has to deal with many different communication channels:
- review sites- booking sites
12
HOTEL RECEPTION
12/2/2013
7
The Challenge
- walk-in customer- phone- email- fax- hotel website
The Hotelier of today has to deal with many different communication channels:
hotel website- review sites- booking sites- social network sites
13
HOTEL RECEPTION
The Challenge
The Hotelier of today has to deal with many different communication channels: - walk-in customer- phone- email- fax- hotel websitehotel website- review sites- booking sites- social network sites- blogs
14
HOTEL RECEPTION
12/2/2013
8
The Challenge
The Hotelier of today has to deal with many different communication channels: - walk-in customer- phone- email- fax- hotel websitehotel website- review sites- booking sites- social network sites- blogs- fora & destination sites
15
HOTEL RECEPTION
The Challenge
The Hotelier of today has to deal with many different communication channels: - walk-in customer- phone- email- fax- hotel websitehotel website- review sites- booking sites- social network sites- blogs- fora & destination sites- chat
16
HOTEL RECEPTION
12/2/2013
9
The Challenge
The Hotelier of today has to deal with many different communication channels: - walk-in customer- phone- email- fax- hotel websitehotel website- review sites- booking sites- social network sites- blogs- fora & destination sites- chat- video & photo sharin
17
HOTEL RECEPTION
The Challenge
The Hotelier doesn’t only have to deal with an overwhelming number of communication channels but also haschannels, but also has to pay up to 15% sales commissions to the booking sites!-> 100 million € sales
commission
18
HOTEL RECEPTION
12/2/2013
10
The Challenge
-> 80 million overnight stays-> 4 billion € transaction
• Professional on-line marketing for attracting potential customers requires:
– Content of high quality and originality
– Visible presented at multiple places in various formats and adaptations
• Professional on-line sales for gaining customers requires:– On-line bookability
– Tight alignment of content and bookings opportunities (i.e., content and booking data)
– Multiple booking opportunities (fluid booking)
– Multi channel yield management and revenue optimization (swarm intelligence and big
25
data)
• Obviously even more is needed to cover to full-fledged customer journey
The solution
A shared IT infrastructure for effective and efficient marketing and sales
26STI Innsbruck
12/2/2013
14
The kernel
Where we go …
Semantic Alignment
Semantic Annotations
Repository
Touristic Ontology TISs
CMSs
TSPs TAs Tirol Werbung
External Data (LOD)and Content
Semantic Search
g
Web pages
Social MediaChannels
Mobile Channels
27
The kernel
Semantic Alignment
Semantic Annotations
Repository
Touristic Ontology TISs
CMSs
TSPs TAs Tirol Werbung
External Data (LOD)and Content
Semantic Search
g
Web pages
Social MediaChannels
Mobile Channels
28
Repository
Touristic OntologyTISs
CMSs
.. and how we start:
12/2/2013
15
The kernel
Repository
Touristic OntologyTISs
CMSs
29
The kernel - Repository
• “A content repository is a store of digital content with an associated set of datamanagement, search and access methods allowing application-independent accessto the content ” (Wikipedia 2013)to the content… (Wikipedia, 2013).
• In terms of CMS a repository acts as a ground layer for long-term storage, structureddata management and digital preservation.
• Common features:
– Data storage
– Data querying
– Data managementRepository
30
– Data management
• Semantic repository is a store of structured data allowing– to store, query and manage the data,
– and automatically reason about the data by using ontologies as semantic schemata.
12/2/2013
16
The kernel - Repository
• Efficient data access and collaborative data management allowing working with the same content consistently.
• Greater focus on digital preservation by better support of archiving, versioning, etc.
• Abstraction layer support for employing complex functionality in a uniform way.
31
• Reasoning capabilities by implementing inference layers.
• Synchronization on the repository level.
The kernel - Repository
• Native: Persistent storage systems with their own implementation ofRDF databases. Provide support for transactions, own query compilerand generally their own procedure language
• Non-Native: Persistent storage systems set-up to run on third partyDBs.
– E.g. Jena SDB.
• In-Memory: RDF Graph is stored as triples in main memory
E S iftOWLIM All G h S N ti
32
– E.g. SwiftOWLIM, AllegroGraph, Sesame Native.
• Non-in-Memory: RDF Graph is stored not in main memory
– E.g. BigOWLIM, AllegroGraph.
12/2/2013
17
The kernel - Repository
• AllegroGraph is a native RDF graph database:– Scaling to billions of triples
– Support of RDF/XML, N3, N-Triples serialization formatsSupport of RDF/XML, N3, N Triples serialization formats
– SPARQL, PROLOG queries
– Built-in reasoners (Jena, Sesame, Racerpro)
– Free and Commercial versions.
• Virtuoso is a native, RDBMS-based semantic repository:– Scaling to billions of triples
– Support of RDF/XML, N3 serialization formats
– SPARQL/SPASQL queries
– Built-in reasoners (Jena, Sesame, Redland)
Free and Commercial versions
33
– Free and Commercial versions.
• Oracle11g:– Scaling to millions of triples
– Support of RDF/XML, N-triples serialization formats
– SQL and SPARQL queries
– Native inferencing and 3rd party reasoned support (Jena)
– Commercial version.
The kernel - Repository
• OWLIM is a scalable semantic repository which allows:
– Management, integration, and analysis of heterogeneous data
– Combined with light-weight reasoning capabilities
– Available as a Storage and Inference Layer (SAIL) for Sesame Open RDF
• Sesame’s infrastructure
• Support for multiple query language (RQL, RDQL, SeRQL)
• Support for import and export formats (RDF/XML, N-Triples, N3).
34
• SwiftOWLIM: in-memory reasoning and query evaluation, fast retrieval, query evaluation, scales to ~100M statements.
• BigOWLIM: more scalable not-in-memory enterprise class repository.
12/2/2013
18
The kernel - Ontology
• Define a domain-specific data model in the repository.
• This touristic Ontology can be used to mediate between different data models reducing the mapping problem from n! to n .
Repository
Touristic Ontology
mapping problem from n! to n .
35
The Problem Multiple & heterogeneous sources of information
The kernel - Ontology
- Different formats & structures- Common usage purpose
- Touristic Information Systems- Content Management Systems- Web pages- Social Media Channels
Mobile Channels
36
- Mobile Channels- External Data (LOD) and Content- Hotel on-line content and data- Content and data of the various touristic associations- Content and data of the Tirol Werbung
12/2/2013
19
The kernel - Ontology
The Solution Touristic Ontology
Integration Layer to provide a common and clear understanding to:
Touristic Ontology
- Unify data & management
37
Repository
First … what is an ontology?
“An ontology is a formal explicit specification of a shared conceptualization”
The kernel - Ontology
An ontology is a formal, explicit specification of a shared conceptualization Studer, Benjamin, Fensel. Knowledge Engineering: Principle and Methods. Data and knowledge engineering, 25 (1998) 161-197
In simple words….
• Ontologies represent concepts and basic relations for the purpose ofcomprehension of a knowledge domain area. To develop an ontologymeans to formalize a common view of a certain area.
38
• Ontologies model knowledge about a specific domain.
• They provide a common vocabulary, the meaning of the terms and also therelation among the terms in order to provide a common and sharedunderstanding for the comprehension of a domain.
12/2/2013
20
Specifically, in the Tourism domain…Vocabulary and relations for describing hotel elements and characteristics.
The kernel - Ontology
• Example of a Tourism ontology: Accommodation Ontology Metadata[University of Innsbruck, Martin Hepp]
– This Accommodation Ontology is an extension of GoodRelations.
– Provides the additional vocabulary elements for describing hotel rooms, hotels, camping sites, and other formsof accommodations, their features, and modeling compound prices as frequently found in the tourism sector,e.g. weekly cleaning fees or extra charges for electricity in vacation homes based on metered usages.
– http://ontologies.sti-innsbruck.at/acco/ns.html
39
• Schema.org vocabularies- Schema.org provides a collection of shared vocabularies webmasters can use to mark up their pages in ways that
can be understood by the major search engines.
– LodgingBusiness > Hotel , Bed &Breakfast, Hostel, Motel
– Also vocabularies for describing events, restaurants & touristic attractions
location Place or PostalAddress The location of the event, organization or action.
i H D ti Th i h f b i
Thing
Place
Organization
41
openingHours Duration The opening hours for a business.
priceRange Text The price range of the business, for example $$$.
paymentAccepted Text Cash, credit card, etc.
LocalBusiness
Which is the added value of the ontology?
- Common knowledge model Clear understanding
The kernel - Ontology
- Common knowledge model Clear understanding
- Integration layer - Reduce the number of mappings between information sources (n:n n:1) from n! to n.
- Provide a background knowledge for systems to automatize certain tasks. Can be used to perform meaningful and inteligent queries Semantic Search
- Allows to exchange data and to interpret the information in the data that has been
42
Allows to exchange data and to interpret the information in the data that has been exchanged in the right context Semantic Alignment
- Allow the creation of machine readable annotations Semantic Annotations
12/2/2013
22
Touristic Information Systems include:
• Property Management Systems,
H l I f i S
The kernel – Touristic Information Systems
• Hotel Information Systems,
• Points of Sales,
• Accounting and Payroll Systems,
• Inventory control Systems,
• Booking and B ki Ch l
Repository
Touristic OntologyTISs
CMSs
43
Booking Channel Management Systems.
The kernel – Touristic Information Systems
Information integration and use across these systems is not a trivial task, because:
foundations and concepts behind backend solutions for e Tourism• foundations and concepts behind backend solutions for e-Tourism (Touristic Information Systems) vary.
• knowledge on the backend solutions for e-Tourism and the functionalities they provide needs to be applied in a consistent manner.
• existing implementation of backend systems and their use in practice varies.
• being able to select, in the light of concepts, functionalities and existing solutions the appropriate backend system for a given tourism business
44
solutions the appropriate backend system for a given tourism business setting and make that easily interoperable is essential.
12/2/2013
23
Import of data from Touristic Information Systems (TISs) will facilitate multi-platform touristic operation, such as:
The kernel – Touristic Information Systems
• unifying principles and methods that enable bookings electronically;
• enabling advanced concepts on booking, direct and multi-channel booking;
• understanding, anticipation and influencing tourists behavior in order to maximize hotel yield or profits;
• facilitating getting a clear overview of the varieties in booking software
45
solutions available on the market;
• making the appropriate solution choices and apply booking and yield management practically.
The kernel – Touristic Information Systems
Addressing the needs of various stakeholders with the ontologized layer for integration of TISs data:
• Tourists, e.g.:– Effective integrated availability of offers availability data
– Possibility to more efficiently combine available offers
• Hotels and tourism service providers, e.g.:– Improved information and process management across heterogeneous TISs
– Receiving data across TISs to enrich their own data and content (e.g. linking to the events in the area which could draw tourists)
• Touristic associations, e.g.:
46
, g– Enabled easier access to the data from hotels and other tourism service providers
– Possibility to easier combine and aggregate data, serve as a better intermediary
12/2/2013
24
The kernel – Content Management Systems
Import from and export to Content Management Systems (CMSs)
Repository
Touristic OntologyTISs
CMSs
47
The kernel – Content Management Systems
• 38,4% of the websites use Web Content Management Systems and the CMS market share is constantly increasing (W3Techs.com report, Nov. 2013)
• CMS is referred to “a computer program that allows publishing, editing and modifying content as well as maintenance from a central interface” (Wikipedia, 2013).
• Types:– Component Content Management System (CCMS)
• Methods and tools for managing and storing re-usable assets, concept items, topics within documents.
– Enterprise Content Management Systems (ECMS)
48
Enterprise Content Management Systems (ECMS)
• Methods and tools for managing and storing enterprise’s content and documents related to organizational processes.
– Web Content Management System (WCMS, often referred as CMS)
• Methods and tools for online and/or offline managing and storing the website content.
• Uses content repository or a database for data storage.
• Supplied as off-the-shelf or custom-developed tailored to specificneeds.
49
The kernel – Content Management Systems
• Based on content and design separation.
• Presentation layer consists of basic CMS templates, which act as a webpage blueprint.
• Dynamic page creation on demand through data extraction from database/content repository.
Customization is achieved by adding plug ins with custom features
50
• Customization is achieved by adding plug-ins with custom features.
• Major technologies used: Java, PHP, Python, MySQL, Perl.
12/2/2013
26
The kernel – Content Management Systems
44%Use a CMS
system
38%Use a CMS
system
51
WCMS distribution of Austrian hotels
WCMS distribution world-wide
W3Techs.com, Nov.2013
Feature 1: Sharing of Content and Data
Solving the Data and Content Bottleneck: Content and Data sharing between different touristic agents
Repository
Touristic OntologyTISs
CMSs
52
TSPs TAs Tirol Werbung
12/2/2013
27
Feature 1: Sharing of Content and Data
Solving the Data and Content Bottleneck: Content and Data sharing between different touristic agents.
• Successful on-line marketing requires high and up-to-date quality on-line content.
• Successful on-line sales requires and up-to-date booking data describing available offers and their• Successful on-line sales requires and up-to-date booking data describing available offers and their prices.
• Providing these at the proper level of quantity, quality, and up-to-date is rather cost-intensive and therefore a major obstacle.
• This problem appears at the layer of the
• Touristic service provider
• Touristic association
• Tirol Werbung
• Large budgets are spent on this issue and many web presences miss a lot of potential due to the related costs.
53
• Therefore, we propose an infrastructure that makes sharing of content and data easy and cheap.
• Based on injecting content and data in our kernel it can be easily exported towards the on-line presence of a certain agent as well as it can be shared with and reused by other touristic agents.
Feature 2: Enrichment
Solving the Data and Content Bottleneck: Enrich with external content and data
Repository
Touristic OntologyTISs
CMSs
54
TSPs TAs Tirol Werbung
External Data (LOD)and Content
12/2/2013
28
Feature 2: Enrichment
Web of Content
The Web is a web of content The Web is a web of content
Content is any textual, visual or aural information that could be found on the
websites.
For a specific entity, e.g. a hotel, we can find on the Web a lot of content
scattered across various websites.
Review sites
55
Sharing sites
Wikis
etc.
Feature 2: Enrichment
• Web of Content
Web of content
VideosTextReviewSites
Wikipedia
VimeoYouTube etc.
56
ImagesFlickr Picasa etc.
12/2/2013
29
Feature 2: Enrichment
― Content can be crawled or retrieved via APIs, like in Hotelnavigator and Trust You services
― Make it accessible through the kernel of the architecture
57
Feature 2: Enrichment
• Web of Documents • Web of Data
Hyperlinks
Typed Links
5858
“Documents”“Things”
12/2/2013
30
Feature 2: Enrichment
• Characteristics:– Links between arbitrary things
• Web of Data
y g(e.g., persons, locations, events, buildings)
– Structure of data on Web pages is made explicit
– Things described on Web pages are named and get URIs
– Links between things are
Typed Links
5959
Links between things are made explicit and are typed
“Things”
Feature 2: Enrichment
• The Web of Data is envisioned as a global database
– consisting of objects and their descriptionsg j p
– objects are linked with each other
– with a high degree of object structure
– with explicit semantics for links and content
• Linked Data is about the use of Semantic Web technologies to publish structured data on the Web and set links between data sources.
6060 Figure from C. Bizer
12/2/2013
31
Feature 2: Enrichment
Facts:• 295 data sets• Over 31 billion triples• Over 504 billion RDF links between data sources
6161
Figure from http://www4.wiwiss.fu-berlin.de/lodcloud/state/, September 2011
Feature 2: Enrichment
• Linked Open Data can be seen as a global data integration platform– Heterogeneous data items from different data sets are linked to each other following the
Linked Data principlesLinked Data principles
– Widely deployed vocabularies (e.g. FOAF) provide the predicates to specify links between data items
• Data integration with LOD requires:1. Access to Linked Data
• HTTP, SPARQL endpoints, RDF dumps
• Crawling and caching
2. Normalize vocabularies – data sets that overlap in content use different vocabularies• Use schema mapping techniques based on rules (e.g. RIF, SWRL) or query languages (e.g. SPARQL
Construct, etc.)
62
3. Resolve identifies – data sets that overlap in content use different URIs for the same real world entities
• Use manual merging or approaches such as SILK (part of Linked Data Integration Framework) or LIMES
4. Filter data• Use SIVE ((part of Linked Data Integration Framework)
62
See: http://www4.wiwiss.fu-berlin.de/bizer/ldif/
12/2/2013
32
Feature 2: Enrichment
• Use LOD to integrate and lookup data about
– places and routes
f– time-tables for public transport
– hiking trails
– ski slopes
– points-of-interest
6363
Feature 2: Enrichment
• Open Streetmap• Google Places • Databases of government
– TIRIS
– DVT
• Tourism & Ticketing association • IVB (busses and trams) • OEBB (trains) • Ärztekammer• Supermarket chains: listing of products • Hofer and similar: weekly offers • ASFINAG: Traffic/Congestion data • Herold (yellow pages)
(y p g )• City archive • Museums/Zoo • News sources like TT (Tyrol's major daily
newspaper) • Statistik Austria
• University of Innsbruck (Curricula, student statistics, study possibilities)
• IKB (electricity, water consumption) • Entertainment facilities (Stadtcafe,
Cinema...) • Special offers (Groupon)
64
12/2/2013
33
Feature 2: Enrichment
• Lots of public data sources
– public transportation - Verkehrsverbund Tirol - Innsbrucker Verkehrsbetriebe (IVB)
– public safety - avalanche - Tyrolean Avalanche Warning Service - traffic - Tyrolean Regional Hazard Warning Centre - weather - Tyrolean Regional Hazard Warning Centre
– public health services - regional hospitals/clinics
65
- UMIT - health and life science university
– culture - cultural events (Tourismusverband Innsbruck, etc.)
- http://www.tirol.gv.at/kunst-kultur/
– and many more …
65
Feature 2: Enrichment
• External content from web site, audio streams, videos and mobile devices
• New touristic services can be provided based on content but requires first:
– Content annotation
– Linking to content
– Enhanced content consumption and delivery
66
12/2/2013
34
Feature 3: Semantic Search
Solving the Search Problem: Increase visibility of contents!
Repository
Touristic OntologyTISs
CMSs
Semantic Search
67
TSPs TAs Tirol Werbung
External Data (LOD)and Content
Feature 3: Semantic Search
Solving the Search Problem: Increase visibility of contents!
• Providing the contents to be searchable is only one step to increase their visibility
A h i d t id l f t t id th t l t d f l h• A search engines need to consider several factors to provide the most relevant and useful search queries, including:
• Intent of the search the results should not based on the specific words used in the query, but more on the intent of the search
• Variations of words should consider tenses, plural, singular, etc.
• Synonyms give same results on any synonyms of the word in query
• Generalized and specialized queries should be able to set relations between generalized and specialized queries
• Concept matching should understand the broad concept of the query
• Natural language queries should be able to process the natural language used by the
68
g g q p g g yusers
• Location of search should be able to provide results based on the current location of the search
12/2/2013
35
Feature 3: Semantic Search
• Semantic search is designed to improve the traditional web searching by improving the search accuracy though the “meaning” of the documents made available for search.
• “Meaning” is established through a semantic model which essentially captures interrelationships• Meaning is established through a semantic model, which essentially captures interrelationships between syntactic elements and their interpretations.
• The semantic models allow for a more precise matching of queries against the documents.
69
• Typically, semantic search engines rely on certain ontology structures which are built from concepts, properties, constraints and possibly axioms of the documents.
Example: Increase the visibility of contents on Mobile Devices
• Typically, the documents are grouped into several main categories and sub-categories as shown at the picture.
• The sub-categories are dynamically shown according to the selection of their main category
• On desktop or personal computer, there is no problem to navigate to those sub-categories, ...
• but on the mobile devices (with limited resources), the sub-categories will be inaccessible decrease the visibility
Th f ti d l i i d t t
70
• Therefore, a semantic model is required to represent those super and sub-categories relationships, such that performing a search to a main category will also include its sub-categories
• all documents will be visible to the search engine including on the mobile devices
12/2/2013
36
Feature 4: Semantic Alignment of Data and Content
Solving the Data and Content
Alignment Problem:
I th i t !Semantic Alignment
Increase the conversion rate!
Repository
Touristic OntologyTISs
CMSs
Semantic Search
71
TSPs TAs Tirol Werbung
External Data (LOD)and Content
Feature 4: Semantic Alignment of Data and Content
Solving the Data and Content Alignment Problem: Increase the conversion rate!
• Achieving visibility through high-quality content, proper dissemination, interaction and engagement with customers are essential corner stones in successful eTourismusengagement with customers are essential corner stones in successful eTourismus.
• However, what matters in the achieving booking volume in terms of numbers of bookings, achieved pricing, and required commission fees.
• Achieving a high conversation rate in relation to successful contents is what matters.
• In consequence, it is essential to align dissemination and communication of content with bookable touristic service offers, i.e., with data related to available booking offers.
• How can this be achieved in a scalable fashion with steadily changing content and context it is presented?
72
12/2/2013
37
Feature 4: Semantic Alignment of Data and Content
• Semantic alignment is the automated process of determining correspondences between different data sources based on a semantic analysis of their relationships.
• A set of correspondences is also called an alignment• A set of correspondences is also called an alignment.
• Natural language analysis and matchmaking of semantic annotations can be used to dynamically establish such correspondences on the fly.
• Based on this different content and data sources of a related topic can be grouped and disseminated together automatically.
73
Feature 4: Semantic Alignment of Data and Content
Show together for: “I want to skii in Mayerhofen”
• Cool content with … booking data and service
74
• Based on corresponding ontological characterization of both.
12/2/2013
38
Feature 5: Semantic Annotations
Provide SEO beyond
keywords and link farms:
S ti A t tiSemantic Alignment
Semantic Annotations
Semantic Annotations
Repository
Touristic OntologyTISs
CMSs
Semantic Search
75
TSPs TAs Tirol Werbung
External Data (LOD)and Content
What are Semantic Annotations?
Feature 5: Semantic Annotations
Innsbruck is the capital city of the federal state of Tyrol (Tirol),
located at 47°16′N 11°23′E.
The name of the Place It is contained in another Place -> Tyrol
In DeutschIn English
76
Refers to Geo coordinates
Talks about a Place
In DeutschIn English
12/2/2013
39
Which is the added value of Semantic Annotations?
Feature 5: Semantic Annotations
annotate
Search EnginesRead and index
77
Feature 5: Semantic Annotations
Which is the added value of Semantic Annotations?
d t dSearch engines understand the content of the pages.
Called “rich snippet” by Google
78
“These rich snippets help users recognize when your site is relevant to their search, and may result in more clicks to your pages.” [4]
pp y g
12/2/2013
40
Feature 5: Semantic Annotations
How do we create Semantic Annotations?
We need:We need:
• A technical way to add them
• A vocabulary with terms about the concept that we want to annotate
Vocabulary created byBing, Google, Yahoo!, Yandex
79
to support the semantic annotationsand speak the same language with the
Description The Grand Hotel Europa combines two worlds[...]
Price range $$$
Payment accepted available credit cards, cash payment
12/2/2013
41
Feature 5: Semantic Annotations
What categories of concepts do exist in schema.org?
For example:For example:
• …
• Event
• Organization, Hotel, Restaurant
• Offer
• Place
• Product
81
• Review, Rating
• …
Feature 5: Semantic Annotations
Summary
In TITi when we talk about Semantic Annotations we imply the usage of the• In TITi when we talk about Semantic Annotations we imply the usage of the schema.org vocabulary.
• The architecture could be extended to support more vocabularies (e.g. the Accommodation Ontology).
• The content is annotated automatically.
82
12/2/2013
42
Feature 6: Multi-channel Communication
Read and write in hundred
thousands of channels:
S l bl M lti h lSemantic Alignment
Semantic Annotations
Scalable Multi-channel
communication
Repository
Touristic OntologyTISs
CMSs
Semantic SearchWeb pages
Social MediaChannels
Mobile Ch l
83
TSPs TAs Tirol Werbung
External Data (LOD)and Content
Channels
Feature 6: Multi-channel Communication
84
12/2/2013
43
Feature 6: Multi-channel Communication
Platform types
• Static Broadcasting• Static Broadcasting
• Dynamic Broadcasting
• Sharing
• Collaboration
• Group Communication
• Semantic-based Dissemination
85
Feature 6: Multi-channel Communication
Static Broadcasting
Homepages / Static Websites
Homepage ExampleStatic Website Example
Entry in Wikipedia for Hotel Goldener Adler
86
Although created through a collaborative process, Wiki websites can be considered static forms of online broadcasting as the information contained in them remains the same for long periods of time.
12/2/2013
44
Feature 6: Multi-channel Communication
Dynamic Broadcasting
• Small piece of content that is dependent upon constraints such as time and p p plocation.
• With Web 2.0 technologies have created dedicated means for publishing streams and interacting with content generated by users.
• Examples of platforms types (organized considering first the length of message and second – the level of interactivity):
N F d RSS
87
• News Feeds: e.g. RSS
• Newsletters
• Email / Email lists
• Microblogs: e.g. Twitter, Tumblr
• Blogs: e.g. Blogger, BuzzFeed
• Social networks: e.g. Facebook, Google+, LinkedIn, Xing
• Chat and instant messaging applications: e.g. Skype, Talk, Meebo.
Feature 6: Multi-channel Communication
Sharing
• There are a large number of Web 2.0 websites that support the sharing of g pp ginformation items such as: bookmarks, images, slides, and videos, etc.
• Provided by hosting services (images, videos, slides are stored on a server).
• Can use specialized applications (see below) of features of other platforms and services (e.g. share photos through Facebook).
88
• Examples: • Picture sharing: e.g. Flickr, Instagram, Picasa, Pinterest
• Slide sharing: e.g. Slideshare, MyPlick, Slideboom, Prezi
• Video sharing: e.g. YouTube, Vimeo, Videolectures
• Social Bookmark sites: e.g. Delicious, Digg, StumbleUpon
• Social News websites: e.g. Reddit.
12/2/2013
45
Feature 6: Multi-channel Communication
Collaboration
• Collaborative software helps to facilitate action-oriented teams working together p g gover geographic distances, and by providing tools that aid communication, collaboration and the process of problem solving.
• Examples: • Wikis: e.g. Wikipedia.
• Collaborative tagging: adding metadata to shared content, e.g. Delicious.
• Document & Application collaboration:
89
e.g. Google Docs, EtherPad.
Feature 6: Multi-channel Communication
Group Communication
• Platforms for sharing and exchanging information but also to collecting feedback g g g gor discussing certain issues.
• Examples: • Social networks: e.g. Facebook, Google+, My Space, Xing, LinkedIn.
• Internet forums: e.g. Quora, Ask.com.
• Online discussion groups: e.g. Google Groups, Facebook Groups, Yahoo! Groups, Meetup, GroupSpaces, Windows Live Groups.
90
12/2/2013
46
Feature 6: Multi-channel Communication
Semantic-based Dissemination• Scope: add machine-processable semantics to the information
Search and aggregation engines can provide much better service in finding and gg g g p gretrieving information.
• Search Engine Optimization• Are potential customers finding your web site?
• Is it possible that potential customers might not be aware that your site exists?
• Do your targeted search terms have high search engine rankings?
• Does your website attract a large number of daily visitors?
Semantic Search:
91
• Semantic Search:• Semantic search tries to understand the searcher’s intent and meaning of the query instead of
parsing the keywords like a dictionary.
Feature 6: Multi-channel Communication
Semantic-based Dissemination
• A (Semantic Web) vocabulary can be considered as a special form of (usually light-weight) t l ti l l ll ti f URI ith ( ll i f ll ) d ib dontology, or sometimes also merely as a collection of URIs with an (usually informally) described
meaning*.
• URI = uniform resource identifier
• Semantic vocabularies include: Schema.org, FOAF, Good Relations, Dublin Core.
• Format is an explicit set of requirements to be satisfied by a material, product, or service.
• The most known examples are RDF and OWL.
• Implementation realization of an application, plan, idea, model,
Format e.g. RDFa
92
p pp , p , , ,
or design
• Semantic repositories.Implementation e.g. OWLIM
Vocabulary e.g. foaf
* http://semanticweb.org/wiki/Ontology
12/2/2013
47
Feature 6: Multi-channel Communication
Semantic-based Dissemination: Formats
<div xmlns:dc=http://purl.org/dc/elements/1.1/about="http://www example com/books/globaltourism">
• HTML Meta Elementsabout= http://www.example.com/books/globaltourism > <span property="dc:title">Global Tourism</span> <span property="dc:creator">William Theonbald</span><span property="dc:date">2004‐10‐01</span>
</div>
• RDFa
• OWL (OWL-Lite, OWL-DL,
OWL-Full, OWL2)
• RIF
• Microformats<ul class="vcard">
li l "f " J D /li
RDFa usage example
93
• Microformats
• Microdata
• RDF
• SPARQL
<li class="fn">Joe Doe</li> <li class="org">The Example Company</li> <li class="tel">604‐555‐1234</li><li><a class="url“
• Schemas for a large number of concepts and domains, such as creative works (e.g. movies, music, TV, shows), places, products, organizations, lodging businesses, etc.
• FOAF
• Uses RDF to describe the relationship people have to other “things” around them.
• FOAF permits intelligent agents to make sense of the thousands of connections people have with each other, their jobs and the items important to their lives.
• Good Relations
94
• A lightweight ontology for annotating offerings and other aspects of e-commerce on the Web.
• The only OWL DL ontology officially supported by both Google and Yahoo.
• Dublin Core
• A set of vocabulary terms used to describe a full range of web resources: video, images, web pages etc. and physical resources such as books and objects like artworks.
12/2/2013
48
Feature 6: Multi-channel Communication
Channels:
• A channel is part of a platform specified by the type of information item it can handle.
<Platform>::<InformationItemType>::<AccountUID>
– Facebook :: Event :: STI account
– Facebook :: Image :: STI account
– Facebook :: Link :: STI account
– Facebook :: Video :: STI account
– Google+ :: MicroBlogPost :: STI account
G l I STI t
95
– Google+ :: Image :: STI account
– Twitter :: Link :: STI account
– YouTube :: Video :: STI account
Feature 6: Multi-channel Communication
Challenges:
• Scalability– The overwhelming amount of available communication channels cannot be easily
managed by content publishers
• Costs– Social Media experts needed to handle communication channels
– Not feasible for hoteliers
• Domain personalization
96
– Adaptation, alignment and definition of the content for several channels
– Generic and automatic solutions can be personalized to any domain
• Bilateral communication– Feedback and engagement
– Reputation management
12/2/2013
49
Feature 6: Multi-channel Communication
Web/Blog Conte
Content Semantic Alignment
Semantic AnnotationsCollect feedback
statistics
Social Web
Weaver
ent Extractio
n an
d Public
t/Chan
nel m
apping &
ad
Repository
Touristic Ontology
Semantic Search
Semantic Alignment
97
Web 3.0/Mobile/Other
cation Laye
r
aptatio
n laye
r
Distribute content
Feature 6: Multi-channel Communication
Dynamic rules:
• Content is mapped & adapted to specific channels using business rules, pp p p g ,defined by domain experts
• Domain ontologies applied to map concepts with channels in rules
• Publication can be scheduled
• Feedback is collected to improve the engagement and adapt dissemination rules
whenThere is a new Event ev
98
There is a new Event evthen
Publish ev in Facebook channelPublish ev in Drupal/News channel
12/2/2013
50
Feedback of Dissemination
Feedback collection:
Refers to the response of an a dience to a message or acti it• Refers to the response of an audience to a message or activity.
• Giving the audience a chance to provide feedback is crucial for maintaining an opencommunication climate.
• Views and clicks
• Unary feedback
• Binary feedback
99
• Binary feedback
• Ratings
• Re-publication
• Comments:
Feature 6: Multi-channel Communication
Engagement process =
Infinite loop between the listening and responding steps, interweaving publishing and listeningpublishing and listening
Listen Analyze Understand Respond
In order to retrieve the desired information, several API calls have to be performed at the dissemination channels:• Fetching the amount of comments of a post
• Fetching all comment of a post
100
g p
• Publish a comment to a post
12/2/2013
51
Outlook
101STI Innsbruck
Feature 7: Additional Services
Additional services can be provided:
• Data analytics and swarm-based yield management
D i k i• Dynamic packaging
• Data Value chain
• Governmental reporting and governance
Semantic Alignment
Semantic Annotations
Service Layer
102
Repository
Touristic OntologyTISs
CMSs
TSPs TAs Tirol Werbung
External Data (LOD)and Content
Semantic Search
g
Web pages
Social MediaChannels
Mobile Channels
12/2/2013
52
Data analytics and swarm-based yield management
• “is the discovery and communication of meaningful patterns in data. Itrelies on the simultaneous application of statistics, computer
f fprogramming and operations research to quantify performance.Analytics often favors data visualization to communicate insight.”
http://en.wikipedia.org/wiki/Analytics
•Large amount of data are usually process in order to uncover hidden patterns, unknown correlations or other useful information
103
•That information can provide:– Advantage over competitors
– Business benefits
– Effective marketing and increased revenue
Data analytics and swarm-based yield management
• Typical problems data analytics is used to:– Make predictionsp
Data analytics and swarm-based yield management in Tourism
• Used for revenue management, capacity and network planning, inventory management, dynamic pricing, last-minute reservations
• Enables better understanding of tourists’ needs and preferences, which can be latter used to design a highly-customized sales and service process to meet those needs
• Helps revealing and analyzing causal relationships that explain the attractiveness of hotels or destinations and preference patterns of
106
attractiveness of hotels or destinations and preference patterns of tourists
12/2/2013
54
Data analytics and swarm-based yield management in Tourism
• Hotels are confronted with a multitude of online booking channels.
• Hotels should provide their available rooms and their rates to most if not all of the channels to prevent not meeting their potential customers.
• In many channels, visibility is achieved through low prices.– However, often channels also require constraints on the price offers in other channels.
• Some channels generate costs without guaranteeing actual income.
107107
Data analytics and swarm-based yield management in Tourism
• A multi-directional multi-channel approach also must rely on Swarm intelligence.
• Observing in real time the reaction of customers and competitors is the key to achieving on-line marketing.
• Adopting your offer and your price dynamically in response to the behavior of your (on-line visible) environment becomes a key for economic success http://en.wikipedia.org/wiki/Swarm_intelligence
• Many solutions to yield management are based on complex statistical methods and complex domain assumptions on how variation of the price can influence the amount of bookings of a service
108
price can influence the amount of bookings of a service.
108
12/2/2013
55
Data analytics and swarm-based yield management in Tourism
• Swarm intelligence support is an essential feature of the yield management.
• Yield management could be realized utilizing reputation and usage values collected from
– different channels, as well as
– tourism information systems.
• Key is the data and content available in Titi!
109109
Dynamic packaging
• Dynamic packages play a more and more important role in tourism
• Tourists want to buy not only single services but service packages
• Most touristic platforms offer the possibility to book a single service (e.g. hotel, flight, car, events) but few provide service packages
• Very often in order to build interesting packages, services offered
110
y g p g ,by different online platform needs to be combined
12/2/2013
56
Dynamic packaging
• Data and services from from various providers can be integrated for booking g gpackages containing:
– Flight
– Hotels
– Restaurants
– Cultural and entertainment events
– Sightseeing
– Shops
111111
Dynamic packaging
• Service packaging involves:
– Understanding requirements and preferences
– Finding relevant services that can fulfill part of the overall request
– Composing relevant services from various sources in packages
– Having a single price for service packages
– Selecting and recommending the optimal
112
Selecting and recommending the optimal service package
– Booking of the service package
– Realize player technology with our architecture
12/2/2013
57
Data Value Chain
“Your data is worth more if you give it away.”
Commission Vice President
Neelie Kroes
113
Data Value chain
• A "DATA Value” based economy driven by the open data strategy
• It will enable and foster best possible social and commercial added value based on intelligent use, management and re-use of data sources in Europe.
• This will lead to– increased business intelligence and efficiency of private and public
sectors
114
sectors
– world class applications
– new business opportunities involving SMEs - (open) data friendly policy and business environment
Based on Márta Nagy-Rothengass – Head of Unit, EC DG Information Society and Media keynote talk “Leveraging the data potential in Europe” at EDF2012
12/2/2013
58
Data Value chain
• Non tangible assets (i.e. data) play a significant role in creation of the economic value.
Data is nowadays more important than for example• Data is nowadays more important than for example search or advertisement.
• The value of the data, its potential to be used to create new products and services, is more important than the data itself.
• New businesses can be built on the back of this data.
115
• Data is an essential raw material for a wide range of new information products and services.
• Facilitating re-use of this raw data will create jobs and thus stimulate growth.
Data Value chain
• Open Data can be integrated into new products and services
• A whole new industry implementing services on top of large data sets is emerging.
• So…Data Value Chain in the Tourism domain refers to…
Chain of activities performed to get profitable value of the Touristic data through different phases.
116
12/2/2013
59
Data Value Chain
To exploit data value chain, TiTi offers:
Service Layer• Integrated and uniform data layer
th t b d f diff t
• To export EDF (Einheitliches Daten Format), OTDS
Repository
Touristic Ontology
Semantic Search
Semantic Alignment
Semantic Annotations
that can be used for different purposes in the tourism domain.
• Hub functionalities: allows to repeat through all its output channels an input data.
(Open Travel Data Standard).
• Semantic infrastructure to be used under a multi-purpose service layer for different touristic activities.
Which can be used…
117
-To promote coordination between different players Coordination of services
E.g Travel agency + hotel+ Tour+ Bus services
Data Value Chain
Example in the Touristic area: Tourism Association
EDF
Resell and package touristic offers
Semantic Search
Semantic Alignment
Semantic Annotations
Service Layer
EDFGlobal TypsODTS
118
Repository
Touristic Ontology
Hotel
Rooms availability
12/2/2013
60
Governmental reporting and governance
• Governments need data from strategic sectors– To measure performance, analyze risks, foresee trends…
– To inform strategic decisions
– For policy making
• Opening up data will give allow businesses to– comply with regulations
– sell data-related services to government
– improve collaboration with local governments
– provide insight into investment and innovation
119
– support public-private partnerships
• Example in the touristic sector – Data from bookings can be used to analyze tourism trends, make new policies, target
specific groups…Based on “Open Data: Driving growth, ingenuity and innovation”, Deloitte 2012. Available online at http://www.deloitte.com/assets/dcom-unitedkingdom/local%20assets/documents/market%20insights/deloitte%20analytics/uk-insights-deloitte-analytics-open-data-june-2012.pdf
Summary
120STI Innsbruck
12/2/2013
61
The Problem
• The number of people who plan and book their travel activities online is steadily growing.
• This confronts touristic service providers with many new challenges– To attract potential customers they have to be present in a multitude of channels
in various formats and adaptions.
– The content in the various channels should be of high quality and originality.
– There should be multiple online booking opportunities for their offers (fluid booking).
– Multi channel yield management and revenue optimization is necessary.
121
HOTEL RECEPTION
The overall picture
Service Layer
Repository
Touristic OntologyTISs
CMSs
Semantic Search
Semantic Alignment
Semantic Annotations
Web pages
Social MediaChannels
Mobile Channels
122
TSPs TAs Tirol Werbung
External Data (LOD)and Content
12/2/2013
62
Key elements of our solution
The Kernel:
•A repository for structured data management and digital preservation.
•An Ontology to mediate between different data models•An Ontology to mediate between different data models.
•The import from Touristic Information Systems (TISs).
•The import and export from Content Management Systems (CMS).
Touristic OntologyTISs
123
Repository
Touristic Ontology
CMSs
Key elements of our solution
The Features:1.The sharing of content and data between different touristic agents.
2 The enrichment with external content and data2.The enrichment with external content and data.
3.Increased visibility of content using Semantic Search.
4.Increased conversion rate using Semantic Alignment of Data and Content.
5.SEO beyond keywords and link farms using Semantic Annotations.