This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
1The company, product and service names used in this web site are for identification purposes only.
The key feature offered by semantics – and in particular Ask Data Anything - is adding additional layers on top of data (which are not explicitly in the data itself) e.g., ask for results over cities, countries, continents when the
data only contains information about cities.
A way to achieve this is by defining a structure of knowledge for any sort of domain (taxonomies), with: nouns representing classes of objects
Ontologies & Ada dimensions (Taxonomy embodiment)ada
dimensions
localization
city
country
continent
…
temporal
day
month
year
…
hierarchical
goods
clothing
shoe
sport-shoes
high-heel-shoes
pants
jeans
Shorts
…
Poland is a country. Germany is a country.Krakow is a city.Warsaw is a city.Krakow is-located-in Poland.Warsaw is-located-in Poland.Hamburg is a city.Berlin is a city.Hamburg is-located-in Germany.Berlin is-located-in Germany.
Every column in the input data is marked With one of the data dimensions and with a Concept (e.g., localization and city)
Every location is an ada-dimension. Every country is a location. Every city is a location. Every continent is a location.
Every hierarchical is an ada-dimension. Every brand is hierarchical. Every status is a hierarchical. Every vendor is hierarchical. Every good is hierarchical.
Every temporal is an ada-dimension. Every second is temporal. Every month is temporal. Every year is temporal.
Technically, Ask Data Anything is capable of performing projection, sub-setting and aggregation operations, providing answers for queries involving the following information:
What quantitative field to use?
How the output is to be displayed?
Where (Optional) to restrict the results? Seizes containment relation
By what to aggregate? (Optional) – needs an aggregating operation to be provided.
Aggregating operation (Optional) – tied to the type (e.g. double, string) of What
Data and Models are tightly coupled, as models provide an interface to query the data so everything aimed to be queried needs to be modeled in the underlying ontology.
In the case of csv data, Ontorion Text Mining AddIn for excel allows to extract meaningful information from raw data, helping in the taxonomy creation process