Introduction Motivation Math. Semantics Ontology Integration & Interoperability Software Verification Conclusion Linking Big Data to Rich Process Descriptions Christoph Lange 1 1 Project ‘‘Formal Mathematical Reasoning in Economics’’, School of Computer Science, University of Birmingham, UK http://cs.bham.ac.uk/~langec 2013-09-19 Lange Linking Big Data to Rich Process Descriptions 2013-09-19 1
Linked (Open) Data is one key to coping with Big Data: it enables decentralised, collaborative management of big datasets, low-overhead information retrieval, and scalable reasoning. Big Data are created or consumed by technical processes or business processes. Their formal description, e.g. for software verification or compliance checking, requires logics whose complexity far exceeds that of the data. Restricting LOD to the RDF logic does not allow for integrating rich process descriptions with the data that these processes create, and therefore does not enable knowledge management, information retrieval and reasoning to take full advantage of rich background knowledge. In this talk I demonstrate different frontiers at which I have worked towards achieving an integration of process descriptions and data.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
2011 Ph.D. (Jacobs Univ. Bremen, with MichaelKohlhase): Enabling Collaboration onSemiformal Mathematical Knowledge bySemantic Web Integration [Lan11]
2011–12 Postdoc (Univ. Bremen, with John Bateman,Till Mossakowski): Ontology Integration andInteroperability (OntoIOp)↝ DistributedOntology Language (DOL) OMG standard [13]
2012–13 Postdoc (Univ. Birmingham, with ManfredKerber, Colin Rowat): Formal MathematicalReasoning in Economics (ForMaRE) [KLR]
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 2
Linked data is one key to coping with big data.Big data are created or consumedby technical/business processes.Formal process descriptions aremore complex than data.Why integrate process descriptions and data?How to integrate them?
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 3
Knowledge Mgmt. Under what experimental setupwere these measurements taken?
Reasoning Given the current variance ofmeasurements, would it help to use asensor with different specifications?Which trading strategy responds bestto the current offers?
Inform. Retrieval Which of my friends are actuallyinterested in my latest video upload?Where can I buy the cheapest parts tofeed into my manufacturing process?
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 8
RDF data and RDFS vocabularies do not suffice formodelling processes – so . . . ?
☀ make your stuff available on the Web(whatever format) under an open license☀☀ make it available as structured data (e.g.,Excel instead of image scan of a table)☀☀☀ use non-proprietary formats (e.g., CSVinstead of Excel)☀☀☀☀ useURIs to denote things, so that peoplecan point at your stuff☀☀☀☀☀ link your data to other data to providecontext [12]
Who says it needs to be RDF?
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 9
RDF data and RDFS vocabularies do not suffice formodelling processes – so . . . ?
☀ make your stuff available on the Web(whatever format) under an open license☀☀ make it available as structured data (e.g.,Excel instead of image scan of a table)☀☀☀ use non-proprietary formats (e.g., CSVinstead of Excel)☀☀☀☀ useURIs to denote things, so that peoplecan point at your stuff☀☀☀☀☀ link your data to other data to providecontext [12]
Who says it needs to be RDF?Lange Linking Big Data to Rich Process Descriptions 2013-09-19 9
How to validate the derived values?How to compute them for new data points?How to collect data points and their dependencies?
Make themathematical semantics explicit!unemp. rate = unemployed
population ⇒ link to “division”(→OpenMath Content Dictionaries) [Vra+10; Lan10]OpenMath CDs are LOD: decentrally extensibleFuture work: OpenMath SPARQL entailment regime
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 12
Towards Device InteroperabilityAmbient Assisted Living ScenarioClara, a vegetarian, instructs her wheelchair to get her tothe kitchen (next door to the living room). For dinner,she would like to take a pizza from the freezer and bakeit in the oven. Afterwards she goes to bed.
Existing ontologies (e.g. OpenAAL) cover core of that:
. . . but not all required concepts (e.g. foodingredients⇒ need other ontologies/modules; tapinto the Web of (Product, Geo) Data). . . not necessarily at the required level ofcomplexity (e.g. space/time⇒ need other logics)
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 14
Towards Device InteroperabilityAmbient Assisted Living ScenarioClara, a vegetarian, instructs herwheelchair to get herto the kitchen (next door to the living room). Fordinner, she would like to take a pizza from the freezerand bake it in the oven. Afterwards she goes to bed.
Existing ontologies (e.g. OpenAAL) cover core of that:
. . . but not all required concepts (e.g. foodingredients⇒ need other ontologies/modules; tapinto the Web of (Product, Geo) Data). . . not necessarily at the required level ofcomplexity (e.g. space/time⇒ need other logics)
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 14
Towards Device InteroperabilityAmbient Assisted Living ScenarioClara, a vegetarian, instructs herwheelchair to get herto the kitchen (next door to the living room). Fordinner, she would like to take a pizza from the freezerand bake it in the oven. Afterwards she goes to bed.
Existing ontologies (e.g. OpenAAL) cover core of that:. . . but not all required concepts (e.g. foodingredients⇒ need other ontologies/modules; tapinto the Web of (Product, Geo) Data)
. . . not necessarily at the required level ofcomplexity (e.g. space/time⇒ need other logics)
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 14
Towards Device InteroperabilityAmbient Assisted Living ScenarioClara, a vegetarian, instructs herwheelchair to get herto the kitchen (next door to the living room). Fordinner, she would like to take a pizza from the freezerand bake it in the oven. Afterwards she goes to bed.
Existing ontologies (e.g. OpenAAL) cover core of that:. . . but not all required concepts (e.g. foodingredients⇒ need other ontologies/modules; tapinto the Web of (Product, Geo) Data). . . not necessarily at the required level ofcomplexity (e.g. space/time⇒ need other logics)
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 14
Different Devices = Different LogicsLight Switch: propositional logic “switched on if andonly if someone is in and it’s dark outside”light_on ≡ person_in_room ∧ dark_outsideFreezer: description logic (Pizza ontology) “alltoppings of a vegetarian pizza are vegetarian”VegetarianPizza ≡ Pizza ⊓ ∀hasTopping.VegetarianWheelchair: first order logic (RCC-style spatialcalculus) “two areas are either the same, or intersect, orborder, or separate, or one is part of the other”∀a1, a2.equal(a1, a2) ∨ overlapping(a1, a2) ∨bordering(a1, a2) ∨ disconnected(a1, a2) ∨part_of(a1, a2) ∨ part_of(a2, a1)
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 17
OntoIOp (Ontology Integration andInteroperability) initiative started in 2011 with ISO
now continued with OMGRequest for Proposals to be issued this autumnproposals due Dec. 2014
50 experts participate, ∼ 15 have contributedRelevant communities represented:
different ontology languages and logicsconceptual and theoretical foundationstechnical foundationsapplications: manufacturing, business rules,model-driven software engineering
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 18
Importance of AuctionsAuctions: a mechanism to distribute resourcesApplications eBay, mobile spectrum, internet domainsSignificance $268.5 billion in 2008 in the US
Given a set of bids on goods (proxying valuations)Goals give goods to those valuing themmost
determine pricesmaximise revenueattract participantsincentive compatibility(no need for tactic over-/underbidding)
Auctions are designed; properties are tested andproved.
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 23
Formal descriptions help to understand, verify andimprove processes in general.Process executions create or consume data.Integrating process descriptions and data improves
A wider view on linked data (beyond RDF) helps tointegrate . . .
process descriptions(often ≥ first-order logic; expressive)big data created or consumed by processes(often RDF; scalable)
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 28
References
References I
5 star Open Data. Apr. 3, 2012. url:http://5stardata.info/ (visited on 2013-09-18).
OntoIOp (Ontology Integration and Interoperability)Standard Development Initiative. 2013. url:http://ontoiop.org (visited on 2013-09-18).
L. M. Ausubel and P. Milgrom. “The Lovely but LonelyVickrey Auction”. In: Combinatorial auctions. Ed. byP. Cramton, Y. Shoham, and R. Steinberg. MIT Press,2006. Chap. 1, pp. 17–40.
M. A. Beyer and D. Laney. The Importance of ‘Big Data’:A Definition. June 21, 2012. url:http://www.gartner.com/resId=2057415.
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 29
References IIF. Badra, F.-P. Servant, and A. Passant. “A SemanticWeb Representation of a Product RangeSpecification based on Constraint SatisfactionProblem in the Automotive Industry”. In: Proceedingsof the 1st Workshop on Ontology and Semantic Web forManufacturing, Extended Semantic Web Conference.(Hersonissos, Crete, Greece, May 29, 2011). Ed. byA. García Castro, C. Toro, L. Ramos, and L. Schröder.CEUR Workshop Proceedings 748. Aachen, 2011,pp. 37–50. url: http://ceur-ws.org/Vol-748/.
M. B. Caminati, M. Kerber, C. Lange, and C. Rowat.Proving soundness of combinatorial Vickrey auctionsand generating verified executable code. 2013. arXiv:1308.1779 [cs.GT].
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 30
P. Cramton, Y. Shoham, and R. Steinberg, eds.Combinatorial auctions. MIT Press, 2006.
M. Kerber, C. Lange, and C. Rowat. ForMaRE. FormalMathematical Reasoning in Economics. url: http://cs.bham.ac.uk/research/projects/formare/(visited on 2013-02-10).
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 31
C. Lange, T. Mossakowski, O. Kutz, C. Galinski,M. Grüninger, and D. Couto Vale. “The DistributedOntology Language (DOL): Use Cases, Syntax, andExtensibility”. In: Terminology and KnowledgeEngineering Conference (TKE). (Madrid, Spain,June 20–21, 2012). Ed. by G. Aguado de Cea,M. C. Suárez-Figueroa, R. García-Castro, andE. Montiel-Ponsoda. 2012, pp. 33–48. arXiv:1208.0293 [cs.AI]. url: http://oeg-lia3.dia.fi.upm.es/tke2012/proceedings.
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 32
T. Mossakowski, O. Kutz, and C. Lange. “ThreeSemantics for the Core of the Distributed OntologyLanguage”. In: Formal Ontology in InformationSystems. 7th International Conference (FOIS 2012).(Graz, Austria, July 24–27, 2012). Ed. by M. Donnellyand G. Guizzardi. Frontiers in Artificial Intelligenceand Applications 239. (The paper has won the bestpaper award. Also published at IJCAI 2013 track on BestPapers in Sister Conferences.) Amsterdam: IOS Press,2012, pp. 337–352. url:http://interop.cim3.net/file/pub/OntoIOp/Publications/FOIS_2012/paper.pdf.
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 34
D. Vrandečić, C. Lange, M. Hausenblas, J. Bao, andL. Ding. “Semantics of Governmental Statistics Data”.In: Proceedings of WebSci’10: Extending the Frontiers ofSociety On-Line. Web Science Trust, 2010. url:http://journal.webscience.org/400/.
Lange Linking Big Data to Rich Process Descriptions 2013-09-19 35