This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Mastering Enterprise Metadata with Seman2c Modeling
Enterprise Metadata: The descrip4on of the organiza4onal context – processes, roles, policies, products and offerings, etc. – that are implicitly part of the enterprise informa4on ecosystem. Seman2c Modeling: A data model linked to the real world through a conceptual model
If we combine enterprise metadata with an enterprise seman4c web ini4a4ve, we can create a knowledge fabric that can completely change the way we think of enterprise soPware
The seman4c model is not just part of the metadata, it is the metadata
SID: ORCL
Seman2c Model: A data model linked to the real world through a conceptual model
Ted
A person in the real world, who just happens to be playing the role of the customer at this
point in 4me
We need a legal name for this person: A name registered with the government body within whose legal jurisdic4on we are
engaging with this person
An address is not just a couple of lines. Is it a residence? Does the customer have “in the town / in the country” homes? What sort of neighborhood is the house in?
The state that issued the license tells us the primary state of domicile of the person. The date of first issue tells us how long the person has been in the state
Data in & about a record also gives us technical context: which table in which database on
which server. When was this record created & by whom
John Smith is a customer of the bank with a large investment porQolio. He is a cau4ous investor with a preference for energy stocks and commodity futures. PB banker is James M
Real world facts
<a human >
“John Smith”
Has name
<a por_olio account>
Operates
<a banker> [James M]
Is advised by
<a security holding> includes
<a security>
For Security
<a company> [Venture Solar]
<a corporate ac4on>
ini4ates Is no4fied
of
Issued By
<a banker> [Frank K]
Is advised by
plans
Public Informa4on
Works with
“Energy”
Ac4vity Area
plans
<a country> (Germany)
Tax Jurisdic4on
<a company> Benthik
Petroleum
Will become subsidiary of
<a country> (Syria)
Operates in
Has posi4ons in Headquartered in
<a state> [California]
<LOB> [Private Bank]
Works in
Works in
<LOB> [Corporate Advisory
Services]
John Smith holds posi2ons in Venture Solar, an energy company, currently headquartered in California, USA
John Smith currently works in Germany and is subject to German tax laws
Venture Solar is corporate customer. Frank K from corporate advisory services is currently helping them put together a reverse merger with Benthik Petroleum
Benthik Petroleum is authorized to operate in Syria
<a planned corporate ac4on>
Non public Informa4on
Taking this one step further… express actual data as graphs as well
• Build repository & import meta data • Boqom up seman4c modeling • Import Business Process Models • Import Enterprise Architecture models • Import Interface specifica4ons • Wrap data sources with SPARQL
Enterprise Architecture
Tasks
Basic Ontologies for each Domain
Interrogate and discover business & IT knowledge across the en2re ecosystem
Deep, complete seman4c models are very difficult to manage in a project context, and do not add significant value in the integra4on phase
Lessons Learned – Phase 1
Avoid “shortcuts” – model the real world
Tradi4onal database designs use a variety of short cuts to make real world complexity manageable. Le}ng these propagate into the seman4c model results in an ontology that is specific to the project context, and therefore does not survive well into later phases
Avoid abstract classes
Without a context to anchor them on, discussions on abstract classes tend to go into free fall, and added liqle to know value
Evolve classes & taxonomy from defining characteris2cs
Explicitly declared taxonomies proved difficult to reverse engineer in later phases of the project
If a rule is not easy to define, check the model first
• Majority of cases where a “reasonable” rule was proving difficult to implement, the root cause was a poor model of the real world. Fixing the model made rule defini4on easier
Retain graph models through all the layers
• When consuming seman4c models in Java or other non-‐seman4c languages, we learned to retain the graph models through all applica4on 4ers
• Majority of developers do not use Java objects as pure “logical models” of the real world. Instead they use families of classes to enable implementa4on of logic and ensure maintainability
• Retaining the seman4c model through all 4ers was the only way to retain the Ontology overlays and enable use of business logic in all 4ers
Gaps in governance of ontology, rule and data cura4on will kill projects
• Errors in ontology and rule models are rela4vely painless to fix, they can require people to walk back mul4ple threads of thought, which can be painful
Data graphs • Quads, not tuples (nothing in the default graph) • Reified statements where provenance is important
Ontology Encoding • OWL 2.0 • Tight version control • All ontologies must be uploaded to the repository
Elements in the Solu2on
Repositories • Single scalable repository for meta data • En4tlement enforced at named graph level • Federated front end only to merge meta data with actual data
Inference levels • Inferred rela4onships • Inferred taxonomy • Seman4c Web Rule Language = declara4ve rules. Rules produce new facts based on exis4ng facts in the model • Rule Interchange Format = forward & backward chaining + other business rules variants: given an outcome, can figure out the star4ng configura4on that would lead to this result
Interfaces • Interac4ve SPARQL endpoint for power users • Custom HTML 5.0 screens + canned queries + model naviga4on for occasional users • Import + export through RDF / TTL / TRIG format. Convert to RDF / TTL / TRIG using custom Java code
THANK YOU About MphasiS MphasiS an HP Company is a USD 1 billion global service provider, delivering technology-‐based solu4ons across industries, including Banking & Capital Markets, Insurance, Manufacturing, Communica4ons, Media & Entertainment, Healthcare & Life Sciences, Transporta4on & Logis4cs, Retail & Consumer Goods, Energy & U4li4es and Governments around the world. MphasiS’ integrated service offerings in Applica4ons, Infrastructure Services and Business Process Outsourcing help organiza4ons adapt to changing market condi4ons and derive maximum value from IT investments. For more informa4on about MphasiS, log on to www.mphasis.com
Presenta2on by Suresh Nair Vice President & Chief Architect, Banking & Cap Markets