Earley & Associates, Inc. | Classification: PUBLIC USE Copyright © 2012 Earley & Associates, Inc. All Rights Reserved. Semantic Technologies and the Public Sector January 24 th , 2012 Seth Earley, CEO, Earley & Associates
Mar 24, 2016
Earley & Associates, Inc. | Classification: PUBLIC USE Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Semantic Technologies and the Public Sector
January 24th, 2012Seth Earley, CEO, Earley & Associates
2Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Co-author of Practical Knowledge Management from IBM Press • 17 years experience building content and knowledge
management systems, 20+ years experience in technology• Former Co-Chair, Academy of Motion Picture Arts and Sciences,
Science and Technology Council Metadata Project Committee• Founder of the Boston Knowledge Management Forum• Former adjunct professor at Northeastern University• Editor Information Professional Magazine from the IEEE• Guest speaker for US Strategic Command briefing on knowledge
networks• Currently working with enterprises to develop knowledge and
digital asset management systems, taxonomy and metadata governance strategies
• Founder of Taxonomy Community of Practice – host monthly conference calls of case studies on taxonomy derivation and application. http://finance.groups.yahoo.com/group/TaxoCoP 150+calls since 2005
• Co-founder Search Community of Practice:http://tech.groups.yahoo.com/group/SearchCoP
Seth Earley, Founder & CEO, Earley & Associates
3Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• What are semantic technologies?• What are some practical applications in the public sector? • What do we need to do to prepare for semantic technology adoption?
Agenda
4Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
4Copyright © 2011 Earley & Associates, Inc. All Rights Reserved.
What do we mean by semantics?
The study of language, study of linguistic meaning, the meaning of words, patterns of concepts
Meanings applied to symbols (words are symbols)
Semantics means the study of meaning
(Circular logic definition)
5Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
5Copyright © 2011 Earley & Associates, Inc. All Rights Reserved.
What do we mean by semantics?
“Implied meaning of data”
6Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
6Copyright © 2011 Earley & Associates, Inc. All Rights Reserved.
More Hype?
Isn’t everything Semantic?
Taxonomies are semantic
Search is semantic
Data integration is semantic
7Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
7Copyright © 2011 Earley & Associates, Inc. All Rights Reserved.
Applied Semantics
• The ability to model concepts, attributes and relations to support the integration of heterogeneous applications and systems and enhance the value of information
• Web services (SOA – Services Oriented Architecture) are built on common definitions and attributes
• The goal is to allow applications to be assembled into new configurations in order to support unanticipated user needs (think Mashup of Google Maps with crime statistics data)
8Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
8Copyright © 2011 Earley & Associates, Inc. All Rights Reserved.
Applied Semantics
• Data integration (Linked Data) is a class of semantic technology
• Linked Data consists of approaches for connecting structured and unstructured data on the web.
• A mash-up leverages Linked Data, but also leverages application functionality based on Web Services
• Enterprise search applications and content management tools are increasingly adopting semantic technology constructs
9Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Consistent ways of representing concepts• Agreed upon terminology• Mechanisms for leveraging concept relationships• Algorithms to extract meaning make inferences or ‘reason’
based on logical models and heuristics (rules synthesized as “artificial intelligence”)
Characteristics of Semantic Approaches
10Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Consistent ways of representing concepts• Agreed upon terminology
Characteristics of Semantic Approaches
11Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Consistent Concepts and Agreed Upon Terminology
Sou
rce:
Lou
rose
nfel
d.co
m
Structural metadata – that’s goodNon-interoperable – that’s bad
Standardization on Dublin Core
Structural interoperabilityvia Dublin Core -- better
Inconsistent semantic valuesremain a problem
Semantic merging – Ideal…
12Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• … make inferences or ‘reason’ based on logical models and heuristics… ‑ Query: I am looking for photos of people smiling
• When you become happy you smile• You become happy when someone you love accomplishes a milestone• Taking ones first step is a milestone• Parents love their children
Characteristics of Semantic Approaches
Title: A mother watching baby take first steps
13Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
13Copyright © 2011 Earley & Associates, Inc. All Rights Reserved.
Public Sector Initiatives
• National program supported by the federal government to support people and organizations engaged in integration efforts
• Develops common vocabularies around integration approaches
• Associations, federal agencies, states and industry collaborate to share approaches and practices.
• Includes international entities
• Does not actually create standards but facilitates standards creation
National Information Exchange Model (NIEM)
14Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
1. Biometrics – fingerprint, DNA, facial images
2. CBRN – Chemical, Biological, Radiological and Nuclear – information exchange to detect and interdict threats
3. Children, Youth and Family Services- Improving outcomes for children and youth at risk
4. Cyber- Coordinated efforts to improve computer security
5. Emergency Management - coordinate responses to recover from natural disasters or acts of terrorism
6. Immigration – support immigration related services
7. Infrastructure protection – strengthen and protect critical infrastructure
8. Intelligence - exchanging intelligence among federal, state, and local agencies on a foreign or domestic basis
9. International trade – Protect movement of trade across US borders
10. Justice- Supports Global Justice XML project to share information between jurisdictions
11. Maritime – Anything associated with global maritime
12. Screening –In support of a wide range of screening and credentialing activities for Homeland Security
13. Human Services - communications that promote the economic and social well-being of families, children, individuals, and communities.
NIEM Domains
15Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
EURopean Inter-Disciplinary research on Intelligent Cargo for Efficient safe and environment-friendly logistics
• Platform to enable cargo items to interact with systems and users along the logistics chain
• Scenarios include: ‑ Connected transport and production processes ‑ Active cold-chain monitoring‑ Cargo controlling transportation in 3PL services to final customer‑ Cooperative warehousing through cargo-centric information services‑ Self-returning empty pallets and boxes‑ Cargo-assisted intermodal transport‑ Intelligent routing through cargo-infrastructure cooperation‑ Automated clearance and billing of transiting goods
EURADICE
Source: http://www.euridice-project.eu/index.php/web/page/66
16Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
EURADICE Scenarios
Source: http://www.euridice-project.eu/index.php/web/page/66
17Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Connected transport and production processes Objective to increase synchronization between transportation and manufacturing processes, through:
‑ Real time detection of exceptions in terms of missing boxes or delays that can trigger changes to the production orders and plans.
‑ Intelligent Cargo is able to detect and inform the user about deviations during the transport process that can bring changes on the production order to which the cargo is assigned and on the future production plans.
‑ Calculation, monitoring and updating of the harmonized delivery date of shipments that will make a unique assembled final product.
‑ The related Intelligent Cargoes are aware of their interdependency and are able to update their combined ETA at the supplier plant.
EURADICE
Source: http://www.euridice-project.eu/index.php/web/page/66
18Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Active cold-chain monitoringObjective is to improve monitoring of the cargo physical conditions and of the cargo delivery process:‑ Automated monitoring of the order fulfillment, through self-detection of deviations,
where the Intelligent Cargo detects deviations in terms of quantity, type, date and place of cargo loaded with respect to the distributor’s order
‑ Automated acceptance of cargo, where the Intelligent Cargo prompts acceptance by the customer if quantity, type, date, place and temperature of delivered goods throughout the transport have been within the allowed limits.
‑ Self-monitoring of cargo physical conditions, which consists of active recording of cargo temperature and cargo seal condition, diagnosing current status and issuing alarms in case of anomalies (e.g.: out-of-bounds temperature readings, cargo seal removal).
EURADICE
Source: http://www.euridice-project.eu/index.php/web/page/66
19Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Programs require process definition and analysis• Business rules creation • Common terminology, mapped metadata and data translations • Mechanisms to infer logic • Application to Application (M2M) communication
EURADICE
Source: http://www.euridice-project.eu/index.php/web/page/66
20Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• IBM Watson – IBM’s Jeopardy playing computer ‑ Not exactly practical - $25 mm and 4yrs, $10mm of hardware‑ Uses taxonomies and ontologies, multiple algorithms to “understand”
content (keyword matching, “temporal” reasoning, “statistical paraphrasing” , “geospatial reasoning, etc.)
Application of Semantics to Search and Content Findability
21Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Search Enhancement – Ability to provide users what they want, not what they are asking for‑ Leverages taxonomies and thesaurus structures (and ontologies) to
provide things that are conceptually related‑ As searchers enter a term, the system says “hey, these items are
related”. ‑ For example, if you search on methodologies, the system may also return
related example projects. “Example projects” was not contained in the search query but is conceptually related to “methodologies”
Application of Semantics to Search and Content Findability
22Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
“Sound bite” definitions
• A Taxonomy is a list of terms that enable classification of information‑ Method used to organize Subject/Topic metadata‑ Typically expresses hierarchical relationships (parent/child)‑ Emphasizes context
• A Thesaurus is a specialized taxonomy‑ Equivalence relationships (synonyms)‑ Associative relationships (related terms – “see also”)‑ Preferred terms, variant terms
• An Ontology is a collection of taxonomies and thesauri‑ A body of knowledge is represented by multiple lists of categories‑ Categories of various types are conceptually related
23Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Telecommunications Product Ontology
Product Categories• Hardware• Software• Infrastructure• Power systems• Third party equipment• …
Solution• Abnormal traffic inspection• Backhaul• Digital media• GSM – R• Last mile access• Mobile operators• Ring network• RNP & RNO• Security• Softswitch• WiMax• …
Document types• Acceptance manual• Branding • Brochure• Carrier profile• Case study• Competitor profile• News• Product data sheets• Statement of compliance• Test report• Universal certification • …
Business Units• Applications and Software• Core Networks• Networks• Wireless
A
Competitors• Ericsson• Nokia Siemens• Alcatel Lucent• Cisco • Zte
24Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Concept Modeling – “Breach”
25Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Semantics Applied to Search
26Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Semantics Applied to Search
Best Bet - Suggest Breach Type
27Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Semantics Applied to Search
Refinement by Metadata Facet
28Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Semantics Applied to Search
“Breach” Related Terms
29Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Semantics Applied to Search
“breach” = “violation”
30Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• Marketing propaganda at the back of the room (or someplace)
• Special deal for TRB attendees for SharePoint IA training
• Something else that he thinks is special and wants me to pitch to you guys (something about a SharePoint IA assessment and planning workshop – see propaganda item #2)
So my marketing VP is not mad at me when I return:
31Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Founded - 1994 Headquarters - Boston, MA
What we do – Design and deliver content management and search solutions for companies and their customers
Our core team – 35 information and system architects, library scientists, process improvement consultants, project managers and other information management specialists
Our unique offering – Content Choreography™
Retail
High Tech & Manufacturing
Pharmaceuticals & Life Sciences
Financial Services & Insurance
Media & Entertainment
Our clients include – Global 2000, major non-profits and government entities
Earley & Associates Overview
32Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
clients
Partial Client List
33Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Communities of Practice• SharePoint IA Group:
http://tech.groups.yahoo.com/group/SharePointIACoP/ • Taxonomy Group: http://finance.groups.yahoo.com/group/TaxoCoP• Search Group: http://tech.groups.yahoo.com/group/SearchCoP
Upcoming Webinar Events
• February 1, 2012 – Business Value of Taxonomy• More to come soon…
Events and Communities
34Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Training Opportunities
City Dates Early Reg. Discount before
Houston, TX January 25-27, 2012 January 11, 2012
Arlington, VA February 8-10, 2012 January 18, 2012
Chicago, IL March 14-16, 2012 February 22, 2012
Anaheim, CA April 11-13, 2012 March 21, 2012
SharePoint Information Architecture (3 days) $1995 ($1795, early reg)
AIIM IOA Master Certificate Course (4 days) $2995
City Dates
Stamford, CT February 28-March 2, 2012
Boston, MA March 27-30, 2012
35Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• The Information Architecture Process‑ User Research & Requirements Gathering‑ Audience and Process Analysis ‑ Roles, Responsibilities, Use cases, Personas and scenario development‑ Content Modeling and Content Type Definitions‑ Metadata Schemas and Taxonomy Development‑ Search Integration
• Term Store Management‑ Creating and Managing Groups
• Creating and Managing Content Types‑ Properties (Site Columns, Workflow, IM Policies)‑ Overview of Content Hubs‑ Adding Content Types to Document Libraries
• Creating Metadata for Content Enrichment‑ Core Metadata Schemas‑ Leveraging Managed Metadata and the Term Store
• Governance‑ Governance planning‑ Operational zing governance using platform capability
SharePoint Information Architecture 3 Day Hands-on Course
36Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
• What you will learn‑ Enterprise search‑ Content inventory and classification‑ Categorization and clustering‑ Fact and entity extraction‑ Taxonomy creation and management‑ Information presentation‑ Information governance
• Who should attend?‑ Anyone with a stake in the success of
your organization’s IOA initiatives
• Certificate options‑ Practitioner (days 1&2)‑ Practitioner + Specialist (days 3&4) =
Master
Information Organization and Access (IOA) – 4 days
37Copyright © 2012 Earley & Associates, Inc. All Rights Reserved.
Contact
Seth EarleyCEOEarley & Associates
Phone: 781-820-8080Email: [email protected]
Follow me on twitter: sethearleyConnect with me on LinkedIn: www.linkedin.com/in/sethearley
38Copyright © 2011 Earley & Associates, Inc. All Rights Reserved.
Thank you