What is a Semantic Wiki Why should you want to know or care? Jesse Wang, Wil Smith 2012.04.11 For Semantic Web Meetup in San Diego
What is a Semantic WikiWhat is a Semantic WikiWhy should you want to know or care?
Jesse Wang, Wil Smith
2012.04.11
For Semantic Web Meetup in San Diego
2
Crowdsourcing for Better Knowledge AcquisitionCrowdsourcing for Better Knowledge Acquisition
3
A Key Feature of WikiA Key Feature of Wiki
Consensus
This distinguishes wikis from other publication tools
4
Consensus in Wikis Comes fromConsensus in Wikis Comes from
Collaboration– ~17 edits/page on average in
Wikipedia (with high variance)– Wikipedia’s Neutral Point of View
Convention– Users follow customs and
conventions to engage with articles effectively
5
Software Support Makes Wikis SuccessfulSoftware Support Makes Wikis Successful
Trivial to edit by anyone Tracking of all changes, one-
step rollback Every article has a “Talk” page
for discussion Notification facility allows
anyone to “watch” an article Sufficient security on pages,
logins can be required A hierarchy of administrators,
gardeners, and editors Software Bots recognize certain
kinds of vandalism and auto-revert, or recognize articles that need work, and flag them for editors
6
Success of WikisSuccess of Wikis
One of human’s greatest inventions
Finding information…Finding information…
Wikipedia has articles about…• … all cities with info on their
populations, locations and skyscrapers, etc.… all German cars with engine size, accelerating data…
Can you find: Skyscrapers with 50+ floors and built after 2000 in Shanghai (or Chinese cities with 1,000,000+ people)?
Or German(Porsche) cars that accelerate from 0-100km/h in 5 seconds? 7
8
Can Search Solve the Problem?Can Search Solve the Problem?
9
How Wikipedia Answers – List!How Wikipedia Answers – List!
http://en.wikipedia.org/wiki/List_of_fastest_cars_by_acceleration
10
And Tables…And Tables…
11
Answer is Hidden Deeply InAnswer is Hidden Deeply In
List of ArticlesAnd
Tables, InfoBoxes, etc.
12
Look into List in WikipediaLook into List in Wikipedia
http://en.wikipedia.org/wiki/List_of_German_cars
13
Editing Standard Wiki Article – Static ListEditing Standard Wiki Article – Static List
14
Static List, Tables, …, Not Useable EnoughStatic List, Tables, …, Not Useable Enough
http://en.wikipedia.org/wiki/List_of_lists_about_Oregon
15
We need structured data
with clear and consistent
semantics
To Find More InfoTo Find More Info
• All Porsche vehicles made in Germany that accelerate from 1-100 km/h less than 4 seconds
• Sci-Fi movies made after year 2000 that cost less than $10M and gross more than $30M
• A map showing where all Mercedes-Benz vehicles are manufactured
• All skyscrapers in China (Japan, Thailand,…) of 50 (40/60/70) floors or more, and built in year 2000 (2001/2002) and after, sorted by built year, floors…, grouped by cities, regions…
• And many more Semantics Come To Rescue
16
What is a Semantic WikiWhat is a Semantic Wiki
A wiki that has an underlying model of the knowledge described in its pages.
To allow users to make their knowledge explicit and formal Semantic Web Compatible
Semantic Wiki
17
Two PerspectivesTwo Perspectives
Wikis for Metadata
Metadata for Wikis
Characteristics of Semantic WikisCharacteristics of Semantic Wikis
Semantic Wikis
18
19
Basics of Semantic WikisBasics of Semantic Wikis
Still a wiki, with regular wiki features– Category/Tags, Namespaces, Title, Versioning, ...
Typed Content (built-ins + user created, e.g. categories)– Page/Card, Date, Number, URL/Email, String, …
Typed Links (e.g. properties)– “capital_of”, “contains”, “born_in”…
Querying Interface Support– E.g. “[[Category:Member]] [[Age::<30]]” (in SMW)
20
Why Semantic Wiki?Why Semantic Wiki?
Annotation of existing structures with machine readable metadatalinks carry meaning, typing of links, typing of pages
Context dependent adaptation and presentationdifferent domains have different ways of presenting content, personal preferences, etc.
Improved, “intelligent”, search and navigationqueries to the structure, visualisation of structure, derived information
Improved interoperability between systemsexchange of content, integration of different systems, agents, etc.
21
What is the Promise of Semantic Wikis?What is the Promise of Semantic Wikis?
Semantic Wikis facilitate Consensus over Data
Combine low-expressivity data authorship with the best features of traditional wikis
User-governed, user-maintained, user-defined
Easy to use as an extension of text authoring
The ultimate data aggregator
22
One Key Helpful Feature of Semantic WikisOne Key Helpful Feature of Semantic Wikis
Semantic Wikis are “Schema-Last”Databases require DBAs and schema design;
Semantic Wikis develop and maintain the schema in the wiki
List of Semantic WikisList of Semantic Wikis
AceWikiArtificialMemoryWagn - Ruby on Rails-basedKiWi – Knowledge in a WikiKnoodl – Semantic Collaboration tool and application platformMetaweb - the software that powers FreebaseOntoWikiOpenRecordPhpWiki
Semantic MediaWiki - an extension to MediaWiki that turns it into a semantic wikiSwirrl - a spreadsheet-based semantic wiki applicationTaOPis - has a semantic wiki subsystem based on Frame logicTikiWiki CMS/Groupware integrates Semantic links as a core featurezAgile Wikidsmart - semantically enables Confluence
23
24
Short History of Semantic MediaWiki (SMW)Short History of Semantic MediaWiki (SMW)
Born at AIFB– Typed links and types and more– Export articles as RDF– Maximally flexible for the wiki user
SMW 0.1 released by AIFB in Sept 2005– Parser/storage support for typed links – [[type::link | label]]– FactBox for semantic relations at end of article– Special:SearchSemantic, with basic auto-completion for link types– Simple query language (“ask”)
Vulcan kicks off Halo Extensions to SMW project in August 2007
SMW 1.0 released by AIFB in Dec 2007, Ontoprise releases Halo Extension 1.0 in parallel– “Property” instead of “Relation” and “Attribute”– Many new datatypes/special pages/UI features
25
Overview of Semantic MediaWiki (SMW)Overview of Semantic MediaWiki (SMW)
Open source (GPL)– Well documented, active user forum
Active development– Commercial support (SMW+) available
World-wide community– International Conferences
• Next SMWCon on 4/25-27, 2012 in Carlsbad, CA
Very stable core, various extensions
http://semantic-mediawiki.org/http://smwplus.com/
26
Semantic MediaWiki (SMW) Markup SyntaxSemantic MediaWiki (SMW) Markup Syntax
[[Property::Value | Display]]
Tsinghua is a university located in [[Has location::Beijing]], with
[[Has population::27000|about 27 thousands]] students.
In page "Property:Has location":
[[Has type::Page]]
In page "Property:Has population":
[[Has type::number]]
Also Special Properties
27
Define ClassesDefine Classes
Beijing is a city in [[Has country::China]], with population [[Has population::2,200,000]].
[[Category::Cities]]
Categories are used to define classes because they are better for class inheritance.
The Jin Mao Tower (金茂大厦 ) is an 88-story landmark supertall skyscraper in …
[[Categories: 1998 architecture | Skyscrapers in Shanghai | Hotels in Shanghai | Skyscrapers over 350 meters | Visitor attractions in Shanghai | Landmarks in Shanghai | Skidmore, Owings and Merrill buildings]]
Category:Skyscrapers in China Category: Skyscrapers by country
28
Possible Database-style Query over Data Possible Database-style Query over Data
{{#ask:[[Category:Skyscrapers]][[Located in::China]][[Floor count::>50]][[Year built::<2000]] …
}}
Ex: Skyscrapers in China higher than 50 stories, built before 2000
ASK/SPARQL query target
Data via Dbpedia
29
Semantic MediaWiki StackSemantic MediaWiki Stack
MediaWiki (XAMPP)
Extension: Semantic MediaWiki
More Extensions and Applications
30
Simple Example: Semantic Sci-Fi Movie WikiSimple Example: Semantic Sci-Fi Movie Wiki
Demo
31
SMW Extensions – Help Build Great ThingsSMW Extensions – Help Build Great Things
• Halo Extensions, Semantic Forms, Semantic Notification, …
Data I/O
• Semantic Toolbar, Semantic Drilldown, Faceted Search, Full-text Retrieval…
Query and Browsing
• Semantic Result Printers, Tree View, Exhibit, Flash charts…
Visualization
• HaloACL, Wiki Admin Tool Triplestore Connector, Simple Rules…• Semantic WikiTags and Subversion Integration extensions • Linked Data Integration Framework with R2R and SILK from F.U.Berlin
Other useful extensions
32
Example: Ultrapedia – Semantic WikipediaExample: Ultrapedia – Semantic Wikipedia
Ultrapedia: An SMW demo built to explore general knowledge acquisition in a wiki
Wikipedia merged with the power of a database Help Readers and Writers Be More Productive
An Analytical Encyclopedia
Standard View of the Wiki DataStandard View of the Wiki Data
http://wiking.vulcan.com/up/index.php/Porsche_996
Dynamic View of the Acceleration DataDynamic View of the Acceleration Data
Graph View of the Acceleration DataGraph View of the Acceleration Data
Dynamic Mapping and ChartingDynamic Mapping and Charting
37
Information Discovery via VisualizationInformation Discovery via Visualization
38
Video: Semantic Wikis for A New ProblemVideo: Semantic Wikis for A New Problem
Social tag-based characterization
Keyword search over tag data
Inconsistent semantics
Easy to engineer
Increasing technical complexity → ← Increasing User Participation
Algorithm-based object characterization
Database-style search
Consistent semantics Extremely difficult to
engineer
Social database-style characterization
Database search + wiki text search
Semantic consistency via wiki mechanisms
Easy to engineer
Semantic Entertainment
Wiki
39
Semantic Seahawks Football WikiSemantic Seahawks Football Wiki
Semantic Entertainment: Query Result Highlight ReelSemantic Entertainment: Query Result Highlight Reel
Commercial Look/Feel
Play-by-play video search
Highlight reel generation
Search on crowd-defined patterns (“touchdowns with big hits”)
Tree-based navigation widget
Very favorable economics
Demo
42
The InspirationThe Inspiration
We started with a
We could have an
wiki site
web application
43
Application 1: Project Management with SMWApplication 1: Project Management with SMW
Automatically populate tables
Just the data you want, At the level you want Calendars and
timelines Workflows Personal menus Form-oriented inputs Notifications via
email/RSS MS Office integration SVN integration
Vulcan Project Management Wiki (Story)Vulcan Project Management Wiki (Story)
Template and style sheet
customizations
Related content
automatically included
45
Vulcan Project Management Wiki (Task)Vulcan Project Management Wiki (Task)
Color codes to indicate types
and status
SVN Integration automatically “Completed”
task and relate to repository
46
Vulcan Project Management Wiki (Visualizations)Vulcan Project Management Wiki (Visualizations)
47
Screenshot of a Sprint pageScreenshot of a Sprint page
http://wiking.vulcan.com/dev/index.php/Sprint_101020
Data automatically generated via template queries on page
48
Requirements for Wiki “Developers”Requirements for Wiki “Developers”
One need not– Write code like a hardcore programmer– Design, setup RDBMS or make frequent
schema changes– Possess knowledge of a senior system
admin Instead one need
– Configure the wiki with desired extensions– Design and evolve the data model
(schema)– Design Content
• Customize templates, forms, styles, skin, etc.
49
Effectiveness of SMW as a Platform ChoiceEffectiveness of SMW as a Platform Choice
Packaged Software
☺Very quick to obtainN Hard to customizeN Expensive
Microsoft Project Version One Microsoft
SharePoint
Custom Development
N Slow to develop☺Extremely flexibleN High cost to develop and maintain
.NET Framework J2EE, … Ruby on rails
SMW + Extensions
☺ Still quick to program☺ Easy to customize☺ Low-moderate cost
Vulcan Project Wiki B.L.S. RPI map
Application 2: Human Brain MapApplication 2: Human Brain Map
• Open, public online access• A detailed, interactive three-dimensional
anatomic atlas of the "normal" human brain• Data from multiple human brains• Genomic analysis of every brain structure,
providing a quantitative inventory of which genes are turned on where
• High-resolution atlases of key brain structures, pinpointing where selected genes are expressed down to the cellular level
• Navigation and analysis tools for accessing and mining the data
Biological Linked Data MapBiological Linked Data Map
• Open, public online access• Data from multiple RDF data stores• Complete import pipeline using LDIF
framework• Outlines of each imported instance
embedding inline wiki properties and providing views of imported properties from original RDF datasets
• Charting tools that ‘pivot’ SPARQL queries providing several views of each query
• Navigation and composition tools for accessing and mining the data
Where did we get the data?Where did we get the data?
KEGG : Kyoto Encyclopedia of Genes and Genomes– “KEGG GENES is a collection of gene catalogs for all complete genomes generated
from publicly available resources, mostly NCBI RefSeq.” Diseasome
– “The Diseasome website is a disease/disorder relationships explorer and a sample of an innovative map-oriented scientific work. Built by a team of researchers and engineers, it uses the Human Disease Network dataset.”
DrugBank– “The DrugBank database is a unique bioinformatics and cheminformatics resource
that combines detailed drug data with comprehensive drug target information.” SIDER
– “SIDER contains information on marketed medicines and their recorded adverse drug reactions. The information is extracted from public documents and package inserts.”
Wiki Ontology MapWiki Ontology Map
• Genes• DrugBank : 4,553• Diseasome : 3,919• KEGG : 9,841
• Diseases• Diseasome : 4,213• KEGG : 459
• Drugs• DrugBank : 4,772• KEGG : 2,482• SIDER : 924
• Effects• SIDER : 1,737
• Pathways• KEGG : 28,442
We chose to intentionally simplify the ontology due to disagreements between researchers about entity relationships and subclasses.
61,342 Instances Available for Import
Embedding SPARQL Semantic Results FormattersEmbedding SPARQL Semantic Results Formatters
Every piece of content on every instance page is generated by Semantic Result Formatters interpreting SPARQL results.
Most inline properties are embedded in templates returned by SPARQL formatters.
All 3 dynamic graph types are interpreting results of SPARQL queries and injecting a JavaScript template into the head of the page.
The outline template takes selected predicates and objects from a SPARQL query, defined in the query embedding, and generates an HTML template for the page.
Four Initial Templates for Each Instance by Category Four Initial Templates for Each Instance by Category
1. Custom infobox within outline template• Visible inline properties
2. Outline template providing instance information
3. Widget template displaying dynamic charts or third party services• Donut charts and disease Twitter feed
4. Broad table SPARQL queries showing instance relationships
5. Hidden inline properties for other extensions
Demo LinksDemo Links
http://neurowiki.alleninstitute.org/index.php/Main_Page http://neurowiki.alleninstitute.org/index.php/AR http://neurowiki.alleninstitute.org/index.php/Propofol http://neurowiki.alleninstitute.org/index.php/AIBS:Main_Page http://neurowiki.alleninstitute.org/index.php/AIBS:AR http://neurowiki.alleninstitute.org/index.php/Frontal_Lobe http://neurowiki.alleninstitute.org/index.php/NEUROLEX:Main_Page http://neurowiki.alleninstitute.org/index.php/Adrenoleukodystrophy
Neurowiki in Action!Neurowiki in Action!
Which drugs are used in Chemotherapy? What are the dangers of Propofol? How are base entities like Calcium represented? How are new inline properties added to entities?
– Can these be searched?– Can these be queried using ASK?
Do existing extensions work with the framework?
58
Openness of SMW as a PlatformOpenness of SMW as a Platform
Open Source
Open Content
Open Metadata
59
Semantic MediaWiki Enables CollaborationSemantic MediaWiki Enables Collaboration
Create and Manage Real Knowledge
Build Social Semantic Web Applications
In an Efficient and Cost-Effective Way
60
April 25-27, Tri-City Wellness CenterApril 25-27, Tri-City Wellness Center
Want to learn more?
62
Tutorial: Wednesday 4/25Tutorial: Wednesday 4/25
MediaWiki– How to Edit, Format Text, Use Template– Infoboxes, Parser Functions, Extensions…
Semantic MediaWiki – Features, usage, queries, patterns– Extensions: data i/o, forms, maps, visualizations…
SMW+– Usability Enhancement, Wiki Admin Tools, Installation Packages– Enterprise Wiki, Microsoft Office Integration, Triple-store support…
Edithon– Supervised individual and group collaboration time
63
Conference Days (4/26-4/27)Conference Days (4/26-4/27)
Two Keynote Talks– Semantics and Wikis: from Genomes to X Games by Tom Marsh– Text, Data, Structure, Semantics and Knowledge by Mark Greaves
Variety of Presentations– Open and closed data, including talks on wikidata– Social and semantics, mash-ups– R&D: life science, neuroscience, biology, finance…– News, updates, demos and apps – Lightning Talks
Plenty of Group Social Time– Breakfast, Coffee breaks, Lunch and Dinner– Movie screening, restaurant and bars, …
64
Social and Semantic Web at a Great Value Social and Semantic Web at a Great Value
Thanks to our sponsor ai-one and local organizers We’d be able to provide
– Quality lunch– Drinks and pastry/snacks for coffee breaks (breakfast)– Conference dinner– Business relation breakfast on Friday 4/27
Opportunity to meet a variety of people– Achievers– Hackers– Bar-hoppers
Register Now! http://bit.ly/scs12
AcknowledgementAcknowledgement
Paul Allen
Mark Greaves
Project Halo
Karlsruhe Institute of Technology
Ontoprise GmbH
Wil Smith
Free University of Berlin
Vulcan Technology
TeamMersion LLC
Jesse Wang
65
Thank you!
66
Backups start here
(End of Slides)