1/26 Improving volunteered geographic data quality Improving volunteered geographic data quality using semantic similarity measurements using semantic similarity measurements Arnaud Vandecasteele - Arnaud Vandecasteele - Rodolphe Devillers Rodolphe Devillers Memorial University of Newfoundland, Canada Memorial University of Newfoundland, Canada 8th International Symposium on Spatial Data Quality, 30 May - 1 June 2013
26
Embed
Improving volunteered geographic data quality using semantic similarity measurements
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
1/26
Improving volunteered geographic data quality Improving volunteered geographic data quality using semantic similarity measurementsusing semantic similarity measurements
Arnaud Vandecasteele - Arnaud Vandecasteele - Rodolphe DevillersRodolphe Devillers Memorial University of Newfoundland, CanadaMemorial University of Newfoundland, Canada
8th International Symposium on Spatial Data Quality, 30 May - 1 June 2013
2/26
Outline
Introduction
Conclusion
Semantic SimilarityP-Rank algorithmTobler's Law
OSM Semantic PluginDescriptionExamples
3/26
IntroductionNational Mapping Agencies
What make National Mapping Agencies Authoritative ?
Positional Accuracy
Completeness
Attribute Accuracy
ISO 19113ISO 19115
...ISO 19157
4/26
IntroductionGeographic Information Quality view asa Project Management Triangle
5/26
IntroductionGeographic Information Quality view asa Project Management Triangle
Really?
6/26
Introduction
Could Another Map be authoritative* ?
* and cheap, and fast, accurate and in the better of worlds free
7/26
IntroductionVolunteered Geographic Information (VGI)
8/26
IntroductionVolunteered Geographic Information (VGI)
the widespread engagement of large numbers of private citizens, often with little in the way of formal qualifications, in the creation of geographic information
“
Goodchild - 2007
”
9/26
Source: http://wiki.openstreetmap.org/wiki/Stats
OpenStreetMap (OSM) is a collaborative project to create a free editable map of the
world
+ 1 million
+ 1.8 billion nodes+ 180 million ways+ 1.9 million relations
Started in 2004
IntroductionThe OpenStreetMap project
10/26
IntroductionData Quality & Volunteered Geographic Information
What aboutData Quality ?
Good geometric accuracyHaklay – 2010, Girres and Touya – 2010, Ludwig et al., - 2011
ButGeographic coverage patchwork
Goodchild - 2007
Semantics can be inconsistentBallatore et al., - 2012, Mooney and Corcoran - 2012
11/26
Introduction
VGI changed the way we produce, publish and share Geographic Information
BUT
Semantic Quality is still an important issue
How to improve semantic quality using a VGI approach ?
Research Problem
12/26
Semantic SimilarityWhat is Semantic Similarity ?
Landuse =
Forest
How to describe a forest in OpenStreetMap
Natural =
Wood
One concept, different representation !Q ? -> When should we use landuse=forest rather than natural=wood?* https://help.openstreetmap.org/questions/324/when-should-we-use-landuseforest-rather-than-naturalwood
11 different answers and no real general agreement
13/26
Semantic SimilarityHow to measure the semantic similarity ?
Semantic Similarity can be used to enhance the quality of VGI dataset
OSM Semantic plugin uses a collaborative approach to reduce the potential semantic similarity
How to improve the results:● Using the Tag Info database to know the most used tags ● By mixing the Geographic and the semantic approach (Ballatore + Mooney)
26/26
Questions ?
Rodolphe DevillersMarine Geomatics Labhttp://www.marinegis.com/ Memorial University of Newfoundland
Acknowledgements
Natural Science and Engineering Research Council of Canada (NSERC)Andrea Ballatore for sharing his results