Top Banner
Linked Data for Development: Part 2: Downscaling Linked Data Victor de Boer With significant input from Christophe Guéret, Martin Murillo, Stephane Boyera, Stefan Schlobach, Bernie Innocenti, Walter Bender, Claudia Urrea, Anna Bon, Hans Akkermans, Nana Gyan, Amadou Tangara. Mary Allen,…
105

Ld4d 2013 part 2

Jan 26, 2015

Download

Education

Victor de Boer

Slides for the second part of the Linked Data for Development (LD4D) Tutorial, held at WSSF2013 in Montreal Canada.

In this presentation I talk about Downscaling the Semantic Web, taking into account issues around 1) infrastructure and hardware 2) interfaces 3) relevant data
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Ld4d 2013 part 2

Linked Data for Development:Part 2: Downscaling Linked Data

Victor de Boer

With significant input from Christophe Guéret, Martin Murillo, Stephane Boyera, Stefan

Schlobach, Bernie Innocenti, Walter Bender, Claudia Urrea, Anna Bon, Hans Akkermans, Nana Gyan, Amadou Tangara. Mary Allen,…

Page 2: Ld4d 2013 part 2

LD4D at ISWC2012

Page 3: Ld4d 2013 part 2

Outline

• Part 2:– Why Linked Data for Development– Bringing the Semantic Web and Linked Data to the Base of the

Pyramid• Relevancy• Infrastructure and connectivity• Interfaces

– IATI as Linked Data– Voice-based access to Market data in the Sahel– Distributed data sharing: OLPC and ERS

• Part 3: Handson session!

Page 4: Ld4d 2013 part 2
Page 5: Ld4d 2013 part 2

CAUTION! DIGITAL DIVIDE AHEAD

Img: Internet World Stats

Page 6: Ld4d 2013 part 2

Digital divide in classrooms

Page 7: Ld4d 2013 part 2

ICT4D• Technology is a development tool

– Education– Healthcare– Livelihood– etc.

• Leveraging communication independently of physical/geographical barriers

• Improving transparency, accountability, efficiency of governments

• Developing nations can leapfrog directly into the information age, jumping many phases of immature technologies

Based on Sbc4d.com

Page 8: Ld4d 2013 part 2

Information sharing needs• Agriculture

– Market Prices – Business opportunities– Support– Sharing indigenous knowledge– Etc.

• Health– Prevention– Access to healthcare– Detection of disease outbreak– etc.

• Education• Etc.

Based on Sbc4d.com

Page 9: Ld4d 2013 part 2

Web Alliance for Regreening in Africa

Washington, 13-15 May 2013 9

W4RA : Information exchange and knowledge sharing in rural Africa

Page 10: Ld4d 2013 part 2

World Wide Web as Instrument of Empowerment

“Our success will be measured by how well we foster the creativity of our children. Whether future scientists have the tools to cure diseases.Whether people, in developed and developing economies alike, can distinguish reliable information from propaganda or commercial chaff.

Whether the next generation will build systems that support democracy and accountable debate.

I hope that you will join this global effort to advance the Web to empower people.”

Sir Tim Berners-Lee, inventor of the Web:

Page 11: Ld4d 2013 part 2

Why the Semantic Web?

• Information (from NGOs) in silos– Specific products– Specific communities

• Lot of knowledge is lost due to lack of publication

Sharing (heterogeneous) knowledge is essential

• LD is well-suited because of:– Language-agnostic– Interface-agnostic– De-centralised authoring

• Slicing

– Re-usability• Local• Global Img: flickr/elcovs

Page 12: Ld4d 2013 part 2

Why linked data (1/2)

Slide stolen from Christophe Gueret

Page 13: Ld4d 2013 part 2

Slide stolen from Christophe Gueret

Why linked data (2/2)

Page 14: Ld4d 2013 part 2

Web of Documents (WWW)Linked Documents

Page 15: Ld4d 2013 part 2

Web of DataLinked Data

Page 16: Ld4d 2013 part 2

Barriers to the Internet1. Technology: The lack of connectivity

and electricity, cost of devices and cost of connection are limiting the adoption and usage of new technologies;

2. Capacity: Lack of time and resources limits the participation in data sharing processes. There are also issues related to low education levels, low capacity to interpret data, and illiteracy;

3. Relevance: Power balance, culture, apathy, lack of incentives, lack of interest and dis-empowerment are also all threats to having citizens engage in data sharing.

Stephane Boyera (SBC4D.com)

Page 17: Ld4d 2013 part 2

Sem.tech/Linked Data should be made

1. usable on small, affordable, hardware deployed in various connectivity contexts;

2. accessible to individuals with varied cultural backgrounds / literacy levels;

3. relevant and directly useful to the target public they aim to empower.

Infrastructure

Interface Relevancy

Page 18: Ld4d 2013 part 2

Infrastructure

• No internet, • No bandwith,• No computer, • No electricity

• Cost– Total cost of ownership

Page 19: Ld4d 2013 part 2

Interface

• Low literacy• Low education• Small languages• low capacity to

interpret data, and illiteracy;

Page 20: Ld4d 2013 part 2

Relevancy

• No local content• No local ownership

• Power balance, culture, apathy, lack of incentives, dis-empowerment

Subsecretario de transparencia, Alcaldes y la gente http://www.youtube.com/watch?v=q0S3juRQXR0 Max Rodriguez

Page 21: Ld4d 2013 part 2

New ways of connecting to the (Semantic) Web

Page 22: Ld4d 2013 part 2

Mobile phones

Page 23: Ld4d 2013 part 2

•No. 1 source of information•Interactive radio programs•Huge listening base

Radio

Page 24: Ld4d 2013 part 2

Low-powered hardware

• OLPC XO laptop• Raspberry Pi• Sheevaplug etc.

Page 25: Ld4d 2013 part 2

With the mainstream

• Dev.countries can leapfrog directly into the information age, – jumping many phases of

immature technologies

• Linked Data is mainstream computer science research.– Let’s worry about the 4.5 B

unconnected prosumers now!

Img: flickr/n3v3rv0id

Page 26: Ld4d 2013 part 2

Voice-based Web access in Africa

Page 27: Ld4d 2013 part 2

• Integrate local community radios and mobile ICT for knowledge sharing

• Better support and integrate local languages in voice-based services– Development of appropriate speech elements (text-

to-speech and Speech recognition)

• Develop a free and open source toolbox for local developers.– Investigate self-sustainability – Develop appropriate business models – In collaboration with local communities.

Page 28: Ld4d 2013 part 2

Bottom-up• Involvement of local communities

– Trust and ownership– Co-creation

• Bottom-up: field visits, workshops, demos, roadshows, etc

• Local communities: innovation co-creation, “Living Labs” socio-technical approach– Use case gathering– Observation and prototyping– Test, adapt

Page 29: Ld4d 2013 part 2

From 20 use cases to 3 voice systems

Market Information

Citizen Journalism

Event Organiser

1 m-Milk ordering and delivery service of Tominian Milk producers and NGO2 m-Tree protection alert service Sahel Eco Farmers and NGO3 mobile-web Event organizer for vaccination of herds Farmers4 m-Farmer-expert directory service Farmer organization5 NGO info-line about legal issues in several languages Sahel Eco6 Leave announcement or select your favourite song Radio7 Shea butter and honey trading service Radio and Sahel Eco8 Access radio programs and announcements on your phone Radio9 Gourcy seed producers seed certification service Farmer organization10 Radio questions and answers about agricultural issues Radio11 m-collective purchase organizing service Local buyers12 m-GIS regreening service Sahel Eco13 m-Farmer social network Sahel Eco14 mobile-web regional market system Farmer organization15 Sahel Eco portal to Regreening and access to m-services Sahel Eco16 m-event organizer for re-greening events Sahel Eco, farmers

Page 30: Ld4d 2013 part 2

Local market data

Communiqué

GSM/Voice interface

Web Interface Text-To-Speech

Community radioSahel Eco operativeBuyers

Page 31: Ld4d 2013 part 2

Z_Di

arra

_ba.

wav

offered by.wav

“Slot and Filler” Text-to-Speech

English:

Bambara:

15 liters of offered by Zakari Diarra

15_ba.wav L_ba.wav Of_ba.wav

Honey_ba.wav

Spoken Language Elements Repository

honey

Page 32: Ld4d 2013 part 2

VoiceXML

<?xml version="1.0" encoding="ISO-8859-1"?><vxml version="2.0" lang="en"> <form> <prompt bargein="false"> Welcome to RadioMarche! <audio src=“audio/communique_1_bambara.wav"/> </prompt> <option dtmf="1" value=“1">Press one for X</option> <option dtmf=“2" value=“2">Press two for Y</option> ...</vxml>

DTMF = Dual-tone multi-frequency signaling

Page 33: Ld4d 2013 part 2

Foroba Blon

Page 35: Ld4d 2013 part 2
Page 36: Ld4d 2013 part 2
Page 37: Ld4d 2013 part 2
Page 38: Ld4d 2013 part 2

Web for ALL.Using voice technologies and available tools…

… we make the benefits of the Web available to people who use simple mobile phones.

~~

~~~~

Page 39: Ld4d 2013 part 2

Results

• RadioMarché -- Increased market for farmers.– Political, social, economical, ecological factors play a great role– Too successful: not the entire value chain is served

• Foroba Blon -- Facilitating rural citizen journalism.– Privacy and security, – New business models

Voice platform with reusable components for different use cases.

Page 40: Ld4d 2013 part 2

Linked Data for RadioMarche

Page 41: Ld4d 2013 part 2

http://semanticweb.cs.vu.nl/radiomarche

Page 42: Ld4d 2013 part 2

Linked Market Data

• 1,952 RDF triples– 90 offerings– 19 contacts

• Links to – Data

• DBPedia• GeoNames• Agrovoc

– Vocabularies• Foaf• GoodRelations

Local market data

Data / communique layer

Farmers (producers)

Buyers (consumers)

Email GSM/VoiceWeb SMS

Interface handler layer

Local radio

Page 43: Ld4d 2013 part 2

Sharing across regions/NGOs

Local market data

RadioMarché market information system

Farmers (producers)

Buyers (consumers)

Email GSM/VoiceWeb SMS

Data / communique platform

Local radio

RadioMarché in second region

Local market data

Data / communique layer

Farmers (producers)

Buyers (consumers)

Email GSM/VoiceWeb SMS

Interface handler layer

Local radio

Page 44: Ld4d 2013 part 2

Re-use: EcoMash

Henk Kroon

Page 45: Ld4d 2013 part 2

rm:offering0001

rm:shea_butter

rm:product_name

rm:1000

rm:quantity

rdfs:label

rdfs:label “Amande de Karité”@fr

“Shea Nuts”@en

speakle:voicelabel_ba

rm:audio_shea_nl.wav

rm:audio_shea_ba.wav

speakle:voicelabel_nl

rdfs:label“1000”

speakle:voicelabel_ba

rm:audio_1000_nl.wav

rm:audio_1000_ba.wavspeakle:voicelabel_nl

rm:Mazankuy_Diarra

rm:kilo

rdfs:label “kilo”@en

speakle:voicelabel_ba

rm:audio_kilo_nl.wav

rm:audio_kilo_ba.wav

speakle:voicelabel_nl

rm:unit_measure

rm:has_contact

Speakle voice labels

Page 46: Ld4d 2013 part 2
Page 47: Ld4d 2013 part 2
Page 48: Ld4d 2013 part 2

https://github.com/biktorrr/speakle

Page 49: Ld4d 2013 part 2

Voice browser Tel: +31208080855 Skype: +990009369996162208

Welcome

Choose application and language

dtmf

About which product (EN)

About which product (NL)

List all products (EN)

dtmf

List product offerings

dtmf

List product offerings

1

2

3

1..n

1..n

Page 50: Ld4d 2013 part 2

Current status• Linked Market Data

– Locally created – Linked Data make re-use possible (NGO, others)– LD voice labels

• Can be (re)used to develop voice applications with this data

• To go beyond proof-of-concept– More localization needed– Local hardware/services (Emerginov / OfficeRoute)– User testing– More sophisticated translations (VoiceSPARQL)

Page 51: Ld4d 2013 part 2

Infrastructure

Interface Relevancy

Page 52: Ld4d 2013 part 2

Icon-based interaction

Page 53: Ld4d 2013 part 2

Icon-based interaction

NCR ATM interface for illiterate 'grammar' - ISOTYPE by Otto Neurath available at http://imaginarymuseum.org/MHV/PZImhv/NeurathPictureLanguage.html

Page 54: Ld4d 2013 part 2

Crowdsourcing voice fragment gathering

Page 55: Ld4d 2013 part 2

One Laptop Per Child (OLPC), Sugar and the Entity Registry System

Bernie Innocenti, Walter Bender, Christophe Guéret,Claudia Urrea

Page 56: Ld4d 2013 part 2

OLPC mission and vision• Develop (and deploy) a low-cost laptop in

order to revolutionize how we educate the world's children

• What motivates learning is not carrots or sticks, but rather:– autonomy,– mastery, and– a sense of purpose.

• A laptop makes learning more flexible: Children learn by teaching and actively helping each other; the teacher is free to focus expertise where it is needed

Page 57: Ld4d 2013 part 2

How is learning with the XO different?

OLPC

Computer for learningStudent-centricTeacher as mentorVoice, textLearning to learnCritical thinking

Page 58: Ld4d 2013 part 2

Sugar

• Operating system for XO laptops

• Learner centric• Activities (Apps)

Page 59: Ld4d 2013 part 2

Different activities

Page 60: Ld4d 2013 part 2

The numbers (2012)

• 2,000,000+ children with XOs• 1,000,000,000 children w/o laptops• 150+ language projects• 40+ countries• 500+ Sugar activities

Page 61: Ld4d 2013 part 2

Efficient Knowledge sharing with SemanticXO and ERS

Page 62: Ld4d 2013 part 2

Mesh VS Infrastructure network

Page 63: Ld4d 2013 part 2

Christophe Gueret

Page 64: Ld4d 2013 part 2

Christophe Gueret

Page 65: Ld4d 2013 part 2

Christophe Gueret

Page 66: Ld4d 2013 part 2

Christophe Gueret

Page 67: Ld4d 2013 part 2

Christophe Gueret

Page 68: Ld4d 2013 part 2

Christophe Gueret

Page 69: Ld4d 2013 part 2

Christophe Gueret

Page 70: Ld4d 2013 part 2

Christophe Gueret

Page 71: Ld4d 2013 part 2

Hybrid solution

http://www.firstmilesolutions.com/documents/DakNet_IEEE_Computer.pdf

Page 72: Ld4d 2013 part 2

Sneakernet

Throughput

Latency

“Never underestimate the bandwidth of a station wagon full of tapes hurtling down the highway.”

—Andrew Tanenbaum

Page 73: Ld4d 2013 part 2

Infrastructure

Interface Relevancy

Page 74: Ld4d 2013 part 2

Linked Data for

Kasper Brandt Victor de Boer

Page 75: Ld4d 2013 part 2

Introduction - IATI

“IATI is a voluntary, multi-stakeholder initiative that seeks to improve the transparency of aid in order to increase its effectiveness in tackling poverty.”

As of 2013, over 150 donors, NGOs and governments have registered to the IATIregistry.org by publishing their aid activities in this XML standard.

Now: 180+

Page 76: Ld4d 2013 part 2

Introduction - IATI users• Funders

o Where is the money of my organisation spent?o Where do other organisations spend their money?

• Governmentso How much money is spent in my country?o What are the budgets or planned disbursements for my country?

• Localso What organisations are working in my area?o What projects are currently going on in my area?

• Publico Where is my tax money going?o What are the organisations doing with my donations?

Page 77: Ld4d 2013 part 2

Introduction - IATI model

OrganisationsActivities

Page 78: Ld4d 2013 part 2

Introduction - Why IATI Linked Data?

1. Reusable vocabularieso Extract information automatically from the IATI data by making

use of applications which are able to interpret standard vocabularies

2. Enrich IATI datao Link IATI data to external datasets in order to enrich the IATI

data with additional information or metadata.3. Donors can use their own Linked Data specification.

o @Linked-data-uri attribute already exists in the IATI model.

Page 79: Ld4d 2013 part 2

Model and links based on requirements elicitated from experts

Iterative Requirements Engineering Process Model by Loucopoulos and Karakostas

Page 80: Ld4d 2013 part 2

Linked Data model - Example

iati:activity/GB-CHC-285776-CHA024iati:activity-transactioniati:activity/GB-CHC-285776-CHA024/transaction/42737 .

iati:activity/GB-CHC-285776-CHA024/transaction/42737iati:transaction-tied-statusiati:codelist/TiedStatus/5 .

Page 81: Ld4d 2013 part 2

Linked Data model - Provenance

• On file levelo Not on activity level

• A named graph per file, e.g.:iati:graph/dataset/Worldbank

Page 82: Ld4d 2013 part 2

Linked Data model - Vocabularies

Page 83: Ld4d 2013 part 2

Linked Data model - Triple store

• Triples loaded into a ClioPatria triple store:o http://semanticweb.cs.vu.nl/iati/o Sparql endpoint– Dereferenceable URIs (

http://purl.org/collections/iati/codelist/Sector/11420)

• Total number of triples: 36,629,017

• Total number of named graphs: 4,790o Largest activities graph is UNOPS containing 1,231,896

triples• Takes approximately 30 minutes to load all data into the triple store.

RDFLibPython RDF/Turtle

Page 84: Ld4d 2013 part 2

Linking datasets - Approach1. In total, how much does a given country receive in aid?2. A comparative index of aid versus the Human Development Index.3. What is the geographic location of a project? How much aid went to a given

province, constituency or village?o Is the aid spent in places where the need is highest? Is it well distributed

across the country?o Can we attribute sub-national breakdowns for aid so we can see how much

goes to different parts of recipient countries?4. How does violent conflict in recipient countries affect aid activities?5. How does aid spending as registered in the IATI standard compare to World Bank

indicators?

Page 85: Ld4d 2013 part 2

Linking datasets

Page 86: Ld4d 2013 part 2

Linking Data applications - Approach1. In total, how much does a given country receive in aid?2. A comparative index of aid versus the Human Development Index.3. What is the geographic location of a project? How much aid went to a given

province, constituency or village?o Is the aid spent in places where the need is highest? Is it well distributed

across the country?o Can we attribute sub-national breakdowns for aid so we can see how much

goes to different parts of recipient countries?4. How does violent conflict in recipient countries affect aid activities?5. How does aid spending as registered in the IATI standard compare to World Bank

indicators?

Page 87: Ld4d 2013 part 2

http://iati2lod.appspot.com/1. In total, how much does a given country receive in aid?

Page 88: Ld4d 2013 part 2

2. A comparative index of aid versus the Human Development Index.

http://iati2lod.appspot.com/

Page 89: Ld4d 2013 part 2

http://iati2lod.appspot.com/4. How does violent conflict in recipient countries affect aid activities?5. How does aid spending as registered in the IATI standard compare to World

Bank indicators?

Page 90: Ld4d 2013 part 2
Page 91: Ld4d 2013 part 2
Page 92: Ld4d 2013 part 2

Links to DBPedia

IDS: document 0001 Theme:”Food Security”

DBPedia:”Food Security”

Analysis of approaches to understanding and addressing food security issues; examination of the structural causes of food insecurity and different policy responses

Theme:” Food aid emergencies ”

Person:”David Pimentel”

Organisation:”FAO”

“Voedselzekerheid”@NL

Page 93: Ld4d 2013 part 2

Links to IATI

IDS: document 0003 Theme 'Higher education’

IATI Sector:”Higher Education”

Theme Education

Organisation : UN Habitat

Activity: Multi donor fund to support civil society in democracy related issues

Degree and diploma programmes at universities, colleges and polytechnics; scholarships.

Page 94: Ld4d 2013 part 2

Linked Data for Landportal.info

[M.Sc. thesis by Alan Chavoshe]

• The Land Portal is an easy access, easy-to-use platform to share land related information, to monitor trends, and identify information gaps to promote effective and sustainable land governance.

Page 95: Ld4d 2013 part 2

Nichesourcing for pluvial data digitization for the Sahel

[M.Sc. thesis by Binyam Tesfa]

Page 96: Ld4d 2013 part 2

DBpedia

GeoNames

Agrovoc

Inst. of Development Studies LOD

Citizen Journalism data

Sahel Pluvial data

SemanticXO

RadioMarché Linked market data

Linked Data for Development (LD4D)

IATI data

Page 97: Ld4d 2013 part 2

Infrastructure

Interface Relevancy

Page 98: Ld4d 2013 part 2

Take home

• Knowledge sharing is a tool for development

• Linked Data is well-suited because of– Language- and interface agnostic characteristics– Decentralizability– Reusability outside of original context

• Downscaling– Interface– Infrastructure– Relevancy

Img: flickr/TomJByrne

Page 99: Ld4d 2013 part 2

What we need from you?

• Data• Cases

– Transparency, Governance, Democracy– Economic development, Healthcare

• Reflection– Ethics of ICT4D

• Open Data• Linked Data

Img: flickr/wetwebwork

Page 100: Ld4d 2013 part 2

More information?

http://worldwidesemanticweb.org

http://w4ra.org

http://iati2lod.appspot.com/

http://[email protected]

Page 101: Ld4d 2013 part 2
Page 102: Ld4d 2013 part 2
Page 103: Ld4d 2013 part 2

The Tabale Platform

NGO staff

Malian French

Bomu

Bambara

Recordmultiple

messagesin different languages

yes

no

don’t know

Page 104: Ld4d 2013 part 2

VUI design (three languages)