Page 1
4/23/18
1
An Introduction
Linked Data1 2 3 4 5
1--Onegoal2– Twotypesofquestions3– RDFtriples4– Fourprinciples5– FivestarLOD
LearnbyUnderstanding
LearnbyAnalyzing
MarciaZeng,2018DIS 1
SirTimBerners-Lee,theinventoroftheWWWandtheinitiatorofLinkedData,presentedaStarSchemefor
measuringtherankofadataset:
2
Five-StarLOD ★★★★★5
https://www.w3.org/DesignIssues/LinkedData.htmlMarciaZeng,2018DIS
Page 2
4/23/18
2
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdataa. WorldCatb. TheBritishNationalBibliography(BNB)
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothersa. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
3
LearnbyAnalyzing
*LAMs=Libraries,archives,andmuseumsMarciaZeng,2018DIS
1. SpecialCollections,Archives
a.LinkedJazz
Theprojectfocusesondigitalizedarchivesofjazzhistorytoexposerelationshipsbetweenmusiciansandrevealtheircommunity’snetwork.
http://linkedjazz.org/
http://linkedjazz.org/network/4MarciaZeng,2018DIS
Page 3
4/23/18
3
MethodologySummary:• Anaturalprocessingtoolpullsexcerptsfromtranscriptsofinterviewswithjazz
musicians thatmentionarelationship withanotherjazzmusician.• Aftertheprocessofcontrollingsynonymsandeliminatingambiguity,themusician
namesweremappedtotheDBpedia,anddata abouteachpersonwasobtained.• Therelationshipswerepresentedbasedonanontology.• Avisualizationtoolwasusedtopresentauniqueinteractiveinterface.
5MarciaZeng,2018DIS
6
•VisualizedresultinGephi.(DatamashupbetweenLinkedJazzmusiciansandCarnegieHallperformers.)
LinkedJazzandCarnegieHall (Indevelopment)About
http://pfch.nyc/linked_jazz_meets_carnegie_hall/CH-LJ_network/index.html#Mary%20Lou%20Williams
MarciaZeng,2018DIS
Page 4
4/23/18
4
(cont.)1.SpecialCollections,Archivesb.OnlineCoinsoftheRomanEmpire(OCRE)
7
http://numismatics.org/ocre/
MarciaZeng,2018DIS
8
• Ontologicalclasses-- browse1.bOCRE
• Modelinginanontology(formedinclasses,properties,relationships);
• FollowingLinkedDataprinciples;• UsingRDFtriplesforentities;• QueryinginSPARQLlanguage.
MarciaZeng,2018DIS
Page 5
4/23/18
5
9http://numismatics.org/ocre/id/ric.6.lon.66
Foranindividualobject,ausercanfindauto-generateddatarelatedtoit,themap(s),andquantitativeanalysis.
MarciaZeng,2018DIS
1.bOCRE
10
• Visualizeyourquerieson-the-flyHow?http://wiki.numismatics.org/numishare:visualize
• UsingSPARQLqueriestofind;• Auto-Visualizing;• Auserdoesnotneedtosee
oruseSPARQLlanguage.
MarciaZeng,2018DIS
1.bOCRE
Page 6
4/23/18
6
11
• Interactwithamap
MarciaZeng,2018DIS
1.bOCRE
MarciaZeng,2018DIS 12http://numismatics.org/ocre/
1.bOCRE
Page 7
4/23/18
7
Formoreinformation• Webinar:EthanGruber:From0to60onSPARQLqueriesin50minutes,May
13,2015WatchtheYouTube:https://www.youtube.com/watch?v=3YhG5QQmhvU
• EthanGruber’swebpagehttp://numismatics.org/ethangruber/– WhereyoucanconnecttohisGithub,https://github.com/ewg118– SPARQLquerieshttps://gist.github.com/ewg118
• AFinalreportsubmittedtothefunder,NEH,2017– http://www.dayofarchaeology.com/final-report-to-the-neh-for-online-coins-of-the-roman-empire/
13
OnlineCoinsoftheRomanEmpire(OCRE)
MarciaZeng,2018DIS
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdataa. WorldCatb. TheBritishNationalBibliography(BNB)
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothera. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
14
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
Page 8
4/23/18
8
http://www.worldcat.org/oclc/246662790
http://viaf.org/viaf/102333412
schema:creator
creatorPrideandPrejudice
MarciaZeng,2018DIS 15
Case:2.aWorldCat
16
Case2.bTheBritishNationalBibliography(BNB)-- usesitsownontology
About:• TheBritishLibraryisthenationallibraryoftheUKandisresponsiblefor
distributingmetadatadescribingitscollections andrecordingUKpublishingoutputintheBritishNationalBibliography(BNB) http://bnb.data.bl.uk.
• In2011,theBritishLibrarybeganpublishingaLODversionoftheBNBaspartofitsopenmetadatastrategy.ThemovetoLODBNBprovedinfluentialamongthelibrarycommunityinmovingtheLinkedData‘debate’fromtheorytopractice.
• TheLODBNBhascontinuedtoevolvewithregularmonthlyupdates,theinclusionofnewlinks(e.g.totheISNI)andcontent(e.g.serials).
Try:• GotoitsFlint Sparql Endpointat:http://bnb.data.bl.uk/flint-sparq• Usethesamplequeriestoseeexamples.• Trytoformyourownqueriesandgetdifferentdatasets.
Read:• Howtogetthebulkdownloadhttp://www.bl.uk/bibliographic/download.html
MarciaZeng,2018DIS
Page 9
4/23/18
9
References: Lists of all classes, properties, and prefixes of the metadata vocabularies used by BNB.
2. Results in Plain text. Other output options are XML and JSON.
1. Query text for “Which titles by detective writer Ian Rankin appear in the BNB?”
SELECT*WHERE{?s?p?o}
1
2
17
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdata– WorldCat
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothera. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
18
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
Page 10
4/23/18
10
Source:extractedscreenshots(2017-07-12)Fromhttp://fast.oclc.org/searchfast/
19
3.KOSa.FAST
MarciaZeng,2018DIS
Source:extractedscreenshots(2017-07-12)athttp://experimental.worldcat.org/fast/35588/rdf.xml
JohnF.Kennedy’sentryinFASTisenrichedwithothersources.
• TheDBpedia identifiersallowFASTtermstoincludedetailedinformationthatisusuallyexcludedinauthorityrecords.
• TheVIAFURIallowsFASTtermstotakeadvantageofallofthevariousstringvaluesincludedinVIAFwithouthavingtomanuallyincludethevaluesintheRDFtriplesforthespecificterm.
20MarciaZeng,2018DIS
3.KOSa.FAST
Page 11
4/23/18
11
Imagesource:CapturedAug.2017.http://experimental.worldcat.org/mapfast/
TheGeoNamesdataisusedtopowerMapFAST,whichisaGoogleMapsmash-up.
21MarciaZeng,2018DIS
3.KOSa.FAST
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdata– WorldCat
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothersa. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
22
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
Page 12
4/23/18
12
23
http://vocab.getty.edu/sparql
http://vocab.getty.edu
http://vocab.getty.edu/queriesMarciaZeng,2018DIS
3.KOSb.GettyVocabs
GettyVocabsLOD
AATTGNULAN[CONA]
We will try them at the Hands-on session.
Atthequerytemplatespage
FindthesectionforULAN.
Therearemanyinterestingqueryexamples.
24
http://vocab.getty.edu/queries
Nameauthoritiesofferfoundationalstructureddatafornetworkanalyses.
Whatkindsofqueryexamples?
MarciaZeng,2018DIS
3.b.GettyVocabs
ULAN
UnionListofArtistNames
Page 13
4/23/18
13
25
Howtouseatemplate?(1)Chooseaquery(left),e.g.,#5.2;thetemplateboxwillshowup(lower-right).(2)Atthattemplatebox’supper-rightcorner,clickonthatSPARQLsign;thequerywillautomaticallyjumpuptotheQuerybox(top).(3)Submit.
1
2
3
MarciaZeng,2018DIS
ULAN
26
Theresult:“allassociativerelationshipsofulan:500115493Duerer,Albrecht”(showingaportionoftheresults).
MarciaZeng,2018DIS
ULAN
Page 14
4/23/18
14
http://vocab.getty.edu/queries#Top-level_Subjects
Browse the examples of queries
You can obtaining special RDF graphs or datasets for very complicated questions, andrevealing unknown relationships.
KOSinLODbecomeknowledgebasesofresearch.
27
TGN
ThesaurusforGeographicNames(TGN)
MarciaZeng,2018DIS
28
1
2
3
Steps: (1) Choose #4.18 query, (2) click on that SPARQL sign in that 4.18 template box. After click on that SPARQL sign, the query should be automatically uploaded to the top box. (3) Submit. Note: Since this is a complicated query, it will run a few seconds. Be patient.
E.g.,LookforcastlesaroundTheNetherlands(withintheboundaryof50.7871853.38972253.5422657.169019)
TGN
MarciaZeng,2018DIS
Page 15
4/23/18
15
E.g.,LookforcastlesaroundTheNetherlands
4
(4) Download the datasets in a selected format.The best way is to download the csvfile. (5) You should either keep the query in your CSV file or make a note what you searched for and in which boundary.
Finished.6
6
Optional:(6) Click on any castle’s ID, & open the single data record for this concept. (7) Click on the Website to see its normal html view.
7
29
TGN
MarciaZeng,2018DIS
E.g.,Queryaspecificplacetype(e.g.,WorldHeritageSites)inageographicboundary.Gottheresults&downloadabledatasets:
WorldHeritageSiteswithin(24.7508328.9577843.80722108.92861)aroundtheSilkroad.
30
TGN
Page 16
4/23/18
16
Use a <Guide Term> to obtain all concept URIs
and preferred terms in the hierarchies (for a
microthesaurus or a pick list)
in <xyz>
31
Microthesaurus =designatedsubsetofathesaurus thatiscapableoffunctioningasacompletethesaurus.
-- ISO25964-2:2013
CreateMicrothesauri orpicklistsfromtheGettyLODVocabularies
AAT(cont.)3.KOSb.GettyVocabs
ArtandArchitectureThesaurus(AAT)
MarciaZeng,2018DIS
Cases:UsingLODintheLAMs*1. SpecialCollections,Archives
a. LinkedJazzb. OnlineCoinsoftheRomanEmpire(OCRE)
2. Bibliographicdata– WorldCat
3. Knowledgeorganizationsystems(KOS)- thesauri,nameauthorities,andothera. FASTb. GettyVocabs
4. DigitalScholarships– VIVObased- Scholars@Cornell
32
LearnbyAnalyzing
*LAM=Libraries,archives,andmuseumsMarciaZeng,2018DIS
Page 17
4/23/18
17
https://scholars.cornell.edu/
Scholars@Cornell
Scholars@Cornelloffersintegratedandindividualizedprofilesabout:• faculty,• instituteunits,
researchdomains,
• collaboratingnetworks,and
• academicoutcomes.
MarciaZeng,2018DIS 33
E.g.,Chooseonefacultymemberorresearcher
34
Forexample,whenlookingforaninformationscienceresearcher,Ifoundaprofessor,SusanR.Fussell.• Whatdoesthisprofile
tellusaboutthisperson?
• Howarethingsconnected?
"Co-Authors”"Co-Investigators”
MarciaZeng,2018DIS
Page 18
4/23/18
18
• https://scholars.cornell.edu/35
Allinteractive.Datavaluesinvariousontologicalclassesareconnected,integrated.
MarciaZeng,2018DIS
36
VIVO(notanacronym)isawell-knownontology-basedscholarlynetworkinganddiscoverytoolformanaginginformationandknowledgeinlargeinstitutionsandassociations,asdemonstratedbytheVIVO-poweredwebsitese.g.,• ScrippsResearchInstitute,• U.S.DepartmentofAgriculture,• UNAVCO,and• manyothers-seeregistry:
http://duraspace.org/registry/vivo
• InotherVIVO-basedsite,therearealsoMapofScience
MarciaZeng,2018DIS
Page 19
4/23/18
19
The changing concepts•(seeing from the content)
– From "Web of Documents" to "Web of Data"– From linking strings to linking things– From digitization to datalization
•(seeing from the results)– From "On the Web" to "Of the Web”
37
Summary
MarciaZeng,2018DIS
38
What is Linked Data?
• -- is a term used to describe a method of exposing, sharing, and connecting data on the Web using URIs and RDF
• --is about: • using the Web to connect related data that
was not previously linked, • using the Web to lower the barriers to linking
data currently linked using other methods. [1]
[1] http://linkeddata.org/
MarciaZeng,2018DIS