Top Banner
ISICIL semantic and social intraweb for corporate intelligence and watch ANR project CONTINT 2009-2011 Fabien Gandon, http://fabien.info Leader Wimmics research team (INRIA, CNRS, Univ. Nice) W3C AC Rep. for INRIA
95
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: semantic and social intraweb for corporate intelligence and watch

ISICIL semantic and social intraweb for corporate intelligence and watch

ANR project CONTINT 2009-2011

Fabien Gandon, http://fabien.info

Leader Wimmics research team (INRIA, CNRS, Univ. Nice)

W3C AC Rep. for INRIA

Page 2: semantic and social intraweb for corporate intelligence and watch

Social web

Semantic web

Linked data

Social semantic web

ISICIL: semantic social intraweb

Page 3: semantic and social intraweb for corporate intelligence and watch

isicil.inria.fr

• enterprise social networking

• business intelligence, watching, monitoring

• communities of interest, of practice, of experts

Page 4: semantic and social intraweb for corporate intelligence and watch

semantic intrawebs & social intrawebs

Page 5: semantic and social intraweb for corporate intelligence and watch

reconcile latest viral applications of the web with formal models and business processes

new tools to support business intelligence and technological watch

interfaces of web 2.0 app. for interaction (blog, wikis, social bookmarking, feeds, etc.)

semantic web formalisms and processing

social epistemology as theoretical framework

ISICIL

Page 6: semantic and social intraweb for corporate intelligence and watch

consortium

INRIA - Wimmics team (leader)

CNRS - I3S – KEWI

Telecom ParisTech

UTT - Tech-CICO

ADEME

Orange Labs R&D

Page 7: semantic and social intraweb for corporate intelligence and watch

proposed overview… integrating requirement analysis methods

examples of challenges and derived functionalities

overview of this open-source platform

http://isicil.inria.fr

Page 8: semantic and social intraweb for corporate intelligence and watch

MERGING METHODOLOGIES extracts of the requirement analysis and specifications

Page 9: semantic and social intraweb for corporate intelligence and watch

analyze and model key business processes

Analyze interactions between members of the group ADEME « roadmap for urban mobility »

campaigns of questionnaires at Orange Labs

trend analysis of intelligence market and watch

comparison of the APIs, widgets and other applications

usage analysis an specification

Page 10: semantic and social intraweb for corporate intelligence and watch

ex. study of transformations: existing target

Page 11: semantic and social intraweb for corporate intelligence and watch

shared online referential

ARIS portal for ISICIL: http://aris2.utt.fr:9090/businesspublisher/

Page 12: semantic and social intraweb for corporate intelligence and watch

convergence matrix

detections of needs or redundancies in key scenarios

Etapes des scénarios Fonctionnalités identifiées Fonctions SI

Présenter problématique au SVIC Mailing, Q&A Envoyer

Demander ce qui est incontournable et ce que font les autres ingénieurs

Consultation d’experts Extraire, filtrer

Prendre en compte demande Workflow, Outils de collaboration communiquer

Préparer requêtes Moteur de recherche, équation de

recherche

rechercher

Recueillir résultats Abonnement, push… Extraire, annoter

Vérifier pertinence des résultats Analyses, outils de filtrage filtrer

Informer l'ingénieur Messagerie électronique, chat,

vidéo-conférence

envoyer

S'approprier les résultats et les requêtes

Equation de recherche, profil, tags Annoter, organiser

Devenir le destinataire des alertes

Diffusion par profil diffuser

Page 13: semantic and social intraweb for corporate intelligence and watch

business intelligence market analysis

Page 14: semantic and social intraweb for corporate intelligence and watch

proposing functionalities & prototypes

Prioritization of functionalities Frequent functionalities and dependencies

Page 15: semantic and social intraweb for corporate intelligence and watch

EXAMPLES OF FONCTIONNALITIES examples of challenges and implementations (web 3.0 & enterprise 3.0)

Page 16: semantic and social intraweb for corporate intelligence and watch
Page 17: semantic and social intraweb for corporate intelligence and watch

a tag a data attached to an object

geometry

Page 18: semantic and social intraweb for corporate intelligence and watch

collaboratively create and manage tags to annotate and categorize content

SOCIAL TAGGING

Page 19: semantic and social intraweb for corporate intelligence and watch
Page 20: semantic and social intraweb for corporate intelligence and watch

a crowd of users creating massive categorizations

Page 21: semantic and social intraweb for corporate intelligence and watch

assited structuring of folksonomies

flat folksonomies web 2.0

[Limpens et al.]

pollution

soil pollution

has narrower

pollutant energy

related related

thesaurus

?

SKOS

Page 22: semantic and social intraweb for corporate intelligence and watch

global giant graph link users, actions, knowledge, resources, groups, etc.

#Freddy

#bk81

hasBookmark

hasTag

#tag27

industry

hasLabel

#Fabien

#bk34

#tag92

industries

hasBookmark

hasLabel hasTag

Page 23: semantic and social intraweb for corporate intelligence and watch

folksonomies → ontologies contributions…

… [Mika, 2005] hierarchies / community inclusion.

… [Heymann et al., 2006] hierarchies / centrality in graph Tag-Ressource

… [Schmitz, 2006] hierarchies / conditional probabilies & co-occurrence

… [Cattuto et al., 2008] [Markines et al., 2009] different metrics

… [Specia et al., 2007] [Begelman et al., 2006] clustering de tags

variations around metrics & space (tag-resource-user).

[Limpens et al ]

Page 24: semantic and social intraweb for corporate intelligence and watch

folksonomies + ontologies contributions…

... [Gruber, 2005] [Tanasescu et al., 2007] tagging tags

… [Specia et al., 2007][Cattuto et al., 2008][Giannakidou et al., 2008] [Ronzano et al., 2008] [Tesconi et al., 2008] automated structuring using external linguistic resources.

... [Good et al., 2007] manual disambiguation referencing a vocabulary

… [Passant et al., 2007] manual disambiguation referencing a thesaurus

… [Huynh-Kim Bang et al. , 2008] structured tagging “Paris<France”

[Limpens et al ]

Page 25: semantic and social intraweb for corporate intelligence and watch

ontologies → folksonomies contributions…

… [Gruber, 2005] [Newman et al., 2005] ontology of the tagging act

… [Breslin et al., 2005] SIOC resources shared on social web sites

… [Kim et al., 2007] SCOT representing tags and their cloud

… [Passant et al., 2008] MOAT, associating a meaning to a tag.

[Limpens et al ]

Page 26: semantic and social intraweb for corporate intelligence and watch

SoA… you are here Computed Tag

similarity Tag-Concept

mapping Users' contrib.

Sem-Web formalism

Multi-points of view

Angeletou et al. (2008)

✓ ✓ ✓

Huynh-Kim Bang et al. (2008)

✓ ✓

Passant & Laublet(2008)

✓ ✓ ✓

Lin & Davis (2010) ✓ ✓ ✓ ✓

Braun et al. (2007) ✓ ✓

Limpens et al. (2010)

✓ ✓ ✓ ✓

Page 27: semantic and social intraweb for corporate intelligence and watch

pollution

pollutant pollution

pollution pollution pollution pollution Soil pollutions

edition distances

Page 28: semantic and social intraweb for corporate intelligence and watch

0

0,1

0,2

0,3

0,4

0,5

0,6

0,7

0,8

0,9

1

Close match Aire sous f cm_A

Tag1 broader than Tag2 Aire sous f t1bt2_A

Related Aire sous f rel_A

evaluating distances c.f. [Limpens et al.]

Page 29: semantic and social intraweb for corporate intelligence and watch

determine thesaurus relations

Comparison of the mean value of the JaroWinkler metric for each type of semantic relation

Mean value of the difference s(t1,t2) - s(t2,t1) with s being the Monge-Elkan QGram metric for each set of tag pairs.

Page 30: semantic and social intraweb for corporate intelligence and watch

Node size ↔ InDegree

◉ tags (delicious + thesenet)

◉ svic keywords

examples of results

Page 31: semantic and social intraweb for corporate intelligence and watch

tag1 tag2 tag3

tag1 freq (tag1) cooc (tag1, tag2) cooc (tag1, tag3)

tag2 cooc (tag2, tag1) freq (tag2) cooc (tag2, tag3)

tag3 cooc (tag3, tag1) cooc (tag3, tag2) freq (tag3)

contextual distance: co-occurrence vector cosine distance to detect related tags

21

2121,cos

tagtag

tagtagtagtag

[Cattuto et al. 2008]

Page 32: semantic and social intraweb for corporate intelligence and watch

example of results CADIC, ADEME

Page 33: semantic and social intraweb for corporate intelligence and watch

environment

interest comunity inclusion detecting narrower tags [Mika et al.]

agriculture

12 ,,21

tagrhasnarrowetaguseruser tagtag

Page 34: semantic and social intraweb for corporate intelligence and watch

Arrows mean "has broader" thickness ≈ weight

examples of results del.icio.us

Page 35: semantic and social intraweb for corporate intelligence and watch

combining metrics edition distances Monge-Elkan Soundex, JaroWinkler,

asymmetry Monge-Elkan Qgram

contextual metric cosinus vector co-occurring tags

social metrics inclusion of communities of interest

football

sport

+

+

Page 36: semantic and social intraweb for corporate intelligence and watch

83 027 relations / 9 037 tags

68 633 related

11 254 hyponyms

3 193 spelling variants

Page 37: semantic and social intraweb for corporate intelligence and watch

structuring as a side effect

Page 38: semantic and social intraweb for corporate intelligence and watch

handling conflicts arbitration rules

IF num(narrower)/num(broader) ≥ c

THEN narrower/broader

ELSE related

purely automatic

conflicting

arbitrated conflict

debated

consensual

Page 39: semantic and social intraweb for corporate intelligence and watch

folksonomy enrichment lifecycle

ADDING TAGS

Automatic processing

User-centric structuring

Detect conflicts

Global structuring

Flat folksonomy

Structured folksonomy

[Limpens et al.]

Page 40: semantic and social intraweb for corporate intelligence and watch
Page 41: semantic and social intraweb for corporate intelligence and watch

social networks networking is not that new e.g. commerce

social network analysis beginning of the 20th century

Page 42: semantic and social intraweb for corporate intelligence and watch

800 millions

Chine: 1 600 millions

Inde: 1 200 millions

acebook

Page 43: semantic and social intraweb for corporate intelligence and watch

Graphs, graphs, graphs

Fabien owner

author

Researcher

type

doc.html author

Semantic web is not antisocial

Adult

Researcher

sub property sub class

semantic web

title

Fabien

Marco Guillaume

Nicolas

Michel

Rémi

social network analysis

),(;)( pxrelxpdin

4)( Guillaumedin

owner

Adult

type

Page 44: semantic and social intraweb for corporate intelligence and watch

semantic social network analysis contributions…

… [Goldbeck et al 2003] propagating trust

… [Finin et al 2005] power law of degrees & community struct

… [Paolillo et al 2006] classical SNA on FOAF from LiveJournal

… [Goldbeck et Rothstein 2008] merging FOAF profiles

… [Anyanwu et al 2007] [Kochut et al 2007] [Corby et al 2004] [Corby 2008] [Baget et al, 2007] paths in SPARQL

… [Ereteo et al 2009] type-parameterized SNA and SemTagP

… [Rowe et al. 2011] User Behaviour in Online Communities

[Ereteo et al ]

Page 45: semantic and social intraweb for corporate intelligence and watch

Directed

networks

Weighted

networks

Labelled

network

Parametrized

operators Network size

Graph Theory ✔ ✔ ✔ 106 nodes

107 edges

[Brandes 2009] ✔ ✔ ✔ 104 nodes

[Paolillo & Wright

2006] ✔ ✔ ~ 104 nodes

~ 105 edges

[San Martin &

Gutierrez 2009] ✔ ✔ ~ 104 nodes

~ 104 - 105

edges

SEMSNA ✔ … ✔ ✔ 104 nodes

~ 105 edges

[Erétéo et al.]

Page 46: semantic and social intraweb for corporate intelligence and watch

parent sibling

mother father brother sister

colleague

knows Gérard

Fabien

Mylène

Michel

Yvonne

<family> (guillaume)=5 d (guillaume)=3 guillaume

c.f. [Erétéo et al.]

Page 47: semantic and social intraweb for corporate intelligence and watch

eg. typed proximity centrality

select distinct ?y ?to

pathLength($path) as ?length

(1/sum(?length)) as ?centrality

where{

?y s (foaf:knows*/rel:worksWith)::$path ?to

}group by ?y

1

GEx

worksWithknows

c

worksWithknows xkglengthkC ,/*/*

Page 48: semantic and social intraweb for corporate intelligence and watch

CORESE/ KGRAM [Corby et al.]

Page 49: semantic and social intraweb for corporate intelligence and watch

e.g.

Page 50: semantic and social intraweb for corporate intelligence and watch
Page 51: semantic and social intraweb for corporate intelligence and watch

ipernity.com dataset in RDF

61 937 actors & 494 510 relationships –18 771 family links between 8 047 actors –136 311 friend links implicating 17 441 actors –339 428 favorite links for 61 425 actors etc.

c.f. [Erétéo et al.]

Page 52: semantic and social intraweb for corporate intelligence and watch

some interpretations validated with managers of ipernity.com

friendOf, favorite, message, comment

small diameter, high density

family as expected: large diameter, low density

favorite: highly centralized around Ipernity animator.

friendOf, family, message, comment: power law of

Page 53: semantic and social intraweb for corporate intelligence and watch

some interpretations existence of a largest component in all sub networks "the effectiveness of the social network at doing its job" [Newman 2003]

0

10000

20000

30000

40000

50000

60000

70000

number actors size largest component

knows

favorite

friend

family

message

comment

Page 54: semantic and social intraweb for corporate intelligence and watch

e.g. of results: different key actors for different kinds of links

c.f. [Erétéo et al.]

Page 55: semantic and social intraweb for corporate intelligence and watch

PERFORMANCES & LIMITS Knows 0.71 s 494 510

Favorite 0.64 s 339 428

Friend 0.31 s 136 311

Family 0.03 s 18 771

Message 1.98 s 795 949

Comment 9.67 s 2 874 170

Knows 20.59 s 989 020

Favorite 18.73 s 678 856

Friend 1.31 s 272 622

Family 0.42 s 37 542

Message 16.03 s 1 591 898

Comment 28.98 s 5 748 340

Shortest paths used

to calculate

Knows Path length <= 2: 14m 50.69s

Path length <= 2: 2h 56m 34.13s

Path length <= 2: 7h 19m 15.18s

100 000

1 000 000

2 000 000

Favorite Path length <= 2: 5h 33m 18.43s 2 000 000

Friend Path length <= 2: 1m 12.18 s

Path length <= 2: 2m 7.98 s

1 000 000

2 000 000

Family Path length <= 2 : 27.23 s

Path length <= 2 : 2m 9.73 s

Path length <= 3 : 1m 10.71 s

Path length <= 4 : 1m 9.06 s

1 000 000

3 681 626

1 000 000

1 000 000

)(GComp rel

)(, yD rel 1

)(bC relb

time projections

Page 56: semantic and social intraweb for corporate intelligence and watch

SEMSNA SCHEMA annotating the networks with analysis results

high centrality

Page 57: semantic and social intraweb for corporate intelligence and watch

SemSNA an ontology of SNA

http://ns.inria.fr/semsna/

Page 58: semantic and social intraweb for corporate intelligence and watch

4

Philippe colleague

2

colleague

supervisor

Degree

Guillaume

Gérard

Fabien

Mylène

Michel

Yvonne

Ivan Peter

example of SemSNA

Page 59: semantic and social intraweb for corporate intelligence and watch

ADD {

?y semsna:hasInDegree _:b0

_:b0 semsna:forProperty param[type]

_:b0 rdf:value ?indegree

_:b0 semsna:hasLength param[length]

}

SELECT ?y count(?x) as ?indegree {

?x $path ?y

filter(match($path, star(param[type])))

filter(pathLength($path)<= param[length])

} group by ?y

parameterized in-degree

)(dolengthtype, yin

[PhD Guillaume Erétéo]

Page 60: semantic and social intraweb for corporate intelligence and watch

conceptual and software framework for a semantic analysis of social networks using semantic web frameworks

Page 61: semantic and social intraweb for corporate intelligence and watch

gro

up

s &

re

aso

ns

Page 62: semantic and social intraweb for corporate intelligence and watch

hierarchical algorithms

output dendrograms of larger and larger communities from top to bottom.

• agglomerative algorithms [Donetti & Munoz 2004] [Zhou & Lipowsky 2004] [Xu et al 2007] [Newman 2004]

• divisive algorithms [Girvan & Newman 2002] [Radicchi et al 2004]

[Eretéo et al., 2011]

Page 63: semantic and social intraweb for corporate intelligence and watch

heuristic based algorithms

• similarity with electrical networks [Wu 2004]

• random walk [Dongen 2000] [Pons et al 2005]

• label propagation [Raghavan et al 2007]

[Eretéo et al., 2011]

Page 64: semantic and social intraweb for corporate intelligence and watch

detecting AND labeling communities

?

?

Page 65: semantic and social intraweb for corporate intelligence and watch

propagating tags to discover communities of interest

Page 66: semantic and social intraweb for corporate intelligence and watch

tags to detect and label communities

extension of algorithm RAK/LP : from random labels to structured tags

salt, water

pepper, wine

mustard

rugby, foot

foot, movie

hockey sport sport

sport

condiment

condiment condiment

[Eretéo et al., 2011]

Page 67: semantic and social intraweb for corporate intelligence and watch

experimented algorithm 1. Algorithm SemTagP(RDFGraph network, Type relation)

2. DO

3. old_network = network

4. FOREACH user in network.users

5. user.tag = mostUsedNeighborTag(user, relationType)

6. END FOREACH

7. WHILE modularity(network) > modularity(old_network)

8. RETURN old_network

[Eretéo et al., 2011]

inject semantics here

Page 68: semantic and social intraweb for corporate intelligence and watch

semantic tag propagation exploit folksonomy for label assignment

a d f

e

g c

b

wiki mobile

[Eretéo et al., 2011]

sweetwiki

mobile inria

mobile

Page 69: semantic and social intraweb for corporate intelligence and watch

semantic tag propagation apply social pressure of RAK/LP

a d f

e

g c

b

wiki mobile

[Eretéo et al., 2011]

sweetwiki

mobile inria

mobile

Page 70: semantic and social intraweb for corporate intelligence and watch

semantic tag propagation take thesaurus into account in propagating

a d f

e

g c

b

wiki mobile

[Eretéo et al., 2011]

sweetwiki

mobile inria

mobile

wiki

sweetwiki mediawiki

skos:narrower

Page 71: semantic and social intraweb for corporate intelligence and watch

semantic tag propagation take thesaurus into account in propagating

a d f

e

g c

b

wiki mobile

[Eretéo et al., 2011]

sweetwiki

wiki inria

mobile

wiki

sweetwiki mediawiki

skos:narrower

Page 72: semantic and social intraweb for corporate intelligence and watch

semantic tag propagation etc. leading to 2 communities

a d f

e

g c

b

wiki mobile

[Eretéo et al., 2011]

wiki

wiki mobile

mobile

Page 73: semantic and social intraweb for corporate intelligence and watch

applied to Ademe Ph.D. network 1 853 agents

1 597 academic supervisors

256 ADEME engineers.

13 982 relationships

10 246 rel:worksWith

3 736 rel:colleagueOf

6 583 tags

3 570 skos:narrower relations between 2 785 tags

Page 74: semantic and social intraweb for corporate intelligence and watch

-1,2

-1,1

-1

-0,9

-0,8

-0,7

-0,6

-0,5

-0,4

-0,3

-0,2

-0,1

0

0,1

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

R AK

T agP

S emT agP

controlled S emT agP (env)

controlled S emT agP (env, energ)

controlled S emT agP (env, energ, model)

MODULARITY COMPARISONS X axis: propagation iterations, Y axis: modularity

Page 75: semantic and social intraweb for corporate intelligence and watch

results 1. pollution

2. sustainable

development

3. energy

4. chemistry

5. air pollution

6. metals

7. biomass

8. wastes

75

Page 76: semantic and social intraweb for corporate intelligence and watch

controled abstraction and merge

lead

Page 77: semantic and social intraweb for corporate intelligence and watch

PLATFORM AND PROTOTYPES overview of the architecture

Page 78: semantic and social intraweb for corporate intelligence and watch

a web 3.0 solution

open integration and standard in the front-end

Page 79: semantic and social intraweb for corporate intelligence and watch
Page 80: semantic and social intraweb for corporate intelligence and watch

tow

ards rich

we

bm

arks

Page 81: semantic and social intraweb for corporate intelligence and watch
Page 82: semantic and social intraweb for corporate intelligence and watch
Page 83: semantic and social intraweb for corporate intelligence and watch
Page 84: semantic and social intraweb for corporate intelligence and watch

navigating in the expert network

Page 85: semantic and social intraweb for corporate intelligence and watch

activity flow and notification

Page 86: semantic and social intraweb for corporate intelligence and watch

export to Gephi for visualization & analyze

Page 87: semantic and social intraweb for corporate intelligence and watch

web-scraping: archiving and integrating

Page 88: semantic and social intraweb for corporate intelligence and watch

Fresnel lenses to adapt results

Page 89: semantic and social intraweb for corporate intelligence and watch

create dynamic reports in the wiki

Page 90: semantic and social intraweb for corporate intelligence and watch

going mobile

Page 91: semantic and social intraweb for corporate intelligence and watch

to know more deployment & test campaign (4… 20… +) .

deliverables and publications http://isicil.inria.fr

open source code on INRIA forge https://gforge.inria.fr/projects/isicil/

models http://ns.inria.fr/

Page 92: semantic and social intraweb for corporate intelligence and watch

doggy-bag of the talk

Page 93: semantic and social intraweb for corporate intelligence and watch

social web 2.0 epistemology semantic web

theoretical framework

extensible models

process and interaction

services and interfaces

Page 94: semantic and social intraweb for corporate intelligence and watch

tomorrow, he, who controls the metadata,

controls the web.

@fabien_gandon http://fabien.info

Page 95: semantic and social intraweb for corporate intelligence and watch

What is WWW2012?

21st International World Wide Web Conference

a “A rated” scientific conference ~12% acceptance & 1000-1500 participants

Lyon- France from 16t to 20th April 2012

RESEARCHERS

INDUSTRIALS USERS

www2012.org @www2012Lyon