Top Banner
Graduate School of Informatics Graduate School of Informatics Kyoto University, November 21, 2001 Kyoto University, November 21, 2001 Technologies of the Technologies of the Interspace Interspace Peer-Peer Semantic Indexing Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory Graduate School of Library and Information Science University of Illinois at Urbana-Champaign www.canis.uiuc.edu, [email protected]
22

Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Jan 29, 2016

Download

Documents

Augustus Lang
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Graduate School of InformaticsGraduate School of InformaticsKyoto University, November 21, 2001Kyoto University, November 21, 2001

Technologies of the InterspaceTechnologies of the Interspace Peer-Peer Semantic IndexingPeer-Peer Semantic Indexing

Bruce SchatzCANIS Laboratory

Graduate School of Library and Information ScienceUniversity of Illinois at Urbana-Champaign

www.canis.uiuc.edu, [email protected]

Page 2: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

THE THIRD WAVE OF NET EVOLUTIONTHE THIRD WAVE OF NET EVOLUTION

PACKETS

OBJECTS

CONCEPTS

Page 3: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

SCALABLE SEMANTICSSCALABLE SEMANTICS

Automatic indexing Domain-Independent indexing Statistical clustering

Compute Context of

concepts within documents documents within repositories

Page 4: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

CROSS-OVERS IN SEMANTIC INDEXINGCROSS-OVERS IN SEMANTIC INDEXING

Page 5: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

1992 1993 1995 1996 1998

COMPUTING CONCEPTSCOMPUTING CONCEPTS

‘92: 4,000 (molecular biology)

‘93: 40,000 (molecular biology)

‘95: 400,000 (electrical engineering)

‘96: 4,000,000 (engineering)

‘98: 40,000,000 (medicine)

Page 6: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

SIMULATING A NEW WORLDSIMULATING A NEW WORLD Obtain discipline-scale collection

MEDLINE from NLM, 10M bibliographic abstracts human classification: Medical Subject Headings

Partition discipline into Community Repositories 4 core terms per abstract for MeSH classification 32K nodes with core terms (classification tree)

Community is all abstracts classified by core term 40M abstracts containing 280M concepts concept spaces took 2 days on NCSA Origin 2000

Simulating World of Medical Communities 10K repositories with > 1K abstracts (1K w/ > 10K)

Page 7: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

COMMUNITY PROCESSINGCOMMUNITY PROCESSING

Page 8: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Existing TechnologiesExisting Technologies Extracting Concepts (AI)

Canonical noun phrases Generic statistical parser

Computing Context (IR) Co-occurrence frequency, in collection Useful interactively, not strict ordering

Page 9: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

CONCEPT NAVIGATIONCONCEPT NAVIGATION

Semantic Indexes for Community Repositories

Navigating Abstractions within Repository concept space category map

Interactive browsing by Community experts

Page 10: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.
Page 11: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Category MapCategory Map

Page 12: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Category Navigation

Category Navigation

Page 13: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Concept NavigationConcept Navigation

Page 14: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

CONCEPT SWITCHINGCONCEPT SWITCHING

“Concept” versus “Term” set of “semantically” equivalent terms

Concept switching region to region (set to set) match

term

Semantic region

Concept SpaceConcept Space

Page 15: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Medicine SessionMedicine Session

Page 16: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Categories and ConceptsCategories and Concepts

Page 17: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Concept SwitchingConcept Switching

Page 18: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Document RetrievalDocument Retrieval

Page 19: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Future TechnologiesFuture Technologies Concept Switching

Spreading activation, similarity clusters

Path Matching Aggregating indexes, many repositories

Dynamic Indexing On-the-fly collections, during session

Page 20: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Peer-Peer ComputationsPeer-Peer Computations Local Interaction

Your PC does small computations e.g. screensaver for SETI

Global Merging Partition computation into small parts Each local forms part of global whole

Large-Scale Distribution 3M users of SETI@Home Public Health. www.intel.com/cure

Page 21: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

THE NET OF THE 21st CENTURYTHE NET OF THE 21st CENTURY

Beyond Objects to Concepts Beyond Search to Analysis Problem Solving via Cross-Correlating

Multimedia Information across the Net

Every community has its own special library Every community does semantic indexing

Page 22: Graduate School of Informatics Kyoto University, November 21, 2001 Technologies of the Interspace Peer-Peer Semantic Indexing Bruce Schatz CANIS Laboratory.

Zen of Information RetrievalZen of Information Retrieval Searching without Searching

Navigate concepts into documents Based on interactive recognition

Indexing without Indexing Compute context on dynamic collections Based on distributed extraction

Sharing without Sharing Record paths during user sessions Based on community practices