Top Banner
Learning Patterns on the World Wide Web Andrew Hogue Advisor: David Karger October 17, 2003
26

Learning Patterns on the World Wide Web

Jan 02, 2016

Download

Documents

kato-cook

Learning Patterns on the World Wide Web. Andrew Hogue Advisor: David Karger October 17, 2003. Agenda. What is a pattern? How do we make one? How do we use it? Why do you want one? Demo. What is a pattern?. Objects in the world have certain semantic properties - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Learning Patterns on the World Wide Web

Learning Patterns on the World Wide Web

Andrew Hogue

Advisor: David Karger

October 17, 2003

Page 2: Learning Patterns on the World Wide Web

Agenda

What is a pattern? How do we make one? How do we use it? Why do you want one? Demo

Page 3: Learning Patterns on the World Wide Web

What is a pattern?

Objects in the world have certain semantic properties

A pattern is a way of recognizing the semantic properties of an object we’ve seen before

A pattern is a structure with semantic slots to be filled in

Page 4: Learning Patterns on the World Wide Web

Example – Books

Define an object’s semantics (ontology):

Class: BookProperty: Author

Property: Title

Property: Price

Property: Publisher

Property: ISBN

. . .

Page 5: Learning Patterns on the World Wide Web

Class: BookProperty: Author

Property: Title

Property: Price

Property: Publisher

Property: ISBN

. . .

Example - Books

?

?

Page 6: Learning Patterns on the World Wide Web

Example - Books

Class: BookProperty: Author

Property: Title

Property: Price

Property: Publisher

Property: ISBN

. . .

Page 7: Learning Patterns on the World Wide Web

Example - Books

Class: BookProperty: Author

Property: Title

Property: Price

Property: Publisher

Property: ISBN

. . .

Page 8: Learning Patterns on the World Wide Web

Creating a Pattern

Choose positive examples

Page 9: Learning Patterns on the World Wide Web

Creating a Pattern

Page 10: Learning Patterns on the World Wide Web

Creating a Pattern

Page 11: Learning Patterns on the World Wide Web

Creating a Pattern

Choose positive examples Find best mapping between examples

Page 12: Learning Patterns on the World Wide Web

Creating a Pattern

Page 13: Learning Patterns on the World Wide Web

Creating a Pattern

Choose positive examples Find best mapping between examples Merge mapped elements and assign

semantic labels

Page 14: Learning Patterns on the World Wide Web

Creating a Pattern

Page 15: Learning Patterns on the World Wide Web

Creating a Pattern

Page 16: Learning Patterns on the World Wide Web

Creating a Pattern

Page 17: Learning Patterns on the World Wide Web

Creating a Pattern

Page 18: Learning Patterns on the World Wide Web

Creating a Pattern

Choose positive examples Find best mapping between examples Merge mapped elements and assign

semantic labels Eliminate unmapped elements

Page 19: Learning Patterns on the World Wide Web

Creating a Pattern

Page 20: Learning Patterns on the World Wide Web

Creating a Pattern

Page 21: Learning Patterns on the World Wide Web

Matching Patterns

Given a pattern with slots and a page to search

Look for items on page with same structure Map pattern slots to page text

Page 22: Learning Patterns on the World Wide Web

Matching Patterns

Page 23: Learning Patterns on the World Wide Web

Matching Patterns

Page 24: Learning Patterns on the World Wide Web

Applications

Extract search engine results Extract and email news headlines Watch sites for updates Reformat sites for easier reading Monitor bank account balances

Page 25: Learning Patterns on the World Wide Web

Demo

Page 26: Learning Patterns on the World Wide Web

More Information

http://haystack.lcs.mit.edu

[email protected]