Top Banner
1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of the 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA’05) Advisor Jia-Ling Koh Speaker Chun-Wei Hsieh
15

1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

Jan 19, 2016

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

1

Maintaining Knowledge-Bases of Navigational Patterns from Streams of

Navigational Sequences

Ajumobi Udechukwu, Ken Barker, Reda Alhajj

Proceedings of the 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA’05)

Advisor : Jia-Ling Koh

Speaker : Chun-Wei Hsieh

Page 2: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

2

Introduction

Navigational patterns: traversal patterns

Two broad techniques for mining

navigational patterns– 1. level-wise, apriori-based techniques– 2. tree-based techniques

Page 3: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

3

Methodology

Sliding window Batch-update strategy

– Batch: the web log in the base time unit

Example

B1

4

B1 B2B1 B2 B3B1 B2 B3 B4B1 B2 B3 B4 B5B1 B2 B3 B4 B5 B6

Page 4: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

4

Adapted GST

Adapted generalized suffix tree Appending a stop symbol to all strings Mining without thresholds

Page 5: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

5

1,12,1

1,22,2

1,32,3

LQ R$$R Q

R$

1,42,43,3

$

3,1

$

3,1

$

1,11,2

1,3

LQR$$RQ

R$

1,4

$

1,12,1

1,22,2

1,32,3

LQR$$RQ

R$

1,42,4

$

Adapted GST

LQR

1,1LQR$

1,11,2LQR$$RQ

1,11,2

1,3

LQR$$RQ

R$

LQR LQ

Page 6: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

6

Adapted GST

Page 7: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

7

The Challenge of Adapted GST

”LQ” occurs in B1 with support count of 4

and “L” occurs independently in B2 with support count of 2

Total count of “L” should be 4 + 2

Page 8: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

8

AC-NAP tree 1

Page 9: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

9

AC-NAP tree 2

Output all node labels and counts to a database

Page 10: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

10

Maintaining patterns within a window

Page 11: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

11

Maintaining patterns within a window

Count total support

Remove out_of_date patterns

Page 12: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

12

Experiments

OS: Microsoft Windows XP professional edition CPU: 2GHz Intel Pentium 4 RAM: 512MB Program language: Java DBMS: MySQL Data: real-world web logs of ”msnbc.com”

Page 13: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

13

Experiments

Page 14: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

14

Experiments

Page 15: 1 Maintaining Knowledge-Bases of Navigational Patterns from Streams of Navigational Sequences Ajumobi Udechukwu, Ken Barker, Reda Alhajj Proceedings of.

15

Experiments