Top Banner
Web Services and Application of Multi- Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington 2005
26

Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Jan 02, 2016

Download

Documents

Claud Dalton
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Web Services and Application of Multi-Agent Paradigm for DL

Yueyu Fu & Javed MostafaSchool of Library and Information Science

Indiana University, Bloomington2005

Page 2: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Outline

• Background

• Centralized vs. Distributed Classification

• Multi-agent Classification

• Discussion

Page 3: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Background

• Information overload– MEDLINE database contains over 12 million records

dating back to the mid-1960’s. – Google claims that it can search more than 8 billion

web pages, which is only a small fraction of the whole web.

• Information organization– Document classification

• Document classification is an important operational problem in digital library research.

Page 4: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

What is document classification?

• Document classification --- a process of assigning natural language texts to predefined categories.

– a news article about a basketball game – sport– a patent document about computer chips - technology– a new article about war in Iraq – politics/economic

Page 5: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Why classification is important?

CategorizationClassification

Retrieval

NLP

ClusteringFiltering

Extraction

Routing

Page 6: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Document Classification

• Human/machineManual classificationAutomatic classification

• Organization structureCentralized classificationDistributed classification

Page 7: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Manual Classification

• Traditional approach – Manual classification• Principal schemes:

• Dewey Decimal Classification• Universal Decimal Classification• Library of Congress Classification

Con: heavily rely on domain experts and human judgments

Page 8: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Automatic Classification

• Alternative approach – Automatic classification• Automatically classify texts based on a set of pre-

classified documents using machine learning techniques

• Classifiers built in a centralized and monolithic manner

Pro: automation, efficient, and consistent.

Page 9: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Centralized Classification

Page 10: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Distributed Classification

Page 11: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Centralized vs. Distributed Classification

• Centralized approach – Classify the documents independently using a

centralized and monolithic classification program

• Distributed approach– Allows multiple classification programs to

work together to classify the documents in a distributed computing environment

Page 12: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Centralized Classification: disadvantages

• Limited by its knowledge

• Limited by its computing power

• Performance bottleneck

• Single point of failure

Page 13: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Distributed Classification: advantages

• More knowledge

• More computing resources

• Reliable --- avoid single point of failure

• Scalable --- dealing with large data set

Page 14: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Multi-Agent Paradigm

• Evolved from distributed artificial intelligence in the late 80’s

• Multi-agent system (MAS) is “a loosely coupled network of problem solvers that work together to solve problems that are beyond their individual capabilities.” (Durfee & Montgomery, 1989)

Page 15: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

MAS: characteristics

• Composed of multiple autonomous components, called agent

• Each agent has incomplete capabilities to solve a problem

• No global system control

• Data is decentralized

• Computation is asynchronous

Page 16: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

MAS: advantages

• Distributes computational resources and capabilities across a network of interconnected agents

• Avoids the “single point of failure” problem • A modular, scalable architecture.• Solutions to problems that can naturally be

regarded as a society of autonomous interacting components-agents.

• Solutions that efficiently use information sources that are spatially distributed.

• Solutions in situations where expertise is distributed.

• Enhances overall system performance.

Page 17: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Multi-Agent Collaboration and Classification of Information (MACCI)

Agent-1 Agent-2

Agent-6 Agent-3

Agent-5 Agent-4

AdminAgent

Page 18: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

MACCI - Experiment

• Data set – RCV1-v2: 800,000 manually categorized

newswire stories from Reuters, Ltd.

• Classification method– Cosine similarity

• Effectiveness measure

callecision

callecisionF

RePr

RePr21

Page 19: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

MACCI - Results

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

F1-micro F1-macro

Centralized-Industry

Centralized-ours

MACCI

Page 20: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Agent Collaboration

• Agents collaborate to help each other.

• Agent communication and interaction are controlled by agent collaboration strategies.

• Collaboration strategies– Random strategy– Good-Neighbor strategy

Page 21: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Multi-Agent Collaboration and Classification of Information (MACCI)

Agent-1 Agent-2

Agent-6 Agent-3

Agent-5 Agent-4

AdminAgent

Page 22: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Random Strategy

1. An agent (A) asks another agent (B) for help randomly when it fails to classify a document.

2. Then agent B tries to classify the document and report the result to agent A.

i. If agent B classifies the document successfully, then this tasks is finished;

ii. If agent B fails to classify, agent A will repeat the steps to ask other agents for help until the document has been classified or all the other agents in the environment has been asked.

Page 23: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Multi-Agent Collaboration and Classification of Information (MACCI)

Agent-1 Agent-2

Agent-6 Agent-3

Agent-5 Agent-4

AdminAgent

Page 24: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Good-Neighbor Strategy1. The administration agent distributed a document from the

document pool to a randomly chosen classification agent. The process continues until the document pool is empty.

2. If an agent successfully classifies a document sent from the administration agent, it sends the document to all the agents in its success list for other potential classification. The help degree is set to 1.

3. If an agent fails to classify a document sent from the administration agent, it sends the document to the four top level parent agents in its failure list for help. The help degree is set to 1.

4. If an agent successfully classifies a document sent from another classification agent and the help degree is smaller than 4, it sends the document to the agents that represent its child classes in its success list. The help degree is incremented by 1.

5. If an agent fails to classify a document sent from another classification agent, it doesn’t take any action.

Page 25: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Multi-Agent Collaboration and Classification of Information (MACCI)

Agent-1 Agent-2

Agent-6 Agent-3

Agent-5 Agent-4

AdminAgent

Page 26: Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.

Conclusion

• The multi-agent approach can successfully achieve the same level of effective for document classification as the centralized approaches do.

• High level of effectiveness can be achieved by adapting carefully designed agent collaboration strategies.