Top Banner
Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT
25

Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Dec 25, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Dependency Network Based Real-time Query Expansion

Jiaqi Zou, Xiaojie WangCenter for Intelligence Science and Technology,

BUPT

Page 2: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Outline

• Introduction– What is RTQE?– Benefits of RTQE– Related Research– Improvements in our work

• Method– Query Intention– Dependency Relation Network– RTQE Method

Page 3: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Outline

• Experiments– Test of operation numbers– Test of expansion success percentage– Test of retrieval performance– Comparison with Bing

• Conclusion

Page 4: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Introduction- What is RTQE?

• RTQE is a kind of query expansion.• RTQE methods expand queries at

the same time when users type queries into the search box.

Page 5: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Introduction- Benefits of RTQE

• RTQE reduces user’s keystrokes and time to perform a query, especially useful for mobile device users.

• RTQE improves the query quality.

Page 6: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Related Research

• Most widely used method: string matching method using query log.

• Little work on RTQE takes query intention into account.– Strohmaier et al. suggested that explicit

queries containing at least one verb word might reflect possible user intentions.

Page 7: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Improvements in our work

• Represent query intention better.• Construct a RTQE method which

expands components of possible user query intentions.

• This RTQE method improves the retrieval performance.

Page 8: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Query Intention

• Task-oriented classification of query intention: – Navigational– Informational– Transactional

• Duan et al. suggested dependency related verb-noun pairs are good representation of informational and transactional query intentions.

Page 9: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Query Intention

• Verb-noun pair is not sufficient to represent query intention, other parts like attributes of noun are also very important.

• New representation:Verb-Attributes-Noun

• Example: buy new car tire, cook Chinese food

Page 10: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Dependency Relation Network

• To do query intention related RTQE, we built a dependency relation network which is a collection of numbers of query intentions.

• Steps:– Do dependency parsing on large corpus.– Extract all the verb-attributes-noun

structures. – Combine these structures to be the Network.

Page 11: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Dependency Relation Network

• Example : How to change a car tire• Extracted : change car tire

Page 12: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

RTQE method

Page 13: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

RTQE Example

Page 14: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Experiments

• Corpus: www.ehow.com – 915,000 articles– 20 categories(Health, Cars,

Food&Drink, etc)

Page 15: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of operation numbers

• Keystrokes and mouse clicks needed to generate a query is recorded. Each keystroke or mouse click is recorded as an operation.

1

1

( )n

x xx

n

xx

OpFull OpExpandedSaved

OpFull

Page 16: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of operation numbers

• Average saved operations is 63.75% after RTQE

Average number of operations

Without RTQE With RTQE

15.0 5.437

Page 17: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of expansion success percentage

• For a given query intention, if the user can find a query exactly related to this intention from the expanded list, we call it a successful expansion.

SuccessNumSuccessPercentage

AllNum

Page 18: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of expansion success percentage

Times

Query expansion success

Query expansion fail

168 32

• Expansion success percentage is 84%.

Page 19: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of retrieval performance

• We compare the retrieval performance of the three: – original query user typed in– the query after verb-noun expansion– the query after verb-attributes-noun

expansion.

• We use precision and nDCG score for evaluation.

Page 20: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of retrieval performance

Query Type Precision nDCG score

Original query word 0.73% 13.11%

Query after verb-noun expansion

9.47% 37.37%

Query after verb-attributes-noun expansion

79.2% 88.95%

Page 21: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Comparison with Bing

• The RTQE result of Bing differs a lot if the word order of a query changes.

Page 22: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Comparison with Bing

• Categories the RTQE result of Bing into 3 groups:– NOT: cannot get correct

recommendations – NORMAL: get correct recommendations

only in normal word order– ALL: can get correct recommendations

both in normal order and other word orders

Page 23: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Comparison with Bing

Group NOT NORMAL ALL

Percentage 49% 33% 18%

Page 24: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Conclusion

• Presented a novel RTQE method using a dependency relation network.

• This RTQE method is proved to be effective in representing user query intention and hence improve retrieval performance.

Page 25: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Thank you!