YOU ARE DOWNLOADING DOCUMENT

Please tick the box to continue:

Transcript
Page 1: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Dependency Network Based Real-time Query Expansion

Jiaqi Zou, Xiaojie WangCenter for Intelligence Science and Technology,

BUPT

Page 2: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Outline

• Introduction– What is RTQE?– Benefits of RTQE– Related Research– Improvements in our work

• Method– Query Intention– Dependency Relation Network– RTQE Method

Page 3: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Outline

• Experiments– Test of operation numbers– Test of expansion success percentage– Test of retrieval performance– Comparison with Bing

• Conclusion

Page 4: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Introduction- What is RTQE?

• RTQE is a kind of query expansion.• RTQE methods expand queries at

the same time when users type queries into the search box.

Page 5: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Introduction- Benefits of RTQE

• RTQE reduces user’s keystrokes and time to perform a query, especially useful for mobile device users.

• RTQE improves the query quality.

Page 6: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Related Research

• Most widely used method: string matching method using query log.

• Little work on RTQE takes query intention into account.– Strohmaier et al. suggested that explicit

queries containing at least one verb word might reflect possible user intentions.

Page 7: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Improvements in our work

• Represent query intention better.• Construct a RTQE method which

expands components of possible user query intentions.

• This RTQE method improves the retrieval performance.

Page 8: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Query Intention

• Task-oriented classification of query intention: – Navigational– Informational– Transactional

• Duan et al. suggested dependency related verb-noun pairs are good representation of informational and transactional query intentions.

Page 9: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Query Intention

• Verb-noun pair is not sufficient to represent query intention, other parts like attributes of noun are also very important.

• New representation:Verb-Attributes-Noun

• Example: buy new car tire, cook Chinese food

Page 10: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Dependency Relation Network

• To do query intention related RTQE, we built a dependency relation network which is a collection of numbers of query intentions.

• Steps:– Do dependency parsing on large corpus.– Extract all the verb-attributes-noun

structures. – Combine these structures to be the Network.

Page 11: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Dependency Relation Network

• Example : How to change a car tire• Extracted : change car tire

Page 12: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

RTQE method

Page 13: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

RTQE Example

Page 14: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Experiments

• Corpus: www.ehow.com – 915,000 articles– 20 categories(Health, Cars,

Food&Drink, etc)

Page 15: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of operation numbers

• Keystrokes and mouse clicks needed to generate a query is recorded. Each keystroke or mouse click is recorded as an operation.

1

1

( )n

x xx

n

xx

OpFull OpExpandedSaved

OpFull

Page 16: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of operation numbers

• Average saved operations is 63.75% after RTQE

Average number of operations

Without RTQE With RTQE

15.0 5.437

Page 17: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of expansion success percentage

• For a given query intention, if the user can find a query exactly related to this intention from the expanded list, we call it a successful expansion.

SuccessNumSuccessPercentage

AllNum

Page 18: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of expansion success percentage

Times

Query expansion success

Query expansion fail

168 32

• Expansion success percentage is 84%.

Page 19: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of retrieval performance

• We compare the retrieval performance of the three: – original query user typed in– the query after verb-noun expansion– the query after verb-attributes-noun

expansion.

• We use precision and nDCG score for evaluation.

Page 20: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Test of retrieval performance

Query Type Precision nDCG score

Original query word 0.73% 13.11%

Query after verb-noun expansion

9.47% 37.37%

Query after verb-attributes-noun expansion

79.2% 88.95%

Page 21: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Comparison with Bing

• The RTQE result of Bing differs a lot if the word order of a query changes.

Page 22: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Comparison with Bing

• Categories the RTQE result of Bing into 3 groups:– NOT: cannot get correct

recommendations – NORMAL: get correct recommendations

only in normal word order– ALL: can get correct recommendations

both in normal order and other word orders

Page 23: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Comparison with Bing

Group NOT NORMAL ALL

Percentage 49% 33% 18%

Page 24: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Conclusion

• Presented a novel RTQE method using a dependency relation network.

• This RTQE method is proved to be effective in representing user query intention and hence improve retrieval performance.

Page 25: Dependency Network Based Real-time Query Expansion Jiaqi Zou, Xiaojie Wang Center for Intelligence Science and Technology, BUPT.

Thank you!


Related Documents