Top Banner
HYP Progress Update By Zhao Jin
19

HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Jan 01, 2016

Download

Documents

Agatha Owen
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

HYP Progress Update

By Zhao Jin

Page 2: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Outline

• Background

• Progress Update

Page 3: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Background

• Query (Text-based)– The set of keywords to be entered into the

system to retrieve the desired information or resources

– Main category• Traditional IR • Web (ex. Google)• OPAC (ex. LINC)• Video (ex. TRECVID)

Page 4: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Background

• Query Analysis– To analyze the pattern and hidden information

in the queries

– To efficiently classify and support such queries.

Page 5: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Progress update

• Mid-May to Early June– Background reading– Around 30 to 40 papers on various topic– Summarizing of key points in the paper

Page 6: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Progress update

• Mid-June to late-June– Log analysis

• BBC Video Query• NUS OPAC Query

– Background reading on OPAC and TRECVID

Page 7: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Progress update

• July to now– Follow up on two main topics

• Query classification and division on content-based and feature-based keywords (OPAC)

• Identifying ASR-oriented keywords in a video query (TRECVID)

– Background reading on MARC, wordnet and LOC subject heading

Page 8: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Progress update

• Plan for the near future– Refine and experiment with the current ideas

– Log analysis

– Background reading (Textbook & Related paper)

– Preparation for implementation

Page 9: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Q&A?

Page 10: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

End of progress update

• Thank you for your attention!

Page 11: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Two types of keywords

• Content-Based Keyword (CBK)– The keywords that concern what the item is

about– Ex. title, subject heading, etc

• Feature-Based Keyword (FBK)– The keywords that concern the features of the

item.– Ex. author, publisher, genre, medium

Page 12: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Benefits

• Benefits:– Faster retrieval – More precise retrieval– Help in relevance ranking

Page 13: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Possible implementation

• Possible implementation: – term co-occurrence for concept division

– list of special words and machine learning for FBK and CBK division

– wordnet for classification among CBKs

Page 14: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Possible implementation

• Possible implementation: – CL and IL search algorithms for actual

searching with CBKs.

– list of special words and machine learning for classification among FBKs.

– Marc record search algorithms for actual searching with FBKs.

Back

Page 15: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Means to retrieve shots

• Example:– To find shots of “Bill Clinton”

• Face recognition

• Closed-caption

• Automatic Speech Recognition (ASR)

Page 16: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Metrics

• Common VS Special (In reality) – How common in reality is the concept

represented by the keyword.

• Generic VS Specific – How generic is the concept represented by the

keyword.

Page 17: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Metrics

• Concrete VS Abstract – Whether the keyword represented is concrete

or abstract

• Topic frequency (Low VS High) – How often the keyword becomes (closely

related to) a topic.

Page 18: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Metrics

• Formal VS Informal – Whether the keyword is in formal or informal

language

• Written VS spoken – Whether the keyword is in spoken or written

language

Page 19: HYP Progress Update By Zhao Jin. Outline Background Progress Update.

Metrics

• Feature-level VS Content-level – Whether the keyword is about the feature of

the video (ex. camera motion) or the content of the video

Back