Text- and Content- based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Matthew Simpson, Md Mahmudur Rahman, Dina Demner- Fushman, Sameer Antani, George R. Thoma Lister Hill National Center for Biomedical Communications, National Library of Medicine, NIH, Bethesda, MD, USA CLEF 2009
12
Embed
Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Matthew Simpson, Md Mahmudur Rahman, Dina Demner-Fushman,
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Text- and Content-based Approaches to Image
Retrieval for the ImageCLEF2009 Medical Retrieval Track
Matthew Simpson, Md Mahmudur Rahman, Dina Demner-Fushman, Sameer Antani, George R. Thoma
Lister Hill National Center for Biomedical Communications, National Library of Medicine, NIH, Bethesda, MD, USA
CLEF 2009
Retrieval tasks and approaches
• ITI project long term goal– Find a way to combine image and text features so
that the whole is greater than the sum of its parts
• Indexing:– Create image documents for ad-hoc image
retrieval– Create surrogate documents for case-based
retrieval– Index using Essie
• term normalization using the SPECIALIST Lexicon• query expansion based on UMLS synonymy• term weighting based on location in the document• Phrase-based search
Text documents
• Image document– Title and caption provided by organizers– Mention extracted from paper– MEDLINE citation (abstract +MeSH)– PICO frame of the caption + image modality
(structured caption summary)
• Surrogate document– MEDLINE citation – caption, mention, and structured caption summary of
each image contained in the article
Text retrieval
• PICO-based structured query and case representation– <topicID>19</topicID> <description>Crohn's disease CT</description>