Top Banner
-TURKUAZ Project- RadeX Tahir Bilal Onur Deniz Soner Kara M. Mert Karadağlı Assistant: Umu Instructor: Me
15

Tiran Software

Dec 31, 2015

Download

Documents

Tiran Software. -TURKUAZ Project- RadeX Tahir Bilal Onur Deniz Soner Kara M. Mert Karadağlı. Assistant: Umut Eroğul Instructor: Meltem T. Yöndem. Outline. Problem Definition Important Aspects Our Approach General Structure Analyzer Component Searcher Component - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Tiran Software

-TURKUAZ Project-

RadeX

Tahir BilalOnur DenizSoner Kara

M. Mert Karadağlı Assistant: Umut Eroğul Instructor: Meltem T. Yöndem

Page 2: Tiran Software

Problem DefinitionImportant AspectsOur ApproachGeneral Structure

Analyzer ComponentSearcher Component

Current StatusPrototypeTool and ResourcesQ/A

Page 3: Tiran Software

Problem Definition

Billions of radiology reportsUnfortunately, they are stored in free-text formatHard to search and retrieveNeed for searchable information

Page 4: Tiran Software

Important AspectsText Mining

NLPInformation ExtractionMorphological AnalysisNamed Entity Recognition

Machine LearningNeural Networks, Decision Trees ...

Page 5: Tiran Software

Our Approach

RadeX, Radiology Data Extractor will enable..

Modular machine learning component

Support for internal/external dictionary connection

Template-based approach for finalizing

Page 6: Tiran Software

General Structure

Page 7: Tiran Software

Analyzer ComponentPreprocess free textLook-up internal and external lexiconsGives semantic to wordsExtracts searchable data

Searcher ComponentSend query strings to databaseRetrieve corresponding information

General Structure (cont.)

Page 8: Tiran Software
Page 9: Tiran Software
Page 10: Tiran Software
Page 11: Tiran Software

Preprocessing.

Connecting and using external sources.

Database implementation.

Applying SVM to unrelated but tagged corpus.

Current Status

Page 12: Tiran Software

Mapping Turkish terms to English translations.

Finding stem of unknown words.

Constructing lexicons.

Features of verbs, adjectives, nouns...

Current Status (cont.)

Page 13: Tiran Software

..decompose reports into sub-parts, sentences and words,

.. analyze words using Zemberek and a stemmer.

.. give semantics to words via internal/external lexicons

.. extract simple information using pre-defined templates

In Prototype we will be able to...

Page 14: Tiran Software

SVM-LightWordNetJWNLTDK / ZarganZemberek,PostgreSQL

Tools & Resources

Page 15: Tiran Software

Any Questions?