Top Banner
Medical Named Entity Recognition on Twitter Data Group:56 Chinmay Bapna Juhi Tandon Kalpish Singhal 201302182 201225032 201505513
25

Ire presentation team56

Jan 18, 2017

Download

Engineering

Nitish Jain
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Ire presentation team56

Medical Named Entity

Recognition on Twitter Data

Group:56Chinmay Bapna Juhi Tandon Kalpish Singhal201302182 201225032 201505513

Page 2: Ire presentation team56

Page 3: Ire presentation team56

Page 4: Ire presentation team56

● Medical Entity Recognition is a crucial step towards efficient medical texts analysis. The task of a Medical Name Entity Recognizer is two fold. -

■ Identification of entity boundaries in the sentences

■ Entity categorization.

Our objective is to extend medical entity recognition for tweets.

● Medical Entities

a. Diseases

b. Drugs

c. Symptoms

Page 5: Ire presentation team56

●●●●●

Page 6: Ire presentation team56
Page 7: Ire presentation team56

1. Each tweet is stored in a separate file along with a corresponding annotation

file with manually labelled entities

2. Next comes the word-context feature, along with the term, a 2-term context

window, preceding and succeeding the concerned term were added as

another feature

3. Metamap tagging: For each tweet, metamap tags are extracted

4. POS tagging: For each tweet, part of speech tags are extracted

5. Orthographic tagging: For each term, orthographic features are identified and

labelled

Page 8: Ire presentation team56
Page 9: Ire presentation team56
Page 10: Ire presentation team56
Page 11: Ire presentation team56
Page 12: Ire presentation team56

●●●●●

Page 13: Ire presentation team56

Page 14: Ire presentation team56
Page 15: Ire presentation team56
Page 16: Ire presentation team56

Page 17: Ire presentation team56
Page 18: Ire presentation team56
Page 19: Ire presentation team56
Page 20: Ire presentation team56

Page 21: Ire presentation team56

❖ The dataset provided to us consists of 85% of ‘None’ entity tags.

❖ Thus with any random designation of tags too we will have 80% or more accuracy

❖ This conveys that accuracy is not the correct metric for correct evaluation of feature models.

❖ Since this metric isn’t good-enough to tell us significance of one feature model over another,

❖ we tried following metrics:

1. Precision,

2. Recall

3. F-Score

❖ In this project we experimented with different feature sets and analyzed their significance based on intuition and

prior knowledge .

❖ Thereafter we evaluated their efficiency in Medical-NER.

Page 22: Ire presentation team56

Page 25: Ire presentation team56