Top Banner
NATURAL LANGUAGE UNDERSTANDING WITH MACHINE LEARNED ANNOTATORS & DEEP LEARNED ONTOLOGIES AT SCALE David Talby Ph.D., MBA, CTO @ Atigeo
22

Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Jan 22, 2018

Download

Data & Analytics

Spark Summit
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

NATURAL LANGUAGE UNDERSTANDING WITH MACHINE LEARNED ANNOTATORS & DEEP LEARNED ONTOLOGIES AT SCALE

David TalbyPh.D., MBA, CTO @ Atigeo

Page 2: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

The problem

Whoneedstobevaccinated?

Whofitsthisclinicaltrial?

Whoisatriskforsepsis?

Whoisgettingmedsthey’reallergicto?

Whoonthisprotocoldidnothavethisside

effect?

Page 3: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

At the beginning, there was search

Scalable&robustIndexingpipelineTokenizers&analyzersSynonyms,spellers&Auto-suggestFileformats&headerboostingRankers,link&reputationboosting

Page 4: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Then there was semantic search“cheap red prom dresses”“laptops under $500”“italian restaurants near me that deliver”“captain america civil war tonight”“nba scores”

DictionaryBasedAttributeExtraction

Dell - XPS 15.6 4K Ultra HD Touch-Screen Laptop - Intel Core i5 - 8GBMemory - 256GB Solid State Drive -Silver

MachineLearnedAttributeExtraction

If you go for the ambience, you'll bedisappointed. If you go for good,inexpensive and authentic Mexicanfood, then you're in the right place.

Page 5: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Then, you need to understand languagePrescribing sick days due to diagnosis of influenza. Positive

Jane complains about flu-like symptoms. Speculative

Jane may be experiencing some sort of flu episode. Possible

Jane’s RIDT came back negative for influenza. Negative

Jane is at high risk for flu if she’s not vaccinated. Conditional

Jane’s older brother had the flu last month. Familyhistory

Jane had a severe case of flu last year. Patienthistory

Page 6: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

1.

Language gets complex

and domain specific

Page 7: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Human language is wonderfully nuancedJoe expressed concerns about the risks of bird flu. Nothing

Joe shows no signs of stroke, except for numbness. DoubleNegative

Nausea, vomiting and ankle swelling negative. Compound

(itgetsworse– inrealityalotoftextisn’tvalidEnglish)

Patient denies alcohol abuse. Speculative

Allergies: Penicillin, Dust, Sneezing. Compound

Page 8: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Let’s build this!

Theinput(patientrecords)

Theprocessingframework

Theoutput Thequeryengines

Page 9: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

SENTENCEDETECTION

SECTIONDETECTION

TOKENIZER LEMMATIZER

STOPWORDREMOVAL

NEGATIONDETECTION

CONDITIONALSCOPE

SPECULATIVESCOPE

DATE NUMBER UNIT QUANITITY

CONCEPTEXTRACTION

Page 10: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby
Page 11: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby
Page 12: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

2.

you’ll need

machine learning early

Page 13: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Machine learned annotators

GrammaticalPatterns

If…then…

DirectInferences

Age<18==>Child

Lookups

RIDT(labtest)

Under-diagnosedconditions

FluDepression

ImpliedbyContext

relevantlabsnormal

Sometimes,it’seasiertojustcodeanannotation’sbusinesslogic

Butsometimesit’seasiertolearnitfromexamples:

Page 14: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby
Page 15: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

3.

bootstrap and then expand

your vocabulary

Page 16: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby
Page 17: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Expanding & updating ontologies

Word2Vec

Page 18: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Let’s build this too!

Page 19: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby
Page 20: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby
Page 21: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Summary: How Summary: Why

1. Languagegetscomplexanddomainspecific

2. You’llneedmachinelearningearly

3. Bootstrap&thenexpandyourvocabulary

Whoneedstobevaccinated?

Whofitsthisclinicaltrial?

Whoisatriskforsepsis?

Page 22: Semantic Natural Language Understanding with Machine Learned Annotators and Deep Learned Ontologies at Scale: Spark Summit East talk by David Talby

Thank You!github.com/atigeo/nlp_demo

@davidtalby