Top Banner
Computing meets Language Kevin Duh Dept. of Computer Science & Human Language Technology COE Johns Hopkins University
29

Computing meets Language

Mar 22, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Computing meets Language

ComputingmeetsLanguageKevinDuh

Dept.ofComputerScience&HumanLanguageTechnologyCOEJohnsHopkinsUniversity

Page 2: Computing meets Language

WhatdoesaComputerScientistdo?

ComputerScienceismorethanjustprogramming&computers!

Imagesource:Almonroth,CCBY-SAviaWikimediaCommonshttps://commons.wikimedia.org/wiki/File:Typing_computer_screen_reflection.jpg

Page 3: Computing meets Language

ComputationalThinking

Thinkinglikeacomputerscientistmeansmorethanbeingabletoprogramacomputer.

Itrequiresthinkingatmultiplelevelsofabstraction.

JeannetteWing(ColumbiaUniversity)CommunicationsoftheACM,2006https://www.cs.cmu.edu/~15110-s13/Wing06-ct.pdf

Imagesource:WorldEconomicForum,CCBY-SAviaWikimediaCommonshttps://en.wikipedia.org/wiki/File:Jeannette_Wing,_Davos_2013.jpg

Page 4: Computing meets Language

Examplesofcomputationalthinkingatwork

• Builda“model”• Abstractsthekeypropertiesofwhatyou’restudying• Allowsyoutorunsimulationsandpredictions

• Examples:• ComputationalBiology• ComputationalFinance• ComputationalLinguistics

Imagesources:(1)Probkos13,CCBY-SAviaWikimediaCommonshttps://commons.wikimedia.org/wiki/File:Punnett_Square.svg(2)Garwood,Sharma,Dunlop,Giribet,CCBYviaWikimediaCommonshttps://commons.wikimedia.org/wiki/File:Phylogenetic_Analyses_of_Opiliones_2014-A.png

Page 5: Computing meets Language

Modelinglanguage?

• Yousayyou”knowEnglish”• Whatexactlyisitthatyouknow?• Howwouldyouwriteitdown?Inwhatnotation?

• Howdotoddlerslearntheirfirstlanguage?• Canweprogramacomputertounderstandhumanlanguage?• Exploitlargeamountsofdata&buildprobabilisticmodelsoflanguage(e.g.viamachinelearning)

Page 6: Computing meets Language

Example:whichphraseismorelikelyaccordingtodata?

Page 7: Computing meets Language

Outline

1. Introduction:ComputerScienceà ComputationalThinking2. Myfield:ComputationalLinguistics3. Exampleresearchtopic:howGoogleTranslateworks4. HowtobeginCSresearchasahighschooler

Page 8: Computing meets Language

ComputationalLinguistics,a.k.a NaturalLanguageProcessing• Wewanttostudy:• Howtomodelhumanlanguage• Howtoprogramcomputerstointerpretandprocesshumanlanguage

• Interdisciplinaryfieldà goodifyoulikebothSTEMandhumanities!• ComputerScience&Engineering• Linguistics,CognitiveScience• Statistics,MachineLearning

Page 9: Computing meets Language

Mypathintothisfield

Page 10: Computing meets Language

Modelinglanguageatmultiplelevels

• Sound • Word • Sentence

JerryhitTom.

TomwashitbyJerry.

Jerry

Tom

Page 11: Computing meets Language

Thisisn’teasy!Unlikeprogramminglanguages,humanlanguagecanbeambiguous.

Imagesource:http://walkinthewords.blogspot.com/2010/07/syntax-with-sherlock-sentence-ambiguity.html

Sherlocksawthemanusingbinoculars

Page 12: Computing meets Language

Whatapplicationsarepossible?

• Currentlywedon’tyethaveamodelthatreallyunderstandslanguagefully,butwehavesomeusableones

StrongAI vs Weak

AI

Page 13: Computing meets Language

Applications:Analyzingonlinereviews

Page 14: Computing meets Language

Applications:Extractinginfofromemails

Page 15: Computing meets Language

Applications:Findinganswersinlongarticles(i.e.helpingyoudohomework)

Page 16: Computing meets Language

Applications:Machinetranslation

世界には6000の⾔語があります。

Thereare6000languagesintheworld.

Page 17: Computing meets Language

Outline

1. Introduction:ComputerScienceà ComputationalThinking2. Myfield:ComputationalLinguistics3. Exampleresearchtopic:howGoogleTranslateworks4. HowtobeginCSresearchasahighschooler

Page 18: Computing meets Language

WhenIlookatanarticleinRussian,Isay:“ThisisreallywritteninEnglish,buthasbeencodedinsomestrangesymbols.”

WarrenWeaver,Americanscientist(1894-1978)

Imagecourtesy:BiographicalMemoirsofNationalAcademyofScience,vol.57

Page 19: Computing meets Language

1a)evas dlrow-eht

1b)

2a)dlrow-eht si detcennoc

2b)

3a)hcraeser si tnatropmi

3b)

4a)ew eb-ot-mia tseb ni dlrow-eht

4b)

Yourmission:Wefound4sentencepairsfromtwoancientMartianlanguages.Figureoutwhich“word”translatestowhich

Page 20: Computing meets Language

1a)evas dlrow-eht

1b)

2a)dlrow-eht si detcennoc

2b)

3a)hcraeser si tnatropmi

3b)

4a)ew eb-ot-mia tseb ni dlrow-eht

4b)

Page 21: Computing meets Language

1a)evas dlrow-eht

1b)

2a)dlrow-eht si detcennoc

2b)

3a)hcraeser si tnatropmi

3b)

4a)ew eb-ot-mia tseb ni dlrow-eht

4b)

dlrow-eht

dlrow-eht

3

1

Frequency

si

si

2

1

Page 22: Computing meets Language

Lifeinthedayofaresearcher

1. Thinkupanewmodelforlanguagetranslation2. Programit3. Feedthemodellotsofdata4. Testit.Readotherresearcher’spaperstogetmoreideas.5. Gobackto(1)untilsatisfied,thenpublish

Page 23: Computing meets Language

Outline

1. Introduction:ComputerScienceà ComputationalThinking2. Myfield:ComputationalLinguistics3. Exampleresearchtopic:howGoogleTranslateworks4. HowtobeginCSresearchasahighschooler

Page 24: Computing meets Language

PracticalsuggestionsforgainingComputerScience(CS)researchexperience• Reality:

1. CSisnotjustaboutprogramming,butstrongprogrammingskillisamust!2. TherearemanyresearchareasrelatedtoCS– enoughtofitanyone’s

interest,butalsosomanythatyoumightnotknowwhatisoutthere

• Suggestedplan:1. Improveyourprogrammingskills2. Contactprofessorsforinternopportunities

Page 25: Computing meets Language

Improvingyourprogrammingskills

• Pickoneprogramminglanguageandbecomereallygoodatit• e.g.Java,Python,C++,Javascript

• Howtobegood?• Programalot.• Readotherpeople’scode.Workwithafriend,orjoinothers’GitHubprojects• Learnaboutdatastructures&algorithms.TakeComputerScienceclasses(atschoolorCoursera,etc.)

• Createaportfolio onGitHubthatyoucanshowduringapplications

Page 26: Computing meets Language

Contactingprofessorsforopportunities

• Writeapoliteemail• Bespecific aboutwhatyouarelookingfor• AddlinktoyourGitHubrepoandexplainyourinterest&experience

• Don’texpectareply• Professorsgetsomanyemailslikethiseverydayfromaroundtheworld….• Professorshavechangingcommitments.No-gothisyeardoesn’tmeannochancefornextyear.

• Ifyou’reluckyandgetaproject:• Beproactive infiguringouthowyoucancontribute.• Becomfortableworkingonsomethingwhenyoudon’tknowallthedetails.• Beindependent.Learnwhentoaskquestionsandwhentoself-study.

Page 27: Computing meets Language

Additionalcomments

• Structuredinternshipprogramsarealsogoodwaystolearn,e.g.• JohnsHopkinsAppliedPhysicsLab(APL)ASPIREprogram• Moreresources:https://cty.jhu.edu/resources/academic-opportunities/internships/math.html

• IfinterestedinMachineLearning&AIsubareasofCS,thenmathandprobability/statisticsarealsoimportant.

Page 28: Computing meets Language

Summary

1. Introduction:ComputerScienceà ComputationalThinking2. Myfield:ComputationalLinguistics3. Exampleresearchtopic:howGoogleTranslateworks4. HowtobeginCSresearchasahighschooler

Page 29: Computing meets Language

Questions/Comments?