Top Banner
Assignment 6: Motif Finding Bio5488 3/24/17!! Review J
6

Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

Oct 04, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

Assignment6:MotifFindingBio54882/24/173/24/17!!

ReviewJ

Page 2: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

Assignment6:Motiffinding• Input• Promotersequences• PWMsofDNA-bindingproteins

• Goal• FindputativebindingsitesinthesequencesbyscanningthesequencesformatchestothePWM

• Output• Listofthelocationsandscoresofputativebindingsites

PWM Putativebindingsequence

Promoter

Page 3: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

AssignmentTODOs

• DeterminethehighestaffinitybindingsiteforeachPWM• CalculatebyhandorwriteascriptJ

• Commenttheexistingcode• Commenttheuser-definedfunctionswithfunctiondocstrings

• Modifythescripttoscanthereversecomplementoftheinputsequence• Modifythescriptonlyreporthitsthathavescoresaboveagiventhreshold

• Scanpromoters(n=2)tofindputativebindingsitesforeachDNA-bindingprotein(n=2)

• Answerfollow-upquestions

Page 4: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

TFScoringMatrix

Page 5: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

Indexing

• Indexingissomewhatarbitrary;howeverit’simportanttofollowconventions:• Thestartpositionofafeatureissmallerthanthestopposition• Thecoordinatesarerelativetotheforwardstrand

Page 6: Assignment 6: Motif Findinggenetics.wustl.edu/bio5488/files/2017/03/Assignment-6-Review-.pdf · Assignment 6: Motif finding • Input • Promoter sequences • PWMs of DNA-binding

UseToyDataSets!!!

ACGT

1000

1000

0010

012

Base

Position

Lookatourexamples/instructionssoyougiveustherightanswersJ