Top Banner
DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein
60

DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Dec 22, 2015

Download

Documents

Janis Scott
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

DNA Motif and protein domain discovery

Presented by:

Deeter Neumann

Peter St. Andre

PDB; zinc finger 224 PDB; human enhancer binding protein

Page 2: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Outline

What are DNA motifs & proteins domains?

Their importance and function

motif algorithms

locating domain/motif experimentally

available programs: PFAM & SMART

Taken fromwikimedia.org

Page 3: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

What are DNA sequence motifs?

“Sequence motifs are short recurring patterns in DNA that are presumed to have biological

function.”D’haeseleer, P. Nature Biotechnology 24, 423 - 425 (2006).

Image taken from bio.miami.edu

Page 4: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Indicates common structural protein domains

Identifies similar function

Other possible biological functions, eg. transcription factors, mRNA processing

Why are DNA sequence motifs important to know?

Page 5: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

What is the function of DNA domains?

specific and non-specific interactions

permits binding of transcription factor to target gene

sequence-specific recognition

Human Molecular Genetics 3; Strachan & Read

Page 6: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

What are protein domains?Protein sequences and structures that evolve,

function, and exist independently from the rest of the protein

They often form functional

units, like metal

binding domains

Image of human zinc finger domain

Taken from .ionchannels.org

Page 7: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

7

Why are Proteins Domains Important?

7

Bind to other molecules in the cell

Signal transduction pathways

Genetically engineering novel proteins

Pharmaceutical importance

Page 8: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Algorithmic Approaches for both DNA motifs and protein domain searches

Three general approaches are used:

Enumeration

Deterministic optimization

Probabilistic optimization

Page 9: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Enumeration

Employs the broadest approach

Looks at all possible motifs

Few limitations are enacted on it

Page 10: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Enumeration, cont.

Key point: Covers all possible sequence motifs with few limitations

Pros: Does not get stuck in local optimum

Cons: May overlook subtle patterns

Programs like WeederWeb and YMF use these type of algorithms

Page 11: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

WeederWeb

Page 12: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

WeederWeb Results

Page 13: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Deterministic optimization

Takes into account an Expectation Maximization model and a position weight matrix

MEME is one program that uses this approach

What does this mean?

Page 14: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Deteriministic optimization, cont.

Page 15: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Deterministic optimization, cont.

Taken from ws.nbcr.net/app1234127263839/meme.html

Page 16: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Probabilistic optimization

Uses a Gibbs sampling approach– Randomized implementation of expectation

maximization model

How is this applied?

Page 17: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Probabilistic optimization, cont.

Selects random sites and each is weighted against known motifs

Allows program to add or remove sequences and continuously update motifs

Page 19: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

19

Results

Page 20: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Which one to use?

Recent research showed that enumeration approaches worked very well

Generally accepted that no one approach is the best

Programs that incorporate several approaches work the best

Important to rerun programs

Page 21: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Examples of programs

WeederWeb is a web-based interface with an enumerative approach

YMF is another enumerative program

MEME is an online program that uses a deterministic optimization approach

MotifSampler is a program that combines Gibbs sampling and a third order Markov model

Page 22: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

YMF

Page 23: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

YMF results

Page 24: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Measurements used to score sequence motifs

Three main statistics used:

Information content

Log likelihood

MAP score

Page 25: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Other measures of motif quality

Group specificity, or site specificity• Probability of having a certain number of target

sequences with the site in question

Sequence specificity• Accounts for both number of sequences with the sites in

question and the number of sites per sequence

Positional bias, or uniformity• Looks at how uniform of the sites in question are

distribute with respect to transcription start sites of the gene

Page 26: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Identification and preliminary characterization of a protein

motif related to the zinc finger

Lovering et al. (1993)

Page 27: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

What is a zinc finger?

PDB; single zinc finger in solution

autonomously folding domain

structural motif

zinc required for folding and DNA

interactions

part of protein that is used to regulate DNA

Page 28: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Classic zinc finger

conserved cysteines and histidines

binds with zincTetrahedral structure

antiparallel two-stranded β-sheets and an α-helix

image from wikipedia

Page 29: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Figure 1A

Lovering et al.

Page 30: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Actual RING1 sequence

MTTPANAQNASKTWELSLYELHRTPQEAIMDGTEIAVSPRSLHSELMCPICLDMLKNTMTTKECLHRFCSDCIVTALRSGNKECPTCRKKLVSKRSLRPDPNFDALISKIYPSREEYEAHQDRVLIRLSRLHNQQALSSSIEEGLRMQAMHRAQRVRRPIPGSDQTTTMSGGEGEPGEGEGDGEDVSSDSAPDSAPGPAPKRPRGGGAGGSSVGTGGGGTGGVGGGAGSEDSGDRGGTLGGGTLGPPSPPGAPSPPEPGGEIELVFRPHPLLVEKGEYCQTRYVKTTGNATVDHLSKYLALRIALERRQQQEAGEPGGPGGGASDTGGPDGCGGEGGGAGGGDGPEEPALPSLEGVSEKQYTIYIAPGGGAFTTLNGSLTLELVNEKFWKVSRPLELCYAPTKDPK

Page 31: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

RING finger

Cys1-Xaa-hydrophobic aa-Cys2-Xaa9-27-Cys3-Xaa1-3-His-Xaa-hydrophobic aa-Cys4-Xaa2-Cys5-hydrophobic aa-

Xaa5-47-Cys6-Xaa2-Cys7

Page 32: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Figure 1B

Fig. 1B Lovering et al.

Gene expression similar in variety of cell lines

Page 33: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Figure 2

Lovering et al.

DNA binding

regulation

recombination

repair

Page 34: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

RING1 peptide

55 aa synthetic peptide (residues 12-66 in RING1 seq) RING finger

metal binding ---> prefers Zinc

cobalt

cadmium

copper

Page 35: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Figure 3A

Fig. 3A Lovering et al.

___ cobalt

----- zinc

S-C0(II)

Co(II) d-d transitions

Page 36: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Figure 4A

Zinc dependence binding

Page 37: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

RING1 function1992 No known function (not published until 1993)

2004 Inhibit transactivation of recombination signal binding protein-J (RBP-J) (Hongyan et al.)

Ubiquitin-protein ligases

Page 38: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Pfam databasehttp://pfam.sanger.ac.uk/

Database that contains large collection of protein domains and families

Represented as sequence alignments and HMMs

List of key features about protein

New interface that combined other Pfam versions

New updates have made it more user-friendly

Page 39: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Pfam search of RING1

Page 40: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Pfam search

Page 41: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Pfam search results

Page 42: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Pfam search results

Page 43: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

Pfam link out

Page 44: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

HMM logo of sequence motif

Page 45: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART

Multiple sequence alignment of members

>400 domains in >54,000 different proteins

Searches database using HMMs

http://smart.embl-heidelberg.de/

Page 46: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART2 different modes

normal

swiss-Prot

SP-TrEMBL

ensemble

genomic

proteomes of sequenced genomes

Page 47: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART

Page 48: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART

Page 49: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART

Page 50: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART

Page 51: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART

Page 52: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

52

SMART

Page 53: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

SMART

Page 54: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

54

More motif madness

Page 55: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

55

PRINTS

Page 56: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

56

PRINTS

Page 57: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

57

PROSITE

Page 58: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

58

PROSITE

Page 59: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

59

Questions?

Page 60: DNA Motif and protein domain discovery Presented by: Deeter Neumann Peter St. Andre PDB; zinc finger 224 PDB; human enhancer binding protein.

60

How primitive is this RING-finger motif? The author only discusses genes containing this motif that come from eukaryotes. Is this motif found in prokaryotes as well?