Top Banner
A Bayesian Sta,s,cal Approach to Modeling Gene Regulatory Pathways in Human Placental Data Elinor Velasquez Dept. of Biology San Francisco State University
39
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: MSDefenseTalk2008

A"Bayesian"Sta,s,cal"Approach"to"Modeling"Gene"Regulatory"Pathways"in"

Human"Placental"Data"

Elinor"Velasquez"Dept."of"Biology"

San"Francisco"State"University"

Page 2: MSDefenseTalk2008

Outline"of"talk"

•  Introduc,on"•  The"experimental"approach"

•  Results"from"experiments"

•  Discussion"•  Conclusions"and"future"work"•  Acknowledgements"

Page 3: MSDefenseTalk2008

Introduc,on"

Page 4: MSDefenseTalk2008

Overall"Goal"

hMp://www.biotechnologycenter.org/hio/assets/hisimages/placenta/placenta44.jpg"

To use a bioinformatics model for which to better understand the human placenta

Page 5: MSDefenseTalk2008

The"human"placenta"

http://www.uchsc.edu/winnlab/index.html

Page 6: MSDefenseTalk2008

EGFR"pathway"

•  "EGFR,"cell"surface"receptor"for"epidermal"growth"factors"

•  "Poten,ally"important"gene"for"the"placenta"

British Journal of Cancer (2006) 94, 184 – 188

Page 7: MSDefenseTalk2008

EGFR"regulates"gene"expression""

EGFR

ANGPT2 CSPG2 DCN

Page 8: MSDefenseTalk2008

Causal"rela,onships""

EGFR

ANGPT2 CSPG2 DCN

Page 9: MSDefenseTalk2008

Example"of"a"gene"regulatory"network"

Gene""1"

Gene"2"

Gene""3"

Gene""5"

Gene""6"Gene"

"4"

Page 10: MSDefenseTalk2008

Defini,on"of"a"Bayesian"network"

•  There"exist"nodes"(disks)"

•  There"are"edges"(arrows)"between"some"of"the"nodes"

•  Causality"is"implied"by"the"edges"

•  Acyclic"

Gene""1"

Gene"2"

Gene""3"

Gene""5"

Gene""6"Gene"

"4"

Page 11: MSDefenseTalk2008

The"experimental"approach"

Page 12: MSDefenseTalk2008

What"is"a"probe"set?"

•  Several"oligonucleo,des"designed"to"hybridize"to"various"parts"of"the"mRNA"generated"from"a"single"gene""

Probe set

mRNA

gene

Page 13: MSDefenseTalk2008

Data"collected"from"microarrays"

•  Data"comes"from"experiments"conducted"by"Virginia"Winn"et"al."at"the"SJ"Fisher"lab,"UCSF"

•  Gene"expression"profiling"experiments"

45000 dots (25-mer oligo probe sets)

representing the human genome

cRNA

hybridization

Page 14: MSDefenseTalk2008

Microarray"data"

•  The"normalized"log"2"intensity"values"were"centered"to"the"median"value"of"each"probe"set,"by"Winn"et"al."

•  Red"denotes"the"up"regulated"expression"and"green"denotes"the"down"regulated"expression"rela,ve"to"the"median"value"

Genes differentially expressed in the basal plate of placentas: Rows contain data from a single basal plate cRNA sample and columns correspond to a single probeset.

http://www.uchsc.edu/winnlab/index.html

Page 15: MSDefenseTalk2008

Summary"of"data"used"in"bioinforma,cs"experiments"

•  36"placentas"•  509"probe"sets"(418"genes)"

•  Probe"sets"chosen"according"to"pcvalue"(p"<"0.05)"

•  Timecseries"data"from"14c16"weeks"to"term"

Average expression value

Page 16: MSDefenseTalk2008

Outline"of"bioinforma,cs"experimental"design"

•  Create"a"naïve"Bayesian"network"using"the"probe"set"data"•  Score"that"network"•  Randomly"add/delete"an"edge"and"rescore"the"Bayesian

"network"•  Con,nue"un,l"best"score"reached"•  Combine"probe"sets"to"create"the"gene"regulatory"pathways"for

"the"placenta"

PS#1#

PS##2# PS#

3#

PS#4#

Page 17: MSDefenseTalk2008

Naïve"Bayesian"network"

•  Choose"a"root"node"•  All"other"nodes"branch"off"of"the"root"node"

•  PS1"is"the"parent"node"

PS#1#

PS##2# PS#

3#

PS#4#

Page 18: MSDefenseTalk2008

Step"1:"Create"a"naïve"Bayesian"network"

•  Use"data"from"one",me"segment"•  Choose"Weeks"23c24"data"(6"placentas)"

PS1

PS2 PS3 PS4

Page 19: MSDefenseTalk2008

Placenta"data"for"Weeks"23c24"

201984 corresponds to EGFR

236034, 211148, 205572 correspond to ANGPT2

204620 corresponds to CSPG2

209335 corresponds to DCN

Page 20: MSDefenseTalk2008

Step"2:"Score"the"naïve"Bayesian"network"

•  We"want"to"score"this"network:"

201984

236064 204620 211148

Page 21: MSDefenseTalk2008

The"network"score"is"a"func,on"of"condi,onal"probabili,es"

•  Probability"is"defined"to"be"the"chance"that"something"will"happen"

•  Condi,onal"probability,"""""Prob(N"|"Pa(N)),"is"the"probability"of"child"node"N"given"parent"of"N"

•  Example:"Given"a"parent"201984’s"node"has"an"associated"expression"value"10,"what"is"the"probability"that"its"child"node,"204620,"has"an"expression"value"of"8?"

201984

204620

Page 22: MSDefenseTalk2008

Condi,onal"probability"

•  EGFR"(201984)"is"the"parent"node"and""""has"value"10.""•  CSPG2"(204620)"is"the"child"node"and"has""""value"8"two",mes"

•  Condi,onal"probability"="2/6"

201984

204620

Page 23: MSDefenseTalk2008

Score"for"naïve"network"

•  The"score"of"the"naive"network"equals"the"product"of"all"the"nonzero"condi,onal"probabili,es"associated"with"the"network:"

•  P(N1,"N2,"N3,"N4)"""=""Π"P(Ni"|"pa(Ni))"

•  P(N1,"N2,"N3,"N4)"""="?"201984

204620

i=1

4

236064 211148

Page 24: MSDefenseTalk2008

Step"3:"Delete/Add"an"edge"

•  The"score"becomes"?."•  Since"the"score"is"a"probability,"we"want"the"score"to"be"high."

•  The"previous"network"is"the"beMer"choice."

201984

236064

204620

211148

Page 25: MSDefenseTalk2008

Final"Network"for"four"probe"sets/three"genes"

EGFR

ANGPT2 CSPG2

Page 26: MSDefenseTalk2008

How"do"we"create"all"possible"networks?"

•  1"probe"set"1"Bayesian"network"•  2"probe"sets"2"possible"Bayesian"networks"•  3"probe"sets"12"possible"Bayesian"networks"•  4"probe"sets"144"possible"Bayesian"networks"•  5"probe"sets">"4800"possible"Bayesian"networks!"•  6"probe"sets"…"??"•  And"so"on…"

Page 27: MSDefenseTalk2008

Welcome"to"“Modern"Heuris,cs”"•  Step"1."Representa,on"of"a"model"•  Step"2."The"evalua,on"func,on"•  Step"3."Defining"the"search"problem"

•  Step"4."Neighborhoods"and"local"op,ma"score

local

change

Page 28: MSDefenseTalk2008

Step"1:"Representa,on"of"the"model""

•  The"model"is"a"gene"regulatory"pathway."

•  The"number"of"possible"pathways"is"so"large"as"to"forbid"an"exhaus,ve"search"for"the"best"answer."

•  It’s"complicated"so"we"have"to"simplify."

•  We"are"going"to"assume"a"Bayesian"model"for"our"probe"set:""

PS#1#

PS##2# PS#

3#

PS#4#

Page 29: MSDefenseTalk2008

Step"2."The"evalua,on"func,on"

•  Introduce"the"idea"of"randomness."

•  Choose"100"networks"randomly."Each"node"is"randomly"chosen"with"probability"p.#

•  Simplify:#Assume"the"network"has"a"Dirichlet"distribu,on"which"is"similar"to"a"mul,nomial"distribu,on.#

www.wikipedia.com

Page 30: MSDefenseTalk2008

The"new"scoring"func,on"

•  Probability"of"a"fixed"network"given"the"specified"database"equals"product"of"condi,onal"probabili,es",mes"the"Dirichlet"distribu,on:"

Page 31: MSDefenseTalk2008

Local"Search"for"Possible"Network"

•  HillcClimbing"is"a"tradi,onal"method"for"search"techniques"

•  Can"get"caught"on"local"maxima"

•  K2"is"a"type"of"hillcclimbing"algorithm"

From http://content.answers.com/

score

local

change

Page 32: MSDefenseTalk2008

MRC2

ATP5E

ERG

PECAM1 IL2RB CECAM1 CYP19A1

USP6NL

EGFR

ADAM9

GLB1

CCNG2

RAP2B

P4HA1

BAMBI

INHBA

CSPG2

DCN COL3A1

COL1A2

SPP1

ANGPT2

SFRP1

Time Segment:

Week 14-16 weeks

Page 33: MSDefenseTalk2008

Results"for"the"Human"Placenta"

Page 34: MSDefenseTalk2008

Gene"egfr"regulated"by:"

Page 35: MSDefenseTalk2008

Gene"Regulatory"Network"for"Gene"egfr#

CCNG2

PLAU CSPG2

DCN

ADAM9

PECAM1 INHBA

COL1A2

Page 36: MSDefenseTalk2008

Sorware"

•  Weka"sorware"system"from"the"University"of"Waikato,"New"Zealand"

•  hMp://www.cs.waikato.ac.nz/~ml/index.html"

Page 37: MSDefenseTalk2008

Future"Direc,ons"

•  A"threecdimensional"viewer"with"numerical"values"will"be"implemented"to"use"with"the"Weka"sorware""

•  Use"molecular"gene,cs"techniques"to"validate"a"por,on"of"the"results"

•  Design"a"gene,c"programming"algorithm"(evolu,onary"algorithm)"to"create"a"Bayesian"network"

Page 38: MSDefenseTalk2008

Acknowledgements"

•  Le,cia"MárquezcMagaña"and"laboratory"members"(San"Francisco"State"University)"

•  Susan"J."Fisher""(U.C."San"Francisco)"•  MaMhew"Gormley"(U.C."San"Francisco)"

•  M.B.R.S.cR.I.S.E."Grant""5"c"R25cGM59298"

Page 39: MSDefenseTalk2008