Top Banner
Co-occupancy networks for histone modifications and chromatin associated proteins Martin Vingron MPI for Molecular Genetics Acknowledgements: Ho-Ryun Chung, Rosa Karlic, Julia Lasserre, Juliane Perner
36

Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Jul 08, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Co-occupancy networks for histone modifications and

chromatin associated proteins

Martin Vingron MPI for Molecular Genetics

Acknowledgements: Ho-Ryun Chung, Rosa Karlic, Julia Lasserre, Juliane Perner

Page 2: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Co-occupancy Networks • Can we predict gene expression? Predict from

what? Histone modifications? • Do „things“ occupy DNA together? • Histone modification networks • Partial correlation, Gaussian Graphical Models • Histone modifications plus chromatin modifiers • Compare to: Gene expression networks, e.g.

BNs, p>>n problem

Page 3: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Histone modifications

Picture from: http://chemistry.gsu.edu/faculty/Zheng/research.html

Page 4: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

From: Li, Carey, Workman (2007) Cell 128:707-719

Page 5: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from
Page 6: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

18 Histone acetylations

Gene expression data

All data from a single cell type: human CD4+ T-cells Control: CD4+ goat IgG and CD4+ rabbit IgG (Wang et al., 2009

Page 7: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Histone modifications and transcription level

Histone modification vector Transcription level

promoter transcript

?

Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from Keiji Zhao lab. Results in a data matrix size #promoters x #histone modifications.

This and following slides: Rosa Karlic and Ho-Ryun Chung et al

Page 8: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Model -2000 +2000

Determine the number Ni of tags mapping to ± 2000 base pairs from the TSS

Transform Ni by Xi = log(Ni + αi) αi (pseudocount) is chosen such that the correlation with the expression value is maximal

Standard linear regression Yi = β0 + β1Xi1 + β2Xi2 + … + βpXip + εi, i=1,…,n

Page 9: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Histone modifications and transcription level

Page 10: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Feature Selection

• Identify best three-modification linear models • Find overrepresented modifications

Page 11: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Human Promoter Classes

Mikkelsen et al. (2007) Nature 448, 548

N = 4,183 N = 10,619

Page 12: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Informative modifications stratified by CpG contents

Page 13: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Correlations among modifications

Correlation between different histone modifications in the promoter regions of 14,802 human genes

Page 14: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Look at co-occurrence of TFBSs in a window on the genome

window

Thomas Manke (J. Mol. Biol., 2003)

Page 15: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Alena van Bömmel, BMC Genomics and PhD thesis

Page 16: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Predicted interactions in hematop. stem cells

Hematopoiesis

Page 17: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Correlations among modifications

Correlation between different histone modifications in the promoter regions of 14,802 human genes

Page 18: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

H variables

H v

aria

bles

From correlations to partial correlations (Gaussian Graphical Models)

Data matrix of dimension NxH N: number of genes H: number of variables

For each pair (hi,hj) of variables compute the correlation cij between ri and rj

This and following slides: Julia Lasserre et al

Page 19: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

H variables

N g

enes

vs

vs

residuals

residuals

cor( , )

Partial correlation coefficient for variables i and j: Regress i and j on the remaining variables. Determine the two sets of residuals after explaining i and j. Partial correlation between i and j is defined as the correlation between the two vectors of residuals.

Page 20: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

H variables

H v

aria

bles

N

gen

es

Page 21: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Theorem on how to compute partial correlations: Partial correlation coefficients can be computed from the entries of the inverse of the variance-covariance matrix. Theorem on the meaning: Under the assumption that the variables are multivariate Gaussian, the partial correlation ρXY·Z is zero if and only if X is conditionally independent from Y given Z.

Page 22: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Example

37

X3

X1

X2

X4

1.0 2.0 6.0 3.0

2.0 5.0 15.1 7.6

6.0 15.1 46.3 23.1

3.0 7.6 23.1 12.6

𝑪𝑪𝑪𝑪𝑪𝑪 𝑿𝑿 = 𝚺𝚺 𝑋𝑋1 𝑋𝑋2 𝑋𝑋3 𝑋𝑋4

𝑋𝑋1

𝑋𝑋2

𝑋𝑋3

𝑋𝑋4

𝑥𝑥1 𝑥𝑥2 𝑥𝑥3 𝑥𝑥4

5.0 -2.0 0.0 0.0

-2.0 10 -3.0 0.0

0.0 -3.0 1.2 -0.5

0.0 0.0 -0.5 1.0

𝑪𝑪𝑪𝑪𝑪𝑪 𝑿𝑿 −𝟏𝟏 = 𝚺𝚺−𝟏𝟏

Σ𝑖𝑖𝑖𝑖 = 0 ↔

Independence

Σ𝑖𝑖𝑖𝑖−1 = 0 ↔

Conditional independence

3 1 2 4

3

1

2 4

𝑋𝑋1 ~ 𝑁𝑁 0,1 𝑋𝑋2 ~ 𝑁𝑁(2𝑋𝑋1 + 1, 1) 𝑋𝑋3 ~ 𝑁𝑁(3𝑋𝑋2 − 0.5, 1) 𝑋𝑋4 ~ 𝑁𝑁(0.5𝑋𝑋3,1)

𝑋𝑋1 𝑋𝑋2 𝑋𝑋3 𝑋𝑋4

𝑋𝑋1

𝑋𝑋2

𝑋𝑋3

𝑋𝑋4

𝑿𝑿

Slide by Juliane Perner

Page 23: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

CD4+ network

Page 24: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Odds and Ends

- We use rank-sorted data (corresponding to a rank correlation coefficient)

- There is huge number of entries in the vectors for which we compute partial correlation coefficients. Therefore we get tiny p-values.

- Remedy: Choose a p-value cutoff and resample from the promoters.

- Accept edges with more than, say, 70% bootstrap support.

Page 25: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Enter chromatin modifiers ….

Combinatorial patterning of chromatin regulators uncovered by genome-wide location analysis in human cells. Oren Ram, Alon Goren, Ido Amit, Noam Shoresh, Nir Yosef, Jason Ernst, Manolis Kellis, Melissa Gymrek, Robbyn Issner, Michael Coyne, Timothy Durham, Xiaolan Zhang, Julie Donaghey, Charles B. Epstein, Aviv Regev, Bradley E. Bernstein Cell, Vol. 147, No. 7. (23 December 2011), pp. 1628-1639

This and following slides: Juliane Perner et al

Page 26: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Correlations among Histone Modifications plus Chromatin Modifiers

Page 27: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Partial Correlations among Histone Modifications plus Chromatin Modifiers

Page 28: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

More network construction … • We modeled expression from HMs, with

subsequent feature selection:

HM2

HM3

HM1

HM5

HM4

HM6

expression

Page 29: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

More network construction: HM-> CM

• Why not model a HMs from CMs?

CM2

CM3

CM1

CM5

CM4

CM6

HM1

Page 30: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

More network construction: HM-> CM

• Why not model all HMs from CMs?

CM2

CM3

CM1

CM5

CM4

CM6

HM1

HM2

Page 31: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

More network construction: sparse linear regression, elastic net

• Sparse regression replaces feature selection

CM2

CM3

CM1

CM5

CM4

CM6

HM1

HM2

Page 32: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Sparse linear model (elastic net) explaining Histone Modifications from Chromatin Modifiers

Page 33: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Chromatin-signalling network.

Perner J et al. Nucl. Acids Res. 2014;42:13689-13695

© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

aka LSD1=lysine specific histone demethylase, demethylates mono- and di-methylated H3K4

Page 34: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Verification of two predicted interactions links H4K20me1 to Polycomb-mediated repression.

Perner J et al. Nucl. Acids Res. 2014;42:13689-13695

© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Page 35: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

#nodes >> #conditions? Not with histone modifications!

all p

rom

oter

s

Histone marks

all genes

cond

ition

s

Page 36: Co-occupancy networks for histone modifications and ......transcript ? Focus on - Sum of tags in promoter - For many histone modifications - in one and the same cell line. Data from

Acknowledgements Ho-Ryun Chung, Rosa Karlic … Linear models, HMs Julia Lasserre .... Gaussian Graphicial Models Juliane Perner … HM-CM networks Sarah Kinkley … validation experiments