Top Banner
State-of-the-art normalization of RT-qPCR data presented by dr Jo Vandesompele prof, Ghent University CEO, Biogazelle May 9, 2012
41

State-of-the-Art Normalization of RT-qPCR Data

Aug 28, 2014

Download

Health & Medicine

Setting up a qPCR experiment is so simple that it actually becomes dangerous. Without appropriate controls and data normalization, results can be misleading at best. This presentation addresses selection and validation of suitable reference genes as well as the use of the global mean normalization method to obtain accurate data. Also discussed are tools for data generation and analysis.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: State-of-the-Art Normalization of RT-qPCR Data

State-of-the-art normalization of RT-qPCR data

presented by dr Jo Vandesompele prof, Ghent University

CEO, Biogazelle

May 9, 2012

Page 2: State-of-the-Art Normalization of RT-qPCR Data

Biogazelle is a qPCR company

The most powerful, flexible and user-friendly real-time PCR data-analysis software

Easy.Fast.Reliable.

qbasePLUS analysis software

Learn.Experience. Achieve.

Easy.Fast.Reliable.

Biogazelle course qPCR experiment design and data-analysisqPCR courses

qPCR services Convenient.Rigorous.C omprehensive.

Analysing and interpreting gene expression data

data mining service

Page 3: State-of-the-Art Normalization of RT-qPCR Data

full text available - biogazelle > resources > articles

http://www.biogazelle.com

Page 4: State-of-the-Art Normalization of RT-qPCR Data

weekly qPCR tips and tricks via Twitter

https://twitter.com/#!/Biogazelle

Page 5: State-of-the-Art Normalization of RT-qPCR Data

critical elements contributing to successful qPCR results

Derveaux et al., Methods, 2010

“normalization is the single most important factor contributing to (more) accurate qPCR results”

Page 6: State-of-the-Art Normalization of RT-qPCR Data

Why do we need normalization?

n  2 sources of variation in gene expression results n  biological variation (true fold changes) n  experimentally induced variation (noise and bias)

n  purpose of normalization is removal or reduction of the experimental variation n  input quantity: RNA quantity, cDNA synthesis efficiency, … n  (input quality: RNA integrity, RNA purity, …)

Page 7: State-of-the-Art Normalization of RT-qPCR Data

various normalisation strategies

Huggett et al., Genes and Immunity, 2005

Page 8: State-of-the-Art Normalization of RT-qPCR Data

various normalisation strategies

n  sample size or volume n  total RNA n  rRNA genes (e.g. 18S rRNA) n  spike-in molecules n  reference genes (mRNA) (‘housekeeping genes’)

Page 9: State-of-the-Art Normalization of RT-qPCR Data

the problem of using a single non-validated reference gene

GOI 21.0 23.0

T U

ACTB T U

18.0 19.0

GAPDH T U

21.0 19.4

Cq values

normalized relative quantities

T U

GOIACTB

2

1

T U

GOIGAPDH 1

3

6-fold difference

Page 10: State-of-the-Art Normalization of RT-qPCR Data

the geNorm solution to the normalisation problem

n  framework for qPCR gene expression normalisation using the reference gene concept: n  quantified errors related to the use of a single reference gene

(> 3 fold in 25% of the cases; > 6 fold in 10% of the cases) n  developed a robust algorithm for assessment of expression stability of

candidate reference genes n  proposed the geometric mean of multiple reference genes for accurate

normalisation

n  Vandesompele et al., Genome Biology, 2002

Page 11: State-of-the-Art Normalization of RT-qPCR Data

candidate reference genes

n  RT-qPCR analysis of 5 candidate reference genes (belonging to different functional and abundance classes) on 7 normal blood samples

0

1

2

3

4

ACTB

HMBS

HPRT1

TBP

UBC

A B C D E F G

Page 12: State-of-the-Art Normalization of RT-qPCR Data

geNorm expression stability parameter

n  pairwise variation V (between any 2 candidate reference genes) n  gene stability measure M

average pairwise variation V of a given reference gene with all other candidate reference genes

n  iterative procedure of removing the worst reference gene followed by

recalculation of M-values

gene A gene B

sample 1 a1 b1 log2(a1/b1) sample 2 a2 b2 log2(a2/b2) sample 3 a3 b3 log2(a3/b3) … … … … sample n an bn log2(an/bn)

standard deviation = V

Page 13: State-of-the-Art Normalization of RT-qPCR Data

geNorm algorithm

n  ranking of candidate reference genes according to their stability n  determination of how many genes are required for reliable normalization n  http://www.genorm.info

Page 14: State-of-the-Art Normalization of RT-qPCR Data

calculation of the normalization factor

n  geometric mean of 3 reference gene expression levels

n  controls for outliers n  compensates for differences in expression level between the reference

genes

geometric mean = (a x b x c) 1/3

arithmetic mean = a + b + c

3

Page 15: State-of-the-Art Normalization of RT-qPCR Data

n  robust – insensitive to outliers

geNorm validation (I)

NF

ACTB HMBS HPRT1 TBP UBC

Page 16: State-of-the-Art Normalization of RT-qPCR Data

0.003

0.006 0.021 0.023 0.056

NF4

NF1

n  cancer patients survival curve statistically more significant results

geNorm validation (II)

log rank statistics

Hoebeeck et al., Int J Cancer, 2006

Page 17: State-of-the-Art Normalization of RT-qPCR Data

n  mRNA haploinsufficiency measurements accurate assessment of small expression differences

geNorm validation (III)

Hellemans et al., Nature Genetics, 2004

n  patient / control n  3 independent experiments n  95% confidence intervals

Page 18: State-of-the-Art Normalization of RT-qPCR Data

n  geNorm is the de facto standard for reference gene validation and normalization n  > 4,400 citations of our geNorm technology n  > 15,000 geNorm software downloads worldwide

normalization using multiple stable reference genes

Page 19: State-of-the-Art Normalization of RT-qPCR Data

large and active geNorm discussion community

> 1000 members, almost 2000 posts

http://tech.groups.yahoo.com/group/genorm/

Page 20: State-of-the-Art Normalization of RT-qPCR Data

improved geNorm is genormPLUS

classic geNorm

improved geNorm (genormPLUS)

platform Excel Windows

qbasePLUS Win, Mac, Linux

speed 1x 20x

expert interpretation + report - +

ranking best 2 genes - +

handling missing data - +

raw data (Cq) as input - +

>5000 qbasePLUS downloads in past 14 months

Page 21: State-of-the-Art Normalization of RT-qPCR Data

geNorm pilot experiment

n  3 simple steps

1.  generate data on qPCR instrument a recommended pilot experiment contains -  8 candidate reference genes -  10 representative samples -  nicely fits in a single 96-well plate

2.  export Cq values from instrument software and import in qbasePLUS

3.  in qbasePLUS: go to Analyze > geNorm and inspect results

“a couple of hours work to get more accurate results for the rest of your lab life”

Page 22: State-of-the-Art Normalization of RT-qPCR Data

genormPLUS result interpretation

n  expert report without need to understand formulas n  time saver n  higher confidence in the results

Page 23: State-of-the-Art Normalization of RT-qPCR Data

n  differences in reference gene stability ranking between high and low quality RNA (Perez-Novo et al., Biotechniques, 2005)

intermezzo - RNA quality has impact on expression stability

most stable

least stable

low low high high quality

Page 24: State-of-the-Art Normalization of RT-qPCR Data

n  755 microRNAs (OpenArray) n  1718 long non-coding RNAs (SmartChip) n  gene panels (96 or 384)

large scale gene expression studies… require something different

Page 25: State-of-the-Art Normalization of RT-qPCR Data

a new normalization method: global mean normalization

n  hypothesis: when a large set of genes are measured, the average expression level reflects the input amount and could be used for normalization n  microarray normalization (lowess, mean ratio, …) n  RNA-sequencing read counts

n  the set of genes must be sufficiently large and unbiased

n  we test this hypothesis using genome-wide microRNA data from experiments in which Biogazelle quantified a large number of miRNAs in different studies

n  cancer biopsies & serum o  neuroblastoma, T-ALL, EVI1 leukemia, retinoblastoma

n  pool of normal tissues, normal bone marrow set n  induced sputum of smokers vs. non-smokers

Page 26: State-of-the-Art Normalization of RT-qPCR Data

How to validate a new normalization method?

n  geNorm ranking global mean vs. candidate reference genes n  reduction of experimental noise n  balancing of expression differences (up vs. down) n  identification of truly differentially expressed genes

n  original global mean (Mestdagh et al., 2009) n  improved global mean (D’haene et al., 2012)

n  mean center the data > equal weight to each gene n  allow PCR efficiency correction

n  improved global mean on common targets (D’haene et al., 2012) n  improved global mean n  average only genes that are expressed in all samples

Page 27: State-of-the-Art Normalization of RT-qPCR Data

n  lower M-value means better stability

00,20,40,60,8

11,21,41,61,8

expr

essi

on s

tabi

lity

geNorm ranking (T-ALL) (I)

Page 28: State-of-the-Art Normalization of RT-qPCR Data

geNorm ranking (I)

bone marrow pool normal tissues

neuroblastoma leukemia EVI1 overexpression

Page 29: State-of-the-Art Normalization of RT-qPCR Data

n  cumulative noise distribution plot (more to the left is better, less noise) n  global mean methods remove more experimental noise

reduction of experimental variation (neuroblastoma) (II)

Page 30: State-of-the-Art Normalization of RT-qPCR Data

reduction of experimental variation (II)

bone marrow pool normal tissues

T-ALL leukemia EVI1 overexpression

Page 31: State-of-the-Art Normalization of RT-qPCR Data

n  U6 normalization (the only expressed small RNA) induces more noise than not normalizing

n  improved global mean is better than original global mean method

reduction of experimental variation (induced sputum) (II)

Page 32: State-of-the-Art Normalization of RT-qPCR Data

balancing differential expression (III)

n  fold changes in 2 cancer patient subgroups n  global mean normalization results in equal number of downregulated and

upregulated miRs

Page 33: State-of-the-Art Normalization of RT-qPCR Data

better identification of differentially expressed miRs (IV)

n  miR-17-92 expression in 2 subgroups of neuroblastoma (MYCN amplified vs. MYCN normal)

n  global mean enables better appreciation of upregulation

Page 34: State-of-the-Art Normalization of RT-qPCR Data

strategy also works for mRNA data

0

0,1

0,2

0,3

0,4

0,5

0,6

0,7

expr

essi

on s

tabi

lity

n  4 MAQC samples (Canales et al., Nature Biotechnology, 2006) n  201 MAQC consensus genes are measured n  geNorm analysis

n  10 classic reference genes n  global mean of 201 mRNAs

Page 35: State-of-the-Art Normalization of RT-qPCR Data

conclusions

n  novel and powerful (miRNA) normalization strategy n  best ranking according to geNorm n  maximal reduction of experimental noise n  balancing of differential expression n  improved identification of differentially expressed genes

n  Mestdagh et al., Genome Biology, 2009 (original global mean) n  D’haene et al., Methods Mol Biol, 2012 (improved global mean)

Page 36: State-of-the-Art Normalization of RT-qPCR Data

normalization info is part of the MIQE guidelines

Table 1. MIQE checklist for authors, reviewers, and editors.a

Item to check Importance Item to check Importance

Experimental design qPCR oligonucleotides

Definition of experimental and control groups E Primer sequences E

Number within each group E RTPrimerDB identification number D

Assay carried out by the core or investigator’s laboratory? D Probe sequences Dd

Acknowledgment of authors’ contributions D Location and identity of any modifications E

Sample Manufacturer of oligonucleotides D

Description E Purification method D

Volume/mass of sample processed D qPCR protocol

Microdissection or macrodissection E Complete reaction conditions E

Processing procedure E Reaction volume and amount of cDNA/DNA E

If frozen, how and how quickly? E Primer, (probe), Mg2!, and dNTP concentrations E

If fixed, with what and how quickly? E Polymerase identity and concentration E

Sample storage conditions and duration (especially for FFPEb samples) E Buffer/kit identity and manufacturer E

Nucleic acid extraction Exact chemical composition of the buffer D

Procedure and/or instrumentation E Additives (SYBR Green I, DMSO, and so forth) E

Name of kit and details of any modifications E Manufacturer of plates/tubes and catalog number D

Source of additional reagents used D Complete thermocycling parameters E

Details of DNase or RNase treatment E Reaction setup (manual/robotic) D

Contamination assessment (DNA or RNA) E Manufacturer of qPCR instrument E

Nucleic acid quantification E qPCR validation

Instrument and method E Evidence of optimization (from gradients) D

Purity (A260/A280) D Specificity (gel, sequence, melt, or digest) E

Yield D For SYBR Green I, Cq of the NTC E

RNA integrity: method/instrument E Calibration curves with slope and y intercept E

RIN/RQI or Cq of 3" and 5" transcripts E PCR efficiency calculated from slope E

Electrophoresis traces D CIs for PCR efficiency or SE D

Inhibition testing (Cq dilutions, spike, or other) E r2 of calibration curve E

Reverse transcription Linear dynamic range E

Complete reaction conditions E Cq variation at LOD E

Amount of RNA and reaction volume E CIs throughout range D

Priming oligonucleotide (if using GSP) and concentration E Evidence for LOD E

Reverse transcriptase and concentration E If multiplex, efficiency and LOD of each assay E

Temperature and time E Data analysis

Manufacturer of reagents and catalogue numbers D qPCR analysis program (source, version) E

Cqs with and without reverse transcription Dc Method of Cq determination E

Storage conditions of cDNA D Outlier identification and disposition E

qPCR target information Results for NTCs E

Gene symbol E Justification of number and choice of reference genes E

Sequence accession number E Description of normalization method E

Location of amplicon D Number and concordance of biological replicates D

Amplicon length E Number and stage (reverse transcription or qPCR) of technical replicates E

In silico specificity screen (BLAST, and so on) E Repeatability (intraassay variation) E

Pseudogenes, retropseudogenes, or other homologs? D Reproducibility (interassay variation, CV) D

Sequence alignment D Power analysis D

Secondary structure analysis of amplicon D Statistical methods for results significance E

Location of each primer by exon or intron (if applicable) E Software (source, version) E

What splice variants are targeted? E Cq or raw data submission with RDML D

a All essential information (E) must be submitted with the manuscript. Desirable information (D) should be submitted if available. If primers are from RTPrimerDB,information on qPCR target, oligonucleotides, protocols, and validation is available from that source.

b FFPE, formalin-fixed, paraffin-embedded; RIN, RNA integrity number; RQI, RNA quality indicator; GSP, gene-specific priming; dNTP, deoxynucleoside triphosphate.c Assessing the absence of DNA with a no–reverse transcription assay is essential when first extracting RNA. Once the sample has been validated as DNA free,

inclusion of a no–reverse transcription control is desirable but no longer essential.d Disclosure of the probe sequence is highly desirable and strongly encouraged; however, because not all vendors of commercial predesigned assays provide this

information, it cannot be an essential requirement. Use of such assays is discouraged.

MIQE Guidelines for qPCR Special Report

Clinical Chemistry 55:4 (2009) 613

Bustin et al., Clin Chem, 2009

Page 37: State-of-the-Art Normalization of RT-qPCR Data

n  developed by the founders of Biogazelle n  peer-reviewed, widely used and cited

n  Vandesompele et al., Genome Bology, 2002 (multiple) reference genes

n  D’haene et al., Methods Mol Biol, 2012 improved global mean + global mean on common targets

normalisation strategies in qbasePLUS

Page 38: State-of-the-Art Normalization of RT-qPCR Data

summary

n  normalization is the single most important factor that increases the accuracy and resolution of RT-qPCR results

n  through a pilot experiment, the geNorm algorithm can identify suitable reference genes from a set of tested candidate reference genes

n  global mean normalization is a powerful alternative normalization strategy for larger scale gene expression studies

n  qbasePLUS accomodates both normalization approaches

Page 39: State-of-the-Art Normalization of RT-qPCR Data

acknowledgments

n  UGent n  Katleen De Preter n  Pieter Mestdagh n  Joëlle Vermeulen n  Stefaan Derveaux n  Filip Pattyn n  Frank Speleman

n  Biogazelle n  Barbara D’haene n  Jan Hellemans

Page 40: State-of-the-Art Normalization of RT-qPCR Data

Biogazelle’s founders are researchers of the year

… in their dreams.

Page 41: State-of-the-Art Normalization of RT-qPCR Data

State-of-the-Art Normalization of RT-qPCR Data presented by Dr Jo Vandesompele Prof Functional Genomics and Applied Bioinformatics Ghent University May 9, 2012

qPCR application guide Newly revised and updated.

Available in electronic or print format at www.idtdna.com

qbasePLUS

Receive 30% off the list price of a premium qbasePLUS license at

tinyurl.com/cbmcvgu (until end of June)