eScholarship provides open access, scholarly publishing services to the University of California and delivers a dynamic research platform to scholars worldwide. Center for Bioinformatics and Molecular Biostatistics UC San Francisco Title: Variants in the CDKN2B and RTEL1 regions are associated with high-grade glioma susceptibility Author: Xiao, Yuanyuan , University of California, San Francisco Publication Date: 07-08-2009 Publication Info: Center for Bioinformatics and Molecular Biostatistics, UC San Francisco Permalink: http://escholarship.org/uc/item/32p785g8 Additional Info: in press Keywords: adult glioma and genome-wide association study
7
Embed
Variants in the CDKN2B and RTEL1 regions are associated with high-grade glioma susceptibility
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
eScholarship provides open access, scholarly publishingservices to the University of California and delivers a dynamicresearch platform to scholars worldwide.
Center for Bioinformatics and MolecularBiostatistics
UC San Francisco
Title:Variants in the CDKN2B and RTEL1 regions are associated with high-grade glioma susceptibility
Author:Xiao, Yuanyuan, University of California, San Francisco
Publication Date:07-08-2009
Publication Info:Center for Bioinformatics and Molecular Biostatistics, UC San Francisco
Variants in the CDKN2B and RTEL1 regions areassociated with high-grade glioma susceptibilityMargaret Wrensch1,2,12, Robert B Jenkins3,12, Jeffrey S Chang4,12, Ru-Fang Yeh4,12, Yuanyuan Xiao4,Paul A Decker5, Karla V Ballman5, Mitchel Berger1, Jan C Buckner6, Susan Chang1, Caterina Giannini3,Chandralekha Halder3, Thomas M Kollmeyer3, Matthew L Kosel5, Daniel H LaChance7, Lucie McCoy1,Brian P O’Neill7, Joe Patoka1, Alexander R Pico8, Michael Prados1, Charles Quesenberry9, Terri Rice1,Amanda L Rynearson3, Ivan Smirnov1, Tarik Tihan10, Joe Wiemels2,4, Ping Yang11,13 & John K Wiencke1,2,13
The causes of glioblastoma and other gliomas remainobscure1,2. To discover new candidate genes influencingglioma susceptibility, we conducted a principal component–adjusted3 genome-wide association study (GWAS) of 275,895autosomal variants among 692 adult high-grade glioma cases(622 from the San Francisco Adult Glioma Study (AGS) and70 from the Cancer Genome Atlas (TCGA))4 and 3,992controls (602 from AGS and 3,390 from Illumina iControlDB(iControls)). For replication, we analyzed the 13 SNPs withP o 10�6 using independent data from 176 high-grade gliomacases and 174 controls from the Mayo Clinic. On 9p21,rs1412829 near CDKN2B had discovery P ¼ 3.4 � 10�8,replication P ¼ 0.0038 and combined P ¼ 1.85 � 10�10.On 20q13.3, rs6010620 intronic to RTEL1 had discoveryP ¼ 1.5 � 10�7, replication P ¼ 0.00035 and combinedP ¼ 3.40 � 10�9. For both SNPs, the direction of associationwas the same in discovery and replication phases.
Subject characteristics, including participation rates for the discoveryGWAS and replication phases, are listed in Supplementary Table 1a,b.The distribution of P values from the principal component–adjustedlogistic regression additive model across the genome for high-gradeglioma cases versus controls (Fig. 1) suggests potentially meaningfulassociations for several SNPs on chromosomes 1, 5, 9, 11 and 20.Supplementary Table 2 summarizes results for the 13 SNPs withP o 10�6 for association with high-grade glioma in discovery dataalong with results from replication data; SNPs with Hardy-WeinbergPo 10�5 in controls or with 45% missing data in any case or controlgroup were excluded. Three of these 13 SNPs (rs1412829 on 9p21, andrs6010620 and rs4809324 intronic to RTEL1 on 20q13.3) had
significant association with high-grade glioma risk in the discoveryphase (principal component analysis P o 1.8 � 10�7), were inde-pendent risk predictors in a multivariable analysis of 13 top hits, andwere replicated in the Mayo Clinic dataset (Table 1). As shown inTable 1 and Supplementary Table 2, the minor allele frequencies forthe three SNPs consistently differed in the same direction betweenhigh-grade glioma cases and controls regardless of data source (AGS,TCGA, iControls or Mayo Clinic). Supplementary Table 3 showsresults from the multivariable model of discovery data that includedall 13 SNPs (four from the 9p21 region, three in RTEL1, plus six othersin other locations). Eight SNPs, including one in the 9p21 region andtwo intronic to RTEL1, remained independently associated with high-grade glioma risk after adjustment for other SNPs. This was expectedgiven the strong linkage disequilibrium (LD) evident for the four 9p21SNPs and two of the three RTEL1 SNPs (Supplementary Table 4).
In discovery data, only the interaction between chromosome 9p21SNP rs1412829 and TERT SNP rs2736100 on chromosome 5 wasstatistically significant with Wald test P ¼ 0.019 (see Supplementary
Figure 1 Distribution of P values from principal component–adjusted logistic
regression additive model across the genome for high-grade glioma cases
versus controls. The 13 SNPs with P o 10�6 are shown in red.
Received 13 March; accepted 1 June; published online 5 July 2009; doi:10.1038/ng.408
1Department of Neurological Surgery, University of California, San Francisco, San Francisco, California, USA. 2Institute of Human Genetics, University of California,San Francisco, San Francisco, California, USA. 3Department of Experimental Pathology, Mayo Clinic, Rochester, Minnesota, USA. 4Department of Epidemiologyand Biostatistics, University of California, San Francisco, San Francisco, California, USA. 5Division of Biostatistics, 6Department of Oncology and 7Department ofNeurology, Mayo Clinic, Rochester, Minnesota, USA. 8Gladstone Institute of Cardiovascular Disease, University of California, San Francisco, San Francisco, California,USA. 9Division of Research, Kaiser Permanente, Oakland, California, USA. 10Department of Pathology, University of California, San Francisco, San Francisco,California, USA. 11Division of Epidemiology, Mayo Clinic, Rochester, Minnesota, USA. 12These authors contributed equally to this work. 13These authors jointly directedthe work. Correspondence should be addressed to M.W. ([email protected]).
Table 5 and its accompanying graph). However, close inspectionrevealed that the interaction probably resulted from an associationof the two SNPs in the controls.P values of SNPs on 9p21 and the LD plot (Fig. 2) show that the top
9p21 SNPs are located in or around CDKN2B. Haplotype analyses(Table 2) showed that a single haplotype for the four 9p21 SNPs wasmore common in cases than controls. Two haplotypes in RTEL1 wereassociated with increased and decreased risk, respectively (Table 2).The Mayo replication data also defined the identical haplotypeassociated with high-grade glioma risk for the four linked 9p21SNPs as identified during the discovery phase (Table 2). In addition,one of the two RTEL1 haplotypes identified in the discovery phase wasalso significantly associated in the replication samples (Table 2).Mantel-Haenszel combined P values for the UCSF and Mayo samplesfor the 13 SNPs are shown in Supplementary Table 6. The UCSFGWAS and Mayo replication suggest that regions of 9p21 (CDKN2B)and 20q13.3 (RTEL1) harbor SNPs associated with high-grade gliomarisk, as discussed further below.
The strongest and most consistent associations in the GWAS werewith a series of four SNPs within noncoding regions of the CDKN2Blocus on 9p21. CDKN2B lies adjacent to the well-known tumorsuppressor gene CDKN2A (encoding p16INK4A and p14ARF) in aregion that is frequently mutated, deleted or hypermethylated in awide variety of tumors, including high-grade glioma. The region iswithin 20 kb of constitutional deletions, including the hemizygousgermline deletion of CDKN2A that has been reported by the Mayogroup and others to be linked to familial melanoma and glioblastomasyndrome (Fig. 2)5. Mice with homozygous deletion of Cdkn2aand/or Cdkn2b are predisposed to develop tumors, including
gliomas6. CDKN2B, like CDKN2A, is a cyclin-dependent kinaseinhibitor which forms a complex with CDK4 or CDK6 and preventsthe activation of the cyclin-D–dependent kinases, thus regulating cellgrowth and cell cycle G1 progression. CDKN2B is frequently inacti-vated in glioma by homozygous deletion along with CDKN2A; 50–70% of primary high-grade gliomas show deletion of this region.Whereas tumor suppressor functions for CDKN2A have been firmlyestablished, only recently has CDKN2B been recognized as an effective‘backup’ for loss of CDKN2A7. In glioblastoma cells, overexpression ofCDKN2B in a CDKN2A-deficient background inhibited cell growth,induced replicative senescence and inhibited telomerase activity8. Incontrast to CDKN2A, CDKN2B is markedly induced by TGF-b. It hasthus been hypothesized that CDKN2B may be engaged under specialcircumstances, whereas CDKN2A plays a more general tumor sup-pressor function in response to DNA damage and hyperproliferativesignals8. TGF-b signaling information is relayed from the cell surfaceto the nucleus via the phosphorylation of SMAD proteins. A recentstudy identified a SMAD-binding region in the CDKN2B promoter; itis of interest that the SNPs associated with glioma in the current studyare in LD with the rs2069418 G4C SNP that lies in the crucialconserved 3¢ box adjacent to the SMAD binding element9. If a SNP inthis region reduces the responsiveness of CDKN2B to TGF-b, it couldallow cancer precursor cell populations to expand, thereby promotinggliomagenesis. It is unknown, however, whether any SNP in the regioncan affect TGF-b or any other cytokine signaling processes.
Although recent studies10–13 have identified chromosome 9p21 asan important region for coronary artery diseases (CAD) and type 2diabetes (T2D), the four glioma-associated chromosome 9p21SNPs are not in LD with SNPs associated with CAD or T2D
OR and 95% CI 1.42 (1.27–1.58) 0.66 (0.57–0.76) 1.60 (1.37–1.87)
Principal component analysis implemented with EIGENSTRAT software. Complete results for 13 top hits with P o 10�6 from UCSF GWAS and Mayo Clinic replication P values areshown in Supplementary Table 2, with Mantel-Haenszel combined results presented in Supplementary Table 6.
2 ADVANCE ONLINE PUBLICATION NATURE GENETICS
LET TERS
(Supplementary Fig. 1). This suggests that separate regions onchromosome 9p21 may contribute to the risk of high-grade gliomaversus CAD and T2D.
Although replication results were not statistically significant, TERT isanother interesting gene identified in our GWAS; it encodes humantelomerase, which is a ribonucleoprotein polymerase that maintainstelomere ends by addition of the telomere repeat TTAGGG. TERTactivity is increased in glioblastoma14,15 and contributes to glioma cellgrowth16. A recent GWAS associated the region containing TERT with
idiopathic pulmonary fibrosis17. AnotherGWAS reported a significant associationbetween TERT SNP rs2736100 and lung can-cer18. On chromosome 20, we found that arelated gene, RTEL1, contains two SNPswithin intron 12 (rs6010620) and intron 17(rs4809324) significantly associated with high-grade glioma. RTEL1 is a DNA helicase critical
for regulation of telomere length in mice, and its loss is associated withshortened telomere length, chromosome breaks and translocations19.
This study is strengthened by the use of principal components3 toadjust for any residual population stratification after using severalquality control measures to assure that only unrelated subjects ofEuropean ancestry were included in the analyses (see Online Methodsfor details). Because glioma is a relatively rare disease, very largematched sets of glioma cases and controls are not currently availablefor GWAS. Consequently, we used a publicly available control group
aPrincipal component–adjusted odds ratios, confidence intervals and P values were estimated using EIGENSTRAT software; SNPs with individual P o 10�6 were included in the haplotype analyses.bRare haplotypes (o5%) were grouped together for these analyses.
chr9: 21800000
MTAP
CDKN2ACDKN2ACDKN2A
ANRILCDKN2B
BC038540
MTAP
7.5 5.0
3.3
1.7
0
5.0
2.5
UC
SF
–lo
g P
Mayo –log P
0
21900000 22000000
Family BFamily A
Other reported SNP disease associationsCoronary diseaseMyocardial infarction
Type 2 diabetes
Deletions in familial melanoma/glioblastoma syndrome (6)
22100000 22200000 22300000 22400000UCSC genes based on RefSeq, UniProt, GenBank, CCDS and comparative genomics
a
b
c
d
Figure 2 Map of the associated 9p21 region
in high-grade glioma. (a) Genes within region.
(b) Location of hemizygous deletion regions
previously linked to familial melanoma/
glioblastoma syndrome (blue)5. Also shown are
SNPs within the region that have been previously
reported to be associated with heart disease and
diabetes risk11. (c) –log P for SNPs within region;
note different scales for UCSF discovery phase
(blue bars, left x axis) and Mayo Clinic replication
phase (red bars, right x axis). P values are from
single point association tests of principal
component–adjusted additive logistic regression
of cases versus controls for 0, 1 or 2 minor
alleles. (d) LD of HapMap SNPs in region.
NATURE GENETICS ADVANCE ONLINE PUBLICATION 3
LET TERS
from Illumina to provide large numbers of controls for the discoveryphase. To minimize the possibility of false positives that might resultfrom using a nonmatched control set, we used carefully matched high-grade glioma cases and controls from the Mayo Clinic for replicationanalyses. In summary, this report shows that high-grade glioma risk isassociated with inherited variation in a region of 9p21 containingCDKN2B and a region of 20q13.3 tagged by two intronic SNPs inRTEL1. That the 9p21 region is frequently deleted in high-gradegliomas lends further biological plausibility to these findings.
METHODSMethods and any associated references are available in the onlineversion of the paper at http://www.nature.com/naturegenetics/.
Note: Supplementary information is available on the Nature Genetics website.
ACKNOWLEDGMENTSWork at University of California, San Francisco (UCSF) has been supported by USNational Institutes of Health grants R01CA52689 and UCSF Brain Tumor SPORE,P50CA097257, as well as by grants from the National Brain Tumor Foundation,the UCSF Lewis Chair in Brain Tumor Research and by donations from familiesand friends of J. Berardi, H. Glaser and E. Olsen. J.S.C. was also supported by afellowship from the National Cancer Institute (grant R25 CA 112355). Work at theMayo Clinic has been supported by the Mayo Clinic Brain Tumor SPORE (NIHP50 CA108961), the Mayo Clinic Comprehensive Cancer Center (P30 CA15083)and the Bernie and Edith Waterman Foundation. The San Francisco Adult GliomaStudy thanks the Northern California Cancer Center for identifying glioma cases;we also thank K. Aldape for pathology review and the pathology departmentsof Alexian, Alta Bates, Brookside, California Pacific, Doctors Pinole, Eden,El Camino, Good Samaritan, Highland, John Muir, Kaiser Redwood City,Kaiser San Francisco, Kaiser Santa Teresa, Los Gatos, Los Medanos, Marin General,Merrithew, Mills Peninsula, Mt. Diablo Hospital, Mt. Zion, Naval Hospital,O’Connor, Ralph K Davies, Saint Louise, San Francisco General, San Jose,San Leandro, San Mateo County, San Ramon Valley, Santa Clara Valley, Sequoia,Seton, St. Francis, St. Luke’s, St. Rose, Stanford, Summit, UC San Francisco, ValleyLivermore, Veterans Palo Alto, Veterans SF, and Washington Hospitals and MedicalCenters for providing tumor specimens for review. Genotyping services forSan Francisco study subjects were provided by deCODE Genetics, Iceland. Thecompany provided SNP and normalized CNV data and technical support indata analysis, including conference call tutorials in the use of the Disease MinerSoftware. We thank B. Scheithauer and C. Gianinni for their careful histologicalreview of all the primary high-grade gliomas collected at the Mayo Clinic for thisstudy. The Mayo Clinic Comprehensive Cancer Center Biospecimens andProcessing (TACMA), Gene Analysis, Biostatistics and Bioinformatics SharedResources were essential for the success of this study. We also thank K. Kelseyfor helpful suggestions on genotyping and interpretation of results, N. Risch forvery helpful suggestions on this paper and S. Sen for helpful discussions andsuggestions on statistical methods. Some computations were performed usingthe UCSF Biostatistics High Performance Computing System.
AUTHOR CONTRIBUTIONSM.W. was the overall UCSF study principal investigator who was responsible forsubject recruitment, oversaw all analyses and wrote parts of and synthesized thepaper. R.B.J. was the overall co-principal investigator of the Mayo study whooversaw the entire study (particularly laboratory quality control), interpreted theresults and wrote parts of the paper. J.S.C. was the UCSF epidemiologist whocontributed to development of the analysis plan, conducted statistical analyses andwrote parts of the paper. R.-F.Y. was the UCSF biostatistician who oversaw andconducted statistical analyses of the discovery phase and wrote parts of the paper.Y.X. was the UCSF biostatistician who conducted statistical analyses of thediscovery and combined phases and wrote parts of the paper. P.A.D. was the Mayostatistician who performed all Mayo data analysis. K.V.B. was the Mayo leadstatistician who participated in study design and the analysis plan. M.B.was the principal investigator of the UCSF Brain Tumor SPORE and a clinicalcollaborator who provided access for subject recruitment. J.C.B. was the Mayoneuro-oncologist who led subject recruitment. S.C. was the co-director of theUCSF neuro-oncology clinic who assisted in subject recruitment. C.G. was theMayo pathologist who verified all pathologic diagnosis of Mayo cases. C.H. wasthe Mayo laboratory technologist responsible for specimen preparation forgenotyping. T.M.K. was the Mayo laboratory manager responsible for specimen
storage and retrieval. M.L.K. provided statistical support for all Mayo analyses.D.H.L. was the Mayo neuro-oncologist who facilitated subject enrollment andmedical record data collection. L.M. was the UCSF project coordinator responsiblefor subject recruitment and preparation of datasets for analyses, and alsoconducted analyses. B.P.O. was the principal investigator of Mayo brain tumorSPORE and neurologist who facilitated subject enrollment and medical record datacollection. J.P. was the UCSF laboratory manager responsible for specimen storage,retrieval and preparation for genotyping. A.R.P. was the UCSF/Gladstonebioinformatician who participated in selecting the genotyping platform, developingthe analytical plan and reviewing the paper. M.P. was the co-director of the UCSFneuro-oncology clinic who assisted in subject recruitment. C.Q. participated insubject recruitment and pathology specimen accrual from Kaiser PermanenteNorthern California. T.R. was the UCSF project coordinator responsible for subjectrecruitment, prepared datasets for analyses, conducted analyses and wrote parts ofpaper. A.L.R. was the Mayo project coordinator responsible for subjectrecruitment. I.S. was the UCSF bioinformatician who participated in developingthe analytical plan, data analysis and interpreting results. T.T. was the UCSFneuropathologist who participated in subject identification, accrual anddevelopment of the analytical strategy. J.W. was the UCSF epidemiologist whoparticipated in choice of genotyping platform and development of the analyticalstrategy and oversaw sample preparation. P.Y. was the overall co-principalinvestigator of the Mayo study who oversaw the entire study (particularly studydesign for subject recruitment, control enrollment, data quality control andanalyses), interpreted results and wrote parts of the paper. J.K.W. was the UCSFstudy co-principal investigator who oversaw all aspects of laboratory work,participated in study design, subject accrual and development of the analysis plan,and wrote the discussion portion of the paper.
Published online at http://www.nature.com/naturegenetics/.
Reprints and permissions information is available online at http://npg.nature.com/
reprintsandpermissions/.
1. CBTRUS. Primary Brain Tumors in the United States, Statistical Report 2000–2004 (Central Brain Tumor Registry of the United States, Chicago, Illinois, 2008).
3. Price, A.L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
4. Cancer Genome Atlas Research Network. Comprehensive genomic characterizationdefines human glioblastoma genes and core pathways. Nature 455, 1061–1068(2008).
5. Bahuau, M. et al. Germ-line deletion involving the INK4 locus in familial proneness tomelanoma and nervous system tumors. Cancer Res. 58, 2298–2303 (1998).
6. Sharpless, N.E. et al. Loss of p16Ink4a with retention of p19Arf predisposes mice totumorigenesis. Nature 413, 86–91 (2001).
7. Krimpenfort, P. et al. p15Ink4b is a critical tumour suppressor in the absence ofp16Ink4a. Nature 448, 943–946 (2007).
8. Fuxe, J. et al. Adenovirus-mediated overexpression of p15INK4B inhibits humanglioma cell growth, induces replicative senescence, and inhibits telomerase activitysimilarly to p16INK4A. Cell Growth Differ. 11, 373–384 (2000).
9. Seoane, J. et al. TGFb influences Myc, Miz-1 and Smad to control the CDK inhibitorp15INK4b. Nat. Cell Biol. 3, 400–408 (2001).
10. Lemmens, R. et al. Variant on 9p21 strongly associates with coronary heart disease,but lacks association with common stroke. Eur. J. Hum. Genet. advance onlinepublication, doi:10.1038/ejhg.2009.42 (25 March 2009).
11. Mohlke, K.L., Boehnke, M. & Abecasis, G.R. Metabolic and cardiovascular traits: anabundance of recently identified common genetic variants. Hum. Mol. Genet. 17,R102–R108 (2008).
12. Schaefer, A.S. et al. Identification of a shared genetic susceptibility locus for coronaryheart disease and periodontitis. PLoS Genet. 5, e1000378 (2009).
13. Schunkert, H. et al. Repeated replication and a prospective meta-analysis of theassociation between chromosome 9p21.3 and coronary artery disease. Circulation117, 1675–1684 (2008).
14. Shervington, A. et al. Glioma: what is the role of c-Myc, hsp90 and telomerase? Mol.Cell. Biochem. 283, 1–9 (2006).
15. Maes, L. et al. Relation between telomerase activity, hTERT and telomere length forintracranial tumours. Oncol. Rep. 18, 1571–1576 (2007).
16. Falchetti, M.L. et al. Telomerase inhibition impairs tumor growth in glioblastomaxenografts. Neurol. Res. 28, 532–537 (2006).
17. Mushiroda, T. et al. A genome-wide association study identifies an association of acommon variant in TERT with susceptibility to idiopathic pulmonary fibrosis. J. Med.Genet. 45, 654–656 (2008).
18. McKay, J.D. et al. Lung cancer susceptibility locus at 5p15.33. Nat. Genet. 40,1404–1406 (2008).
19. Ding, H. et al. Regulation of murine telomere length by Rtel: an essential geneencoding a helicase-like protein. Cell 117, 873–886 (2004).
rs2383208 at positions 22088618 to 22122075), with maximum r2 ¼ 0.47
between rs4977756 and rs2891168. For rs1412829, the 9p21 SNP independently
associated with glioma risk in multivariate analysis, the maximum correlation
with the CAD- or T2D-associated SNP (rs2891168) was r2 ¼ 0.23.
For completeness for other glioma genetics researchers, we also present
SNP-disease association data for any SNP with P o 0.001 for tests of
associations with high-grade glioma (Supplementary Table 7). We also
conducted all analyses on glioblastoma only cases; no major differences for
statistically significant SNPs were found (data not shown).
To obtain combined estimates of high-grade glioma risk for top SNPs from
UCSF and Mayo, we used the Mantel-Haenszel method to estimate the OR,
95% CI and P value; the test is for equality of allele frequencies between cases
and controls.
Subjects for replication phase. The Mayo Clinic case group included 176
individuals with glioblastoma and anaplastic astrocytomas newly diagnosed
between 2005 and 2008. Cases were identified within 24 h of diagnosis, except
for those who had their initial diagnosis elsewhere, followed by verification at
the Mayo Clinic. The cases consisted of 67 (38%) women and 109 (62%) men
who were 53.8 ± 12.6 years old; 174 (98%) were white; 114 (65%) had
glioblastoma; 62 (35%) had anaplastic astrocytomas. Pathologic diagnosis was
confirmed by review of the primary surgical material for all cases by two Mayo
Clinic neuropathologists based on surgically resected material. The control
group consisted of consented individuals who had a general medical exam at
the Mayo Clinic. Matching variables were sex, date of birth (within two and one
half years), self-identified race (Hispanic white, non-Hispanic white, American
Indian, African American, Asian, Pacific Islander, Other) and residence.
Geographic region of residence was matched in three zones based on the
distance to the Mayo Clinic Rochester: Olmsted County; the rest of Minnesota,
Wisconsin, Iowa, North Dakota and South Dakota; and the rest of the United
States and Canada. Excluded were individuals under the age of 18 and those
with a history of brain tumor. The Mayo Clinic case and control enrollment
research protocol was approved by Mayo Institutional Review Board. These
cases and controls were genotyped using Illumina 610Quad arrays.
Sample preparation and genotyping for replication phase and quality
control. DNA was isolated from snap-frozen, buffy-coat samples using an
AutoGenFlex STAR system (AutoGen) with Qiagen’s FlexiGene DNA AGF3000
kit and AutoGen’s blood DNA finishing kit. DNA was quantified using a
ND-1000 spectrophotometer (Thermo Scientific) and normalized to 50 ng/ml
using 10 mM Tris HCl, 0.1 mM EDTA, pH 8.0 buffer (Teknova). Genotyping
was performed using Illumina 610Quad SNP arrays (Illumina) according to the
manufacturer’s recommendations. Briefly, 200 ng of genomic DNA was
amplified then fragmented. The fragmented DNA was hybridized on Illumina’s
Human 610-Quad BeadChip. Fluorescent labeling was performed by single-
base extension using labeled nucleotides. The BeadChip was then scanned with
Illumina’s Bead Array Reader. Samples, including positive controls, were
processed in a 96-well format.
We carried out allele calling using Illumina’s Genotyping Module version
3.3.7 in BeadStudio version 3.1.3.0. We summarized concordance in interplate,
intraplate and overall subject replicates to investigate potential genotyping
error. Subject-level call rates were calculated and those subjects with call rates
o0.9 were excluded from further analysis. Individual SNP call rates were
summarized and SNPs with call rates o0.9 were excluded from the analysis.
The minor allele frequency (MAF) was calculated for each SNP, and SNPs with
MAF o 0.01 were excluded from further analysis. The above analyses were
done on the complete set of data, and each analysis was repeated separately for
each plate to investigate any potential plate effects. The overall Illumina subject
call rate across all SNPs for Mayo Clinic cases and controls was 97.5 ± 0.02
(median 98.3; range 90.0–98.4). Inter- and intraplate replicate analysis was
performed for the 13 SNPs summarized in Table 1. For all 13 SNPs, all inter-
and intraplate replicates were identical.
Statistical methods for replication phase. The frequency distribution at each
SNP locus was tested against the Hardy-Weinberg equilibrium (HWE) under
the allele mendelian biallelic expectation using the w2 test. SNPs with HWE
P values o0.001 for control subjects were excluded from the analysis. The
principal component approach was implemented in EIGENSTRAT to deter-
mine whether there was any evidence of population stratification in the Mayo
cases and controls3,27. We used an additive logistic regression model for 0, 1 or
2 copies of the minor allele for candidate SNPs to investigate the association of
glioma risk. Significant principal components from the population stratifica-
tion analysis were included as covariates in the logistic regression models.
Haplotype blocks were estimated using Haploview26. The multiple SNP
marker-disease association by estimated haplotype was evaluated using
haplo.score (a software developed by the Mayo Clinic), which accounts for
ambiguous linkage phase28.
20. Felini, M.J. et al. Reproductive factors and hormone use and risk of adult gliomas.Cancer Causes Control 20, 87–96 (2009).
21. Wrensch, M. et al. Nonsynonymous coding single-nucleotide polymorphisms spanningthe genome in relation to glioblastoma survival and age at diagnosis. Clin. Cancer Res.13, 197–205 (2007).
22. Davis, F.G. et al. Issues of diagnostic review in brain tumor studies: from the BrainTumor Epidemiology Consortium. Cancer Epidemiol. Biomarkers Prev. 17, 484–489(2008).
23. Pico, A.R. et al. SNPLogic: an interactive single nucleotide polymorphism selection,annotation, and prioritization system. Nucleic Acids Res. 37, D803–D809 (2009).
24. Dausset, J. et al. Centre d’etude du polymorphisme humain (CEPH): collaborativegenetic mapping of the human genome. Genomics 6, 575–577 (1990).
25. Hung, R.J. et al. A susceptibility locus for lung cancer maps to nicotinic acetylcholinereceptor subunit genes on 15q25. Nature 452, 633–637 (2008).
26. Barrett, J.C., Fry, B., Maller, J. & Daly, M.J. Haploview: analysis and visualization of LDand haplotype maps. Bioinformatics 21, 263–265 (2005).
27. Patterson, N., Price, A.L. & Reich, D. Population structure and eigenanalysis. PLoSGenet. 2, e190 (2006).
28. Schaid, D.J., Rowland, C.M., Tines, D.E., Jacobson, R.M. & Poland, G.A. Score testsfor association between traits and haplotypes when linkage phase is ambiguous. Am. J.Hum. Genet. 70, 425–434 (2002).