www.sciencetranslationalmedicine.org/cgi/content/full/10/472/eaap8914/DC1 Supplementary Materials for Gut microbiota composition and functional changes in inflammatory bowel disease and irritable bowel syndrome Arnau Vich Vila, Floris Imhann, Valerie Collij, Soesma A. Jankipersadsing, Thomas Gurry, Zlatan Mujagic, Alexander Kurilshikov, Marc Jan Bonder, Xiaofang Jiang, Ettje F. Tigchelaar, Jackie Dekens, Vera Peters, Michiel D. Voskuil, Marijn C. Visschedijk, Hendrik M. van Dullemen, Daniel Keszthelyi, Morris A. Swertz, Lude Franke, Rudi Alberts, Eleonora A. M. Festen, Gerard Dijkstra, Ad A. M. Masclee, Marten H. Hofker, Ramnik J. Xavier, Eric J. Alm, Jingyuan Fu, Cisca Wijmenga, Daisy M. A. E. Jonkers, Alexandra Zhernakova, Rinse K. Weersma* *Corresponding author. Email: [email protected]Published 19 December 2018, Sci. Transl. Med. 10, eaap8914 (2018) DOI: 10.1126/scitranslmed.aap8914 The PDF file includes: Materials and Methods Fig. S1. Comparison of microbial richness between cohorts. Fig. S2. Venn diagram of overlapping taxa between IBD and clinical IBS. Fig. S3. Cohorts, sample collection, and sample processing algorithm. Fig. S4. Principal coordinate analysis plot on Bray-Curtis dissimilarities of controls. Fig. S5. Phenotype data processing algorithm. Fig. S6. Metagenomic sequencing data pipeline. Fig. S7. Overview of statistical analyses. Fig. S8. Prediction model to distinguish cohort of origin in disease. Fig. S9. Prediction model to distinguish cohort of origin in controls. References (37–51) Other Supplementary Material for this manuscript includes the following: (available at www.sciencetranslationalmedicine.org/cgi/content/full/10/472/eaap8914/DC1) Table S1 (Microsoft Excel format). Summary statistics of phenotypes. Table S2 (Microsoft Excel format). Summary statistics of gut microbiome taxonomy. Table S3 (Microsoft Excel format). Variables included in linear models case-control analyses. Table S4 (Microsoft Excel format). Taxonomy results of CD versus controls. Table S5 (Microsoft Excel format). Taxonomy results of UC versus controls. Table S6 (Microsoft Excel format). Taxonomy results of IBS-GE versus controls. Table S7 (Microsoft Excel format). Taxonomy results in the overlap of IBD and IBS-GE. Table S8 (Microsoft Excel format). Taxonomy results of IBS-POP versus controls.
31
Embed
Supplementary Materials for · composition, and Shannon index in IBS-GE. Table S21 (Microsoft Excel format). Associated phenotypes on gene richness, gut microbiome composition, and
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Gut microbiota composition and functional changes in inflammatory bowel disease
and irritable bowel syndrome
Arnau Vich Vila, Floris Imhann, Valerie Collij, Soesma A. Jankipersadsing, Thomas Gurry, Zlatan Mujagic, Alexander Kurilshikov, Marc Jan Bonder, Xiaofang Jiang, Ettje F. Tigchelaar, Jackie Dekens, Vera Peters,
Michiel D. Voskuil, Marijn C. Visschedijk, Hendrik M. van Dullemen, Daniel Keszthelyi, Morris A. Swertz, Lude Franke, Rudi Alberts, Eleonora A. M. Festen, Gerard Dijkstra, Ad A. M. Masclee, Marten H. Hofker,
Ramnik J. Xavier, Eric J. Alm, Jingyuan Fu, Cisca Wijmenga, Daisy M. A. E. Jonkers, Alexandra Zhernakova, Rinse K. Weersma*
Published 19 December 2018, Sci. Transl. Med. 10, eaap8914 (2018)
DOI: 10.1126/scitranslmed.aap8914
The PDF file includes:
Materials and Methods Fig. S1. Comparison of microbial richness between cohorts. Fig. S2. Venn diagram of overlapping taxa between IBD and clinical IBS. Fig. S3. Cohorts, sample collection, and sample processing algorithm. Fig. S4. Principal coordinate analysis plot on Bray-Curtis dissimilarities of controls. Fig. S5. Phenotype data processing algorithm. Fig. S6. Metagenomic sequencing data pipeline. Fig. S7. Overview of statistical analyses. Fig. S8. Prediction model to distinguish cohort of origin in disease. Fig. S9. Prediction model to distinguish cohort of origin in controls. References (37–51)
Other Supplementary Material for this manuscript includes the following: (available at www.sciencetranslationalmedicine.org/cgi/content/full/10/472/eaap8914/DC1)
Table S1 (Microsoft Excel format). Summary statistics of phenotypes. Table S2 (Microsoft Excel format). Summary statistics of gut microbiome taxonomy. Table S3 (Microsoft Excel format). Variables included in linear models case-control analyses. Table S4 (Microsoft Excel format). Taxonomy results of CD versus controls. Table S5 (Microsoft Excel format). Taxonomy results of UC versus controls. Table S6 (Microsoft Excel format). Taxonomy results of IBS-GE versus controls. Table S7 (Microsoft Excel format). Taxonomy results in the overlap of IBD and IBS-GE. Table S8 (Microsoft Excel format). Taxonomy results of IBS-POP versus controls.
Table S9 (Microsoft Excel format). Taxonomy results of all diseases versus controls. Table S10 (Microsoft Excel format). Strain diversity results of all diseases versus controls. Table S11 (Microsoft Excel format). Bacterial growth rate results of all diseases versus controls. Table S12 (Microsoft Excel format). Prediction accuracy of all prediction models. Table S13 (Microsoft Excel format). Top 20 gut microbiome features in the prediction model. Table S14 (Microsoft Excel format). Summary statistics of gut microbiome MetaCyc function. Table S15 (Microsoft Excel format). Pathway results of all diseases versus controls. Table S16 (Microsoft Excel format). Virulence factor results of all diseases versus controls. Table S17 (Microsoft Excel format). Antibiotic resistance gene results of all diseases versus controls. Table S18 (Microsoft Excel format). Associated phenotypes on gene richness, gut microbiome composition, and Shannon index in CD. Table S19 (Microsoft Excel format). Associated phenotypes on gene richness, gut microbiome composition, and Shannon index in UC. Table S20 (Microsoft Excel format). Associated phenotypes on gene richness, gut microbiome composition, and Shannon index in IBS-GE. Table S21 (Microsoft Excel format). Associated phenotypes on gene richness, gut microbiome composition, and Shannon index in IBS-POP. Table S22 (Microsoft Excel format). Correlation phenotypic factors with an FDR <0.1 from the Adonis analysis in CD. Table S23 (Microsoft Excel format). Correlation phenotypic factors with an FDR <0.1 from the Adonis analysis in UC. Table S24 (Microsoft Excel format). Correlation phenotypic factors with an FDR <0.1 from the Adonis analysis in IBS-GE. Table S25 (Microsoft Excel format). Correlation phenotypic factors with an FDR <0.1 from the Adonis analysis in IBS-POP. Table S26 (Microsoft Excel format). Variables included in multivariate linear models within disease cohorts. Table S27 (Microsoft Excel format). Taxonomy results within the CD univariate model. Table S28 (Microsoft Excel format). Taxonomy results within the CD multivariate model. Table S29 (Microsoft Excel format). Taxonomy results within the UC univariate model. Table S30 (Microsoft Excel format). Taxonomy results within the UC multivariate model. Table S31 (Microsoft Excel format). Taxonomy results within the IBS-GE univariate model. Table S32 (Microsoft Excel format). Taxonomy results within the IBS-GE multivariate model. Table S33 (Microsoft Excel format). Taxonomy results within the IBS-POP univariate model. Table S34 (Microsoft Excel format). Taxonomy results within the IBS-POP multivariate model. Table S35 (Microsoft Excel format). Pathway results within the CD univariate model. Table S36 (Microsoft Excel format). Pathway results within the CD multivariate model. Table S37 (Microsoft Excel format). Pathway results within the UC univariate model. Table S38 (Microsoft Excel format). Pathway results within the UC multivariate model. Table S39 (Microsoft Excel format). Pathway results within the IBS-GE univariate model. Table S40 (Microsoft Excel format). Pathway results within the IBS-GE multivariate model. Table S41 (Microsoft Excel format). Pathway results within the IBS-POP univariate model. Table S42 (Microsoft Excel format). Pathway results within the IBS-POP multivariate model. Table S43 (Microsoft Excel format). Cohort-associated taxa and IBD versus IBS taxonomical associations.
SUPPLEMENTARY MATERIALS
Materials and Methods
I. Cohorts, sample collection, sample processing, and metagenomic sequencing
Stool samples and phenotypic data of three cohorts were collected and uniformly processed
using the algorithm depicted in Fig. S3.
A. Cohorts
In this study, we used data and biomaterials from three cohorts from the Netherlands:
Cohort 1 - LifeLines DEEP
The first cohort is the LifeLines DEEP cohort, which is a subset of the Dutch general population
cohort, LifeLines. Both LifeLines and LifeLines DEEP have been previously described (37, 38).
In summary, LifeLines is a three-generation cohort that comprises approximately 167,000
participants residing in the three northern provinces of the Netherlands. All participants will be
followed-up prospectively for at least 30 years. Participants regularly undergo physical
examinations and fill in extensive questionnaires. In addition, blood and urine samples are
collected. Each participant is asked to fill in health, lifestyle, and quality-of-life questionnaires
every 1.5 years, whereas each participant is invited for a follow-up visit to a LifeLines clinic
every 5 years (37).
LifeLinesDEEP comprises approximately 1,500 LifeLines participants. The aim of the
LifeLinesDEEP cohort is to investigate different -omics layers. Therefore, additional
biomaterials were collected, including fecal samples. Participants who consented to giving a
fecal sample were also asked to fill in extensive questionnaires on their gastrointestinal (GI)
health (38).
Cohort 2 - University Medical Center Groningen IBD
The second cohort is the University Medical Center Groningen IBD (UMCG IBD) cohort.
Patients with inflammatory bowel disease (IBD) were recruited at the specialized IBD outpatient
clinic of the Department of Gastroenterology and Hepatology, UMCG, as described previously
(7). The IBD diagnosis was made based on accepted radiological, endoscopic and
histopathological evaluation. All patients were 18 years or older at the time of fecal sample
collection.
Cohort 3 - Maastricht IBS
The third cohort is the Maastricht IBS (MIBS) cohort and comprises Irritable Bowel Syndrome
(IBS) patients and healthy controls. IBS patients were recruited at the out-patient department of
the Gastroenterology-Hepatology division of the Maastricht University Medical Center+, a
secondary and tertiary referral center, and via general practitioners from the Maastricht area. All
IBS patients were diagnosed by a gastroenterologist after an extensive work-up that usually
included a colonoscopy. Healthy controls were age- and sex-matched and had a medical
examination to exclude any gastrointestinal disorders and determine current or previous
gastrointestinal complaints. In addition, patients were asked to fill in extensive questionnaires on
gastrointestinal health (39).
B. Analysis groups
The three cohorts were combined and the samples were subsequently divided into analysis
groups. Samples were exclusively assigned to one of the following four analysis groups:
Analysis group 1 - Population controls
The first analysis group comprised participants from both the LifeLinesDEEP cohort as well as
from the MIBS cohort. From LifeLinesDEEP, participants with self-reported IBD and self-
reported IBS were excluded, and fecal samples from 926 participants were analyzed. From the
MIBS cohort, 144 healthy controls were included. After removing samples with a read count of
less than 10 million after metagenomic sequencing (n=45), analysis group 1 - population controls
consisted of 1025 samples (893 from LifeLinesDEEP, and 132 from MIBS). Comparability
between controls was assessed by a principal coordinate analysis on Bray-Curtis dissimilarities.
No significance differences were observed in the first three principal coordinates (Wilcoxon test:
p= 0.07, p=0.95, p=0.56, respectively) (Fig S4). However, when testing individual microbial
feature associations between controls from the LifeLinesDEEP cohort versus controls from the
Maastricht cohort, the relative abundance of 42 taxa were found to be statistically significantly
different between cohorts (FDR<0.01) (Table S43). In order to remove any batch effect in the
case-control analyses, a cohort covariate was forced into the linear models.
Analysis group 2 - IBD patients diagnosed by a gastroenterologist (IBD)
The second analysis group comprised 427 patients with IBD, i.e. the entire cohort UMCG IBD.
Patients were excluded from analyses when they had a stoma, a pouch or a short bowel (n=47).
Moreover, two samples were excluded due to accidental sampling of peri-anal abscess content
instead of stool. After removing samples with a read count less than 10 million reads after
metagenomic sequencing (n=23), analysis group 2 - IBD consisted of 355 IBD patients
comprising 208 patients with CD, 126 patients with UC, and 21 patients with IBD-Undetermined
(IBDU).
Analysis group 3 - IBS patients diagnosed by a gastroenterologist (IBS-GE)
The third analysis group comprised 188 IBS patients from the MIBS cohort, who were diagnosed
by their treating gastroenterologist based on to the ROME III criteria and the exclusion of other
GI diseases (39). This group is referred to as IBS-gastroenterologist (IBS-GE). To exclude any
other organic diseases, additional tests were performed when deemed necessary by the
gastroenterologist, i.e. biopsies from endoscopy, abdominal imaging, blood, breath, and fecal
analyses. After removing samples with a read count less than 10 million reads after metagenomic
sequencing (n=7), analysis group 3 - IBS-GE consisted of 181 patients. In this group, patients
were also assigned to IBS subtypes according to the ROME III criteria, indicating the most
predominant bowel habit of the patients: 65 patients with IBS diarrhea (IBS‐D), 33 patients with
IBS constipation (IBS‐C), 73 patients with IBS mixed stool pattern (IBS‐M) and 10 patients with
the IBS unspecified subtype (IBS‐U).
Analysis group 4 - IBS patients diagnosed by self-filled-in ROME III questionnaire (IBS-POP)
The fourth group comprised 242 IBS patients from the LifeLines DEEP cohort. After the
completed ROME III questionnaires had been evaluated, these patients met the ROME III
criteria for IBS. This group is referred to as IBS population (IBS-POP). After removing patients
with a read count less than 10 million reads after metagenomic sequencing (n=11), analysis
group 4 - IBS-POP consisted of 231 IBS patients. Patients were assigned to IBS subtypes based
on predominant bowel habits according to the self-reported ROME III criteria and to the self-
reported Bristol Stool Form Scale information (40): 42 patients with IBS diarrhea (IBS‐D), 65
patients with IBS constipation (IBS‐C), 111 patients with IBS mixed stool pattern (IBS‐M) and
13 patients with the IBS unspecified subtype (IBS‐U).
C. Fecal sample collection
All participants of the LifeLines DEEP cohort and the UMCG IBD cohort were asked to produce
a fecal sample at home and place it in their home freezer (-20°C) within 15 minutes after
production. Subsequently, a nurse visited all participants to pick up the fecal samples on dry ice
and transfer them to the laboratory of the Department of Genetics, UMCG. Aliquots were made
and these were stored at -80°C until further processing.
All participants of the MIBS cohort were asked to produce a fecal sample at home and
place it in their home refrigerator (4°C) and bring it to the hospital within 24 hours. Here, the
samples were collected and aliquots were made. The aliquots were then stored at -80°C. Next,
aliquots of the samples were shipped on dry ice to the Department of Genetics, UMCG, for
further processing.
D. DNA extraction from fecal samples
All fecal sample aliquots remained frozen until microbial DNA extraction. Fecal DNA isolation
was performed using the AllPrep DNA/RNA Mini Kit (Qiagen; cat. # 80204) with the addition
of mechanical lysis as previously described (8).
E. Metagenomic sequencing
After fecal sample collection and DNA extraction, fecal DNA was sent to the Broad Institute of
Harvard and MIT in Cambridge, Massachusetts, USA, where library preparation and whole
genome shot-gun sequencing was performed on the Illumina HiSeq platform. From the raw
metagenomic sequencing data, low quality reads were discarded by the sequencing facility using
an in-house pipeline. Samples with a read depth of less than 10 million reads were excluded from
subsequent analyses. Next, quality trimming and adapter removal was performed using
Trimmomatic (v.0.32), setting the minimum length to 70% of the total input read length (41).
F. Fecal biomarker measurements
In this study, we used three fecal biomarkers: fecal calprotectin, chromogranin A (CgA), and
human-β-defensin-2 (HBD-2). Fecal calprotectin levels were available for all three cohorts. The
levels are a marker for inflammation in the gastrointestinal tract (42). Fecal calprotectin levels
were measured at the UMCG using a commercial enzyme-linked immunosorbent assay (ELISA,
Bühlmann Laboratories, Switzerland). CgA and HBD-2 were available for patients in the
analysis groups IBS-GE and IBS-POP. CgA marks the activation of the neuroendocrine system
(27) and HBD-2 marks the defense mechanisms against invasion of microbes (43). As described
previously, CgA and HBD-2 were measured at the ‘Medische Laboratoria Dr. Stein & Collegae’
(the Netherlands) using a commercial radioimmunoassay (RIA, Euro-Diagnostica, Sweden) and
a commercial enzyme-linked immunosorbent assay (ELISA, Immunodiagnostik AG, Germany),
respectively (39).
II. PROCESSING OF PHENOTYPE DATA
The phenotypes and summary statistics of all 1792 individuals are presented in Table S1
(depicted in Fig S5)
A. Phenotypes of Analysis group 1 - Population controls
In population controls, 25 phenotypes were collected and subsequently used for all case-control
analyses (Table S3): 4 intrinsic factors, 19 medication categories, and 2 smoking factors. These
phenotypes have been related to the gut microbiome composition in the general population (10).
In addition, these phenotypes were available for all four analysis groups (10, 39).
B. Phenotypes of Analysis group 2 - IBD patients
For patients with IBD, 159 phenotypes were collected and subsequently associated with the gut
microbiome composition. These factors included 5 intrinsic factors, 20 IBD-specific phenotypes,