Top Banner
Point and Click Microbiome Analysis Tools from the BioHPC and BICF
31

Point and Click Microbiome Analysis Tools from the BioHPC ...

Jan 31, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Point and Click Microbiome Analysis Tools from the BioHPC ...

Point and Click Microbiome Analysis

Tools from the BioHPC and BICF

Page 2: Point and Click Microbiome Analysis Tools from the BioHPC ...

Allows groups to give easy-access to their analysis pipelines via the web

Astrocyte – BioHPC Workflow Platform

Standardized Workflows

Simple Web Forms

Online documentation & results visualization*

Workflows run on HPC cluster without developer or user needing cluster knowledge

Slide contribution: David Trudgian@BioHPC

astrocyte.biohpc.swmed.edu

Page 3: Point and Click Microbiome Analysis Tools from the BioHPC ...

https://astrocyte.biohpc.swmed.edu/brand/bicf/browse/

Page 4: Point and Click Microbiome Analysis Tools from the BioHPC ...

Create a new project

Page 5: Point and Click Microbiome Analysis Tools from the BioHPC ...

Add data to your project

Page 6: Point and Click Microbiome Analysis Tools from the BioHPC ...

Add data to your project

For NGS experiment, this is recommended.

Page 7: Point and Click Microbiome Analysis Tools from the BioHPC ...

Make your design filegroup This ID should match the name in the fastq file ie S0001.R1.fastq.gz the sample ID is S0001 Note: SampleID shouldn't start with numbers ie 10C should be changed to S10C condition This is the group that will be used for pairwise differential abundance analysis

group conditionGut1 GutGut2 GutMouth1 MouthMouth2 MouthNasal1 NasalNasal2 Nasal

Page 8: Point and Click Microbiome Analysis Tools from the BioHPC ...

Make your design file• Use tab as delimiter – Excel save as “Text (tab delimited)”

• For all contents, no “-” • For all contents, no spaces • Columns names MUST be exactly the same as

documented

Page 9: Point and Click Microbiome Analysis Tools from the BioHPC ...

Select your data files and set up workflow and submit

SELECT YOUR FILES

Page 10: Point and Click Microbiome Analysis Tools from the BioHPC ...

Project is running

Page 11: Point and Click Microbiome Analysis Tools from the BioHPC ...

Timeline of the whole run

Page 12: Point and Click Microbiome Analysis Tools from the BioHPC ...

Common errors and solutions

• Make sure the delimiter is tab • Make sure the column name are the same

as mentioned in documentation • Make sure the file names match

Page 13: Point and Click Microbiome Analysis Tools from the BioHPC ...

Common errors and solutions

• Not all files are uploaded

• It’s about the proxy setting

• Use auto-detect proxy

Page 14: Point and Click Microbiome Analysis Tools from the BioHPC ...

Marker Genes Allow For Taxonomic Profiling

Page 15: Point and Click Microbiome Analysis Tools from the BioHPC ...

Marker Genes Allow For Taxonomic Profiling

• Should be present in all prokaryotic organisms compared

• Vertically and slowly evolving • Amplify-able with small set of “universal

primers” • Has an established database of reference

sequences

Page 16: Point and Click Microbiome Analysis Tools from the BioHPC ...

rRNAs as phylogenetic markers• Ribosomal RNAs are present in all living organisms

– 16S present in all prokaryotes – 18S present in all eukaryotes

• rRNAs are vertically and slowly evolving – Play a critical role in protein translation – rRNAs are relatively conserved and rarely acquired

horizontally – rRNAs are amplify-able with small set of “universal

primers” • rRNAs has an established reference database

Page 17: Point and Click Microbiome Analysis Tools from the BioHPC ...

rRNA Reference Databases

Cole, J. R., Q. Wang, J. A. Fish, B. Chai, D. M. McGarrell, Y. Sun, C. T. Brown, A. Porras-Alfaro, C. R. Kuske, and J. M. Tiedje. 2014. Ribosomal Database Project: data and tools for high throughput rRNA analysis Nucl. Acids Res. 42(Database

issue):D633-D642; doi: 10.1093/nar/gkt1244 [PMID: 24288368]

Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO (2013) The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucl. Acids Res. 41

(D1): D590-D596.

DeSantis, T. Z., P. Hugenholtz, N. Larsen, M. Rojas, E. L. Brodie, K. Keller, T. Huber, D. Dalevi, P. Hu, and G. L. Andersen. 2006. Greengenes, a

Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB. Appl

Environ Microbiol 72:5069-72.

Page 18: Point and Click Microbiome Analysis Tools from the BioHPC ...

Other Marker Genes

• Intergenic Transcribed Spacer (ITS)

• RecA: Response to DNA Stress in Bacteria

• Cpn60: Chaperonin Database

Page 19: Point and Click Microbiome Analysis Tools from the BioHPC ...

Overall Analysis PipelineInput Seq

QC Barcode/Primer + Quality

Trimming; Min Read Length

Align Sequences to 16S Reference DB

Taxonomic Assignment

OTU Clustering

Alpha Diversity

Beta DiversityRarefaction

PCoA NMDS

Stat Analysis

Page 20: Point and Click Microbiome Analysis Tools from the BioHPC ...

Alpha Diversity

• Species richness is a survey of the number of distinct organism in a community

• Rarefaction is a method to assess species richness • Species evenness measures how equal the

community ie 2 taxa each at 50% abundance vs 9 to 1 ratio.

• Alpha diversity is a measurement composed of richness and evenness.

Page 21: Point and Click Microbiome Analysis Tools from the BioHPC ...

Beta-Diversity

• Beta-diversity measures including absolute or relative overlap describe how many taxa are shared between habitats

• Beta diversity acts like a similarity score between populations, allowing analysis by sample clustering or, again, by dimensionality reductions such as PCA

• Beta diversity can be measured by simple taxa overlap such as Bray-Curtis dissimilarity

Page 22: Point and Click Microbiome Analysis Tools from the BioHPC ...

Unifrac

• A distance metric used for comparing biological communities

• It differs from distance metrics (Bray Curtis) as it incorporates phylogenetic distances (tree based) between observed organisms in the computation

• Weighted Unifrac also incorporates taxonomic abundances

Page 23: Point and Click Microbiome Analysis Tools from the BioHPC ...

Sample Comparison based on OTU Composition

PCoA

Page 24: Point and Click Microbiome Analysis Tools from the BioHPC ...

Astrocyte Workflow• Uses Mothur’s MiSeq SOP • https://www.mothur.org/wiki/MiSeq_SOP

• Reference Database and Taxonomy • Silva • GreenGenes

• Allows users to visualize results (VizApp)

Page 25: Point and Click Microbiome Analysis Tools from the BioHPC ...
Page 26: Point and Click Microbiome Analysis Tools from the BioHPC ...

Alpha Diversity

Page 27: Point and Click Microbiome Analysis Tools from the BioHPC ...

PCOA and NMDS

Page 28: Point and Click Microbiome Analysis Tools from the BioHPC ...
Page 29: Point and Click Microbiome Analysis Tools from the BioHPC ...
Page 30: Point and Click Microbiome Analysis Tools from the BioHPC ...
Page 31: Point and Click Microbiome Analysis Tools from the BioHPC ...