Living Large: Elucidation of the Frankia EAN1pec Genome Sequence Shows Gene Expansion and Metabolic Versatility Louis S Tisa 1 , David R Benson 2 , Gary B. Smejkal 4 , Pascal Lapierre 2 , J. Peter Gogarten 2 , Philippe Normand 5 , M. Pilar Francino 3 , and Paul Richardson 3 1 Dept. Microbiology , U New Hampshire, Durham, NH, USA; 2 Dept. Mol. Cell Biol., U Connecticut , Storrs, CT, USA; 3 JGI, Walnut Creek, CA, USA, 4 Pressure Biosciences, Inc, Bridgewater, MA, USA, 5 Ecologie Microbienne UMR CNRS 5557, Université Lyon, Villeurbanne, France
31
Embed
Living Large: Elucidation of the Frankia EAN1pec Genome Sequence Shows Gene Expansion and Metabolic Versatility Louis S Tisa 1, David R Benson 2, Gary.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Living Large: Elucidation of the Frankia EAN1pec Genome
Sequence Shows Gene Expansion and Metabolic
VersatilityLouis S Tisa1, David R Benson2, Gary B. Smejkal4, Pascal Lapierre2, J. Peter Gogarten2, Philippe Normand5, M. Pilar
Francino3, and Paul Richardson3
1Dept. Microbiology , U New Hampshire, Durham, NH, USA; 2Dept. Mol. Cell Biol., U Connecticut , Storrs, CT, USA; 3 JGI, Walnut Creek, CA, USA,
4 Pressure Biosciences, Inc, Bridgewater, MA, USA, 5 Ecologie Microbienne UMR CNRS 5557, Université Lyon, Villeurbanne, France
Actinorhizal Symbiosis
• Symbiotic association between Frankia and woody dicotyledonous plants – results in formation of
root nodules
• over 250 species of Actinorhizal plants
Frankia• Member of the
Actinomycetales
• Hyphal bacteria– 67-72% GC – generation time 24-48 h
• Structures– vesicles– spores
in planta vesicle clusters Bar = 10 µm
Three Frankia genotypes
Betulaceae
Myricaceae
Casuarinacee
Elaeagnuaceae
Rhamnaceae
Myricaceae
Gymnostoma
CoriaiaceaeDatiscaceaeRosaceaeCeanothus
Why Sequence CcI3 and EAN1pec?
CcI3• Metabolism • Member of Group I• Narrow Host range • Markers:
KanR,GenR,KasR,NalR
AsO43-
EAN1pec• Diverse metabolism• Member of Group III
(globally distributed) • Broader Host range
Markers:
NovR, LinR,KasR,NalR, AsO4
3-,Pb2+ and CrO4
2+
• Limited genetics
Surprise One: Three different genome sizes
ACN 6783 CDS 2 rRNA 72.8% GC
CcI3 4515 CDS 2 rRNA 70.1% GC
EAN 7492 CDS 3 rRNA 71.0% GC
Circular Topology
Comparison of the CDS Frankia ACN14a
Frankia CcI3 Frankia
Ean1pec
2291
630
587
1190
2730
1333 3725
reciprocal blast search with a cutoff of 10-4.
Comparative distribution of ORF function
0
200
400
600
800
1000
1200
1400 Frankia sp. CcI3
Frankia sp. EAN1pec
Acidothermus cellulolyticus 11B
Arthrobacter sp. FB24
Kineococcus radiotolerans SRS30216
Streptomyces coelicolor A3(2)
COG Functional Groups
-1500
-1000
-500
0
500
1000
1500
T + I Duplicates ORFans
CcI3
ACN
EaNGene Duplication Level is higher in EAN (18.5% of the ORFs) than ACN (7.5 %) or CcI3 (9.8 %)
1355
The EAN Genome is Expanding
1054
CcI3 has an accelerated rate of gene loss compared to EAN and ACN
What are the Major families of duplicated genes in these Frankia strains?
BlastClust (NCBI) analysis 25% identity over at least 40% of the length (30% identity /52% length same result)
An analysis of the Top 20 duplicated gene families showed major differences in
functional groups
CcI3 (165/444)• 116 out 165 (70%) duplicated genes belonged to
several classes of transposases and genes associated with prophage and plasmids
EAN (406/1355)• Transport proteins,
Dioxygenases, Short chain dehydrogenases/reductases (SDR), Regulatory proteins, cytochrome P450, monooxygenases
• also like CcI3 132 out of 406 (32.5%) genes associated with integrases, transposases
ACN (151/512)• Transport proteins,
SDR,serine-threonine protein kinases, methyltransferases, endonucleases, & a variety of dehydrogenases
• no transposases in 151 genes of the top 20 families
CcI3
•loss of genes associated with transport and metabolism
b, EAN :Elaeagnaceae (pink), Myricaceae (green) Rhamnaceae (blue, Tribe Colletieae in South America, Australia and New Zealand). Areas of overlap are brown and dark blue).
Present day native distribution of actinorhizal plant hosts.
a, ACN: Betulaceae (orange) Myricaceae (green) and their overlap (khaki).
c, CcI3: Casuarina and Allocasuarina of the Casuarinaceae (light blue).
What about genes identified as potentially involved in Symbiosis?
• Nitrogenase components
• Hopanoid biosynthesis
• Uptake Hydrogenase biosynthesis
• Hemoglobin
• Nodulation
Nitrogenase Cluster for EAN1pec
NifK, NifD, NifHNifX, NifN, NifE
hypothetical proteins
NifB NifZ NifW
3 Fd genesNifS
NifV (homocitrate synthase) is located in another region of the chromosome
HboO expression is up-regulated under hypoxic conditions
Nitrogen status did not significantly affect expression
Why the large genome (9.1 Mb) for Frankia EAN1pec?
• many soil dwellers have large genomes (Streptomyces, Bradyrhizobium, Burkholderia, etc.
• these “boy-scouts” are always prepared for changing conditions of the soil environment– wide array of substrates (uptake systems)– need for tight regulation
Why the large genome (9.1 Mb) for Frankia EAN1pec?
• many soil dwellers have large genomes (Streptomyces, Bradyrhizobium, Burkholderia, etc.
• these “boy-scouts” are always prepared for changing conditions of the soil environment– wide array of substrates (uptake systems)– need for tight regulation
Metabolism
• Complete Embden-Meyerhof, TCA and Pentose Phosphate pathways
• wide arsenal of transport genes • large numbers of genes for short chain
dehydrogenase/reductase, dioxygenase, etc.
Regulatory mechanisms • Large number of DNA binding proteins• Two-component systems• Sigma Factors • Anti-sigma Factors• Anti-sigma Factor Antagonists