Top Banner
Sequence Similarity as a Predictor of the Transmembrane Topology of Membrane-Intrinsic Subunits of Bacterial Respiratory Chain Enzymes* Richard A. Rothery 1 , Navita Kalra 1 , Raymond J. Turner 2 , and Joel H. Weiner 11 CIHR Group in the Molecular Biology of Membrane Proteins, Department of Biochemistry, 474 Medical Sciences Building, University of Alberta, Edmonton, Alberta T6G 2H7 2 Department of Biological Sciences, 156 Biological Sciences Building, University of Calgary, 2500 University Drive N.W., Calgary, Alberta, Canada, T2N 1N4 Abstract Integral membrane proteins usually have a pre- dominantly a-helical secondary structure in which transmembrane segments are connected by membrane-extrinsic loops. Although a number of membrane protein structures have been reported in recent years, in most cases transmembrane topologies are initially predicted using a variety of theoretical techniques, including hydropathy ana- lyses and the ‘‘positive inside’’ rule. We have explored the use of plots of the distribution of sequence similarity within families of membrane proteins comprising homeomorphic domains as a new method for the prediction/verification of the orientation of transmembrane topology models within certain families of multimeric respiratory chain enzymes. Within such proteins, analyses of sequence similarity can: i) identify heme and/or quinol binding sites; ii) identify potential electron- transfer conduits to/from prosthetic groups; and iii) locate regions defining potential subunit-sub- unit interactions. We mined emerging bioinfor- matic data for sequences of 11 families of membrane-intrinsic proteins that are part of multi- meric respiratory chain complexes that also have membrane-extrinsic subunits. The sequences of each family were then aligned and the resultant alignments converted into a graphical format recording an empirical measure of the sequence similarity plotted versus residue position. In each case, this plot was compared to the predicted transmembrane topology. With one exception, there is a strong correlation between the existence of membrane-extrinsic loop-localized sequence similarity and predicted subunit-subunit interac- tions. Introduction Adaptable bacteria that can exploit various sources of metabolic energy are able to assemble a diverse array of respiratory chains in response to available respira- tory reductants and oxidants. Given the early emer- gence of molecular genetic approaches and the accumulation of sequence data for Escherichia coli , much of the progress in delineating these chains is based on studies of this organism as a model system (Berks et al. , 1995a; Gennis and Stewart, 1996; Richardson and Watmough, 1999; Unden and Bon- gaerts, 1997). Respiratory chain complexes can: i) be comprised primarily of membrane-intrinsic subunits (e.g. cytochromes bo 3 and bd; quinol:oxygen oxido- reductases); ii) be comprised of membrane-extrinsic subunits that are anchored to the membrane by at least one membrane-intrinsic subunit (i.e. quinol:fumarate oxidoreductase, FrdABCD; 1 quinol:nitrate oxidoreduc- tase, NarGHI); or iii) be comprised of many (>2) membrane-extrinsic subunits, as well as many mem- brane-intrinsic subunits (>2) (NADH:quinol oxidoreductase, complex I; quinol:cytochrome c oxi- doreductase, complex III). In addition to their mem- brane-anchoring function, the membrane-intrinsic subunits also provide sites for quinone interaction and heme coordination (Eposti, 1989; Lancaster et al., 1999; Magalon et al., 1998; Nakamura et al., 1996). Interaction with quinones provides for a lateral flux of reducing equivalents within the membrane between the primary dehydrogenase and terminal reductase of each respiratory chain. Within the relatively simple respira- tory chain complexes (#2 membrane-extrinsic and membrane-intrinsic subunits), there is a distinctive orientation of the subunit-subunit interactions between # 2002 Horizon Scientific Press *This work was funded by a grant from the Canadian Institutes of Health Research to J.H.W. N.K. was supported by a NSERC Summer Studentship. For correspondence. Tel. (780) 4922761; Fax. (780) 4920886. 1 Abbreviations. DmsABC, S- and N- oxide (SNO) reductase from Escherichia coli ; FdnGHI, formate dehydrogenase from E. coli ; FrdABCD, fumarate reductase from E. coli ; FrdCAB, fumarate reductase from Wolinella succinogenes; HyaABC, hydrogenase-1 (HYD-1) from E. coli ; HybOBC, hydrogenase-2 (HYD-2) from E. coli; Mo-bisMGD, molybdo-bis(molybdopterin guanine dinucleotide); MTT, membrane targeting and translocation system; NarGHI, nitrate reductase; NrfABCD, nitrite reductase from E. coli ; Q-pool, quinone pool; PsrABC, polysulfide reductase from W. succinogenes; Q-site, quinone/quinol binding site; SdhCAB, Bacillus subtilis succinate dehydrogenase; SdhCDAB, succinate dehydrogenase from E. coli; TM, transmembrane. J. Mol. Microbiol. Biotechnol. (2002) 4(2): 133–150. JMMB Research Article
19

Sequence Similarity as a Predictor of the Transmembrane ...

Nov 23, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sequence Similarity as a Predictor of the Transmembrane ...

Sequence Similarity as a Predictor of theTransmembrane Topology of Membrane-IntrinsicSubunits of Bacterial Respiratory Chain Enzymes*

Richard A. Rothery1, Navita Kalra1,Raymond J. Turner2, and Joel H. Weiner1†

1CIHR Group in the Molecular Biology of MembraneProteins, Department of Biochemistry, 474 MedicalSciences Building, University of Alberta, Edmonton,Alberta T6G 2H72Department of Biological Sciences, 156 BiologicalSciences Building, University of Calgary, 2500University Drive N.W., Calgary, Alberta, Canada,T2N 1N4

Abstract

Integral membrane proteins usually have a pre-dominantly a-helical secondary structure in whichtransmembrane segments are connected bymembrane-extrinsic loops. Although a number ofmembrane protein structures have been reportedin recent years, in most cases transmembranetopologies are initially predicted using a variety oftheoretical techniques, including hydropathy ana-lyses and the ‘‘positive inside’’ rule. We haveexplored the use of plots of the distribution ofsequence similarity within families of membraneproteins comprising homeomorphic domains as anew method for the prediction/verification of theorientation of transmembrane topology modelswithin certain families of multimeric respiratorychain enzymes. Within such proteins, analyses ofsequence similarity can: i) identify heme and/orquinol binding sites; ii) identify potential electron-transfer conduits to/from prosthetic groups; andiii) locate regions defining potential subunit-sub-unit interactions. We mined emerging bioinfor-matic data for sequences of 11 families ofmembrane-intrinsic proteins that are part of multi-meric respiratory chain complexes that also havemembrane-extrinsic subunits. The sequences ofeach family were then aligned and the resultantalignments converted into a graphical formatrecording an empirical measure of the sequencesimilarity plotted versus residue position. In eachcase, this plot was compared to the predictedtransmembrane topology. With one exception,there is a strong correlation between the existence

of membrane-extrinsic loop-localized sequencesimilarity and predicted subunit-subunit interac-tions.

Introduction

Adaptable bacteria that can exploit various sources ofmetabolic energy are able to assemble a diverse arrayof respiratory chains in response to available respira-tory reductants and oxidants. Given the early emer-gence of molecular genetic approaches and theaccumulation of sequence data for Escherichia coli,much of the progress in delineating these chains isbased on studies of this organism as a model system(Berks et al., 1995a; Gennis and Stewart, 1996;Richardson and Watmough, 1999; Unden and Bon-gaerts, 1997). Respiratory chain complexes can: i) becomprised primarily of membrane-intrinsic subunits(e.g. cytochromes bo3 and bd; quinol:oxygen oxido-reductases); ii) be comprised of membrane-extrinsicsubunits that are anchored to the membrane by at leastone membrane-intrinsic subunit (i.e. quinol:fumarateoxidoreductase, FrdABCD;1 quinol:nitrate oxidoreduc-tase, NarGHI); or iii) be comprised of many (>2)membrane-extrinsic subunits, as well as many mem-b rane - i n t r i ns i c subun i t s (>2 ) (NADH:qu i no loxidoreductase, complex I; quinol:cytochrome c oxi-doreductase, complex III). In addition to their mem-brane-anchoring function, the membrane-intrinsicsubunits also provide sites for quinone interaction andheme coordination (Eposti, 1989; Lancaster et al.,1999; Magalon et al., 1998; Nakamura et al., 1996).Interaction with quinones provides for a lateral flux ofreducing equivalents within the membrane between theprimary dehydrogenase and terminal reductase of eachrespiratory chain. Within the relatively simple respira-tory chain complexes (#2 membrane-extrinsic andmembrane-intrinsic subunits), there is a distinctiveorientation of the subunit-subunit interactions between

# 2002 Horizon Scientific Press

*This work was funded by a grant from the Canadian Institutes ofHealth Research to J.H.W. N.K. was supported by a NSERC SummerStudentship.

†For correspondence. Tel. (780) 4922761; Fax. (780) 4920886.

1Abbreviations. DmsABC, S- and N- oxide (SNO) reductase fromEscherichia coli; FdnGHI, formate dehydrogenase from E. coli;FrdABCD, fumarate reductase from E. coli; FrdCAB, fumaratereductase from Wolinella succinogenes; HyaABC, hydrogenase-1(HYD-1) from E. coli; HybOBC, hydrogenase-2 (HYD-2) from E. coli;Mo-bisMGD, molybdo-bis(molybdopterin guanine dinucleotide); MTT,membrane targeting and translocation system; NarGHI, nitratereductase; NrfABCD, nitrite reductase from E. coli; Q-pool, quinonepool; PsrABC, polysulfide reductase from W. succinogenes; Q-site,quinone/quinol binding site; SdhCAB, Bacillus subtilis succinatedehydrogenase; SdhCDAB, succinate dehydrogenase from E. coli;TM, transmembrane.

J. Mol. Microbiol. Biotechnol. (2002) 4(2): 133–150. JMMB Research Article

Page 2: Sequence Similarity as a Predictor of the Transmembrane ...

• MALDI-TOF Mass Spectrometry in Microbiology

Edited by: M Kostrzewa, S Schubert (2016) www.caister.com/malditof

• Aspergillus and Penicillium in the Post-genomic Era

Edited by: RP Vries, IB Gelber, MR Andersen (2016) www.caister.com/aspergillus2

• The Bacteriocins: Current Knowledge and Future Prospects

Edited by: RL Dorit, SM Roy, MA Riley (2016) www.caister.com/bacteriocins

• Omics in Plant Disease Resistance

Edited by: V Bhadauria (2016) www.caister.com/opdr

• Acidophiles: Life in Extremely Acidic Environments

Edited by: R Quatrini, DB Johnson (2016) www.caister.com/acidophiles

• Climate Change and Microbial Ecology: Current Research and Future Trends

Edited by: J Marxsen (2016) www.caister.com/climate

• Biofilms in Bioremediation: Current Research and Emerging Technologies

Edited by: G Lear (2016) www.caister.com/biorem

• Microalgae: Current Research and Applications

Edited by: MN Tsaloglou (2016) www.caister.com/microalgae

• Gas Plasma Sterilization in Microbiology: Theory, Applications, Pitfalls and New Perspectives

Edited by: H Shintani, A Sakudo (2016) www.caister.com/gasplasma

• Virus Evolution: Current Research and Future Directions

Edited by: SC Weaver, M Denison, M Roossinck, et al. (2016) www.caister.com/virusevol

• Arboviruses: Molecular Biology, Evolution and Control

Edited by: N Vasilakis, DJ Gubler (2016) www.caister.com/arbo

• Shigella: Molecular and Cellular Biology

Edited by: WD Picking, WL Picking (2016) www.caister.com/shigella

• Aquatic Biofilms: Ecology, Water Quality and Wastewater Treatment

Edited by: AM Romaní, H Guasch, MD Balaguer (2016) www.caister.com/aquaticbiofilms

• Alphaviruses: Current Biology

Edited by: S Mahalingam, L Herrero, B Herring (2016) www.caister.com/alpha

• Thermophilic Microorganisms

Edited by: F Li (2015) www.caister.com/thermophile

• Flow Cytometry in Microbiology: Technology and Applications

Edited by: MG Wilkinson (2015) www.caister.com/flow

• Probiotics and Prebiotics: Current Research and Future Trends

Edited by: K Venema, AP Carmo (2015) www.caister.com/probiotics

• Epigenetics: Current Research and Emerging Trends

Edited by: BP Chadwick (2015) www.caister.com/epigenetics2015

• Corynebacterium glutamicum: From Systems Biology to Biotechnological Applications

Edited by: A Burkovski (2015) www.caister.com/cory2

• Advanced Vaccine Research Methods for the Decade of Vaccines

Edited by: F Bagnoli, R Rappuoli (2015) www.caister.com/vaccines

• Antifungals: From Genomics to Resistance and the Development of Novel Agents

Edited by: AT Coste, P Vandeputte (2015) www.caister.com/antifungals

• Bacteria-Plant Interactions: Advanced Research and Future Trends

Edited by: J Murillo, BA Vinatzer, RW Jackson, et al. (2015) www.caister.com/bacteria-plant

• Aeromonas

Edited by: J Graf (2015) www.caister.com/aeromonas

• Antibiotics: Current Innovations and Future Trends

Edited by: S Sánchez, AL Demain (2015) www.caister.com/antibiotics

• Leishmania: Current Biology and Control

Edited by: S Adak, R Datta (2015) www.caister.com/leish2

• Acanthamoeba: Biology and Pathogenesis (2nd edition)

Author: NA Khan (2015) www.caister.com/acanthamoeba2

• Microarrays: Current Technology, Innovations and Applications

Edited by: Z He (2014) www.caister.com/microarrays2

• Metagenomics of the Microbial Nitrogen Cycle: Theory, Methods and Applications

Edited by: D Marco (2014) www.caister.com/n2

Caister Academic Press is a leading academic publisher of advanced texts in microbiology, molecular biology and medical research. Full details of all our publications at caister.com

Further Reading

Order from caister.com/order

Page 3: Sequence Similarity as a Predictor of the Transmembrane ...

membrane-intrinsic and membrane-extrinsic domains.2

Our hypothesis is that the orientation of topologymodels for the membrane-intrinsic domain can bepredicted from the distribution of sequence similaritywithin the loops connecting the transmembrane (TM)segments. Alternatively, the distribution of sequencesimilarity within the loops can provide information on theorientation of the membrane-extrinsic subunits. In thelargely membrane-intrinsic complexes, subunit-subunitinteractions occur primarily laterally between the sub-units within the membrane, and therefore are unlikely toprovide information on the orientation of topologymodels.

Relatively reliable computational methods havebeen developed for the prediction both of the presenceof a-helical TM segments, and for the orientation ofpredicted topologies within the membrane. These areoften based on hydropathy analyses of individualamino acid residues averaged in various ways over amoving window of residues (Kyte and Doolittle, 1982;von Heijne, 1992). The orientation of possible modelsof transmembrane topology is generally predicted bythe ‘‘positive inside’’ rule of von Heijne (von Heijne,1989; von Heijne, 1992). Alternatively, the orientationof possible models can be tested by analyses ofsequence conservation of selected residues withinmultiple sequence alignments (Persson and Argos,1997). The application of neural network-based meth-ods that include evolutionary analyses of relatedproteins and multiple sequence alignments has addedadditional levels of sophistication and potential accu-racy (Rost et al., 1995). Entire genomes have beenanalyzed for the presence of membrane-integralproteins, and the statistical distribution of orientationsof proteins containing discrete numbers of TM seg-ments has been determined (Jones, 1998). Theplethora of currently available methodologies isclaimed to work at accuracies up to 86% judgedagainst proteins of experimentally-estimated topology(Rost et al., 1996).

Relatively few membrane-integral or membrane-bound protein structures are available at atomicresolution compared to those of soluble proteins. Formembrane proteins, topology models and their orien-tation are typically determined experimentally usingmethods that include: gene fusions within themembrane-extrinsic loops, glycosylation tagging (ineukaryotic cells), chemical modification, and epitopetagging (van Geest and Lolkema, 2000). However,despite the apparent accuracy of both predictive andexperimental topology prediction methods, conflictssometimes arise between methods that address thetopology of a membrane-integral subunit of amembrane-bound enzyme complex, and those thataddress the overall structure and function of the entirecomplex. Examples include the topology of the asubunit of the F0 domain of the E. coli F0F1 ATPase(Deckers-Heberstreit et al., 2000), and the DmsCsubunit of the E. coli S- and N-oxide reductase

(DmsABC) (Rothery and Weiner, 1996; Weiner et al.,1993). Others are listed by Rost et al. (Rost et al.,1996).

The class of bacterial oxidoreductases that con-tain #2 membrane-intrinsic and membrane-extrinsicsubunits offers an additional potential method for theprediction of the overall orientation of transmembranetopology models. The interface between the mem-brane-intrinsic and membrane-extrinsic domains ispresumably structurally and functionally important,and it would be expected that significant sequenceconservation would exist on the side of the membranedomain defining subunit-subunit interactions com-pared to the side that is simply exposed to the aqueousmilieu. Emerging sequence data allows facile classifi-cation of sequences into families corresponding to themembrane-anchor subunit(s) of a range of oxidore-ductases. In this paper, we have mined membrane-anchor sequences from sequence databases belong-ing to 11 families of membrane-bound oxidoreductaseanchor subunits. We have analyzed the sequenceconservation within these 11 families relative to theproposed transmembrane topologies and the proposedor actual locations of the membrane-extrinsic subunits.These analyses present a new method for the predic-tion of the orientation of transmembrane proteintopology models for oxidoreductases with membrane-extrinsic subunits.

Results

Figure 1 shows the range of bacterial oxidoreductaseschosen for study herein. In each case, the enzyme ispresent in a range of species, and for at least onespecies there is experimental data suggesting theindicated orientation of the membrane-extrinsic sub-units. Sequences for the membrane-anchors weremined using as bait the sequences for which topolo-gical data is available. This resulted in the identifica-tion of 11 discrete families of membrane-anchorsubunits (Table 1). As described in the EXPERIMEN-TAL PROCEDURES, two tests, one simple and oneempirical, were developed to evaluate models oftransmembrane topology. In the simple test, a scoreof +1 indicates complete agreement with a proposedmodel of transmembrane topology, and a score of -1indicates complete disagreement. In the empirical test,a positive score indicates agreement with a model oftransmembrane topology in which the membrane-extrinsic subunits are cytoplasmically localized. Anegative score suggests a periplasmic location forthese subunits. The oxidoreductases, their membrane-anchors, the evidence for the location of the mem-brane extrinsic subunits, and the results of theanalyses presented herein (Tables 2 and 3, seeEXPERIMENTAL PROCEDURES) are as follows:

(1) FrdABCD (QH2:fumarate oxidoreductase). Thisenzyme supports anaerobic respiratory growth onfumarate and is similar in overall architecture toSdhCDAB (succinate:Q oxidoreductase, complexII) (Hagerhall, 1997; Ohnishi et al., 2000). The

2For convenience, the membrane-intrinsic domain is defined as themembrane-intrinsic subunit(s) including the loops.

134 Rothery et al.

Page 4: Sequence Similarity as a Predictor of the Transmembrane ...

Figure 1. Overall topologies of membrane-bound oxidoreductases that have membrane-extrinsic catalytic subunits. In each case there are either oneor two membrane-anchor subunits. Evidence for the proposed topologies is either biochemical (e.g. DmsABC) or comes from a combination ofbiochemical and structural (crystallography) data (e.g. FrdABCD and FrdCAB/SdhCAB). In one case, the membrane-anchor (NrfD/PsrC) appears tobe part of complexes with significantly different overall architectures. Membrane-extrinsic subunits depicted above the membrane arecytoplasmically oriented, those below are periplasmically oriented.

Membrane Protein Topology Prediction. 135

Page 5: Sequence Similarity as a Predictor of the Transmembrane ...

structure of the E. coli enzyme has recently beendetermined by X-ray crystallography (Iverson et al.,1999) and extensive experimental data indicates

that the FrdAB catalytic dimer is cytoplasmicallyoriented (Cole et al., 1985; Hagerhall, 1997;Ohnishi et al., 2000). FrdA contains a FAD cofactor

Table 1. Membrane-anchor Subunit Families

Familya Membersb Structurec TMSd Dimer MembersLocatione (name, accession number, organism)

FrdC 4 Crystal 3 C frdc_ecoli, P03805, E. coli; frdc_provu, P20923, Proteus vulgaris; frdc_haein, P44892,Haemophilus influenzae; frdc_myctu, Q10762, Mycobacterium tuberculosis

FrdD 4 Crystal 3 C frdd_ecoli, P03806, E. coli; frdd_provu, P20924, Proteus vulgaris; frdd_haein, P44891,Haemophilus influenzae; frdd_myctu, Q10763, Mycobacterium tuberculosis

SdhC 21 Inf., Expt. 3 C dhsc_ecoli, P10446, E. coli; dhsc_strco, CAB89077, Streptomyces coelicolor; dhsc_mycle,Q49919, Mycobacterium leprae; dhsc_neima, AAF41354, Neisseria meningitidis;dhsc_coxbu, P51055, Coxiella burnetii; frd_rhodo, O82999, Rhodoferax fermentans;c560_marpo, P35721, Marchantia polymorpha; c560_recam, P80481, Reclinomonasamericana; c560_chocr, P48934, Chondrus crispus; c560_cyaca, P48935, Cyanidiumcaldarium; c560_cyame, Q9ZZR3, Cyanidioschyzon merolae; dhsc_parde, Q59659,Paracoccus denitrificans; dhsc_brady, O68000, Bradyrhizobium japonicum; dhsc_ricpr,P41085, Rickettsia prowazekii; dhsc_ricbt, Q9RHE9, Rickettsia (bunchy top disease);c560_bovin, P35720, Bos taurus; c560ii_bovin, Q9T2T6, Bos taurus; c560_human,Q99643, Homo sapiens; c560_cricg, P70097, Cricetulus griseus; c560_drosom, Q9VGS3,Drosophila melanogaster; ym07_yeast, Q04487, Saccharomyces cerevisiae

SdhD 9 Inf., Expt. 3 C dhsd_ecoli, P10445, E. coli; dhsd_ neima, AAF41355, Neisseria meningitidis; frd_rhodo,O83000, Rhodoferax fermentans; dhsd_shepu, O33737, Shewanella putrefaciens;dhsd_coxbu, P51057, Coxiella burnetii; dhsd_ricpr, P41086, Rickettsia prowazekii;dhsd_recam, P80482, Reclinomonas americana; dhsd_chocr, P54323, Chondrus crispus(Carragheen); dhsd_parde, Q59660, Paracoccus denitrificans

FrdC_W 7 Crystal 5 C frdc_wolsu, P17413, Wolinella succinogenes; frdc_helpy, O06912, Helicobacter pylori;frdc_camje, Q9PI97, Campylobacter jejuni; frdc2_wolsu, O34253, W. succinogenes;dhsc_paema, P70932, Paenibacillus macerans; dhsc_bacsu, P08064, Bacillus subtilis;dhsc_strco, Q9RCY6, Streptomyces coelicolor

NarI 10 Inf., Expt. 5 C nari_ecoli, P11350, E. coli; narv_ecoli, P19316, E. coli; nari_pseae, O54046, Pseudomonasaeruginosa; nari_parde, Q56357, Paracoccus denitrificans; nari_myctu, O06562, Myco-bacterium tuberculosis; nari_strco, O86714, Streptomyces coelicolor; nari2_strco,Q9RI29, Streptomyces coelicolor; nari_bacsu, P42177, Bacillus subtilis; nari_staca,Q9ZIF5, Staphylococcus carnosus; nari_theac, O06462, Thermus aquaticus

FdnI 6 Inf., Expt. 4 C fdni_ecoli, P24185, E. coli; fdoi_ecoli, P32174, E. coli; fdni_camje, Q9PMF4, Campylobacterjejuni; fdhc_wolsu, P28180, W. succinogenes; fdxi_haein, P44451, Haemophilus influen-zae; fdoi_aquifex, O67148, Aquifex aeolicus

HyaC 16 Inf. 4 P hyac_ecoli (cybh_ecoli), P19929, E. coli; hyac_helpy, Q9ZLK3, Helicobacter pylori J99;hydc_helpy, O25350, Helicobacter pylori; cybh_wolsu, P31875, W. succinogenes;hydc_camje, Q9PN33, Campylobacter jejuni; cybh_alceu, P31898, Alcaligenes eutrophus;hupc_pseud, P95495, Pseudomonas hydrogenovora; cybh_rhoca, P16145, Rhodobactercapsulatus; hupc_rhosh, O86470, Rhodobacter sphaeroides; cybh_braja, P21960,Bradyrhizobium japonicum; cybh_rhilv, P27648, Rhizobium leguminosarum; cybh_azoch,Q43953, Azotobacter chroococcum; cybh_azovi, P23000, Azotobacter vinelandii;hupc_thiro, Q56361, Thiocapsa roseopersicina; hoxz_pseca, P81606, Pseudomonascarboxydovorans; hoxz_aquae, O66896, Aquifex aeolicus

DmsC 4 Inf., Expt. 8 C dmsc2_ecoli (ynfh_ecoli), P76173, E. coli; dmsc_ecoli, P18777, E. coli; dmsc_yerpe,Q9X6B4, Yersinia pestis; dmsc_haein, P45002, H. influenzae

NrfD 4 Inferred 8 P nrfd_ecoli, P32709, E. coli; nrfd_haein, P45014, H. influenzae; psrc_wolsu, P31077,W.succinogenes; o30248_arcfu, O30284, Archaeoglobus fulgidus

HybB 4 Inferred 10 P hybb_ecoli, P37180, E. coli; hmc3_desvh, P33390, Desulfovibrio vulgaris; baa94029_rhoge,BAA94029, Rhodocyclus gelatinosus; o30063_arcfu, O30063, Archaeoglobus fulgidus

aThe name of the family is taken from the E. coli member. In the case of FrdC_W, which does not have an E. coli homolog, it is taken from the W. succinogeneshomolog.bNumber of family members identified on the basis of homeomorphic domain structure and low expectation value in a BLASTP search.cCrystal, structure is known from protein crystallography. Inferred, predicted using hydropathy analyses and the "positive-inside" rule (von Heijne, 1989; vonHeijne, 1992) with little or no experimental data. Expt., experimental data exists to support proposed topology. Inf., topology inferred from hydropathyanalyses.dTMS, number of TM segments.eExperimentally supported location of membrane-extrinsic dimer.

136 Rothery et al.

Page 6: Sequence Similarity as a Predictor of the Transmembrane ...

at the site of fumarate reduction, and FrdB containsa [2Fe-2S] cluster, a [3Fe-4S] cluster, and a [4Fe-4S] cluster. The enzyme has two membrane-anchors, FrdC and FrdD, each with 3 TM segmentsand a cytoplasmic N-terminus. 4 FrdC and 4 FrdDsequences were collected.

FrdC. Figure 2A shows the predicted topology andthe absolutely conserved residues of FrdC. Abso-lutely conserved residues within the loops arealmost exclusively oriented towards the cytoplas-mic side of the membrane. In the structure ofFrdABCD, the N-terminus appears to interactdirectly with the membrane-extrinsic FrdAB dimer(Iverson et al., 1999). Figure 2B shows a plot of theaverage similarity of the features of FrdC identifiedin Figure 2A throughout the corresponding Clus-talW alignment of the 4 FrdC sequences. Note thatthe BLOSUM62 substitution matrix used in theanalysis weights conservation of each residuedifferently, and scores substitutions (assigningboth positive and negative scores, depending onthe amino acids involved) (Henikoff and Henikoff,1993; Henikoff and Henikoff, 1996). Above loop-average similarity is observed in the cytoplasmicloops, and below loop-average similarity is ob-

served in the periplasmic loop. Thus, analysis ofthe sequence similarity within FrdC is entirelyconsistent with the hypothesis presented herein.The simple topology test results in a score of +1(Table 2), and the empirical test results in a score(aC –aP) of +4.33 (Table 3; see ExperimentalProcedures). Thus, both tests agree with thetopology presented in Figure 2A and the actualtopology revealed by the crystal structure.

FrdD. As is the case with FrdC, almost all of theabsolutely conserved residues within the loops areoriented towards the cytoplasmic side (Figure 2C),and the N-terminus in the crystal structure alsointeracts with the membrane-extrinsic FrdAB di-mer. Figure 2D shows a plot of the averagesimilarity of the features of FrdD identified in Figure2C. In this case, above loop-average similarity isobserved in the first cytoplasmic loop, but belowloop-average similarity is observed in the secondcytoplasmic loop. No extension into the periplasmbeyond the end of the third TM helix is predicted,and below loop-average similarity is observed inthe periplasmic loop. Despite the lower than loop-average similarity observed in the cytoplasmic loopbetween TM helices 2 and 3, the presented

Table 2. Simple test of correlation between loop similarity and proposed topology amongst membrane-anchor subunits

Family Testb Loop s valueaPm¼y

m¼1 Smd A(e) Pass/Fail

Cc P C P C P C P C P C

FrdC Cyto 1 1 1 1 4 1.00 Pass

FrdD Cyto 1 1 21 1 0.33 Pass

SdhC Cyto 1 1 1 1 4 1.00 Pass

SdhD Cyto 1 1 1 1 4 1.00 Pass

FrdC_W Cyto 21 1 1 1 1 1 4 0.67 Pass

NarI Cyto 1 1 1 21 1 1 4 0.67 Pass

FdnI Cyto 21 1 1 1 1 3 0.60 Pass

HyaC Peri 1 1 21 0 21 0 0.00 –

DmsC Cyto 21 21 21 21 21 21 21 21 21 29 21.00 Fail

NrfD Peri 21 1 1 1 1 1 21 1 1 5 0.56 Pass

HybB Peri 1 1 21 21 1 1 1 21 1 21 1 3 0.27 Pass

Analysis based on helices/loops observed in actual structure:FrdC Cyto 1 1 1 1 4 1.00 Pass

FrdD Cyto 21 21 1 21 22 20.50 Fail

FrdC_W Cyto 1 0 1 1 1 1 5 0.83 Pass

aScore for each of the loops as described in the EXPERIMENTAL PROCEDURES section. +1 for agreement with hypothesis, 21 for disagreement, 0 forambivalence by inspection.bTest based on proposed location of membrane2extrinsic dimer in combination with proposed TM topology.cC, putative cytoplasmic loop; P, putative periplasmic loop.dsum of scores.

Membrane Protein Topology Prediction. 137

Page 7: Sequence Similarity as a Predictor of the Transmembrane ...

Table

3.Empiricaltestofcorrelationbetw

eenloopsim

ilarity

andproposedtopologyamongstmembraneanchorsubunits

Family

Overall

Averagea

Loop

averagevb

Loopsim

ilarity2v,(x

C2v)or(x

p2v)

SumsofDifferencesb

Cc=1

Pp=1

Cc=2

Pp=2

Cc=3

Pp=3

Cc=4

Pp=4

Cc=5

Pp=5

Cc=6

aC

aP

aC2aP

Testc

Pass/Fail

FrdC

2.17

2.23

0.49

21.04

0.07

22.73

0.56

23.77

4.33

Cyto

Pass

FrdD

2.15

1.96

0.52

20.97

20.17

0.36

20.97

1.33

Cyto

Pass

SdhC

0.84

0.97

0.97

21.19

0.26

21.04

1.23

22.23

3.46

Cyto

Pass

SdhD

0.92

0.98

0.16

20.54

0.71

20.96

0.87

21.50

2.37

Cyto

Pass

FrdC_W

0.78

0.31

20.04

20.13

0.52

20.49

0.35

20.34

0.83

20.96

1.79

Cyto

Pass

NarI

1.79

1.64

21.36

0.74

20.59

20.06

20.47

0.90

1.58

22.42

3.99

Cyto

Pass

FdnI

1.59

1.57

20.35

20.46

0.23

20.97

0.69

0.57

21.42

1.99

Cyto

Pass

HyaC

1.89

1.47

20.81

0.21

0.38

0.00

0.15

20.28

0.22

20.50

Peri

Pass

DmsC

2.62

2.84

1.75

22.39

1.51

20.51

0.24

23.01

0.15

22.81

1.29

28.73

4.94

213.66

Cyto

Fail

NrfD

1.36

0.71

20.39

20.53

2.25

20.28

2.30

20.26

20.63

21.46

1.48

22.53

5.01

27.54

Peri

Pass

HybB

0.50

0.44

20.44

1.69

0.02

20.13

20.47

0.90

20.73

20.20

21.54

0.03

20.26

23.42

2.30

25.72

Peri

Pass

Analysis

basedonhelic

es/loopsobservedin

actualstructure

FrdC

2.17

2.05

0.49

20.67

0.58

21.95

1.08

22.62

3.69

Cyto

Pass

FrdD

2.15

1.56

20.33

0.07

0.10

0.44

20.23

0.52

20.75

Cyto

Fail

FrdC_W

0.78

0.21

0.12

0.00

0.52

20.35

0.12

20.33

0.76

20.67

1.44

Cyto

Pass

aaveragesimilaritythrough

outthealigned

sequences.

bv,

averagesimilaritythrough

outalltheloops.x C

,averagesimilarityin

cytoplasm

icloopc.

x c,x p,av

eragesimilarityin

cytoplasm

icloopcorperiplasm

icloopp.aC,aP,as

defined

inEqs.(2)an

d(3).

c Testedtopology

based

onlocationofmem

brane2

extrinsicsubunitsin

relationto

theproposedtopology.If

aC2aPispositive,correlationwithacytoplasm

iclocationfortheextrinisicsubunitsisindicated,ifitisnegative,

correlationiswithaperiplasm

iclocation.

138 Rothery et al.

Page 8: Sequence Similarity as a Predictor of the Transmembrane ...

topology of FrdD yields a score of +0.33 in thesimple test, and a score of +1.33 in the empiricaltest. Thus, both tests agree with the topologypresented in Figure 4C and the actual topologyrevealed by the crystal structure.

(2) SdhCDAB (succ i na t e :Q ox i do r educ t a se ,complex II). This enzyme is an important compo-nent of the TCA cycle and the E. coli enzyme is agood model system for mitochondrial complex II(Hagerhall, 1997). A single heme b (heme b556 in E.coli) is coordinated between the two membrane-anchor subunits (Nakamura et al., 1996; Vibatet al., 1998), SdhC and SdhD, each having 3 TMsegments with cytoplasmic N-termini (Hagerhalland Herderstedt, 1996; Hagerhall, 1997). The E.coli sequences were used as bait to collect 21sequences of members of the SdhC family and 9sequences of members of the SdhD family. Incontrast to FrdC and FrdD, sequences were

identified from both eukaryotic mitochondria andprokaryotes.

SdhC. The proposed topology of SdhC (Figure 3A)is similar to that of FrdC. Because many of theSdhC sequences are eukaryotic (Table 1) andtherefore have mitochondrial targeting signals attheir N-termini, the plot of sequence similarityversus residue position starts at a nominal 115.As is the case for FrdC, the absolutely conservedresidues appear to be oriented towards thecytoplasmic side of the subunit. However,because of the phylogenic diversity of thesequences, there are fewer absolutely conservedresidues in SdhC than in either FrdC or FrdD.The simple topology test results in a score of +1,and the empirical test results in a score of +3.46,in agreement with the topology presented inFigure 3A.

Figure 2. Analysis of sequence similarity as a function of membrane sidedness for the FrdC and FrdD subunits of FrdABCD. A. Transmembranetopology of FrdC and the distribution of absolutely conserved residues. The horizontal line represents the region defining the interface between FrdCand the membrane-extrinsic FrdB subunit. B. Plot of average similarity within the features of FrdC. The average similarity within each segment ofsequence representing the features is plotted versus residue position. The horizontal line represents the average sequence similarity within thepredicted membrane-extrinsic loops (2.23). The overall similarity through the entire alignment is 2.17. C. Transmembrane topology and thedistribution of absolutely conserved residues in FrdD. D. Plot of average similarity within the features of FrdD. The average sequence similarity withinthe predicted membrane-extrinsic loops is 1.96 (horizontal line). The overall similarity through the entire alignment is 2.15. Abbreviations:C, cytoplasmic loop; P, periplasmic loop; 1, 2, 3, numbered TM helices.

Membrane Protein Topology Prediction. 139

Page 9: Sequence Similarity as a Predictor of the Transmembrane ...

SdhD. The proposed topology of SdhD (Figure 3C)is also similar to that of FrdC. Figure 3D shows aplot of the average similarity of the TM helices andloops. The simple topology test results in a score of+1, and the empirical test results in a score of+2.37, in agreement with topology presented inFigure 3C.

(3) FrdCAB/SdhCAB (QH2:fumarate/succinate:Qoxidoreductase). Extensive experimental data ex-ist supporting the overall topology of the fumaratereductase (FrdCAB) of Wolinella succinogenes(Kortner et al., 1990; Lancaster et al., 1999;Lancaster et al., 2000) and the succinate dehy-drogenase (SdhCAB) of Bacillus subtilis (Hagerhallet al., 1992; Hagerhall, 1997; Hederstedt et al.,1985) presented in Figure 1. The structure ofFrdCAB from W. succinogenes has been solvedby X-ray crystallography (Lancaster et al., 1999).Although the membrane-extrinsic subunits (FrdAB/

SdhAB) appear to have an almost identical struc-ture to those of E. coli FrdABCD (Iverson et al.,1999), the FrdC_W-type membrane-anchors arediheme cytochromes b with 5 TM segments andcytoplasmic N-termini (Figure 4A). A total of 7members of the FrdC_W family were identifiedusing the W. succinogenes and B. subtilis se-quences as bait. The two hemes are coordinatedbetween four TM segments (Hagerhall et al., 1992;Hagerhall et al., 1995; Matsson et al., 2000).Presumably to facilitate heme packing and coordi-nation, TM segments 1, 2, 3, and 4 have relativelyhigh sequence similarity associated with them thatis higher than that observed in the surroundingloops (Figure 4B). This is why our TM topologytests compare the sequence similarity in the loopswith the average similarity for all the loops in theprotein of interest (rather than comparing with theaverage similarity for the entire protein; see

Figure 3. Analysis of sequence similarity as a function of membrane sidedness for the SdhC and SdhD subunits of SdhCDAB. A. Transmembranetopology of SdhC and the distribution of absolutely conserved residues. One of these conserved residues, H84 of E. coli SdhC, provides a ligand forthe heme b that appears to be present in all SdhCDAB-type enzymes. The horizontal line represents the region defining the interface between SdhCand the membrane-extrinsic SdhB subunit. B. Plot of average similarity within the features of SdhC. The horizontal line represents the averagesequence similarity within the predicted membrane-extrinsic loops (0.97). The overall similarity through the entire alignment is 0.84.C. Transmembrane topology and the distribution of absolutely conserved residues in SdhD. H71 of SdhD provides the second His ligand to theheme b coordinated between SdhC and SdhD. D. Plot of average similarity within the features of SdhD. The average sequence similarity within thepredicted membrane-extrinsic loops is 0.98 (horizontal line). The overall similarity through the entire alignment is 0.92.

140 Rothery et al.

Page 10: Sequence Similarity as a Predictor of the Transmembrane ...

EXPERIMENTAL PROCEDURES). In the case ofthe FrdC_W family, there are no absolutely con-served residues in the loops, but significant se-quence similarity is observed. The simple topologytest results in a score of +0.67, and the empiricaltest results in a score of +1.79, in agreement withthe topology presented in Figure 4A.

(4) NarGHI (QH2:NO"3 oxidoreductase). The respira-

tory nitrate reductase A (NarGHI) from E. coli isknown to have its membrane-extrinsic subunits(NarGH) anchored to the inside of the cytoplasmicmembrane by a diheme cytochrome b (NarI) (Berkset al., 1995b; Hackett and Bragg, 1982; Jones andGarland, 1977; Jones et al., 1980; Magalon et al.,1997; Rothery et al., 1999). NarG contains amolybdo-bis(molybdopterin guanine dinucleotide)(Mo-bisMGD) cofactor, and NarH contains three[4Fe-4S] clusters and one [3Fe-4S] cluster (Gui-gliarelli et al., 1992; Rothery et al., 1998). The E.

coli sequence was used as bait to identify 10 NarIsequences. Like FrdC_W, NarI has 5 TM segments(Figure 4C), but has a periplasmic N-terminus, andthe two hemes are coordinated between two TMsegments (Magalon et al., 1997). As is the casewith FrdC_W, heme coordination and packingprobably results in the observation of relativelyhigh sequence similarity within TM helices 2 and 5.The simple topology test results in a score of+0.67, and the empirical test results in a score of+3.99, in agreement with the topology presented inFigure 4C.

(5) FdnGHI (HCOO":Q oxidoreductase). Based onsequence analyses, FdnG contains a Mo-bisMGDcofactor, and FdnH contains four [4Fe-4S] clusters,and these two subunits comprise a catalytic dimer(FdnGH) anchored to the cytoplasmic membraneby FdnI. The localization of FdnGH has beeninferred to be periplasmic (Berks et al., 1995a),

Figure 4. Analysis of sequence similarity as a function of membrane sidedness for the FrdC_W subunit of FrdCAB and the NarI subunit of NarGHI.A Transmembrane topology of FrdC_W and the distribution of absolutely conserved residues. FrdC_W is a diheme cytochrome b with the hemescoordinated between four TM segments as shown (Lancaster et al., 1999). The horizontal line represents the region defining the interface betweenFrdC_W and the membrane-extrinsic FrdB subunit. B.Plot of average similarity within the features of FrdC_W. The horizontal line represents theaverage sequence similarity within the predicted membrane-extrinsic loops (0.31). The overall similarity through the entire alignment is 0.78.C. Transmembrane topology of NarI and the distribution of absolutely conserved residues. The horizontal line represents the region defining theinterface between NarI and the membrane-extrinsic NarGH subunits. D. Plot of average similarity within the features of NarI. The average similaritythe predicted membrane-extrinsic loops is 1.64 (horizontal line). The overall similarity through the entire alignment is 1.79. Abbreviations: bP - dimer-proximal heme; bD - dimer-distal heme.

Membrane Protein Topology Prediction. 141

Page 11: Sequence Similarity as a Predictor of the Transmembrane ...

based in part on the presence at the N-terminus ofFdnG of a ‘‘twin-arginine’’ signal sequence target-ing the fully-folded FdnGH to the MTT (MembraneTargeting and Translocation) system (Berks, 1996;Sargent et al., 1998b; Weiner et al., 1998).However, a systematic study of 47 randomly-generated b-lactamase gene fusions to the minorE. coli formate dehydrogenase (FdoGHI) stronglysuggests a cytoplasmic location for FdoGH (Benoitet al., 1998). Given the very strong sequencesimilarity between FdnGHI and FdoGHI (FdnG,82% similar, 76% identical; FdnH, 81% similar,77% identical; FdnI, 58% similar, 48% identical),we conclude that both enzymes have the sameoverall architecture. The E. coli FdnI sequence wasused as bait to identify 6 sequences of the family.FdnI is a diheme cytochrome b with 4 TM segmentsand a cytoplasmic N-terminus (Figure 5A) (Benoitet al., 1998; Berks et al., 1995b). The hemes

appear to be coordinated between 3 TM helices ina manner similar to that demonstrated for themembrane-anchors of the HyaABC-type hydroge-nases (see below) (Groß et al., 1998; Meek andArp, 2000). As is the case for FrdC and FrdD, theorientation of sequence similarity within the loopsis striking. The simple topology test results in ascore of +0.60, and the empirical test results in ascore of +1.99, in agreement with the topologypresented in Figure 5A, and supporting a cytoplas-mic location for FdnGH.

(6) HyaABC (H2:Q oxidoreductase). This is the HYD-1hydrogenase isoenzyme (Bock and Sawers, 1996).It comprises two membrane-extrinsic subunits,HyaAB, anchored the periplasmic side of themembrane by HyaC. HyaA and HyaB are the smalland large hydrogenase subunits, respectively. Byanalogy to the structurally characterized Ni-Fehydrogenase from Desulfovibrio gigas (Volbeda

Figure 5. Analysis of sequence similarity as a function of membrane sidedness for the FdnI subunit of FdnGHI and the HyaC subunit is HyaABC(HYD-1). A. Transmembrane topology of FdnI and the distribution of absolutely conserved residues. FdnI appears to be a diheme cytochrome b withthe hemes coordinated between three TM segments as shown (Berks et al., 1995b). The horizontal line represents the region defining the interfacebetween FdnI and the membrane-extrinsic FdnGH subunits.B. Plot of average similarity within the features of FdnI. The horizontal line representsthe average sequence similarity within the predicted membrane-extrinsic loops (1.57). The overall similarity through the entire alignment is 1.59.C. Transmembrane topology and the distribution of absolutely conserved residues in HyaC. The horizontal line represents the region defining theinterface between HyaC and the membrane-extrinsic HyaAB subunits. D. Plot of average similarity within the features of HyaC. The averagesequence similarity within the predicted membrane-extrinsic loops is 1.47 (horizontal line). The overall similarity through the entire alignment is 1.89.Abbreviations: bP - dimer-proximal heme; bD - dimer-distal heme.

142 Rothery et al.

Page 12: Sequence Similarity as a Predictor of the Transmembrane ...

et al., 1995), HyaB contains the Ni-Fe active site,and HyaA contains one [3Fe-4S] cluster and two[4Fe-4S] clusters. Comparison with the HybOBC(HYD-2) isoenzyme (see below) and the presenceof a ‘‘twin-arginine’’ signal sequence at the N-terminus of HyaA strongly suggest a periplasmiclocation for the HyaAB dimer. Although E. coliHyaABC has been purified (Sawyers and Boxer,1986), no direct biochemical evidence exists toconfirm a periplasmic orientation for HyaAB. How-ever, the catalytic dimer of the W. succinogeneshomolog is demonstrably periplasmic (Groß et al.,1999). The E. coli HyaC sequence was used toidentify 16 members of the family that have anapparently similar TM topology to that predicted forFdnI. The HyaC homologs from W. succinogenesand Azotobacter vinelandii are diheme cyto-chromes b with 3 TM helices providing hemecoordination (Groß et al., 1998; Meek and Arp,2000) (Figure 5C). The simple topology test resultsin a score of 0.00, and the empirical test results in ascore of 20.50. A value of zero in the simple testindicates ambivalence towards the proposed mod-el, but a negative value in the empirical testindicates agreement (viz. a periplasmic locationfor HyaAB). Thus, the data suggest that twoproteins (FdnI and HyaC) with an overall similarityin TM topology and heme coordination can dockextrinsic subunits on opposite sides of the mem-brane (cf. Figure 5A and 5C).

(7) DmsABC (QH2:SNO oxidoreductase). This enzymeis able to reduce a wide range of S- and N- oxides(hence it may be referred to as an S- and N- oxidereductase) (Simala-Grant and Weiner, 1996; Si-mala-Grant and Weiner, 1998; Weiner et al., 1988;Weiner et al., 1992), and is comprised of a catalyticdimer (DmsAB) anchored to the inner surface ofthe cytoplasmic membrane by DmsC (Rothery andWeiner, 1993; Sambasivarao et al., 1990; Samba-sivarao et al., 2001). DmsA contains a Mo-bisMGDcofactor (Rothery et al., 1995), DmsB contains four[4Fe-4S] clusters (Cammack and Weiner, 1990;Rothery and Weiner, 1996), and DmsC is ahydrophobic membrane-anchor with 8 TM seg-ments. An experimentally determined topologyusing blaM and phoA gene fusions places boththe N- and C- termini on the periplasmic side of themembrane (Weiner et al., 1993). Controversysurrounds the compartmentalization of the DmsABsubunits, as DmsA has a ‘‘twin-arginine’’ signalsequence that results in assembly of the DmsABCheterotrimer to the membrane being dependent onthe MTT protein translocation/targeting machinery(Sambasivarao et al., 2000; Weiner et al., 1998).Despite convincing experimental evidence to thecontrary, it has been suggested that the DmsABdimer is periplasmically oriented (Berks, 1996;Sargent et al., 1998b).

Inspection of the absolutely conserved resi-dues in Figure 6A reveals a striking orientation ofthe conserved residues towards the periplasmicside of the proposed topology. The plot of average

sequence similarity within the features of theproposed topology model (Figure 6B) reflects theoverall orientation of absolutely conserved resi-dues within the DmsC family towards the periplas-mic side of the model. The simple topology testresults in a score of 21.00, and the empirical testresults in a score of 213.66, in disagreement withthe predicted and experimentally-determined to-pology of DmsC (see DISCUSSION).

(8) PsrABC (QH2:S2#- oxidoreductase). This enzyme

allows respiratory growth by W. succinogenes withpolysulfide as terminal electron acceptor (Krafftet al., 1995). The catalytic dimer, PsrAB, isanchored to the periplasmic side of the cytoplasmicmembrane by a hydrophobic membrane-anchor,PsrC. Sequence analysis suggests that PsrAcontains a Mo-bisMGD cofactor, and that PsrBcontains four [4Fe-4S] clusters (Krafft et al., 1992).The PsrAB dimer is targeted to the periplasm by a‘‘twin-arginine’’ signal sequence at the N-terminusof PsrA. PsrC shares significant sequence similar-ity with E. coli NrfD, a component of one of theE. coli nitrite reducing systems (NrfABCD) (Hus-sain et al., 1994). The overall architecture of thissystem differs significantly from that of PsrABC(Figure 1). NrfD appears to be the membrane-anchor to a periplasmically oriented electron-transfer subunit, NrfC that is similar to DmsB (i.e.its sequence infers the presence of four [4Fe-4S]clusters) and has a ‘‘twin-arginine’’ signal se-quence. NrfB is a periplasmic pentaheme cyto-chrome c that may form a dimer with NrfC. Thepredicted site of nitrite reduction is NrfA, aputatively soluble periplasmic pentaheme cyto-chrome c (Berks et al., 1995b). PsrC/NrfD appearto have 8 transmembrane helices and periplasmicN- and C- termini (Figure 6C). 4 proteins wereidentified as being members of the PsrC/NrfDfamily using the E. coli NrfD sequence as bait.The simple topology test results in a score of 0.56,and the empirical test results in a score of 27.54, inagreement with the topology shown in Figure 6C.

(9) HybOBC (H2:Q oxidoreductase). This is the HYD-2hydrogenase of E. coli. The large (Ni-Fe, HybC)and small ([Fe-S], HybO) subunits are anchored tothe periplasmic side of the membrane by ahydrophobic HybB membrane-anchor that is quitedistinct from the HyaC anchor described above forthe HyaABC (HYD-1) enzyme in that it has 10transmembrane helices rather than 4. HybO andHybC appear to be similar in cofactor compositionand sequence to HyaA and HyaB of HyaABC(Sargent et al., 1998a). An additional subunit,HybA, has a ‘‘twin-arginine’’ signal sequence,contains 4 [Fe-S] clusters, appears to be targetedto the periplasm via the MTT pathway, but has anundefined functional role (Frank Sargent, personalcommunication). Localization of the HybOC cata-lytic dimer to the periplasmic side of the cytoplas-mic membrane and its translocation via the MTTmachinery has been clearly demonstrated (Rodri-gue et al., 1999a; Rodrigue et al., 1999b; Wu et al.,

Membrane Protein Topology Prediction. 143

Page 13: Sequence Similarity as a Predictor of the Transmembrane ...

2000). In contrast to the case of HyaABC, only fourputative HybB-homologs could be identified usingthe E. coli HybB sequence as bait. Hydropathyanalyses suggest that members of this family ofproteins have 10 TM segments with cytoplasmic N-and C- termini (Figure 7A). It is not clear why thisprotein has so many TM segments compared to themembrane-anchor of HyaABC (10 versus 4 TMsegments). The simple topology test results in ascore of 0.27, and the empirical test results in ascore of 25.72, in agreement with the topologypresented in Figure 7A.

Analysis of the Sequence Similarity Method toTopologies Based on Crystal StructuresFigure 8 shows that there is a strong overallagreement between the predicted topologies andthe structure-based topologies for FrdC, FrdD, andFrdC_W. In order to test the accuracy of themethods presented herein, the analyses presentedabove were based on predicted topologies of theproteins for which crystal structures are available. If

the analyses are repeated using the helical seg-ments identified in the published structures ofFrdABCD and FrdCAB (Iverson et al., 1999; Lan-caster et al., 1999), similar results are obtained,except in the case of FrdD (Tables 2 and 3). In thiscase, the proposed topology fails both the simpleand empirical tests. However, it should be noted thatthe helical segments observed in the crystal struc-tures have TM segments that either overlap com-pletely with them, or are included within them. Thusanalyses of these segments using the methodspresented herein may be biased by sequencesimilarity within the non-TM sections of the identifiedhelices.

Discussion

We have demonstrated that sequence similarity withinthe membrane anchors of the respiratory chainenzymes described herein can play a significant rolein predicting the orientation of their TM topologies. In10 out of 11 families of anchor tested, strong correla-

Figure 6. Analysis of sequence similarity as a function of membrane sidedness for the DmsC subunit of DmsABC and the NrfD subunit of NarfABCD.A. Transmembrane topology of DmsC and the distribution of absolutely conserved residues. The horizontal line represents the region defining theinterface between DmsC and the membrane-extrinsic DmsAB subunits. B. Plot of average similarity within the features of DmsC. The horizontal linerepresents the average sequence similarity within the predicted membrane-extrinsic loops (2.84). The overall similarity through the entire alignmentis 2.62. C. Transmembrane topology of NrfD and the distribution of absolutely conserved residues. The horizontal line represents the region definingthe interface between NrfD and the membrane-extrinsic NrfBC subunits. D. Plot of average similarity within the features of NrfD. The averagesequence similarity within the predicted membrane-extrinsic loops is 0.71 (horizontal line). The overall similarity through the entire alignment is 1.36.

144 Rothery et al.

Page 14: Sequence Similarity as a Predictor of the Transmembrane ...

tion exists between the experimentally-determinedsidedness of the membrane-extrinsic subunits andthe predicted or experimentally-determined orientationof the anchor topology (Tables 2 and 3). Thus, it isclear that consideration of the orientation of sequencesimilarity within the loops of a TM model is pertinent inthose cases where membrane-extrinsic subunits arepresent. Categorization of emerging sequences formembrane-anchor proteins into families that includetopologically characterized members is becomingincreasingly facile, rendering the sequence similaritymethod a useful complement to existing TM topology-orientation prediction methods.

The presence of sequence similarity within theloops defining subunit-subunit interactions betweenthe membrane-intrinsic and -extrinsic domains prob-ably does not play a role in defining the orientation ofthe topology per se, but is likely to be solely a result ofthe interactions necessary to maintain overall multi-meric structure. This is because prediction of the

topologies of the 10 families that produce consistentsequence similarity analysis data is based on hydro-pathy/charge distribution analyses that ignore thepossible role of loop-localized residues in definingsubunit-subunit interactions. For the interactions toplay a role in defining the orientation of the TMtopology of the membrane anchor, the subunit-subunitinteractions would have to be in existence during theprocess of membrane insertion.

The one example of a membrane anchor thatappears to defeat the analysis presented herein is theDmsC subunit of E. coliDmsABC. Its proposed topologyis based on 17 blaM and 3 phoA gene fusions within thedmsC gene of the dmsABC operon (Weiner et al.,1993). Based on the presence of a ‘‘twin-arginine’’leader sequence in DmsA, the membrane-extrinsicDmsAB dimer is ‘‘expected’’ to be targeted to theperiplasm via the MTT apparatus (Berks, 1996; Sargentet al., 1998b). Interestingly, the analysis presentedherein provides credence to this hypothesis, as the

Figure 7. Analysis of sequence similarity as a function of membrane sidedness for the HybB subunit of HybOBC. B. Transmembrane topology andthe distribution of absolutely conserved residues. The horizontal line represents the region defining the interface between HybB and the membrane-extrinsic HybOC subunits. B. Plot of average similarity within the features of HybB. The horizontal line represents the average sequence similaritywithin the predicted membrane-extrinsic loops (0.44). The overall similarity through the entire alignment is 0.50.

Membrane Protein Topology Prediction. 145

Page 15: Sequence Similarity as a Predictor of the Transmembrane ...

loop-localized sequence similiarity is periplasmically-oriented in the putative topology. However, there issignificant biophysical/biochemical data suggesting acytoplasmic location for DmsAB within the matureholoenzyme (Rothery andWeiner, 1993; Sambasivaraoet al., 1990). In addition, it has recently been demon-strated that DmsAB expressed in the absence of DmsCremains in the cytoplasm (Sambasivarao et al., 2001). Ithas been demonstrated that the dissociable Q-site ofDmsC is conformationally-linked to one of the [Fe-S]clusters of DmsB and that H65 and E87 play animportant role in defining this site (Rothery and Weiner,1996, Rothery and Weiner, unpublished results).H65 and E87 of DmsC are placed at either end of aperiplasmic loop in the proposed topology (Figure 6A).These observations are problematic because theDmsAB dimer has been demonstrated to be bound tothe cytoplasmic side of the membrane (Sambasivaraoet al., 1990; Sambasivarao et al., 2001). One importantfeature of DmsC is its lethality when expressed inthe absence of DmsAB (Turner et al., 1997). Thus, apossible explanation for the conflicting topological datais that the experimentally-determined topology of DmsCis correct only in the context of this subunit assemblingto the membrane in the absence of the dimer. Normal

assembly of the holoenzyme may require interactionbetween DmsC and the membrane extrinsic dimer priorto membrane insertion to overcome the lethalityphenotype, i.e. the interaction between DmsC andDmsAB may be a topogenic signal. Other membraneanchors, as has been demonstrated for the NarI subunitof NarGHI (Magalon et al., 1997), may not have lethalityphenotypes associated with them. Alternatively, DmsCmay hold DmsAB in a ‘‘baseball glove’’ of 8 TMsegments. This would require insertion of a portion ofDmsAB below the inner surface of the cytoplasmicmembrane, and could also require prior association ofDmsAB with DmsC to overcome the DmsC-mediatedlethality phenotype. Obviously, the TM topology ofDmsC requires closer experimental examination,perhaps using gene/tag fusion methods that donot disrupt holoenzyme structure (e.g. sandwichfusions). Such studies are currently ongoing in ourlaboratory.

Maturation of the following oxidoreductases isdependent on the MTT system: FdnGHI, HyaABC,DmsABC, PsrABC, NrfABCD, and HybOBC. Despite apaucity of experimental data for some enzymes, it isclear that not all ‘‘twin-arginine’’ leader containingproteins are targeted to the periplasm. Known excep-

Figure 8. Comparison of predicted and actual transmembrane topologies for the membrane-anchors of bacterial fumarate:menaqiunoneoxidoreductases. Predicted topologies were obtained using the program TMpred (Hofmann and Stoffel, 1993) (www.ch.embnet.org). The TMpredprogram makes a prediction of membrane-spanning regions and their orientation. The algorithm is based on the statistical analysis of TMbase, adatabase of naturally occurring transmembrane proteins. The prediction is made using a combination of several weight-matrices for scoring.Protein sequences were searched for TM segments between 17 and 33 residues in length. Actual topologies are based on the reported lengths ofhelices that include a transmembrane segment in the published structures of E. coli FrdABCD (Iverson et al., 1999) and W. succinogenes FrdCAB(Lancaster et al., 1999).

146 Rothery et al.

Page 16: Sequence Similarity as a Predictor of the Transmembrane ...

tions appear to be DmsABC (Rothery and Weiner,1993; Sambasivarao et al., 1990; Sambasivarao et al.,2001) and FdnGHI (by analogy to topological studieson the minor FdoGHI from E. coli) (Benoit et al., 1998).This is puzzling because the MTT system is involved intargeting fully-folded, cofactor-containing, often het-erodimeric proteins to the periplasm. It is thus unclearwhy mature DmsAB and FdnGH appear to remain onthe cytoplasmic side of the membrane in theirrespective mature holoenzymes.

It is clear from the method provided herein thatsequence similarity scores within a ClustalW alignmentcan provide important information that can argue for oragainst a particular TM model for a respiratory chainmembrane anchor protein. The BLOSUM62 substitutionmatrix (Henikoff and Henikoff, 1993; Henikoff andHenikoff, 1996) allows quantification of similarity withinthe features of the proteins studied in a way that permitsoverall scoring of similarity for a particular TMmodel. Asdemonstrated herein, this is a significant advantageover simply viewing the identity within an alignment, asthe matrix scores (and penalizes) each substitutionwithin the alignment. We chose a group of bacterialmembrane anchors to respiratory chain enzymes as the‘‘bait’’ proteins for our analyses, because many of theseare studied in our laboratories, but it is likely that thisapproach would work for a wide range of membraneproteins that have extrinsic subunits, and even thosethat have membrane-intrinsic and extrinsic domainswithin a single polypeptide.

Experimental Procedures

Sequence collection. Bait sequences from the indivi-dual sequence families (see RESULTS) were used tomine sequences from the SWISS-PROT/TrEMBLdatabases at the ExPASy Molecular Biology Server(expasy.hcuge.ch) using the BLASTP database searchengine (Altschul et al., 1997) available thereon.Occasionally, it was necessary to obtain sequencesfrom the folowing additional databases: PIR, PRF,PDB, and translations from annotated coding regionsin GenBank and RefSeq. These are available at theNCBI server (Entrez Protein at www.ncbi.nlm.nih.gov).Sequences were included within families based on thefollowing criteria: i) identified sequences being ofapparently homeomorphic domain structure similar tothat of the bait sequence; and, ii) a low expectationvalue being returned for the sequence in the BLASTPoutput. In the case of the FrdC_W family of proteins,the SdhC subunits do not appear in the BLASTPoutput, but based on structural and biochemicalanalyses are clearly members of the FrdC_W family(Hagerhall and Herderstedt, 1996; Hagerhall, 1997;Lancaster et al., 1999).

Sequence alignments and topology model generation.Multiple sequence alignments were generated usingthe ClustalW alignment algorithm (Thompson et al.,1994). Topology models for each sequence in eachalignment were determined using the TMpred algo-rithm (www.ch.embnet.org) (Hofmann and Stoffel,

1993) or the TMHMM algorithm (www.cbs.dtu.dk/services/TMHMM-1.0/) (Sonnhammer et al., 1998).Typically, the predicted topology of the original baitsequence was used for further analyses as describedin the RESULTS.

Quantitation of similarity within the alignments. Thesequence similarity at each position in the alignmentswas determined using the -outfile option of thePLOTSIMILARITY program of the Wisconsin Se-quence Analysis Package (GCG Version 9.1-UNIX)using the BLOSUM62 substitution matrix (Henikoff andHenikoff, 1993; Henikoff and Henikoff, 1996) and awindow average of 1 (i.e. no averaging). Averagingwithin each feature (each TM helix and loop) wascarried out using an electronic spreadsheet. Wherelengths of N- and C-termini varied within an alignment,averaging was performed only on regions of sequenceexisting in all the aligned sequences.

Sequence similarity as a predictor of the orientation ofTM topology models. For each of the 11 families ofmembrane-anchors, sequences were mined, a Clus-talW alignment was generated, PLOTSIMILARITYdata calculated, and TM topologies of each memberpredicted. In every case, there was general agreementbetween the predicted TM topology of each member.Subsequent analysis was based on the predicted TMtopology of the bait protein, which in the majority ofcases was the family member from E. coli.

Averaging of sequence similarity values withinputative transmembrane segments and membrane-extrinsic loops in the ClustalW alignments was carriedout as described above. In each family, it is clear thatsignificant sequence similarity exists within a number ofTM segments. This similarity is likely to result from:i) definition of sites of substrate binding (i.e. Q-sites);ii) definition of heme binding sites (His ligands,residues involved in heme packing); iii) definition ofsubunit-subunit interactions between membrane-intrin-sic subunits (i.e. between SdhC and SdhD); or iv) thetendency for TM segments to contain residues withhydrophobic side chains (i.e. Ile, Val, Leu). Weeliminated this similarity from our analysis by calculat-ing the average sequence similarity for each proteinwithin the putative membrane-extrinsic loops. Twoscoring tests were devised to test proposed TMtopologies. One is based on higher or lower thanloop-average similarity (see definition, below) beingobserved where expected in cytoplasmic or periplasmicloops. The other is based on quantitation of the overalldifference between cytoplasmically and periplasmicallyoriented similarity. These tests are explained below:

(1) Simple. For each protein family, the averagesimilarity for each loop, including the N- andC-termini, was calculated. This resulted in averagesimilarity values, xn , where n is the number of theloop. Each loop was assigned a score (s) based onits average similarity score relative to the loop-average similarity (v) throughout all of the loops.The score was based on the sidedness of the

Membrane Protein Topology Prediction. 147

Page 17: Sequence Similarity as a Predictor of the Transmembrane ...

membrane-extrinsic subunits with respect to thecytoplasmic membrane. For example, based onthe hypothesis presented herein relatively highsimilarity would be expected to be observed on thecytoplasmic side of a membrane-intrinsic subunit ofan enzyme with cytoplasmically-oriented membrane-extrinsic subunits:

for a cytoplasmic loop,

if xn > v ; s ¼ 1; if xn , v ; s ¼ "1; if xn , v ; s ¼ 0;

for a periplasmic loop,

if xn , v ; s ¼ 1; if xn > v ; s ¼ "1; if xn , v ; s ¼ 0:

For loops m ¼ 1 to m ¼ y (the total number of loops)agreement, A, with the implied topology was calculatedas follows:

A ¼Pm¼y

m¼1 Sm

yð1Þ

Thus, A ¼ þ1 represented complete agreement withan implied model and A ¼ 21 represented completedisagreement.(2) Empirical. For each protein family, the averagesimilarity, xn , for each loop and the similarity through-out all the loops, v, was calculated as described above.Then sums of differences, aC and aP, were calculatedbetween the putative cytoplasmic/periplasmic loopsand the overall loop-average, v.

For cytoplasmic loops c ¼ 1 to c ¼ yC ,

aP ¼Xc¼yc

c¼1ðxc " vÞ ð2Þ

where yC is the total number of cytoplasmic loops.

For periplasmic loops p ¼ 1 to p ¼ yP ,

aP ¼Xp¼yP

p¼1ðxP " vÞ ð3Þ

where yP is the total number of periplasmic loops.

If aC " aP > 0, agreement was with a topology model inwhich the membrane-extrinsic subunits are cytoplas-mically localized. If aC " aP , 0; agreement was with atopology model in which the extrinsic subunits areperiplasmically localized.

Both methods assigned equal weight to eachmembrane-extrinsic loop, as well as the N- and C-termini, regardless of the length of these features. Asdescribed in the RESULTS, this appears to result in areasonable interpretation of the similarity data in termsof proposed topological models.

Acknowledgements

The authors would like to thank Dr. Axel Magalon of the CNRS,Marseille, France for critically reading the manuscript, and Dr. FrankSargent of the Centre for Metalloprotein Spectroscopy and Biology,School of Biological Sciences, University of East Anglia, Norwich,England for information on E coli HYD-2 prior to its publication.

References

Altschul, S.F., Madden, T.L., Schðffer, A.A., Zhang, J., Zhang, Z.,Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST:a new generation of protein database search programs. NucleicAcids Res. 25: 3389–3402.

Benoit, S., Abaibou, H., and Mandrand-Berthelot, M.-A. 1998.Topological analysis of the aerobic membrane-bound formatedehydrogenase of Escherichia coli. J. Bacteriol. 180: 6625–2234.

Berks, B.C., Ferguson, S.J., Moir, J.W.B., and Richardson, D.J.1995a. Enzymes and associated electron transport systems thatcatalyze the respiratory reduction of nitrogen oxides and oxyanions.Biochim. Biophys. Acta 1232: 97–123.

Berks, B.C., Page, M.D., Richardson, D.J., Reilly, A., Cavill, A.,Outen, F., and Ferguson, S.J. 1995b. Sequence analysis of subunitsof the membrane-bound nitrate reductase from a denitrifyingbacterium: the integral membrane subunit provides a prototype forthe diheme electron-carrying arm of a redox loop. Mol. Micro. 15:319–331.

Berks, B.C. 1996. A common export pathway for proteins bindingcomplex redox cofactors? Molec. Microbiol. 22: 393–414.

Bock, A., and Sawers, G. 1996. Fermentation. In Escherichia coli andSalmonella: cellular and molecular biology. Vol. 1. Neidhardt, F.C.ed. Washington, D.C.: ASM Press, pp. 262–282.

Cammack, R., and Weiner, J.H. 1990. Electron paramagneticresonance spectroscopic characterization of dimethyl sulfoxidereductase of Escherichia coli. Biochemistry 29: 8410–8416.

Cole, S.T., Condon, C., Lemire, B.D., and Weiner, J.H. 1985.Molecular biology, biochemistry, and bioenergetics of fumaratereductase, a complex membrane-bound iron-sulfur flavoenzyme ofEscherichia coli. Biochim. Biophys. Acta 811: 381–403.

Deckers-Heberstreit, G., Greie, J.-C., Stalz, W.-D., and Altendorf, K.2000. The ATP synthase of Escherichia coli: structure and functionof Fo subunits. Biochim. Biophys. Acta 1458: 364–373.

Eposti, D. 1989. Prediction and comparison of the heme-binding sitesin membrane hemoproteins. Biochim. Biophys. Acta 977: 249–265.

Gennis, R., and Stewart, V. 1996. Respiration. In Escherichia coli andSalmonella: cellular and molecular biology. Vol. 1. Neidhardt, F.C.Ed. Washington, D.C.: ASM Press, pp. 217–261.

Groß, R., Simon, J., Lancaster, C.R.D., and Kroger, A. 1998.Identification of histidine residues in Wolinella succinogenes hydro-genase that are essential for menaquinone reduction by H2. Molec.Micorbiol. 30: 639–646.

Groß, R., Simon, J., and Kroger, A. 1999. The role of the twin-argininemotif in the signal peptide encoded by the hydA gene of thehydrogenase from Wolinella succinogenes. Arch. Microbiol. 172:227–232.

Guigliarelli, B., Asso, M., More, C., Augier, V., Blasco, F., Pommier, J.,Giordano, G., and Bertrand, P. 1992. EPR and redox characteriza-tion of the iron-sulfur centres in nitrate redcutases A and Z fromEscherichia coli. Evidence of a high and low potential class and theirrelevance in the electron-transfer mechanism. Eur. J. Biochem. 207:61–68.

Hackett, N.R., and Bragg, P.D. 1982. The association of two distinct bcytochromes with respiratory nitrate reductase of Escherichia coli.FEMS Micro. Lett. 13: 213–217.

Hagerhall, C., Aasa, R., von Wachenfeldt, C., and Hederstedt, L.1992. Two hemes in Bacillus subtilis succinate: menaquinoneoxidoreductase (complex II). Biochemistry 31: 7411–7421.

Hagerhall, C., Friden, H., Aasa, R., and Hederstedt, L. 1995.Transmembrane topology and axial ligands to hemes in thecytochrome b subunit of Bacillus subtilis succinate:menaquinonereductase. Biochemistry 34: 11080–11089.

Hagerhall, C., and Herderstedt, L. 1996. A structural model for themembrane-integral domain of succinate:quinone oxidoreductases.FEBS Lett. 389: 25–31.

Hagerhall, C. 1997. Succinate:quinone oxidoreductases. Variations ona conserved theme. Biochim. Biophys. Acta. 1320: 107–141.

Hederstedt, L., Maguire, J.J., Waring, A.J., and Ohnishi, T. 1985.Characterization by electron paramagnetic resonance and studies ofthe subunit location and assembly of the iron-sulfur clusters ofBacillussubtilis succinate dehydrogenase. J. Biol. Chem. 260: 5554–5562.

Henikoff, S., and Henikoff, J.G. 1993. Performance evaluation ofamino acid substitution matrices. Proteins: Struct. Funct. Genet. 17:49–61.

Henikoff, J.G., and Henikoff, S. 1996. Blocks database and itsapplications. Methods Enzymol. 266: 88–105.

148 Rothery et al.

Page 18: Sequence Similarity as a Predictor of the Transmembrane ...

Hofmann, K., and Stoffel, W. 1993. TMbase - a database of membranespanning protein segments. Biol. Chem. Hoppe-Seyler 347: 166.

Hussain, H., Grove, J., Griffiths, L., Busby, S., and Cole, J. 1994. Aseven-gene operon essential for formate-dependent nitrite reductionto ammonia by enteric bacteria. Molec. Micro. 12: 153–163.

Iverson, T.M., Luna-Chavez, C., Cecchini, G., and Rees, D.C. 1999.Structure of the Escherichia coli fumarate reductase respiratorycomplex. Science 284: 1961–1966.

Jones, R.W., and Garland, B.P. 1977. Sites and specificity of thereaction of bipyridylium compounds with anaerobic respiratoryenzymes of Escherichia coli. Effects of permeability barriersimposed by the cytoplasmic membrane. Biochem. J. 164: 199–211.

Jones, W.R., Lamont, A., and Garland, P.B. 1980. The mechanism ofproton translocation driven by the respiratory nitrate reductasecomplex of Escherichia coli. Biochem. J. 190: 79–94.

Jones, D.T. 1998. Do transmembrane protein superfolds exist? FEBSLett. 423: 281–285.

Kortner, C., Lauterbach, F., Tripier, D., Unden, G., and Kroger, A.1990. Wolinella succinogenes fumarate reductase contains adiheme cytochrome b. Molec. Microbiol. 4: 855–860.

Krafft, T., Bokranz, M., Klimmeck, O., Schroder, I., Fahrenholz, F.,Kojro, E., and Kroger, A. 1992. Cloning and nucleotide sequence ofthe psrA gene of Wolinella succinogenes polysulphide reductase.Eur. J. Biochem. 206: 5456–5463.

Krafft, T., Gross, R., and Kroger, A. 1995. The function of Wolinellasuccinogenes psr genes in electron transport with polysulfide as theterminal electron acceptor. Eur. J. Biochem. 230: 601–606.

Kyte, J., and Doolittle, R.F. 1982. A simple method for displaying thehydropathic character of a protein. J. Mol. Biol. 157: 105–132.

Lancaster, C.R.D., Kroger, A., Auer, M., and Michel, H. 1999.Structure of fumarate reductase from Wolinella succinogenes at2.2 A resolution. Nature 402: 377–385.

Lancaster, C.R.D., Groß, R., Haas, A., Ritter, M., Mðntele, W., andSimon, J. 2000. Essential role of Glu-66 for menaquinol oxidationindicates transmembrane electrochemical potential generation byWolinella succinogenes fumarate reductase. Proc. Natl. Acad. Sci.USA 97: 13051–13056.

Magalon, A., Lemesle-Meunier, D., Rothery, R.A., Frixon, C.,Weiner, J.H., and Blasco, F. 1997. Heme axial ligation by the highlyconserved His residues in helix II of cytochrome bnr of Escherichiacoli nitrate reductase A (NarGHI). J. Biol. Chem. 272: 25652–25658.

Magalon, A., Rothery, R.A., Lemesle-Meunier, D., Frixon, C.,Weiner, J.H., and Blasco, F. 1998. Inhibitor binding within theNarI subunit (cytochrome bnr) of Escherichia coli nitrate reductase A.J. Biol. Chem. 273: 10851–10856.

Matsson, M., Tolstoy, D., Aasa, R., and Hederstedt, L. 2000. Thedistal heme center in Bacillus subtilis succinate:quinone reductase iscrucial for electron transfer to menaquinone. Biochemistry 39:8617–8624.

Meek, L., and Arp, D.J. 2000. The hydrogenase cytochrome b hemeligands of Azotobacter vinelandii are required for full H2 oxidationcapability. J. Bacteriol. 182: 3429–3436.

Nakamura, K., Yamaki, M., Sarada, M., Nakayama, S., Vibat, C.R.,Gennis, R.B., Nakayashiki, T., Inokuchi, H., Kojima, S., and Kita, K.1996. Two hydrophobic subunits are essential for the heme b ligationand functional assembly of complex II (succinate:ubiquinoneoxidoreductase) from Escherichia coli. J. Biol. Chem. 271: 521–527.

Ohnishi, T., Moser, C.C., Page, C.C., Dutton, P.L., and Yano, T. 2000.Simple redox-linked proton-transfer design: new insights fromstructures of quinol-fumarate reductase. Structure 8: R23–R32.

Persson, B., and Argos, P. 1997. Prediction of membrane proteintopology utilizing multiple sequence alignments. J. Prot. Chem. 16:453–457.

Richardson, D.J., and Watmough, N.J. 1999. Inorganic nitrogenmetabolism in bacteria. Curr. Opin. Chem. Biol. 3: 207–219.

Rodrigue, A., Boxer, D.H., Mandrand-Berthelot, M.A., and Wu, L.-F.1999a. Requirement for nickel of the transmembrane translocationof Ni-Fe-hydrogenase 2 in Escherichia coli. FEBS Lett. 392: 81–86.

Rodrigue, A., Chanal, A., Beck, K., Mller, M., and Wu, L.-F. 1999b. Co-translocation of a periplasmic enzyme complex by a hitchhikermechanism through the bacterial TAT pathway. J. Biol. Chem. 274:13223–13228.

Rost, B., Casadio, R., Fariselli, P., and Sander, C. 1995. Transmem-brane helices predicted at 95% accuracy. Prot. Sci. 4: 521–533.

Rost, B., Fariselli, P., and Casidio, R. 1996. Topology prediction forhelical transmembrane proteins at 86% accuracy. Prot. Sci. 5:1704–1718.

Rothery, R.A., and Weiner, J.H. 1993. Topological characterization ofEscherichia coli DMSO reductase by electron paramagnetic reso-nance spectroscopy of an engineered [3Fe-4S] cluster. Biochemistry32: 5855–5861.

Rothery, R.A., Simala Grant, J.L., Johnson, J.L., Rajagopalan, K.V.,and Weiner, J.H. 1995. Association of molybdopterin guaninedinucleotide with Escherichia coli dimethyl sulfoxide reductase:effect of tungstate and a mob mutation. J. Bacteriol. 177:2057–2063.

Rothery, R.A., and Weiner, J.H. 1996. Interaction of an engineered[3Fe-4S] cluster with a menaquinol binding site of Escherichia coliDMSO reductase. Biochemistry 35: 3247–3257.

Rothery, R.A., Magalon, A., Giordano, G., Guigliarelli, B., Blasco, F.,and Weiner, J.H. 1998. The molybdenum cofactor of Escherichia colinitrate reductase A (NarGHI): effect of a mobAB mutation andinteractions with [Fe-S] clusters. J. Biol. Chem. 273: 7462–7469.

Rothery, R.A., Blasco, F., Magalon, A., Asso, M., and Weiner, J.H.1999. The hemes of Escherichia coli nitrate reductase A (NarGHI):potentiometric effects of inhibitor binding to NarI. Biochemistry 38:12747–12757.

Sambasivarao, D., Scraba, D.G., Trieber, C.A., and Weiner, J.H.1990. Organization of dimethyl sulfoxide reductase in the plasmamembrane of Escherichia coli. J. Bacteriol. 172: 5938–5948.

Sambasivarao, D., Turner, R.J., Simala-Grant, J.L., Shaw, G., Hu, J.,and Weiner, J.H. 2000. Multiple roles for the twin arginine leadersequence of dimethyl sulfoxide reductase of Escherichia coli. J. Biol.Chem. 275: 22526–22531.

Sambasivarao, D., Dawson, H.A., Zhang, G., Shaw, G., Hu, J.,and Weiner, J.H. 2001. Investigation of Escherichia coli dimethylsulfoxide reductase assembly and processing in strains defective forthe sec-independent protein translocation system Mtt. J. Biol. Chem.276: 20167–20174.

Sargent, F., Ballantine, S.P., Rugman, P.A., Palmer, T., andBoxer, D.H. 1998a. Reassignment of the gene encoding theEscherichia coli hydrogenase 2 small subunit. Identification of asoluble precursor of the small subunit in a hypB mutant. Eur. J.Biochem. 255: 746–754.

Sargent, F., Bogsch, E.G., Stanley, N.R., Wexler, M., Robinson, C.,Berks, B., and Palmer, T. 1998b. Overlapping functions ofcomponents of a bacterial Sec-independent protein export pathway.EMBO J. 17: 3640–3650.

Sawyers, R.G., and Boxer, D.H. 1986. Purification and properties ofthe membrane-bound hydrogenase isoenzyme-1 from anaerobicallygrown Escherichia coli K12. Eur. J. Biochem. 156: 265–275.

Simala-Grant, J.L., and Weiner, J.H. 1996. Kinetic analysis andsubstrate specificity of Escherichia coli DMSO reductase. Micro-biology 142: 3231–3239.

Simala-Grant, J.L., and Weiner, J.H. 1998. Modulation of the substratespecificity of Escherichia coli dimethylsulfoxide reductase. Eur. J.Biochem. 251: 510–515.

Sonnhammer, E.L.L., von Heijne, G., and Krogh, A. 1998. A hiddenMarkov model for predicting transmembrane helices in proteinsequences. Proc. Intl. Intelligent Systems for Molecular Biology 6:175–182.

Thompson, J.D., Higgins, D.G., and Gibson, T.J. 1994. CLUSTALW: improving the sensitivity of progressive multiple sequencealignment through sequence weighting, positions-specific gappenalties and weight matrix choice. Nucleic Acids Res. 22:4673–4680.

Turner, R.J., Busaan, J.L., Lee, J.H., Michalak, M., and Weiner, J.H.1997. Expression and epitope tagging of the membrane anchorsubunit DmsC. of Escherichia coli dimethylsulfoxide reductase.Protein Engineering 10: 285–290.

Unden, G., and Bongaerts, J. 1997. Alternative respiratory pathwaysof Escherichia coli: energetics and transcriptional regulation inresponse to electron acceptors. Biochim. Biophys. Acta 1320:217–234.

van Geest, M., and Lolkema, J.S. 2000. Membrane topology andinsertion of membrane proteins: search for topogenic signals.Microbiol. Molec. Biol. Rev. 64: 13–33.

Vibat, C.R., Cecchini, G., Nakamura, K., Kita, K., and Gennis, R.B.1998. Localization of histidine residues responsible for hemeaxial ligation in cytochrome b556 of complex II (succinate:ubiquinoneoxidoreductase) in Escherichia coli. Biochemistry 37: 4148–4159.

Volbeda, A., Charon, M.-H., Piras, C., Hatchikian, E.C., Frey, M., andFontecilla-Camps, J.C. 1995. Crystal structure of the nickel-ironhydrogenase from Desulfovibrio gigas. Nature 273: 580–587.

Membrane Protein Topology Prediction. 149

Page 19: Sequence Similarity as a Predictor of the Transmembrane ...

von Heijne, G. 1989. Control of topology and mode of assembly of apolytopic membrane protein by positively charged residues. Nature341: 456–458.

von Heijne, G. 1992. Membrane protein structure prediction. Hydro-phobicity analysis and the positive-inside rule. J. Mol. Biol. 225:487–494.

Weiner, J.H., MacIsaac, D.P., Bishop, R.E., and Bilous, P.T. 1988.Purification and properties of Escherichia coli dimethyl sulfoxidereductase, an iron-sulfur molybdoenzyme with broad substratespecificity. J. Bacteriol. 170: 1505–1510.

Weiner, J.H., Rothery, R.A., Sambasivarao, D., and Trieber, C.A.1992. Molecular analysis of dimethylsulfoxide reductase: a complexiron-sulfur molybdoenzyme of Escherichia coli. Biochim. Biophys.Acta 1102: 1–18.

Weiner, J.H., Shaw, G., Turner, R.J., and Trieber, C.A. 1993. Thetopology of the anchor subunit of dimethyl sulfoxide reductase ofEscherichia coli. J. Biol. Chem. 268: 3238–3244.

Weiner, J.H., Bilous, P.T., Shaw, G.M., Lubitz, S.P., Frost, L.,Thomas, G.H., Cole, J.A., and Turner, R.J. 1998. A novel andubiquitous system for membrane targeting and secretion of cofactor-containing proteins. Cell 93: 93–101.

Wu, L.-F., Chanal, A., and Rodrigue, A. 2000. Membrane targetingand translocation of bacterial hydrogenases. Arch. Microbiol. 173:319–324.

150 Rothery et al.