Top Banner
Vol. 171, No. 1 JOURNAL OF BACTERIOLOGY, Jan. 1989, p. 172-183 0021-9193/89/010172-12$02.00/0 Copyright © 1989, American Society for Microbiology Sequence of the Pseudomonas aeruginosa trpI Activator Gene and Relatedness of trpI to Other Procaryotic Regulatory Genes MING CHANG, AYELE HADERO,t AND IRVING P. CRAWFORD* Department of Microbiology, University of Iowa, Iowa City, Iowa 52242 Received 17 June 1988/Accepted 4 October 1988 In Pseudomonas aeruginosa, the trpl gene product regulates the expression of the trpBA gene pair encoding tryptophan synthase. trpl and tipBA are transcribed divergent4r. The trpl DNA sequence and deduced amino acid sequence were determined. The trpl start codon was found to be 103 base pairs from that of tqpB. trpi encodes a 293-residue protein and the size of the trpl gene product, measured on sodium dodecyl sulfate- polyacrylamide gels, was close to that calculated from the amino acid sequence. The amino acid sequence of trpl resembles that of Enterobacter cloacae am,pR, the regulatory gene for the ampC E. The N-terminal portions of trpl and ampR resemble corresponding portions of ilvY, metR, and lysR in Escherichia col and nodD in Rhizobium melilot. This resemblance may help to define a trpl-related family of activator proteins sharing a common structural plan. In Pseudomonas aeruginosa, seven trp structural genes, scattered at four chromosomal locations, accomplish the synthesis of tryptophan from its precursor, chorismate (Fig. 1) (6, 11, 27). Early studies showed that in fluorescent pseudomonads the genes for tryptophan synthase, trpB and trpA, are coordinately induced by the substrate, indolegly- cerol phosphate, rather than repressed by the product, L-tryptophan (6, 13). More recently, the P. aeruginosa PAC174 trpBA genes along with their control region were subcloned from an R68.44-derived R-prime plasmid contain- ing about 114 kilobases (kb) of P. aeruginosa chromosomal DNA into plasmid pBR322 (19, 27). The resulting 8.5-kb plasmid, pZAZ167, enables an Escherichia coli trpE trpB trpA auxotroph to grow on anthranilate in place of trypto- phan (19, 27). Several deletion mutants of pZAZ167 located upstream of the trpBA genes cause low, constitutive levels of tryptophan synthase even in the presence of the inducer indoleglycerol phosphate but can be restored to normal inducibility in trans (28). Therefore, it was proposed that a trpI gene, mapping upstream of the trpBA genes, encodes an activator-like mediator responsible for the induction of the trpBA genes (28). In this paper, we establish the location, orientation, and size of the trpl gene and present its DNA sequence and deduced amino acid sequence. Comparison of the trpI amino acid sequence with that of ampR in Entero- bacter cloacae, ilv Y, metR, and lysR in E. coli, and nodD in Rhizobium meliloti shows that these activator proteins have significant similarity in their N-terminal portions. The diver- sity in their C-terminal portions may reflect differences in the inducer molecules with which they interact. MATERIALS AND METHODS Bacterial strains, plasmids, and media. The bacterial strains and plasmids used or constructed in this study are shown in Table 1. The bacterial strains were cultured in Vogel-Bonner minimal medium E (38), in LB medium, or on X-Gal (5-bromo-4-chloro-3-indoxyl-,3-D-galactopyranoside) agar plates (37). X-Gal was obtained from Sigma Chemical Co., St. Louis, Mo. To ensure retention of vectors or * Corresponding author. t Present address: Department of Pediatrics, SUNY Health Sci- ences Center at Brooklyn, Brooklyn, NY 11203. recombinant plasmids, appropriate antibiotics were added in the following amounts (micrograms per milliliter): ampicillin, 200; tetracycline, 10; and chloramphenicol, 12.5. X-Gal plates were used to screen pUC plasmid derivatives trans- formed into JM83 or JM101 cells. Vogel-Bonner minimal medium was used for growth rate tests. Growth rate. Growth rate tests were used to assess the activity of the trpI gene product. The principle is based on the observation of Manch and Crawford (28) that E. coli IC1107 harboring an uninducible deletion plasmid such as pZAZ131 grows slowly on minimal agar plates supplemented with anthranilate or indole but grows well on tryptophan. However, cotransformation of pZAZ131 with a trpl-carrying plasmid such as pZAZ202 allows E. coli IC1107 to grow as rapidly on anthranilate as on tryptophan, but it still grows slowly on indole. In this study, growth rate tests were carried out by cotrans ormation of two compatible plasmids, one carrying the trpI gene and the other carrying the trpBA gene pair, into E. coli IC1107. The transformed cells were streaked on four kinds of minimal plates. After 12 to 36 h of incubation at 37°C, the extent of growth of the transformed cells was estimated by the size of isolated colonies. The four minimal plates used were labeled MAHC, MTAHC, MAAHC, and MIAHC. MAHC was a Vogel-Bonner minimal agar plate containing 0.05% (wt/vol) acid-hydrolyzed casein and 0.2% glucose but no other supplements. MTAHC, MAAHC, and MIAHC were MAHC containing 0.1 mg of tryptophan, anthranilate, and indole per plate, respectively. The MAHC plate was used to test for contamination of the transformed cell culture, since the IC1107 transformed cells remain trpE auxotrophs. The MIAHC plate was used to detect the occurrence of any constitutive mutants. A trpI-containing plasmid was defined as TrpI+ when the transformed cells grew as fast on the MAAHC plate as on the MTAHC plate. If the transformed cells grew slowly on the MAAHC plate, the plasmid was defined as Trpl-. Cells transformed with either a TrpI+B+A+ plasmid or a Trpl-B+A+ plasmid were always used as controls in the growth rate test. Plasmid DNA manipulation. The procedures for plasmid purification, restriction enzyme digestion, DNA ligation, agarose gel electrophoresis, and polyacrylamide gel electro- 172 on March 6, 2021 by guest http://jb.asm.org/ Downloaded from
12

for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

Oct 11, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

Vol. 171, No. 1JOURNAL OF BACTERIOLOGY, Jan. 1989, p. 172-1830021-9193/89/010172-12$02.00/0Copyright © 1989, American Society for Microbiology

Sequence of the Pseudomonas aeruginosa trpI Activator Gene andRelatedness of trpI to Other Procaryotic Regulatory Genes

MING CHANG, AYELE HADERO,t AND IRVING P. CRAWFORD*Department of Microbiology, University of Iowa, Iowa City, Iowa 52242

Received 17 June 1988/Accepted 4 October 1988

In Pseudomonas aeruginosa, the trpl gene product regulates the expression of the trpBA gene pair encodingtryptophan synthase. trpl and tipBA are transcribed divergent4r. The trpl DNA sequence and deduced aminoacid sequence were determined. The trpl start codon was found to be 103 base pairs from that of tqpB. trpiencodes a 293-residue protein and the size of the trpl gene product, measured on sodium dodecyl sulfate-polyacrylamide gels, was close to that calculated from the amino acid sequence. The amino acid sequence of trplresembles that of Enterobacter cloacae am,pR, the regulatory gene for the ampC E. TheN-terminal portions of trpl and ampR resemble corresponding portions of ilvY, metR, and lysR in Escherichiacol and nodD in Rhizobium melilot. This resemblance may help to define a trpl-related family of activatorproteins sharing a common structural plan.

In Pseudomonas aeruginosa, seven trp structural genes,scattered at four chromosomal locations, accomplish thesynthesis of tryptophan from its precursor, chorismate (Fig.1) (6, 11, 27). Early studies showed that in fluorescentpseudomonads the genes for tryptophan synthase, trpB andtrpA, are coordinately induced by the substrate, indolegly-cerol phosphate, rather than repressed by the product,L-tryptophan (6, 13). More recently, the P. aeruginosaPAC174 trpBA genes along with their control region weresubcloned from an R68.44-derived R-prime plasmid contain-ing about 114 kilobases (kb) of P. aeruginosa chromosomalDNA into plasmid pBR322 (19, 27). The resulting 8.5-kbplasmid, pZAZ167, enables an Escherichia coli trpE trpBtrpA auxotroph to grow on anthranilate in place of trypto-phan (19, 27). Several deletion mutants of pZAZ167 locatedupstream of the trpBA genes cause low, constitutive levels oftryptophan synthase even in the presence of the inducerindoleglycerol phosphate but can be restored to normalinducibility in trans (28). Therefore, it was proposed that atrpI gene, mapping upstream of the trpBA genes, encodes anactivator-like mediator responsible for the induction of thetrpBA genes (28). In this paper, we establish the location,orientation, and size of the trpl gene and present its DNAsequence and deduced amino acid sequence. Comparison ofthe trpI amino acid sequence with that of ampR in Entero-bacter cloacae, ilv Y, metR, and lysR in E. coli, and nodD inRhizobium meliloti shows that these activator proteins havesignificant similarity in their N-terminal portions. The diver-sity in their C-terminal portions may reflect differences in theinducer molecules with which they interact.

MATERIALS AND METHODSBacterial strains, plasmids, and media. The bacterial

strains and plasmids used or constructed in this study areshown in Table 1. The bacterial strains were cultured inVogel-Bonner minimal medium E (38), in LB medium, or onX-Gal (5-bromo-4-chloro-3-indoxyl-,3-D-galactopyranoside)agar plates (37). X-Gal was obtained from Sigma ChemicalCo., St. Louis, Mo. To ensure retention of vectors or

* Corresponding author.t Present address: Department of Pediatrics, SUNY Health Sci-

ences Center at Brooklyn, Brooklyn, NY 11203.

recombinant plasmids, appropriate antibiotics were added inthe following amounts (micrograms per milliliter): ampicillin,200; tetracycline, 10; and chloramphenicol, 12.5. X-Galplates were used to screen pUC plasmid derivatives trans-formed into JM83 or JM101 cells. Vogel-Bonner minimalmedium was used for growth rate tests.Growth rate. Growth rate tests were used to assess the

activity of the trpI gene product. The principle is based onthe observation of Manch and Crawford (28) that E. coliIC1107 harboring an uninducible deletion plasmid such aspZAZ131 grows slowly on minimal agar plates supplementedwith anthranilate or indole but grows well on tryptophan.However, cotransformation ofpZAZ131 with a trpl-carryingplasmid such as pZAZ202 allows E. coli IC1107 to grow asrapidly on anthranilate as on tryptophan, but it still growsslowly on indole.

In this study, growth rate tests were carried out bycotrans ormation of two compatible plasmids, one carryingthe trpI gene and the other carrying the trpBA gene pair, intoE. coli IC1107. The transformed cells were streaked on fourkinds of minimal plates. After 12 to 36 h of incubation at37°C, the extent of growth of the transformed cells wasestimated by the size of isolated colonies. The four minimalplates used were labeled MAHC, MTAHC, MAAHC, andMIAHC. MAHC was a Vogel-Bonner minimal agar platecontaining 0.05% (wt/vol) acid-hydrolyzed casein and 0.2%glucose but no other supplements. MTAHC, MAAHC, andMIAHC were MAHC containing 0.1 mg of tryptophan,anthranilate, and indole per plate, respectively. The MAHCplate was used to test for contamination of the transformedcell culture, since the IC1107 transformed cells remain trpEauxotrophs. The MIAHC plate was used to detect theoccurrence of any constitutive mutants.A trpI-containing plasmid was defined as TrpI+ when the

transformed cells grew as fast on the MAAHC plate as onthe MTAHC plate. If the transformed cells grew slowly onthe MAAHC plate, the plasmid was defined as Trpl-. Cellstransformed with either a TrpI+B+A+ plasmid or aTrpl-B+A+ plasmid were always used as controls in thegrowth rate test.

Plasmid DNA manipulation. The procedures for plasmidpurification, restriction enzyme digestion, DNA ligation,agarose gel electrophoresis, and polyacrylamide gel electro-

172

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 2: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

PSEUDOMONAS ACTIVATOR GENE 173

tryptophan repression Constitutive InGP induction

AS PRT InGPS PRAI TS

ChorismateAntiranilatePRA InGP Tryptophan

TS-A \\ / TS-BIndole

FIG. 1. Tryptophan biosynthetic pathway, with the correspond-ing trp genes in P. aeruginosa. The trpR gene encodes a repressoracting on trpE and the trpGDC cluster. The trpI gene encodes anactivator inducing the trpBA gene pair. The trpE and trpG genesencode the large and the small subunits of anthranilate synthase, andtrpA and trpB encode the a and subunits of tryptophan synthase.The gene products and their participation in biosynthetic reactionsare indicated by arrows. Abbreviations: AS, anthranilate synthase;PRT, anthranilate phosphoribosyltransferase; InGPS, indoleglyce-rol phosphate synthase; PRAI, N-phosphoribosylanthranilate isom-erase; TS, tryptophan synthase; PRA, N-phosphoribosylanthrani-late; CDRP, 1-(o-carboxyphenylamino)-1-deoxyribulose phosphate;InGP, indoleglycerol phosphate; TS-A, tryptophan synthase Areaction; TS-B, tryptophan synthase B reaction.

phoresis were those described by Maniatis et al. (29). WhenpACYC184-derived plasmids were purified, 300 mg of spec-tinomycin was added per liter of culture in the late exponen-tial phase to amplify plasmids. E. coli cells were madecompetent for transformation by the CaCl2 method (10).Plasmid pMI10 was pZAZ125 deleted of two AvaI frag-

ments. Plasmid pZAZ125 has a BamHI-SalI fragment (Fig.2A) inserted into the tet gene of pACYC184. The only AvaIsite in pACYC184 is located downstream of the Sall site andis within the tet gene. This AvaI site and two other AvaI sitesin the trpI gene (Fig. 2B) of pZAZ125 were cleaved; the restof plasmid was then self-ligated, generating pMI10. PlasmidpMI110 was pZAZ167 with its SphI-SphI fragments removed(Fig. 2A). Plasmid pBD6 was pMIT3 deleted of BalI-BalIand BalI-StuI fragments in the middle of the trpI gene (Fig.2B).BAL 31 exonuclease was used to decrease the size of

DNA fragments or generate various-sized deletions. Theexperimental procedure used is described by Maniatis et al.(29). Each batch of plasmid DNA was digested for varioustimes by a fixed amount of enzyme to decide the properconditions. The enzyme used was obtained from BethesdaResearch Laboratories, Inc., Gaithersburg, Md. The bufferand amount of enzyme used were those suggested by themanufacturer.-

Plasmids pUC18 and pUC19 were often used as cloningvectors because their makeup facilitates identification ofrecombinants (41). The multiple cloning sites in pUC18 andpUC19 are positioned in opposite directions relative to thelac promoter. A SalI-PstI fragment containing the entire trpIregion was subcloned into the Sall and PstI sites of pUC19and pUC18, generating pMI40 and pMI50, respectively.For DNA sequencing, DNA fragments were first labeled

with 32P by one of two methods: 5' labeling with terminaltransferase and [a-32P]2',3'-dideoxyadenosine triphosphate(42) or 3' fill-in labeling with the large fragment of E. coliDNA polymerase I and the appropriate radioactive deoxy-nucleotide triphosphate (26). Sequencing reactions werecarried out by the procedure of Maxam and Gilbert (26) and

TABLE 1. Bacterial strains and plasmids

E. coli strain or Relevant characteristics Referenceplasmid or source

StrainIC762 W3110 F- his trpE3B248A2 rpsL 19IC1107 W3110 F- trpE3B248A2 recA rpsL 28JM83 ara A(lac-proAB) rpsL 480 lacZAM15 41P678-54 thr leu 1JM101 supE thi A(lac-proAB) (F' traD36 proAB lacP lacZAM15) 41JM11o rpsL thr leu thi lacY galK galT ara tonA tsx dam dcm supE44 A(lac-proAB) 41

(F' traD36 proAB lacIq lacZAM15)PlasmidpBR322 Ampr Tcr 4pACYC184 Tcr Cmr 7pUC19 Ampr 41pUC18 Ampr 41pZAZ167 Ampr TrpI+B+A+ 27pZAZ131 Ampr Trpl-B+A+; from pZAZ167 by deletion of 368-bp SacII fragment 28pZAZ133 Ampr Trpl-B+A+; from pZAZ167 by deletion of 424-bp SacII fragment 28pZAZ202-489 Cmr TrpI+ This workpZAZ202-102 Cmr TrpI+ This workpZAZ125 Cmr TrpI+B+A+ 28pMI10 Cmr Trpl-B+A+; from pZAZ125 by deletion of AvaI fragment This workpMI40 Ampr TrpI+; PstI-SalI fragment subcloned into pUC19 This workpMI50 Ampr Trpl-; PstI-SatI fragment subcloned into pUC18 This workpMI110 Ampr TrpI+; from pZAZ167 by deletion of SphI fragments This workpMIB85 Cmr TrpI+; trpB region removed from pZAZ202-489 This workpMIK13 Ampr; BamHI-KpnI fragment from pMI85 subcloned into pUC19 This workpMIE6 Ampr TrpI+; BamHI-SaIl fragment from pMI85 subcloned into pUC19 This workpMIT3 Tcr Trpl+ This workpUC19TET Tcr This workpBD6 Tcr; from pMIT3 by deletion of BalI-BalI and BalI-StuI fragments This work

VOL. 171, 1989

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 3: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

174 CHANG ET AL.

AtrpI trpBA

PvSa Sa SaBa X B5pp s P PSp S Sa P PS P B

1000 2000 3000400b4000 bp

PZAZ131.......

pZAZ133

pZAZ202-489 £ pZAZO2-102

pZAZ125

PMI10......6.................................................. .....................................

pMI40la

4- lacPPMI50

lacP -+

PMI110

PMIB85I I

B

Pv

I 6400 600

D K Ea Ba StI IIII I I0

B00 1000

4~4

pMIB85, pMIE6 & pMIT3

pMIK13

pBD6

FIG. 2. (A) Restriction endonuclease cleavage map of the trpIBA portion of pZAZ167, with distances given in base pairs. Arrows indicatethe location and orientation of the trpBA and trpI genes (18, 27, 28; this work). (B) Sequencing strategy in the trpl region. Nucleotides arenumbered as in panel A. Below the restriction map are indicated the fragments used to obtain the sequence. Restriction enzyme abbreviations:A, AvaI; B, BamHI; Ba, BalI; Bg, BgII; Bs, BsshII; D, DdeI; H, Hinfl; K, KpnI; P, PstI; Pv, PvuI; S, Sall; Sa, SacIl; Sp, SphI; St, Stul;X, XhoI. , DNA fragments retained in the corresponding plasmids; , regions deleted in the plasmids indicated. Relative positions ofthe lacP promoter and the PstI-SalI DNA fragment are shown for plasmids pMI40 and pMI50.

S

200

(A)1 0X

14001200

Bas H

1600 bp

J. BACTERIOL.

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 4: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

PSEUDOMONAS ACTIVATOR GENE 175

developed by 8% urea-polyacrylamide gel electrophoresisaccording to Sanger and Coulson (31). The sequence datawere analyzed with the aid of the PCS computer program(23).

Minicell preparation and labeling. Plasmid-specified pro-teins were synthesized and labeled in minicells (8), using theE. coli P678-54 minicell-producing strain (1). Several steps inthe usual procedure were modified as described in Results.SDS-polyacrylamide gel electrophoresis. Labeled polypep-

tides were separated and their molecular weights wereestimated on sodium dodecyl sulfate (SDS)-polyacrylamidegels (39). The gels were prepared for autoradiography by thetechnique of Bonner and Laskey (5).

Analysis of protein sequence data. The FASTP programshows the alignment of amino acid sequences and gives twosimilarity scores, an initial score and an optimized score,which are calculated according to the PAM250 matrix (14),an amino acid replaceability matrix. The statistical signifi-cance of a score, termed the z value, can be obtained by theequation: z = (similarity score - mean of random scores)/standard deviation of random scores. The similarity scores,mean of random scores, and standard deviation of randomscores are available in the FASTP and XRDF programs,both offered by Bionet. The guidelines for z values are: z >3, possibly significant; z > 6, probably significant; z > 10,significant (25).

RESULTSLocalization of the trpl gene by BAL 31 deletions. To study

the trpI gene, the 2.2-kb Sall-SalI fragment from pZAZ167(Fig. 2A) was subcloned into plasmid pACYC184 (7), avector compatible with pZAZ131. Preliminary analysis ofthe trpI and trpBA genes on pZAZ167 suggested that thisSall DNA fragment contains the first part of the trpB geneand the entire region upstream of trpB, where trpI is located.Plasmids pZAZ202-489 and pZAZ202-102 were generatedhaving opposite orientations of the inserted fragment withinthe tet gene of pACYC184.Growth rate tests showed that pZAZ202 plasmids of either

orientation were TrpI+. Either version of pZAZ202, whencotransformed into IC1107 with the trpI-deficient plasmidpZAZ131, enhanced the growth rate on MAAHC plates tothe level of cells containing pZAZ167. This finding indicatesthat the trpI gene is within the 2.2-kb Sall-Sall DNAfragment and can be expressed from its own promoter.To localize the 5' and 3' boundaries of the trpI gene, BAL

31 exonuclease was used to create a series of deletionsinvading from either end. pZAZ202-489, having a uniquePvuI site at the 3' end and a unique PstI site at the 5' end(Fig. 2), was opened at the PvuI site and then treated withBAL 31 exonuclease for varying times. Plasmids of varioussizes were self-ligated by using T4 DNA ligase and trans-formed into E. coli IC1107. Deletion plasmids were analyzedby restriction enzyme digestion and transformed intopZAZ131-containing competent cells for growth rate tests.Results of growth rate tests suggested that TrpI activity wasretained until the deletion reached the DdeI site at base pair(bp) 649. Five of these plasmids were chosen for sequencingbecause restriction enzyme mapping showed that three ofthem contained the largest TrpI+ deletions and two had thesmallest TrpI- deletions. DNA sequencing precisely definedthe missing regions (Fig. 3). The largest deletion with aTrpI+ phenotype, that in plasmid 58, ended at bp 615,whereas the smallest Trpl- deletion, that in plasmid 62,stopped at bp 690. Apparently one boundary of the trpI genelies within this 76-bp segment, from bp 615 to 690.

The same method was used to construct a series ofdeletions at the opposite end. In this case, pZAZ202-489 wasdigested with PstI to generate linear molecules for BAL 31deletion. The results from growth rate tests and restrictionenzyme mapping showed that this boundary was locatedbetween the Hinfl and the BsshII sites. Of six deletionplasmids lacking the Hinfl site at bp 1601 and retaining aBsshII site at bp 1494, three were phenotypically TrpI+, onewas intermediate between plus and minus, and two wereTrpl-. The ends of the deletions were clarified by DNAsequencing (Fig. 3). Except for deletion P98, these deletionsare consistent with a boundary lying between bp 1544 and1586.Determination of the orientation of the trpI gene. A 2.0-kb

SalI-PstI fragment slightly shorter than the Sall-SailI frag-ment was inserted into the multiple cloning sites of pUC19and pUC18 (41), generating pMI40 and pMI50, respectively.In one case, the trpl gene should be codirectional with thestrong promoter lacP.

Plasmids pMI40 and pMI50 are incompatible withpZAZ131; therefore, pMI10 was made by deleting most ofthe trpI gene from pZAZ125, a pACYC184 derivative con-taining the entire trpIBA segment. The growth rate ofpMI10-transformed cells on MAAHC plates was similar tothat of pZAZ131-transformed cells and was increased bycotransformation with plasmid pMI110, made by deletion ofthe trpBA genes from pZAZ167.The carbon sources used in this experiment were slightly

modified from those used in previous studies (28). Threedifferent carbon sources were added to MAHC in an attemptto increase the rate of transcription from the lacP promoter:0.5% (wt/vol) sodium succinate, 0.2% (wt/vol) lactose, and0.2% (wt/vol) glucose plus 10-3 M IPTG (isopropyl-p-D-thiogalactopyranoside). All three gave the same result;IC1107 containing pMI10 and pMI40 grew faster on anthra-nilate than did IC1107 containing pMI10 and pMI50, whichindicated that in pMI40 the trpI gene is oriented in the samedirection as the lacP promoter. This finding shows that theorientation of the trpl gene is opposite to that of trpBAgenes. The result of one growth rate test using sodiumsuccinate as the carbon source is presented in Table 2. It isnot clear why the IC1107 cells containing pMI40 andpZAZ131 grew more slowly on MAAHC plates than did thesame cells containing pZAZ167. Plasmid pMIE6 has a struc-ture similar to that of pMI40 except that the trpB portion isdeleted in pMIE6. When cotransformed with pZAZ131,pMIE6 was able to enhance the growth rate of IC1107 cellson MAAHC plates to the level of cells containing pZAZ167.

Determining the DNA sequence and reading frame of thetrpl gene. A restriction map of the trpl region, along with thestrategy of DNA sequencing, is shown in Fig. 2B. The DNAsequence of both strands was determined by the Maxam-Gilbert method (26). There is an open reading frame consis-tent with the trpI termini determined by BAL 31 deletions.The nucleotide sequence of P. aeruginosa PAC174 trpI andthe deduced amino acid sequence are shown in Fig. 3.As discussed below, there are several alternative open

reading frames in the trpl region of pZAZ167. Making use ofthe gene fusion method, we identified the correct one bysubcloning the BamHI-KpnI fragment from pMIB85, de-scribed in the next paragraph, into the multiple cloning siteof pUC19 to make pMIK13. The cloning strategy is shown inFig. 4. The DNA sequence of pMIK13 shows that translationfrom the lacZ start codon ends 2 bp ahead of the start codonof the presumed trpI reading frame, and after 272 codons thepresumed reading frame is fused in frame with the lacZ

VOL. 171, 1989

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 5: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

176 CHANG ET AL.

AP110 (+)AB85(+)I AP131 M I

CATGGAAGGC TCCAGGAAAT GACGTATGGG CAQAAZLTAA TCCCCCCGAC GGGCAAAAGA AACCGATAAGGTACCTTCCGA,GGTCCTTTA CTGCATACCC GTCTTAGATT AGGGGGGCTG CCCGTTTTCT TTGGCTATTC*-- trpB --- HinfI 1587

AP98 (+)

APl12(±)4 AP114(-)ATTGCTTCAA CCTGTCAGA AAACTCACGA ATAGCC

TAACGAAGTT GGACAGTCCT TTTGAGTGCT TATCGG1537

Leu Arg AlaCTGCGCBGCT

BsshII

AP137 (-)4Met Ser Arg Asp Leu Pro Ser Leu Asn AlaATG AGC CGC GAC CTG CCC TCC CTG AAT GCC

--- trp --

Phe Giu Ala Ala Ala Arg Leu His Ser Ile Ser Leu Ala Ala Glu Glu LeuTTC GAA GCC GCT GCC CGG TTG CAC AGC ATC AGC CTG GCG GCC GAG GAA CTG

1480

His Val Thr His Gly Ala ValCAC GTT ACC CAT GGC GCC GTG

Ala Leu Phe Gly Arg Asp GlyGCC CTG TTC GGC AGG GAT GGA

Asp Ala Cys Gly Asp Ala PheGAC GCC TGC GGC GAT GCG TTC

Thr Ala Glu Ala Pro Phe ValACC GCC GAG GCC CCG TTC GTC

Pro Arg Leu Asp Arg Leu AsnCCG CGG CTG GAC CGG CTC AAC

Glu Gly Glu Phe Asp Pro ArgGAG GGC GAG TTC GAT CCG CGT

Pro Trp Pro Ala Asp Met GlnCCC TGG CCG GCG GAC ATG CAG

Ser Pro Arg Leu Ala Gln GluAGC CCG CGC CTG GCG CAG GAA

Ser ArgAGC CGG1420

Arg GlyCGC GGC1360

Glu ArgGAG CGA1300

Leu GlyCTC GGC1240

Arg AlaCGT GCC1180

Arg ProCGT CCC1120

Val PheGTC TTC1060

Thr GlyACC GGC1000

Gln Val Arg Leu Leu Glu Glu Asp Leu Gly ValCAG GTG CGG TTG CTC GAG GAA GAT CTC GGG GTG

Val Lys Leu Thr Asp Ser Gly Val Arg Leu ArgGTA AAA CTC ACC GAT TCC GGC GTT CGC CTG CGA

Leu Arg Gly Val Cys Ala Glu Leu Arg Arg GlnCTG CGT GGC GTC TGT GCC GAG CTG CGC CGG CAG

Val Pro Gly Ser Leu Leu Ala Arg Trp Phe IleGTA CCT GGC AGC CTG CTG GCG CGC TGG TTC ATC

Leu Pro Asp Leu Arg Leu Gln Leu Ser Thr SerCTC CCC GAC CTG CGC CTG CAA CTG TCC ACC AGC

Gly Leu Asp Ala Met Leu Trp Phe Ala Glu ProGGC CTG GAC GCC ATG CTC TGG TTC GCC GAG CCG

Glu Leu Ala Ser Glu Arg Met Gly Pro Val LeuGAG CTG GCT TCC GAG CGC ATG GGC CCG GTG CTC

Leu Ser Gln Ala Pro Ala Ala Arg Leu Leu GlnCTG TCC CAG GCG CCC GCC GCG CGG CTG TTG CAG

Glu Pro Leu Leu His Thr AlaGAG CCG CTG CTG CAT ACC GCC

Ser ArgTCG CGG940

Pro Gln Ala TrpCCC CAG GCC..TIGG

StuI

Pro Ala Ser Ala Ser Gln GlyCCG GCC TCG GCG AGC CAG GGG

a-peptide sequence. If this presumed reading frame is ex-

pressed directly or by translational coupling, the hybrid geneshould produce a fusion protein having some a-peptideactivity. If another open reading frame is actually used fortrpI, no oa-peptide activity will be seen. A ligated mixture ofBamHI-KpnI fragments and linearized pUC19 moleculeswas transformed into JM83. The transformed JM83 cellswere grown on L agar plus X-Gal and ampicillin. Ten whiteand ten blue colonies were picked at random. Plasmids were

analyzed by restriction enzyme digestion. Only seven plas-

mids, all purified from blue colonies, contained the BamHI-KpnI insert. This result showed that JM83 transformed withpMIK13 exhibits a blue color reaction with X-Gal, diagnos-tic of a-peptide activity and confirming the reading frame ofthe trpl gene.

Determination of the size of the "Ip gene product. Becauseof the small amount of the trpI gene product in pZAZ202-489-transformed cells, it was not feasible to attempt todetermine the size of the gene product directly. PlasmidpUC19, existing in high copy in the cell and containing the

J. BACTERIOL.

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 6: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

PSEUDOMONAS ACTIVATOR GENE 177

Leu Ala Ala Glu Ala Leu ArgCTG GCG GCG GAG GCG CTG CGC

Ala Ala Val Ala Gly Leu GlyGCG GCG GTG GCG GGC CTC GGC

Ala Ala Ala Pro Gly Gly ProGCC GCG GCG CCT GGC GGG CCC

Tyr Gly GlnTAL.GGC CAG880 BalI

Val Ala IleGITG.GCCATC820 BalI

Trp Gly PheTGG GGC TTC760

Gly Phe Glu His Leu Tyr Tyr Leu Leu GluGGG TTC GAG CAT CTC TAC TAC CTG CTG GAA

Ala Pro Glu Pro Leu Val Arg Asp Asp LeuGCC CCG GAG CCA CTG GTC CGC GAC GAT CTC

Ile Glu Thr Asp Ala Arg Leu Ala Leu TrpATC GAG ACC GAC GCG CGC CTG GCC CTG TGQ

Val ProGTA CCGKpnI

Ala Arg Leu His AspGCG CGC CTG CAC GAT

Leu Ala Gly *TTG GCA GGC TGA

DdeI

fLCGCTGGCG ATTTGCTGCA649

JA47 (-)Pro ArgCCG CGT700

4A62 (-)Ala Gly Arg LeuGCC GGG CGC CTG

4A58 (+)CTTTTCCGGC TCGGCTAGTG TCCATTGAAC

AAGAACCAGG CATCGTCCGG GGAGAGAGTT

GGCTCGTCCA AGACCAGCAT500

CATGTCCAAT550

Ala Gln Trp Leu Arg Glu GlnGCG CAA TGG TTG CGG GAG CAG

A4 6 (+)LA52 (+)

AGAGGCGCCC GTCGGGCGCG600

CACCACACCT ACAAGAAGAT CGAACTGGTC

CGAGGACGCC ATCAACAACG CCCTCGCCGA

ATCTGGAATG GTTCGAGGTG GTGGATACCC GCGGGCACAT CGAGAACGGT400

AGCGGCGAAG AGCATCCAGC450

GCCGTCGGCC ATTACCAGGT

GACCCTGAAA GTAGGGTTCC GTATCGCCAA350

TAGCTGATGG CACCCTGGAA ACCTGTCCAC.GATCGCCTGCPvuI

GCCCTGGAAA300

GGCCGCCGCG TCTTTCCGCT TCGCCGGATC ATGCACAGGC

ATGGGCGGCG CGCTCGTCTC AACCTTGCCG AATTTGGCAA200

TATGAGGGCG GCGTGCCTCC250

GCCTGACCCG TTCGGGTCTT CATCCATGCA

ACGAGGGAAT GTGTTCATGA150

AGAAGTTGAT GTTGGCAGTC GGCCTGTTTG CCGTGGCGGG CAGCGCATTC100

GCCGCCAAGC CCTGTGAGGA ACTGAAAGCC GAGATCGATG CGAAGATCAA50

GGCCAACGGC GTTCCTGCCT

ACACCCTGGA AATCGTCGAC1

SalIFIG. 3. Nucleotide sequence of the trpI gene from P. aeruginosa, 5' and 3' flanking regions of the gene, and amino acid sequence of the

trpl gene product. The trpB and trpI genes are transcribed divergently (dashed arrows). Initiation codons and putative ribosome-bindingsequences complementary to 16S rRNA are underlined. The BAL 31-deleted plasmids and their deletion endpoints are indicated above thesequence. For the 5'-end deletions, nucleotides to the right of the endpoint are retrained; for the 3'-end deletion, nucleotides to the left of theendpoint are retained. (+), (-), Designations of the phenotype of each deletion. Several restriction enzyme sites mentioned in the text areindicated below the sequence.

strong lacP promoter, was used to elevate the amount of thetrpl gene product in the cell. Since no convenient restrictionenzyme sites were available to subclone the trpI gene intopUC19, the BAL 31 enzyme was used to delete the trpBregion in pZAZ202-489. The ends of the BAL 31-digestedmolecules were made blunt by DNA polymerase I largefragment in the presence of all four deoxynucleotide triphos-phates and then ligated to BamHI linkers (CGGATCCG).

Plasmid pMIB85 having a BamHI restriction enzyme sitelocated 87 nucleotides ahead of the presumed trpl startcodon, was constructed in this way. The BamHI-SalI DNAfragment from pMIB85 was cloned into pUC19 to generatepMIE6.

Plasmid-directed polypeptides synthesized in E. coli mini-cells and separated on SDS-polyacrylamide gels were usedto examine the size of the trpI protein. The preliminary

VOL. 171, 1989

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 7: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

178 CHANG ET AL. J. BACTERIOL.

TABLE 2. Growth rate test to determine the orientation of the trpl gene

Colony size on':

Cell (plasmid) MTAHC MAAHC MIAHC MAHC

22h 34h 22h 34h 22h 34h 22h 34h

IC1107(pZAZ167) + ++ + ++ + + - -IC1107(pZAZ125) + ++ + ++ - - - -IC1107(pMI10) + ++ - + - - -IC1107(pMI10, pMIll0) + ++ + ++ + + - -IC1107(pMI10, pMI40) + ++ +* +- + - -IC1107(pMI10, pMI50) + ++ - - - - -IC1107(pMI10, pMIE6) + ++ + ++ + + -

0 Media are described in Materials and Methods. Sodium succinate was used as the carbon source. The first reading was 22 h after innoculation, and the secondwas 12 h later. Colony size: -, single colonies barely visible to the naked eye; , colonies about 0.1 to 0.2 mm in diameter, easily visible to the naked eye; +*,colonies about 0.5 mm in diameter; +, colonies about 1 mm in diameter; + +, colonies about 1.5 mm in diameter.

experiment showed that among the pMIE6-directed poly-peptides, the trpI and ,-lactamase polypeptides comigratedon SDS-polyacrylamide gels. The size of the trpl geneproduct predicted from translation of its DNA sequence is31,950 daltons. The ,-lactamase encoded by the ampicillinresistance gene in the pUC19 derivative has two forms inminicells; the sizes of the unprocessed and processed 3-lactamases are 31,410 and 28,850 daltons, respectively (2,36). To visualize the trpl polypeptide, the ampicillin resis-tance gene was removed and replaced by the tetracyclineresistance gene for selection. A portion of the ampicillin

pMIB85

SalIBamHIa

XhoI KpnI

BamHI & KpnI

digestion

resistance gene was deleted, and a tetracycline gene car-tridge, the EcoRI-AvaI DNA fragment from pBR322, wasinserted in its place, resulting in pMIT3. Plasmid pMIT3 wastransformed into the minicell-producing strain E. coli K-12P678-54 for in vivo translation. Suitable amounts of minicellextracts were loaded on a 10% SDS-polyacrylamide gel (Fig.5). Comparison of the polypeptides produced from pMIT3and pUC19TET shows that there is one extra band having amolecular weight of about 29,000 and migrating to theposition of the two j-lactamase bands in pUC19. Whenproduced from the deletion mutant pBD6, this polypeptide is

lacP lacZtBamlI KpnIMultiple cloning site

I BamHlI KpnI

digestion

Ligated by T4 DNA ligase

Transformed into JM83

Plasmids purified from blue colonies

I

lacP W lacZ'BamHI_

_ ~~KpnI

FIG. 4. Construction of plasmid pMIK13. The BamHI-KpnI fragment, containing nearly 90% of the trpl gene, is positioned downstreamof the lacP promoter so that the orientation of the trpI gene is the same as the direction of transcription from the lacP promoter. The multiplecloning site of pUC19, a 54-bp synthetic oligonucleotide, is inserted between the sixth and seventh codons of lacZ'. lacZ' encodes theN-terminal 145 residues of f-galactosidase, termed the a-peptide, whose activity is not affected by insertion of the multiple cloning site. Thetrpl gene region is indicated by heavy lines. Construction of pMIB85 is described in Results.

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 8: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

PSEUDOMONAS ACTIVATOR GENE 179

1 2 3 4 5 6

97,4k

68. 0

43. l .

25.7

1 8.4

FIG. 5. Polypeptides produced by pMIT3- and pBD6-containingminicells. Polypeptides were labeled in minicells by [35S]methionineplus '4C-amino acids and separated on a 10% polyacrylamide gel inthe presence of SDS. Lane 1, "4C-labeled protein markers (97.4, 68,43, 25.7, and 18.4 kilodaltons). Polypeptides specified by the follow-ing plasmids are shown: lane 2, pMIT3; lane 3, pBD6; lane 4,pUC19tet; lane 5, pUC19; lane -6, pMIK13. Plasmid pUC19tet, a

pUC19 derivative, was constructed by the method used to constructplasmid pMIT3. The arrows beside lanes 2, 3, and 6 indicate thepolypeptides encoded by the intact trpI gene, the defective trpIgene, and the trpI-lacZ' hybrid gene, respectively.

proportionally decreased in size (the construction of pBD6 isdescribed in Materials and Methods). Apparently, this extraband is the product of the trpl gene.

Similarity among the trpl, ampR, ilvY, metR, lysR, and nodDproteins. With the help of the FASTP program (25), the trpIamino acid sequence was compared with protein sequencesin the PIR (Protein Identification Resource) and SWISS-PRO (Swiss Protein) data bases. ampR in Enterobactercloacae showed a significant similarity through the entireamino acid sequence (Fig. 6A). lysR in E. coli (35) and nodDin R. meliloti (16) were found to have some similarity to thetrpl protein in their N-terminal portions. Subsequently, themetR (30) and ilv Y (40) proteins in E. coli were examined andalso showed similarity in this region. These six proteins, allfunctioning as transcriptional activators, had various de-grees of similarity along their amino acid sequences. It isinteresting that they are all related in the N-terminal regions(Fig. 6B). The order of their listing in the figure indicatestheir relatedness based on similarity scores calculated by theXRDF program (data not shown). No gaps were introducedinto this region for alignment. If gaps are allowed, severaladditional small regions can be matched in other parts of theproteins. However, not all of these small regions are com-

mon to all six proteins. Analysis of similarity scores indi-cates that the N-terminal portions of each of the adjacentpairs are significantly similar (based on the initial z value; see

Materials and Methods).

DISCUSSIONThe approximate location of the trpI gene was established

by genetic studies using low-level, constitutive mutants of

pZAZ167; two such derivatives, pZAZ131 and pZAZ133, aredeleted in different but adjacent SacII fragments. This find-ing suggests that the trpI gene may extend over these twoSacII fragments (28). When sequencing in the vicinity oftrpI, upstream of the trpB gene in pZAZ167, we found thisregion to have a very high G+C content, resulting in severecompression in several areas of both strands of the DNAsequence. (The average G+C content in the genome of P.aeruginosa is reported to be 65% [32], and that in the trpBAgene pair is 68% [18]). The 72% G+C content in the trpIgeneregion also results in a low frequency of occurrence of stopcodons and a corresponding abundance of open readingframes on both strands of the DNA sequence. Consequently,we found it necessary to use deletions generated by BAL 31nuclease to better determine the boundaries of the trpI gene.The results of BAL 31 deletion experiments showed that

the 3' boundary of the trpl gene lies between bp 615 and 690;the 5' boundary is located between bp 1544 and 1586. Thus,the trpI gene extends from about 60 bp upstream of the trpBstart codon to the middle of one SacII fragment (Fig. 3),consistent with the findings of Manch and Crawford (28). P98and P114, 5'-end deletion mutants, have identical sequencesin the trpl gene region but differ in the extent of deletion intothe adjacent tet gene. The difference in phenotype of thesetwo deletions may be caused by the upstream flankingsequence. The orientation of the trpI gene in pZAZ202-489 isopposite to the transcriptional direction of the tet gene;therefore, it is unlikely that expression of the trpI gene isaffected by the promoter of the tet gene. Close and Rodri-guez (9) used a chloramphenicol resistance gene cartridge tomap the transcriptional activities of pBR322. They foundthat a low level of transcription occurs in the antisensedirection within the tet gene. This transcriptional activitymay extend to the trpI gene in deletion P98.The orientation of trpI is opposite to that of the trpBA gene

pair. Such an arrangement, in which a regulatory gene istranscribed divergently from the gene set that it regulates,has been observed before (24). These two genes or operonscan share a common control region. In some cases, the twostart sites are so close that the promoters overlap, as witharaC-araBAD (24) and ilvY-ilvC in E. coli (40) and nahR-nahG in the Pseudomonas NAH7 plasmid (33). In the trpland trpBA system, the 103-bp distance between the twoinitiation codons implies that the promoters may partiallyoverlap (Fig. 3).Growth rate tests were used to examine the activity of the

trpI gene product in transformed cells. These tests do not,however, precisely measure the amount of the trpl geneproduct. Once enough endogenous tryptophan is made tosupply the cell, the level of tryptophan synthase, the productof the trpBA gene pair, is not limiting for growth. PlasmidpMIK13 was used to confirm the reading frame of trpIpredicted from the DNA sequence. The trpI-lacZ' fusionprotein generated in pMIK13-containing cells can also beused to monitor the level of the trpI gene product. We foundthat transformed cells cultured in glycerol plus IPTG had thehighest level of ,B-galactosidase activity. This culture condi-tion was applied to the minicell system to generate enough ofthe trpI protein to be seen on SDS-polyacrylamide gels.One open reading frame, encoding 293 residues found

within the trpI boundaries, was confirmed to be expressed invivo. In recombinant plasmid pMIK13, this reading framewas fused in frame with the a-peptide sequence. The a-peptide activity was detected in pMIK13-transformed cells,which indicated that this fusion protein was synthesized invivo (Fig. 5). In addition, a full-sized fusion polypeptide was

VOL. 171, 1989

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 9: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

180 CHANG ET AL.

A.

Trpl

ApR

MSRDI.PSIl.NALRAFEAAAI 11.S I SI.MAAE:.I.fIVIVHGAVSRQVRLI.EEDLGVALFGRDGRGVKLTDSGVRLRDACGDAFERL 80

MTRSYI..Pl.NSt.KAFEAAARHI.SFI'iMAA I EI.NVTrIISAI SQHVKT'LEQHlINCcI,FVRVSRGLMI.TTEGEINLlIPVLNDSFDR I 80

RGVCAELRRQrAEAPFVLGVPGSI.I.ARWF I PIRLDRLNRALPDLRLQLSTSEGEFDPRRPGLDAMLWFAEPPWPAD1QVFE 160

AGMLDRFANHRAQEKLKIGVVG'tFA'T'GVLESQLEDFRRGYPHIDLQLSTHNNRVDPAAEGLDYTIRYGGGAWHGiEAEF- 159

IASERMGPVLSPRLAQETGLSQAPAARLLQEPLlHT'ASRPQ--AWPASASQGLAAEALRYGQGFEHILYYLLEAAVAGI.GV 238

LCHAPlAPLCTPDIA--ASl.-HSPAD-Il.RFTLLRSYRRDEWTAWMQAAGEHPPSPTHR-VMVFDSSVTMLEAAQAGVGI 234

AIAPEPLVRDDI.AAP-GGPWGI-lETDARlAI.WVPARLHDPRAGRLAQWLREQLAG. T AA LL RAR.......LV.....AIAPVDMFTliLLASERIVQPF^'rQIELGSYWI..TRLQSRAETPAMREFS'R:WLVEKMKK

8.

E. cloacae

P. aeruginosa

E. coli

E. coll

E. coll

R. meliloti

ampR 7

trpI 7

ilvY 2

metR

lysR

nodD

3

5

10

ET L7 V TH El A IHJEJ1;V | JG l

A M IHI P1g1p LzH HE 0 IR[ IHI T S HElMl

R I E L S QJ M

P L 1S1 S L R1 A F E A A| A R1 H I. s [H El H A AS L N A L R A F E A A A J L H S I S L J A

1JL LEADL K T L H L A E E] Q H El G R S A

E I K HI|T R N S G S A D A AN L [j H I E I [a H A V MT A G S L T E A A

L E] L [o V E 1 D AJ [iL [E 1 K A

* * * * * * *

s

s

s

s

s

S

S--

|Q H V K T L E1 H L N

flR QVRLL E ED G

H U F S D LIEEl R IG

El E L ITR F K V I G

A A E] ALRJ R T Y F GU

c

v

Q

F

L

D

QA

p

R

K

E

L F

L F

L F

L F

L F

L F

I

E

El

A

H

El

GlR JElE

G R DElSRM K E

E LSEl RS M Q i

R G ] M L T T E G [j] N L Ei 70

R S V K L T D S G V L R 70

RT V T L T D A G mE L R 65

Q P 1 R F T E[ G E V L E 66

G R 1. H P T V uG I. F 68

El E I ET [a R A [E A J. A -1

* * * *

FIG. 6. (A) Alignment of the trpl and ampR amino acid sequences, performed by the FASTP program. Symbols: :, identical residues;.,residues frequently substituted in evolution; -, insertions made during optimization. The similarity between the trpl and ampR proteins was

evaluated by the z value, calculated by the XRDF program. The optimized similarity score of the trpl-ampR pair is 374. The mean of randomscores and standard deviation of random scores were 38.3 and 8.59, respectively. Therefore, the z value is significant at 39.08 (see Materialsand Methods). (B) Alignment of the N-terminal portions of the trpI, ampR, ilvY, metR, lysR, and nodD amino acid sequences. Residuenumbers are given at the beginning and end of each sequence. Positions with identical residues are boxed; those with residues similar oridentical are starred. Similarity groups: D = E; F = Y; N = Q; K = R; S = T; and I = L = V = F = M.

293

291

J. BACTERIOL.

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 10: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

PSEUDOMONAS ACTIVATOR GENE 181

TABLE 3. Codon usage in P. aeruginosa trpI

Amino acid Codon Usage Amino acid Codon Usage Amino acid Codon Usage Amino acid Codon Usage

Phe JUU 0 Ser UCU 0 Tyr UAU 1 Cys UGU 1Phe UUC 10 Ser UCC 5 Tyr UAC 2 Cys UGC 1Leu UUA 0 Ser UCA 0 END UAA 0 END UGA 1Leu UUG 5 Ser UCG 2 END UAG 0 Trp UGG 7Leu CUU 0 Pro CCU 2 His CAU 3 Arg CGU SLeu CUC 11 Pro CCC 7 His CAC 3 Arg CGC 14Leu CUA 0 Pro CCA 1 Gln CAA 2 Arg CGA 2Leu CUG 31 Pro CCG 12 Gln CAG 10 Arg CGG 9Ile AUU 0 Thr ACU 0 Asn AAU 1 Ser AGU 0Ile AUC 4 Thr ACC 7 Asn AAC 1 Ser AGC 8Ile AUA 0 Thr ACA 0 Lys AAA 1 Arg AGA 0Met AUG 4 Thr ACG 0 Lys AAG 0 Arg AGG 1Val GUU 2 Ala GCU 3 Asp GAU 7 Gly GGU 0Val GUC 4 Ala GCC 22 Asp GAC 8 Gly GGC 18Val GUA 3 Ala GCA 1 Glu GAA 5 Gly GGA 1Val GUG 6 Ala GCG 19 Glu GAG 16 Gly GGG 5

visible on SDS-polyacrylamide gels after synthesis in theminicell system. The sizes of both the fusion polypeptide andthe intact trpI polypeptide estimated from SDS-polyacryl-amide gels were close to those predicted from the amino acidsequence. Deletion of 37 residues in the middle of thisreading frame resulted in loss of the activating function ofthe trpl protein (data not shown) and decreased the size ofthe trpI polypeptide in SDS-polyacrylamide gels.The codon usage in trpl, similar to that of other Pseudo-

monas genes, shows a marked preference for G or C in thethird position (Table 3) (12, 18). As noticed when P. aerugi-nosa trpBA was compared with trpBA from other bacteria,lysine is used less frequently, usually being substituted forby arginine (18); in the trpl gene arginine is also muchpreferred, composing 10.6% of the total residues, whereashistidine and lysine constitute 2.0 and 0.3%, respectively. Inaddition, the preference for leucine over isoleucine is re-markable in comparison with the case in other Pseudomonasgenes; leucine accounts for 16% and isoleucine accounts foronly 1.4% of the total residues. The amino acid compositionof trpl is shown in Table 4. The slightly higher G+C content(72% versus about 65%) and biased usage of certain aminoacids in the trpl gene might be accounted for by the feebleexpression of this gene, as is true of regulatory genes ingeneral.The trpl, ampR, ilvY, metR, lysR, and nodD gene prod-

ucts shown in Fig. 6B have some common features. Theyfunction as activators, required for copious expression ofother genes. In the presence of the P-lactam antibioticcefoxitin, the ampR product induces the expression ofampC, encoding cephalosporinase. The ilvY protein acti-vates the expression of iIvC, encoding acetohydroxy acidisomeroreductase, the second enzyme in the isoleucine-valine biosynthetic pathway (40). The metR product isrequired for expression of the metE and metH gene prod-ucts, enzymes responsible for the methylation of homocy-steine to methionine (30). The lysR protein activates the lysAgene, encoding diaminopimelate decarboxylase (35). In thepresence of root extract, the nodD protein induces thenodABC genes, which are involved in root invasion and thestimulation of nodule development (16, 17). In some casesthese activators may serve to repress their own synthesis.All of these activators are similar in size: the trpI, ampR,ilvY, lysR, and nodD proteins have 293, 291, 297, 311, and308 residues, respectively, whereas the metR protein has 276residues. These regulatory genes are all transcribed diver-

gently from the gene sets that they regulate, and they oftenlie close enough that the divergent promoters are overlap-ping. For the lysR-lysA pair, the start codons are separatedby 121 bp; the start codon of nodD is 266 bp from that ofnodA. For the ampR-ampC pair, the transcription initiationsites are separated by 55 bp; those of the metR-metE pairand the ilvY-i1vC pair are 25 and 45 bp apart, respectively.The significant similarity in N-terminal portions in this

family of activators probably reflects a common functionamong them. In the case of the catabolite activator protein,the N-terminal domain contains the cyclic AMP binding siteand the C-terminal domain has the DNA-binding region,which recognizes a specific DNA sequence and also inter-acts with RNA polymerase (22). In these six proteins theconserved N-terminal portions may be the DNA-bindingregions. The diversity of their C-terminal portions may relateto inducer specificity, which suggests that this portion maycontain the inducer-binding site. A 20-residue sequencewithin the N-terminal portion of lysR has been proposed to

TABLE 4. Deduced amino acid composition of the P. aeruginosatrpl gene producta

CompositionAmino acid

No. of residues t

Ala 45 15.4Arg 31 10.6Asn 2 0.7Asp 15 5.1Cys 2 0.7Gln 12 4.1Glu 21 7.2Gly 24 8.2His 6 2.0Ile 4 1.4Leu 47 16.0Lys 1 0.3Met 4 1.4Phe 10 3.4Pro 22 7.5Ser 15 5.1Thr 7 2.4Trp 7 2.4Tyr 3 1.0Val 15 5.1

a Total number of residues, 293; molecular weight, 31,950.

VOL. 171, 1989

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 11: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

182 CHANG ET AL.

be a cro-like DNA-binding region; His-29 and its adjacentresidues serve as the turn (15). The alignment of the trpIprotein with the lysR protein offers another prediction for thelocation of a helix-turn-helix; that is, Gly-35 in the trpIprotein and serine or proline for the other proteins may serveas the center of the turn. The ilvY member has been provento be a DNA-binding protein, and RNA polymerase bindingto the ilvC promoter is mediated by the ilvY protein in thepresence of the acetohydroxy acid isomeroreductase sub-strate (40a). In all likelihood, this entire family of activatorsaffects the expression of other sets of genes after binding toa specific region of DNA.

Independently, Appelbaum et al. (3) found the similaritybetween Rhizobium japonicum nodD2 and E. coli lysR. TheR. japonicum nodD2 gene product is about 69%o identical tothat ofR. meliloti nodD. Appelbaum et al. also compared theamino acid sequences of the nodDI and nodD2 gene prod-ucts from R. japonicum USDA 191 to those of the relatednodD gene products from R. meliloti, Rhizobium trifolii,Rhizobium leguminosarum, and Bradyrhizobium ("Para-sponia") sp. Interestingly, they found that the N-terminalregions had the greatest similarity; 60 of the first 80 residueswere identical in all six genes. Spaink and co-workers (34)concluded that although the variqus nodD gene products arefunctionally interchangeable, they are not identical. Horvathand co-workers (21) propose that the nodD genes mayencode determinants of host specificity by interacting withdifferent plant factors. Comparison of amino acid sequencesof the nodD family shows a similar pattern in that they areclosely related in the N-terminal portions and more diversein the C-terminal portions. This agrees with our proposal forthe structural plan for all of the trpI-related activators; theN-terminal portion is responsible for DNA binding, and theC-terminal portion interacts with the inducer.

Recently, the N-terminal portion of the catM repressor ofAcinetobacter calcoaceticus was shown to share apparentsimilarity with the six proteins mentioned above (E. L.Neidle, personal communication). This 251-residue regula-tory protein controls the induction of a series of enzymes forcatechol degradation; its transcription diverges from one ofthe operons that it regulates. Genetic tests show clearly,however, that catM encodes a repressor, not an activator.This finding further suggests that this family of regulatoryproteins may have evolved to express varied functions,while its N-terminal portion still retains specific DNA-binding ability.The trpl-related family of regulatory proteins has recently

been independently described by Henikoff et al. (20). UsingGENPRO software to search the EMBL, GENBANK, andNBR-PIF data bases, they identified three additional exam-ples from E. coli and Salmonella typhimurium and one fromAlcai'genes eutrophus. Those from the enteric bacteriaincluded cysB, an activator in the cysteine synthetic path-way, and two genes of unknown function, leuO, locatedbetween the leuACBD and ilvIH operons, and antO, locatedupstream of the gene for the Na+/H+ antiporter. The Alca-ligenes example is also an open reading frame of unknownfunction found upstream of the tfdA (2,4-dichlorophenoxya-cetate monooxygenase) gene. Henikoffet al. (20) proposed ahelix-turn-helix motif located in the N-terminal region ineach of the family members they identified. They found nosequence similarity between these regulatory proteins andthose of the ompR family.

ACKNOWLEDGMENTSThis work was supported by Public Health Service grant A120279

from the National Institutes of Health. Computer resource$ for thedata base search were provided by the National Institutes ofHealth-sponsored BIONET National Computer Resource for Mo-lecular Biology (Public Health Service grant 1 U41 RR01685).We thank R. Milkman for critically reading the manuscript and

E. L. Neidle and L. N. Ornston for communicating their results tous prior to publication.

ADDENDUM IN PROOF

These data will appear in the EMBL/GenBankIDDBJnucleotide sequence data bases under accession numberM21093.

LITERATURE CITED1. Adler, H. I., W. D. Fisher, A. Cohen, and A. HardAgres. 1967.

Miniature Escherichia coli cells deficient in DNA. Proc. Natl.Acad. Sci. USA 57:321-326.

2. Ambler, R. P., and G. K. Scott. 1978. Partial amino acidsequence of the ampicillin resistance gene of Escherichia coliplasmid R6K. Proc. Natl. Acad. Sci. USA 75:3732-3736.

3. AppelbFm, E. R., D. V. Tompson, K. Idler, and N. Chartrain.1988. Rhizobiumjaponicum USDA191 has two nodD genes thatdiffer in primary structure and function. J. Bacteriol. 170:12-20.

4. Bolivar, F., R. L. Rodriguez, P. J. Greene, M. C. Bach, H. L.Heynecker, H. W. Boyer, J. H. Crosa, - S. Faltow. 1977.Construction and characterization ofnew cloning vehicles. II. Amultipurpose cloning system. Gene 2:95-113.

5. Bonner, W. H., and R. A. Laskey. 1974. A film detection methodfor tritium-labeled proteins and nucleic acids in polyacrylamidegels. Eur. J. Biochem. 46:8348.

6. Calhoun, D. H., D. L. Pierson, and R. A. Jenen. 1973. Theregulation of tryptophan biosynthesis in Pseudomonas aerugi-nosa. Mol. Gen. Genet. 121:117-132.

7. Chang, A. C. Y., and S. N. Cohen. 1978. Construction andcharacterization of amplifiable multicopy DNA cloning vehiclesderived from the P15A cryptic miniplasmid. J. Bacteriol. 134:1141-1156.

8. Clirk-Curtiss, J. F., and R. Curtis MI. 1983. Analysis ofrecombinant DNA using Escherichia coli minicells. MethodsEnzymol. 101:347-362.

9. Close, T. J., and R. L. Rodrguez. 1982. Construction andcharacterization of chloramphenicol-resistance gene cartridge: anew approach to the transcription mapping of extrachromo-somal elements. Gene 20:305-316.

10. Cohen, S. N., A. C. Y. Chang, and L. Hsu. 1973. Construction ofbiologically functional bacterial plasmids in vitro. Proc. Natl.Acad. Sci. USA 70:3240-3244.

11. Crawford, I. P. 1987. The synthesis of tryptophan from chori-smate: comparative aspects. Methods Enzymol. 142:293-300.

12. Crawford, I. P., and L. Eberly. 1986. Structure and regulation ofthe anthranilate synthase genes in Pseudomonas aeruginosa. I.Sequence of trpG encoding the glutamine amidotransferasesubunit. Mol. Biol. Evol. 3:436448.

13. Crawford, I. P., and I. C. Gunsalus. 1966. Inducibility oftryptophan synthase in Pseudomonas putida. Proc. Natl. Acad.Sci. USA 56:717-724.

14. Dayhoff, M. 1978. Atlas of protein sequence and structure, vol.5, suppl. 3. National Biomedical Research Foundation, SilverSpring, Md.

15. Dodd, I. B., and J. B. Epn. 1987. Systematic method for thedetection of potential X Cro-like DNA-binding regions in pro-teins. J. Mol. Biol. 194:557-564.

16. Eglehoff, T. T., R. F. Fisher, T. W. Jacobs, J. T. Mula, andS. R. Long. 1985. Nucleotide sequence of Rhizobium meliloti1021 nodulation genes: nodD is read divergently from nodABC.DNA 4:241-248.

17. Fisher, R. F., H. L. Brierley, J. T. Muigan, d S. R. Long.1987. Transcription of Rhizobium meliloti nodulation genes. J.Biol. Chem. 262:6849-6855.

J. BACTERIOL.

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from

Page 12: for Sequence Pseudomonas aeruginosa trpI Activator Gene ...JOURNALOFBACTERIOLOGY, Jan. 1989, p. 172-183 Vol. 171, No. 1 0021-9193/89/010172-12$02.00/0 Copyright ©1989, American Society

PSEUDOMONAS ACTIVATOR GENE 183

18. Hadero, A., and I. P. Crawford. 1986. Nucleotide sequence ofthe genes for tryptophan synthase in Pseudomonas aeruginosa.Mol. Biol. Evol. 3:191-204.

19. Hedges, R. W., A. E. Jacob, and I. P. Crawford. 1977. Wideranging plasmid bearing the Pseudomonas aeruginosa trypto-phan synthase genes. Nature 267:283-284.

20. Henikoff, S., G. W. Haughn, J. M. Calvo, and J. C. Wallace.1988. A large family of bacterial activator proteins. Proc. Natl.Acad. Sci. USA 85:6602-6606.

21. Horvath, B., C. Bachem, J. Schell, and A. Kondorosi. 1987.Host-specific regulation of nodulation gene in Rhizobium ismediated by a plant-signal, interacting with the nodD geneproduct. EMBO J. 6:841-848.

22. Irwin, N., and M. Ptashne. 1987. Mutants of the cataboliteactivator protein of Escherichia coli that are specifically defi-cient in the gene-activation function. Proc. Natl. Acad. Sci.USA 84:8315-8319.

23. Lagrimini, L. M., S. T. Brentano, and J. E. Donelson. 1984. ADNA sequence analysis package for the IBM personal com-puter. Nucleic Acids Res. 12:605-614.

24. Lee, N. L. W. 0. Gielow, and R. G. Wallace. 1981. Mechanismof araC autoregulation and the domains of two overlappingpromoters, Pc and PBAD in the L-arabinose regulatory region ofEscherichia coli. Proc. Natl. Acad. Sci. USA 78:752-756.

25. Lipman, D. J., and W. R. Pearson. 1985. Rapid and sensitiveprotein similarity searches. Science 227:1435-1441.

26. Maxam, A. M., and W. Gilbert. 1980. Sequencing end-labeledDNA with base-specific chemical cleavages. Methods Enzymol.65:499-560.

27. Manch, J. N., and I. P. Crawford. 1981. Ordering tryptophansynthase genes of Pseudomonas aeruginosa by cloning inEscherichia coli J. Bacteriol. 146:102-107.

28. Manch, J. N., and I. P. Crawford. 1982. Genetic evidence for apositive regulatory factor mediating induction on the tryptophanpathway of Pseudomonas aeruginosa. J. Mol. Biol. 156:67-77.

29. Maniatis, T., E. F. Fritsch, and J. Sambrook. 1982. Molecularcloning: a laboratory manual. Cold Spring Harbor Laboratory,Cold Spring Harbor, N.Y.

30. Plamann, L. S., and G. V. Stauffer. 1987. Nucleotide sequenceof the Salmonella typhimurium metR gene and the metR-metEcontrol region. J. Bacteriol. 169:3932-3937.

31. Sanger, F., and A. R. Coulson. 1978. The use of thin acrylamidegels for DNA sequencing. FEBS Lett. 87:107-110.

32. Shapiro, H. S. 1968. Distribution of purine and pyrimidines indeoxynucleic acids, p. H-31-H-36. In H. A. Sober (ed.), Hand-book of biochemistry: selected data for molecular biology. CRCPress, Inc., Cleveland.

33. Schell, M. A. 1986. Homology between nucleotide sequences ofpromoter regions of nah and gal operons of NAH7 plasmid ofPseudomonas putida. Proc. Natl. Acad. Sci. USA 83:369-373.

34. Spaink, H. P., C. A. WiJffelman, I. Pees, R. J. H. Okker, andB. J. J. Lugtenberg. 1987. Rhizobium nodulation gene nodD asa determinant of host specificity. Nature (London) 328:337-340.

35. Stragier, P., and J.-C. Patte. 1983. Regulation of diaminopime-late decarboxylase synthesis in Escherichia coli. III. Nucleotidesequence and regulation of the lysR gene. J. Mol. Biol. 168:333-350.

36. Sutcliff, J. G. 1978. Nucleotide sequence of the ampicillinresistance gene of Escherichia coli plasmid pBR322. Proc. Natl.Acad. Sci. USA 75:3737-3741.

37. Vieira, J., and J. Messing. 1982. The pUC plasmids, anM13mp7-derived system from insertion mutagenesis and se-quencing with synthetic universal primers. Gene 19:259-268.

38. Vogel, H. J., and D. M. Bonner. 1956. Acetylornithinase ofEscherichia coli: partial purification and some properties. J.Biol. Chem. 218:97-106.

39. Weber, K., and M. Osborn. 1969. The reliability of molecularweight determinations by dodecyl sulfate-polyacrylamide gelelectrophoresis. J. Biol. Chem. 244:4406-4412.

40. Wek, R. C., and G. W. Hatfield. 1986. Nucleotide sequence andin vivo expression of ilvY and ilvC genes in Escherichia coliK12. J. Biol. Chem. 261:2441-2450.

40a.Wek, R. C., and G. W. Hatfield. 1988. Transcriptional activationat adjacent operators in the divergent-overlapping ilv Y and ilvCpromoters of Escherichia coli. J. Mol. Biol. 203:643-663.

41. Yanisch-Perron, C., J. Vieira, and J. Messing. 1985. ImprovedM13 phage cloning vectors and host strains: nucleotide se-quences of the M13mpl8 and pUC19 vectors. Gene 33:103-119.

42. Yousaf, S. I., A. R. Carroll, and B. E. Clarke. 1984. A new andimproved method for 3'-end labeling DNA using [a-32P]ddATP.Gene 27:309-313.

VOL. 171, 1989

on March 6, 2021 by guest

http://jb.asm.org/

Dow

nloaded from