Top Banner
Downloaded from www.microbiologyresearch.org by IP: 54.162.190.106 On: Thu, 03 Mar 2016 17:32:52 J. gen. Virol. (1989), 70, 1-11. Printedin Great Britain Key words: CMV/nucleotide sequence/RNA 1 Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1 By THOMAS M. RIZZO AND PETER PALUKAITIS* Department of Plant Pathology, Cornell University, Ithaca, New York 14853, U.S.A. (Accepted 19 September 1988) SUMMARY The nucleotide sequence of RNA 1 of the Fny strain (Subgroup I) of cucumber mosaic virus (CMV) was determined and compared at both the nucleic acid and protein levels with the corresponding sequence of RNA 1 of the Q strain (Subgroup II) of CMV. Fny-CMV RNA 1 consisted of 3357 nucleotides and contained a single long open reading frame (ORF) of 2979 nucleotides, whereas Q-CMV RNA 1 consists of 3389 nucleotides and contains a single ORF of 2973 nucleotides. The levels of sequence homology between the two RNAs were 76~ at the nucleotide level and 85~o at the protein level. These homologies were distributed widely over the molecules, with 45 of the non-conservative differences in amino acid sequence located between amino acids 503 and 705, and another 15 ~ of the differences located between amino acids 224 and 298. While the C-terminal 141 amino acids contain more basic than acidic amino acids, the region of greatest amino acid sequence heterogeneity, amino acids 503 to 600, contained a preponderance of acidic amino acids in the putative translation products of RNAs 1 of both Q-CMV and Fny-CMV. The last 180 nucleotides of the 3'-terminal non-coding region of Fny-CMV RNAs 1 and 2 were 96~ homologous, whereas the sequence homology between Fny-CMV RNA 1 and Q-CMV RNA 1 was 64~ in this region. Furthermore, the tRNA-like secondary structures formed by the 3'-terminal non-coding regions of Fny-CMV RNAs 1 and 2 were virtually identical. By contrast, there was only 84~ sequence homology between the 5"-terminal non-coding regions of these two RNAs and 81~ sequence homology between the 5'-terminal non-coding regions of Q-CMV RNA 1 and Fny-CMV RNA 1. The non-equivalent divergence in the non-coding regions of these RNAs, as well as possible functions for the translation product of RNA 1, are discussed. INTRODUCTION Cucumber mosaic virus (CMV) is a positive-sense RNA plant virus with a genome composed of three single-stranded RNAs, designated RNAs 1, 2 and 3, in decreasing order of Mr (Peden & Symons, 1973). RNAs 1 and 2 each encode one large polypeptide, and RNA 3 encodes two polypeptides, including the Mr 25 000 virus coat protein (Schwinghamer & Symons, 1977). By analogy with studies involving other viruses with tripartite genomes (Kiberstis et al., 1981; Nassuth & Bol, 1983), RNAs 1 and 2 of CMV are believed to encode proteins associated with the replication of the virus genome. There is also significant nucleotide sequence homology between the RNAs 1 and between the RNAs 2 of alfalfa mosaic virus (A1MV), brome mosaic bromovirus (BMV) and CMV (Rezaian et aL, 1984, 1985). Similar nucleotide sequence anatyses have led to suggestions that RNA 2 encodes the polymerase subunit of the replicase, while RNA 1 encodes a nucleotide-binding protein that may also be a helicase (Kamer & Argos, 1984; Hodgman, 1988). These suggestions are consistent with the results of genetic studies with two strains of CMV (Fny and Sny), which indicate a role for RNA 1 in the regulation of viral RNA synthesis (Zitter & Gonsalves, 1986). 0000-8483 © 1989 SGM
11

Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

May 13, 2023

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

J. gen. Virol. (1989), 70, 1-11. Printed in Great Britain

Key words: CMV/nucleotide sequence/RNA 1

Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

By T H O M A S M. R I Z Z O AND P E T E R P A L U K A I T I S *

Department of Plant Pathology, Cornell University, Ithaca, New York 14853, U.S.A.

(Accepted 19 September 1988)

SUMMARY

The nucleotide sequence of RNA 1 of the Fny strain (Subgroup I) of cucumber mosaic virus (CMV) was determined and compared at both the nucleic acid and protein levels with the corresponding sequence of RNA 1 of the Q strain (Subgroup II) of CMV. Fny-CMV RNA 1 consisted of 3357 nucleotides and contained a single long open reading frame (ORF) of 2979 nucleotides, whereas Q-CMV RNA 1 consists of 3389 nucleotides and contains a single ORF of 2973 nucleotides. The levels of sequence homology between the two RNAs were 76~ at the nucleotide level and 85~o at the protein level. These homologies were distributed widely over the molecules, with 45 of the non-conservative differences in amino acid sequence located between amino acids 503 and 705, and another 15 ~ of the differences located between amino acids 224 and 298. While the C-terminal 141 amino acids contain more basic than acidic amino acids, the region of greatest amino acid sequence heterogeneity, amino acids 503 to 600, contained a preponderance of acidic amino acids in the putative translation products of RNAs 1 of both Q-CMV and Fny-CMV. The last 180 nucleotides of the 3'-terminal non-coding region of Fny-CMV RNAs 1 and 2 were 96~ homologous, whereas the sequence homology between Fny-CMV RNA 1 and Q-CMV RNA 1 was 64~ in this region. Furthermore, the tRNA-like secondary structures formed by the 3'-terminal non-coding regions of Fny-CMV RNAs 1 and 2 were virtually identical. By contrast, there was only 84~ sequence homology between the 5"-terminal non-coding regions of these two RNAs and 81~ sequence homology between the 5'-terminal non-coding regions of Q-CMV RNA 1 and Fny-CMV RNA 1. The non-equivalent divergence in the non-coding regions of these RNAs, as well as possible functions for the translation product of RNA 1, are discussed.

INTRODUCTION

Cucumber mosaic virus (CMV) is a positive-sense RNA plant virus with a genome composed of three single-stranded RNAs, designated RNAs 1, 2 and 3, in decreasing order of Mr (Peden & Symons, 1973). RNAs 1 and 2 each encode one large polypeptide, and RNA 3 encodes two polypeptides, including the Mr 25 000 virus coat protein (Schwinghamer & Symons, 1977). By analogy with studies involving other viruses with tripartite genomes (Kiberstis et al., 1981; Nassuth & Bol, 1983), RNAs 1 and 2 of CMV are believed to encode proteins associated with the replication of the virus genome. There is also significant nucleotide sequence homology between the RNAs 1 and between the RNAs 2 of alfalfa mosaic virus (A1MV), brome mosaic bromovirus (BMV) and CMV (Rezaian et aL, 1984, 1985). Similar nucleotide sequence anatyses have led to suggestions that RNA 2 encodes the polymerase subunit of the replicase, while RNA 1 encodes a nucleotide-binding protein that may also be a helicase (Kamer & Argos, 1984; Hodgman, 1988). These suggestions are consistent with the results of genetic studies with two strains of CMV (Fny and Sny), which indicate a role for RNA 1 in the regulation of viral RNA synthesis (Zitter & Gonsalves, 1986).

0000-8483 © 1989 SGM

Page 2: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

T. M. RIZZO AND P. PALUKAITIS

On the basis of nucleic acid hybridization studies (Gonda & Symons, 1978; Piazzolla et al., 1979; F. Garcia-Arenal & P. Palukaitis, unpublished results), serology (Devergne & Cardin, 1975) and peptide mapping of the coat protein (Edwards & Gonsalves, 1983), all CMV strains examined can be divided into two subgroups: a major subgroup of 33 strains (Subgroup I) and a minor subgroup of 11 strains (Subgroup II). The complete nucleotide sequence of the genome of one member of Subgroup II, Q-CMV, has been determined (Davies & Symons, 1988 ; Rezaian et al., 1984, 1985), and we have determined the nucleotide sequence of RNA 2 of a Subgroup I Strain, Fny-CMV (Rizzo & Palukaitis, 1988). In this paper, we report the nucleotide sequence of Fny-CMV RNA 1, and compare this sequence to Q-CMV RNA 1. The nature and extent of sequence diversity between RNAs 1 and 2 of strains representative of Subgroups I and II are discussed.

METHODS

Complementary DNA cloning. Fny-CMV was propagated and isolated, and the viral RNAs were extracted and purified as previously described (Palukaitis & Zaitlin, 1984). Complementary DNA was prepared to total CMV RNA by the procedure of Gubler & Hoffman (1983), using a decanucleotide primer (5'-TGGTCTCCTT-3') complementary to the Y-terminal 10 nucleotides of all four CMV RNAs. This c D N A was blunt end-ligated to SmaI-linearized pUC 18 (Norrander et al., 1983) and cloned in Escherichia coli strain JM 101 (Messing, 1979) using the transformation procedure of Messing (1983). Colonies containing CMV RNA sequences were detected and recombinant clones were screened, both by nucleic acid hybridization as previously described (Rizzo & Palukaitis, 1988). A cDNA library specific to part of RNA 1 also was prepared using total CMV RNA by the method of Ahlquist (1986). First strand cDNA was primed with an oligonucleotide (5 '-GTCGACGAATTCAAGAT- GATCTTGCGCA-3'), the 3'-terminal 19 nucleotides of which are complementary to the 5' end of the cDNA insert of pFnyl00 (see Results and Discussion). The 3'-terminal six nucleotides contain an FspI site, while the 5'- terminal 12 nucleotides contain a Sai l site and an EcoRI site. Second strand cDNA was primed with an oligonucleotide (5 '-GGATCCGCATGCGTTTATTTACAAGAGCG-Y), the T-terminal 17 nucleotides of which correspond exactly to the 5' end of RNA 1 (determined by direct RNA sequencing; see below). In addition, the Y-terminal 12 nucleotides of this oligomer contain a BamHI site and an SphI site. The above cDNA was blunt end-ligated to Hincll-linearized pUC18. Cells of E. coli strain DH5ct (Jessee, 1986) were made competent and transformed with the ligation mixture. Plasmid DNAs from the resultant white colonies were screened by restriction enzyme analysis for the presence of inserts.

Nucleic acid purification, manipulation and sequencing. The large scale isolation of plasmid was done as described by Maniatis et al. (1982) using the alkaline lysis procedure. DNA fragments were separated by electrophoresis and extracted from low melting temperature agarose (Maniatis et al., 1982).

Fny-CMV RNA was decapped with tobacco acid pyrophosphatase, dephosphorylated with calf intestinal alkaline phosphatase, and 5' end-labelled with [~-32P]ATP by polynucleotide kinase as described (Garcia-Arenal et al., 1987). End-labelled RNA 1 was isolated by agarose gel electrophoresis as previously described (Rizzo & Palukaitis, 1988), and direct RNA sequencing was done as described by Garcia-Arenal et al. (1987) and Haseloff & Symons (1981).

Recipient E. coil strains JM109 (Yanisch-Perron et al., 1985) or DH5ctF' (Liss, 1987) and vectors M13mpl8 or M13mpl9 (Norrander et aL, 1983) were used for the construction of a Bal 31-generated, ordered set of deletions (Poncz et al., 1982) used for sequencing Fny-CMV RNA 1. M13 DNAs containing CMV RNA 1 sequences were prepared and sequenced by the dideoxynucleotide chain termination method of Sanger et aL (1977, 1980) with the modifications previously described (Rizzo & Palukaitis, 1988). The nucleotide sequences of the Bal 31-generated, ordered set of deletions of the cDNA clones of Fny-CMV RNA 1 were oriented with respect to each other with an IBM Personal Computer AT equipped with the Microgenie Sequence Analysis Program (Beckman). All other sequence analyses were done with this computer program as previously detailed (Rizzo & Palukaitis, 1988).

Enzymes and chemicals. Avian myeloblastosis virus reverse transcriptase was purchased from Promega Biotec and United States Biochemicals. All other enzymes and chemicals were as previously described (Rizzo & Palukaitis, 1988).

RESULTS AND DISCUSSION

Iden t i f i ca t ion o f R N A 1-speci f ic c D N A clones

A f t e r s c reen ing a n in i t ia l c D N A l i b r a ry for R N A 3- a n d R N A 4-specific c lones by co lony h y b r i d i z a t i o n w i t h r a n d o m l y p r i m e d , 3zp- labe l led c D N A p r e p a r e d to gel -pur i f ied R N A 4, the r e m a i n i n g c lones were s c reened by a n a l y s i n g in se r t size. T h e la rges t p l a s m i d (pFny200) was s h o w n to be a nea r ly fu l l - length R N A 2-specific c D N A c lone (R izzo & Pa luka i t i s , 1988). T h e

Page 3: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

CMV RNA 1 sequence 3

next largest plasmid (pFnyl00) contained an insert of length 2.9 kb. The pattern of restriction fragments after agarose gel electrophoresis was different for these two clones, suggesting that pFnyl00 was specific to RNA 1 (results not shown). A Northern blot of total CMV RNA was probed with 3zp_labelled pFny100, which specifically hybridized to RNA 1 (results not shown).

The RNA 1-specific eDNA insert in pFnyl00 was approximately 0.55 kb shorter than expected for a lull-length clone. Subsequent nucleotide sequencing of this insert, and comparison with the nucleotide sequence of Q-CMV RNA 1, indicated that pFnyl00 did not contain eDNA representing the 5'-terminal 0.55 kb of Fny-CMV RNA 1. Thus, oligonucleotide primers were used to generate a second eDNA library specific to the Y-terminal 0.55 kb of Fny- CMV RNA 1. Clones in an RNA 1-specific eDNA library (putatively carrying inserts representing the cDNA segment missing in pFny100) were screened by analysing insert size. Of plasmids screened in this library, five insert-carrying plasmids contained the expected 0-55 kb insert (results not shown), and one of these (pFnyl01) was chosen for further study.

Strategy for sequencing Fny-CMV RNA 1

The nucleotide sequence of Fny-CMV RNA 1 was determined by sequencing the overlapping eDNA inserts of pFnyl00 and pFnyl01. The construction ofa Bal 31-generated, ordered set of deletions used in sequencing these eDNA inserts has been described previously (Rizzo & Palukaitis, 1988). The set of deletions covering the inserts of pFnyl00 and pFnyl01 is illustrated in Fig. 1.

The 5'-terminal 33 nucleotides of Fny-CMV RNA 1 were determined by direct RNA sequencing. As expected, the 5' end of the eDNA insert of pFny101 contains this authentic viral sequence. By contrast, seven 3'-terminal nucleotides of the primer sequence were lost from pFny 100 during the cloning procedure. The complete nucleotide sequence of Fny-CMV RNA 1 is presented in Fig. 2.

The region of overlap between pFnyl00 and pFnyl01 is 73 nucleotides long (nucleotides 483 to 555). There is one discrepency between these two sequences, at position 484 of Fny-CMV RNA 1:pFnyl00 contains a T residue while pFnyl01 contains a C residue. However, this nucleotide is at the third position ofa codon in the large open reading frame (ORF) (see below), and both codons encode a serine residue. Hence, this difference is most likely due to the cloning of a variant RNA 1 molecule rather than a eDNA cloning artefact in either pFnyl00 or pFnyl01. This region of overlap was sequenced in three non-sibling clones carrying the 0.55 kb insert; all were identical to pFnyl01. However, this difference in nucleotide sequence may be due to 'sequence drift', since the cDNAs in pFnyl00 and pFnyl01 were synthesized from different RNA preparations.

Fny-CMV RNA 1 encodes one long open reading frame

The only long ORF of Fny-CMV RNA 1 begins at the first AUG codon at nucleotides 95 to 97 and contains 2979 nucleotides, encoding a 111404 Mr protein (993 amino acids; Fig. 2). By contrast, the translation product of Q-CMV RNA 1 has a predicted Mr of 110930 (Rezaian et al., 1985).

In Fny-CMV RNA 1, the next largest ORF on the positive strand begins at residue 2328 and is 165 nucleotides long; Q-CMV RNA 1 has a corresponding 207 nucleotide ORF also starting at residue 2328 (Rezaian et al., 1985). The longest ORF on the negative strand of Fny-CMV RNA 1 begins at position 1090 of the positive strand and extends 411 nucleotides; a counterpart ORF in Q-CMV RNA 1 begins at residue 1093 and is 198 nucleotides long. The second largest positive strand ORFs and the longest negative strand ORFs of Q- and Fny-CMV RNAs 1 encode polypeptides that are only 30~o and 23~o homologous, respectively.

Comparison of Fny-CMV RNA 1 and Q-CMV RNA 1 nucleotide sequences and translation products

The alignment of the RNA 1 sequences of Fny-CMV and Q-CMV is shown in Fig. 3. Putative translation products of the two RNAs are aligned in Fig. 4. The nucleotide sequences and corresponding translation products have overall homologies of 76~ and 85~, respectively.

Page 4: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

4

p F n y l 0 0 . • •

T. M. RIZZO AND P. PALUKAITIS

Q Kb 0

I R

1 0 0 . 1 R I A 6

100 .1A8 100.1A9

100.1AI0

100.1All

100.1AI2 100.1RIA13 100.1RIA14

100.1AI5 100.1A16

100.1A17 - -

9Fn~101.2A2,

101.2A1

1 0 1 2

- - t 0 0 . 2 R I A I 6 - - I00 .2AI5

100.2A14 100.2A13

100.2A12 100.2All

100.2AI0 100.2A9 100.2A8

100.2A7 100.2A6 100.2A5

100.2A4

100.2A3

~ I 0 0 . 2 R 1 A I 100.2R

1 2

St X

100.1A3

1 0 0 . 1 R I A 4

| 00 .1A5

1 0 0 . 1 R

p F n y l 0 1 - •

Kb 0

100.2A2

100. IR1A7

R F ©

F S p F n y l 0 1 . 1

101.1A1 - - 1 0 1 . 1 A 2 - -

0-5

Fig. l. Ordered set of overlapping deletions used for sequencing cDNA clones of Fny-CMV RNA 1 : pFnyl00 and pFnyl01. The ordered set of deletions was generated by digestion with nuclease Bal 31 from both ends of the inserts of the two clones. The ends of the cDNA corresponding to the 5' and 3' termini of RNA 1 are indicated. The heavy lines represent vector sequences. Abbreviations: F, FspI; R, EcoRI; RV, EcoRV; S, SphI; St, StuI; X, XhoI.

Unmatched nucleotides within the coding regions are localized in three specific R N A segments (Fig. 3). These segments show codon usage realignment (described as frameshifting by Rizzo & Palukaitis, 1988) with respect to one another. Hence, nucleotide sequence alignment indicates deletions of either three (both segments A and B) or 17 (segment C) nucleotides in the coding sequence. Because the differences between the two strains in segment C do not lead to the addition or deletion of amino acids, the Q-CMV R N A l-encoded polypeptide is two amino acid residues shorter than the polypeptide encoded by Fny-CMV R N A 1. All three codon usage realignments result in corresponding protein segments with no amino acid homology (Fig. 4).

The nucleotide and amino acid sequences of Q- and Fny-CMV R N A s 1 both have higher levels of homology (76 ~ and 85 ~ , respectively) than those of Q- and Fny-CMV R N A s 2 (71 and 73 ~ , respectively) (Rizzo & Palukaitis, 1988). In the case of R N A 2, however, homology at both the nucleotide coding region level and translation product level is greater in the central one-

Page 5: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

C M V RNA 1 sequence

1 m7GpppGUL~UUt~AC~U~GCGUACGGUUEJULUCCCUGc~UCCCCUGU~UU~CUACCCUUUC~U~. C C U ~ U C U U U C U ~ U ~ U U ~ U U C ~

M~t A ~ ~}" Set Se t Phe A811 l~e Ae~z CZU Lew VaZ A~a Set I/£a CZ¥ Asp ~ G~ l~w Leu A~z 7~zr AZ~ Z4u Va~ Asp L~m Thy AZG 95 A._~ GCG ACG UCC UCG UUC AAC AUC AAu £~AA ULIG GGA GCC UCC CAC GGC GAU GGA CGA CUE GCG ACE GCC CUE GUU GAU #J~G AC, A CGU

ll~n GZu Gin Le~ GZ~ GZw GZn l~u GZn Hie GZn At@ A~ GZy A~ L~e VaZ TI~J I~a Ar~ Aen VaZ L~ GZy VaZ L 8 Aep Scz" GZ~ FaZ 185 CAU GAG CA(; CUC ~G GAG C~ UGA CAG GAU CM CGU A~ GGC CGU AAG GUC GAC AUC C~ ~AC GUU UUr, ~U GUA ~G ~U UCC GAG GUC

]Z. Ar~ ASh A~ ~J~ GZ~/ Clef L~8 T~ir ~1~p L~ H~a Lew Thz" GZn G~ ~w P~ A~z Pro H~# G~ [~u A~ G~¥ A~z L~w Az~ L¢~ ~g 275 AUC EGG AAU EGG GAU GGA riGA AAG GAC GAC CUE CAU CUU ACE GAG CAG GAG UUU GCU CCC GAC GGC CUA GCU GGU GCC CUE CGC UUG

GZU Thr L~ Asp C~e L~u Aa~ Set Phe P~o Se~ ~er GZ~ Lew Ar~ G~n A~ Le~ Va~ Lew Asp Ph~ G~ G~ Se~ Tr~ V~ ~ Hi~ ~d~ 365 G~ ACU CUC GAU UGU CUA GAC UCU UUC CCU UCA UGA GGU CUG CGG CAG GAC CUC GUC UUA GAC UUC GGA GGA AGU UGG GUC ACA CAU UAC

Lea Ar~ GZ~ Hie Aen Va~ Hie eye C~e Set Pro Cye Lew GZ~ IZe Ar~ Asp L~s Met Ar~ His Ala GZu Arg Leu M~ Ann M~ ~ L~ 455 CUC CGC GGA CAU AAC GUA CAC UGU UGu UCC CCU UGU UUG GGU AUC CGC GAU AAG AUG CGC CAU GCG ~ CGC UUA AUG GAC AUG CGC AAG

I~e I~e Le~ Ash Asp Pro Gin Gin Phe Asp G~ Ar 8 G~n P~o Ae~ Phe C~e Th~ G~n ~ A~a AZa Asp C~8 L~e Va~ C~ ALa Hi~ Phe 545 AUC AUC UUG AAC GAU CCA CAA CAG UUC GAU GGU EGG CAG CCG GAU UUC UGC ACU CAA CCG GCU GCG GAU UGC ~ GUA CAA GCC GAC UUU

A~a IZe Set IZe Hi~ Gl~ GL¥ T~r Asp Me~ Gly Phe Ar~ GZ¥ Le~ C~8 GZ~ Ala Met AS~ ALa H£S GZ¥ Thr ~hr I~ L~ L~ G~ Th~ 635 GCU AUA UCU AUU CAU GGA GGU UAU GAU AUG GGC UUU AGA GGA UUA UGU GAA GCG AUG AAU GCU CAC GGA ACC ACU AUU UUG GAG GGA ACG

Me~ Met Phe Asp GZ~ A~a Me~ Met Phe Asp A8~ G~n CZ~ Val IZe PrO GZ~ Le~ Aen C~a GZn ~rp Ar~ L~e I~e Ar~ S~r AZa Ph¢ SeP 725 AUG AUG UUC GAU GGIJ GCG AUG AUG UUU GAC GAC CAA GGU GUA AUA CCC GAA CUU AAU UGU CAG UGG AGG AAG AUC AGG AGU GCU UUC UCC

G~U Thr Clu Aap Vag Thr Pro Leu Val Gly LgO Leu Ash Set Thr VoI Phe See A~ VaZ Arg Lye Phe Lys Thr Met Pal Ala Phe Aep 815 GAA ACU GAA GAC GUC ACA CCA CUG GUU GGU AAG CUff AAU UCC ACA GUU UUC UCC CGC GUG CGG AAG UUC AAG ACG AUG GUA GCU UUU GAU

FEe Ile ASh Glu Se~ Thr Met Set T~r Val His Asp T~p Glu Ann Ile L~s Se~ Phe Leu Thr Anp G~n Thr T~r Set ~Jr Ar~ Gl~ Me~ 905 UUC AUC AAC GAG UCU ACU AUG UCU UAU GUU CAU GAU UGG GAG AAU AUA AAA UCU UUU CUU ACG GAC CAG ACU UAU UCG UAC CGA GGG AUG

Thr u~r G~ Ile G~ Arg C~e Val I~e Hi~ A~a G~y I~e Met Thr Tyr Ly~ Ile I~e Gl~ Va~ Pro G~y Met C~e Pro ~o GZ~ Leu Il~ 995 ACU C GGU AUC GAA CGC UGC GUU AUC CAC GCU GGU AUU AUG ACG UAC AAG AUU AUC GGU GUA CCU GGG AUG UGC CCA CCC GAA CUC AUU

Az~ H~8 C~s I~e T~o Phe Pro Set IZ¢ L~ A~p T~r Yo~ Ol~ Le~ £~s IZe Pro AL~ Set GZn Asp LCW Fa~ C~u T1~p L~ Thr Va~ Ar~ 1085 CGA CAU UGU AUU UGG UUC CCC UCU AUU AAA GAC UAU GUU GGU CUA AAG AUC CCC GCG UCG CAG GAU UUG GUU GAG UGG GA.~ AGA GUG CGG

YZe Leu ~hr Set Thr Le~ Ar~ Glu Thr Gll~ G~ Ile AZa Met A~ 6'~6 T~r Ash A~p Lye Lye Ala Trp Met Glu G~n ~he L~8 ~al I~ ) 75 .~L~O UUA .~CG UCA ACA UUA GGU GAG ACU GAA GAG AUU GCU AUG AGG UGU UAU AAU GAU AAG AAA GCG UGG AUG GAA CAG UUC AAG GUU AUC

Le~ G~ Va~ Leu Set Ala L~8 Ser Sar Thr Ile Val Ile A~n G~ Met Set Met GZn Set Gl~ G~u Arg Ile Asp I~e Ash Asp T~r His 1265 eUA GGU GUU CUA UCC GCG AGA UCA UCU ACC AUU GUU AUU AAU GGU AUG UCC AUG CAA UCU GGC GAG CGA AUA GAC AUU AAU GAU GAU GAC

• Hr Ile G~ Phe Ala Ile Le~ Lew His Thr Lye Met Lys ~yr Gl~ G~n Lew GZy Lys Met Tyr Asp Met Trp Ash Ala Set Set I~e Set 1355 GAc AUC GGU UUC GCC AUU CUU CUU CAC ACA AAA AUG GAG UAC GAA CAA CUU GGA AAA AUG UAU GAU AUG UGG AAU GCU UCG AGU AUU UCG

Trp Ph¢ AZa A~a Leu ~hr Ar~ Pro Le~ Ar~ Val Phe Phe Set Ser Va~ v~l Hi~ Ala Leu Phe Ppo Thr Leu Arg Pro Arg G~ Glu 1445 ~ ~GG UUU GCC GGG UUG ACU CGU CCG CUG CGU GUG UUU UUC UCC AGU GUU GUU CAC GCA CUA UUC CCG ACU UUG AGA CCC GGC GAG GAA

L~e Glu Phe Leu IZe L~e Lea Set Thr Phe Val Thr Phe Ash G~u Glu Cys Set Phe Asp G~ GZ~ Glu G~u Trp Asp Val Ile Set Ser 1535 AM GAA UUU UUG AUC AAG CUC UCC ACC UUC GUG ACU UUU AAU GAA GAG UGC UCA UUU GAC GGU GGA GAG GAA UGG GAC GUG AUA UCA UCC

AZa Ala Tyr Va~ AZa Thr GZn Ala Val Thr Asp GZy Lys IZe Leu Ala A~a G~n Lys Ala Glu L~e Leu Ala C~u L~s Leu Ala Gin Pro 1625 GGU GCA UAC GUU GCU ACG GAG GCU GUU ACG GAU GAG AAG AUU UUG GCU GCG GAG AAA GCC GAG AAG CUU GCU GAG AAG CUU GGA CAA CCC

Va~ Set G~u Val Set Asp Set Pro GIa Thr Set Set GLn Thr Pro Asp A~p T~r Ala Asp V~ C~ G~ Ar~ G~ A~ GIu Va~ Set G~u 1715 GUG AGU GAG GGA UCG GAC AGU CCU GAG ACC /)CA OCU GAA ACG GCU GAU CAll AGU GCU GAU GUU UGU GGA AGG GAG CGA GAG GUU UCG GAA

Leu Asp Set Leu Set A~a CZn Th~ Arg Set Pro Ile Thr Arg VaZ Ala G~u Arg AZa Thr hZa Met Leu Glu Tyr ALa A~a T~r GZw L~a 1805 CUC GAC UCC CUA UCA GCU CAG ACA CGU UCC CCC AUC ACU AGA GUU GCU GAA AGG GCU ACU GCU AUG UUA GAG UAU GGC GCU UAU GAG AAA

G~n Leu Hi~ Asp Thr Thr Va~ Set A~ Le~ L~s Ar~ IZe Trp A~ ble~ Ala Gl~ G~ Asp Asp L~ Arg Ash Set Leu GZu Gl~ Ash Leu 1895 GAA UUG CAG GAC ACU ACA GUG UGU AAU UUA AAA CGU AUU UGG AAG AUG GCG GGC GGU GAU GAC AAA AGA AAC UGG CUC GAG GGU AAU UUG

L~ Phe Val Phe Aep'~hr T~r Phe Th~ Va~ Asp Pro Met V~ A~n I~e Hi~ Phe Set ~r G~ Arg Trp Met Arg Pro Val Pro Glu GZ~ 1985 AAG UUU GUU UUC GAU ACG UAC UUU ACC GUU GAU CCO AUG GUG AAG AUU GAU UUC UCG ACG GGU CGE UGG AUG GGU GCU GUG GEC GAG GGU

IZe Val Tgr ~er Val Gl 9 Tgr Aen G~ Arg Gly Leu Gl~ Pro L~e Set Asp Gly GZU Le~ T~r I~e Va~ ASh Set Gl~ C~a VaZ Ile C~ 2075 AUU GUU UAU UCU GUC GGU UAU AAU GAA GGC GGU UUA GGU CCG AAG UCU GAU GGA GAG CUU UAC AUU GUC AAU AGU GAA UGC GUG AUC UGU

ABn Se~ GZw Set Lea Set Thr Va~ Thr Arg S~r Leu GZn A~a GZ~ Thr Gly ~hr i~e Se~ G~ VaZ Asp G~ Val AZa G~ C~ ~ L~ 2165 AAC AGU GAA UCU UUA UCC ACU GUC ACG CGU UCU CUU CAA GCU GGA ACC GGG ACC AUU AGU CGA GUU GAC GGA GUU GCU GGU UGU GGG

Thr Thr Ala Ile L~ Set ILe Phe GZu Pro Set Thr A~p ~et IZe Va~ Thr A~a Ash Lya Z~s Set Ala G~n Asp Va~ Arg Me~ Ala Leu 2255 AGG AGG GCA AUU AAA UCC AUU UUU GAG CCG UCC AGU GAC AUG AUC GUU ACC GCG AAC AAG AAG UCE GCC CI~ GAI) GUA CGU AUG GEA CUU

Phe L~S Set Set As~ Set L~ G~u ALa C~ Thr Phe VaT~ Ar~ Uhr A~a Asp Set, Fa~ Leu Lew A~n G~ C~ Pro ~hr Val Set Az~ ~Z 2345 UUG AAA UCG UCG GAU UCC GAA GCU UGC AGC UUU GUU CGA ACA GCC GAU UCU GUG CUA CUU AAU GAA UGU GGG ACU GUU AGU AGG GUU

Leu VaZ Asp Glu VaZ Val Leu Leu H£e Phe C~ Gin Le~ C~e A~a Va~ Me~ Set L~e Leu L~e ALa VaZ Ar~ ALa IZ~ Cge Phs GZ~ A~p 2435 uur, GUU GAC GAG GUC GUU CUG CUA GAC UUU GGU CAA UUA UGU GCC GUC AUG UCU AAG UUG AAG GCC GUG CGA GCU AUA UGU UUU GGG GAU

S e t GZu 6Z. IZe A~ Phe Sat Se t Ar~ Asp AZa Se~ Phe Asp Her Ar 9 Phe Ser L ~ i l e ILe Pro ASp G~u Thr Se t Asp AZa Asp ~hr 2 5 2 5 UGG GAG CAG AUU GGG UUU UGC t~GG C~U GAU GGU UCA Ut.S GAC AUG C~U UUC tSGU GAE AUU AUU CCU GAU GA~ ACt' A~U GAU ~CU C~C ACC

~r Phe A~ Set Pro G~n Asp Va? Va~ Pro Le~ Va~ Arg Leu Met A~a 7hr Lge A~a Leu Pro £~s Gly Thr Hie Set L~s Tyr Thr L~e 2615 AGA UUC GGU AEC CCA CAA GAU GUU GUA CCG CUU GUG CGU UUA AUG GCU ACG AAG GCC CUU CGG AAA GGA A£G CAU UGA AAA GAG ACG AAA

Tr~ VaZ Set GZn Set L~s VaZ Z~e A~g Set val Thr Set Ar~ Ala Ile Ala Set Val Thr teu VaZ Asp Leu Asp Set Set Ar~ Phe ~dr 2705 UGG GUU UCU CAA UCU AAA GUG ~ AG, a UCU GUU ACA UCU CGU GCC AUC GCU AGU GUG ACA UUG GUU GAC UUG GAU UCU UCG AGG UUC UAU

I t s Thr Met Thr Gin AIa ~np L~s A~ Se t ~UU u IZe Se t Ar~ Ala £ys GZu Ne~ Arm Leu PI~ L~/S Tar Phe 7~p Aen Glu AI~ IZ¢ 2 7 9 5 AUA ACE AUG AGG r .~ GCU GAU A~G GCC UGA G AAU UCA AG~ GCG AAA GAG AUG ~U GGA CGA GA~ AEU UUG UC~ AAU C, SU~ A ~ AUU

~ r vaZ His GZu .Set GZn GZy IZe Set GZu Asp Ills VaZ The" Leu ~ Z A ~ [eu Lys Se t Thr Lye Cys Asp Leu ~he Lys GZn ~he S e t 2885 ACC GUG CAU GAG UCU GAG GGU AUC UCC GAA GAC GAG GUG ACU UUG GUA AGA UUA AAG AGU AGA /k~ UGU GAC CUG UUU AM CAG UUU UCU

~r C~S Le~ VaZ ATa Leu 7%r A~ R~s L~e Vat Thr Phe A~ 7~r G~u ~dr C~s GZ~ VaZ Leu Aen G~ Asp leu Ile ALa GZu uC~G~ Val 2975 UAU UGU CUG GUU GCA CUE ACG AGA CAC .~G GUC AGA UUC CGC UAC GAG UAC UGU GGU GGA UUG A~£ GGC GAU UUA AUC GCC GA~ GUU

A/.~ Arg A~z 3065 Gcu CGU GCU GAG CGGU~UC~C~CUU~GGGCGGGAUCuGAGUUGGGGGUAAUCUGGAMCCGUCUGA.~GUCACUAMCACAUUG~GUGGUGAACGGGUUGUCCAL~CCAG-CL~AA~GG

3 [ 80 CUA~U~.UGGUGAGUCG.UGGAGAAAUCCACGGGAGUAGAGLgJACAAG.UCUCUGAGGGAGGUUUGAJ~CCAUCUCCuAGGUUUGUUCG.GAAGGAC~C~UCCG~AC~CU~

3 2 9 9 GUGCUAGUUUCAGGGUACGGGUGCCCCCGACUUUCGUGGGGGUCUCUA/UtAGGAGACCA 3 3 5 7

Fig. 2. Nucleotide sequence of Fny-CMV RNA 1 and the encoded amino acid sequence of the large ORF. The initiation and stop codons are underlined.

Page 6: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

6 T . M , R I Z Z O A N D P , P A L U K A I T I S

L ~uuuuAuutmc~cc~u~`cG~,uuc~`cccc~ccucc~cu~u~j~ct~AcccUUu~Uuc~ucUu¢ucGAUuUwUUcU~UU~cc~u~c~ccu~u I I I I I I I I I f l l H I I r l I l I f T r J r J iJrFr l r rPi I I I I I I I I r l I I I F F r F I r F e l l I l l II ! JJ IJI r I I I I I ITTTI I I I I I I I I I I I H I I J I

[ G - mJUAUU~CAAC~GC~A. C~UUCAXUC C CU~CCUCCCCUGUAA~C ~CCCUUUC~C C UC - UC ~UCUUAAUC U - UUUCUUUC,~UUCC U ~ C ~ C C ~ U

[21 CAAI.ca~C UraGUAGCCUCCCAC C~GCC.Ct UAMGGACUAC U UGCGACAGC C C UC GUU GAU AAGACAGCUCAUGAACAGC t~G,~ GC~GC AAC U UC~CAU~C ~ C C ~ 1111111 IIIIIil111111111111111111111//I I l l l l / l l l l l l l l ) l l l l l l l l l l l l l l l l I ] l l l I11111111 I I I I T I I I I I I I I I l l l l I I I I I r l l l

118 CAAUGAAU~UAGCCU~.CA~GG~GAUA~AG~A~UA~UCGCGAC~GCCCU~GUUGAUAAC~CAGCUCA(~'AG~AGCU~C~G~G~UUACAG~A~C~G~CC~U~

241 CAUCCGUAAUGUUUUGGAUGUA~GGACUCCGAAGUCAUUCGClACUCGGUAUGGUGGUAAGUACGAC CUCCAUCUCGCCCAGCAGGAAUUAG£ UCC C CAUGGCC~CCCUCCG l l l l l g II I I I I I I I I I I I I I I I I r l l l l I I I I I I I I I I I I I I I I I f I I i r r l l r l r r l J l l l l l l T I I I I I I I I I I I i l l l l l r r I l l l p i i I I I I l l l l l p l l

238 CAUCCGGAACGUUUUGGGUGUAAAGGAUUCCC,,AGGUCAUCCGG/~UCGGUAUGGAGGAAAGUACGACC UC CAUCUUACCCAGCAGGAGUUUGCUCC CCACGGCCUAGCU~CCCUCCG

361 CUUCIUGI/C~I~ACUCUCGAUUGU~UAGACUUUUU C CCUC GU UCAGGUC U(IC GGCAGGAC C UCGUC UUAC~UUU CGC~C~GUU ~GGU CACA~CUAUCUU C G C ~ C ~ C ~ A C A ~ G I I I I H I I I I I I I I I I I I I I I r I I H l l l I l l l l l l I I i l l l l l l l l r r l l l l l r l p l r l l l l l l i i i i i i i i i ] J l l l l ] l l r l l l l i I i i p l l l l l l l i l l l l l f l i I

358 c UUGUGUCiA~ CUC U (?Ct, AUUGUC UACIA C UC UUIIC C C UU C AUC AGGUC rIG CGGC AGGAC C UC GU C UUA~CUUCG~G~UG~C~C ~C &UU~CC UC C ~ C ~ C ~ A ~ C ~

481 CUGCUCCCCAI/GUUUGGGUAUUCGUGACAAGAUGCGU CACAC UGAAAGGUIIGAUGAGCAUGCGCAAGGUCAUUUUAAACGAUCCACAACAGUUU~U~CCGCC~CC~C UUU~C II t J l l l I I I I I I I l l f r II Ir i l l l r r r r rr r i i i r i! i i r r r r rerrr r I I I I II i i i I I I I I l l l l l r r r e I I i i / rr i i i l l l l l i i i i i i i

478 UUGUUCCCCUUG~UUGGGUAUCCGCGAUAAGAUGCGCCAUGCGGAACGCUUAAUGAACAUGCGCAAGAUCAUCUUGAACGAUCCACAACAGUUCGAUGGUCGGCAGCCGGAUUUCUC~AC

~01 U JU~GUCLIGCGGCUGAAU GCA/~G UUCAAGCCCA CUUUGCU AUUUC UAUACAC GG,AGGAUAUGAUAUGGGCUUUAGAGC-,UUUAUGU ~ G G C ~ C ~ U CAC ~CUAC~U UU U ~ I I I I I II II I rPr l l IT I r r l i r a f r p l l r ] r p p r l r r l II ere r r r l r l l l l l l l l l l l l l l p I l l l l ] l l ] / r p l l l i I i r l l l l l l l II i i i ] ]111

598 UCAACCGGCUGCGGALIL~GUACAAGCCCAC UUUGCUAOAUC UAUUCAUGGAGGUUAUGAUAUGGGCUUUAGAGGAUUAUGUGAAGCGAUGAA~CUCAC~CCACUAUUUU~

7~I A~GAC, GAt.IGAUGUUCCCNCGGUGCI, IAUGAUGU'd'~ACG AC CAA C~G CU UUA ULIC C UG AAC U LIA ~ UGUC AGUGGC G ~ U U ~ U C C G C C UUUUC U ~ G ~ G~ UG C ~ C UU GUUC IP r l l l l l l l l l l l l i I l l l l I J I I I I I I I J I l / I r l J l i r I I] P l l l J I l l I J r l ] J r l [ J II r[ r rl i i i i i i I] II rl ~

718 GGGA~CGAUGAUGUUCGAUGGUGCGAUGAUGUUUGACGACCAAGGUGUAAUACCCC~AACUUAA~/UGUCAGUGGAGGAAGAUCAGGAGUGCUUUCUCCGAAACUGAA~GUCACACCACU

841 AGCA6CU.~ACUCAAUUCUAGUGUUUUUUCACGCGUGC~A~UGGGAAAACCUUAAUCG~ AUUUGACUUCGUUGAGG&AUCCACGAUGUCUUAUGUUCACG.AUUGGGA~JAAUAUAA/~UC I I I I I II 111111 I I Ipr J] l i r l l l l ] II ]] JJ ] i i ipJ i ] l l r i i Ji i i i i i i l l l l l ] J J r l I I ] ] F r l r r p p l l l p l l J l

838 GGUUGGUAAGCUUA.~UUCCACAGUUUUCUCCCGCGUGCGGAAGUUCAAGACGAUGGUAGCL~UUUGAUUUCAUCAACGAGUCUACUAUGUCUUAUGUUCAUGAUUGGGAGAAUAUAA~UC

961 GUUUAUGACAGAUCAAACGUACUCUUUCAAUGGUAUGACCUAUGGAAUUGAGCGUUGUGUUA U UUAC G C U GGC G UGAUGAC U UACAAGAU UGUUGGCGUGCC UGGUAUGUGUCCGCCCGA IIr I JI II II II II I l l I II I I l l l I¢ II I¢ ¢1 II II11¢ I l l l l l g I I l l l g I l l l l l l l l I II II I I I I I 11111 II l l l l l

95~ UUUUCUUACGGACCAGACUUAUUCGUACCGAGGGAUGACUUACGGUAUCGAACGCUGCGUUAUCCACGCUGGUAUUAUGACGUACAAGAUUAUCGGUGUACCUGGGAUGUGCCCACCCGA

1o8~ ACUCAUUCraACAUUGUAUCUGGUUCCCCUCUAUGAAGGACUAUGUUGGUCUUAAGAUUCCCGCGUEUGA UGACUUtIGUUAAAUGG~CAGUCCGUAUU UUACUGU~UUACG~ r l l r I J i i r l l l r l l l l l i i i i I i i i i i r T r r ir p r r l l r l l r r l l l l ] lJ i i i J IJJ I J II JI IJI I J r r l E l l l l l l l IJ i i i JJr J l l J r r l r r l l l l p l

1o78 ACUCAUUCGACAUUGUAUUUGGUUCCCCUCI/AUUAAAGAC UAUGUUGGUC UAAAGAUCCCCGCGUCGCAGGAUU UGGUUGAGUGGAAAACAGUGCGGAUUUUAACGUCAACAUUACGUGA

1201 GAC~AAGAAAUAGCUAUGCGUUGUUAUAACGACAAGAAGAAUUGGAUGGAUCUAUUCAAGAUUAUUCU~GGUGUUUUAUCAUCGAAGUCCUCCACGAUCGUUAUCAAUGGUAUGUCUAU I I I I I r l I | H I J ] l l l J ] l l l l l l l rr r l l l r i l l r r i l l J i ] l l J r I l r l ]1 i r l r r l i l r r r i l l pl i i / l / J I r r l l H H r l [ r r l r H 1198 GACUCd~GAGAUUGCUAt~C~GGUGUUAUAAUGAUAAGAAAGCGUGGAUGGAACAGUUCAAGGUUAUCCUAGGUGUUCUAUCCGCGAAAUCAUCUACCAUUGUUAUUAAUGGUAUGUCCAU

132~ GCAAUCCG~UGAACGUAUUGAUCUCAAU~AUUAUCAUUA~AUCGGUUUU~UAUUCUUCUCEACACGAAAUUAp~AUACG/~ACAACUUGG~UGUAC~UAUGUG~UGCUUCCUU I l l i l l II I I r i l l i i I F iF i r lF r r l l ] l r r l J i J i i J i i i i i i J ] l r l i l l l iJi I Ji i i i J I J H J I r r l J r r l l l l l r r r r r p l r r l r r p l l l l l r

1318 GCAAUCUGGCC, AGCGAAUAGACAUUAAUGAUUAUCACUACAUCGGUUUCGC CAUUC UUCUUCACACAAAAA UGAAGUACC~ACAAC UUGGAA.~AUGUAUGAUAUGUGGA, AUGCOUCGAG

1441 CA UUUGGAAGUGGUUUGCGUCUAUGUCUAGACCAUUCCGUGUUUUCUUUUCCACUGUUGUUAAGACUUUGUUUCCGACUUUGAGACCGCGCGAGGAAAAGGAGUUUUUGGUCAAACUUUC I I I I r r l r l l J r ] r l r I Ir I i J JJ i i r l l l rE Ir rFii I I I i i i r I r r i r 11111111111117 r l r r l l l l l l l rr r l rrrJ J i l l II rF

1438 UAUUUCGAAGUGGUUUGCCGCGUUGACUCGUCCGCUGCGUGU~UUUUUCUCCAGUGUUGUUCACGCACUAUUCCCGACUUUGAGACCCCGCGAGGAAAAAGAAUUUUUGA~/CAAGCUCUC

|561 CACUUU c GU C ACCUUUAA £GA GGAGIJGCUC UUUUIIAC GG AGGG.A.~GAA U~C, GA C GUGAUAUC A U C A ~ C G G CUUUC~UAGC CAC U ~ C UGU ~GC~U~C~CUAU~C ~ C C ~ I I) 11111 II I I I I I I I I l l l l r J I I I I I I I I r ]1 I i l r l l l l l i J i i i J i I I J I I rl II I Fir i I II J l ] l l l l l l I IJ I I I p i i i I ]111 I J

1558 C ACCUUCGUGACUUUUAAUGAAGAGUGCUCAUUUGACGGUG~AGAGGAAUGGGACGUGAUAUCAUCCGCUGCAUACGUUGCUACGCAGGCUGUUACCGAUGGGAAGAUUUUGGCUGCGCA A "

1681 ]11111 Irlr r Irrre I] ii Ir Jr rl rl I i i i l r l r I ~ - ~ - ~ l r r rl J i l l IIJJ I rl i r r r l r r rrp i l l GAAAGCUAAGAAAUUAGCUGAUCGUCUGGCCGU~CC UGUUGAAGAAGUUACUGC UA UUCCUGA CUCCUACACCUGUUGAUCAGGGCAC UGC UUGUGGACUGGAAACAGAAAC

1678 GO~,G¢C~GAAGCUUG¢O~AGCUUGCAC~CCCGOGAGUGAGGUAtJCGC~CAGUCC U~ CGt~q.G~C UC~ACGCC UGAUGA~A COGCUGAOGUU0GUG~G~GCGA~GGU

1798 AUc~¢UGr`.~CUcu~t~CUG¢CcAAACAcGU~cC¢¢CAUCGcACG~cGCU~C~ccGC~Au~UG~AUUCAGc~A~GA~cAAUUGCAC~AuAccACc~Uc r r l l ] l l l ]1 II II II II rr i J i i I i I i i i H J J r i ] r r l J r l [ r [ rl rl i r l r r l i rl l i t r r J r r r l l l l l l r l l l l l l l l l l p II p[ II ii

1798 UU~¢uC~cuc¢¢uAU¢A~u¢~c~c~c~u¢~¢c¢~uc~cuAc~Guu~tu~A~s~cuAcu~cuAUg~A~A~G~GCu~A~GAg~AC~uGCACc~cACuACAGuG~C

1918 A~uCu~C.*.ACG~U~c`c~c~cA~f;GcAc~GuGGCGAc~ACAAGA~C~C~UuAGA~AAU~GA~uG~G~GAcAC~A~UUcUG~C~Ccc~A~A~cAc~u Ipl p r lJ l r l l l r J i r r r l l Jr ii ii [rJl r re r r l l r i iJl i ] l l l J J ] l ] i]J]r rl Ir iI rr rrl r r lrJr r I r r l l l J i l l Ir

1918 ~AA~UAAAACG~A~UGGMcAUGGcGGGcGGUGA~GACAAAAGAAA~Uccc~cGAGGG~AA~GAA~G~CGA~ACGUAcU~uAC~G~GA~CcUA~GG~AACAU~CA~U

2038 UCC~ACUGGGAGAUGGAUGCACCCUGUGCCUC-GAG~GCGUGGUAUACUCUGUUGGUUAUAAUGAGAAGGGUCUUGGUCCUAAACUUGAUAGUGAGUUGUACAUAGUUAAUGGUGAUUGUGU J II II I IIIIIIP r r l r l l l l i l l l l J ii Ii rlrrr l l l l l l l J l l l pJr I i I i i l ii l lr l i liJ i i i i i i ii i i i l l i ] II Ii

2038 CUCCACGGGUCGGUGGAUGCGUCCUGUGCCCGAGGGUAUUGUUUAUUCUGUCGGUUAUAAUGAACGCGGUUUAGGUCCGAAGUCUGAUGGAGAGCUUUACAUUGUCAAUAGUGAAUGCGU

2~.58 C~U~UCGAACAC~CAUGAUUUGUUUUCAAUCACUAAAUCUUUGUUAGCUCCCACCGGAACCAUCAGCCAAGUCGAUGGUG~AGCUGGGUGCGGGAAAACCACUGcUA~AAAAUCCAUGUU rll i Ipll l l p rrr r J i i i i rrl i IHI Ir I i r rrrll rl ] i ] i i II Ip r{ rrrrr ir l i r l r l l i l l l li il i i i i r r l r ii

2158 GAU•UGUAACAGUGAAUCUUUAU••AC•GUCACGCGUUCUCUUCAAGCUGGAACCGGGACCAUUAGUCAAGUUGACGGAG••UG•UGGUUGUGGGAAA.ACCACGGCAA•UAAAU••A••U•

2278 u~UUC~A~AGAUAUAAUUGUCA~AGCCAACAAGAAAU~UGCUCAAGAUG~GCGUUAU~C~CUGUUUAAAUCUACL~GA~UCCAAAC~A.~GCUUGUGCUUUUGU~AGGACCGCUGAUU~ I I rl l l r i l iJ i i ir i i I i rl i J I r J ] l l IJ Ip p l l l r l l l i l l i i rr i i i l i r l J ir I J l l J r E i r r r l l l r I r l l H r rr Ir H i l l

2278 L~GAGCCGUCCACUGACAUGAUCGUUACCGCGAACAAGAAGUCCGCCCAAGAUGUACGUAUGt;CACUUUUCAAAUCGUCGGAUUCCAAAGAA~CUUGCACCUUUGUUCGAACAGCCGAUUC

2398 CAUAUUACUCAAUGAUUGC C C UAC GGUGUCUC GAGUGC U UGUGGAUGAAGUUGUUUUGUUG C ACUU UGGUCAGUUGUGCGCUGUCAUGUCGAAAC UUCA UGCUGUCAGAGCUUUGUGUUU I I I I I I I H I II II gl t l g I II I l l II ~1 II I I I II I I I I I I I I I I I I I I I I II I l l l l l l l ]1 I I l l I I I I I I T I TIITI

2398 U GUCC~ACUUAAUGAAUGUCCGACUGUUAGUAGGGUUUUGGUUGACGAGGUCGUUCUGCUACACUUUGGUCAAUUAUGUGCCGUCAUGUCUAAGuUGAAGGCCGUGcGAGCUAUAUGUUt~

2518 CGGAGACUCCC~ACAfl, AUAGCCUUCU C C UCUC GUGACGC UUCGUUCGACA UGCGUUUCUCUAAGC UUAUU CCGC.A U~GACCAGUGAUGC GGACACAACUUUCCGUAGCCCACAAC~UGU Ir rl i l i i i i i i i r l r r r I ] r r l r l i J i r l l l r II J i i ] l l l l l l J J l l l r l r i l r l r l J i i i i i i i 7711rrll i l r l r ir I r r l l r l p p l l r l l l l l l l l

2518 UGGGGAUUCGGAGCAGAUUGCCUUUUCCUCgCGUGAUGCUUCAUUUGACAUGCGUUUC UCU~AGAUUAUUCCUGAUG~AC UAGU~UGCU~CACCA CAUUCCgUAG£CCAC~GAUGU

2638 I l l l l I l l l l l l l r l l ] l J l l r l l l l l l l l II r l l l l l l l I]11 I r r l r l l ~ ~ [ ~ , ~ l T i i l l i i i l l l l l l l ! ! l r l l l l l ii i J r l r I I l l l l

AGUACCACUUGUGCGUUU('.AUGGCUACGAAGGCUC UAC CGAAAGGGA CCCGUACGAAAUA CCCAAUCUAAAGUGAGGAAGUC UGUUAC UUCGCGUGCUGUUGCUAG

2638 UGUACCGCUUGUGCGUUUAAUGGCUACGAAGGCC CUUC CGAAAGGAACCCAUUCAAAAUAqACGA~U~.GUtI~CUCAA/IC UAAAGU~UCUGUU~CUE~CAUC~U~G

2755 UGUAUCUCUAGUUGAACUGGACCCCACCAEAUUCUAUAUUACCAUGACGCAAGCUGAUAAAGCCUCACUAAUCACUAKAG•UA.AGGAACUGAACCUACCCAAGGCAUU£UACACUGAUAG Ir l J J J I I I I I I I I p rrlJ i I ] i l l l l i r l r r r l r r l l l ] J J l l l l i r l l l r l l rl i ] 1 pl I i i i i l r i r l J l r i i r r i l l p I r l l rl

2758 UGUGACAUUGGUUGACUUGGAUUCUUCCAGgUUCUAUAUAACGAUGACC CAAGC UGAUAAGGCCUCACUGAUUUCAAE, GGCGAAAG'AGA UGAAUCUACCAAAC~CUUUCUG~U~G

2875 GAUA~GAC UGUCCAUGAAUCUCA.~GGUAUUUCAGAAGAUCAUGUGACCCUGGUUAGG UUCPI, AGAGUACUA/kAUGUGAC CUGUUCAAC~AUUUUCUUACUGCUUAGUUGCAGUUACI~G I l l II II II I l r l l r l lP l r l l l r ir I l r l l i i i i rl r l l r i i i i J i ] l l l l r i l l l J l r r r l l r l r ]] i l l l l l l ] i II e i l l l l l I i i 1

2878 GAUUAAAACCGUGCAUC~GUCUCAGGGUAUCUC£GAAGA£CACGUUACUUU~UAA~UU~UAC~UGU~£C UGUUU~£AGUUUU£UUAUUGUCUCGUUG~£UGAC~G

2995 Pill l l l[ l l l l irlllrrl Jlpll l l l l l l l l l rr II ir iJllllJ r l r T rl 7]-Ii ir IT~I i l l I I i r l l fl AC ACAA GGUCACC UUC CGCUAUGAGUACU&tlGGUGUGUU~.GGUGGGGACCU AJ~UC~C UAAU~ UUC C &UUABUC~UCGCC.-C, AAGI~U C C IIACGUUAAACUACGC UCUCUUUAUU

2998 ACAC~GGU .CACAUUCCGCUACGAGUACUGUGGUGUAUUGAACGGCGAU .UUAAUC GCCG.~0GI~- - - - -GU . . . . . UGCIJC . . . . . . . GUGCIIUAG- CGGU- - - CU-CC CUC - - -UUCGG

31J5 GCGAGUGCUCtAGUUGGUAGUUUUGCUUUAAACUAUCUGAAGUCGCUAAAUC•AGUAUUGGUUGCGA.ACGGGUUG•CCAUCCAGCUUACGGCUAAAAUGG•CAGUAUGCCCCA/•.AGGcAUG I I I I I I I I I I I I I IP I rl r i l l J I i l i l l l l rprrr Ir I i I i f I r J r f l l r l f r H i i i i I I I i i i i i i i i i i i i i i i i i i i I I I i l l

3093 GCGGGAUCUGAGUUGGCGGUAAU o CUGCAAACCGUC UG/tAGUCACUAAACACAUUGU- - GUGGUGAACGGGIIUGUCCAUCCAGCUAACGGCUA.A/kAUGGUCAGUCGUG~UCCACG

3235 C C -GAC AUC C UACAA GGUUGU C C~GC UAC C C UUC-C-C-C-C-C-C~ UCA UC U C C UAKA U U UC UU C GGAA GGGC U U C G UGAG/kAGCU C GUG CA C GG U~UACAC U~ ~O UAC C ~ G~C GG~ UCG II I I I I I I I I I P r l l r/I I l r l l l I l l l l r l r l l I I ] r l l l l ] l l H i l l r l I I I I I I II Ig I I I I I I I I I I I I I t

3210 ~CAC~jAGAC'd~AC~A~'UCUC~-~AGGCACC~UUGAAACCAUCUCCUAGG~UCUUCGG~GCACUUC~UCCGUG~CUUCUA--GCAC~C-G~CUAGUUUC-A~G~C~U~--G 3354 CCUGUGGUUUUCCACAGGUUCUCCAUAAGGAGACCA 3389

rl I I I I Pl I r r l I I i i r l p r r l r 3322 CCCCCCACUUUCGUGGGGGUCUCUAAAAGGAGACCA 3357

Fig. 3. Alignment of the nucleotide sequences of Q-CMV RNA 1 (upper line of each pair) and Fny- CMV RNA 1 (lower line of each pair). Identical nucleotides are indicated by vertical lines; gaps are denoted as dashes; the initiation and stop codons are underlined; the segments showing codon usage realignment are boxed and labelled A to C,

Page 7: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

C M V R N A 1 sequence

1 ~ATssF~mE~vAs~GDKG[~A~v~KTA~EQ~EEQLQ~QRRG~KvY~Nv~OvKOsEvIRTR¥G~vD~A~E~PHG~AGA~R~cE~c~FFp II]]]]]l]!~]]l][]]]lllllllll]!!lll[]]l[I][ll !lll!!]l IIllllII i!l]]l]lllo]]l [l!lll!llllll]l!llr II

1 MATss~NINE~v~s~GDKG~AT~v~KT~EQ~EEQ~Q~QRRGRKv~R~v~vKDsEvrR~R~G~K~TQQEFAP~G~A~A~R~cET~c~DsFp

101 RsGLRQD~wDFGGs~vTH~LRGHNvHcc~cLGIRDKMRHT~RLMsMRKvIL~DPQQF~GRQP~FcTKSAAEcKv~AHFAIs~HGG~M~FR~LcEAMN Illllllll~li ' l lIllll! ' l!!ll!l]l~!lll!lllollll ~lloll]llllll!ilIlill Ilolll!lll]!lllll!llllllllllll

101 ~GLRQDLv~DFGGsWvTHYLRGH~vHcCsP~LG~RDKMRHAERLM~MRK~LnD~QQFDGRQPDFCTQ~AADcKvQAHFA~iHGG¥DMGFRGLcEAMN

201 A~GTT~LKGTMMFDGAMM~DDQGHPELKcQWRK~KsAFsEEEDm~s~AKL~ssvFsRvRNGKTL~AFDFvEEsTMsYvHD~NIKsFMTDQ~sFNGM ]lll l l l l ' l l! l l l l l l] l l l l !ill :[[ll[ol[[[I ;I I IIIlo]llllr Ii0011!!o IIIllill]l~l]ll!011lll[0 II

201 A~G~T~LKGTMMFDGAM~FDDQ~wPE~NcQWRKiRsAFsETE~vTP~v~K~NsTwsRvRKFK~MvAFDH~EsTM~Yw~EN~K~FLTDQTYsYRGM

301 ~GIER~VI¥AGVMT¥KIv~GM~PPELIRHCIWF~sMK~GLK~PA~LVK~KTvR~L~5TLRE~EEIAMR~YN~KKN~M~K~XGvL~KsST rl[!'il[[ Ilollllio!{ll!lll!l[[ll[lll!olllll]illll [[i 'lil~II Illilll]]]ll!]]lll I1o rlollI][lorl[i

301 TYGIERcvIHAGIMTYKI~Gv~GMcPPE~IR~CIWFpsIK~vGLKIPAsQ~LvEW~TvR~LTsTLRETEEIAMR~YNDKKAWME~FKv1LGv~sAKssT

401 IvINGMsMQs~ER~DL~DYHY~GFA1LLHTKLKYEQLGKMYDMWNAsF~WKWFAsMsR~FRvFFsTvvKTLFP~LRPREzKEFLwLsTFv~FNEEcsFD l¢l¢(¢lll¢ltlll'01¢¢¢g¢ll¢¢¢¢lllolf¢l¢¢¢ll¢ll/ll ¢~l¢oool ' '~llo#loo? ~': . . . . . . ~ ,,:~'lilll!ll¢ollll?lll/ill!l!

401 ~v~NGMsMQ~GERIDIN~YHY~GFA1~LH~KMKYEQLGKMYDMWNAss~KWFAAL~R~LR~FFs~HA~F~LR~REEKEFLIK~sTFvTFNEE~FD A

501 GGKE~DvIssA~FvArQAvA%T~AEEKAKK~ADR~AvPv~EvT~pE~sp~Pv~TA~G~ETET~E~Ds~sA~rRsP~AR~AERArA~E~sA~EK II I I I l i l l l l o l l l l l ! o l [ III II l l loo!i '1 ilo I l l l l ~[I I l l ! l l l l l l l l l l l l ! l o l o l l l l i [ ! l l l o l l l l

501 GGez~Dvr5S~AYvm~avTD~Kr~a~Ex~zEKL~vsEvsD~.s~rP~Dra~vcGRzRzvsE~DsLsA~RSP:rRvazR~T~LzYz~Yz ~

600 Q~HDTTvSNLQR~WCMA~GDNKRN~LESNLKFVFDT~F~vDA~VNVHFPTGRWMHpV~EGVVYSvGY~EK~LGPKLDSELY~VNGDCVISNsHDLFS~TK IIIIIIIIll It ~ iI]!l III!!1 I[11]1[111oli oll~,ll ,el ~ ~ '~l!ollll[]l!o!]lll I III111 o111 !1 I oolo

601 QL~vsNLKRIWnMAGGDDKR~sLEGNLKFvFD~F~v~MvN~HFs~GRWMRPwEG~v¥svG~r~ERG~G~Ks~GEL~vNsEcv~c~sEsL~v~R

700 sLLA~GT~s~vDGvAGcGKT~A~KsMFNPsTD~vTA~KKsAQDvR~A~FKsTDsKEAcAFw~ADs~LLNDcP~vsRvLvDEvv~FGQLcAvMsKL 1] r [l[lll]ll[[ll[[lll[llo] ]lllol]l[[l[ll[lll lllllo[IIl!!ollll[llol!lolilllll[l[lllllllI$1!lll![ ~

701 sL~AG~G~sQvDGvAGcGKTTA~Ks~FE~s~DM~v~ANKKsAQDvRMALFKssDsKEAcTFvR~ADsvLLNEcP~vsRvLvDEvvLLHFGQLcAvMsKL . . . . . . B . . . .

800 HAVRALCFGDsEQIAFSSRDASFDMRFSKL1pDETSDADTTF~SPQDvVPLVRLbiATKALPKGTRTKY~=D''~QSKVRKSVTSRAVASVSLVELDPTRFY o l l l l o l l l l l l l [ i l l l l ] l l ] ] ] l [ ] l o l l l l l [ l l [ l l l l l [ I ] [ [ [ l ] l l l l l l l l l l ] l o o l l ~ ~ l ] i [oo l l l~ ! ]o l l l o [ lo [ I orlr

801 ~A~RA~FG~E~IAFS~R~ASFDMRFSK~ET~A~TTFR~QD~V~LVR~MA~KAL~K~TH~K~-~W~QSKVKRSVT~RA~ASVT~VDL~SSRFY . . . . . . . . . C

899 ~TMTQADKASL~T~AKELNLPKAFYTDR~KTV~ESQG~SEDH~TLVRLK~KCDLFKKFSYCLVAVTRH~VTFRYE~CGVLGGD~AN~[ 991 ] / ] ] l / I ] / IJ /o[ ] ] /o l l l lo ]o o l l l ] ] ] ] ] l ] ] ] ] ] ] l ] l l l ] l l l l / / ] [ ] I I I I ] l ] o l l I I I I ] ] l l l l l ] l I l l ! l I~ I

901 ~TMT~ADKASL~RAKE~NL~KTFWNER~KT~ES~G~SE~VTLVRLKSTK~DLFK~FSY~LVALT~HK~TFR~E~CGYLNG~L~AEA~'R~J 993

Fig. 4. Alignment of the putative translation products of Q-CMV RNA 1 (top line) and Fny-CMV RNA 1 (bottom line). Identical amino acids are indicated by vertical lines; chemically similar amino acids [as described by Dayhoffet al. (1972)] are indicated by circles between lines; gaps are denoted by dashes. Amino acid sequences enc(~:led by segments A to C (see Fig. 3) are boxed.

third than in either the N- or the C-terminal thirds. Rather, with Q- and Fny-CMV R N A s 1, nucleotide and amino acid sequence homologies generally are more conserved in the terminal regions than in the central region. Thus, excluding conservative amino acid substitutions, 45 % of the differences in amino acid sequence occur between amino acids 503 and 705, and another 15 % of the differences occur between amino acids 224 and 298, with the remaining differences distributed over the rest of the molecule (Fig. 4). This distribution of homologous and non- homologous sequences is less apparent at the nucleotide sequence level (Fig. 3) because of additional sequence divergence masked at the protein level by the degeneracy of the genetic code.

The distribution of charged amino acids also displays a bias: the C-terminal 141 amino acids contain considerably more basic amino acids [27 in Q-la (the putative protein encoded by Q- CMV R N A 1) and 28 in Fny-la] than acidic amino acids (12 in Q- la and 12 in Fny-la), as is often observed in the C-terminal domains of nucleic acid-binding proteins (Galibert et al., 1979; Pasek et al., 1979); the region of greatest heterogeneity (amino acid residues 503 to 600) contains more acidic amino acids (17 in Q- la and 19 in Fny-la) than basic amino acids (nine in Q- la and 10 in Fny-1 a); most other regions of the two putative proteins contain similar levels of basic and acidic amino acids (estimated by analysing groups of 1 O0 amino acids), with the exception of the region comprising amino acids 101 to 200 which contains more basic amino acids (19 in Q- la and 17 in Fny-la) than acidic amino acids (10 in both Q- la and Fny-ta).

Similarities in amino acid sequence between the translation products of specific genes of a number of viruses, including the Q- la protein, have been observed (Haseloff et al., 1984; Rezaian et al., 1985). Since the Fny-1 a protein is v e ~ similar in amino acid sequence to the Q-1 a protein, the above similarities would also apply to the putative translation product of Fny-CMV

Page 8: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

T. M. R I Z Z O A N D P . P A L U K A I T I S

RNA 1. Of the other virus gene products that were compared with the Q-1 a protein, protein la of BMV showed the most overall amino acid sequence homology. The Fny-la protein did not have a greater sequence homology to the BMV-la protein (44.5~) than did the Q-la protein (43.8 ~); however, the region of greatest amino acid sequence divergence between the Fny-1 a protein and the Q-1 a protein (Fig. 4, residues 501 to 600) also includes the region of greatest heterogeneity between the Q-la protein and BMV-la protein (Rezaian et al., 1985). Furthermore, the corresponding, aligned region of BMV-la (residues 470 to 578) also contains more acidic amino acids (24) than basic amino acids (20), in contrast to the amino acids in the BMV-la protein flanking this region. Similarly, an analysis of the corresponding aligned sequences of A1MV and tobacco mosaic virus (TMV) (Fig. 2 of Haseloff et al., 1984) shows the presence of a domain containing marginally more acidic (26 for the A1MV-1 a protein and 25 for the TMV Mr 126000 protein) than basic (22 for the A1MV-la protein and 17 for the TMV Mr 126000 protein) amino acids, while the regions flanking this domain contain either equivalent numbers of basic and acidic amino acids, or more basic than acidic amino acids. Hence, there is a domain present near the central region of several viral translation products analogous to the CMV-la protein, which shows little amino acid sequence homology but contains a preponderance of acidic amino acids.

By comparison with other proteins involved in nucleic acid replication, Hodgman (1988) has noted that proteins encoded by several plant viruses contain a series of conserved motifs. Thus, the 1 a proteins of A1MV, BMV and Q-CMV all show some conservation of amino acid sequence with other nucleotide-binding proteins, further suggesting some role for these proteins in replication. These conserved protein sequence motifs are also present in the putative protein encoded by Fny-CMV RNA 1.

Non-coding regions of Fny-CMV RNA 1

In the non-coding 5'-terminal regions, the 94 nucleotides of Fny-CMV RNA 1 and the 97 nucleotides of Q-CMV RNA 1 are 81 ~ homologous. The 86 nucleotides in the non-coding 5' terminus of Fny-CMV RNA 2 are 84~ homologous to the corresponding region of Fny-CMV RNA 1. The distribution of homologous nucleotide sequences between Fny-CMV RNA 1 and Fny-CMV RNA 2 is similar to that described by Rezaian et al. (1985) for Q-CMV RNAs I and 2; i.e. there are few differences in the first 50 nucleotides. Moreover, both the sequence homology in this region between Q-CMV RNAs 1/2 and BMV RNAs 1/2 and the sequence complementarity between the 5' non-translated region of Q-CMV RNAs 1/2 and the satellite RNA of CMV (Rezaian et al., 1985) are also present in the non-coding 5'-terminal region of RNAs 1 and 2 of Fny-CMV.

The non-coding 3'-terminal region of Fny-CMV RNA I is 284 nucleotides long (including the stop codon), which compares with 319 nucleotides in Q-CMV RNA 1. The 3'-terminal 180 nucleotides of Fny-CMV RNAs 1 and 2 are 96 ~ homologous; the sequence homology between Fny-CMV RNA 1 and Q-CMV RNA 1 in this region is 64~o. Two specific secondary structures involving this segment of Fny-CMV RNA 2 have been illustrated elsewhere (Rizzo & Palukaitis, 1988); the predominance of one conformation over the other depends on the presence or absence of magnesium ions. The secondary structure of the 3' end of Fny-CMV RNA 1 in the absence of magnesium ions is illustrated in Fig. 5. Also shown are the nucleotide differences between Fny-CMV RNAs 1 and 2 occurring in this structure.

It is evident from Fig. 5 that the seven nucleotide substitutions and the single nucleotide deletion that occur in Fny-CMV RNA 2 relative to RNA 1 are predominantly outside the base- paired regions of the secondary structure. There are three exceptions to this. In hairpins A and G, three nucleotide substitutions occur within the base-paired stems; however, these changes do not disrupt the proposed secondary structure occurring in either the presence or absence of magnesium ions. At position 13 in stem A, the U residue in RNA 1 and the C residue in RNA 2 can both base pair with the same G residue. In stem G, a G-C base pair in RNA 1 (positions 161 and 152, respectively) is replaced by an A-U base pair in RNA 2.

The extent of sequence divergence in the non-coding regions between RNAs of the two subgroups of CMV is not equivalent; i.e. there is greater homology in the 5'-terminal non-coding

Page 9: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

C M V R N A 1 sequence

3 r i A

G t~ ~A A F A 11 GAGG A A G A A C . . . . A

A A 160A-U U A C U C U A A a U G" C U- A (U)20 c

U "G *G -Cu 140-C.C cG I C'G U-A A'U GGGGUG U

180 -G'CAGUC "GCCAGUA'UCUGAG.C C c G'C

C-G I A,U

51 C-G

CU~cCU 120-C. G-40 U C ACCAAAG UUU .GCAUGGG ACU E . . . . . . . . . . . . u AGGUUUC GUACUUc.G U

1~o u u u u A C A G A.U GG A ~,C G.C

"C C .G A.U-80 cA. U C.G U G AAC U C C ~60 D

Fig. 5. Secondary structure of the 3' terminus of Fny-CMV R N A 1 in the absence of magnesium ions. Stem and loop structures are labelled A to H. Nucleotide substitutions that occur in Fny-CMV RNA 2 (Rizzo & Palukaitis, 1988) relative to RNA 1 are indicated by smaller letters adjacent to their corresponding nucleotides in Fny-CMV RNA 1. The residue in parentheses is missing in Fny-CMV RNA 2.

regions (81% for RNA 1 and 80~ for RNA 2) than in the Y-terminal non-coding regions (66% for RNA l and 65 % for RNA 2) of the two CMV strains. However, the Y-terminal regions show a strong conservation of secondary structure (e.g. Fig. 5). Thus, at the 5' termini, the actual nucleotide sequence may be important for various RNA-protein interactions, while at the 3' termini, the secondary structure formed by the nucleotide sequence may be more important in such interactions. This may also account for the difference in the extent of divergence between the 5'- and Y-terminal regions of RNAs 1 and 2 (of Fny-CMV); i.e. 16% in the 5' termini and 4% in the 3' termini, since compensatory changes would have to occur in the Y-terminal region to maintain the same secondary structure.

Genetic analysis and function o f RNA 1

Although genetic mapping studies with several strains of CMV have indicated that the domains for pathogenicity and host range are on RNA 2 and/or RNA 3 (Marchoux et al., 1974; Hanada & Tochihara, 1980; Rao & Francki, 1982; Edwards et al., 1983; Lakshman & Gonsalves, 1985), two studies have implicated RNA 1 in such roles (Lakshman & Gonsalves, 1985; Zitter & Gonsalves, 1986). In the latter case, RNA 1 of Fny-CMV was associated with rapidity in the initial increase in virus titre in squash, as well as the rapidity and extent of symptom development. However, we have observed that these effects are specific to certain cucurbit species, and are not seen in either tomato, tobacco, Nicotiana clevelandii or cucumber. Moreover, the acceleration and enhancement of pathogenicity is correlated with decreased levels of (encapsidated) RNA 1, both with Fny-CMV and another strain of CMV (Rm-CMV) that also exhibits the above properties (M. Roossinck & P. Palukaitis, unpublished results). Therefore, RNA 1 may play a major role in the regulation of viral RNA synthesis, and thus only indirectly affect pathogenicity.

Clearly, more work is needed both in defining the roles of the translation products of RNAs 1 and 2 in virus replication and in delineating various functional domains in these proteins.

This work was supported by grant no. 86-CRCR-l-1983 from the USDA CGO and grant no. DE-FG02- 86ER13505 from the Department of Energy.

R E F E R E N C E S

AHLQUIST, P. (1986). In vitro transcription of infectious viral RNA from cloned cDNA. Methods in Enzymology 118, 709 716.

DAVIES, C. & SYMONS, R. H. (1988). Further implications for the evolutionary relationships between tripartite plant viruses based on cucumber mosaic virus RNA 3. Virology 165, 216-224.

DAYHOFF, M. O., ECK, R. V. & PARK, C. M. (1972). A model of evolutionary change in proteins. In Atlas of Protein Sequence and Structure, vol. 5, pp. 89-99. Edited by M. O. Dayhoff. Washington, D.C. : National Biomedical Research Foundation.

DEVERGNE, J. C. & CARDIN, L. (1975). Relations s~rologiques entre cucumovirus (CMV, TAV, PSV). Annales de Phytopathologie 7, 255 276.

Page 10: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

10 T. M. R I Z Z O AND P. P A L U K A I T I S

EDWARDS, M. C. & GONSALYES, D. (1983). Grouping of seven biologically defined isolates of cucumber mosaic virus by peptide mapping. Phytopathology 73, 1117-1120.

EDWARDS, M. C., GONSALVES, D. & PROVVIDENTI, R. (1983). Genetic analysis of cucumber mosaic virus in relation to host resistance: location of determinants for pathogenicity to certain legumes and Lactuca saligna. Phytopathology 73, 269-273.

GALIBERT , F., MANDART, E., FITOUSSI, F., TIOLLAIS, P. & CHARNAY, P. (1979). Nucleotide sequence of the hepatitis B virus genome (subtype ayw) cloned in E. coll. Nature, London 281, 646-650.

GARCIA-ARENAL, F., ZAITLIN, M. & PALUKAITIS, P. 0987). Nucleotide sequence analysis of six satellite R N A s of cucumber mosaic virus: primary sequence and secondary structure alterations do not correlate with differences in pathogenicity. Virology 158, 339-347.

GONDA, T. J. & sYMoNs, R. H. (1978). The use of hybridization analysis with complementary D N A to determine the R N A sequence homology between strains of plant viruses: its application to several strains of cucumoviruses. Virology 88, 361-370.

GUnLER, U. & HOFFMAN, B. J. (1983). A simple and very efficient method for generating c D N A libraries. Gene 25, 263-269.

HANADA, K. & TOCHIBARA, H. (1980). Genetic analysis of cucumber mosaic, peanut stunt and chrysan themum mild mottle viruses. Annals of the Phytopathological Society of Japan 46, 159-168.

HgSELOFF, J. & SYMONS, R. H. (1981). Chrysan themum stunt viroid: primary sequence and secondary structure. Nucleic Acids Research 9, 2741-2752.

HASELOFF, l., GOELET, P., ZIMMERN, D., AHLQUIST, P., DASGUPTA, R. & KAESBERG, P. (1984). Striking similarities in amino acid sequence among nonstructural proteins encoded by R N A viruses that have dissimilar genomic organization. Proceedings of the National Academy of Sciences, U.S.A. 81, 4358-4362.

HODGMAN, T. C. (1988). A new superfamily of replicative proteins. Nature, London 333, 22-23. JESSEE, J. (1986). New subcloning efficiency competent cells: > 1 x 106 transformants/ktg. Focus 8 (4), 9-10.

Gaithersburg: Bethesda Research Laboratories. KAMER, G. & ARGOS, P. (1984). Primary structural comparison of RNA-dependent R N A polymerases from plant,

animal and bacterial viruses. Nucleic Acids Research 12, 7269-7282. KIBERSTIS, P. A., LOESCH-FRIES, L. S. & HALL, T. C. (1981). Viral protein synthesis in barley protoplasts inoculated

with native and fractionated brome mosaic virus RNA. Virology 112, 804-808. LAKSHMAN, D. K. & GONSALVES, D. (1985). Genetic analyses of the large lesion mutants of two cucumber mosaic

virus strains. Phytopathology 75, 758-762. LISS, L. R. (1987). New M13 host: DH5ctF' competent cells. Focus 9 (3), 13. Gaithersburg: Bethesda Research

Laboratories. MANIATIS, T., FRITSCH, E. F. & SAMBROOK, J. (1982). Molecular Cloning: A Laboratory Manual. New York: Cold

Spring Harbor Laboratory. MARCBOUX, G., MAROU, J. & QUIOT, J.-n. (1974). Compl6mentat ion entre A R N de diff6rentes souches du virus de la

mosaiqu¢ du concombre. Mise en 6vidence d 'une interaction entre deux A R N pour d6terminer un type de sympt6me. Comptes rendus hebdomadaires des sdances de l'Acad~mie des sciences, Sdrie D 278, 1943-1946.

MESSING, J. (1979). A multipurpose cloning system based on the single-stranded D N A bacteriophage M13. Recombinant DNA Technical Bulletin. Bethesda: N I H Publication No. 7 9 4 9 , 2(2), 43-48.

MESSING, J. (i983). New M13 vectors for cloning. Methods in Enzymology 101, 20-78. NASSUTB, A. & BOL, J. F. (1983). Altered balance of the synthesis of plus- and minus-s t rand R N A s induced by R N A s

1 and 2 of alfalfa mosaic virus in the absence of R N A 3. Virology 124, 75-85. NORRANDER, J., KEMPE, T. & MESSING, J. (1983). Construction of improved M13 vectors using oligonucleotide-

directed mutagenesis. Gene 26, 101-106. PALUKAITIS, P. & ZAITLIN, M. (1984). Satellite R N A s of cucumber mosaic virus: characterization of two new

satellites. Virology 132, 426-435. PASEK, M., GOTO, T., GILBERT, W., ZINK, B., SCHALLER, H., MACKAY, P., LEADBETYER, G. & MURRAY, K. (1979).

Hepati t is B virus genes and their expression in E. coll. Nature, London 282, 575-579. PEDEN, K. W. c. & SYMONS, R. H. (1973). Cucumber mosaic virus contains a functionally divided genome. Virology

53, 487-492. PIAZZOLLA, P., DIAZ-RUIZ, J. g. & KAPER, J. M. (1979). Nucleic acid homologies of eighteen cucumber mosaic virus

isolates determined by competition hybridization. Journal of General Virology 45, 361-369. PONCZ, M., SOLOWIEJCZYK, D., BALLANTINE, M., SCHWARTZ, E. & SURREY, S. (1982). " N o n r a n d o m " D N A sequence

analysis in bacteriophage M13 by the dideoxy chain-termination method. Proceedings of the National Academy of Sciences, U.S.A. 79, 4298-4302.

RAO, A. L. N. & FRANCKI, R. I. B. (1982). Distr ibution of determinants for symptom production and host range on the three R N A components of cucumber mosaic virus. Journal of General Virology 61, 197-205.

REZAIAN, M. A., WILLIAMS, R. H. V., GOULD, A. R. & SYMONS, R. H. (1984). Nucleotide sequence of cucumber-mosaic- virus R N A 2 reveals a translation product significantly homologous to corresponding proteins of other viruses. European Journal of Biochemistry 143, 277-284.

REZAIAN, M. A., WILLIAMS, R. H. V. & SYMONS, R. H. (1985). Nucleotide sequence of cucumber mosaic virus R N A 1. Presence of a sequence complementary to part of the viral satellite R N A and homologies with other viral RNAs. European Journal of Biochemistry 150, 331-339.

RlZZO, T. i . & PALUKAITIS, P. (1988). Nucleotide sequence and evolutionary relationships of cucumber mosaic virus (CMV) strains: C M V R N A 2. Journal of General Virology 69, 1777-1787.

Page 11: Nucleotide Sequence and Evolutionary Relationships of Cucumber Mosaic Virus (CMV) Strains: CMV RNA 1

Downloaded from www.microbiologyresearch.org by

IP: 54.162.190.106

On: Thu, 03 Mar 2016 17:32:52

C M V R N A 1 sequence 11

SANGER, F., NICKLEN, S. & COULSON, A. R. (1977). DNA sequencing with chain-terminating inhibitors. Proceedings of the National Academy of Sciences, U.S.A. 74, 5463-5467.

SANGER, F., COOLSON, A. R., BARRELL, B. G., SMITH, A. J. H. & ROE, B. A. (1980). Cloning in single-stranded bacteriophage as an aid to rapid DNA sequencing. Journal of Molecular Biology 143, 161-178.

SCHWINGHAMER, M. W. & SYMONS, R. H. (1977). Translation of the four major RNA species of cucumber mosaic virus in plant and animal cell-free systems and in toad oocytes. Virology 79, 88-108.

YANISCH-PERRON, C., VIEIRA, J. & MESSING, J. (1985). Improved M13 phage cloning vectors and host strains: nucleotide sequences of the M13 mpl8 and pUC19 vectors. Gene 33, 103-119.

ZITTER, T. A. & GONSALVES, D. (1986). Analysis of pseudorecombinants of two strains of cucumber mosaic virus differing in symptom expression and aphid transmissibility in squash. Proceedings of the Workshop on Epidemiology of Plant Virus Diseases, Orlando, Florida, August 6-8, 1986. pp. 19-21.

(Received 3 June 1988)