Top Banner
Vol. 66, No. 11 JOURNAL OF VIROLOGY, Nov. 1992, p. 6572-6577 0022-538X/92/116572-06$02.00/0 Copyright © 1992, American Society for Microbiology Sequence Similarity between Borna Disease Virus p40 and a Duplicated Domain within the Paramyxovirus and Rhabdovirus Polymerase Proteins MARCELLA A. McCLURE,1* KELLY J. THIBAULT,2 CAROLYN G. HATALSKI,3 AND W. IAN LIPKIN2,3,4* Department of Ecology and Evolutionary Biology, 1 Department of Microbiology and Molecular Genetics, 2 Department of Neurology,' and Department of Anatomy and Neurobiology, University of California, Irvine, California 92717 Received 30 June 1992/Accepted 18 August 1992 We report the sequence of a Borna disease virus clone (pBDV-40) that encodes a 40-kDa protein (p40) found in the nuclei of infected cells. Comparative sequence analysis indicates that p40 is distantly similar to two different regions in the L-polymerase proteins encoded by paramyxoviruses and rhabdoviruses. The p40 sequence similarity indicates a previously undetected duplication in these viral polymerases. Phylogenetic reconstruction suggests that the gene that encodes p40 last shared a common ancestor with these viral polymerase genes prior to the duplication event. These findings support the hypothesis that Borna disease virus is a negative-strand RNA virus and suggest that p40 is involved in transcription and/or replication. The discovery of a duplication within the polymerase proteins of paramyxoviruses and rhabdoviruses has profound implications for the mapping of enzymatic activities within these multifunctional proteins. Borna disease (BD) is a naturally occurring immune- mediated neurologic disease of horses and sheep caused by infection with an RNA virus, BD virus (BDV) (10, 24, 25). The experimental host range for BDV includes birds, ro- dents, and primates (25). Antibodies reactive with BDV proteins have been described in patients with bipolar depres- sion (2, 33), schizophrenia (33), and AIDS encephalopathy (4), suggesting that BDV, or a related infectious agent, is pathogenic in humans. Although BD was described in the early 1800s, BDV, the infectious agent that induces the disease, remains unclassi- fied. BDV has been refractory to classical methods of virus purification (25). Isolation of BDV cDNA clones has been critical in demonstrating that BD is due to infection, that the infectious agent contains nucleic acids, and that the 24- and 40-kDa proteins associated with BD are encoded by the infectious agent (24). Little is known about the molecular biology of the BDV infectious cycle. BDV transcribes its mRNAs in the nuclei of infected cells (6, 10), and p40 is frequently isolated in association with p24 (3). Biochemical and molecular studies have indicated that the BDV genome is a single-stranded RNA between 8.5 (10, 24) and 10 (32, 36) kb long. The designation of BDV genome polarity has been controversial: we have proposed that it is negative stranded (10, 24); others have suggested that it is positive stranded (32, 36). Rhabdoviridae, Paramyxoviridae, and Filoviridae are RNA enveloped viruses with nonsegmented genomes of negative polarity constituting the order Mononegavirales. Basic features of genome organization and analogous protein function are common among these families and support a shared evolutionary history (14, 19). Gene order for the rhabdovirus genome is 3'-NP-P-G-L-5', and that of the paramyxoviruses is 3'-NP-P(C/V)-M-F-HN-L-5' (14). The RNA-dependent RNA polymerase (approximately 240 kDa) * Corresponding authors. is encoded by the L gene that accounts for slightly more than half of the virus genome. The L protein is a multifunctional enzyme that, in conjunction with the NP-RNA template and P proteins, performs a variety of functions, including repli- cation, transcription, polyadenylation, capping, and methyl- ation. We cloned and sequenced a BDV cDNA clone inferred to encode a 39.5-kDa protein. In vitro transcription-translation experiments confirmed that the clone encodes a 40-kDa protein. Sequence analysis revealed that p40 is similar to an extensive region of duplication in the L-polymerase proteins of paramyxoviruses and rhabdoviruses. These findings have implications for understanding the role of p40 in the life cycle of BDV and the evolution of the monopartite, negative- strand RNA virus polymerase proteins. MATERIALS AND METHODS Library screening. Approximately 20,000 recombinants in a BDV rat brain cDNA plasmid library (pcDNAII; Invitro- gen, La Jolla, Calif.) were hybridized with a 32P-labeled oligonucleotide, TGGAGGCCGACTGTA, representing a sequence toward the 5' end of the BDV p40 open reading frame in cDNA clone AB5 (24). DNA sequencing. Plasmid DNA was sequenced on both strands by the dideoxynucleotide chain termination method by using bacteriophage T7 DNA polymerase (Sequenase; United States Biochemical). In vitro transcription-translation and immunoprecipitation. Capped mRNAs were synthesized in vitro (21) by using linearized pBDV-40 plasmid DNA as the template. Synthetic mRNAs and poly(A)+ RNAs from normal rat brain or BD rat brain tissue were translated in vitro by using rabbit reticulocyte lysates (Promega Biotech) (29). Translated [35S] methionine-labeled proteins were immunoprecipitated by incubation with serum from BD rats, rabbit anti-rat immu- noglobulin G (Pel-Freez), and staphylococcal protein A (Pansorbin; Calbiochem) (30). Immunoprecipitated proteins 6572
6

Sequence Similarity Disease Virus p40 and ...

Feb 09, 2022

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sequence Similarity Disease Virus p40 and ...

Vol. 66, No. 11JOURNAL OF VIROLOGY, Nov. 1992, p. 6572-65770022-538X/92/116572-06$02.00/0Copyright © 1992, American Society for Microbiology

Sequence Similarity between Borna Disease Virus p40 and aDuplicated Domain within the Paramyxovirus and

Rhabdovirus Polymerase ProteinsMARCELLA A. McCLURE,1* KELLY J. THIBAULT,2 CAROLYN G. HATALSKI,3

AND W. IAN LIPKIN2,3,4*Department ofEcology and Evolutionary Biology, 1 Department ofMicrobiology and Molecular Genetics, 2Department ofNeurology,' and Department ofAnatomy and Neurobiology, University of California,

Irvine, California 92717

Received 30 June 1992/Accepted 18 August 1992

We report the sequence of a Borna disease virus clone (pBDV-40) that encodes a 40-kDa protein (p40) foundin the nuclei of infected cells. Comparative sequence analysis indicates that p40 is distantly similar to twodifferent regions in the L-polymerase proteins encoded by paramyxoviruses and rhabdoviruses. The p40sequence similarity indicates a previously undetected duplication in these viral polymerases. Phylogeneticreconstruction suggests that the gene that encodes p40 last shared a common ancestor with these viralpolymerase genes prior to the duplication event. These findings support the hypothesis that Borna disease virusis a negative-strand RNA virus and suggest that p40 is involved in transcription and/or replication. Thediscovery of a duplication within the polymerase proteins of paramyxoviruses and rhabdoviruses has profoundimplications for the mapping of enzymatic activities within these multifunctional proteins.

Borna disease (BD) is a naturally occurring immune-mediated neurologic disease of horses and sheep caused byinfection with an RNA virus, BD virus (BDV) (10, 24, 25).The experimental host range for BDV includes birds, ro-dents, and primates (25). Antibodies reactive with BDVproteins have been described in patients with bipolar depres-sion (2, 33), schizophrenia (33), and AIDS encephalopathy(4), suggesting that BDV, or a related infectious agent, ispathogenic in humans.Although BD was described in the early 1800s, BDV, the

infectious agent that induces the disease, remains unclassi-fied. BDV has been refractory to classical methods of viruspurification (25). Isolation of BDV cDNA clones has beencritical in demonstrating that BD is due to infection, that theinfectious agent contains nucleic acids, and that the 24- and40-kDa proteins associated with BD are encoded by theinfectious agent (24). Little is known about the molecularbiology of the BDV infectious cycle. BDV transcribes itsmRNAs in the nuclei of infected cells (6, 10), and p40 isfrequently isolated in association with p24 (3). Biochemicaland molecular studies have indicated that the BDV genomeis a single-stranded RNA between 8.5 (10, 24) and 10 (32, 36)kb long. The designation of BDV genome polarity has beencontroversial: we have proposed that it is negative stranded(10, 24); others have suggested that it is positive stranded(32, 36).Rhabdoviridae, Paramyxoviridae, and Filoviridae are

RNA enveloped viruses with nonsegmented genomes ofnegative polarity constituting the order Mononegavirales.Basic features of genome organization and analogous proteinfunction are common among these families and support ashared evolutionary history (14, 19). Gene order for therhabdovirus genome is 3'-NP-P-G-L-5', and that of theparamyxoviruses is 3'-NP-P(C/V)-M-F-HN-L-5' (14). TheRNA-dependent RNA polymerase (approximately 240 kDa)

* Corresponding authors.

is encoded by the L gene that accounts for slightly more thanhalf of the virus genome. The L protein is a multifunctionalenzyme that, in conjunction with the NP-RNA template andP proteins, performs a variety of functions, including repli-cation, transcription, polyadenylation, capping, and methyl-ation.We cloned and sequenced a BDV cDNA clone inferred to

encode a 39.5-kDa protein. In vitro transcription-translationexperiments confirmed that the clone encodes a 40-kDaprotein. Sequence analysis revealed that p40 is similar to anextensive region of duplication in the L-polymerase proteinsof paramyxoviruses and rhabdoviruses. These findings haveimplications for understanding the role of p40 in the life cycleof BDV and the evolution of the monopartite, negative-strand RNA virus polymerase proteins.

MATERIALS AND METHODS

Library screening. Approximately 20,000 recombinants ina BDV rat brain cDNA plasmid library (pcDNAII; Invitro-gen, La Jolla, Calif.) were hybridized with a 32P-labeledoligonucleotide, TGGAGGCCGACTGTA, representing asequence toward the 5' end of the BDV p40 open readingframe in cDNA clone AB5 (24).DNA sequencing. Plasmid DNA was sequenced on both

strands by the dideoxynucleotide chain termination methodby using bacteriophage T7 DNA polymerase (Sequenase;United States Biochemical).

In vitro transcription-translation and immunoprecipitation.Capped mRNAs were synthesized in vitro (21) by usinglinearized pBDV-40 plasmid DNA as the template. SyntheticmRNAs and poly(A)+ RNAs from normal rat brain or BDrat brain tissue were translated in vitro by using rabbitreticulocyte lysates (Promega Biotech) (29). Translated [35S]methionine-labeled proteins were immunoprecipitated byincubation with serum from BD rats, rabbit anti-rat immu-noglobulin G (Pel-Freez), and staphylococcal protein A(Pansorbin; Calbiochem) (30). Immunoprecipitated proteins

6572

Page 2: Sequence Similarity Disease Virus p40 and ...

BDV p40 AND PARAMYXOVIRUS AND RHABDOVIRUS POLYMERASES 6573

were analyzed by sodium dodecyl sulfate-polyacrylamide gelelectrophoresis (SDS-PAGE) and autoradiography.Computer analyses. All computer analyses were con-

ducted on a SPARCstation 2GS running SUN OS 4.1.1. Thedata base accessed for searches and sequence extraction wasthe nonredundant data base composed of PIR, 30.0, SWISS-PROT, 20.0, and GenPept (translated GenBank, 70.0) anddeveloped by the National Center for Biotechnology Infor-mation, National Library of Medicine (17a).The strategy used to detect the relationships between the

p40 and L-polymerase sequences consisted of four stages:sensitive data base screening, initial pairwise alignment,multiple alignment, and manual refinement (26, 28). The database screening was performed by using the original BasicLocal Alignment Search Tool (gBLAST) that allows forextensions containing gaps (source code provided by E. W.Myers) (1). The gBLAST program retrieves many sequencesthat are clearly unrelated to the probe sequence, and biolog-ical knowledge is necessary to distinguish potential relation-ships from this high background. On the basis of smallregions of identity found by gBLAST, segments equivalentin size to the p40 sequence were cut from the rhabdovirus

CTAAAGCCCAAGAGACGCCTGGTTGATGACGCCGATGCC ATG GAG GAC CAA GAT TTA TAT GAA CCC +27M E D Q D L Y E P

CCA GCG AGC CTC CCC AAG CTC CCC GGA AAA TTC CTA CAA TAC ACC GTT GGG GGG TCT +84P A S L P K L P G K F L 0 Y T V G G S

GAC CCG CAT CCG GGT ACA GGG CAT GAG AAG GAT ATC AGG CAG AAC GCA GTG GCA TTG +141D P H P G T G H E K D I R 0 N A V A L

TTA GAC CAG TCA CGG CGC GAT ATG TTT CAT ACA GTA ACG CCC AGC CTT GTG TTT CTA +198L D 0 S R R D M F H T V T P S L V F L

TGT TTG CTA ATC CCA GGA CTG CAC GCT GCG TTT GTT CAC GGA GGG GTG CCT CGT GAA +255C L L I P G L H A A F V H G G V P R E

TCT TAC CTG TCG ACG CCT GTT ACG CGT GGG GAA CAG ACT GTC GTT AAG ACT GCA AAG +312S Y L S T P V T R G E Q T V V K T A K

TTT TAC GGG GAA AAG ACA ACA CAG CGT GAT CTC ACC GAG CTG GAG ATC TCC TCT ATA +369F Y G E K T T 0 R D L T E L E I S S I

TTC AGC CAT TGT TGC TCA TTA CTA ATT GGG GTT GTG ATA GGA TCG TCA TCT AAG ATT +426F S H C C S L L I G V V I G S S S K I

AAA GCA GGA GCC GAG CAG ATC AAG AAA AGG TTT AAA ACT ATG ATG GCA GCC TTA AAC +483K A G A E Q I IK K R F KI T M M A A L N

CGG CCA TCC CAT GGT GAG ACT GCT ACA CTA CTT CAG ATG TTT AAT CCA CAT GAG GCT +540R P S H G E T A T L L 0 M F N P H E A

ATA GAT TGG ATT AAC GGC CAG CCC TGG GTA GGC TCC TTT GTG TTG TCT CTA CTA ACT +597I D W I N G 0 P W V G S F V L S L L T

ACA GAC TTT GAG TCC CCA GGT AAA GAA TTC ATG GAT CAG ATT AAA CTT GTC GCA AGT +654T D F E S P G K E F M D 0 I K L V A S

TAT GCG CAG ATG ACT ACG TAC ACT ACT ATA AAG GAG TAC CTC GCA GAA TGT ATG GAT +711Y A 0 M T T Y T T I K E Y L A E C M D

GCT ACC CTT ACA ATC CCT GTA GTT GCA TAT GAG ATT CGT GAC TTT TTA GAA GTT TCA +768A T L T I P V V A Y E I R D F L E V S

GCA AAG CTT AAA GAG GAA CAT GCT GAC CTG TTT CCG TTC CTG GGG GCT ATT CGG CAC +825A K L K E E H A D L F P F L G A I R H

CCC GAC GCT ATC AAG CTT GCG CCA CGG AGC TTT CCC AAT CTG GCT TCT GCA GCG TTT +882P D A I K L A P R S F P N L A S A A F

TAC TGG AGT AAG AAG GAG AAT CCC ACA ATG GCG GGC TAC CGG GCC TCC ACC ATC CAG +939Y W S K K E N P T M A G A S V K E G Y

CCG GGC GCG AGT GTC AAG GAG ACC CAG CTT GCC CGG TAT AGG CGC CGC GAG ATA TCT +996R A S T I Q P T 0 L A R Y R R R E I S

CGC GGG GAA GAC GGG GCA GAG CTC TCA GGT GAG ATC TCT GCC ATA ATG AGA ATG ATA +1053R G E D G A E L S G E I S A I M R M I

GGT GTG ACT GGT CTA AAC TAG AAAACAATGAACAAACCAATAAAAAAA +1074G V T G L N *

FIG. 1. Nucleic acid sequence and deduced amino acid sequencefor BDV p40. Underlined nucleotides indicate a consensus sequencefor initiation of translation; an asterisk indicates the terminationcodon for a protein with a predicted molecular mass of 39.5 kDa.Boxed residues indicate a putative nuclear targeting motif. Under-lined amino acids correspond to direct microsequence data for a40-kDa protein extracted from BD rat brain tissue (11).

A B C D E F

46-

P4030-

FIG. 2. In vitro translation of a 40-kDa protein from clonepBDV-40. Synthetic mRNAs transcribed from pBDV-40 RNA andpoly(A)+ RNAs from normal rat brain or BD rat brain weretranslated in vitro by using rabbit reticulocyte lysates. [35S]methio-nine-labeled translation reactions were immunoprecipitated firstwith normal rat serum or BD rat serum, second with rabbit anti-ratimmunoglobulin G, and last with staphylococcal protein A cells.Immunoprecipitated protein products were analyzed by autoradiog-raphy after SDS-PAGE. Lanes: A, normal rat brain RNA, normalrat serum; B, normal rat brain RNA, BD rat serum; C, BD rat brainRNA, normal rat serum; D, BD rat brain RNA, BD rat serum; E,pBDV-40 RNA, normal rat serum; F, pBDV-40 RNA, BD ratserum. Molecular weight markers (103) are indicated at the left.

and paramxyovirus L-polymerase sequences and used in thesubsequent stages of analysis. Pairwise alignments wereperformed for all combinations of the p40 sequence and allL-polymerase segments (13). The scoring matrices employedwere the unitary matrix and the minimum-mutation matrix(8, 34). Progressive multiple alignments were generated witha user-specified weighting option set at a value of 2 (12).Manual refinements were introduced to identify small re-gions of similarity not detected by the multiple alignmentprogram, to allow alternative gapping when this wouldproduce more consistent local region relationships or tominimize the mutational events required to align one set ofsequences to another.

Phylogenetic reconstruction was conducted by using amethod that finds the most parsimonious tree on the basis ofthe multiple alignment (15, 17). The tree on which the fewestnucleotide substitutions are required to account for all of theamino acid differences among the input sequences aligned isthen corrected for missing nucleotide substitutions. Thiscorrection may underestimate the total number of substitu-tions, but it does not bias the relative rates of substitution(16).

Nucleotide sequence accession number. The BDV p40 genesequence (see Fig. 1) has been deposited in GenBank underaccession number M99375.

RESULTSIsolation and sequencing of clone pBDV-40. BDV cDNA

clones were isolated from a BD rat brain cDNA library by asubtractive-hybridization method. One of these clones hy-brid arrested translation of a BDV-specific 40-kDa proteinfrom a BD rat brain poly(A)+ RNA fraction (24). This clonepredicts a 34-kDa protein, and an oligonucleotide corre-sponding to the 5' end of the protein was used as a probe torescreen the library for full-length cDNAs that encode p40.Two independent clones were identified and sequenced

VOL. 66, 1992

Page 3: Sequence Similarity Disease Virus p40 and ...

6574 McCLURE ET AL.

*W.*.4.3 A :3:PC 3.3:

3040M40W1.330 ... .. . - a

44444 to. : 34AA

x x33o Po. 000 > 33 3p*..MaWM:*w x

0m,o 4 ixI__ga

4433I *4.1.13zz">4 :>A. 10

03I,thx : 1S.:. 1:

44M .M

W zg. ~0 UJM333

>:A 41.44MW4

,14 P.F 04 a" "0 PC Peas,

a1::3)3A. OMON4 0003J>>FE~~~~~c 'aasEe::,> :>

*#,:W.)4W wNiIi03 to 3a

I 4SC IC bbo oI- owa a

McMc a >is a 4 33a

00;k:iZ:W*:X-a F:ft Os P

01. F.11..4:-03 M z

33331.0,43 ~~~0. 1

to0 1 03 033Ay:3:434434 F :4433t1.1.b

:00 3 ::

eiSAR.R~~14C--> PBx

004 1.11

33331.330 01.3~~~n03 04

1.X. 031. I33

c*A l,,3,0aA3 A

333.13 .1

go" 004 .

1.31 N1.UNEyU^XP |9

:B01 341.3104~>P3M M

000.01.1.33M.4.014 44X .

4000.;u 1.

_xx>

a,~~~I ""ox aco

M4M 1.U.M4rIUa

>u>0033. MMP

4..04333 *4 00 '

bChCKSIChCY,Y, ,YYd m:tA)K1.1.3031.33~~~~~84 8

.0M*41A WW0m 9aMMCIA D. .

a#a4*4*4a-a . K44*4*

W 1.-mUo 04 44wa44

us40. -:4*3k41.3 0o4AK 0McX 4 4300

3441.1.3 :03*00:44.1033 0333341

33:t *:bW:bw: :3*:43

_ c z a zy m

go >e 04F 4 > 3P3

333 .1 00003~E- H 0

> >

2aesomF toF

zl3 X~~~~~~bb""IC a

313053 M~U.Gc3..40U

:_0030. :^ :00:0:

as o lx P.,c_l

.4 .4a > > ut:t:st7 tXpt:sto

44

K4 6

~~A~~~353A~~3333

U-1 94aQ:Acl

*444,304 43.3 3

330*4 33~~~X*4A

O 001

00 4003Le

MMM_

33 :443433g 0 0 34 M734om

Xkmasto....

0i :bczZPCaZ9

P.qt0 - PonXPPPOP

33041.*4*4 1.~ Mt 1 m

z ~ ~~~~Xto

UW: 31. *4 :041

"W~~~~~~~ uo:W.oCa KM Cm C

3 3

0X 3 331.333

aina

:4:W 3 :4344:3

043 -E0, 4 4*

1..1.1.301.010OOOMs .

oaa

" 333 3 s"4D"3,4

0 MMM .3OK44O03

>3> 33 0 3 34 1 33

1.31. x1.1.*4 343.131

oUa.1OK3::i::1:.33.1o . 3 *4*334144

:3 zzz43, _ *444MM1333 000 003 .**Y9v*33433331. 1.u 3MM>s^

3s 3e 0000:Ks:| 'SA

1.:3sa 4444 ;wZP"<PI3333aaaEs : _ 1.*4,:|*:a:|*:P : *1.1i.1 #.t.1ecolSlac<1.41.1.

: ¢as X:B331.3*4^lCP@£Z0 44430aoPeP<A@XMP PZoa440,4,4,** 4 sc3 333333444."*3p 4o*1 3444*44333~ : XR444444)34 - w444:1.4434330 441.

3333*3 4t1

1.0MM;6W>*4 E < 4:;P-i|:

030"*4,4w" 3 .::EF343431,4. * 3a 41.330a1.1iX.33:.1333a 33.MM o 1 1

in aciatotoml000:01.13z 3Bo Efl1 M UstZAN "No "as<""e

MMg" OK _ 'iozMUZ3So .1.13.3.13 333 *4 331.1433W

353 1.35<^X:AE

0 331.E4 WA3 Mu044 *41M::34:0 4 £444,4

SSlCW F¢Za e 4ocoaa a Z a aat1.4MZMWM3M C4333 00033M3M0 .I..1.1

FF-h>4va~~~~It3 :0 301.

J:.p:. 0 91.30414e:4:j :a; 4 :*i 43

~~33000 1~.1.M040

:0:00*3:000 n.awl

0 0NOK 3MM3

~~3~~3~~~3~ MDi3O 4MoMMMmo ~~~4 333433331

00000 1. Ml. 3

MMM01a 1. 041333433Do34Oi4*4M41.3

mm'" Z-fi: i^.t .....a.g!01-{

:,W.4 .oot 3 33 3

g_ oa lm lm4N=oi

*oM. >M04 M3 A30*. Mc

W0004 I4 N ml in l o0443jo*i

*4444*111 4 M3M11.3

.)30WzOM33.~.tMN0

E*> -a bqSC_

,iwq-:i; in-p P . 04*t< *1>~~~~~~~~I Cmt4n>:CACPC as

:43444OsCB1

:Ii4 K-,W4 04tMMMM mg3 -t000 soF

:W,,:WW:4CX o o > :BSr": NC

0333 WM .4.4

3oo P .4.4.4 3scbcbsaF P o8t~~~~~~1

0031.1.4

M3a

*4*4,41.0

ag

1 -~~~~~~~9 _HIa

oa 0a a 0" " o04030>03

bh.BB .3333.1000a

>t*-mfi a331.31.il 00

aaM a.a1P1 P.1..> >0a s : 3:>0

M IKWO.*S A

033*3O 4.40331w *

u u> usoml

......

P ,aIE3.4fl000001400

0~ ~~~4IH1 4*

M 4 X

Po:* M M

A003W1 *060-N *4 30 014

34", 4Z l4Z

ko: .t t P.:

1.34

4 M3 3- f

.4 l P

o3 .4

Z l"

33A">_

_ 4

.133 B :| : 643l a B

!.4 ll*4ll

__________~~~~~~Cco4 en___to__A.__

2.. !. 2 . -.:,.,b. " :.-- :.-

II I I to . i-. b. '441 . ,. 'N X4 p 'IN, 1.4- .

J. VIROL.

I

is 3c In 19u 0 PI.!!a At 09AOft.

Wi 'o

x P1.4.4.4.40 30 3K u C) u u u

14 P.- " bo I" 14

1. 64 9-14 14Am

Page 4: Sequence Similarity Disease Virus p40 and ...

BDV p40 AND PARAMYXOVIRUS AND RHABDOVIRUS POLYMERASES 6575

Fes lnase-like kinaselie

1 335 7i8 1073 1404 2109

amino acid residuesFIG. 4. Gene map of the rhabdovirus polymerase indicating the

positions of the duplicated regions relative to the previously de-scribed kinase-like regions. Fes is the feline sarcoma virus oncogeneproduct, and PDGFR is the platelet-derived growth factor receptor;both are tyrosine kinase proteins. The numbers indicate the begin-ning and end of each segment in the VSV Indiana strain polymeraseprotein sequence.

along both strands. The two clones are identical in sequence.Each has a translation initiation signal (20) and encodes a357-residue protein with a predicted molecular mass of 39.5kDa. The amino acid sequence contains a positively charged5-amino-acid residue region consistent with a nuclear target-ing signal (7) (Fig. 1, boxed residues). The inferred proteinsequence is similar to a BDV p40 microsequence extractedfrom BD rat brain tissue (Fig. 1, underlined residues) (11).

In vitro transcription-translation of clone pBDV40. Toconfirm the size of the inferred protein, coupled in vitrotranscription-translation experiments were performed byusing clone pBDV-40 as a transcription template. Poly(A)+RNA from normal rat brain and BD rat brain tissues weretranslated in vitro as controls. Protein products were immu-noprecipitated with serum from normal or BD rats andanalyzed by SDS-PAGE and autoradiography. Both clonepBDV-40 template RNA and poly(A)+ RNA from BD ratbrain tissue directed translation of a 40-kDa protein that wasspecifically immunoprecipitated by serum from BD rats.Normal rat brain poly(A)+ RNA translation products werenot immunoprecipitated by serum from BD rats. Normal ratserum did not precipitate translation products (Fig. 2).

14SENDAI-1R A 3~~~PRA-1

114 22s MEASLES-1116 W339 NCDV-2134,442 RSV-1

VSV-IND-1

VSV-NJ-1

27 RBIES-1

4~~~~~~~~~~~~~~nSENDAI-2

,o _ -_4-MEASLES-2

691~~~~~7

-2 NCDV-2

691 BDV-P40

0 200 400 600 800 1000

NUCLEOTIDE SUBSTITUTIONSFIG. 5. The most parsimonious tree for the 17 amino acid

sequences of the multiple alignment (Fig. 3). Each branch numberindicates the number of nucleotide substitutions required in thedescent from the ancestral node to each tip. The black oval indicatesthe duplication event that gave rise to the paralogous segments ofthe L gene of paramyxoviruses and rhabdoviruses prior to theirdivergence from one another. All abbreviations are as in Fig. 3. Thenumbers indicate duplications I and II, respectively.

VOL. 66, 1992

Page 5: Sequence Similarity Disease Virus p40 and ...

6576 McCLURE ET AL.

Analysis of p40 sequence. The nonredundant data base wassearched with the p40 sequence by using gBLAST (seeMaterials and Methods). The L-polymerase proteins ofparamyxo- and rhabdoviruses were the only viral sequencesconsistently retrieved by the program. Protein sequences ofpositive-strand RNA viruses were not found to have anysimilarities to p40 at the level detected by gBLAST.Two different regions in the L-polymerases of the two

negative-strand RNA virus families are similar to the p40protein sequence. Expansion of each region of similaritybetween the polymerase and p40 sequences was carried outby generating a multiple alignment (12). A block of residuesis considered a motifwhen the p40 sequence is identical to atleast two residues within each set of rhabdovirus andparamyxovirus sequences in each duplication. Most of thesequences analyzed exhibited conservation of motifs in aspecific order (Fig. 3, regions 1 to 9). Region 1 and thecentral part of region 6 are highly conserved among theparamyxo- and rhabdoviruses in the region we designatedduplication I (35).The duplication detected by the p40 protein sequence

probe indicated a previously undetected relationship withinthe large negative-strand RNA polymerases. The L gene ofboth vesicular stomatitis virus (VSV) and rabies virus con-tains two distinct regions with distant similarity to the felinesarcoma virus oncogene product (FES) and the platelet-derived growth factor receptor (PDGF-R) tyrosine kinaseproteins, respectively (28). Only serine kinase activity hasbeen reported for the rhabdoviruses; however, it has re-cently been demonstrated that several protein sequencesresembling serine-threonine kinases phosphorylate tyrosineresidues (23). Duplication I is proximal to the FES-likesegment, while duplication II is distal to this segment,overlapping the PDGFR-like kinase region (Fig. 4). Theparamyxoviruses also contain distantly similar kinase-likesequences in analogous positions (27). The overlappingportion of p40 does not contain the residues conserved in thekinase domains (18).Our data also indicate that an internal duplication event

occurred in the duplication I segment of the ancestralparamyxovirus L-polymerase gene and that an even smallerduplication occurred subsequent to the divergence of respi-ratory syncytial virus (Fig. 3, boxed segments). The asparticacid residue conserved in the larger of the two duplicationsis suggested to be involved in the catalytic site of theparamyxovirus polymerase (Fig. 3, asterisks) (31).The regions conserved between duplications I and II for

both paramyxovirus and rhabdovirus begin to define theresidues that may play a role in L-polymerase functions (Fig.3, shaded columns). Motifs that are conserved betweenduplications I and II and p40 are good candidates forresidues involved in enzymatic functions that have beenconserved since the divergence of these sequences from oneanother (Fig. 3, regions 1 to 9).The phylogenetic tree, based on the multiple alignment

illustrated in Fig. 3, exhibits topological congruency for thelineages of the duplication I and II segments (Fig. 5). Thistopology is in agreement with a consensus phylogenetic treederived from the L gene sequences of paramyxovirus andrhabdovirus (35). The sequence analysis suggests that p40last shared a common ancestor with the rhabdovirus andparamyxovirus sequences prior to the duplication event thatgave rise to the extant polymerase genes (Fig. 5). If thishypothesis is correct, p40 should be a mosaic of duplicationsI and II; i.e., some regions of p40 should resemble duplica-tion I, while others should resemble duplication II, owing to

chance exploration of alternative sequence space (Fig. 3,vertical bars).

DISCUSSION

We have isolated and sequenced a BDV-specific cDNAfrom a BD-infected rat brain plasmid library. The inferredprotein sequence predicts a 39.5-kDa protein, and in vitrotranscription-translation studies confirm that clone pBDV-40does encode a 40-kDa protein (Fig. 1 and 2).A search of the nonredundant protein sequence data base

revealed a similarity between the inferred p40 sequence ofBDV and the L-polymerase proteins of paramyxovirusesand rhabdoviruses. This relationship defines a duplicatedregion within the polymerase genes (Fig. 3). Duplications Iand II are located distal and proximal, respectively, to theFES-like kinase sequences of the VSV polymerase (Fig. 4).Phylogenetic reconstruction indicates that p40 last shared acommon ancestor with the rhabdovirus and paramyxoviruspolymerase genes prior to the event that gave rise to theparalogous portions of these proteins (Fig. 5).

Little is known about the distribution of enzymatic func-tions along the L-polymerase proteins. It has been suggestedthat viral RNA-dependent RNA polymerases and reversetranscriptase proteins share a common ancestry on the basisof the conservation of four putative motifs, each of whichconserves only one amino acid (9, 31). Three of theseconserved residues (aspartic acid, glycine, and aspartic acid)are found in the consensus context in the L-polymeraseproteins of paramyxoviruses and rhabdoviruses (Fig. 3,asterisks). Functional studies of reverse transcriptase dem-onstrated that the two aspartic acid residues are essential topolymerase activity and cannot be substituted by glutamicacid (5, 22). It has been suggested that the presence of theseresidues in the L-polymerase of negative-strand RNA vi-ruses defines the site of polymerase activity (9, 31, 35). Thismodel suggests that p40 is unlikely to be a polymerasebecause aspartic acid residues are not found in the appropri-ate positions (Fig. 3, asterisks). There is no direct evidence,however, to indicate that the duplication I region is the siteof polymerase activity. If p40 is a viral polymerase, assequence similarities to the L-polymerase proteins suggest,it is necessary to consider a new model for the site ofpolymerase activity in paramyxoviruses and rhabdoviruses.One or more of the conserved motifs we have described maybe the active site.The identification of a duplicated region within the

paramyxovirus and rhabdovirus L-polymerase genes raisesthe possibility that one duplication provides the replicationfunction (i.e., readthrough of termination signals) while theother acts as the transcriptase. Another possibility is that theencoded proteins fold so that the duplications act in concertwith one another like a double-domain protein providingboth replication and transcriptase functions. By analogy, p40would be predicted to act as a dimer. Although the role ofp40 in the virus life cycle remains to be determined, itsnuclear localization and sequence similarity to the rhabdo-virus and paramyxovirus polymerase sequences suggest thatit is likely to function as a polymerase and that BDV is likelyto be related to the order Mononegavirales.

ACKNOWLEDGMENTSWe thank W. M. Fitch, J. Perrault, and D. Summers for construc-

tive criticism of the manuscript.Support was provided by NIH grants AI-28309 (M.A.M.) and

NS-29425 (W.I.L., K.J.T., and C.G.H.) and UC Taskforce on AIDS

J. VIROL.

Page 6: Sequence Similarity Disease Virus p40 and ...

BDV p40 AND PARAMYXOVIRUS AND RHABDOVIRUS POLYMERASES 6577

grant R91I047 (W.I.L., K.J.T., and C.G.H.). W.I.L. is a recipient ofa Young Investigator Award from NARSAD and a Pew ScholarsAward from the Pew Charitable Trusts.

REFERENCES1. Altschul, S. F., W. Gish, W. Miller, E. W. Myers, and D. J.

Lipman. 1990. Basic local alignment search tool. J. Mol. Biol.215:403-410.

2. Amsterdam, J., A. Winokur, W. Dyson, S. Herzog, F. Gonzalez,R. Rott, and H. Koprowski. 1985. Borna disease virus. Arch.Gen. Psychiatr. 42:1093-1096.

3. Bause-Niedrig, I., G. Pauli, and H. Ludwig. 1991. Borna diseasevirus-specific antigens: two different proteins identified bymonoclonal antibodies. Vet. Immunol. Immunopathol. 27:293-301.

4. Bode, L., S. Riegel, H. Ludwig, J. Amsterdam, W. Lange, and H.Koprowski. 1988. Borna disease virus-specific antibodies inpatients with HIV infection and with mental disorders. Lancetii:689.

5. Boyer, P. L., A. L. Ferris, and S. H. Hughes. 1992. Cassettemutagenesis of the reverse transcriptase of human immunode-ficiency virus type I. J. Virol. 66:1031-1039.

6. Carbone, K., T. Moench, and W. I. Lipkin. 1991. Borna diseasevirus replicates in astrocytes, Schwann cells and ependymalcells in persistently infected rats: location of viral genomic andmessenger RNAs by in situ hybridization. J. Neuropathol. Exp.Neurol. 50:205-214.

7. Chelsky, D., R. Ralph, and G. Jonak 1989. Sequence require-ments for synthetic peptide-mediated translocation to the nu-cleus. Mol. Cell. Biol. 9:2487-2492.

8. Dayhoff, M. 0. 1978. A model of evolutionary change inproteins. Detecting distant relationships, p. 345-358. In M. 0.Dayhoff (ed.), Atlas of protein sequence and structure. NationalBiomedical Research Foundation, Washington, D.C.

9. Delarue, M., 0. Poch, N. Tordo, D. Moras, and P. Argos. 1990.An attempt to unify the structure of polymerases. Protein Eng.3:461-467.

10. de la Torre, J., K. Carbone, and W. I. Lipldn. 1990. Molecularcharacterization of the Boma disease agent. Virology 179:853-856.

11. Dietzschold, B. Personal communication.12. Feng, D.-F., and R. F. Doolittle. 1987. Progressive sequence

alignment as a preresequisite to correct phylogenetic trees. J.Mol. Evol. 25:351-360.

13. Feng, D.-F., M. S. Johnson, and R. F. Doolittle. 1985. Aligningamino acid sequences: comparison of commonly used methods.J. Mol. Evol. 21:112-125.

14. Fields, B. N., and D. M. Knipe. 1991. Fundamental virology, 2nded. Raven Press, New York.

15. Fitch, W. M. 1971. Minimum change for a specific tree topology.Syst. Zool. 20:406-416.

16. Fitch, W. M., and M. Bruschi. 1987. The evolution of prokary-otic ferredoxins-with a general method correcting for unob-served substitutions in less branched lineages. Mol. Biol. Evol.4:381-394.

17. Fitch, W. M., and J. S. Farris. 1974. Evolutionary trees with theminimum nucleotide replacements from amino acid sequences.J. Mol. Evol. 3:263-278.

17a.Gish, W. Personal communication.

18. Hanks, S. K., A. M. Quinn, and T. Hunter. 1988. The proteinkinase family: conserved features and deduced phylogeny of thecatalytic domains. Science 241:42-52.

19. Kingsbury, D. W. 1991. In H. Fraenkel-Conrat and R. R.Wagner (ed.), The paramyxoviruses. Plenum Press, New York.

20. Kozak, M. 1987. An analysis of 5'-noncoding sequences from699 vertebrate messenger RNAs. Nucleic Acids Res. 15:8125-8148.

21. Krieg, P. A., and D. A. Melton. 1984. Functional messengerRNAs are produced by SP6 in vitro transcription of clonedcDNAs. Nucleic Acids Res. 12:7057-7070.

22. Larder, B. A., D. J. M. Purifoy, K. L. Powell, and G. Darby.1987. Site-specific mutagenesis of AIDS virus reverse tran-scriptase. Nature (London) 327:716-717.

23. Lindberg, R. A., A. M. Quinn, and T. Hunter. 1992. Dual-specificity protein kinases: any hydroxyl will do. Trends Bio-chem. Sci. 17:114-119.

24. Lipkin, W. I., G. Travis, K. Carbone, and M. Wilson. 1990.Isolation and characterization of Boma disease agent cDNAclones. Proc. Natl. Acad. Sci. USA 87:4184-4188.

25. Ludwig, H., L. Bode, and G. Gosztonyi. 1988. Boma disease: apersistent virus infection of the central nervous system. Prog.Med. Virol. 35.

26. McClure, M. A. 1992. Evolution of retroposons by acquisitionor deletion of retrovirus-like genes. Mol. Biol. Evol. 8:835-856.

27. McClure, M. A. Unpublished data.28. McClure, M. A., and J. Perrault. 1989. Two domains distantly

related to protein-tyrosine kinases in the vesicular stomatitisvirus polymerase. Virology 172:391-397.

29. Pelham, H. R. B., and R. J. Jackson. 1976. An efficient mRNA-dependent translation system from reticulocyte lysates. Eur. J.Biochem. 67:247-256.

30. Persson, H., L. Henninhausen, R. Taub, W. DeGrado, and P.Leder. 1984. Antibodies to human c-myc oncogene product:evidence of an evolutionarily conserved protein induced duringcell proliferation. Science 225:689-693.

31. Poch, O., I. Sauvaget, M. Delarue, and N. Tordo. 1989. Identi-fication of four conserved motifs among the RNA-dependentpolymerase encoding elements. EMBO J. 8:3867-3874.

32. Richt, J., S. VandeWoude, M. Zinc, 0. Narayan, and J. Clem-ents. 1991. Analysis of Borna disease virus-specific RNAs ininfected cells and tissues. J. Gen. Virol. 72:2251-2255.

33. Rott, R., S. Herzog, B. Fleischer, A. Winokur, J. Amsterdam,W. Dyson, and H. Koprowski. 1985. Detection of serum anti-bodies to Borna disease virus in patients with psychiatricdisorders. Science 228:755-756.

34. Schwartz, R. M., and M. 0. Dayhoff. 1978. Matrices for detect-ing distant relationships, p. 353-358. In M. 0. Dayhoff (ed.),Atlas of protein sequence and structure. National BiomedicalResearch Foundation, Washington, D.C.

35. Stec, D. S., I. Hill, and P. L. Collins. 1991. Sequence analysis ofthe polymerase L gene of human respiratory syncytial virus andpredicted phylogeny of nonsegmented negative-strand viruses.Virology 183:273-287.

36. VandeWoude, S., J. Richt, M. Zink, R. Rott, 0. Narayan, and J.Clements. 1990. A Borna virus cDNA encoding a proteinrecognized by antibodies in humans with behavioral diseases.Science 250:1276-1281.

VOL. 66, 1992