Top Banner
THE JOURNAL OF BIOLOGICAL CHEMISTRY Vol. 261, No. 34, Issue of December 5, pp. 15989-15994,1986 Printed in U. S. A. Identification of a Second Mutation in the Protein-coding Sequence of the Z Type Alpha l-Antitrypsin Gene* (Received for publication, May 27, 1986) Toshihiro NukiwaS, Ken Satoh, Mark L. Brantly, FumitakaOgushi, Gerald A. Fells, Michael Courtney, and Ronald G. Crystal From the Pulmonary Branch, National Heart, Lung, and Blood Institute, Bethesda, Maryland 20892 and Transgene, SA Strasbourg, France This study reports the entire nucleotide sequence of the protein codingregionsequence of the alpha 1- antitrypsin (alAT) Z gene, a common form of the alAT gene associated with serum alAT deficiency. In addi- tion to Glu342 to LysS4’ mutation in exon V which has been previously identified by peptide analysis, another point mutation (GTG to GCG in exon 111) in the gene sequence predicts a secondaminoacidsubstitution (Va1213 to Ala213) in the Z protein. This Va1213 to Ala213 mutation was confirmed to be a general finding in Z type a1AT gene by evaluating genomic DNA from 40 Z haplotypes using synthetic oligonucleotide gene probes directed toward the mutated exon I11 sequences in the Z gene. Furthermore, the exon I11 Va1213 to Ala213 mutation eliminates a BstEII restriction endonuclease site in the alAT Z gene, allowing rapid identification of this Va1213 to Ala213 substitution at the genomic DNA level. Surprisingly, when genomic DNA samples from individuals thought to be homozygous for the M1 gene (the most common alAT normal haplotype) were eval- uated with BstEII, 23% of the M1 haplotypes were BstEII site negative, thus identifying a new form of M1 (Le. M1(Ala2I3)), likely identical to M1 but with an isoelectricfocusing“silent”aminoacidsubstitution (Va1213 to Ala213).Although the relative importance of the newly identified exon I11 Va1213 to Ala213 mutation to the pathogenesis of theabnormalities associated with the Z gene is not known, it is likely that M1(Ala213) gene represents a common “normal” poly- morphism of the alAT gene that served as an evolu- tionary intermediate between the M1(VaP3) and Z genes. Alpha l-antitrypsin (a1AT’) is an antiprotease that func- tions primarily asaninhibitor of neutrophil elastase, an * The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “aduertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. The nucleotide sequence(s) reported in this paper ha been submitted to the GenBankTM/EMBL Data Bank with accession number(s) 50261 4. 4 To whom reprint requests should be addressed Bldg. 10, Rm 6D03, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892. For clarity, we have adopted the following nomenclature. If the IEF is M1 but the sequence unknown, the haplotype is referred to as M1. If the genomic DNA sequence around the 213 regionis examined (e.g. by the restriction endonuclease BstEII or with oligonucleotide probes), M1(VaP3) and M1(Ala213) will be used to identify the two haplotypes (i.e. the IEF pattern is M1, but the two proteins differ at residue 213.) Other abbreviations used are: kb, kilobase pairs; IEF, isoelectric focusing. omnivorous protease capable of destroyingmostforms of connective tissue (1). Coded for by a 10.2 kb at the chromo- somal segment of 14q31-32 (2), the a1AT gene is comprised of five exons and four introns (3). The a1AT gene is expressed in liver hepatocytes and in mononuclear phagocytes as a 1.75- kbmRNA that is translated and secreted into the rough endoplasmic reticulum as a 418-amino acid precursor protein containing a 24-residue signalpeptide (3-6). In the rough endoplasmic reticulum, N-linked carbohydrates areadded to each of 3 asparaginyl residues, the protein is translocated to the Golgi where the high mannose carbohydrates are trimmed, and the glycosylated a1AT is secreted as a mature protein of 394 amino acids (5-8). The mature protein circulates in the plasma with a half-life of approximately 5 days (9, lo), and it diffuses into all tissues where it functions to inhibit neutrophil elastase. The a1AT gene is highly pleomorphic with more than 30 known haplotypes of a1AT identified by isoelectric focusing (IEF) of serum (11-13). The two parental haplotypes are codominantly expressed. The most common haplotypes are of the “family, including M I (haplotype frequency in the United States Caucasian population 68-76%), M2 (14-20%), and M3 (10-12%) (11,14,15). Inheritance of any homozygous or heterozygous combinations of the “family proteins is associatedwith “normal” levels of a1AT (150-350 mg/dl) (ll), and they function similarly as excellent inhibitors of neutrophil elastase (16), with an association rate constant in the order of lo7 M” s” (17). In contrast to the “family haplotypes, the Z haplotype is associated with low plasma levels of d A T , i.e. “alAT defi- ciency” (11, 13). Typically, individuals homozygous for the Z protein have a1AT levels 10-15% of normal (11). The Z gene represents 1-2% of all a1AT haplotypes of individuals of European descent (14, 15). Importantly, the ZZ homozygous state is associated in children with neonatal hepatitis,choles- tasis, and cirrhosis, and in adults, with emphysema developing by ages 30-40 (11, 18-20). The emphysema is thought to develop because there is insufficient alAT available to protect the fragile alveolar structures from their burden of neutrophil elastase; as a result there is slow, progressive destruction of the alveolar walls from the uninhibitedelastase (19). Since theobservation in 1976 that a tryptic digest of the Z protein contained a single amino acid substitution (MI G ~ u ~ ~ to Z LYS~~’), it has been assumed that this substitution is responsible for the deficiency and associated clinical manifes- tations of the ZZ homozygous state. As part of ageneral evaluation of alAT gene structure associated with alAT deficiency, we have cloned and sequenced the Z alAT gene. To our surprise, we found that in addition to the “classic” exon V G ~ u ~ ~ ~ to Lys342 substitution, the Z gene contains a second amino acid mutation (exon 111,Va1213 to Ala213). Fur- 15989
6

Identification of a second mutation in the protein-coding sequence of ...

Dec 31, 2016

Download

Documents

dinhhanh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Identification of a second mutation in the protein-coding sequence of ...

THE JOURNAL OF BIOLOGICAL CHEMISTRY Vol. 261, No. 34, Issue of December 5, pp. 15989-15994,1986 Printed in U. S. A.

Identification of a Second Mutation in the Protein-coding Sequence of the Z Type Alpha l-Antitrypsin Gene*

(Received for publication, May 27, 1986)

Toshihiro NukiwaS, Ken Satoh, Mark L. Brantly, Fumitaka Ogushi, Gerald A. Fells, Michael Courtney, and Ronald G. Crystal From the Pulmonary Branch, National Heart, Lung, and Blood Institute, Bethesda, Maryland 20892 and Transgene, SA Strasbourg, France

This study reports the entire nucleotide sequence of the protein coding region sequence of the alpha 1- antitrypsin (alAT) Z gene, a common form of the alAT gene associated with serum alAT deficiency. In addi- tion to Glu342 to LysS4’ mutation in exon V which has been previously identified by peptide analysis, another point mutation (GTG to GCG in exon 111) in the gene sequence predicts a second amino acid substitution (Va1213 to Ala213) in the Z protein. This Va1213 to Ala213 mutation was confirmed to be a general finding in Z type a1AT gene by evaluating genomic DNA from 40 Z haplotypes using synthetic oligonucleotide gene probes directed toward the mutated exon I11 sequences in the Z gene. Furthermore, the exon I11 Va1213 to Ala213 mutation eliminates a BstEII restriction endonuclease site in the alAT Z gene, allowing rapid identification of this Va1213 to Ala213 substitution at the genomic DNA level. Surprisingly, when genomic DNA samples from individuals thought to be homozygous for the M1 gene (the most common a lAT normal haplotype) were eval- uated with BstEII, 23% of the M1 haplotypes were BstEII site negative, thus identifying a new form of M1 (Le. M1(Ala2I3)), likely identical to M1 but with an isoelectric focusing “silent” amino acid substitution (Va1213 to Ala213). Although the relative importance of the newly identified exon I11 Va1213 to Ala213 mutation to the pathogenesis of the abnormalities associated with the Z gene is not known, it is likely that M1(Ala213) gene represents a common “normal” poly- morphism of the alAT gene that served as an evolu- tionary intermediate between the M1(VaP3) and Z genes.

Alpha l-antitrypsin (a1AT’) is an antiprotease that func- tions primarily as an inhibitor of neutrophil elastase, an

* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “aduertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper h a been submitted to the GenBankTM/EMBL Data Bank with accession number(s) 50261 4.

4 To whom reprint requests should be addressed Bldg. 10, Rm 6D03, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892.

For clarity, we have adopted the following nomenclature. If the IEF is M1 but the sequence unknown, the haplotype is referred to as M1. If the genomic DNA sequence around the 213 region is examined (e.g. by the restriction endonuclease BstEII or with oligonucleotide probes), M1(VaP3) and M1(Ala213) will be used to identify the two haplotypes (i.e. the IEF pattern is M1, but the two proteins differ at residue 213.) Other abbreviations used are: kb, kilobase pairs; IEF, isoelectric focusing.

omnivorous protease capable of destroying most forms of connective tissue (1). Coded for by a 10.2 kb at the chromo- somal segment of 14q31-32 (2), the a1AT gene is comprised of five exons and four introns (3). The a1AT gene is expressed in liver hepatocytes and in mononuclear phagocytes as a 1.75- kb mRNA that is translated and secreted into the rough endoplasmic reticulum as a 418-amino acid precursor protein containing a 24-residue signal peptide (3-6). In the rough endoplasmic reticulum, N-linked carbohydrates are added to each of 3 asparaginyl residues, the protein is translocated to the Golgi where the high mannose carbohydrates are trimmed, and the glycosylated a1AT is secreted as a mature protein of 394 amino acids (5-8). The mature protein circulates in the plasma with a half-life of approximately 5 days (9, lo), and it diffuses into all tissues where it functions to inhibit neutrophil elastase.

The a1AT gene is highly pleomorphic with more than 30 known haplotypes of a1AT identified by isoelectric focusing (IEF) of serum (11-13). The two parental haplotypes are codominantly expressed. The most common haplotypes are of the “family, including MI (haplotype frequency in the United States Caucasian population 68-76%), M2 (14-20%), and M3 (10-12%) (11,14,15). Inheritance of any homozygous or heterozygous combinations of the “family proteins is associated with “normal” levels of a1AT (150-350 mg/dl) (ll), and they function similarly as excellent inhibitors of neutrophil elastase (16), with an association rate constant in the order of lo7 M” s” (17).

In contrast to the “family haplotypes, the Z haplotype is associated with low plasma levels of d A T , i.e. “alAT defi- ciency” (11, 13). Typically, individuals homozygous for the Z protein have a1AT levels 10-15% of normal (11). The Z gene represents 1-2% of all a1AT haplotypes of individuals of European descent (14, 15). Importantly, the ZZ homozygous state is associated in children with neonatal hepatitis, choles- tasis, and cirrhosis, and in adults, with emphysema developing by ages 30-40 (11, 18-20). The emphysema is thought to develop because there is insufficient a lAT available to protect the fragile alveolar structures from their burden of neutrophil elastase; as a result there is slow, progressive destruction of the alveolar walls from the uninhibited elastase (19).

Since the observation in 1976 that a tryptic digest of the Z protein contained a single amino acid substitution (MI G ~ u ~ ~ ~ to Z LYS~~’), it has been assumed that this substitution is responsible for the deficiency and associated clinical manifes- tations of the ZZ homozygous state. As part of a general evaluation of a lAT gene structure associated with a lAT deficiency, we have cloned and sequenced the Z a lAT gene. To our surprise, we found that in addition to the “classic” exon V G ~ u ~ ~ ~ to Lys342 substitution, the Z gene contains a second amino acid mutation (exon 111, Va1213 to Ala213). Fur-

15989

Page 2: Identification of a second mutation in the protein-coding sequence of ...

15990 Alpha 1 -Antitrypsin 2 Gene

thermore, through the evaluation of genomic DNA of what were thought to be a1AT M1 homozygotes, we have identified a new, common form of a1AT type M1 which shares the VaP3 to Ala213 mutation with the Z gene but has the same G ~ u ~ ~ ~ sequence as the common M1 gene.

EXPERIMENTAL PROCEDURES

Sources of Genomic DNA-Genomic DNA was isolated from white blood cells of individuals with various alAT phenotypes by the method of Jeffreys and Flavell (21). The a1AT phenotypes were identified by a combination of serum IEF, serum a1AT levels, and family studies (13, 22). The alAT serum levels were measured by radial immunodiffusion using the commercial standard (Behring Di- agnostics). In addition to the alAT heterozygote M3Z used for the cloning of the Z gene, genomic DNA was evaluated from 26 individuals with the serum phenotype MlMl and 20 individuals with the serum phenotype ZZ. As controls, DNA was evaluated from individuals with haplotypes M2 (n = 18), M3 (n = 6), and S (n = 7).

Cloning and Sequencing of the Protein-coding Sequence of the Z Type alAT Gene-Using complete EcoRI digestion, a 10-kb EcoRI fragment of genomic DNA from an individual with the alAT phe- notype M3Z encompassing the entire protein-coding regions (exons 11-V) of the alAT gene (3) was cloned into XgtWES as described previously (23). The Z and M3 clones were identified by hybridization with 19-mer oligonucleotide'gene probes specific for the DNA se- quences complementary to the amino acid sequences centered about Lys3" (Z gene) and G1u3" (M3 gene) (23, 24). The 10-kb Z clone was digested into three fragments with PstI (1.6 kb containing exon 11, 2.4 kb containing exons 111 and IV, and 1.1 kb containing exon V) and subcloned into pUC13. The double-stranded plasmid DNA with the insert was directly sequenced by the dideoxynucleotide chain termination method using bidirectional primers (25); 12,15-mer oli- gonucleotides were used to cover the sense sequence and 13,15-mer oligonucleotides to evaluate the antisense sequence of exons 11-V and neighboring intron regions (3).

Evaluation of alAT Genes for Restriction Fragment Length Poly- morphisms-After sequencing of the Z a1AT gene demonstrated a mutation in exon I11 at residue 213 (see "Results"), it became apparent that this mutation should result in a loss in a restriction site for the endonuclease BstEII (GJGTNACC). To evaluate this, a 2.4-kb region encompassing exons I11 and IV of the cloned Z gene was placed in pUC13, and a 0.95-kb fragment encompassing the entirety of exon I11 was isolated. This exon I11 probe included the sequence of the Z gene from a PstI site 5' to exon I11 to a BstEII site just 3' to exon 111. After labeling with [cx-~'P]~CTP by nick translation (261, this exon I11 probe was used to evaluate genomic DNA (5 rg/lane) digested with restriction endonucleases PstI and BstEII according to the manufacturer's recommendations and analyzed by the Southern pro- cedure.

Detection of Single Point Mutations Using Oligonucleotide Probes- To determine whether the two mutations found in the cloned Z type alAT gene were universally present in all a1AT genes of a population of individuals who appear to be Z homozygotes by conventional criteria utilizing serum a1AT analysis, 19-mer oligonucleotide probes were constructed complementary to the two Z gene mutations coding for amino acid substitutions (exon I11 M1 Va1213 to Z Ala213 mutation with the probes G-GAC-CAG-GTG-ACC-ACC-GTG for the M1 gene and G-GAC-CAG-GGG-ACC-ACC-GTG for the Z gene; exon V M1 Glu3" to Z Lys342 mutation with the probes AAC-ATC-GAC-GAG- AAA-GGG-A for the M1 gene and ACC-ATC-GAC-UG-AAA-GGG- A for the Z gene). The genomic DNA to be evaluated was cut with the restriction endonucleases EcoRI and BglI; together these two enzymes conveniently cleave the alAT gene such that each of the five exons are contained within each of five different size DNA fragments (23). All oligonucleotide probes were labeled with incor- poration of [32P]dNTPs by Escherichia coli DNA polymerase (Klenow fragment) using reverse complementary templates and a small 8-mer primer (27). The evaluation of various genomic DNA samples using these oligonucleotides was carried out as previously described (23), except that the washing step at 55 "C for 3 min was omitted with the exon I11 probes.

RESULTS AND DISCUSSION

Soon after the discovery of a lAT deficiency in 1963 (28), it was recognized that the Z protein had a different charge

than what was then called the M protein (29). In 1976, a tryptic peptide of the Z protein was found to differ from the corresponding peptide of the M protein by a loss of a glutamic acid residue and an addition of a lysine (30). Subsequent sequencing of an 8-amino acid segment of this region of the M and Z proteins confirmed the Glu to Lys substitution (31). Finally, when the entire M1 protein was sequenced by Carrel1 et al. (32), the partial human cDNA and the entire baboon cDNA by Kurachi et al. (33), and the entire M1 cDNA by Long et al. (3), it became apparent that the involved glutamic acid was situated at residue 342. The universality of the G ~ u ~ ~ ~ to Lys342 (GAG to M G in the genome) difference in all Z genes was confirmed at the genomic level by Kidd et al. (24) and by Nukiwa et al. (23) using 19-mer oligonucleotide probes centered about the G ~ u ~ ~ ~ to Lys342 substitution.

With this as a background, despite the fact that more than 60% of the Z protein or gene had not been sequenced (12,31, 34), it has been generally assumed that the G ~ u ~ ' ~ to Lys342 substitution was the only difference in the primary structure between the Z and M1 proteins (32). However, when we sequenced the entire protein-coding exon region of the Z a1AT gene, we found that in addition to the classic exon V G ~ u ~ ~ ~ to Lys342 substitution, the Z gene contained a second substitution (exon 111, VaP3 to Ala213; Fig. 1). In addition, the Z gene contains a silent base change in exon I1 (AAG to AAA) that codes for L Y S ' ~ ~ in both the M1 and Z proteins.

Inheritance of the Z a1AT gene has several consequences: 1) the Z protein aggregates in the rough endoplasmic reticu- lum of the alAT secreting cells (35, 36); 2) there is a reduced rate of secretion of the molecule by these cells (6, 37-39); 3) the plasma levels of alAT are markedly reduced (28); and 4) the Z protein does not function as well as an inhibitor of neutrophil elastase (40). All available evidence suggests that in the ZZ homozygous state that the Z gene is transcribed in a normal fashion, that a1AT synthesizing cells have normal levels of d A T mRNA, and that the Z type mRNA can be translated in a normal fashion (6, 37-39). However, studies with liver and mononuclear phagocytes of such individuals have shown that these cells secrete less alAT than those of normals (6, 39). Consistent with this fact, light microscopic evaluation of biopsies of liver of ZZ individuals demonstrates intracellular accumulation of alAT, and transmission elec- tron microscopic evaluation of these specimens has shown that the alAT accumulates in the rough endoplasmic reticu- lum (36). Furthermore, evaluation of the intracellular form of a1AT recovered from such livers demonstrated that it con- tains "high mannose" carbohydrate side chains (41). To- gether, this evidence has led to the concept that as the Z protein is produced, N-linked carbohydrates are normally added. However, liver accumulation and the plasma deficiency associated with the homozygous Z state result from a de- creased rate of folding of the high mannose form of alAT in the rough endoplasmic reticulum, allowing hydrophobic resi- dues in adjacent molecules to interact, leading to aggregation. Interestingly, those Z type alAT molecules that are translo- cated to the Golgi undergo normal trimming of the carbohy- drate side chains, and such molecules are normally secreted (41) and have a normal circulating half-life (9). However, a recent study by Ogushi et al. (40) has demonstrated that the Z type molecule has a significantly reduced association rate constant for neutrophil elastase. In this context, in addition to the fact that the ZZ homozygous state is associated with a marked reduction in a1AT levels, on the average, the Z type molecule takes longer than does the M type molecule to inhibit an equivalent amount of neutrophil elastase.

The relative importance of the newly identified VaP3 to

Page 3: Identification of a second mutation in the protein-coding sequence of ...

Alpha 1-Antitrypsin Z Gene 15991

Ala mutation compared to the classic G ~ u ~ ~ ' t o L Y S ~ ~ ' mutation to each of these abnormalities associated with the Z protein is not known. The normal alAT protein has been crystallized and its three-dimensional structure determined, but the three- dimensional structure of the Z protein has not been evaluated (42). In the M protein structure, the GW4' residue is located in sheet A strand 5 and ValzL3 residue at the turn of segment 202-223 which forms a strongly twisted, double-stranded anti- parallel ladder. It has been hypothesized that the G ~ u ~ ~ ~ to Lysq4' substitution results in a loss of a critical salt bridge (Glu"' to LysZgo) which has an effect on the rate folding of the inhibitor, perhaps explaining the reduction in the rate of three-dimensional folding of the Z protein in the rough en- doplasmic reticulum. The Va1213 does not appear to participate in any critical salt bridges, nor does it appear in the three- dimensional structure near the active site a t Met3". I t is, however, reasonably close (in the tertiary structure) to and hence to a carbohydrate attachment site (42). Whether this has any consequence to the intracellular handling of the molecule, or whether the Va1213 to Ala213 substitution (or the G ~ u " ~ ~ to Lys"' substitution) has any affect on the association rate constant of the interaction with neutrophil elastase, is unknown.

Despite the fact that the importance of the VaP3 to Ala2I3 substitution is not known, the knowledge of its presence has led us to the identification of a previously unrecognized, but common polymorphic form of the normal M1 gene. Evaluation of the normal M1 gene sequence in the V a P 3 region revealed that the endonuclease BstEII normally cuts in the sequences in exon I11 coding for the amino acids Gln212-Va1213-Thr214. Theoretically, however, with the substitution GTG to GCG ( V a P to Ala213) in the Z protein, this BstEII restriction site would be lost. Evaluation of genomic DNA from individuals homozygous for the M1 gene and those homozygous for the Z gene demonstrated this to be the case. In this context, if the Z gene is cut with PstI and BstEII, there is no BstEII site in exon 111, and thus a single 0.95-kb fragment is generated that can be detected with an exon I11 probe (Fig. 2, lane I). In contrast, in the M1 gene, the presence of the exon I11 BstEII site leads to the generation of the 0.72-kb fragment (Fig. 2, lane 2; a 0.23-kb fragment is also generated, but it does not appear on the autoradiogram because it does not bind effi- ciently to the filter). We initially thought this loss of a restriction site associated with the Z gene would be useful as a method to uniquely identify the Z gene from the common "family haplotypes. However, in evaluating this hypothesis, we soon realized that a significant proportion of genomic DNA samples that had been identified as being M l M l homo- zygotes by conventinal criteria (13) could be further subgrouped depending on whether they contained or did not contain the BstEII restriction site. In this context, some M l M l samples contained the BstEII site (Fig. 2, lane 2), while others were homozygous for the absence of this site (Fig. 2, lane 3), and still others were heterozygous for this site (Fig. 2, lane 4) . However, when M2, M3, and S haplotypes were evaluated, all were BstEII positive (Le. all have the same sequence in the 213 region as the classic M1 gene; data not shown). Thus, it became apparent that d A T haplotypes thought to be M1, can actually be M l ( V a P ) or M1(Ala'l3).'

Comparison of the IEF patterns of serum of individuals homozygous for M1(ValZL3) and M1(Ala2I3) demonstrated they were identical (data not shown), as might be expected by a

An unpublished sequence of an alAT cDNA (S. L, C. Woo, and E. W. Davie) referred to by Carrel1 et al. (32) and also by Rosenberg et al. (43) showed an Ala at amino acid 213; presumably this cDNA represents M1(Ala2I3).

I II Ill IV v

1 -

072'0.23

z 0.95

ZZ MIMI M IMI M lM l

Amino acid 212 213 214 Gln Val Thr Kb

Est E 11

Gln Ala Thr 0.95* C A G GiCjG A C C 0.72* G T C C%;C T G G

m .

Q

1 2 3 4

FIG. 2. Identification of a restriction fragment length pol- ymorphism in the Z a l A T gene resulting from the exon I11 VaP3 mutat ion and the ident i f icat ion of a form of the M1 gene [M1(A1a2l3)] with this same polymorphism. Shown at the top is a schematic of the alAT gene with its five exons (solid boxes, I -V) and four introns. The exon 111 VaP3 mutation is indicated by V. Based on the sequence of the M1 and Z genes (Fig. l), indicated are the expected restriction sites for the enzymes PstI (P) and BstEII ( B ) and the predicted fragments that should be generated from M1 (0.72,0.23 kb) and Z (0.95 kb) genomic DNA, respectively. Shown at the left is the sequence of the M1 and Z genes around the 213 region, indicating the expected cutting site of BstEII in the M1 gene and the expected loss of this site in the Z gene. After the DNA was cut with the two enzymes, the resulting fragments were analyzed by Southern transfer (44) and hybridized with a 32P-labeled exon I11 probe (see "Experimental Procedures"). Lane 1, genomic DNA from an individ- ual with the phenotype ZZ showing only the expected 0.95-kb frag- ment. Lane 2, genomic DNA from an individual with the phenotype MlMl showing only the expected 0.72-kb fragment. The expected 0.23-kb band (a) does not appear on the autoradiogram because this small fragment does not bind to a nitrocellulose filter efficiently. Lane 3, genomic DNA from an individual with the alAT phenotype M1(Ala2'3)M1(Ala213). This individual has the protein phenotype MlM1, but with the BstEII 0.95-kb fragment instead of the expected 0.72-kb fragment; this identifies a newly recognized polymorphic form of M1 that does not have the BstEII restriction site in exon 111 (referred to as M1(Ala2I3)). Lane 4, genomic DNA from an individual heterozygous for M1(VaI2l3) and M1(AlaZ1')).

substitution of the neutral amino acid Ala213 for the neutral amino acid Va1213. However, both M1(Val2l3) and M1(Ala'l3) could be easily distinguished from the other common M- family haplotypes M2 and M3. Furthermore, the M1(Ala213) haplotypes was found to be transmitted in a codominant autosomal fashion and was associated with normal serum levels of d A T (not shown).

Construction of 32P-labeled oligonucleotide probes comple- mentary to the sequence differences among the M1(Va1'13), Ml(Ala'''), and Z genes centered about residues 213 and 342 verified that those genes corresponding to IEF patterns of serum identified as ZZ together with genomic DNA BstEII patterns that were -/- (i.e. only 0.95-kb fragment generated) are homozygous for the Ala213 a n d L Y S ~ ~ ' sequences (Fig. 3A). Those genes corresponding to IEF patterns of serum identi- fied as MlMl together with genomic DNA BstEII patterns that were +/+ (i.e. only 0.72-kb fragment seen) are homozy- gous for the VaP3 and sequences (Fig. 3B). However, the M1(Ala213)M1(Ala213) genes ( i e . those corresponding to the IEF patterns of serum identified as M l M l but with the

Page 4: Identification of a second mutation in the protein-coding sequence of ...

15992 Alpha 1 -Antitrypsin Z Gene Kb

I I I I I I I I 1 I I 1 I I I 1 1 2 3 4 5 6 7 8 9 1 0 1 1 1 2 1 3 1 4 1 5

L 4 . 8 0.05

I1 10.0 I E I I EE I 0.9 1 I I E I F 2 . 0 3 11 - 4.3 I

I I t !I - L 1 . 5 - f - 1 . 3 - 1 I !

I B B II B B B B I

5'

. . . . . . . . . . . . . . . .

ExonI I l~a1~ '~probe: G G A C C A G G@G A C C A C C G T G E x o n V g l ~ ~ ~ ~ p r o b e : A C C A T C G A C H A G A A A G G G A * * * * * * * * * * * * * * * * * * * * *

Exon Ill ala213 probe: G G A C C A G G I G A C C A C C G T G Exon V 1 ~ s ~ ' probe: A C C A T C G A C I A G A A A G G G A * * * * * * * * * * * * * * * * * * * * *

A. Evaluation of IEF ZZ, B. Evaluation of IEF MlMl , C: Evaluation of IEF MlM1, Bst E II -1- genomic DNA Bst E II + I + genomic DNA Bst E II - / - genomic DNA

Probe Probes Probes Probes

Exon kb cDNA 111 vaIzr3 Ill V gluM2 V lysM2 111 vaI2l3 Ill alazr3 V gluM2 V lysM2 111 vaI2l3 111 alazr3 V gluM2 V 1 ~ s ~ '

M m m A n F m F I 1 4.3 v

I

I 2.0 - IV 1.5 r. V 1.3 0 --- --- "- "

"""- -"

FIG. 3. Detection of the exon I11 and exon V sequence differences of the Z, M1(Valzls). and M1(AlaZ1') alAT genes using oligonucleotide probes. At the top of the figure is a schematic of the alAT gene showing the five exons (solid boxes labeled Z-V), the putative promoter site 5' to exon I, the start codon in exon 11, and the stop codon in exon V. With EcoRI ( E ) and BglI ( B ) , the five exons are conveniently cut into five different size DNA fragments, each containing a single exon. Thus, after Southern transfer and hybridization with a "P-labeled full length alAT cDNA (45), an autoradiogram reveals fragments of 4.3, 2.0, 1.5, 1.3, and 0.9 kb correspond to exons 11, I, IV, V, and 111, respectively (23) (far left lane, labeled Probe, cDNA). The exon 111 Va1213-Ala213 substitution ( M l ( V a P ) to M1(Ala2l3) or Z) is caused by a base change of GTG to GCG. The 19-mer oligonucleotide gene probes used to detect this change are indicated as the "exon 111 VaTL3 probe" and "exon 111 A1a213 probe." From the BstEII data (Fig. 2), it would be expected that the exon 111 V a P probe would hybridize to the M1(VaI2l3) haplotype but not to the Z or M1(Ala2I3) haplotype while the exon 111 Ala2I3 probe would hybridize in the reverse order. The exon V Glu"" to Lys mutation (M1(VaI2l3) or M1(Ala2I3) to Z) is caused by a base change of GAG to

and the "exon V LYS"'~ probe." From prior studies (23, 24), it is known that the exon V GIu3'* probe hybridizes to all M1 genes tested (presumably including M1(Ala'I3) as well) but not to the Z gene, while the exon V L ~ S ~ ' ~ probe hybridizes to all Z genes but not M1 (and presumably M1(Ala213)). All oligonucleotide probes were labeled with incorporation of [32P]dNTPs by E. coli DNA polymerase (Klenow fragment; Pharmacia) using reverse complemen- tary templates and a small 8-mer primer (27). Bases in the probes that are labeled are indicated by *. Genomic DNA from various sources were cut with the endonucleases BglI and EcoRI, 5 pg were electrophoresed, transferred to nitrocellulose, hybridized with a labeled probe, washed and autoradiographed. Shown are the evaluations of genomic DNA from individuals with various alAT phenotypes. (A) ZZ (isoelectric focusing of serum pattern ZZ, genomic DNA BstEII pattern -/-); note hybridization with the exon 111 AlaZ1' probe and exon V Lysw' probe, but not with the exon 111 VaP3 probe or exon V GIU"" probe. ( B ) M1(Va1213)M1(Va121R) (isoelectric focusing pattern MlM1, genomic DNA BstEII pattern +/+); note hybridization with the exon 111 probe and exon V G1u3'* probe, but not with the exon 111 Ala213 probe or exon V LYS"~ probe. ( C ) M1(Ala21')M1(Ala21'') (isoelectric focusing pattern MlM1, genomic DNA BstEII pattern -/-); note hybridization with the exon 111 Alazt3 probe and exon V Glu"' probe, but not with the exon 111 probe or the exon V Lys"'' probe.

- AAG. The 19-mer oligonucleotide gene probes used to detect this change are indicated as the "exon V GluaTprobe"

Page 5: Identification of a second mutation in the protein-coding sequence of ...

Alpha 1 -Antitrypsin Z Gene 15993

TABLE I Comparison of the alpha 1-antitrypsin BstEZIpatterns and oligonucleotide evaluation of alpha I-antitrypsin

sequences centered at amino acid residues 213 and 342 Number positive with"

Serum isoelectric BstEIIb focusing pattern pattern

Phenotype' n Exon I11 Exon I11 Exon V Glu"'

Exon V

Drobe Drobe Drobe Drobe va1213 ~ 1 ~ 2 1 3 Lys3'2

MlMl +I+ M1(Va1213)M1(Va1213) 16 16 0 16 0 M1(Va1213)M1(Ala213) MlMl +/- 8 8 8 8

MlMl -1- M1(Ala213)M1(Ala213) 2 0 2 0

zz zz 2 0

-1- 20 0 20 0 20 Oligonucleotide evaluation was carried as described in the legend to Fig. 3. Note the exact correlation among

the BstEII patterns and the oligonucleotide patterns. *The BstEII pattern of genomic DNA was determined as described in the legend to Fig. 2. +/+ = 0.72-kb

fragment present, 0.95-kb fragment absent; +/- = both 0.72- and 0.95-kb fragments present; -1- = 0.95-kb fragment present, 0.72-kb fragment absent.

e The alpha 1-antitrypsin phenotype was determined by a combination of IEF, serum levels, family studies, and genomic DNA BstEII patterns. M1(Va1213)M1(Va1213) = IEF pattern MlM1, normal serum levels, BstEII +/+; M1(Va1213)M1(Ala213) = IEF pattern MlM1, normal serum levels, BstEII +/-; M1(Ala213) M1(Ala213) = IEF pattern MlM1, normal serum levels, BstEII -1-; ZZ = IEF pattern ZZ, serum levels <50 mg/dl, BstEII -1-.

/ ' '\ / / '\

Baboon a1 AT Human alAT M1 ~ys'", ala"', g W 2 ~ys'", alaz1', glu"'

AAG GCG GAG

Human alAT M1 ( V ~ I ~ ' ~ ) Human alAT 2

AAG GTG GAG ~ys'", vaP3, g W Z 1 ~ys'", a~a"', lyssu

A M GCG AAG 1 FIG. 4. Possible evolutionary relationships among the var-

ious a lAT genes. The partial sequence data for the M1(Ala213) gene3 suggests that the M1(Ala213) gene is more closely linked to the baboon than M1(VaP3) or the Z genes.

genomic DNA BstEII patterns that were -/-) are homozy- gous for the Ala213 and G ~ u ~ ~ ~ sequences (Fig. 3C). Further- more, when 46 genomic samples identified by the combination of isoelectric focusing of serum, serum a lAT levels, family studies, and BstEII restriction patterns of genomic DNA as being M1(Va1213)M1(Va1213), M1(Va1213)M1(Ala213), M1(Ala213)M1(Ala213), or ZZ, there was a 100% correlation with the corresponding Va1213-G1~342 (Ml(Va1213)), Ala213- G ~ u ~ ~ ~ (M1(Ala213)), or Ala213-Lys342 (Z) sequences as identi- fied with the respective oligonucleotide probes (Table I).

Interestingly, when we used the exon I11 VaP3 and exon I11 Ala213 oligonucleotide probes to evaluate genomic DNA from individuals previously identified as being M l M l homo- zygotes, it became apparent that the M1(Ala213) haplotype was relatively common. In this regard, oligonucleotide evalu- ation of 26 genomic samples of Caucasians thought to be M l M l homozygotes revealed that 16 were homozygous with the exon I11 VaP3probe (i.e. true M1(Va1213)M1(Va1213) homo- zygotes), eight were heterozygous for the exon I11 VaP3 and exon I11 Ala213 probes ( i e . , M1(Va1213)M1(Ala213) heterozy- gotes), and two were homozygous for the exon I11 Ala213 probe (i.e. M1(Ala213)M1(Ala213) homozygotes). Assuming these fre- quencies hold for the Caucasian population as a whole, these data suggest haplotype frequencies (among haplotypes previ- ously identified as MI) for M1(Va1213) of 77% and for M1(Ala213) of 23%. In this context, the M1(Ala213) gene is likely as frequent as the previously identified "family hap- lotype M2 and more frequent than M3 (14, 15).

Like its unknown importance to the Z gene or protein, the functional importance of the Va1213 to Ala213 mutation to the

M1 gene or protein is unknown. However, our preliminary studies comparing the individuals homozygous for M1- (Va1213)M1(Va1213) to those homozygous for M1(Ala213)M1- (Ala213) have failed to reveal any marked differences in a lAT levels or function.

The fact that the Z gene differs from the M1(Va1213) gene by more than one mutation, and that some individuals thought to have the M1(Va1213) a lAT haplotype actually have the M1(Ala213) haplotype, leads to two interesting conclusions. First, since the Z sequence differs from the M1(VaP3) se- quence at two sites (amino acids 213 and 342), the Z gene could not have evolved from the M1(VaP3) gene (or vice versa) directly by a single mutational event (Fig. 4). Second, the available evidence suggests that the M1(Ala213) gene was an evolutionary intermediate between the M1(Va1213) and Z genes. In this regard, the M1(Ala213) gene sequence3 is iden- tical to the baboon d A T sequence (33) at the codons LyslZ9 (AAG), Ala213 (GCG), and G ~ u ~ ~ ~ (GAG). In contrast, the M1(VaP3) sequence differs from the baboon at one codon (M1(Va1213) (GTG), baboon Ala213 (GCG)) and the Z gene differs from the baboon at two codons (Z Lys'*' (AAA), Z L ~ s ~ ~ ~ (AAG); baboon LyslZ9 (AAG), baboon G ~ u ~ ~ ~ (GAG)) (Fig. 4).

Acknowledgment-We would like to thank Dr. R. Huber, Max Planck Institute, for his helpful discussions relating to the three- dimensional structure of the alAT molecule.

REFERENCES

2. Schroeder, W. T., Miller, M. F., Woo, S. L. C., and Saunders, G. F. (1985) 1. Travis, J., and Salvesen, G. S. (1983) Annu. Rev. Eiochem. 52,655-709

3. Long, G. L., Chandra, T., Woo, S. L. C., Davie, E. W., and Kurachi, K.

4. Gelger, T., Northemann, W., Schmelzer, E., Gross, V., Gauthier, F., and

5. Perlmutter, D. H., Cole, F. S., Kilbrid e, P , Rowing, T. H., and Colten, H.

6. Mornex, J. F., Chytil-Weir, A., Courtney, M., LeCocq, J.-P., and Crystal,

7. Mega, T., Lu'an, E., and Yoshida, A. (1980) J. Biol. Chem. 255,4053-4056 8. Carrell, R. Id., Jeppsson, J.-O., Vaugban, L., Brennan, S. O., Owen, M. C.,

9. Laurell, C. B., Nosslin, B., and Jeppsson, J. 0. (1977) Clin. Sci. Molec. Med. and Boswell, D. R. (1981) FEBS Lett. 135,301-303

10. Jones, E. A., Vergalla, J., Steer, C. J., Bradley-Moore, P. R., and Vierling, 52, 457-461

11. Gadek, J. E., and Crystal, R. G. (1982) in The Metabolic Basis ojInherited J. M. (1978) Clin. Sci. Mol. Med. 55, 139-148

Disease (Stanbury, J. B., Wyngaarden, J. B., Frederickson, D. S., Gold- stein, J. L., and Brown, M. s., eds) 5th Ed., pp. 1450-1467, McGraw-

12. Carrell, R. W., and Owen, M. C . (1979) Essays Med. Biochem. 4, 83-119 Hill, New York

13. Cox, D. W., Johnson, A. M., and Fagerhol, M. K. (1980) Hum. Genet. 53,

Am. J. Hum. Genet. 37,868-872

(1984) Biochemistry 23,4828-4837

Heinrich, P. C . (1982) Eur. J . Biochem. 126, 189-195

R. (1985) Proc. Nutl. Acud. Sci. 17. 8. A. 82, 795-799

R. G. (1986) J. Clin. Invest. 77, 1952-1961

429-433

T. Nukiwa and R. Crystal, unpublished observation.

Page 6: Identification of a second mutation in the protein-coding sequence of ...

15994 Alpha 1 -Antitrypsin Z Gene 14. Dykes, D. D., Miller, S. A., and Polesky, H. F. (1984) Hum. Hered. 34 , 31. Owen, M. C., and Carrell, R. W. (1977) FEBS Lett. 79 , 245-247

15. Kueppers, F., and Christopherson, M. J. (1978) Am. J. Hum. Genet. 30 , 32. Carrell, R. W., Jeppsson, J.-0.. Laurell, C.-B., Brennan, S. O., Owen, M.

C., Vaughan, L., and Boswell, D. R. (1982) Nature 298,329-334

16. Oakeshott, J. G., Muir, A,, Clark, P., Martin, N. G., Wilson, S. R., and 33. Kurachi, K., Chandra, T., Friezner Degen, S. J., White, T. T., Marchioro,

T. L., Woo, S. L. C., and Davie, E. W. (1981) Proc. Natl. Acad. Sci. U. S.

17. Beatty, K., Bieth, J., and Travis, J. (1980) J. Biol. Chem. 255,3931-3934 34. Jeppsson, J. O., and Ericksson, S. (1985) Biochim. Biophys. Acta 831.30- A. 78,6826-6830

18. Sveger, T. (1976) N. Engl. J. Med. 294,1316-1321 19. Gadek, J. E., Fells, G. A., Zimmerman, R. L., Rennard, S. I., and Crystal, 35. Eriksson, S., and Larsson, C. (1975) N . Engl. J. Med. 2 9 2 , 176-180

33

20. Higgins, M. W., and DeMets, D. (1981) in Epidemiology of Respiratory J. (1984) Liver 4,325-337

308-310

359-365

Whitfield, J. B. (1985) Ann. Hum. Biol. 12 , 149-160

R. G. (1981) J. Clin. Inuest. 68, 889-898 36. Callea, F., Fevery, J., Massi, G., Lievens, C., de Groote, J., and Desmet, V.

Diseases, Task Force Report, July 1979 (Lenfant, C., chairman), pp. 11- 37. Errington, D. M., Batburst, I. C., Janus, E. D., and Carrell, R. W. (1982)

of Health, publ. No. 82-2019 31, U. S. Department of Health and Human Services, National Institutes FEBS Lett. 148,83-86

38. Foreman, R. C., Judah, J. D., and Colman, A. (1984) FEBS Lett. 168.84- 21. Jeffreys, A. J., and Flavell, R. A. (1977) Cell 12 , 429-439 22. Constans, J., Viau, M., and Gouaillard, C. (1980) Hum. Genet. 5 5 , 119-121 39. Perlmutter, D. H., Kay, R. M., Cole, F. S., Rossing, T. H., Van Thiel, D., 23. Nukiwa, T., Brantly, M., Garver, R., Paul, L., Courtney, M., LeCocq, J.-P., and Colten, H. R. (1985) Proc. Natl. Acad. Sci. U. S . A. 82,6918-6921

24. Kidd, V. J., Wallace, R. B., Itakura, K., and Woo, S. L. C. (1983) Nature and Crystal, R. G. (1986) J. Clin. Inuest. 77, 528-537 40. Ogushi, F., Fells, G., Hubbard, R., Straus, S., and Crystal, R. G. (1986) Am.

Reu. Respir. Dis. 133, (suppl.) A218 304,230-234 41. Bathurst, I. C., Travis, J., George, P. M., and Carrell, R. W. (1984) FEBS

25. Vieira, J., and Messing, J. (1982) Gene 19 , 259-268 26. Rigby, P. W. J., Dieckmann, M., Rhodes, C., and Berg, P. (1977) J . Mol. 42. Loebermann, H., Tokuoka, R., Deisenhofer, J., and Huber, R. (1984) J.

Lett. 177,179-183

27. Studenckl, A. B., and Wallace, R. B. (1984) DNA (N . Y.) 3,7-15 43. Rosenberg, S., Barr, P. J., Najarian, R. C., and Hallewell, R. A. (1984) Mol. Biol. 1 7 7 , 531-557

28. Laurell, C. B., and Eriksson, S. (1963) Scand. J. Clin. Lab. Inuest. 15 , 132- Nature 312 , 77-80

29. Laurell, C. B. (1965) Scand. J. Clin. Lab. Inuest. 17 , 271-274 44. Southern, E. M. (1975) J. Mol. Biol. 98,503-517

30. Yoshida, A,, Lieberman, J., Gaidulis, L., and Ewing, C. (1976) Proc. Natl. 45. Courtney, M., Buchwalder, A,, Tessier, L.-H., Jaye, M., Benavente, A,,

Balland, A., Kohli, V., Lathe, R., Tolstoshev, P., and LeCocq, J.-P. (1984)

88

Biol. 113,237-251

140

Acad. Sci. U. S. A. 73,1324-1328 Proc. Natl. Acad. Sci. U. S. A. 81,669-673

APPENDIX

FIG. 1. Sequence of the protein coding regions of the d A T Z gene. Shown is the sequence of the protein coding exons (11-V) of the Z a1AT gene; for clarity it is shown in a continuous form from one exon to the next, with dotted vertical lines separating the exons. The translated amino acid sequence for the Z gene is immediately below the nucleic acid sequence. The numbering starts from N-terminal glutamic acid of the mature alAT protein; amino acids within the signal peptide are identified with minus numbers. Point mutations and amino acid changes are indicated in dotted boxes. Also identified are the carbohydrate attachment sites (CHO-) and the active inhibitory site at Met358.

20 CAT GAT CAG GAT CAC CCA ACC TTC M C M G ATC ACC CCC AAC CTG GCT GAG TTC GCC TTC AGC CTA T I C CGC CAG CTG GCA CAC CAG TCC M C AGC ACC M T ATC TTC TTC TCC CCA GTG Hm Asp Gln Asp Hnr Pro Thr Phe As" Lys 11. Thr Pro Asn Leu Ala Glu Phe Ala PhS S e r Leu Tyr Arg GI" Leu Ala HIS G k Ssr *an Sa Thr As" IIe Phe Phe S s r PI0 Val

30 40 M

60 70 80 Cm,

AGC ATC GCT ACA GCC TTT GCA ATG CTC TCC CTG GGG ACC M G GCT GAC ACT CAC GAT G M ATC CTG GAG GGC CTG M T TTC M C CTC ACG GAG ATT CCG GAG GCT CAG ATC CAT G M GGC S e r 110 Ala Thr Ala Phs Ala Me1 Ley S s r Leu Gly Thl Lys Ala Asp Thr HS Asp Glu IIe Leu Glu Gly Leu As" Phe As" Leu Thr Glu I h PI0 Glu A h Gln 110 H48 Glu GlY

90

I CK)

1 w 110 120 TTC CAG G M CTC CTC COT ACC CTC AAC CAG CCA GAC AGC CAG CTC CAG CTG ACC ACC GGC M T GGC CTG TTC CTC AGC GAG GGC CTG M G CTA GTG Phe Gln Glu Leu Leu Alp Thr Leu Asn Gln Pro Asp Ser Gln Leu Gln Leu Thr Thr Gly Asn Gly Leu Phe Leu S e r Glu Gly Leu Lye Leu Val

MI

1 4 0 1 50 M G TTG TAC CAC TCA G M GCC l T C ACT GTC M C TTC GGG GAC ACC G M GAG GCC AAG M A CAG ATC M C GAT TAC GTG GAG M G GGT ACT C M GGG AM ATT GTG GAT TTG GTC M G GAG Lys Leu Tyr HIS Ser Glu Ala Phe Thr Val As" Phe Gly Asp Thr GI" GI" Ala Lya Lys Gln Ile As" Asp Tyr Val GI" Lys Gly Thr Gln Gly Lys Ih Val Asp Leu Val Lys Glu

1 6 0

Exon I1 1 Exon 111 213

c n GAc AGA GAc AcA GTT m GCT CTG GTG U T TAC A r c r r c m &c MA TGG GAG AGA ccc m G*A GTc M G GAC ACC GAG G M GAG GAc n c cAC GTG GAC CAG~XGIACC ACC 1 8 0 190 XI0 210 1-l

Leu Asp Arg Asp Thr Val Phe Ala Leu Val Asn Tyr 11. Pha Phe Lys Gly Lys Trp GI" Arg Pro Phe Glu Val Ly6 Asp Thr Glu Glu Glu Asp Phe H18 Val Asp Gln I Ala I Thr Thl

I M1 IGTG: L 3

220 GTG M G GTG CCT ATG AT0 M G COT TTA GGC ATG TTT M C ATC CAG CAC TGT M G M G CTG TCC AGC TGG GTG CTG CTG ATG MA TAC CTG GGC M T GCC ACC GCC ATC TTC TTC CTG CCT

230 240 250

Val Lys Val Pro Me1 Me1 Lyr Arg Leu Gly Mal Phe A m Ile Gln Hm Cys Lys Lyr Leu Set Ser Try Val Leu Leu MU Lys Tyr Leu Gly Asn Ala Thr Ala IIe Phe PhS Leu P m

E X O ~ 111 I Exon IV CK)

260 GAT GAG GGG AM CTA CAG CAC CTG GAA M T G M CTC ACC CAC GAT ATC ATC ACC AAG TTC CTG G M M T G M GAC AGA AGO TCT GCC AGC TTA CAT TTA CCC AM CTG TCC ATT ACT GOA As+ Glu Gly Lya Leu Gln Has Leu Glu A m Glu Leu Thr HIS Asp 11e 110 Thr Lya Phe Leu Glu Asn GI" Asp Arg Arg Ser Ala S e r Leu HIS Lou Pro Lys Leu S e r 111) Thr GlY

270 280 290

I I Exon IV I Exon V

300 ACC TAT GAT CTG M G AGC GTC CTG GOT C M CTG GGC ATC ACT M G GTC TTC AGC M T GGG GCT GAC CTC TCC GGG GTC ACA GAG GAG GCA CCC CTG M G CTC TCC M G GCC GTG CAT M G Thr Tyr Asp Leu Lys S e r Val Leu Gly Gln Ley Gly Ile Thr Lys Val Phe Ser Asn Gly Ala Asp Leu S e r Gly Val Thr Glu Glu A h Pro Leu LyS Leu Ser Lysl Ala Val HlS L p

310 320

A a k site I

350

. ""

GGG ACT GM GCT GCT GGG GCC ATG m TTA GAG GCC ATA CCCIATG~TCT ATC ccc ccc GAG GTC MG TTC MC AM ccc m GTC TTC TTA ATG ATT

r-1 380 370

Ala Val Leu Thr 11. Asp1 LysI Lya Gly Thr Glu Ala Ala Gly Ala Me1 Phe Leu Glu Ala Ile PmlMel I S e r 11e P m P m Glu Val Lys Phe As" Lya Pro Phe Val Phe Leu Me1 I*

b o l Lo?:

L A

380 G M C M U T ACC M G TCT CCC CTC TTC AT0 GOA AM GTG GTG M T CCC ACC C M AM T M CTG CCT CTC GCT CCT C M CCC CTC CCC TCC ATC CCT GGC CCC CTC CCT GOA TGA CAT T M Glu Gln As" Thr Lya Ser Pro Leu Phe MU Gly Lya Val Val Asn Pro Thr GI" Lys

390 394

AGA AGO GTT GAG CTG G