Top Banner
Protein Structures DOI: 10.1002/anie.200603246 A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays** Michael Portwich, Sandro Keller, HolgerM. Strauss, Carsten C. Mahrenholz, Ines Kretzschmar, Achim Kramer, and Rudolf Volkmer* The a-helical coiled coil was one of the first protein motifs whose structure was elucidated in detail. [1] First described by Crick in 1953, [2] the coiled coil is composed of at least two right-handed amphipathic a helices that wrap around each other into a left-handed supercoil such that their hydrophobic surfaces are in continuous contact (Figure 1 a). Coiled-coil structures can associate up to pentamers, form homomeric and heteromeric complexes at different stoichiometries, and be aligned in parallel and antiparallel topologies. [1] A characteristic of all coiled coils is the presence of heptad repeat sequences [abcdefg] i , where i denotes the heptad number (Figure 1 a). Hydrophobic amino acids are usually required at core positions a and d and are crucial for folding. [3] Positions b, c, e, f , and g face the outside of the assembly and are generally occupied by hydrophilic residues. Furthermore, inter- and intrahelical salt bridges formed by charged residues frequently found at positions b, c, e, and g may stabilize the structure. [4] The “peptide velcro” hypothesis [5] summarizes these findings and outlines that 1) a and d are hydrophobic, 2) e and g are charged, and 3) b, c, and f are hydrophilic. Although coiled-coil motifs can be predicted with a high degree of confidence, [6] predicting their association states and topologies is still a great challenge. Different selection systems have been established for screening recombinantly expressed coiled-coil complexes. [7] An impressive library-versus-library approach for identifying pairs of interacting coiled-coil-forming polypeptides was developed by Plɒckthun, Michnick, and co-workers. [8] Recently, a glass-chip experiment has been used to monitor a network of bZIP transcription factors at the protein level. [9] Herein we report on the development of optimized synthetic peptide arrays [10] useful for analyzing and characterizing coiled-coil associations at the amino acid level. In contrast to rational design, which mostly depends on short model peptides, our approach relies on the full-length homodimeric GCN4 leucine-zipper coiled-coil domain (Fig- ure 1 a). [11] The influence of amino acid substitutions on the association is tested and the stoichiometry is examined by biophysical methods. We expect that the results will be helpful for a better predictability of coiled-coil interactions. First, we synthesized a peptide array comprising 589 single-substitution variants of the GCN4 leucine-zipper sequence on cellulose membranes [12] and probed it for binding to the native GCN4 sequence (wild type (wt); Table 1, Figure 1 b). Individual spot signal intensities of the pattern were measured and evaluated as described. [13, 14] The replace- ment variability [14] of each sequence position was calculated (Figure 1 c) and classified as low (V 20 %), medium (20 % V 50 %), or high (V 50 %). As expected, leucine at the core positions d I d IV is classified as having low variability, with the exception of Leu12 (d II ), which is replaceable by Ala. In contrast, core positions a I a V are of intermediate variability; synonymous residues within this class are characterized by an expanded physicochemical similarity [14] to the exchanged amino acids. The solvent-exposed positions f , b, and c are all classified as highly variable, whereas the pattern emerging for the core-flanking positions e and g is more complex. As the e positions are highly variable, we anticipated a similar behavior for the g positions. However, the variability of the g sites depends on the sequence position, ranging from low (g III = Glu 22) to medium (g I = Lys 8, g IV = Leu 29) to high (g II = Lys 15). This discrepancy between the two core-flanking Table 1: Sequences of selected GCN4 variants. g abcdefg abcdefg abcdefg abcdefg ab I II III IV wt [a] R MKQLEDK VEELLSK NYHLENE VARLKKL VG 1 R MKQLEDK VEELLSK NYHLENE KARL EKL VG 2 R MKQLEDK VEELLSK YYH TENE VARLKKL VG 3 R MKQLEDK VEELLSK IYH NENE VARLKKL VG [a] The sequence corresponds to residues 249–279 of the GCN4 protein of Saccharomyces cerevisiae. The underlined residues highlight the substitutions. For the synthesis see S1 and S2 in the Supporting Information . [*] Dr. M. Portwich, [$] C. C. Mahrenholz, I. Kretzschmar, Prof.Dr. A. Kramer, Dr. R. Volkmer Institut fɒr Medizinische Immunologie CharitȖ-UniversitȨtsmedizin Berlin Campus CharitȖ Mitte Schumannstrasse 20–21, 10117 Berlin (Germany) Fax: (+ 49) 30-450-524942 E-mail: [email protected] Homepage: http://www.charite.de/immunologie/research/agsm/ index.html Dr. S. Keller, [$] Dr. H. M. Strauss [+] [$] Leibniz Institute of Molecular Pharmacology FMP Robert-RɆssle-Strasse 10, 13125 Berlin (Germany) [ + ] Present address: Max Planck Institute of Colloids and Interfaces Am Mɒhlenberg 1, 14424 Golm (Germany) [ $ ] These authors have contributed equally. [**] This work was supported by the Deutsche Forschungsgemeinschaft (SFB 449 to R.V. and SFB 498 to H.S.), the Ernst Schering Research Foundation, the Jɒrgen Manchot Foundation, and the Universi- tȨtsklinikum CharitȖ Berlin. Supporting information for this article is available on the WWW under http://www.angewandte.org or from the author. Communications 1654 # 2007 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim Angew. Chem. Int. Ed. 2007, 46, 1654 –1657
16

A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

Mar 06, 2023

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

Protein StructuresDOI: 10.1002/anie.200603246

A Network of Coiled-Coil Associations Derived from Synthetic GCN4Leucine-Zipper Arrays**Michael Portwich, Sandro Keller, Holger M. Strauss, Carsten C. Mahrenholz, Ines Kretzschmar,Achim Kramer, and Rudolf Volkmer*

The a-helical coiled coil was one of the first protein motifswhose structure was elucidated in detail.[1] First described byCrick in 1953,[2] the coiled coil is composed of at least tworight-handed amphipathic a helices that wrap around eachother into a left-handed supercoil such that their hydrophobicsurfaces are in continuous contact (Figure 1a). Coiled-coilstructures can associate up to pentamers, form homomericand heteromeric complexes at different stoichiometries, andbe aligned in parallel and antiparallel topologies.[1] Acharacteristic of all coiled coils is the presence of heptadrepeat sequences [abcdefg]i, where i denotes the heptadnumber (Figure 1a). Hydrophobic amino acids are usuallyrequired at core positions a and d and are crucial for folding.[3]

Positions b, c, e, f, and g face the outside of the assembly andare generally occupied by hydrophilic residues. Furthermore,inter- and intrahelical salt bridges formed by charged residuesfrequently found at positions b, c, e, and g may stabilize thestructure.[4] The “peptide velcro” hypothesis[5] summarizesthese findings and outlines that 1) a and d are hydrophobic,2) e and g are charged, and 3) b, c, and f are hydrophilic.Although coiled-coil motifs can be predicted with a highdegree of confidence,[6] predicting their association states andtopologies is still a great challenge.

Different selection systems have been established forscreening recombinantly expressed coiled-coil complexes.[7]

An impressive library-versus-library approach for identifying

pairs of interacting coiled-coil-forming polypeptides wasdeveloped by Pl2ckthun, Michnick, and co-workers.[8]

Recently, a glass-chip experiment has been used tomonitor a network of bZIP transcription factors at theprotein level.[9] Herein we report on the development ofoptimized synthetic peptide arrays[10] useful for analyzing andcharacterizing coiled-coil associations at the amino acid level.In contrast to rational design, which mostly depends on shortmodel peptides, our approach relies on the full-lengthhomodimeric GCN4 leucine-zipper coiled-coil domain (Fig-ure 1a).[11] The influence of amino acid substitutions on theassociation is tested and the stoichiometry is examined bybiophysical methods. We expect that the results will be helpfulfor a better predictability of coiled-coil interactions.

First, we synthesized a peptide array comprising 589single-substitution variants of the GCN4 leucine-zippersequence on cellulose membranes[12] and probed it for bindingto the native GCN4 sequence (wild type (wt); Table 1,Figure 1b). Individual spot signal intensities of the patternwere measured and evaluated as described.[13, 14] The replace-ment variability[14] of each sequence position was calculated(Figure 1c) and classified as low (V� 20%), medium (20 %�V� 50 %), or high (V� 50 %). As expected, leucine at thecore positions dI–dIV is classified as having low variability, withthe exception of Leu12 (dII), which is replaceable by Ala. Incontrast, core positions aI–aV are of intermediate variability;synonymous residues within this class are characterized by anexpanded physicochemical similarity[14] to the exchangedamino acids. The solvent-exposed positions f, b, and c are allclassified as highly variable, whereas the pattern emerging forthe core-flanking positions e and g is more complex. As the epositions are highly variable, we anticipated a similarbehavior for the g positions. However, the variability of theg sites depends on the sequence position, ranging from low(gIII =Glu22) to medium (gI =Lys8, gIV=Leu29) to high(gII =Lys15). This discrepancy between the two core-flanking

Table 1: Sequences of selected GCN4 variants.

gabcdefgabcdefgabcdefgabcdefgab

I II III IV

wt[a] RMKQLEDKVEELLSKNYHLENEVARLKKLVG

1 RMKQLEDKVEELLSKNYHLENEKARLEKLVG

2 RMKQLEDKVEELLSKYYHTENEVARLKKLVG

3 RMKQLEDKVEELLSKIYHNENEVARLKKLVG

[a] The sequence corresponds to residues 249–279 of the GCN4 proteinof Saccharomyces cerevisiae. The underlined residues highlight thesubstitutions. For the synthesis see S1 and S2 in the SupportingInformation .

[*] Dr. M. Portwich,[$] C. C. Mahrenholz, I. Kretzschmar,Prof. Dr. A. Kramer, Dr. R. VolkmerInstitut f9r Medizinische ImmunologieCharit:-Universit=tsmedizin BerlinCampus Charit: MitteSchumannstrasse 20–21, 10117 Berlin (Germany)Fax: (+49)30-450-524942E-mail: [email protected]: http://www.charite.de/immunologie/research/agsm/

index.html

Dr. S. Keller,[$] Dr. H. M. Strauss[+] [$]

Leibniz Institute of Molecular Pharmacology FMPRobert-RHssle-Strasse 10, 13125 Berlin (Germany)

[+] Present address:Max Planck Institute of Colloids and InterfacesAm M9hlenberg 1, 14424 Golm (Germany)

[$] These authors have contributed equally.

[**] This work was supported by the Deutsche Forschungsgemeinschaft(SFB 449 to R.V. and SFB 498 to H.S.), the Ernst Schering ResearchFoundation, the J9rgen Manchot Foundation, and the Universi-t=tsklinikum Charit: Berlin.

Supporting information for this article is available on the WWWunder http://www.angewandte.org or from the author.

Communications

1654 � 2007 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim Angew. Chem. Int. Ed. 2007, 46, 1654 –1657

Page 2: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

positions has also been observed by Matthews, Vinson and co-workers.[15]

As multiple substitutions can entail a leap from dimeric totrimeric or tetrameric structures,[3a, 16] we extended thepeptide-array approach to an investigation of double sub-stitutions, focusing on residues located close to and within thecore (see Table S4.1 in the Supporting Information). Doublesubstitutions at positions a/d, a/e, and d/g of the GCN4leucine-zipper sequence resulted in a synthetic peptide arrayof 4320 double-substitution variants, which were then probedfor binding the native GCN4 sequence wt (Figure 2a,b).Altogether, 933 heteromeric associations were observed andare summarized in Figure 2c. Heptad I is characterized by thehighest tolerance toward substitution, whereas heptads II–IVshow different substitution tolerances according to I> III>II> IV for a/d, I> III> IV> II for a/e, and I> II> III> IVfor d/g. Given their classification as moderately and highlyvariable (Figure 1c), it is not surprising that double substitu-tions at a/e also resulted in the highest number of associations.This implies that single-point substitutions at positions a and eact additively upon simultaneous exchange. Surprisingly,however, a/d and d/g substitutions were equally poorlytolerated, suggesting that, besides the well-known corepositions a and d, the core-flanking position g plays animportant role in coiled-coil formation. The particular statusof position g implied by single-point substitutions was laterconfirmed by the double-substitution analysis.

To analyze if pairs of favorable or unfavorable substitu-tions exist, we classified amino acids into two sets: a hydro-philic set (x) comprising Arg, Asn, Asp, Gln, Glu, His, Lys,Ser, and Thr; and a hydrophobic set (F) comprising Ala, Gly,Ile, Leu, Met, Phe, Pro, Trp, Val, and Tyr. We then analyzed towhat extent the GCN4 heptads tolerate combinations of F/F,x/F, F/x, and x/x substitutions, as depicted by the color-codedslices in Figure 2c. All combinations are allowed for a/esubstitutions, with the exception of the second heptad, wheresubstitutions are restricted to F/F and F/x combinations. Atpositions a/d and d/g, however, combinations of hydrophilic(x/x) substitutions hardly occur and, if so, only in the firstheptad. Astonishingly, we observed a significant number of F/x replacements at positions a/d. Except for heptad IV, thehydrophobic amino acid at position d can be replaced by ahydrophilic one if the residue at position a is simultaneouslyexchanged by a hydrophobic one. Such an effect was notfound for d/g substitutions. With the exception of the firstheptad, only a hydrophobic amino acid could replace ad residue independent of the g replacement. Interestingly,both a/d and d/g pairs in heptad IV seem to be crucial forassociation.

Overall, three substitution effects were observed: First,several combinations of two previously tolerated singlesubstitutions (Figure 1b) were also accepted in double-substitution variants (addition effect). For example, variant1 included a basic Lys instead of Val at position aIV and anacidic Glu instead of Lys at position eIV (Table 1). Second,some previously untolerated single substitutions wereaccepted when they were combined with an already previ-ously tolerated substitution (transition effect). In variant 2,for instance, a hydrophilic Thr was accepted instead of the

Figure 1. a) A helical-wheel diagram depicting the homodimeric leucinezipper (coiled coil) of the yeast bZIP transcription factor GCN4 asviewed from the N terminus. The residues (single-letter code foramino acids is used) next to the viewer are surrounded by circles.Crossed arrows in the center denote interactions in the hydrophobicinterface termed the core and dashed arrows represent inter- andintrahelical salt bridges apparent in the structure.[11] b) Completesingle-point substitution analysis of the GCN4 leucine-zippersequence. Pink spots indicate interactions between cellulose-mem-brane-bound variants and a dye-labeled wild-type (wt) GCN4 leucine-zipper sequence that was synthesized by standard solid-phase peptidesynthesis and labeled at the N terminal with tetramethylrhodamine(Tamra) (see S1 and S2 in the Supporting Information). Each spotcorresponds to a variant in which one residue of the wt sequencegiven at the top was replaced by one of the 20 gene-encoded aminoacids as specified on the left. Spots in the first row represent replicasof the wt sequence. c) Percentage of replacement variability (V) ofeach sequence position. All spot signals on the array shown in (b)were measured quantitatively and successful replacements (countablebinding spots) were determined (see S4 and S5 in the SupportingInformation). V was calculated as V= (number of binding spots/20)M100[14] and plotted against the GCN4 leucine-zipper sequence.

AngewandteChemie

1655Angew. Chem. Int. Ed. 2007, 46, 1654 –1657 � 2007 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.angewandte.org

Page 3: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

usually highly sensitive Leu at position dIII when a Tyr waslocated close in space to position aIII. Third, severalcombinations of two previously untolerated single sub-stitutions were allowed when they were combined in theGCN4 sequence (saltation effect). For example, variant 3included Ile at position aIII and Asn at dIII, neither of whichwere tolerated as a single substitution (Figure 1 b).

As the peptide-array experiments used were designedto detect heterospecific associations, homospecific inter-actions of GCN4 variants were studied by a differentapproach. The synthetic GCN4 variants 1, 2, and 3 (see S3in the Supporting Information) were immobilized intriplicate on a cellulose membrane.[17] Each array wasthen probed with one of the variants and revealed apattern of homo- and heteromeric associations (Figure 3).In contrast to 2 and 3, 1 did not show homospecificassociation. Moreover, heterospecific association wasobserved between 2 and 3, whereas 1 associated hetero-specifically only with 3. CD spectroscopy affirmed thecoiled-coil character of the associations, which becameapparent in an increase in the helicity upon mixing twoassociating variants (see S6 in the Supporting Informa-tion).

Analytical ultracentrifugation was employed to char-acterize the homospecific association processes. Although2 (Figure 4a) and 3 both folded into stable homotrimericstructures (termed 23 and 33, respectively), 1 was mainlymonomeric (see S7 in the Supporting Information).Isothermal titration calorimetry allowed a deeper viewinto the heterospecific association processes. Uponmixing, 23 and 33 rearranged to form a heteromericcoiled coil with a 1:1 stoichiometry (Figure 4b and S7 inthe Supporting Information). Monomeric 1 also formed a1:1 complex when mixed with 33, whereas no heterospe-cific interaction was observed between 1 and 23. Ultra-centrifugation demonstrated that all heteromeric com-

Figure 2. a) Synthetic peptide arrays displaying the complete set of GCN4leucine-zipper sequences with double substitutions at heptad positions a/d,a/e, and d/g in heptads I–IV by using all gene-encoded amino acids exceptfor Cys. For practical reasons, each assembly comprised 26M14 synthesissites, thus enabling the implementation of controls. Each colored spotrepresents a variant that is heterospecifically associated with a wild-typepeptide that is marked with Tamra on the N terminus (see S2 in theSupporting Information ). b) Double substitutions of aIV/eIV (V23/K27).Arrays were analyzed with a Lumi-Imager and the signal intensities werethen translated into Boehringer light units (BLUs), charted, and grayscaledfor better evaluation (see S4 and S5 in the Supporting Information ). Rowsrepresent the aIV position and columns the eIV position, both of which werescanned by using the 19 amino acids shown. Gray and black cells denoteassociating and strongly associating variants, respectively. See S5 in theSupporting Information for the complete set of tables. c) Pie chartsstanding for a/d, a/e, and d/g substitutions and summarizing heterospe-cific associations observed in (a). Each chart represents one type of doublesubstitution in all four heptads, shown as quarters I–IV. The radius of aquarter correlates with the number of possible substitutions in thecorresponding heptad. The surface of each slice scales with the number ofsubstitutions classified as hydrophobic–hydrophobic (F/F ; blue), hydro-philic–hydrophobic (x/F ; turquoise), hydrophobic–hydrophilic (F/x ;orange), or hydrophilic–hydrophilic (x/x ; red).

Figure 3. Array analysis of homo- and heterospecific associationsof GCN4 leucine-zipper variants 1–3 (Table 1). Three identicalarrays a)–c) were generated by immobilizing the synthetic peptideswt, 1, 2, and 3 on a cellulose support (see S1 and S3 in theSupporting Information ). Each array contained variants 1–3 andthe native GCN4 leucine-zipper wt as a control in triplicate. Arrayswere incubated with biotinylated derivatives of 1–3 as indicated onthe right. The associations were visualized by subsequent incuba-tion with peroxidase-labeled streptavidin (see S2 and S4 in theSupporting Information).[17b]

Communications

1656 www.angewandte.org � 2007 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim Angew. Chem. Int. Ed. 2007, 46, 1654 –1657

Page 4: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

plexes formed were dimers and confirmed the absence ofheteroassociation between variants 1 and 2 (see S7 in theSupporting Information). Figure 4c summarizes the interac-tions and stoichiometries of all coiled coils formed by thethree variants.

In conclusion, the presented synthetic peptide arrays canbe applied to the study of coiled-coil associations, without,however, providing information about the stoichiometry andtopology of the coiled coils. Our array analyses lead to a newappraisal of the heptad position g, whose replacement sensi-tivity was found to be significantly higher than that of theformally similar position e. Double-point substitutions canentail addition, transition, or saltation effects, which allowedus to create an interaction network of three GCN4 leucine-

zipper sequence variants including a monomer as well as twohomotrimeric and two heterodimeric coiled coils. PositionsaIII and dIII act as a switch for the transition from the nativehomodimeric coiled coil to homotrimeric structures. Hence,the native function of the GCN4 leucine zipper, that is, theformation of a homodimeric protein, can be fundamentallyaltered by only two substitutions. Conceivably, this supra-molecular flexibility of the coiled-coil motif is the quality thatmakes it such a versatile actor in nature.

Received: August 9, 2006Published online: January 9, 2007

.Keywords: helical structures · peptide arrays · peptides ·protein structures · spot synthesis

[1] a) J. M. Mason, K. M. Arndt, ChemBioChem 2004, 5, 170;b) A. N. Lupas, M. Gruber, Adv. Protein Chem. 2005, 70, 37.

[2] F. H. S. Crick, Acta Crystallogr. 1953, 6, 689.[3] a) P. B. Harbury, T. Zhang, P. S. Kim, T. Alber, Science 1993, 262,

1401; b) P. B. Harbury, P. S. Kim, T. Alber, Nature 1994, 371, 80;c) D. L. Akey, V. N. Malashkevich, P. S. Kim, Biochemistry 2001,40, 6352; d) B. Tripet, K. Wagschal, P. Lavigne, C. T. Mant, R. S.Hodges, J. Mol. Biol. 2000, 300, 377.

[4] a) N. E. Zhou, C. M. Kay, R. S. Hodges, J. Mol. Biol. 1994, 237,500; b) D. Krylov, I. Mikhailenko, C. Vinson, EMBO J. 1994, 13,2849; c) W. D. Kohn, C. M. Kay R. S. Hodges, J. Mol. Biol. 1998,283, 993; d) R. A. Kammerer, D. Kostrewa, P. Progias, S.Honnappa, D. Avila, A. Lustig, F. K. Winkler, J. Pieters, M. O.Steinmetz, Proc. Natl. Acad. Sci. USA 2005, 102, 13891.

[5] E. K. OJShea, K. J. Lumb, P. S. Kim, Curr. Biol. 1993, 3, 658.[6] a) Multicoil: E. Wolf, P. S. Kim, B. Berger, Protein Sci. 1997, 6,

1179; b) Socket: J. Walshaw, D. N. Woolfson, J. Mol. Biol. 2001,307, 1427; c) Coils: A. Lupas, M. van Dyke, J. Stock, Science1991, 252, 1162; d) Paircoil: B. Berger, D. B. Wilson, E. Wolf, T.Tonchev, M. Milla, P. S. Kim, Proc. Natl. Acad. Sci. USA 1995,92, 8259; e) J. M. Mason, M. A. Schmitz, K. M. M2ller, K. M.Arndt, Proc. Natl. Acad. Sci. USA 2006, 103, 8989.

[7] J. C. Hu, E. K. OJShea, P. S. Kim, R. T. Sauer, Science 1990, 250,1400.

[8] J. N. Pelletier, K. M. Arndt, A. Pl2ckthun, S. W. Michnick, Nat.Biotechnol. 1999, 17, 683.

[9] J. R. Newman, A. E. Keating, Science 2003, 300, 2097.[10] R. Frank, J. Immunol. Methods 2002, 267, 13.[11] E. K. OJShea, J. D. Klemm, P. S. Kim, T. Albert, Science 1991,

254, 539.[12] For details about the synthesis and purity of spot-synthesized

cellulose-membrane-bound peptide arrays please refer to S1 inthe Supporting Information.

[13] A. A. Weiser, M. Or-Guil, V. Tapia, A. Leichsenring, J.Schuchhardt, C. FrKmmel, R. Volkmer-Engert, Anal. Biochem.2005, 342, 300.

[14] J. Przdzezdiak, S. Tremmel, I. Kretzschmar, M. Beyermann, M.Bienert, R. Volkmer-Engert, ChemBioChem 2006, 7, 780.

[15] a) B. Ibarra-Molero, J. A. Zitzewitz, C. R. Matthews, J. Mol.Biol. 2004, 336, 989; b) C. Vinson, A. Acharya, E. J. Taparowsky,Biochim. Biophys. Acta 2006, 1759, 4.

[16] X. Zeng, H. Zhu, H. A. Lashuel, J. C. Hu, Protein Sci. 1997, 6,2218.

[17] H. K. Rau, N. DeJonge, W. Haehnel, Angew. Chem. 2000, 112,256; Angew. Chem. Int. Ed. 2000, 39, 250.

Figure 4. a) Homoassociation of variant 2 as followed by analyticalultracentrifugation. Peptide samples at concentrations of 3.0 mgmL�1

(black circles), 1.5 mgmL�1 (red circles), and 0.3 mgmL�1 (bluecircles) were spun at 50 krpm to give the local refractive indexdifference, Dn, as a function of the squared distance from the rotoraxis, d2, at sedimentation and chemical equilibrium. For clarity, onlyevery 10th data point is shown. The best fits (lines) were obtainedusing a nonideal monomer-dimer-trimer model (see Supporting Infor-mation S7). b) Heteroassociation of variants 2 and 3 as monitored byisothermal titration calorimetry. Aliquots (10 mL) of a 1.08 mm solutionof 3 were injected into 124 mm 2 at 25 8C. The best fit (blue line) to thenormalized heats of reaction, Q (red circles), yielded an associationconstant of 1.2M105m�1 and a stoichiometry of 1:1. R is the molarratio of 3 :2. c) Diagram of mutual associations between nonimmobi-lized variants 1, 2, and 3 as derived from the biophysical experimentsshown in a) and b) (see also S7 in the Supporting Information ).

AngewandteChemie

1657Angew. Chem. Int. Ed. 2007, 46, 1654 –1657 � 2007 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim www.angewandte.org

Page 5: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

1

A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine Zipper

Arrays

Michael Portwich, Sandro Keller, Holger M. Strauss, Carsten C. Mahrenholz, Ines

Kretzschmar, Achim Kramer, and Rudolf Volkmer

S1: Solid-phase peptide synthesis of GCN4 leucine zipper variants

Soluble GCN4 leucine zipper variants for dye labeling, biotin labeling, immobilization, and

biophysical experiments were synthesized by solid-phase peptide synthesis according to the

Fmoc strategy. Variants wt, 1, 2, and 3 were prepared with either an N-terminal (Cββ-) or a

C-terminal (-ββC) elongation (β = β-alanine). Peptides without elongation were used for

biophysical experiments. The following sequences were used for:

dye labeling: CββRMKQLEDKVEELLSKNYHLENEVARLKKLVG (Cββ-wt)

biotin labeling: CββRMKQLEDKVEELLSKNYHLENEKARLEKLVG (Cββ-1)

CββRMKQLEDKVEELLSKYYHTENEVARLKKLVG (Cββ-2)

CββRMKQLEDKVEELLSKIYHNENEVARLKKLVG (Cββ-3)

immobilization: RMKQLEDKVEELLSKNYHLENEVARLKKLVGββC (wt-ββC)

RMKQLEDKVEELLSKNYHLENEKARLEKLVGββC (1-ββC)

RMKQLEDKVEELLSKYYHTENEVARLKKLVGββC (2-ββC)

RMKQLEDKVEELLSKIYHNENEVARLKKLVGββC (3-ββC)

Automatic solid-phase peptide synthesis was performed on Tentagel-SRam resin (Rapp

Polymere, Tübingen, Germany). Reactions were performed in plastic syringes at room

temperature on a multi-peptide synthesis robot (Syro2000, MultiSynTech, Witten, Germany).

Couplings were achieved by reacting 4 equivalents Fmoc-AA-OH with 4 equivalents PyBOB

and 8 equivalents NMM in DMF. A solution of piperidine (20%) in DMF was used to remove

the Fmoc protecting group. Peptides were deprotected and cleaved from the resin using a

mixture of 10 mL TFA, 0.75 g phenol, 0.5 mL water, 0.5 mL methylphenylsulfide, and

0.25 mL 1,2-ethandithiol. After 3 h at room temperature, the cleavage solution was collected,

and the crude peptides were precipitated from the solution with dry ether at 0 °C. HPLC

purification and analysis were achieved using a linear solvent gradient (A: 0.05% TFA in

water; B: 0.05% TFA in acetonitrile; gradient: 5–60% B over 30 min; UV detector at 214 nm;

Page 6: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

2

RP-18 column). The identities of the peptides were validated by mass spectrometry using

MALDI-TOF (VoyagerLT, Applied Biosystems, Weiterstadt, Germany) and ESI (Q-

TOFmicro, Micromass, Manchester, UK).

SPOT synthesis: Synthesis of cellulose-membrane-bound arrays was done according to: H.

Wenschuh, H. Gausepohl, L. Germeroth, M. Ulbricht, H. Matuschewski, A. Kramer, R.

Volkmer-Engert, N. Heine, T. Ast, D. Scharn, J. Schneider-Mergener in Combinatorial

Chemistry, A Practical Approach. (Ed.: H. Fenniri), Oxford University Press, Oxford, 2000,

pp. 95–116. The purity of SPOT-synthesized peptides is generally in the range of 40–85%,

which is adequate for screening assays. Even longer peptides can be SPOT-synthesized in

sufficient quality, as recently shown by Przezdziak et al.[1] The SPOT-synthesized FBP28

WW domain, containing 34 residues, was obtained in ~65% purity as determined by HPLC,

and the identity was confirmed by mass spectrometry.[1]

S2: Dye and biotin labeling

Cββ-wt was dye-labeled by Michael addition of its N-terminal Cys thiol group to

tetramethylrhodamine-6-maleimide (Molecular Probes, Eugene, USA). The peptide was

dissolved in sodium phosphate buffer (10 mM, pH 7.4, argon-sparged) at a concentration of

6.8 mM. Then, 1.5 equivalents maleimide dye compound and 1.5 equivalents tris(2-

carboxyethyl)phosphine hydrochloride were added, and the reaction mixture was stirred for

4 h at room temperature. Afterwards, 10 equivalents 2-mercaptoethanol was added to cap

unmodified maleimido groups (1 h) before adding 100 equivalents dimethylsulfoxide (1 h).

The dye-labeled peptide was purified by HPLC as described in S1; detection was carried out

at 540 and 214 nm. The identity of the peptide was confirmed by mass spectrometry (see S1).

According to the same protocol, variants Cββ-1, Cββ-2, and Cββ-3 were N-terminally

labeled with biotin using the Nα-(3-maleimidylpropionyl)biocytin reagent (Molecular Probes,

Eugene, USA), purified, and confirmed.

S3: Immobilization of wt-ββC, 1-ββC, 2-ββC, and 3-ββC on cellulose membranes

The purified and analytically characterized peptides (see S1) were immobilized via their C-

terminal Cys to a maleimide-functionalized Whatman 50 cellulose support as described by

Otte et al.[2]

S4: Binding assay on cellulose membranes

Page 7: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

3

General: Membrane-bound GCN4 variants were washed with DMF, ethanol, and three times

with TBS buffer (13.7 mM NaCl, 0.27 mM KCl, 5 mM Tris (tris(hydroxymethyl)amino-

methane), pH 8.0) for 10 min each. Membranes were incubated for 3 h with blocking buffer

(for 50 mL blocking buffer: 5 mL blocking buffer (Sigma–Aldrich, Steinheim, Germany),

2.5 g sucrose, 5 mL 10 × TBS buffer (137 mM NaCl, 2.7 mM KCl, 50 mM Tris, pH 8.0),

filled up to 50 mL with water).

Single-substitution arrays: Arrays were incubated with a solution of N-terminally tetra-

methylrhodamine-labeled Cββ-wt at a concentration of 10 µM in blocking buffer at room

temperature overnight. Arrays were washed three times with TBS, and the resulting spot

patterns were recorded with a flatbed scanner (Snapscan e40, Agfa, Mortsel, Belgium).

Subsequently, arrays were analyzed densitometrically to yield spot signal intensities in terms

of Boehringer light units using the Lumi-Analyst software (Roche, Mannheim, Germany; see

also S5). Arrays were regenerated by washing with DMF (4 times, 20 min each) followed by

washing with ethanol (3 times, 3 min each) and drying. As a negative control, arrays were

probed with 2-mercaptoethanol coupled to tetramethylrhodamine-6-maleimide (prepared

according to S2).

Double-substitution arrays: Cellulose membranes were incubated with a solution of N-

terminally tetramethylrhodamine-labeled Cββ-wt at a concentration of 50 µM in blocking

buffer at room temperature overnight before proceeding as described above. Results are

shown in tabular form in S5.

Table S4.1: GCN4 heptad positions and residues selected for double substitutions.

Position Substituted residues Location a/dI Met2/Leu5 core/core aII/dII Val9/Leu12 core/core aIII/dIII Asn16/Leu19 core/core aIV/dIV Val23/Leu26 core/core aI/eI Met2/Glu5 core/flank aII/eII Val9/Leu13 core/flank aIII/eIII Asn16/Glu20 core/flank aIV/eIV Val23/Lys27 core/flank dI/gI Leu5/Lys8 core/flank dII/gII Leu12/Lys15 core/flank dIII/gIII Leu19/Glu22 core/flank dIV/gIV Leu26/Leu29 core/flank

Mutual associations between variants wt, 1, 2, and 3: Three identical arrays were incubated

simultaneously, each with one of the biotin-labeled variants (Cββ-1, Cββ-2, and Cββ-3) at

10 µM in blocking buffer at room temperature overnight. After washing 3 times with TBS,

Page 8: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

4

arrays were incubated for 30 min with horse-radish-peroxidase-labeled streptavidin (Sigma,

St. Louis, USA) according to the supplier’s instructions and subsequently washed five times

with TBS. Binding was visualized using a chemoluminescent substrate (Uptilight HRP,

Uptima, Interchim, Montluçon, France) and a Lumi-Imager (Roche, Mannheim, Germany).

S5: Quantification of spot signal intensities

A single- or double-substitution variant was classified as associating if its spot signal intensity

was greater than the lower cutoff value given by Ø – 1.96s , where Ø is the average and s the

standard deviation of 62 signal intensities (single-substitution arrays) or 30 signal intensities

(double-substitution arrays) as calculated for the spots of the native GCN4 leucine zipper

sequence. A double-substitution variant was further classified as strongly associating if its

spot signal intensity was greater than the higher cutoff value given by Ø + 1.96s .

Spot signal intensities obtained from all double-substitution binding experiments are shown in

Tables S5.1–S5.12. Each table represents an array after incubation with a dye-labeled native

GCN4 leucine zipper sequence. Amino-acid residues used for the more N-terminal of the two

substitutions are indicated on the left, whereas residues used for the more C-terminal

substitution are given at the top of each table. Gray cells denote spot signal intensities within

the interval delimited by Ø ± 1.96s , whereas black cells denote intensities greater than

Ø + 1.96s .

A D E F G H I K L M N P Q R S T V W Y

A 39 94 105 102 57 79 81 45 118 79 62 53 56 67 76 75 62 181 80D 72 50 59 97 58 50 43 104 87 89 56 64 68 63 65 56 70 161 94E 54 56 63 90 64 76 72 54 108 82 71 25 41 43 88 60 60 166 73F 92 81 50 186 85 70 123 55 307 118 103 70 66 137 65 77 98 328 134G 51 25 49 82 57 62 81 55 101 78 65 63 49 89 65 68 81 227 113H 66 72 56 193 54 49 98 62 110 85 51 52 72 78 75 90 61 196 109I 60 42 42 151 71 70 113 54 231 124 49 51 66 71 60 26 50 293 102K 62 64 42 179 58 75 73 92 87 71 69 57 72 82 82 94 61 307 112L 73 61 39 131 73 75 150 64 263 102 63 48 66 67 47 68 74 230 160M 78 44 57 193 60 60 152 62 234 99 32 71 78 86 78 74 80 317 96N 64 63 44 128 57 82 96 73 119 86 61 73 76 87 59 63 80 213 136P 40 40 56 194 44 73 76 58 120 68 45 66 52 104 78 79 78 217 86Q 89 67 52 146 65 33 91 40 100 81 42 49 47 83 70 56 78 246 155R 77 67 60 193 76 99 145 85 172 105 75 64 60 139 51 74 88 293 177S 47 53 55 180 63 61 71 66 132 86 61 75 54 99 69 69 82 276 116T 36 65 38 120 44 51 91 63 142 89 70 42 70 74 52 57 78 265 118V 59 58 43 175 71 62 94 54 98 61 54 42 39 63 36 54 93 338 163W 252 168 117 680 168 222 701 158 714 414 165 215 189 395 161 178 188 1634 1140Y 81 66 46 254 85 190 215 130 472 299 70 127 67 130 108 101 157 1392 465

Table S5.1: Double substitutions at positions aI (M2) and dI (L5). .

Page 9: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

5

A D E F G H I K L M N P Q R S T V W YA 21 30 26 35 19 18 35 25 103 48 31 37 34 42 43 41 40 73 27D 26 25 18 34 23 29 21 48 28 36 38 28 16 26 32 35 30 49 51E 30 28 40 43 45 29 26 40 53 30 24 36 29 13 29 24 38 28 33F 103 22 322 40 46 36 74 39 237 59 46 44 99 36 40 31 63 60 93G 36 3 23 29 22 38 37 37 26 40 22 42 42 40 23 15 40 59 35H 41 34 40 50 21 48 36 36 35 40 50 42 43 43 27 47 41 51 42I 288 25 74 53 54 42 80 31 403 55 43 24 69 55 48 8 58 56 71K 30 31 35 27 21 27 42 31 76 36 19 43 43 53 29 49 20 66 46L 86 48 76 29 61 24 56 24 177 41 26 22 44 30 47 31 61 60 74M 68 67 50 45 41 47 51 47 107 29 29 58 101 44 29 23 28 70 41N 31 38 36 38 23 24 35 39 29 44 23 30 37 50 47 41 32 31 43P 30 27 22 31 29 35 19 29 26 51 33 27 27 37 42 36 36 54 48Q 26 48 32 52 26 11 59 31 39 30 31 27 38 36 44 23 35 62 42R 44 38 45 47 43 45 35 37 81 29 47 35 0 61 53 32 40 142 39S 19 36 24 23 35 38 33 36 39 36 49 41 30 30 28 43 31 75 38T 5 52 42 21 38 37 39 34 37 42 18 35 38 43 33 19 37 63 41V 198 24 43 33 56 38 55 16 158 67 35 21 33 22 33 28 29 45 64W 121 28 262 44 54 70 80 66 123 94 61 41 119 84 29 38 71 155 111Y 122 19 507 53 22 36 43 29 475 43 23 14 137 43 49 43 89 114 417

Table S5.2: Double substitutions at positions aII (V9) and dII (L12).

A D E F G H I K L M N P Q R S T V W YA 33 32 38 61 23 26 19 6 76 36 29 10 29 67 26 31 17 33 48D 19 18 19 16 17 21 1 32 30 31 35 30 21 26 21 18 28 37 25E 30 35 25 23 27 28 28 39 49 34 31 29 39 14 69 47 34 39 29F 35 30 32 40 34 46 49 39 133 56 40 37 49 43 42 141 55 72 49G 25 4 27 21 41 23 39 24 73 41 32 32 31 35 30 30 29 41 38H 37 34 35 45 36 30 120 19 857 69 26 46 27 47 41 50 41 49 41I 52 28 22 60 45 236 40 65 41 65 186 31 72 37 38 37 57 62 60K 32 26 18 51 26 39 31 27 173 43 30 36 21 49 38 27 31 62 38L 20 26 30 23 25 34 38 33 48 29 32 24 44 32 22 40 32 43 36M 45 35 22 41 31 57 58 34 395 73 42 34 51 52 43 78 47 68 44N 27 44 15 48 26 38 59 24 223 48 43 31 18 30 29 21 36 65 37P 41 37 20 32 20 34 20 25 35 24 30 34 23 31 27 19 22 54 27Q 35 23 23 43 29 23 97 44 342 93 29 18 42 52 27 41 109 52 46R 27 28 26 72 38 66 49 22 405 50 41 22 26 37 48 20 43 59 48S 20 17 18 52 35 39 126 28 328 69 36 26 25 57 23 41 74 62 51T 30 30 42 69 41 44 275 28 736 215 73 29 29 46 50 95 156 68 28V 40 12 23 23 36 47 28 11 34 55 52 44 42 42 40 49 15 42 35W 34 44 47 57 79 59 83 46 180 83 47 60 69 49 26 106 76 166 107Y 44 40 30 90 21 36 251 45 1363 316 68 25 45 47 39 237 124 100 101

Table S5.3: Double substitutions at positions aIII (N16) and dIII (L19).

A D E F G H I K L M N P Q R S T V W YA 54 36 24 33 26 33 39 18 100 27 23 38 20 34 35 14 17 35 43D 24 23 23 33 24 33 6 40 56 36 19 24 18 29 26 17 20 61 31E 33 20 41 27 23 26 27 20 36 35 25 22 26 24 24 45 31 36 30F 32 35 33 32 38 23 27 38 123 32 27 19 23 19 22 13 26 73 33G 31 2 39 31 23 38 30 29 30 28 14 28 22 27 36 17 17 39 34H 34 12 22 24 37 36 25 24 39 37 43 29 23 35 31 23 34 73 37I 32 23 31 41 29 27 110 28 227 40 27 18 21 25 18 4 30 62 63K 29 16 23 28 30 35 50 39 173 47 38 38 19 33 42 30 41 84 60L 29 27 30 0 45 62 54 27 157 42 35 43 40 45 44 37 45 110 65M 37 32 41 45 39 41 27 36 91 34 0 91 62 45 44 23 25 54 63N 28 42 39 28 21 42 36 20 25 39 22 21 32 44 22 43 26 21 47P 58 31 16 30 27 32 27 30 32 44 36 34 36 39 26 24 26 72 35Q 39 28 22 50 33 0 41 58 60 31 33 22 39 49 44 28 42 76 48R 47 32 23 45 28 41 52 41 178 44 25 39 0 61 69 33 38 92 49S 36 29 29 47 42 56 44 33 45 45 32 31 30 37 28 37 29 80 39T 0 35 66 48 50 32 40 25 39 39 39 38 34 41 34 23 32 79 36V 24 24 32 63 38 47 69 1 155 87 60 43 32 49 26 36 47 93 85W 33 37 37 64 43 46 37 66 138 61 49 61 60 95 18 54 76 123 90Y 18 22 20 41 35 46 42 34 829 33 30 32 18 28 16 21 32 243 252

Table S5.4: Double substitutions at positions aIV (V23) and dIV (L26).

Page 10: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

6

A D E F G H I K L M N P Q R S T V W YA 186 56 112 214 118 134 204 136 241 204 103 115 142 193 186 110 164 1546 358D 107 53 120 211 108 103 117 90 120 100 101 87 66 168 124 143 123 485 189E 93 56 98 277 102 122 150 195 129 157 97 88 94 112 127 82 131 503 223F 329 161 251 498 303 307 441 368 374 1063 303 206 322 477 861 487 698 1034 1168G 191 63 87 260 152 160 216 145 253 224 94 131 146 154 144 182 265 679 426H 224 120 202 807 198 220 440 167 238 218 181 125 240 174 222 200 227 1019 516I 216 69 270 400 288 305 363 398 347 375 213 172 205 398 428 193 286 876 841K 158 97 119 508 142 166 236 145 213 204 158 157 161 321 226 194 308 874 573L 283 107 278 378 246 331 430 286 294 342 188 159 298 320 388 358 320 1632 1211M 296 97 270 530 289 307 443 293 530 412 169 99 247 377 177 213 290 934 1180N 173 105 123 328 142 166 238 212 229 187 186 146 134 230 240 132 170 501 220P 134 65 140 233 176 140 182 197 171 212 98 154 108 238 213 197 200 1214 353Q 136 105 140 344 162 127 264 119 197 174 110 167 97 205 239 166 212 888 316R 303 160 222 594 260 265 461 142 434 350 153 237 161 222 221 216 317 908 494S 167 80 174 411 194 160 272 168 312 280 135 150 210 230 220 214 167 788 366T 170 60 138 304 123 161 237 139 304 284 140 172 149 201 169 164 261 827 419V 324 96 179 457 162 203 209 206 281 251 112 134 132 267 230 220 202 1059 522W 920 1384 1350 1688 1554 1670 1497 1123 1296 1713 1501 546 1791 667 1515 1351 1269 1846 2092Y 371 190 842 850 455 735 1048 769 502 1358 475 313 924 1194 1232 1242 1229 2121 1800

Table S5.5: Double substitutions at positions aI (M2) und eI (E5).

A D E F G H I K L M N P Q R S T V W YA 50 22 50 41 31 33 60 43 65 86 27 15 41 96 47 44 60 227 146D 34 21 29 26 22 40 5 27 30 21 20 28 17 19 13 15 8 36 23E 19 30 21 32 29 22 37 30 31 47 14 24 38 37 14 34 29 64 22F 79 38 57 179 27 51 195 118 258 204 58 33 121 120 217 69 134 485 295G 26 23 18 30 33 28 19 17 21 17 25 22 18 32 14 19 14 86 39H 27 16 28 56 19 33 63 48 69 34 21 22 30 61 21 33 46 120 35I 252 255 378 394 93 263 389 279 263 484 283 17 237 359 243 149 218 566 682K 37 26 55 57 35 40 61 37 70 42 37 29 34 72 31 28 26 112 67L 195 70 109 255 38 59 229 143 231 239 75 32 106 213 83 86 195 525 630M 85 65 92 133 48 58 155 84 105 82 32 20 59 82 54 38 70 234 188N 41 27 23 37 29 20 25 47 31 41 23 22 42 32 27 35 47 49 25P 28 33 22 28 28 36 28 42 27 51 30 33 28 39 28 26 26 66 23Q 32 40 17 34 22 19 23 31 34 44 23 25 26 37 34 40 33 121 49R 39 32 45 62 26 45 59 61 68 74 34 32 51 85 35 38 67 176 98S 40 31 30 44 34 43 36 41 52 35 44 25 36 42 68 38 30 102 57T 28 15 19 32 30 39 52 34 48 46 17 32 30 36 35 36 42 170 42V 334 144 123 312 94 71 273 164 188 155 70 12 155 230 111 79 184 1334 606W 134 53 68 164 72 132 258 298 243 140 105 37 109 246 136 88 112 1635 454Y 124 9 43 439 36 86 288 290 755 349 74 22 204 378 215 82 222 1372 629

Table S5.6: Double substitutions at positions aII (V9) und eII (L13).

A D E F G H I K L M N P Q R S T V W YA 16 40 44 26 45 29 20 35 28 47 29 27 25 27 30 24 19 78 32D 24 14 32 14 26 18 8 34 14 36 31 39 52 42 56 30 18 67 35E 36 23 42 44 27 46 45 60 45 82 47 12 42 62 38 35 40 108 64F 61 128 171 56 93 57 59 125 54 116 90 31 131 129 199 227 64 95 74G 61 19 171 56 56 65 52 73 49 120 46 34 73 78 65 57 33 104 57H 445 164 459 207 85 264 340 1256 466 719 377 23 478 1398 774 1065 944 509 315I 36 41 31 32 25 32 30 37 51 29 33 29 26 41 42 19 25 64 54K 62 47 280 71 37 60 102 97 219 199 33 37 166 75 96 152 79 122 72L 30 46 35 24 35 46 40 40 43 46 33 23 42 42 43 30 23 68 44M 177 120 600 93 116 109 160 303 307 230 102 30 341 396 173 218 192 293 149N 254 82 301 170 142 331 183 443 222 497 473 25 405 1034 293 340 271 409 312P 19 17 28 32 23 23 25 28 36 41 39 22 31 41 30 28 18 67 45Q 314 311 1210 413 480 146 434 516 264 887 191 14 740 926 774 448 257 1068 762R 168 62 419 128 106 107 275 193 466 424 94 34 305 176 134 233 216 230 175S 130 176 674 83 110 68 61 93 68 155 86 54 109 112 135 109 76 135 112T 101 254 971 84 135 70 122 98 102 306 87 39 204 126 207 152 77 297 118V 39 41 46 42 54 56 33 20 15 31 25 41 32 43 38 38 48 97 57W 170 230 230 136 184 228 139 358 150 471 383 71 271 253 326 202 169 187 222Y 1028 1185 1756 546 1563 631 191 1232 530 1372 986 47 1339 1313 1247 1545 502 1092 1180

Table S5.7: Double substitutions at positions aIII (N16) und eIII (E20).

Page 11: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

7

A D E F G H I K L M N P Q R S T V W YA 76 23 46 82 56 55 73 159 124 120 42 27 128 580 74 68 54 180 283D 30 26 30 30 24 30 15 26 22 27 26 22 17 23 25 18 37 30 34E 28 25 28 23 29 44 33 59 72 36 31 26 28 39 18 29 27 49 36F 122 51 102 143 70 64 171 242 267 144 102 28 202 171 218 257 164 169 163G 19 15 15 34 37 41 43 36 49 32 35 34 36 45 43 24 16 62 33H 53 31 56 65 54 41 107 68 95 65 39 35 70 76 90 68 142 134 78I 274 205 840 679 222 141 265 334 439 388 837 32 398 404 260 313 252 759 886K 172 94 934 241 124 126 1499 303 1861 834 335 12 566 205 334 357 342 931 648L 140 72 161 191 55 113 209 327 265 253 197 19 347 474 150 231 239 778 515M 68 44 65 78 48 81 111 81 197 90 38 21 128 99 82 59 109 117 125N 35 21 29 57 36 48 23 36 32 42 26 30 29 35 23 21 24 31 28P 40 23 38 30 68 29 25 41 35 47 36 46 32 38 23 17 34 58 26Q 62 27 59 66 32 0 43 63 96 58 55 39 74 71 58 58 45 158 66R 969 218 854 411 134 181 1180 298 1131 758 274 32 894 413 533 451 1058 1291 1320S 47 34 43 57 28 62 36 75 55 57 33 39 42 79 32 26 37 113 64T 0 9 39 48 18 23 37 60 39 43 30 25 44 68 39 44 33 95 64V 281 85 278 496 168 109 292 318 163 233 259 31 245 412 184 283 208 506 367W 116 111 74 122 64 119 180 146 153 146 180 38 105 111 120 84 175 220 255Y 338 127 152 478 93 108 1108 971 1253 1102 237 19 656 859 631 481 1078 959 644

Table S5.8: Double substitutions at positions aIV (V23) und eIV (K27).

A D E F G H I K L M N P Q R S T V W YA 27 33 44 137 47 48 63 72 65 77 48 46 55 93 84 50 68 197 121D 56 65 42 112 43 58 145 32 81 54 29 32 40 39 43 54 49 190 84E 32 40 30 70 28 50 78 68 85 56 63 69 58 119 34 46 49 150 75F 105 45 57 161 46 156 208 191 191 170 75 79 109 234 141 117 185 689 378G 59 71 38 100 49 42 88 27 89 57 31 48 49 66 74 70 54 199 112H 61 55 41 128 57 82 148 91 134 49 48 47 64 123 66 70 67 328 185I 77 52 70 166 51 100 130 106 209 119 66 46 98 127 92 118 90 252 203K 41 36 37 81 54 65 91 71 109 96 56 40 67 90 61 73 76 179 134L 191 85 142 203 38 153 220 213 222 249 105 47 298 211 114 216 178 651 268M 86 32 58 194 58 80 193 111 187 163 54 27 58 92 65 65 119 452 537N 51 32 36 107 49 50 60 54 88 60 41 48 67 91 82 66 83 216 122P 45 30 39 96 42 54 127 52 110 72 42 46 67 90 51 43 67 177 97Q 45 41 43 113 46 64 74 50 102 76 41 55 49 87 82 69 56 215 120R 79 54 46 136 47 47 135 51 105 80 72 77 70 112 85 76 103 204 191S 45 32 53 112 68 58 103 52 109 64 53 21 32 83 52 46 83 227 157T 54 52 37 130 58 64 107 85 99 73 55 57 63 127 70 57 76 229 153V 46 24 47 60 38 74 89 88 109 104 45 38 44 116 65 106 101 331 208W 250 237 324 1074 148 251 365 271 376 314 174 162 319 513 238 390 301 1773 1630Y 132 129 97 439 131 165 389 224 254 320 87 84 128 230 113 170 133 2044 775

Table S5.9: Double substitutions at positions dI (L5) und gI (K8).

A D E F G H I K L M N P Q R S T V W YA 64 27 23 188 57 113 203 256 385 265 96 7 255 384 66 80 103 1012 148D 28 24 35 38 27 35 42 13 20 32 15 19 21 34 18 20 16 58 16E 16 22 27 32 30 35 38 59 29 40 24 25 46 82 10 22 31 73 47F 31 20 21 40 28 49 43 37 38 48 33 25 40 57 32 13 29 162 81G 44 29 16 20 14 24 27 35 42 55 19 19 37 61 20 39 28 112 29H 34 22 20 30 17 36 33 38 43 18 16 16 27 35 22 24 24 97 50I 70 36 22 53 28 35 37 63 68 51 40 19 33 73 57 29 33 266 58K 16 24 30 20 17 31 18 27 23 31 25 16 24 36 31 32 21 86 45L 207 70 134 153 38 97 294 318 232 187 98 16 176 388 188 134 182 1658 1563M 40 23 33 54 45 31 53 60 70 40 38 19 26 75 19 33 42 272 126N 27 29 24 33 35 21 29 30 30 30 29 24 34 39 30 20 31 107 18P 31 10 28 18 14 24 17 20 21 37 18 25 35 30 30 21 22 72 35Q 17 37 30 42 33 33 45 49 52 35 25 24 28 56 19 26 47 187 135R 45 19 21 31 22 39 34 39 34 43 19 30 35 25 23 16 27 132 51S 33 27 20 29 29 25 40 36 35 34 24 13 33 47 24 22 30 150 51T 29 22 21 34 33 36 27 31 29 34 21 24 32 33 36 20 27 124 28V 32 17 25 31 23 29 44 30 35 30 16 21 31 67 23 29 29 169 79W 64 44 44 89 45 61 89 67 73 56 38 35 49 79 48 35 42 588 269Y 51 27 23 53 40 51 60 93 55 65 46 19 27 91 36 36 38 1202 227

Table S5.10: Double substitutions at positions dII (L12) and gII (K15).

Page 12: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

8

A D E F G H I K L M N P Q R S T V W YA 0 5 14 14 15 16 8 16 10 29 5 14 19 27 23 19 20 31 28D 13 13 18 17 17 17 15 13 23 38 14 21 32 38 16 14 32 38 24E 38 35 34 33 25 33 32 40 23 39 27 24 29 17 17 23 26 36 35F 26 34 36 30 32 31 45 35 42 53 36 43 47 58 36 51 50 79 38G 28 39 30 36 39 20 27 32 36 39 15 30 30 33 31 36 38 63 43H 41 35 36 57 36 60 62 50 30 32 27 42 26 34 30 31 27 56 64I 30 57 71 39 35 32 45 45 46 48 36 30 50 39 37 26 22 52 34K 26 22 32 31 38 30 37 33 32 47 43 30 34 45 32 42 40 87 44L 88 91 201 56 25 73 98 127 156 174 55 25 198 120 77 139 119 115 165M 50 54 48 48 39 45 41 58 45 68 31 32 42 44 44 29 37 64 48N 31 25 39 41 31 52 37 33 31 31 36 36 41 41 30 39 42 43 29P 30 35 23 37 31 43 24 26 30 43 19 31 23 39 38 29 32 67 48Q 24 27 34 39 44 33 23 39 30 44 25 23 35 45 30 34 26 75 35R 34 30 37 58 32 45 53 60 32 58 60 48 48 49 45 40 42 95 44S 27 31 23 35 39 43 37 50 43 45 34 62 42 40 31 45 38 97 54T 37 35 39 34 42 27 32 38 25 42 30 17 34 34 26 31 36 63 50V 33 35 49 32 38 24 46 42 19 52 33 21 48 29 41 26 25 52 31W 53 41 68 93 50 54 54 79 53 73 38 65 67 100 43 47 73 164 228Y 37 37 49 58 31 55 36 39 42 48 44 40 44 54 53 46 48 122 89

Table S5.11: Double substitutions at positions dIII (L19) and gIII (E22).

A D E F G H I K L M N P Q R S T V W YA 9 23 23 19 15 20 22 27 25 28 22 26 18 36 37 22 26 41 22D 36 30 30 29 28 54 38 18 26 31 17 28 17 32 23 19 24 38 16E 13 23 38 27 25 60 44 39 44 41 26 37 59 19 18 30 29 34 23F 30 31 26 36 31 27 40 33 44 40 55 34 42 61 45 55 23 72 67G 43 35 20 27 30 46 19 31 33 22 23 27 17 29 23 17 25 46 45H 51 30 35 26 26 26 28 45 37 36 28 26 29 37 25 34 16 57 25I 42 36 36 43 35 42 82 53 59 67 34 24 50 62 39 26 52 105 58K 26 65 23 34 32 40 25 39 36 35 26 29 39 40 40 35 35 59 33L 77 46 63 72 44 43 328 92 243 121 41 32 90 148 63 68 86 215 155M 45 48 42 39 49 64 33 30 54 53 17 24 54 46 35 35 45 72 40N 17 12 20 43 41 41 26 35 43 32 30 44 26 36 32 19 38 49 18P 29 24 32 39 34 20 13 22 14 31 36 40 34 45 54 23 26 56 51Q 53 38 24 46 41 25 27 24 21 48 37 16 17 28 23 16 22 59 44R 46 40 50 46 31 46 56 52 43 50 42 59 22 52 46 39 30 103 49S 30 44 28 36 23 46 49 38 41 44 33 25 22 50 33 30 25 83 67T 7 12 18 27 22 35 29 46 29 38 30 27 45 31 41 30 21 50 38V 24 32 29 29 27 43 47 27 32 46 23 18 19 30 29 31 33 65 35W 70 71 63 119 72 107 123 94 153 109 61 61 68 197 20 48 69 362 170Y 25 29 18 43 35 46 44 26 89 27 21 14 25 37 28 23 24 166 62

Table S5.12: Double substitutions at positions dIV (L26) and gIV (L29).

Page 13: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

9

S6: Circular dichroism

195 210 225 240

-15.0

-7.5

0.0

7.5

195 210 225 240

a) b)

Θ /k° cm2 dmol-1

λ / nm

Figure S6.1: Circular-dichroism analysis of GCN4 variants 1, 2, and 3 as well as equimolar

mixtures thereof in TBS (154 mM NaF, 10 mM tris(hydroxymethyl)aminomethane, pH 8.0,

room temperature). a) 1 (black line), 2 (red line), and 3 (blue line). b) Equimolar mixtures of

1 and 2 (black line), 1 and 3 (red line), and 2 and 3 (blue line). Spectra were recorded on a

Jasco (Tokyo, Japan) J-720 spectropolarimeter at total peptide concentrations of 75 µM and

corrected by subtracting the buffer baseline. The mean residue ellipticity, Θ, is plotted versus

the wavelength, λ.

S7: Analytical ultracentrifugation and isothermal titration calorimetry

Centrifugation: Sedimentation equilibrium experiments were conducted in a BeckmanCoulter

Xl-I analytical ultracentrifuge at 20 °C using the interference optics of the instrument. Peptide

solutions were brought to dialysis equilibrium with 50 mM Tris buffer, pH 8.0, containing

137 mM NaCl and 2.7 mM KCl with PD10 columns (Amersham Bioscience, Freiburg,

Germany). Peptide stock solutions had a concentration of 3 mg mL–1 of total peptide. 250 µL

of three different dilutions (1:1, 1:2, and 1:10) were loaded into artificial-boundary

centerpieces and spun at 50 or 60 krpm, depending on the expected molecular weight.

Attainment of apparent sedimentation and chemical equilibrium was ascertained by

comparison of consecutive scans using MATCH. Blank-corrected scans were first analyzed

by calculating point-average molecular weights from the slope of a plot of the natural

logarithm of the concentration versus the squared distance from the rotor axis. Visual

inspection of the overlay of local molecular weights versus concentration for the different

loading concentrations was used to check for reversibility and suggested the existence of

Page 14: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

10

limiting species at the lower and upper ends of the concentration scale. Where appropriate,

this information was used to select distinct models to describe the data. These models were

then directly fitted to the data using NONLIN. A model was judged to be an adequate

description of the data if the residuals were random by visual inspection and if no other model

explained the data significantly better as judged by the variances of the fits.[3]

The density of the buffer used and the partial specific volume (PSV) of the different peptides

were measured using a DMA 5000 densitometer (Anton Paar, Graz, Austria) as described

elsewhere.[4] For mixtures of two peptides, the weighted average of the individual values of

the PSV was used to calculate point-average molecular weights. A conversion factor of

3.29 fringes mg–1 mL–1 was used to convert values from fringe units to molar quantities.

Tables S7.1–S7.3 display the fitting results returned by different homoassociation models for

variants 1, 2, and 3 as reported by NONLIN. Values are given in fringe units rather than

molar quantities. Where appropriate, 95%-confidence limits are given below the best-fit

parameters. Parameters for which no confidence limits are given were held fixed. Best-fit

values are shown in bold (d.o.f.: degrees of freedom, σ: reduced buoyant molecular weight as

defined in Reference 4, B: colligative 2nd virial coefficient as reported by NONLIN, n.a.: not

applicable).

variance d.o.f. σ ln(K2 × M) ln(K3 × M2) B

3.9156e–3 2536 1.666 n.a. n.a. –6.22e–3 –6.48e–3/–6.0e–3

7.5418e–4 2536 1.666 –3.602 –3.633/–3.571 n.a. 0

7.2672e–4 2535 1.666 –3.519 –3.617/–3.421

n.a. 2.71e–4 –1.59e–5/5.57e–4

Table S7.1: Homoassociation of variant 1. PSV: 0.720 mL g–1, ρ: 1.007 g mL–1, Mw: 3741 Da, σ60krpm, theoretical: 1.666.

variance d.o.f. σ ln(K2 × M) ln(K3 × M2) B

1.0915e–2 2645 1.232 n.a. –2.518 –2.67/–2.362 0

9.1454e–4 2644 1.232 n.a. –1.644 –1.719/–1.556

1.66e–3 1.58e–3/1.75e–3

1.7804e–3 2644 1.232 –0.244 –0.451/–0.029

–1.204 –1.367/–1.035 0

7.1164e–4 2643 1.232 –1.885 –2.087/–1.692

–1.443 –1.485/–1.401

1.28e–3 1.18e–3/1.38e–3

Table S7.2: Homoassociation of variant 2. PSV: 0.703 mL g–1, ρ: 1.007 g mL–1, Mw: 3748 Da, σ50krpm, theoretical: 1.232.

Page 15: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

11

variance d.o.f. σ ln(K2 × M) ln(K3 × M2) B

1.0481e–2 2684 1.216 n.a. –3.474 –3.587/–3.358 0

7.3351e–4 2683 1.216 n.a. –2.823 –2.879/–2.766

1.55e–3 1.47e–3/1.62e–3

2.8006e–3 2683 1.216 –1.506 –1.693/–1.320

–2.9 –2.985/–2.806 0

7.7608e–4 2682 1.216 –17.28 –1.3e4/–9.26e3

–2.836 –2.841/–2.834

1.55e–3 1.56e–3/1.58e–3

Table S7.3: Homoassociation of variant 3. PSV: 0.704 mL g–1, ρ: 1.007 g mL–1, Mw: 3711 Da, σ50krpm, theoretical: 1.216.

Table S7.4 summarizes the homoassociation constants determined for variants 1, 2, and 3.

Again, 95%-confidence limits are given below the best-fit parameters.

variant K2 (M–1) K3 (M–2)

1 182 165/201 n.a.

2 936 765/1136

1.2 × 107 1.15/1.25 × 107

3 n.a. 0.3 × 107 0.28/0.31 × 107

Table S7.4: Homoassociation constants of variants 1, 2, and 3.

Calorimetry: High-sensitivity isothermal titration calorimetry (ITC) was performed at 25 °C

on a VP-ITC (MicroCal, Northampton, USA) by injecting 10-µL aliquots of a concentrated

solution (~1 mM) of variant 1, 2, or 3 into a dilute solution (~100 µM) of another variant.

Both samples were gently degassed under stirring before the experiment to avoid air bubbles.

Baseline subtraction and peak integration were accomplished using Origin 5.0 as described by

the manufacturer (MicroCal Software, Northampton, USA). All reaction heats were

normalized with respect to the molar amount of peptide injected and corrected by subtracting

the heat of dilution (taken as the constant heat measured toward the end of the titration).

Binding constants, K, and binding stoichiometries, N, were calculated with Origin 5.0 using

the “One Set of Sites” model. Table S7.5 lists the K and N values determined by ITC for all

possible combinations of variants 1, 2, and 3.

variants K (M–1) N 1/2 n.a. n.a. 1/3 5.0 × 104 1.0 2/3 1.2 × 105 1.0

Table S7.5: Heteroassociation constants of variants 1, 2, and 3.

Page 16: A Network of Coiled-Coil Associations Derived from Synthetic GCN4 Leucine-Zipper Arrays

12

Raw data of heteroassociation experiments:

0 10 20 30

3

4

5

6

7

0 40 80 120

-20

-15

-10

-5

0

∆p / µJ s-1

a) b)

Mw / kDa

∆n t / min

Figure S7.1: Heteroassociation reactions of variants 2 and 3. a) Analytical

ultracentrifugation. Both peptides were mixed at a molar ratio of 1:1 at total peptide

concentrations of 3.0 mg mL–1 (black line), 1.5 mg mL–1 (red line), and 0.3 mg mL–1 (blue

line) and brought to sedimentation and chemical equilibrium at 50 krpm. The local molecular

weight, Mw, determined from these gradients is shown as a function of ∆n and approaches the

molecular weight expected for a heterodimer (dotted line). The overlay of the curves indicates

that free monomers of 2 or 3 were undetectable. b) ITC data obtained upon injecting 10-µL

aliquots of a 1.08 mM solution of 3 into a 124 µM solution of 2 at 25 °C. Differential heating

power, ∆p, versus time, t. The strongly exothermic peaks decreased in magnitude as less and

less free 2 was available for binding upon gradual addition of 3.

References

[1] J. Przezdziak, S. Tremmel, I. Kretzschmar, M. Beyermann, M. Bienert, R. Volkmer-

Engert, ChemBioChem 2006, 7, 780–8.

[2] L. Otte , U. Wiedemann, B. Schlegel, J. R. Pires, M. Beyermann, P. Schmieder, G.

Krause, R. Volkmer-Engert, J. Schneider-Mergener, H. Oschkinat, Protein Sci. 2003,

12, 491–500.

[3] O. Kratky, H. Leopold, H. Stabinger, Methods Enzymol. 1973, 27, 98–110.

[4] T. M. Laue, Methods Enzymol. 1995, 259, 427–52.

[5] D. A. Yphantis, Biochemistry 1964, 3, 297–317.