JBC Origene Your Gene Company

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/jbc.M201622200 on March 8, 2002

J. Biol. Chem., Vol. 277, Issue 25, 22639-22647, June 21, 2002
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
277/25/22639    most recent
M201622200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zurovec, M.
Right arrow Articles by Sehnal, F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zurovec, M.
Right arrow Articles by Sehnal, F.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Unique Molecular Architecture of Silk Fibroin in the Waxmoth, Galleria mellonella*

Michal &Zbreve;urovec and Frantisek SehnalDagger

From the Institute of Entomology, Academy of Sciences, and the Faculty of Biological Sciences, University of South Bohemia, Branisovská 31, 370 05 Ceské Budejovice, Czech Republic

Received for publication, February 18, 2002, and in revised form, March 8, 2002

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Proteins of silk fibers are characterized by reiterations of amino acid repeats. Physical properties of the fiber are determined by the amino acid composition, the complexity of repetitive units, and arrangement of these units into higher order arrays. Except for very short motifs of 6-10 residues, the length of repetitive units and the number of these units concatenated in higher order assemblies vary in all spider and lepidopteran silks analyzed so far. This paper describes an exceptional silk protein represented by the 500-kDa heavy chain fibroin (H-fibroin) of the waxmoth, Galleria mellonella. Its non-repetitive N-terminal (175 residues) and C-terminal (60 residues) parts, the overall gene organization, and the nucleotide sequence around the TATA box show that it is homologous to the H-fibroins of other Lepidoptera. However, over 95% of the protein consists of highly ordered repetitive structures that are unmatched in other species. The repetitive region includes 11 assemblies AB1AB1AB1AB2(AB2)AB2 of remarkably conserved polypeptide repeats A (63 amino acid residues), B1 (43 residues), and B2 (18 residues). The repeats contain a high proportion of Gly (31.6%), Ala (23.8%), Ser (18.1%), and of residues with long hydrophobic side chains (16% for Leu, Ile, and Val combined). The presence of the GLGGLG and SSAASAA(AA) motifs suggests formation of pleated beta -sheets and their stacking into crystallites. Conspicuous conservation of the apolar sequence VIVI followed by DD or ED is interpreted as indicating the importance of hydrophobicity and electrostatic charge in H-fibroin cross-linking. The environment of G. mellonella larvae within bee cultures requires continuous production of silk that must be both strong and elastic. The spectacular arrangement of the repetitive H-fibroin region apparently evolved to meet these requirements.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Silk is a proteinaceous polymer secreted by specialized exocrine glands in several groups of arthropods. Silk quality reached highest level of functional specialization in spiders and in lepidopteran larvae (caterpillars). The remarkable mechanical properties of the silks (1) inspired numerous studies on their composition and structural arrangement. Early amino acid analyses and x-ray diffraction studies revealed that the silks of caterpillars contain pleated beta -sheets (2). The sheets formed by short iterated repeats of small amino acid residues are stacked into crystallites that reinforce the silk fiber (3). Recent data (1, 4) on the DNA sequence demonstrated the presence of short amino acid repeats in spider silks and proved that the composition of repeats determines physical properties of the silk fiber, such as strength and elasticity.

Caterpillars produce silk from a pair of labial glands, each of which consist of silk-secreting posterior and middle regions, and an outlet (5). The posterior region produces fibrous silk core, whereas the middle region provides a sticky coating of the fiber and adds several low molecular components with presumably protective functions to the silk (6, 7). The silk core is typically composed of 3 types of proteins as follows: heavy chain fibroin (H-fibroin),1 light chain fibroin (L-fibroin), and P25 glycoprotein (8-10). For the silkworm Bombyx mori it was shown that the H-fibroin (~390 kDa) and L-fibroin (~30 kDa) molecules are linked by a disulfide bond, and six such heterodimers are assembled with a single P25 molecule into a complex called elementary fibroin unit (11). The core of tussah silk, which is produced by Antheraea silkmoths, is made up of H-fibroin homodimers (10, 12).

H-fibroin makes up the bulk of the lepidopteran silk fiber, and its structure determines the physical properties of the silk. Five structural silk categories were recognized on the basis of x-ray diffraction studies and amino acid analyses (13). The silk of B. mori was classified as group 1, which is characterized by dense molecule packing (intersheet distance 9.3 Å) corresponding to high glycine content (14), whereas the tussah silk of Antheraea pernyi was classified as group 3 with 10.6 Å intersheet packing that is consistent with high alanine content (15). These data were recently supported by full elucidation of the primary H-fibroin structures. The H-fibroin of B. mori was shown to consist of 12 large "crystalline" domains separated by short spacers; each crystalline domain includes a number of glycine-rich repeats dominated by the GAGAGS motif (16). By contrast, the H-fibroin of A. pernyi includes 80 tandemly arranged repeats, each containing a crystalline region of 5-15 alanine residues flanked by an "amorphous" motif (12).

The silk of the waxmoth, Galleria mellonella, a species distant in evolution from both B. mori and A. pernyi, was assigned to structural group 3 as the tussah silk (13), but partial sequence of the waxmoth H-fibroin did not disclose any regular polyalanine repeats (17). Also, the silk of G. mellonella has different physical properties than tussah silk. It has a tensile strength 7.5 × 108 Nm-2 and 101% extendibility (18), whereas the corresponding values for the tussah silk are 5.8 × 108 Nm-2 and 35%, respectively (19). Our goal was to clarify the structure of G. mellonella H-fibroin as a step toward understanding the molecular architecture that lends to this silk its unusual physical properties.

In a previous study (17), we isolated from the posterior silk gland region of G. mellonella larvae a cDNA clone designated PG-2. Except for a non-repetitive 0.3-kb 3' terminus, the clone consisted of repetitive motifs. The corresponding mRNA was not detectable in any other body tissue. Its large size, high abundance in the posterior silk gland section, and the sequence of its putative translation product indicated that PG-2 was derived from the 3' end of the H-fibroin gene. By using PG-2 as a probe, we have now identified genomic clones that encode considerable portions of both 3' and 5' ends of the gene. Sequence analysis revealed that the structure of G. mellonella H-fibroin is exceptionally homogenous and resembles certain spider silks more than the H-fibroins of B. mori and A. pernyi.

    EXPERIMENTAL PROCEDURES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Insects and Tissue Preparation-- Our stock of the waxmoth, G. mellonella L. (Lepidoptera, Pyralidae) was maintained at 30 °C on a semiartificial diet (20). The posterior parts of silk glands were dissected from anesthetized last instar larvae whose age was measured from the preceding ecdysis. Dissected tissues were rapidly frozen in liquid nitrogen or on dry ice and stored at -80 °C.

Isolation of Genomic Clones and Hybridization Conditions-- A genomic library was prepared from 15- to 20-kb Sau3A fragments of G. mellonella DNA (21). Fragments were ligated to XhoI half-site arms of the LambdaGEMTM-12 vector (Promega) that was grown in the LE392 strain of Escherichia coli. About 4 × 104 plaques were screened with 1.2-kb cDNA probe PG-2 representing the 3' end of the H-fibroin gene (17). The positive clones were re-screened with the extreme 3' end of PG-2 (0.3 kb), which encoded non-repetitive C-terminal amino acid sequence. Hybridization with both the full and the truncated PG-2 probes was done under high stringency conditions at 65 °C in 5× SSPE, 5× Denhardt's solution, 0.5% SDS, and washing at room temperature twice in 2× SSPE, 0.1% SDS, and twice in 0.2× SSPE, 0.1% SDS. The clones, which failed to react with the short version of PG-2, were screened with the PstI-BclI fragment of B. mori H-fibroin gene (positions -139 to +69) that contained the 5' upstream region and the first exon of this gene (22). This hybridization was done under low stringency conditions at 55 °C in 5× SSPE, 5× Denhardt's solution, and 0.5% SDS, with two subsequent washings in 2× SSPE, 0.1% SDS at room temperature, and two washings in 1× SSC, 0.1% SDS at 55 °C. All probes were 32P-labeled with the aid of the Random Primed DNA Labeling Kit (United States Biochemical Corp.).

Sequencing-- Selected genomic clones were characterized with restrictases and the restriction fragments subcloned in pBluescript SK(+/-) (Stratagene). Part of the non-repetitive region was sequenced with primer walking. Single- or double-stranded templates were sequenced either manually or with capillary DNA sequencer ABI 310 (PerkinElmer Life Sciences). The dideoxy chain termination reaction with [alpha -35S]dATP, T3 or T7 primers, and Sequenase version 2.0 DNA Sequencing Kit (Amersham Biosciences), was employed in the manual sequencing. Automatic sequencing was done with Thermosequenase Cycle Sequencing Kit (Amersham Biosciences) or ABI Dye Terminator Cycle Sequencing Kit (PerkinElmer Life Sciences). Most of the analyzed gene regions were sequenced repeatedly from both DNA strands as depicted in Fig. 1. Established sequences were analyzed with the DNAstar software (Lasergene) and with the aid of EMBL network services.

Southern and Northern Analysis-- For the Southern analysis, high molecular weight genomic DNA was prepared from the newly ecdysed last instar larvae, and 5-µg DNA aliquots were digested with restrictases specified under "Results," electrophoresed on 0.8% agarose gel, and blotted on nylon membrane. Aliquots of 5 µg of total RNA, which was prepared from the posterior and middle silk gland regions and from the body carcass devoid of the silk glands, were taken for the Northern blotting. By using high stringency conditions described above, Southern blots were probed with the 32P-radiolabeled 1.2-kb PG-2 probe and Northern blots with the XbaI-BamHI fragment of the lambda X1 genomic clone. Relative positions of HindIII-digested lambda  phage DNA fragments were used as size markers.

Reverse Transcription-PCR-- The reverse transcription step of reverse transcription-PCR was performed with 5 µg of total RNA, 5 units of avian leukemia virus reverse transcriptase, and 50 ng of primer 5' ATCCAGATGAACCACCT 3' (positions +2869 to +2853 of the gene, i.e. 4136-4110 in the GenBankTM accession number AF095239) in a 20-µl reaction mix. Subsequently, 1 µg of the reaction mixture was taken for cDNA amplification using a primer corresponding to the gene sequence between positions +9 and +28 (Fig. 3A), and a reverse primer corresponding to positions +2851 to +2832 (4108-4309 in the GenBankTM accession number AF095239). PCR included initial denaturation as follows: 1 min at 94 °C, 35 cycles of 20 s at 94 °C, 20 s at 55 °C, and 20 s at 72 °C, and final extension 10 min at 72 °C.

Primer Extension-- The procedure was described previously (6). A 35-nt synthetic oligonucleotide complementary to the region -30 to +4 of the gene (Fig. 3A) was employed for primer extension analysis using total RNA from the posterior section of silk glands of well staged last instar larvae. The primer was 5'-labeled with [gamma -32P]ATP with the aid of T4 polynucleotide kinase and hybridized to 10 µg of total RNA for 16 h at 32 °C. The following extension reaction was carried out with murine leukemia virus reverse transcriptase (Amersham Biosciences) for 2 h at 42 °C.

    RESULTS
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Isolation of the Genomic Clones-- Screening of our G. mellonella genomic library with the PG-2 cDNA clone led to isolation of nearly 60 positive recombinant phages. Most of them were likely to contain inserts from the central gene region with repeated sequences that are difficult to clone and analyze. We therefore decided to select clones extending to the 5' or the 3' regions of the H-fibroin gene. Eight clones of the 3' region were obtained using the unique 0.3-kb 3' terminus of PG-2 as probe. Clone lambda XF, which gave the strongest hybridization signal, was chosen for further work. The remaining 52 clones hybridizing with the whole PG2 were screened with a probe derived from the 5' end of B. mori H. fibroin. Based on experience with other silk genes (21), we expected some similarity between the H. fibroin genes of B. mori and G. mellonella. Indeed, under low stringency conditions, we detected weak hybridization of the B. mori probe to two genomic clones of G. mellonella. The DNA of these clones was isolated, cut with PstI, and re-hybridized with the H-fibroin probe of B. mori. Both clones proved to contain a single 0.7-kb fragment similar in sequence to the first exon of B. mori H-fibroin gene. One of the two clones was taken for further research under the name lambda X1. Extensive mapping and subcloning revealed the presence of StyI, AccI, and SacI restriction sites that were instrumental in elucidating large parts of the repetitive gene region. The 50 clones, which hybridized only with the full PG-2 probe, apparently contained only the internal sequences of the H-fibroin gene and were not analyzed.

H-fibroin Gene Structure and Expression-- Genomic Southern blotting (data not shown) allowed us to construct a restriction map of the whole H-fibroin gene and to assess the size of the entire repetitive region. Localization of the restriction sites XbaI and BamHI in the clone lambda X1 and of the SalI and SacI sites in the clone lambda XF was essential for the overall gene analysis. Genomic DNA fragments obtained by double digestions with XbaI + SalI, XbaI + SacI, BamHI + SalI, and BamHI + SacI, respectively, were probed with PG-2 and some also with the XbaI-BamHI fragment of the clone lambda X1 and/or the SalI-SacI fragment of the clone lambda XF (data not shown). Analysis of the data allowed one possible arrangement of the fragments as shown in Fig. 1. The overlapping digestion fragments of the lambda X1 and lambda XF inserts were subcloned into pBluescript SK(+) according to the strategy summarized in Fig. 1 and sequenced. The results (GenBankTM accession numbers AF095239 and AF095240) confirmed that the clones represented the 5' and 3' ends, respectively, of the H-fibroin gene. By analogy with the H-fibroin gene structure in other lepidopteran species (12, 16), we assume that the internal XbaI-XbaI fragment (7 kb) and adjacent regions represent central part of the second exon (Fig. 1) that is made up of similar repeats as found in clones lambda X1 and lambda XF (see below).


View larger version (25K):
[in this window]
[in a new window]
 
Fig. 1.   Restriction map of the H-fibroin gene in G. mellonella and the sequencing strategy. The general gene organization (exons E1 and E2 are represented by hatched boxes) and several restriction sites are presented in the composite genomic map below the size bar. Additional restriction sites, which were used for subcloning, are shown in the aligned genomic clones lambda X1 and lambda XF. The sequenced regions are indicated by arrows (both the vector primers and the specific synthesized primers were used for sequencing). Restriction enzymes included AccI (A), BglII (B), BamHI (Bm), KpnI (K), PstI (P), SacI (S), SacII (Sc), SalI (Sl), Sty (St), and XbaI (X).

Northern blots probed with the labeled XbaI-BamHI fragment of clone lambda X1 (encoding H-fibroin N terminus) disclosed the presence of a single, very large posterior silk gland-specific mRNA (Fig. 2A). A similar transcript was previously detected with the PG2 probe that encodes H-fibroin C terminus (17).


View larger version (63K):
[in this window]
[in a new window]
 
Fig. 2.   Transcription of the H-fibroin gene. A, Northern analysis demonstrating tissue specificity of the expression. 5 µg of total RNA aliquots from the posterior (P) and middle (M) silk gland regions and from the body carcass without silk glands (L) were probed with the XbaI-BamHI fragment (5' end of the gene) of clone lambda X1. Ribosomal RNA stained with methylene blue was used to check RNA integrity. B, primer extension mapping of the transcription start. 5'-Radiolabeled (5 × 104 cpm) primer corresponding to the gene region -30 to +4 (Fig. 3A) and 10 µg of total RNA were used to synthesize the cDNA with the aid of murine leukemia virus reverse transcriptase. The template RNA was prepared from the posterior section of silk glands taken from the last instar larvae 1 day (lane 1) and 3 days (lane 3) after ecdysis and at the prepupal stage (lane P). The products were analyzed in a sequencing gel (8% polyacrylamide, 7 M urea) along with the C, T, A, G ladders of the lambda X1 genomic clone that was sequenced with the same primer. The results reveal two transcription starts (arrows) whose use depends on the age of the larva.

The Gene Sequence-- The sequenced part of the lambda X1 clone included 1285 nt of the 5' non-coding region, 42 nt putatively encoding 14 amino acids (beginning with an initiation codon in position +29), 1310 nt evidently non-coding and therefore regarded as an intron, and 4362 nt with a continuous open reading frame. Sequences between positions -426 and -346 in the upstream region, and between +1123 and +1203 in the intron, exhibited about 90 and 70% identity, respectively, with the 3' end of the repetitive element Gm1 (21).

Transcription start of the H-fibroin gene was determined by primer extension analysis using posterior silk gland RNA (Fig. 2B). The analysis disclosed two different transcription starts that appeared to be used singly or simultaneously, depending on the developmental stage of the last instar larva. The transcription start used by the newly ecdysed last instar larvae was assigned +1, and the whole gene was numbered relative to this position. This site was also identified in the H-fibroin gene of other Lepidoptera (Fig. 3A). In the course of the last larval instar of G. mellonella, the transcription start was shifted to position -18, and the +1 position ceased to be used. Two sequences possibly served as TATA boxes, each of them being associated with one transcription start: a TATAAAA sequence at -30 to -24 and a TAATATA sequence at -49 to -43 (Fig. 3A). An initiator sequence TCAGT (23) at +2 to +6 followed immediately after the transcription start at +1, and a similar sequence TCAGA was localized at -12 to -8, i.e. just a few nucleotides downstream from the transcription start at -18.


View larger version (111K):
[in this window]
[in a new window]
 
Fig. 3.   Terminal parts of the H-fibroin gene of A. yamamai (A.y., GenBankTM entry X05578; last 13 residues shown in italics actually represents the closely related gene of A. pernyi, AF083333), G. mellonella (G.m., AF095239 and AF095240), and B. mori (B.m., V00094 and AF226688). Sequences are aligned to maximize homologies indicated with gray background. The non-coding parts are printed in lowercase letters, and the coding parts in uppercase letters. A, the H-fibroin 5' ends with the upstream flanking regions. The TATA motifs are boxed, and the initiation codons are printed in bold. Sequences of the upstream region and the first exon are numbered from the commonly used transcription start; the partial intron sequences (italics) are not numbered, and the following coding sequences are numbered from the start of the 2nd exon to emphasize conserved nucleotide positions. B, the H-fibroin 3' ends aligned to match the positions of the termination codon (in bold). The polyadenylation signals are boxed.

The exact delimitation of the intron was determined by sequencing across the exon/intron junctions in a cDNA clone obtained by reverse transcription-PCR. The amplified cDNA fragment began at +9 and ended at +2851, thus including almost the entire exon 1 and a part of the large exon 2. The sequencing proved that the first exon consisted of the 28-nt leader sequence and the 42-nt coding sequence (Fig. 3A). The second exon began with a unique sequence of 532 nt and continued as regular repeats that are described below. The repeats were iterated until the 3' end of the sequenced part of clone lambda X1 and the same type of repeats appeared at the 5' end of clone lambda XF (Fig. 4). The sequenced portion of the latter clone included 1331 nt of the coding region, of which about 70% consisted of the repeats, and 183 nt of adjacent non-coding sequence (GenBankTM accession number AF095240). The point of transcription termination, i.e. the 3' terminus of the second exon, was established by comparing the sequence of lambda XF with that of the PG-2 cDNA clone (17). It was confirmed that the 3'-untranslated region consisted of 94 nt and included polyadenylation signal AATAAA (24) localized 23 nt upstream from the transcription terminus. The last 89 nt identified in the lambda XF clone represented non-transcribed spacer DNA flanking the 3' end of the H-fibroin gene.


View larger version (111K):
[in this window]
[in a new window]
 
Fig. 4.   Sequences of the H-fibroin repeats. The repeats A (189 nt), B1 (129 nt), and B2 (54 nt) are numbered according to their position in the gene (numbering begins with 1) or in the lambda XF clone (numbering starts with n + 1). The sequenced parts of clones lambda X1 (5742 nt) and lambda XF (1514 nt) contained AB1AB1AB1AB2AB2AB1AB1AB1AB2AB2AB2AB1AB1A and B2AB2AB1AB1A, respectively, repeat iterations (cf. Fig. 5). Nucleotides deviating from the usual sequence of the repeats are highlighted, and deviations changing codon assignment are printed in lowercase letters.

Predicted Amino Acid Sequence-- The identified lambda X1 sequence encodes a putative peptide of 1468 amino acid residues (Fig. 5A). The expanse of the first 188 residues exhibits no obvious sequence iterations, but the remainder consists of regular repeats. Compliance with the general rules (25) indicates that a signal peptide is cleaved off between residues 18 and 19. The following tract of 170 residues is remarkable by the high content of charged amino acids that are often grouped by 2. The 4.5 pK value reflects prevalence of the negatively charged residues. About one-third of residues in the non-repetitive N terminus is hydrophobic and one-third is polar. Notable is a degenerate doubling of two short sequences.


View larger version (86K):
[in this window]
[in a new window]
 
Fig. 5.   H-fibroin structure deduced from the gene sequence. A, amino acid sequences of the non-repetitive N terminus (189 residues numbered from the translation start, with the presumed signal peptide printed in italics), the repetitive central parts, and non-repetitive C terminus. The N-terminal and C-terminal regions contain doubled short motifs (underlined, each pair in different way). The repetitive central region is arranged in the repeats A (yellow), B1 (blue), and B2 (green), which are numbered as the corresponding polynucleotide blocks in Fig. 4. Variations in the amino acid residues that occur in at least two copies of the respective repeat are shown on the gray, and singular replacements on the white background. B, consensus sequences of the repeats A (yellow), B1 (blue), and B2 (green) aligned to accentuate homologies. The hydrophobic motif GLGLG and its variants are shown on gray background.

The putative translation product of the sequenced part of clone lambda XF includes 443 amino acid residues, of which 373 are arranged in repeats, and 70 constitute a non-repetitive C terminus (Fig. 5A). It is characterized by high content of arginine that shifts the pK value to 11.6, and by the presence of two similar stretches of 7-8 amino acids.

The repeats found in clones lambda X1 and lambda XF are identical, indicating that they are iterated throughout the central gene region between the two clones. From the genomic Southern mapping summarized in Fig. 1, we estimate that the entire coding region is ~18,000 nt long and encodes ~6000 amino acids. The deduced molecular mass of G. mellonella H-fibroin is thus close to 500 kDa, and over 95% of the molecule consists of repeats. They are built up of predominantly neutral amino acids; only about 2.7% residues carry a strong negative charge and 0.6% a positive charge. The composition of repeats is decisive for the amino acid representation in the whole fibroin. Relative representation of amino acid residues in the deduced translation products of lambda X1 and lambda XF is consistent with the results of direct amino acid analysis of fibroin extracted from the silk glands (26) or the cocoons (27, 28).

Repeated Motifs in the H-fibroin Protein-- Beginning at the predicted amino acid residue 189, the sequence of G. mellonella H-fibroin continues as a regular repetitive arrangement of three types of complex repeats (Fig. 5A). The repeat A consists of 63 residues, and the repeats B1 and B2 consist of 43 and 18 amino acid residues, respectively. The repeats are combined in higher order repetitions AB1 and AB2, and these are arranged into assemblies (AB1AB1AB1AB2(AB2)AB2). We estimate that fibroin encompasses about 11 such ensemble repeats. Both the absence of any variability in the length of repeats and the high conservation of their sequence are striking. Hydrophobic residues dominate, but their stretches are broken into short motifs by intercalated polar residues. Charged amino acids are rare and occur exclusively in the repeat A. Only a few amino acid residues are occasionally replaced with others of similar properties, e.g. Leu alternates with Ile and Val, and Asp with Glu. Replacements between Gly, Ala, and Ser are also relatively common. Most variations are confined to certain positions and occur in several copies of the respective repeat. The total variation never exceeds 20%, and certain regions are identical in all copies.

The short motif GLGGLG, in which L may be replaced with I, V, and exceptionally by S, is shared by all repeats (Fig. 5B). The A repeats usually contain a GVGGLG modification in positions 38-43 (numbered from the repeat beginning) and a PLGGIG modification in positions 49-54. Each B1 repeat contains 3 similar motifs in positions 4-9, 15-20, and 33-38, respectively. The B2 repeat includes just one motif that is flanked by similar amino acid residues as the last such motif in B1, indicating that B2 and the second portion of B1 may have common origin. The hydrophobic motifs always alternate with short sequences containing polar residues Ser or Thr. Two similar and relatively long motifs, SSAASAAAA and SSAASAAS, are present in the A repeat and as highly altered and truncated modifications also in the B1 and B2 repeats (Fig. 5B). The hydrophobic group VIVI, which is followed by two acidic residues (in a single case, DD was changed to ND) and preceded either by PAP or AGE tripeptide, is a notable feature of the A repeat (Fig. 5A).

    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

H-fibroin gene of Lepidoptera-- The H-fibroin gene of G. mellonella is composed of one very short and one very long exon (Fig. 1), as are the homologous genes in B. mori (16) and A. pernyi (12). Because G. mellonella is phylogenetically rather remote from the two other species, the described arrangement of the fibroin gene is probably standard for most Lepidoptera. Sequence homologies at the 5' and 3' ends indicate the existence of a common ancestral H-fibroin gene.

The DNA sequence of several hundreds nucleotides upstream from the transcription start was identified in the H-fibroin gene of B. mori (29), Antheraea yamamai (30), A. pernyi (12), and now G. mellonella. In all these cases, the upstream region is AT-rich, with frequent concatenations of up to 7 A or T nucleotides. Interspecies homology is obvious downstream from about -50 position, including the presence of a TATA box at -0 to -24 (Fig. 3A). G. mellonella acquired, probably by coincidental mutations, another TATA box at -49 to -43. The existence of two transcription starts (standard at +1 and additional at -18) indicates strongly that both TATA boxes are functional. We cannot explain, however, why the start is at +1 in young larvae and at -18 in old larvae (Fig. 2B).

Homology between H-fibroin genes of different species is very clear through the first exon, including the exon/intron boundary (Fig. 3A). A few deletions and insertions modified the length of the leader sequence, but the following part of the first exon, which encodes the signal peptide, is remarkably preserved. The length of the intron and its internal sequence are dissimilar, but similarities between the compared species are found in the sequence of about 250 nt beginning around the intron/exon boundary and continuing in the second exon. The following major part of the exon is composed of highly species-specific repeated motifs. Similarity between the species occurs again at the 3' end of the gene (Fig. 3B). For example, the non-translated tail sequence is of similar length and includes a GGATACAT motif localized 27-31 nt prior to the polyadenylation signal.

The H-fibroin Proteins-- The electrophoretic mobility of G. mellonella H-fibroin in polyacrylamide gel indicated a size of 240 kDa (25), which is considerably less than the 500 kDa deduced from the gene size (Fig. 1). However, size estimation in the gel is inaccurate, especially in the case of large hydrophobic molecules such as H-fibroin. Partial degradation of the analyzed H-fibroin during solubilization in 8 N urea cannot be excluded. We trust size assessment based on the gene analysis, and we emphasize that the H-fibroin of G. mellonella is larger than in other lepidopterans so far examined. A size of 391 kDa was computed for the H-fibroin of B. mori (16), and 216 kDa can be deduced for A. pernyi from the molecular data of Ref. 12. The last figure is in good agreement with the 220-kDa assessment of the extracted H-fibroin (31). The size of H-fibroins in Antheraea myllita was estimated as ~197 kDa (32), and H-fibroins of two other saturniid species, A. yamamai and Philosamia cynthia ricini, behaved as proteins of 250 and 230 kDa, respectively (31).

Interspecific diversity of the H-fibroin size is due to differences in the large central protein region that is composed of repeats, whereas the terminal non-repetitive sequences are of similar length in all species. In G. mellonella, the non-repetitive N terminus encompasses 173 amino acid residues, of which a stretch of more than 100 is clearly homologous with the H-fibroins of B. mori and A. pernyi (Fig. 6A), as well as Bombyx mandarina (33) and A. yamamai (30). The major part of the signal sequence, which is encoded by the first exon, is nearly identical in all these species. The non-repetitive sequence encoded by the second exon has diversified but more than 20 residues have maintained similar spacing. They are arranged in conserved groups, with occasional insertions or deletions between them. A major deletion of 20 amino acids (in comparison with the H-fibroin of B. mori) follows after the residue 31 in the H-fibroins of A. pernyi and A. yamamai. Despite great phylogenetic distance, the conservation of amino acid positions between G. mellonella (superfamily Pyraloidea) and Antheraea is similar or even higher than between Antheraea and Bombyx which both belong to the superfamily Bombycoidea. The non-repetitive H-fibroin N terminus of all species contains a high proportion of charged amino acids: 10 basic and 13 acidic in G. mellonella, 10 and 11 in B. mori, and 10 and 12 in A. pernyi, respectively. This and other conserved attributes of the N terminus suggest that it may play a role in silk fiber assembly.


View larger version (46K):
[in this window]
[in a new window]
 
Fig. 6.   Comparison of the non-repetitive terminal sequences of H-fibroin proteins. A, N-terminal parts exhibiting similarities between the three compared species: A. pernyi (A.p.), G. mellonella (G.m.), and B. mori (B.m.). Signal sequence, encoded by the first exon, is typed in italics. B, extreme C termini of G. mellonella (G.m.) numbered as in Fig. 4, and B. mori (B.m.) numbered according to Ref. 16.

The C terminus of the H-fibroin is characterized by the presence of 3 cysteines that are located in the proximity of eight basic residues (Fig. 6B). For B. mori it has been shown that the cysteine most distant from the protein end forms a disulfide linkage to the L-fibroin, whereas the more proximal and the terminal cysteines form an intramolecular disulfide bond (34). We have previously demonstrated homology between the L-fibroins of B. mori and G. mellonella and suggested that they have similar functions (35). The finding of conserved positions of the 3 cysteines in the H-fibroin (Fig. 6B) strongly indicates that the H-fibroin-L-fibroin complex of this species is assembled in a way similar to B. mori.

The C terminus of Antheraea H-fibroin is short and includes only 3 basic residues. It does contain 3 cysteines, but the alignment to the H-fibroin C termini of B. mori and G. mellonella is ambiguous. The diversification of Antheraea H-fibroin C terminus from the consensus found in B. mori and G. mellonella is associated with the loss of L-fibroin (10). It was proposed that Antheraea H-fibroin forms homodimers that are held together by a disulfide bridge in which one of the cysteines is engaged (30).

Organization of the Repetitive H-fibroin Core-- Amino acid composition of the repetitive region of G. mellonella H-fibroin is dominated by residues Gly, Ala, Ser, and those with bulky apolar side chains (Table I). Amino acid sequence and high conservation of the A, B1, and B2 repeats, and the regularity of their assemblies AB1AB1AB1AB2(AB2)AB2 (Fig. 5A), are very spectacular features of G. mellonella H-fibroin. The basic repeats A, B1, and B2 are longer and more complex than the elementary repetitive units in other spider and lepidopteran silks. Their arrangement in the AB1 and AB2 subdomains and concatenation in about 11 assemblies are extremely regular. The H-fibroins of B. mori and A. pernyi are also composed of about 12 large ensemble repeats, but their sizes vary due to irregular numbers of the basic repetitive units they contain. For example, the numbers of residues range from 147 to 596 in the ensemble repeats of B. mori H-fibroin, and only the length (42-44 residues) of the amorphous spacers that separate the assemblies is conserved (36). The H-fibroin of A. pernyi is made up of 80 units that are each composed of a track of 4-14 alanines followed by one of 4 different types of non-crystalline motifs (12). A periodic occurrence of the shortest non-crystalline motif (11 residues) after each 5th to 8th unit breaks the repetitive region into the 12 repeat assemblies of unequal length.

                              
View this table:
[in this window]
[in a new window]
 
Table I
Total number and percent of selected amino acids in the repeated region of known H-fibroins
Only part of the region (1651 residues) was sequenced in G. mellonella, but complete sequences are available for B. mori (5263 residues) and A. pernyi (2639 residues).

The Maintenance of Uniform Repeats-- All three types of repeats in G. mellonella H-fibroin are remarkably conserved throughout the sequenced part of the gene. Their uniformity at the DNA level (Fig. 4) is strengthened by the preferential use of certain isocodons. The high content of codons rich in G and C in the first two positions is compensated for by high frequency of isocodons ending with U and A. Similar compensation occurs in various types of DNA sequences (37). In the H-fibroin genes, however, the rate of recurrence of U or A in the third position is not random, and the frequency of isocodons is species-specific (Table II). The incidence of "silent" mutations is similar to the rate of base replacements that change codon specification. For example, within the 4950 nt of the analyzed G. mellonella repeats (Fig. 4), we found 5.47% silent and 3% assignment-altering mutations.

                              
View this table:
[in this window]
[in a new window]
 
Table II
Codon usage (in %) for the most common amino acids in repeated region of the H-fibroins

The codon choice pattern in B. mori H-fibroin is correlated with skewed cellular contents of isoaccepting tRNAs (38, 39). Mita et al. (40) proposed that the restricted codon usage is dictated by the most stable conformations of chromatin or the encoded mRNA, whereas the composition of the tRNA pool is a secondary adaptation. We propose that the codon bias is also related to the stability of reiterated motifs. The tandems of short motifs, such as the minisatellites, are prone to replication slippage, unequal sister chromatid exchange, and unequal allelic recombination that generate variations in the length of the tandems (41). Allelic divergence in spider silk genes (42) and 15% length variation of B. mori H-fibroin (43) demonstrate the importance of this mechanism in the silk genes with short iterated motifs. By contrast, larger DNA blocks without intrinsic repeats are less likely to be internally misaligned during crossing over. We assume that a lack of reiterated short motifs within the A, B1, and B2 DNA blocks is a prerequisite for the maintenance of their length. The unique parts of some other silk genes, for example introns and coding "spacers" in a spider silk gene (44) and amorphous linkers joining the repetitive domains of B. mori H-fibroin (36), are also characterized by length conservation.

The crossover and misreplication between the iterated DNA segments drive their concerted evolution and lead to segment homogenization (45, 46). Homogenization of DNA repeats has been reported for microsatellites, repetitive regions of mitochondrial DNA, tandem gene arrays, and repetitive sequences within large genes. The process of sequence homogenization on one hand suppresses and on the other hand promotes fast spreading of base substitutions among the repeats. Mutations changing codon assignment are initially stochastic events, but the function of the encoded protein imposes constraints on their maintenance and propagation (47). Mutations improving the function are favored by natural selection that fosters spreading of the altered nucleotide to other repeats.

Molecular Conformation of G. mellonella H-fibroin-- Classical x-ray diffraction studies of lepidopteran silk revealed that fibroin molecules are cross-linked by hydrogen bridges formed between the amide and the carbonyl groups of adjacent polypeptide strands (2). Amino acid repeats such as GAGAS or AAAAA are thereby aligned into pleated beta -sheets, and interactions between the side chains of the amino acids of adjacent sheets cause their stacking into crystallites reinforcing the silk fiber. Intersheet distances in the crystallites are determined by the side chains of participating amino acids. The results of measurements of these distances (14, 15) are consistent with the domination of glycine in the H-fibroin of B. mori and the presence of polyalanine tracks in A. pernyi H-fibroin. The x-ray spectrum of G. mellonella silk indicated the presence of crystallites with a similar intersheet packing as in the silk of A. pernyi (3). Indeed, the sequences SSAASAAAA and SSAASAAS in the A repeat of G. mellonella H-fibroin (Fig. 4) resemble the polyalanine tracks of the A. pernyi H-fibroin. However, the repeats of G. mellonella H-fibroin also contain a high proportion of Gly alternating with Ala, Ser, and a few other residues. Some of these alternations are regular, and their conservation in reiterated repeats indicates that they can also form crystallites, albeit of different periodicity. New physical measurements are needed to resolve the structure of crystallites in G. mellonella H-fibroin.

In the H-fibroins of B. mori (16) and A. yamamai (12), the reiterated crystalline motifs are periodically interspaced with distinct "boundary" sequences. They contain residues with bulky side chains, e.g. Tyr, Trp, Glu, and Arg, that interrupt the beta -sheet regions and allow random orientation of the H-fibroin molecule conferring flexibility to the silk fiber (19). In the H-fibroin of G. mellonella, residues Glu, Asp, Trp, and Asn occupy specific sites in close proximity to short crystalline regions. This arrangement suggests that this H-fibroin may contain numerous small crystallites separated by short peptide strands, as has been suggested for a spider dragline silk (48). The residues with apolar bulky side chains, such as Leu, Ile, and Val, may have a dual role in G. mellonella H-fibroin. In some positions they appear to be incidentally replaced by Pro or Tyr, indicating that in these positions they are involved in disturbing the formation of beta -sheets and prevent their stacking. The high conservation of Leu, Ile, and Val in other positions, usually in alternation with Gly, indicates a more specific role, including formation of some kind of crystallites.

Some highly elastic spider silks are composed of repeats allowing formation of beta -spirals rather than sheets (4). Peptide chain conformations other than the beta -sheet crystallites may also be important for the physical properties of G. mellonella silk. Similarly, high conservation of the apolar sequence VIVI followed by DD or ED (Fig. 5A) in all analyzed A repeats (in a single case DD was replaced by ND) of the H-fibroin suggests that hydrophobic interactions and electrostatic bonds may be involved in H-fibroin cross-linking. More data are needed to reconcile molecular conformation of the H-fibroin fibers.

Functional Considerations-- In the presence of long repeats of distinct internal structures, the H-fibroin of G. mellonella resembles certain spider silks (1, 49). Spiders typically produce several types of silk whose propensities match the functional requirements. The dragline and web frame silks are very strong, whereas the orb web silks excel in elasticity. The caterpillars possess only one type of silk gland, and their silk is generally used for the construction of cocoons in which they pupate. Cocoon silk should be strong and persistent, and elasticity is irrelevant. These properties are met in the silk of B. mori and A. pernyi that have tensile strengths 7.4 × 108 and 5.8 × 108 Nm-2, and extensibility 24 and 35%, respectively (19).

In a number of lepidopteran families, the use of silk is not limited to cocoon spinning. The larvae of G. mellonella, which develop in bee colonies, produce large quantities of silk from the second larval instar on, to construct silk tubes protecting them against the detection and killing by bees. The tubes are continuously renewed, but each section is used for several days. It must be strong and extensible to allow the larva inside to grow and to turn around without leaving the tube. Older parts of the tube are eaten and digested by the larva. Upon reaching its full size in the seventh larval instar, the larva abandons the tube and seeks a suitable place for cocoon spinning. The larva of 200 mg can then suspend on the silk thread and descend while spinning.

The functional requirement for a combination of strength and elasticity probably drove the silk evolution in G. mellonella. The elasticity of G. mellonella silk is obvious from the observations of larvae moving in their silken tubes, but it has not been measured. A note on extensibility hysteresis upon cyclical application of the strain load indicates elasticity, but no details were given (18). The tensile strength of 7.5 × 108 Nm-2 and the 101% extensibility of G. mellonella silk are the highest values found in Lepidoptera (18). The reported increase of extensibility in water may be regarded as an adaptive feature. The environment of bee nests is humid, whereas cocoons, for which extensibility is rather disadvantageous, are usually formed outside the nests in a dry place. G. mellonella silk is "designed" to be strong and flexible in the silk tubes produced in the warm and moist environment of the bee nest, digestible at the alkaline pH in the gut of caterpillars, and rigid and persistent in the cocoons that must last for weeks or months. These features are embodied in the structure and extreme regularity of the H-fibroin repeats.

    ACKNOWLEDGEMENTS

We thank Drs. Dalibor Kodrík and Changsong Yang for sharing unpublished data on the fibroin protein analysis and on the developmental changes in H-fibroin gene expression, respectively. The PstI-BclI fragment of B. mori H-fibroin gene was kindly provided by Dr. P. Couble of the University of Lyon, France. We also thank Dr. J. S. Edwards of the University of Washington, Seattle, WA, for critical reading of the manuscript.

    FOOTNOTES

* This work was supported by Grants 204/96/1100 and 204/00/0019 from the Grant Agency of the Czech Republic and by Grant ME 204 from the Czech Ministry of Education, Youth, and Sports.The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EBI Data Bank with accession number(s) AF095239 and AF095240.

Dagger To whom correspondence should be addressed: Entomological Institute, Academy of Sciences, Branisovská 31, 370 05 Ceské Budejovice, Czech Republic. Tel.: 420-38-5300350; Fax: 420-38-5300354; E-mail: sehnal@entu.cas.cz.

Published, JBC Papers in Press, March 8, 2002, DOI 10.1074/jbc.M201622200

    ABBREVIATIONS

The abbreviations used are: H-fibroin, heavy-chain fibroin; L-fibroin, light-chain fibroin; nt, nucleotide.

    REFERENCES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

1. Gosline, J. M., Guerette, C. S., Ortlepp, C. S., and Savage, K. N. (1999) J. Exp. Biol. 202, 3295-3303[Abstract]
2. Pauling, L., and Corey, R. B. (1953) Proc. Natl. Acad. Sci. U. S. A. 39, 253-256[Free Full Text]
3. Lucas, F., and Rudall, K. M. (1968) in Comprehensive Biochemistry (Florkin, M. , and Stota, E. H., eds), Vol. 26B , pp. 475-558, Elsevier Science Publishers, Amsterdam
4. Hayashi, C. Y., and Lewis, R. V. (1998) J. Mol. Biol. 275, 773-784[CrossRef][Medline] [Order article via Infotrieve]
5. Sehnal, F., and Akai, H. (1990) Int. J. Insect Morphol. Embryol. 19, 79-132[CrossRef]
6. &Zbreve;urovec, M., Yang, C., Sehnal, F. (1998) J. Biol. Chem. 273, 15423-15428[Abstract/Free Full Text]
7. Nirmala, X., Kodrík, D., &Zbreve;urovec, M., Sehnal, F. (2001) Eur. J. Biochem. 268, 1-10[Medline] [Order article via Infotrieve]
8. Tanaka, K., Mori, K., and Mizuno, S. (1993) J. Biochem. (Tokyo) 114, 1-4[Abstract/Free Full Text]
9. &Zbreve;urovec, M., Kodrík, D., Yang, C., Sehnal, F., Scheller, K. (1998) Mol. Gen. Genet. 257, 264-270[CrossRef][Medline] [Order article via Infotrieve]
10. Tanaka, K., and Mizuno, S. (2001) Insect Biochem. Mol. Biol. 31, 665-677[CrossRef][Medline] [Order article via Infotrieve]
11. Inoue, S., Tanaka, K., Arisaka, F., Kimura, S., Ohtomo, K., and Mizuno, S. (2000) J. Biol. Chem. 275, 40517-40528[Abstract/Free Full Text]
12. Sezutsu, H., and Yukuhiro, K. (2000) J. Mol. Evol. 51, 329-338[Medline] [Order article via Infotrieve]
13. Warwicker, J. O. (1960) J. Mol. Biol. 2, 350-362[Medline] [Order article via Infotrieve]
14. Marsh, R. E., Corey, R. B., and Pauling, L. (1955) Biochim. Biophys. Acta 16, 1-34[Medline] [Order article via Infotrieve]
15. Marsh, R. E., Corey, R. B., and Pauling, L. (1955) Acta Crystallogr. 8, 710-715[CrossRef]
16. Zhou, C.-Z., Confalonieri, F., Medina, N., Zivanovic, Y., Esnault, C., Yang, T., Jacquet, M., Janin, J., Duguet, M., and Perasso, R. (2000) Nucleic Acids Res. 28, 2413-2419[Abstract/Free Full Text]
17. &Zbreve;urovec, M., Sehnal, F., Scheller, K., Kumaran, A. K. (1992) Insect Biochem. Mol. Biol. 22, 55-67
18. Hepburn, H. R., Chandler, H. D., and Davidoff, M. R. (1979) Insect Biochem. 9, 69-77
19. Denny, M. W. (1980) Symp. Soc. Exp. Biol. 34, 247-272[Medline] [Order article via Infotrieve]
20. Sehnal, F. (1966) Z. Wiss. Zool. 174, 53-82
21. Yang, C., Teng, X., &Zbreve;urovec, M., Scheller, K., Sehnal, F. (1998) Gene (Amst.) 209, 157-165[CrossRef][Medline] [Order article via Infotrieve]
22. Ohshima, Y., and Suzuki, Y. (1977) Proc. Natl. Acad. Sci. U. S. A. 74, 5363-5367[Abstract/Free Full Text]
23. Cherbas, L., and Cherbas, P. (1993) Insect Biochem. Mol. Biol. 23, 81-90[CrossRef][Medline] [Order article via Infotrieve]
24. Proudfoot, N. J. (1991) Cell 64, 671-674[CrossRef][Medline] [Order article via Infotrieve]
25. Nielsen, H., Engelbrecht, J., Brunak, S., and von Heijne, G. (1997) Protein Eng. 10, 1-6[Abstract/Free Full Text]
26. Michalik, J., Sehnal, F., and Lassota, Z. (1984) Acta Biochim. Pol. 31, 139-148[Medline] [Order article via Infotrieve]
27. Lucas, F., Shaw, J. T. B., and Smith, S. G. (1960) J. Mol. Biol. 2, 339-349[Medline] [Order article via Infotrieve]
28. Bonnot, G., and Delobel, B. (1970) Ann. Zool. Ecol. Anim. 2, 595-605
29. Tsujimoto, Y., and Suzuki, Y. (1979) Cell 18, 591-600[CrossRef][Medline] [Order article via Infotrieve]
30. Tamura, T., Inoue, H., and Suzuki, Y. (1987) Mol. Gen. Genet. 206, 189-195[CrossRef]
31. Tamura, T., and Kubota, T. (1989) in Wild Silkmoths 88 (Akai, H. , and Kiguchi, M., eds) , pp. 67-72, National Institute Sericultural Insect Science, Tsukuba, Japan
32. Datta, A., Ghosh, A. K., and Kundu, S. C. (2001) Insect Biochem. Mol. Biol. 31, 1013-1018[CrossRef][Medline] [Order article via Infotrieve]
33. Kusuda, J., Tazima, Y., Onimaru, K., Ninaki, O., and Suzuki, Y. (1986) Mol. Gen. Genet. 203, 359-364[CrossRef]
34. Tanaka, K., Kajiyama, N., Ishikura, K., Waga, S., Kukuchi, A., Ohtomo, K., Takagi, T., and Mizuno, S. (1999) Biochim. Biophys. Acta 1432, 92-103[CrossRef][Medline] [Order article via Infotrieve]
35. &Zbreve;urovec, M., Vasková, M., Kodrík, D., Sehnal, F., Kumaran, A. K. (1995) Mol. Gen. Genet. 247, 1-6[CrossRef][Medline] [Order article via Infotrieve]
36. Zhou, C.-Z., Confalonieri, F., Jacquet, M., Perasso, R., Li, Z.-G., and Janin, J. (2001) Proteins Struct. Funct. Genet. 448, 119-122
37. Nakamura, T., Soyama, A., and Wada, A. (1991) FEBS Lett. 289, 123-125[CrossRef][Medline] [Order article via Infotrieve]
38. Garel, J. P., Mandel, P., Chavancy, G., and Daillie, J. (1970) FEBS Lett. 7, 327-329[CrossRef][Medline] [Order article via Infotrieve]
39. Suzuki, Y., and Brown, D. D. (1972) J. Mol. Biol. 63, 409-429[CrossRef][Medline] [Order article via Infotrieve]
40. Mita, K., Ichimura, S., Zama, M., and James, T. C. (1988) J. Mol. Biol. 203, 917-925[CrossRef][Medline] [Order article via Infotrieve]
41. Armour, J. A. L., Monckton, D. G., Neil, D. L., Tamaki, K., MacLeod, A., Allen, M., Crosier, M., and Jeffreys, A. J. (1993) Genome Anal. 7, 43-57
42. Beckwitt, R., Arcidiacono, S., and Stote, R. (1998) Insect Biochem. Mol. Biol. 28, 121-130[CrossRef][Medline] [Order article via Infotrieve]
43. Manning, R. F., and Gage, L. P. (1980) J. Biol. Chem. 255, 9451-9457[Abstract/Free Full Text]
44. Hayashi, C. Y., and Lewis, R. V. (2000) Science 287, 1477-1479[Abstract/Free Full Text]
45. Smith, G. P. (1976) Science 191, 528-535[Abstract/Free Full Text]
46. Dover, G. (1982) Nature 299, 111-117[CrossRef][Medline] [Order article via Infotrieve]
47. Meeds, T., Lockhard, E., and Livingston, B. T. (2001) J. Mol. Evol. 53, 180-190[CrossRef][Medline] [Order article via Infotrieve]
48. Termonia, Y. (1994) Macromolecules 27, 7378-7381[CrossRef]
49. Gatesy, J., Hayashi, C., Motriuk, D., Woods, J., and Lewis, R. (2001) Science 291, 2603-2605[Abstract/Free Full Text]


Copyright © 2002 by The American Society for Biochemistry and Molecular Biology, Inc.
Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg