CfT-I: an LTR-retrotransposon in Cladosporium fulvum, a fungal pathogen of tomato
Transcript of CfT-I: an LTR-retrotransposon in Cladosporium fulvum, a fungal pathogen of tomato
Mol Gen Genet (1992) 233 : 337-347
© Springer-Verlag 1992
Original articles
CfT-I: an LTR-retrotransposon in Cladosporium fulvum, a fungal pathogen of tomato
Mark T. McHale 1'2, lan N. Roberts 1'3, Stuart M. Noble 1, Christine Beaumont 1, Michael P. Whitehead 1, Devanshi Seth 1'4, and Richard P. Oliver 1
i Norwich Molecular Plant Pathology Group, University of East Anglia, School of Biological Sciences, Norwich NR4 7T J, UK
Received November 6, 1991
Summary. A retrotransposon from the fungal tomato pathogen Cladosporium fulvum (syn. Fulvia fulva) has been isolated and characterised. It is 6968 bp in length and bounded by identical long terminal repeats of 427 bp; 5 bp target-site duplications were found. Puta- tive first- and second-strand primer binding sites were identified. Three long open reading frames (ORFs) are predicted from the sequence. The first has homology to retroviral gag genes. The second includes sequences ho- mologous to protease, reverse transcriptase, RNAse H and integrase, in that order. Sequence comparisons of the predicted ORFs indicate that this element is closely re- lated to the gypsy class of LTR retrotransposons. Races of the pathogen exhibit polymorphisms in their comple- ment of at least 25 copies of the sequence. Virus-like particles which co-sediment with reverse transcriptase activity were observed in homogenates of the fungus. This is the first report of an LTR retrotransposon in a filamentous fungus.
Key words: Transposable elements - Leaf mould - Virus- like particles - Reverse transcriptase - Fulvia fulva
Introduction
Despite intensive investigation of the genetic structure of filamentous fungi, knowledge of transposable elements in this group of organisms in sparse. A LINE-like trans- poson was detected in an African strain of Neurospora crassa (Kinsey and Helber 1989) and a number of mobile introns have also been found (Michel and Lang 1985). Laboratory strains of Aspergillus nidulans and N. crassa appear to lack active transposons (Kinsey and Helber
Present addresses. 2 SmithKline Beecham, The Frythe, Welwyn, Herts, UK 3 Institute of Food Research Norwich, Colney Lane, Norwich, UK 4 Biotech International, Technology Park, Bentley, Western Aus- tralia Correspondence to. R.P. Oliver
1989; J. R. Kinghorn and C. Scazzocchio, personal com- munications), a feature that doubtless aided their contri- bution to classical genetics. In contrast, many phyto- pathogenic fungi display phenotypic instability, which is characteristic of organisms harbouring transposons. For example, many pathogens spontaneously produce sterile, non-pathogenic mycelium when cultured in vitro, a situa- tion that is often cured by passaging through the plant host to restore vigour. In the field, the rapid appearance of novel, virulent forms in response to resistant varieties of crops is all too frequent (see e.g. Dinoor et al. 1988 ; Lindhout et al. 1989). The molecular basis of these phenomena is unknown, in part because detailed genetic analysis of these species is in its infancy.
Retrotransposons are an otherwise ubiquitous class of mobile genetic elements that require a reverse transcrip- tion step to undergo replicative transposition (Boeke and Corces 1989). The 'life-cycle' of a retrotransposon is similar to that of retroviruses; however, retrotrans- posons do not produce infectious virus particles, instead they produce non-infectious virus-like particles that are intermediates during transposition (Shiba and Saigo 1983; Garfinkel et al. 1985; Mellor et al. 1985; Goreleva et al. 1989). Numerous virus-like particles have been observed in preparations of filamentous fungi (Buck 1986; Bozarth 1972): functions have been ascribed to only a handful.
The integrated copy of a retrotransposon undergoes transcription via host RNA polymerase II to produce a polyadenylated mRNA, which acts as both a template for translation and as a single-stranded genomic RNA in the mature virus-like particle (Fink et al. 1986). Transla- tion of the retrotransposon mRNA produces the struc- tural proteins of the gag (group-specific antigen) gene and several enzymatic activities (acid proteases, reverse transcriptase, RNase H, integrase) encoded by the pol (polymerase) gene. A third open reading frame (ORF) may encode a protein analogous to retroviral env (en- velope) genes.
LTR retrotransposons and retroviruses possess a sim- ilar genomic organisation and in certain cases share ex-
338
tensive DNA and amino acid sequence homology. LTR retrotransposons have been divided into two distinct groups on the basis of sequence comparisons (Xiong and Eickbush 1988; Doolittle et al. 1989). One such group is the gypsy group of retrotransposons. This class of retro- transposon exhibits the closer phylogenetic relationship to true retroviruses. The other group of LTR retrotrans- posons includes the eponymous copia from Drosophila melanogaster (Mount and Rubin 1985), Tyl from Sae- charomyces cerevisiae (Clare and Farabaugh 1985) and Tntl from Nicotiana tabacum (Grandbastien et al. 1988).
The gypsy group of LTR retrotransposons has a di- verse membership, comprising retroelements from Lilium henryi [del (Sentry and Smyth 1989; Smyth et al. 1989), Dictyostelium discoideum [D1RS-1] (Cappello et al. 1985), S. cerevisiae [Ty3] (Clark et al. 1988; Hansen et al. 1988), Schizosaecharomyces pombe (Levin et al. 1990) [Tfl, Tf2], several Drosophila species [gypsy, 412, 297, 17.6, TOM] (Marlor et al. 1986; Yuki et al. 1986; Inouye et al. 1986; Saigo et al 1984; Tanda et al. 1988) and a moth [TED] (Friesen and Nissen 1990). The plant viruses CAMV [cauliflower mosaic virus] (Gardner et al. 1981) and CERV [carnation etched ring virus] (Hull et al. 1986) are also related to this group. To our knowledge, LTR retrotransposons have never been reported in filamen- tous fungi.
In this paper we describe the isolation of an LTR retrotransposon from the genome of the filamentous phytopathogen Cladosporium fulvum, a biotrophic, non- obligate member of the Deuteromycetes (Fungi imper- fecti), the causative agent of tomato leaf mould. The disease is a model system for the study of plant-pathogen interactions. The pathogen exists in a number of physio- logical races, which exhibit a gene-for-gene relationship with cultivars of tomato carrying different resistance genes (de Wit 1977). Five avirulence genes have been identified (Lindhout 1989) and Avr9 has been cloned (van Kan et al. 1991). We have previously described the isola- tion of a 225 bp sequence, designated P5, from the genome of C. fulvum (McHale et al. 1989). P5 encoded a putative protein with striking homology to reverse transcriptase sequences from various retroelements. P5 was shown to be a moderately repetitive sequence, which hybridised to about six bands in pulsed field gel separa- tions of C. fulvum chromosomes (Talbot et al. 1991). In this paper we report the isolation and characterisation of cosmid clones containing DNA homologous to P5. We show that they contain full-length copies of a retrotrans- poson and suggest that these elements be designated CfT-1 (C. fulvum transposon).
ed pAN7-2, a vector based on pAN7-1 (Punt et al. 1987). Colony and Southern hybridisations used Hybond-N membranes (Amersham) and standard procedures (Hodgson and Fisk 1987; Maniatis et al. 1982).
Single- and double-stranded clones were sequenced using the Sequenase 2.0 kit (United States Biochemicals), the Pharmacia (Uppsala) T7 polymerase kit for manual electrophoresis or the Auto-read kit and an ALF-Phar- macia sequencing apparatus. Primers were synthesised on a Cyclone BioSearch synthesiser. Sequences were analysed using the DNASIS package (Pharmacia) and the NBRF and EMBL databases. Phylogenetic analysis used the PAUP 3.0 package (Swofford 1990). Alignments of amino acids were performed by eye.
For isolation of virus-like particles (VLPs), 5 x 103 condiospores per 50 ml of Czapeck's Dox liquid medium (Oxoid) were incubated at 25 ° C with rapid shaking (200 rpm) for 10 days. Cells from 10~12 g wet weight of mycelium were broken down by freezing in liquid nitro- gen and grinding in a pestle and mortar in 15 ml of extraction buffer (1 M MgSO4, 10 mM TRIS, pH 7.4, 10 mM CaCI2) with the aid of sand (E1-Sherbeini et al. 1984). The homogenate was incubated on ice for 20 rain and centrifuged at 9750 x g in a Sorvall SS34 rotor for 1 h. Three-millilitre aliquots of the supernatant were overlaid onto 1 ml of 45% w/v sucrose in PKE buffer (0.02 M NazHPO4, 0.15 M KC1, 0.01 M EDTA, pH 7.6). The samples were centrifuged in an SW41 Beckman rotor at 61000 x g overnight at 4 ° C. The supernatant was dis- carded and the pellet was resuspended in 1 ml of PKE and kept on ice. The samples were loaded onto a 10-80% w/v sucrose density gradient (made up in PKE buffer) and centrifuged in an SW41 rotor (130000 x g, 4 ° C, 4 h). One millilitre fractions were collected by pipetting gently from the top of the tube. The fractions, numbered 1 to 12, from the top to the bottom of the tube, were diluted 5 times in PKE and centrifuged at 150 000 x 9 at 4 ° C in a Beckman Ti65 rotor. The pellets were resuspended in a minimum volume of PKE and assayed for reverse transcriptase by the method of Garfinkel et al. (1985), with poly G as template and oligo dC as primer.
Aliquots from each fraction of the sucrose density gradient were extracted with one volume of chloroform to remove lipid and allowed to settle for 5 min onto carbon-coated plastic film on nickel grids. Samples were fixed for 5 min with 2.5% glutaraldehyde in phosphate buffer, pH 7.4; 2% (w/v) uranyl acetate was used for negative staining for 2 rain. The samples were rinsed in phosphate buffer in between the steps. Grids were exam- ined for VLPs in a JEOL 100C electron microscope.
Materials and methods Results and discussion
C. fulvum races were maintained on V8 juice agar (Harl- ing et al. 1988) plates at 25 ° C. Escherichia coli JM83, JM101, pUC19 and M13 mp 18 and 19 were used for subcloning and sequencing (Norrander et al. 1983). Fun- gal DNA was prepared according to Raeder and Broda (1985). The cosmid library was constructed by ligating DNA partially digested with Sau3A into BamHI-digest-
Isolation of full-length copies of CfT-1
A cosmid library was screened with the C. fulvum reverse transcriptase clone pNOMP5 (McHale et al. 1989). Nine hundred cosmid clones were screened, from which five positive clones were isolated. DNA from the clones was digested with BamHI or HindIII, and probed with
pNOMP5 DNA. BamHI digestion revealed that cosmids 691,712 and 786 each contained a 1.7 and 1.8 kb doublet of hybridising DNA. This doublet was reminiscent of BamHI-digested, C. fulvum genomic DNA probed with pNOMP5 (McHale et al. 1989). pNOMP5 hybridised to different BamHI bands in the remaining two cosmids, 3612 and 920 (data not shown). Southern hybridisation of HindIII-cut C. fulvum genomic DNA with pNOMP5 had revealed a multitude of high molecular weight bands, implying that intact copies of the P5 retrotransposon are not cut with this restriction enzyme (McHale et al. 1989). Similarly, pNOMP5 hybridised to high molecular weight bands in the HindIII-digested cosmid DNAs, indicating that they may contain full-length copies of the C. fuF rum retroelement. The cosmids which hybridised to pNOMP5 were subcloned by partially digesting with HindIII, religating and transforming into E. eoli. The smallest subclones that retained the 1.7 and 1.8 kb Barn- HI doublet were selected for detailed analysis. Subclones pNOM712:3 (15 kb) and pNOM691:125 (20 kb), were mapped with EcoRI, BamHI and HindIII.
Sequencing of CfT-1:712
The PstI, BamHI and EcoRI fragments of pNOM712:3 were subcloned into pUC18 (Norrander et al. 1983), thus generating a series of overlapping subclones that were representative of the whole of 712:3 (Fig. 1). The long terminal repeats were detected by probing digests of 712:3 with each of the subclones, searching for multiple hybridisations. This delimited the element to approxi- mately 7.4 kb. (Fig. 1). Many of the subclones were transferred to M13 mpl8. Both strands of the element were sequenced by subcloning and using specific primers (Fig. 2).
The Ion 9 terminal repeats
Examination of the LTRs revealed many features that are characteristic of retrotransposons (Fig. 3). The LTRs are 427 bp long and are identical. Sequencing the CfT-1:691 LTRs from the PstI sites outwards also gave the same identical sequences. Since LTR sequences are
l kb
P P BE X EBE PXB P X I I I I I III I11 I I
ILTRi ILTRI
I P3 I P31 I P13 I P6 [ B21 I B13 I B15 I
I E12 I I E21 P5
ORF1 ORF3 O R F 2
Fig. 1. Restriction map, subclones and open reading frames (ORFs) of the CJT- 1.712 region. Abbreviations of restriction enzymes used are as follows-PstI, P; EcoRI, E; BamHI, B; J(hoI, X. Subclones are referred to in the text with prefix pNOM. LTR, long terminal repeat
339
homogenised during replication (Boeke and Corces 1989) and would be expected to diverge once integrated into the genome, the identity of the LTR sequences places a limit on the time at which the cloned elements transposed. The presence of highly homologous se- quences as represented in the 691 and 712 subclones, at different genomic locations reinforces the view that this element has recently transposed.
Examination of the sequences flanking the LTRs in 691 and 712 did not permit an unequivocal assignment of the length of the target site duplication because of degeneracy of the sequences. However, comparison of the 691 and 712 sequences indicated that 5 bp target-site sequences were duplicated during integration. The two duplicated sequences° TATAG(712) and GTACC(691), bear no obvious relationship to each other. The first sequence is reminiscent of the TATA and ATAT target- sites of the gypsy retrotransposons, 17.6 297 and Beagle (Bingham and Zachar 1989). The ends of the LTRs exhibit a 5 bp perfect inverted repeat structure. The LTRs terminate in the characteristic sequences 5'TG...CA3'.
Transcription of retrotransposons starts and termi- nates within the LTR sequences. In some cases, se- quences within the LTRs have been shown to have the properties of enhancers (Errede et al. 1985). It is inte- resting to note that sequences in the CfT-1 LTR show striking homology to the Tyl enhancer (Xu and Boeke 1990) and also to enhancers identified in SV40 (Weber et al. 1984), yeast PH080 (Gilliquet et al. 1987) and N. crassa (Geever et al. 1983) (Fig. 3). Two sets of re- peated sequences lie on either side of the SV40 en- hancer-like region (Fig. 3).
Immediately 3' to the 5' LTR is the presumed primer binding site for first-strand reverse transcription. There is no TGG sequence, implying that an internal tRNA fragment rather than an intact tRNA is used for priming. Internal tRNA fragments are used for priming copia retrotransposition (Kikuchi et al. 1986). The CfT-1 primer binding site is complementary to conserved re- gions of tRNAs. The sequence can be imperfectly aligned with an internal region of a S. pombe serine tRNA (Kohli et al. 1984). Second-strand priming involves annealing to a polypurine tract immediately 5' to the 3' LTR. In CfT-1, the sequence A G A G A G G G G A T G G is found at this position.
Four PstI sites were found in CfT-1, one in each LTR and two in the internal, single copy region. Southern hybridisation of the subclone pNOMP5 to PstI-cut C. fulvum genomic DNA revealed just two very intense bands of 3.2 kb and 4.9 kb (Fig. 4). All other restriction enzymes, such as XhoI (lane 2), give a ladder of heteroge- neous bands in addition to the strong band predicted by the presence of two internal XhoI sites (Fig. 1) in 712, when hybridised to fragments of CfT- 1. This implies that all pNOMP5-related sequences have conserved PstI sites in the LTRs. The internal PstI sites are symmetrically arranged so either one, but not both, is absent in about half the genomic copies of the retrotransposon. The presence of conserved LTR sequences amongst all the CfT-1 sequences is evidence of an unusual degree of
76
164
252
340
428
516
TAAGAGACGGTAATAGACTATA~AAGGAG CTTAAGATAT CGGTATACG CACT~C=ATAAACAATAG OOAG OOATA
TAC C CTTATATAq~2GTTAC C CGTAATAATATAATATA~[fG%~gACGGAT CGGACACGAC CGAAGGAGCGT CAG C CAATCAATAG C C CG C
ACTAATGACAGG CG ~ C CACAG CTAGG CC~gGCACAG C CG CTG CAGAGTG C CTGGAC4I~ CCTC~AG~fG C CTGGAGTG CCTA~OO
TAGG CACAGTAC4] CACC~ATAGGCACGGTAC4~ CA~AAAGGAATGGA;~I'I'I'C tit ±~T~IT I'AGGAACG CATCGTAC CCITATC4~TT
CGAGGCG%~f CAAOOCACTGTT CAATAT CACTG'±'±'I~GAATACACCACAAC C CAACCTCTG COO CACI]9 CTG C±'±GGTc ±'I'I'ACAAGAC
GAGACAG OTACTG CI~9 CAT CACTCTC C CT CAC CITCCAT CGGCCACGAGACAAAACCCGATI~AG~T tT±'t~fGATCI'fGTGATA
T CA~AGGTTATAC CAAAC OOAACAGc ±TIGAACGC COO CAACGATGGGAGATCGACAGAGTGAC ~AACACC CGC CC C CAGTGA
Met Leu Set Met Pro Trp Leu Leu Ile Gln Ala Thr ASn
604 AGCA~CAGAGCCCGACGACOOCAGACCGCGAACGATG CTA TCA ATG CCT
Pro Gln Gly Trp Gln Pro Glu Phe Phe T!rr Gly Asp Arg Val
677 COO CAAGGT TGG CAA CCA GAG %~fC qTgC TAT GGC GAC CGA GTC
Met Asp.Met Tyr Phe Leu Phe Asn Set Met Thr Glu Asn Leu
743 ATG GAT ATGTAC~C q*fG~CAAC TCC ATG ACT GAG AAC OOC
Leu Arg Gly Arg Ala Gln His Trp Val Lys Pro Phe Leu Arg
809 ~q~ AGA GGA CGA GCA CAG CAT q~G GTC AAA CCA ~TC CI~ AC~
Asp ASn Ala ASp Gly Val Phe Lys Ser T!rr ASn His Leu Lys
875 GAC AAT GCTGAC GGAGTC %~TCAAG TCTTACAAC CAT OOCAAG
Val Ser Ash Glu Ile Ala Thr Ala Val Arg Val Ile Gln His
875 GTC TCG AAC GAG A_q~ GCC ACT GCC C4I~ CC/f GTT ATT CAG CAC
Glu Tyr Ala Ala Lys Phe Gln Glu Tyr Ala Gln Leu Thr ASp
941 GAA TAT GOO GCC AAG qTC CAG GAA TAC GCG CAA CTC ACC GAT
Met q~rr Arg Arg Gly Leu Lys Glu His Val Lys Asp Glu Leu
1073 Aq~Z TAT CGAAGA GGA CI~ AAG GAG CAT GTC AAA GAC GAG Gly Leu Gly ASp Leu Val Gln Val Thr Ile ASp Leu Asp ASp
1139 GGC ~TA GGC GAC OOA GTC CAA GTC ACG ATC GAT CI~ GAC GAC
Arg Arg Tyr ASp Set Lys Val Ser Gly Lys Ala Gly T!rr Thr
1205 CGA CGA TAC GAC TCC AAG GTA TOO GGAAAG GCC GGA TAC ACG
Gly Phe Ash asp ASh q~yr ASn Lys Pro Lys Asp Lys Pro Tyr
1271 GGT TTC AAC GAC AAC TAC AAC AAA CCC AAG GAC AAG CCG TAC
ASp Val Thr Glu Lys Gly Arg Lys Ile Arg Asn Ser Lys Gly
1337 GAC GTT ACC GAG AAA GGT CGC AAG ATC AGG AAC AGC AAG GGG
Glu Thr Arg Thr Cys Tyr Gly Cys Gly Lys Pro Gly His Ile
1403 GAA ACA CGA ACT TGC TAT GGA TGC GGC AAA COO GGC CAC ATA
Met Val Arg Arg Glu Gln Pha ASn Met Met Gln Arg Arg Thr
1469 ATG GTG CGC CGC GAA CAG ~C AAC ATG ATG CAA CGA CGA ACT
Ser Leu Gly Asp Ile ASp Tyr Thr Arg Arg Val Glu Glu Arg
1535 TOO CTG GGA GACATC GAC TAT ACT CGGAGA GTG GAG GAG AGA
Leu leu ASp pro Ser Gln Gly Arg gly Gly Cys His Gly Gln
1601 OOC CTG GAC CCC TCA CAA GGA OGA GGA GGA TGC CAT GGG C~
Leu Gln ASn Gln Pro Arg Gln Phe Asn Met Met Ala Arg Arg
1667 CTG CAG AAC CAG CCA CGA CAG TTC AAC ATG ATG GCA CGA AGA
Set Arg Asn Gly His Gly Ser Leu His Trp Arg Phe Cys T!rr
1733 TCT CGAAAT GGT CAC GGA AGC CI~ CAT TGG C~A%~fCTGT TAC
Set Ala Lys Set Gly Ala Gly Tyr Trp Pro Gln Gln Pro Arg 1799 TCC GCG AAA TCA GGA GCA GGG TAT TC49 CCT CAG CAA CCG CGA
Gln ASp Ala Thr Leu Gln Glu Val ASp Ile Asp Glu Set Cys
1865 CAA GAC GCC ACG OOC CAA GAA GTG GAC ATC GAC GAA TCC q~C
Glu ASn Gln ASp Pro Gly Trp ASn Gly Ser Trp Ser Pro Glu
1931 GAA AAT CAG GAC CCA GGA TGG AAC GGA TCG TC49 TCA CCG GAA
Thr Gly Leu ASp Thr Ile Glu GIu Asp Gln Asp Pro Glu Glu
1997 ACC GGG CTG GAT ACC ATC GAG GAA GAC CAG GAT CCG GAA GAA
Gly Glu Glu Set Asp Thr ASp ASp Asp ASn Glu Gln Met Thr
2063 GGA GAAGAGAGC GAC AOOGAC GAC GACAAC GAACAAATG ACC
Leu Tyr ASp Met Ile Ile His Leu Gln Arg Arg His Glu Glu
2129 CTC TAC GAC ATGATA An CAC OOC CAACGA AC~ CAT GAAGAA
Arg Met Leu His Ser Ile Glu Phe Asp Lys Thr Leu Asp Thr
2195 CGA ATG OOA CAC TCC ATT GAG '~±'±' GAC AAG ACA OOC GAC ACC
Pro Leu Met Glu Thr qg~r Glu ASh Leu Ala Thr Ala Val Thr
2261
2327
2394
2460
Fig. 2
13 TGG CTA CTC ATA CAG GCA ACG AAT
Lys Phe Asp Thr Trp Val Set Gln 35
AAG qrfC GAC ACT TGG GTC TCT CAG
Lys Pro Ile Phe Ala Ala ~hr Phe 57
AAG CCT ATC 'i'±'±' GCC ACC ACC TTC
Lys T!rr Leu Asp Set Asn Gly Glu 79
AAA TAC CTG GAT AGC AAC C~A GAA
His Ala Met Lys Ser VAI Phe Gly i01
CAT GCT ATG AAG AGC GTC '±'±',' GGT
Leu Thr Gln Lys Thr Set Thr Ala 123
ACT CAG AAG ACT TCC ACA GCC
Trp Asp Asp Glu Ala Leu Gln Val 145
TC4Z GAC GAC GAA GCA CTT CAA GTC
Met Arg ASp Gly Arg Lys Ile Asp 167
ATG AGG GAC GGT CGG AAG ATC GAT
Lys Leu T!rr Glu Arg Ala Met Glu 189
AAG CTC TAC GAG AGA GCT ATG GAA
Pro Gly Tyr Asp Asn Arg Asn Arg 211
CCA GGC TAC GAC AAC CGT AAC AGA
Tlrr Gly Pro Gln Pro Met Glu Leu 233
TAC GGA CCA CAG CCA ATG GAG CTC
Asn Arg Arg Pro Pro Set Set Arg 255
AAC AGA AGA CCC CCG A6~ TCA AGA
Ala Arg Asp Cys Arg Gly Lys Asn 277 GCA AGG GAC TGC CGC GGT AAG AAT
Set Lys Ser Glu Set Set Val Glu 299
AGT AAA TO0 GAG AGC A~T G%~f GAG Ala Set Asn Gly Set Thr GIu Pro 321
GCT AGC AAT GGA AGC ACG GAG COO
Gln Met Val Val Ile Pro Phe Glu 343 CAA ATG GTG GTC A~9 CCA TTC GAG
Val Asp Tyr Glu Ile Glu Set GIu 365
GTC GAC TAC GAA ATC GAG TCC GAA
Gln Asp Set Cys Gln Val His Tyr 387
CAA GAC TCG TGT CAG GT~ CAC TAC
Gly Thr Leu Gly Ala ~ His Arg 409
GGA ACC CTG GGA GCA ACA CAT AGG
phe Asp Asp Asp Gly Set Asp Lys 431
~C GAC GAC CAT GGA AGC GAC AAA
Pro Gln Glu Glu Set Glu Glu Thr 453
CCA CAG GAA GAA AC]r CAA GAA ACC
Set Ser Glu GIu ASp Set Ser Glu 475 TCA AGC GAA GAA GAT TCC TCC GAA
Phe Thr Val Asp Ala Pro Lys Lys 497
~C ACC GTA GAC GCA CCG AAG AAG
Phe Leu Pro Arg Ile Gly Gly Arg 519
qTC CTA CCG AGG ATA GGC GGA CGA
Leu Arg Gly Met Ala Trp Gly Tyr 541
CTA CGA GGC ATG GCC TGG GGA TAC
Glu Arg Pro Pro Ile Gly Set Arg 563
CCA CTC ATG GAA ACG ACC GAA AAC CTA GCA ACT GCT GTC ACA GAG CGA CCG CCG ATA GGA AGT AG
Met Ile Gly Thr Gly Tyr Leu Thr Pro Set Gly Thr Phe Val Ser Asn Glu Leu Arg Asn Met Val
ASh Asp Arg T!rr Arg Ile Pro His Thr Val Arg Asn Ile Arg Ile Lys Arg Ile Ala Gln His Gly
AAT GAT AGG TAC CGG ATA CCT CAC ACC GTC AGG AAC AZ~f CC]f ATC AAA CGA ~ GCG CAA CAT GGT Gln His Ala Arg Ala Leu Tyr Set Glu Thr Gln.Lys Ile Gln Glu Arg His Arg Val Gln Met Trp
Ala Ala Arg Thr Arg Thr Leu Leu Arg Ash Ala Glu Asp Pro Gly Thr Ser Ser Set q~r Asp Val
GCA GCA CGC ACG CGC ACT CTA CTC AGA AAC GCA GAA GAT CCA GGA ACG TCA TCG AGT ACA GAT GTG Thr Gln Lys Gly Set Arg Gln Val Gln Ala Glu Ala Set Arg His Glu Ala Arg Arg Set Thr Tyr
Asp Thr Glu Arg Leu Ala Pro Gly Ser Set Gly Gly Leu Glu Thr Arg Set Thr Thr Phe His Ile GAC ACA GAA AGG CTC GCG CCA GGT TCA AGC GGA GGC CTC GAG ACA CGA AGC ACG ACG q~C CAC ATA
Glu Gln Gly Lys Gly Set Tyr Pro Gln Glu Stop
585
22
607
44
629 66
639
341
2526
2592
2658
2724
2790
2856
2922
2988
3054
3120
3186
3252
3318
3384
3450
3516
3582
3648
3714
3780
3846
3912
3978
4 0 4 4
4110
4176
4242
4308
4374
4 4 4 0
4506
4572
4638
4704
Arg Thr Gly Lys Arg Ile Asn Ser Pro Gly Met Ser Asp Arg Arg Glu Leu Cys Asn Thr Glu Lys 88
CGA ACA GGG AAA AGG ATC AAC TCC CCA GGA ATG AGC GAC CGA AGG GAG ~TPA TGC AAC ACA GAA AAG
Ala Gln Ala Set Pro Thr Thr Thr Val Phe Arg Thr Lys Ile Ile Val Ash Gly His Lys Thr Asp ii0
GCT CAA GCA TCA CCA ACG ACA ACA GTG 'i'I'I' C~A ACC AAG ATr ATC GTC AAC GGG CAC AAG ACA GAT
Ala Met Ile Asp Set Gly Ala Ser Gly Ash Phe Ala Set Glu Set Phe Val Thr Arg Ash Arg Ile 132
GCC ATG ATA GAT TCC GGA GCG AGC GGG AAT '±'±'£ GCC TCC GAA TCA 'i'±'±' GTI' ACA AGA AAC AGG ATC
Ala Thr Cys Lys Lys Lye Glu Gly Tyr Glu Leu Ile Ala Val Asp Gly Ser Ser Leu Pro Set Val 154 GCT ACC TGT AAG AAG AAA GAG GGT TAT GAA ~ ATC GCG GTA GAC GGA TCA TCC ~PA CCC AGC GTG
GIu Arg Glu Thr Ile Pro Leu Pro Leu Ala Ile Gln Arg His His Glu Glu Ile Thr Leu Asp Val 176
GAG CGC GAG ACG ATA CCG C~A CCA CTT GCf ATC CAA CGG CAC CAC GAG GAG ~ ACC ~ GAT GTT
Thr Asp Met Ala Ser His ASp Ile Val Leu Gly Met Pro Trp Leu Arg Lys His Asn Pro Val Ile 198
AC9 GAT ATG GCC AGT CAT GAC ATC GTA CTA GGT ATG CCT TGG q~G AGA AAG CAC AAC CCA GTA ATC
Asp Trp Arg Arg Gly Val Leu Thr Phe Arg Glu Cys Glu Cys Val Ile Asp Ile Gln Pro Ala Gln 220 GAC TGG AGA AGA GGA GTA C9C ACA 'i'±'±' AGG GAA TGC GAA TGC GTT ATC GAC An CAG CCT GCG CAG
Trp Gln Arg Ser Leu Ala Asp Glu Ala Arg Lys Gln Leu ASn Arg Ile Gin Leu Ala Pro Thr Arg 242
TGG CAG CC/f TCA CTG GCA GAT GAG GCA AGG AAA CAG C~C AAC AGG ATA C/~ CTT GCG CCG ACT AGA
Arg Thr Glu Glu Pro Pro Set Thr Gly Thr Asp Thr Gly Val Gly Pro Pro Gly His Glu Val Thr 264
AGG ACA GAA GAA CCA CCT TCG ACC GGG ACA GAC AC~P GGC GTG GGG CCG CC~ GGT CAC GAA GTC ACT
Gly Ser Asp Gly Ser ASn Ala Pro Ser Lys Asp Thr Ash Ile Ser GIu Leu Set Ile Pro Lys Glu 286 GGA AGT GAT GGA AGT AAC GCA CCA TOG AAG GAT ACC AAC ATC TCA GAG ~I~ AGT Aq'P CCC AAG GAG
Tyr Arg Lys Trp Set Arg Leu Phe Glu Glu Glu Arg Gly Lys Asp Ala Leu Pro Lys His Gln Pro 308
TH/T CGC ~ ~ TOG CGA CTC TI'I' GAA GAG GAA AGA GGC AAG GAC GCC ~A CCT AAG CAC CAA CCA
Trp ASp His Lys Ile ASn Ile Gln Pro Gly Lys Glu Pro Pro Trp Gly Pro Leu Tyr Gln Met Ser 330
TGG GAT CAC AAG ATA AAC A~ CAG CCA GGG AAA GAG CCT CCA TGG GGA CCC CTA TAT CAA ATG TCT
Glu Lys Glu Leu Gln Thr Leu Arg Glu Trp Leu Lys Glu Lys Leu Ala Lys Gly Trp Ile Arg Arg 352 GAG AAA GAG CTA CAG ACC CTA OGA GAA TGG CI~ AAG GAG AAG CTA GCC AAA GGA TGG ATA CGA CGA
Ser Thr Set Set Ala Gly Thr Pro Cys Met Phe Val Pro Lys Ala Asn Gly Lys Leu Arg Leu Val 374
TCC ACC TCA AGT GCA GGA ACT CCA TGC ATG TTC G~ CCA AAA GCA AAC GC~ AAA CTA CGA CTC GTA
Gln Asp Tyr Arg Lys Leu Asn Glu Ile Thr Ile Lys Asn Arg Tyr Pro Leu Pro Asn Ile Glu Glu 396
CAA GAC TAC CGA ILAG qrfG AAC GAG ATC ACG ATC AAG AAC CGA TAT CCG CTA CCC AAC ATC GAA GAA
Ala Gln ASp Arg Leu Thr Gly Set Asp Tip Tyr Thr Lys Ile Asp Leu Arg Asp Ala Phe Tyr Ala 418 GCA CAA GAC AGA qTDA ACC GGA TCA GAC TGG TAC ACG AAG ~ GAC CTA CGA GAC GCC ~ TAT GCT
Ile Arg Met Ala Glu Gly Glu Glu Tip Lys Thr Ala Phe Arg Thr Arg qhyr Gly Leu Tyr Glu Phe 440
ATC CGA ATG GCA GAA GGA GAA GAA TGG AAA ACC GCT %~fC AGG ACA AGA TAC GGA C9C TAC GAA ~C
Leu Val Met Pro Met Gly Leu Thr ASn Ala Pro Ala Set Cys Gln ASp Leu Val Asn Glu Thr Leu 462
qq~3 GTC ATG CCA ATG GGA ~ ACC AAC GCA CCC GCA TCC TGC CAG GAC CTI ~ GTC AAC GAA ACA CI~
Arg Asp Leu Leu Asp Val Cys Val Val Ala Tyr Met Asp Asp Ile Leu Val Tyr Thr Lys Gly Set 484 AGA GAC CTA CTC GAC GTG TGC (19C GTT GCT TAC ATG GAC GAC ATA CTG GTC TAC ACA AAA GGA TCC
Leu Gln Glu His Thr Lys Gln Val Gln Asp Val Phe Glu Arg Leu Thr Lys Set Gly Phe Lys Thr 506
CTC CAG GAA CAT ACC AAG CAA G%~f CAA GAT GTG TTC GAA CGA CTC ACG AAG TCC GGA TTC AAG ACA
Ala Pro Glu Lys Cys Glu Phe His Lys Lys Glu Val Lys Phe Leu Gly Phe Ile Ile Set Thr Thr 528
GCA CCC GAG AAA TC-C GAA TTC CAC AAG AAA GAA GTC AAG '±'l'±' TTA GGC qq~f ATC ATC AGT ACA ACA
Gly Ile Thr Ile Asp Pro Ala Lys Thr Gln Ser Ile Arg Glu Trp Pro Glu Pro Lys Thr Val Lys 550 GGG ATA ACG ATC GAC CCT GCA AAG ACA CAG TCA ATC AGA GAA TGG CCA GAA CCG AAG ACA GTC AAG
Asp Val Gln Set Phe Leu Gly Leu Ala Asn Tyr Asn Arg Lys Phe Ile Lys Asp Tyr Set Lys Thr 572
GAT GTA CAG TCA TTC CI~ GGA CTC GCC AAC TAC AAC CGG AAA %vfC ATT AAG GAC TAT TCA AAG ACA
Ala Ala Pro Met Thr Met Leu Thr Arg Lys Asp Val ASn Trp Lys Trp Gly Lys GIu Gln Thr Glu 594
GCA GCA CCA ATG ACG A%~ CI~ ACA AGA AAA GAC GTC AAC TGG AAA TGG GGA AAA GAA CAG ACC GAA
Ala Phe Lys Arg Leu Lys Glu Gln Cys Ala Set Ala Pro Thr Leu Arg Leu Phe Asp Gly Set Lys 616 GCG ~PC AAA AGA CTC AAG GAA CAG TGC GCT TCA GCC CCA ACG CTr CGA CTA qTfC GAT GGT AGC AAG
GIu Val His Ile Glu Thr ASp Ala Ser ASp Met Ala Ile Gly Ala Cys Leu Thr Gln Thr His ASp 638
GAA GTC CAC ATC GAG ACC GAC GCG TCT GAT ATG GCA ATA GGC GCA TGT CTA ACA CAG ACA CAC GAT
Gly Lys Arg His Pro Val Ala T!rr Tlrr Set Arg Lys Met Thr Thr Ala Glu Gln Asn Tyr Asp Ile 660
GGG AAA AGA CAC CCA GTG GCC TAT TAT TCC CGG AAA ATG ACC ACA GCG GAA CAG AAC TAC GAC ATC
His Asp Lys Glu Leu Leu Ala Ile Val Ala Ala Met Gln His Tip Arg Val Tyr Val Glu Gly Pro 682 CAT GAC AA~ GAG CTT CTA GCC ATT G~ GCC GCC ATG CAA CAT TGG AGA GTG TAC GTC GAG GGC CCA
Pro Lys Leu Thr Ile Leu Set Asp His Lys ASh Leu Thr Tyr Phe Thr Thr Thr Lys Glu Leu Thr 704
CCG AAG q~gA ACG A~ CIT TCA GAC CAC AAG AAT CTC ACG TAC ~C ACG AC~ ACG AAG GAA c±'i ~ ACC
Arg Arg Gln Ala Arg Trp Ser Glu Leu Leu Gly Gln Tyr Lys Phe Glu Ile Lys Tyr Thr Pro Gly 726
CGA AGA CAA GCC CGC TGG TCG GAG CTG C~9 GGG CAG TAC AAG q'fC GAA ATC AAA TAC ACT CCA GGA
Thr Glu Asn Gly Pro Ala Asp Ala Leu Set Arg Arg Set Asp Tyr Met Glu Gly Lys Glu Pro Val 748 ACA GAG AAC GGC CCG GCA GAT GCG CTG AGC CGA AGA AGC GAT TAC ATG GAA GGA AAA GAA CCA GTG
Gln His Lys Ile Leu Lye Thr Ash Pro Asp Gly Set Leu Set Ala Ash Thr Arg Glu Phe Asn Asn 770
CAA CAC AAA ATA Cq~ AAG ACA AAC CCC GAC GGA AGT CTT AGT GCC AAC AOG AGA GAG ~ AAC AAC
Ile Val Arg Ile Leu Set Asp Lys Glu Glu Gln Phe Pro Ile Set Gln Gly Lys Tyr Gln Val Pro 792
Aq*9 @TA CGA ATC CTA AGC GAC AAG GAA GAA CAG q~gC CCT ATA TC~ CAA GGA AAG TAC CAA GTA CCA
Lys ASp Arg Glu Glu Glu Cys Ile Arg Gln His His ASp Glu Pro Thr Tyr Gly His Pro Gly Thr 814 AAG GAT CGA GAA GAA GAA TGC ATA CGA CAA CAT CAC GAC GAG CCA ACA TAC GGA CAT CCT GGA ACT
Set Lys Thr Val Asp Leu Ile Gln Arg Set Phe Set Phe Pro Gln Met Arg Leu Lys Val Leu Arg 836
342
4770
4836
4902
4968
5034
5100
5166
5232
5298
5364
5430
5505
5582
5648
5714
5780
5846
5912
5978
6044
6126
6213
6300
6387
6474
6561
6648
6735
6822
6909
6996
7083
7170
7257
7344
TCA AAG ACA GTC GAC CfA ATA CAG CGC AGC T�C TCA ffTC CCA CAG q~rr Ile Lys Lys Cys Val His Cys Gln Gln Asn Lys Ala Ala Arg
TAC ATC AAG AAA TGC GTA CAC ff~C CAA CAA AAC AAA GCT GCA CGG
Gln Phe Arg Thr Pro Pro Thr Lys Pro Trp Asp Glu Val Thr Met
CAG ff~C AGG ACA CCA CCA ACG AAA CCA TGG GAC GAG GIfT ACG ATG
Arg Ser Lys Asp Arg Val Thr Gly Gln Ala Tyr ASp Met Ile Leu
AGG TCA AAG GAT CGA GTC ACA C43A CAA GCC TAT GAC ATG ATA CTA
Lys ff~rr Ala His Phe Ile Pro Ala Ser Glu Ile ff~agr Thr Ala Glu
AAA TAT GCA CAC ~TC ATT CCT GCA TCA GAA ATA TAC ACT GCA GAG
Asp Arg Leu Ile Arg Tyr His Gly Phe Pro Glu Val Phe Ile Thr
GAC AGA ffff~ ATC CGA TAT CAC GGA ff~fC CCG GAA GTA ~C ~ ACA
Set ASn Tyr Trp Lys Thr Leu Met Gly Thr Ile Gly Ile Lys His
TCA AAC TAC TGG AAG ACG CTC ATG GGA ACG ~ GGA ATC AAG CAC
Pro Glu Thr ASp Gly Gln Thr Glu Arg Thr Asn Gln Thr Leu Glu
CCA GAG ACG GAT GGG CAA ACG GAA AGA ACG AAC CAG ACA CTC GAG
Asn Tyr Ala Gln Asp Asn Trp Val Set Leu Leu Pro Met Ala Gln AAC TAC GCA CAA GAC AAC TGG Gffrf TCA q~fA CTG CCA ~ C-CG CAG
Set Glu Thr Thr Set Thr Thr Pro Phe Met Arg Thr Leu Ala Arg
TCA GAG ACA ACT TCG ACG ACG CCA '±'±'±' ATG CGA ACT ffff~ GC~ AGG
Leu Asp Pro Thr His Arg Pro Arg Glu Gln Ser Stop
CTG GAT CCC ACC CAC AGG CCG AGA GAG CAA TCG TGA
AGA CTT AAG GT~ CTA CGC His Ala Lys Tyr Gly His Leu
CAC GCG AAA TAC C4~T CAC CTA
ASp Phe Ile Thr Lys Leu Pro
GAC q~2C ATT ACG AAA CTC CC~
Val Met Val ASp Arg Leu Thr
GTC ATG GTC GAC AGA CTC ACA
Gln Leu Gly Tyr Leu Val Leu
CAG CI~ C~A TAC CTC GTA CTG
Asp Arg ASp Lys Leu Phe Thr
GAC AGA GAC AAG CTC %aTC ACA
Lys Leu Ser Thr Ala Tyr His
AAG TI~ TCA ACA GCA TAC CAT
Gln Tyr Leu Arg His T!rr Ile
CAA TAC CTA CGG CAC TAC ATC
Ile Ala Leu Asn Asn His Lys
ATC GCA CTG AAC AAC CAC AAA
Thr Leu Thr Tyr Pro Glu His
ACC C�A ACC TAT CCG GAA CAC
CGACGGAAACTCT CAAGG~ CACAAAGAAG CAC~CA
Met Ala Pro Leu Leu Lys Glu
AGGCGATCGAAGATGCACAAC_AAAGACTGTC �CAACGACGGCAAGA~AAAA ATG GCA CCT C~G CTA AAA GAG
Gly Asp Lys Val Tlrr Leu Leu Thr Lys Asn Leu Lys Thr Arg Arg Gln Thr Lys Lys Leu Asp His
GC49 GAT AAG GTC TAT CTC CTC ACA AAA AAC CTG AAG ACA AGA AGA
Val Lys Val Gly Pro Phe Phe Ile Asp Lys Val Val Gly Pro Val
Gffrf AAG GTC GGA CCA 'I'I'I ~ 'i'I'I' 2~_TC GAC AAA GTC GTA GGA CCA GTC
Pro Asp Ala Lys Ile His Pro Val Phe His Ile Ser Lys Leu Glu CCT GAT GCG AAA ATC CAC CCG GTC TTC CAT ATC TCC AAG TTG GAA
Cys Gln Glu Set Phe His Phe Glu Pro Glu Ala Glu Asn Glu Phe
TGC CAG GAG AGC ~fC CAT TTC GAG CCG GAA GCA GAA AAC GAA '±'±'i'
Lys Lys Gly Gln Arg %marr Leu Val Lys Trp Lys Gly q~r Asp Glu
AAG AAG GGT CAG CGA TAC CffT GTC AAA TGG AAA GGA TAC GAC GAA
Arg Ile Ash Leu Ala ASh Cys Tyr Gln Leu Leu Arg Gln Phe Gln
AGA ~ AAT CTG GCG AAC TGC TAC CAG CIrf CI~ CGA CAG TTC CAG
Lys Gln Glu Ala Gln Glu Arg Arg Ala Ser Pro ASp Gln Thr Arg
AAA CAA GAA GCT CAG GAA CGG AGA GCA AGT CCG GAT CAG ACC AGA
Ala Arg Thr Lys Stop
Fig. 2. Sequence and deduced ORFs of CJT-1.712. The target-site duplications are in bold-type. The inverted repeats at the ends of the LTRs are underlined. The numbers at the left indicate nucleotide
CAG ACC AAG AAA CTG GAT CAC
Asn Tyr Arg Leu Arg Leu Pro
AAC TAC CGA CTA CGA CTA CCA
Pro Ala Asp Ale Glu Thr Pro
CCA GCG GAC GCC GAA ACA CCT
Glu Val Glu Lys Ile Leu Asp GAA GTC GAA AAA ~ CTA GAC
Set Glu Asn Thr Trp Glu Pro
TCA GAA AAT ACC ff~G GAA CCA
Lys Trp Arg Gln Asp Set Arg
AAG TGG CGC CAA GAT TCC CGG
Ser Arg Pro Lys Tyr Pro His
AGT CGT CCG AAA TAC CCT CAC
858
88O
9O2
924
946
968
990
i012
1034
1045
7
29
51
73
95
117
139
161
165
GCA AGG ACC /LAG TAG GAGGGGAAAGAGACAGCGAGC~AC~CCACGTCI~CGTCCGGAAGCTC~CAGCCrfC
CTCCACTG CGATCGCCTCGGTCTCTCGACGCTCTAGAAGCTCCAG CTCCc±'I'I C49AGACGGGCCTTCITfCT CCCAAG CGG CGGCCTC
TTCCTCCTGTAT CC9~ CGCACGAC~ CTTCCR~TATCcI'I'CGAAG CGAAGCCTC~TATG C f f r f ~ C T C C ~ ~
T CGACGATG CT C CGACAC~-I "I "i', "I'CTCGAGAGCCrCG CGCI~ACGGCG CAGACR]CGACCA-q-~CCG CCT~AAC~ACAqrfACA
GT ~CGA~T I'i'C c± T i~ZfACAGACACCACAC CGACCAGAC C CGACATGGACACGACAGACGACffrf CGT CT CG C C a A ~ A
CAGGGACGAACCATGATI~ CGCCCAGGG CITCG~I'I'I'±'CTCGGATTGACGAATG~A~ ~ ~ ~ ~ ~ C
ARX~ffff~AAGGAAAGAAGA~AG CCATGACGACCG CT CAACC CAAGGAAAC CT CGAffWA-DaT CATAAG CT CC<'~ACGAG CTAAG CTAGAG
AGGGG~ACGGATCGGACACGACCGAAGGAGC~TCAG CCAATCAATAG CCCGCACTAATGACAGGCG CGGC CCACAG CTAG
G CC~49 CACAG CCG~AGTG C~AGTG CCR~GAGTGCCTC~AGTGCCTACGGACTT~~~AT~ CA~ T ~ ' TI'Cc±'±'I GT2~'±TAC4~AACGCATCGTACCCTP~CGAGGCGTT CAACTCA~ C-AATA
TCACI~I'I-IC~AATACACCACAAC C CAACC�CTG CCTCACI~Cff~ ~ c±'±'±'ACAAGACGAGACAG CfACTG u'IwCATCACTCT
C C CTCACCI~CCAT CC~ C CACC~ACAAAAC CCG/ZPI~AG ~ T CIr9 CleAT ~ATCAff~ACaZTTATAC CAAAC CWAAC
~ATA~k~_%AC~CTCGAGG CCTTATAGAffWCTACTTCTACC~CTCTu'±'x'±'~ACfAGAAA.'±'I'±'ATACGCAAATACATAATAC CCCT
AATCCc ±' ± • I C=AOGTAAAG C CGACGACTAATAC~fAGGAGATAGAA~AC~fACACGG C3~A-q~2ATATACG CGG C C CS~fATAGACCrf
AC�I~AT CGG C~AqTfAffWACGAGCAAA~ATACTACGT CAC CGAC CP CGAATATAAT~ c ± TI 'AG C CTAC CC�TAG CTTATAG C
TATAAC CTAAACAT CI~C~fAGGAGATAGGCGASGTGGCA~TI'/~TCI~ CA~
positions, those at the right indicate numbering of amino acid residues in the respective open reading frames. The sequence has been submitted to EMBL as accession Z 11 866
sequence conservation. This may imply relatively recent acquisition of the CfT-1 elements within the C. fulvum genome.
The open readin9 frames
Three long ORFs, all on the same strand, were found when the sequence of CfT-1 was analysed (Fig. 1). The
first starts 99 bp from the 3' end of the 5' LTR. The predicted sequence of 639 amino acids has homology to 9a9 genes of retroviruses. In particular, amino acids 260-273 (Fig. 5) have the characteristics of a zinc-finger DNA-binding domain (C-X2-C-X�-C Covey 1986).
Retroelement genomes are typically transcribed into a long polycistronic mRNA which is translated into a polyprotein and, in turn, cleaved to give the final proteins
Length 427 bp
Duplicated sequences 712 TAATAATATAATATAG(113).(7084)TATAGAAAGGCTC at the insertion sites 691 TAACGGGCCTAGTACC . . . . . . . . . GTACCTTCTAGAA
Inverted repeats ( 114)TGTTACG CCTAACA(540)
Enhancer-like sequences and repeats (210) GAGTGCCTG
(241)
(252)
GAGTGCCTG GAGTGCCTG GAGTGCCTA Repeatl
GCCTACGGACTT GCCTATGTACTT SV40enhancerregion A
TAGGCACAG TAGGCACGA TAGGCACGG TAGGCACAG Repeat2
343
(461) TTCCA TTCCA Neurospora 5' enhancer TTCCA Tyl enhancer core
(528) ACCAAACCTAACAGCTTT ACCTCACCTAATGACTTT SV40 enhancer region B
(527) TACCA TACCA pho80 enhancer core
Primer binding sites First strand (539)5'.CAGCTTTGAACGCCCTCAA
Schizo~,,~cho~omyco=- I [ I I I I l ] l l pombe tRNA-Ser GUC AAACUUG
CU GAC. 5'
Second strand (6644) AGAGAGGGATGG
Fig. 3. Features of the long terminal repeats present in the Cladosporium fulvum transposon efT-1. Numbers refer to the nucleotide sequence in Fig. 2
1 2
4 .9
3.2
2 .7
Fig. 4. Hybridisation of pNOMP5 to PstI-(lane 1) and XhoI-(lane 2) digested C. fulvum genomic DNA, demonstrating the conservation of the PstI sites in the LTRs. The numbers indicate mol. wt. markers in kb
( K i n g s m a n and K i n g s m a n 1988). The t r ans l a t ion o f ret- rovi ra l and gypsy class r e t r o t r a n s p o s o n m R N A s requires r i b o s o m a l f rameshi f t ing f rom the gag O R F to the pol O R F . In CfT-1 , the pu ta t ive pol O R F over laps the gag O R F by 229 bases. The mos t l ikely pos i t i on for f rame- switching is the run o f 4As at pos i t i on 2534. The pol O R F extends for 1045 a m i n o acids. Sequences h o m o l o g o u s to p ro tease , reverse t ranscr ip tase , R N a s e H and in tegrase
Table 1. Distance matrix ,of reverse transcriptase sequences"
1 2 3 4 5 6 7 8 9 10 11
1 17.6 2 297 10 3 GYPSY 41 36 4 412 38 42 45 5 CAMV 46 47 47 48 6 DIRS 51 52 55 55 49 7 Cfr-1 43 44 51 51 52 8 HIV1 54 56 58 56 53 9 DEL 42 42 49 48 53
10 Ty3 37 37 39 47 43 11 TED 28 27 39 46 46 12 copia 67 66 63 66 65
51 59 56 55 45 61 53 50 58 47 53 43 57 48 63 64 67 67
44 63 67
The 79 amino acids identified by Poch et al. (1989) from the gypsy class retrotransposons, HIV-1, copia and CJT-1 were aligned and analysed using the PAUP program (Swofford 1990) " The numbers of non-identical amino acids between pairs of sequences were calculated and are displayed
are f o u n d in tha t order . Thus CfT-1 m a y possess all the enzymat i c act ivi t ies r equ i red for a u t o n o m o u s t rans- pos i t ion .
In each case the s t ronges t s imilar i t ies are to sequences o f r e t r o t r a n s p o s o n s in the gypsy g roup o f L T R re t ro-
344
Orf 1 Nucleic
CfT-I 258 Copia 230 Ty3-2 265 HIV-I 390
Off-2 Reverse
CfT-I 428 TED 402 17.6 319
Acid Binding Domain Orf-2 Protease Domain
RTCYGCGKPGHIARDCRG CfT-I 112 MIDSGASGNFA VKCHHCGREGHIKKDCFH Copia 290 VLDSGASDHLI RLCFTCKKEGHRLNECRA TED 38 LIDTGSTVNMT VKCFNCGKEGHTARNCRA 17.6 28 LIDTGSTVNMT
^ ^^ ^ ^ ^
Transcriptase Domain
KTAFRTRYGLYEFLVMPMGLTNAPASCQDLVNETLRDLLDVCWAYMDDIL KTAFNVEHGHFEFLRMPMGLKNSPSTFQRVMDNVLRGLQNNICLVYLDDII KTAFSTKHGHYEYLRMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIV ^^^^ ^ ^ ^^ ^^ ^ ^ ^^ ^ ^ ^^^
Orf-2 RNAse H Domain
CfT-I 557 NYDIHDKELLAIVAAM CfT-I TED 630 NYSTIEKELLAIVWAT TED 17.6 545 NYSTIEKELLAIVWAT Copia Gypsy 530 NYATNERELLAIVWAL HIV-I
^^ ^^^^6^ ^
Off-2 Endonuclease Domain
' 958 GIKHLSTAYH-PETDGQTERTNQTI 1031 KIIHHKTLPHSPSDNGNIERFHSTI 568 GISYHLTVPHTPQLNGVSERMIRTI 861 GIKQEFGIPYNPQSQGVVESMNKEL
^
Fig. 5. Alignments of ORF 1 and 2 sequences to retroelement protein sequences. Sequences were taken from Mount and Rubin 1985 (co- pia), Hansen et al. 1988 (TyJ); Ratner et al. 1985 (HIV-1); Friesen and Nissen 1990 (TED); Inouye et al. 1986 (17.6) and Marlor et al. 1986 (gypsy). The circumflex symbols indicate identi- cal amino acids. Numbers refer to coordinates in the deduced amino acid sequences
transposons (Doolittle et al. 1989). Poch et al. (1989) have identified a set of amino acids that are most con- served between reverse transcriptase and RNA-depen- dent D N A polymerases. Alignments of these amino acids from various retroelements with the homologous se- quences of CfT-1 were analysed phylogenetically using a parsimony algorithm (Swofford 1990). This analysis clearly demonstrates that CfT-1 lies within the enlarged gypsy group (Table 1) but failed to identify a convincing monophyletic tree for the gypsy group. Bootstrapping with a minimum of 80% confidence co-rooted all se- quences to HIV-1 with the exception of 17.6 and 297, two very closely related Drosophila retrotransposons (Inouye et al. 1986; Saigo et al. 1984). The amino acid sequences of CfT-1 are essentially equidistant from 17.6, 297, Del, TED and Ty3.
The third OR F starts 100 bp after pol ORF and ex- tends for 165 amino acids. A database search revealed no convincing homology. The relationship of this and other retrotransposon third ORFs to retroviral env genes in unclear.
The presence of strikingly homologous sequences, particularly as found in the pol ORF, in such highly diverse organisms as fungi, plants and insects may be regarded as primafacie evidence of horizontal gene trans- fer. The degree of homology is greater than would be expected of non-essential genes in organisms separated in evolution since ancestral divergence (Doolittle et al. 1989). An examination of codon usage within the CfT-1 open reading frames provides further evidence for the recent acquisition of these genes. The three ORFs of CfT-1 comprise a total of 1777 codons. The base usage frequency at the third wobble position is non-random, with A and C found to occur at frequencies of 32% and 29%, respectively; G is found at 24% of sites and T at 15% (Table 2). This pattern is found equally in ORFs 1 and 2 and to a lesser extent in the short ORF 3, and may be compared with the codon usage of the three sequenced C. fulvum cDNAs, Avr9, P1 and P17, as examples of "host" genes (van Kan et al. 1991; P.J.G.M. de Wit, personal communication). The third position bases are dominated by C (36%) followed by T (24%), G (21%) and A (19 %). This pattern of C > T > G > A is generally seen in the consensus of all filamentous fungal genes (Gurr et
Table 2. Third codon position bias in CJT-1 genes
A C G T No. codons
CJT-10RF1 28 32 25 15 645 CJT-10RF2 33 28 25 14 1045 CJT-10RF3 35 26 19 20 165 CJT-10FRs 1-3 32 29 24 15 1855 Cladosporium fulvum 19 36 21 24 381
cDNAs
The calculated percentages are shown of each base in the third position of codons for the three predicted CJT-10RFs, individually and together, and for the sum of the Avr9, P1 and P17 cDNAs (van Kan et al. 19991); P.J.G.M. de Wit, personal communication
al. 1987). This pattern contrasts markedly with the A > C > G > T pattern in the CfT- 1 open reading frames and suggests a recent extrachromosomal origin. Similar results were reported for the copia and 17.6 retrotrans- posons (Hansen et al. 1988). It is clear that a more widespread survey of the presence of retrotransposons and their codon usage is justified.
Race-specific polymorphisms
Evidence was sought that CfT- 1 might be responsible for some of the variation in pathotype between the physio- logical races of the fungus. For this purpose, southern blots of genomic D N A of various isolates, digested with BamHI and EeoRI, were probed with pNOMP3, the PstI fragment comprising the 3' half of the 5 'LTR and 2 kb of the 5' internal domain (Fig. 6). In each track, about 25 strong bands can be counted in addition to weaker bands corresponding to hybridisation to the 3' LTR. This finding, together with the frequency of positive cosmids, is consistent with a moderate copy number of about 25.
Race 4 differs from race 0 in that the putative avir~ ulence 4 gene is not expressed in race 4. It is intriguing to note that the race 4 hybridisation pattern (lanes 3 and 7) exhibits single extra bands when compared to race 0 (lanes 4 and 8). Race 2,4,5,9,11 differs in that all of the
23.1
9 .4
6 .6
1
4.4
2 3 4 5 6 7
Fig. 6. Hybridisation of pNOMP3 to EcoRI-(lanes 1-4) and Barn- HI-(lanes 5 8) digested genomic DNA. DNA from race 0 (lanes 4 and 8) is compared with race 4 (lanes 3 and 7), race 5 (lanes 2 and 6) and race 2,4,5,9,11 (lanes 1 and 5). Numbers to the left are mol. wt. markers in kb; Arrows highlight the polymorphisms between race 0 and race 4 DNA
Fig. 7. Transmission electron micrograph of virus-like particles (VLPs) in fraction 5 of the 10-80% sucrose gradient fractionation of a C. fulvum VLP preparation
345
known avirulences are unexpressed and about 5 extra bands are seen when compared to race 0 (lanes 1 and 5).
Virus-like particles as replicative intermediates durin9 C. fulvum retrotransposition
The retrotransposons Ty and copia encapsidate their genomic RNA and enzymatic pol functions into virus- like particles (VLPs) (Garfinkel et al. 1985; Mellor et al. 1985; Goreleva et al. 1989). In order to determine wheth- er CfT-1 is also encapsidated, VLP preparations were made from lysates of C. fulvum mycelium. The VLPs were separated on a 10-80% sucrose gradient, frac- tionated and pelleted. Each of the fractions was exam- ined for the presence of VLPs by electron microscopy. VLPs of between 40 and 50 nm in size and with electron- dense cores were repeatedly found in fraction 5 of the gradient (Fig. 7). The particles are strikingly similar to Ty particles observed in galactose-induced strains of yeast, containing the Gal-Ty element (Garfinkel et al. 1985). It is worth noting that Ty particles cannot be reliably detected in uninduced cells.
Each of the fractions of the sucrose gradient was assayed for reverse transcriptase (RT) activity by incor- poration of [Gt-32p]dCTP into polynucleotide, using oligo dC as primer and poly G as template (Fig. 8a). A peak of activity was found in fraction 5; coincidence of VLPs and RT activity was found in three separate experiments. The RT activity in the peak fraction was found to be dependent on Mg 2+, primer, and template, and was not supported by Mn 2 + ions (Table 3). Reverse transcriptase activity has not been demonstrated previously in filamen- tous fungi. The VLP fractions from the sucrose gradient were phenol-extracted and dot-blotted onto nitro- cellulose. The reverse transcriptase-encoding clone pNOMP5 hybridised primarily to the nucleic acid in fraction 5, containing the VLP peak (Fig. 8b). This fur-
16
rn 0
x
¢J
a
/.a. •
10
8
6 •
• .\/'J / 2 I--I 0 T I /i~q ~,~ 0 • •
0 1 2 3 4 5 6 7 8 9 10 11 12 VLP f rec t ions
b • o O 0 • • •
Fig. 8. a Reverse transcriptase activity in sucrose gradient fractions of C. fulvum VLP. b Dot-blot hybridisation of pNOMP5 to sucrose gradient fractions
346
T a b l e 3 . Reverse transcriptase activity (cpm) of fraction 5 from C. fulvum VLP sucrose gradient centrifugation
Complete 3804
- Mg z + 469 - Mg 2+ + Mn 2+ 187 - oligo dC 340 -po ly G 219 - enzyme 350
VLP, Virus-like particle The assays were performed methods
as decribed in the Materials and
ther suppor ts the view that CJT- l - re la ted elements are t ranscribed into R N A , which is packaged in VLPs tha t conta in reverse transcriptase.
C o n c l u s i o n
The results presented in this paper indicate tha t C.fulvum contains a modera t e number o f genomic elements with m a n y o f the conserved features o f re t ro t ransposons . These elements, which are represented by the c loned and sequenced 712 clone, have been designated CJT-1 ele- ments. The element appears to be expressed and pack- aged into VLP particles, bu t there is no direct evidence that the elements are still capable o f fur ther trans- posit ion. However , the un in te r rupted open reading frames and identical L T R s suggest tha t the c loned copy has t ransposed recently. This is the first repor t o f an L T R re t ro t ransposon in a f i lamentous fungus.
The origin o f the element is an intr iguing question. Its similarity to elements in diverse organisms and the biased c o d o n usage are suggestive o f hor izon ta l transfer. The role o f the element in p r o m o t i n g var ia t ion in pa thogen popula t ions , by cont r ibut ing to the b o o m - a n d - b u s t cycle whereby pa thogens rapidly overcome the resistance in new varieties o f crop plant is u n k n o w n but is under active study. The s tudy o f genetically uncharacter ised filamen- tous fungi is current ly severely hampered by a lack o f versatile genetic tools. The ability to use t ransposons to tag genes o f interest would be a significant advance.
Acknowledgements. We are grateful to A. Coddington for construct- ing the cosmid library, L. Flegg for technical assistance, Mark Coleman for helpful discussions and M. Oliver for help with the figures. This work was supported by grants to R.P.O. from the AFRC and the Gatsby Charitable foundation and from NERC for the sequencer and a studentship to M.T.M. from SERC.
R e f e r e n c e s
Bingham PM, Zachar Z (1989) Retrotransposons and the FB trans- poson from Drosophila melanogaster. In: Berg DE, Howe MM (eds) Mobile DNA. American Society for Microbiology, Wash- ington DC, pp 485-502
Boeke JD, Corces VG (1989) Transcription and reverse transcrip- tion of retrotransposons. Annu Rev Microbiol 43 : 403-434
Boeke JD (1989) Transposable elements in Saccharornyces cerevisiae. In: Berg DE, Howe MM (eds) Mobile DNA. Ameri- can Society for Microbiology, Washington DC, pp 403-434
Bozarth RF (1972) A new dimension in microbiology. In: Environ- mental health perspectives. United States Department of Health Education and Welfare, Washington, pp 23-39
Buck KW (1986) Fungal virology - an overview. In: Buck FW (ed) Fungal virology. CRC Press, Boca Raton, Florida, pp 1-84
Cappello J, Handelsman K, Lodish HF (1985) Sequence of Dictyo- stelium DIRS-I : An apparent retrotransposon with inverted terminal repeats and internal circle junction. Cell 43:105-115
Clare J, Frabaugh P (1985) Nucleotide sequence of a yeast Ty element: evidence for an unusual mechanism ofgene expression. Proc Natl Acad Sci USA 82:2829-2833
Clark D J, Bilanchone VW, Haywood L J, Dildine SL, Sandmeyer SB (1988) A yeast sigma composite element Ty3 has properties of a retrotransposon. J Biol Chem 263:1413-1423
Covey SN (1986) Amino acid sequence homology in 9ag region of reverse transcribing elements and the coat protein of canliflower mosaic virus. Nucleic Acids Res 14: 623-633
de Wit PJGM (1977) A light and scanning-electron microscope study of infection of tomato plants by virulent and avirulent races of Cladosporiumfulvum. Neth J Plant Pathol 83:109-122
Dinoor A, Eshed N, Nof E (1988) Puccinia coronata crown rust of oats and grasses. In: Sidhu GS (ed) Advances in plant pathol- ogy, vol 6. Academic press, New York, pp 333-344
Doolittle RF, Feng DF, Johnson MS, McClure MA (1989) Origins and the evolutionary relationships of retroviruses. Q Rev Biol 64:1-30
Dunsmuir P, Brorein WJ, Simon MA, Rubin GM (1980) Insertion of the Drosophila transposable element copia generates a 5 base pair duplication. Cell 21:575-579
E1-Sherbeini M, Tipper D J, Mitchell D J, Bostian KA (1984) Virus- like particle capsid proteins encoded by different L double- stranded RNAs of Saccharornyces cerevisial: their roles in main- tenance of M double-stranded killer plasmids. Mol Cell Biol 4:2818-2827
Errede B, Company M, Ferchak JD, Hutchinson CA III (1985) Activation regions in a yeast transposon have homology to mating type control sequences and to mammalian enhancers. Proc Natl Acad Sci USA 82:5423-5427
Fink GR, Boeke JD, Garfinkel DJ (1986) The mechanism and consequences of retrotransposition. Trends Genet 2:118-123
Friesen PD, Nissen MS (1990) Gene organisation and transcription of TED, a lepidopteran retrotransposon integrated within the baculovirus genome. Mol Cell Biol 10:3067-3077
Gardner RC, Howarth AJ, Hahn P, Brown-Leudi M, Shepherd RJ, Messing J (1981) The complete nucleotide sequence of an infec- tious clone of cauliflower mosaic virus by M13mp7 shotgun sequencing. Nucleic Acids Res. 9:2871-2887
Garfinkel D J, Boeke JD, Fink GR (1985) Ty element transposition: reverse transcriptase and virus-like particles. Cell 42:507-517
Geever RF, Case ME, Tyler BM, Buxton F, Giles NH (1983) Point mutations and DNA rearrangements 5' to the inducible oa-2 gene of Neurospora allow activator protein-independent tran- scription. Proc Natl Acad Sci USA 80:7298-7302
Gilliquet V, Legrain M, Hilger F (1987) Sequence of the region 5' to the negative regulatory gene PH080 of Saccharomyces cerevisiae. Nucleic Acids Res 15 : 5893
Goreleva TV, Resnik NL, Schuppe NG (1989) Retrotransposon transposition intermediates are encapsidated into virus-like particles. FEBS Lett 244:307-310
Grandbastien MA, Spielman A, Caboche M (1988) Tnt-l, a mobile retroviral-like element of tobacco isolated by plant cell genetics. Nature 337:376-380
Gurr SJ, Unkles SE, Kinghorn JR (1987) The structure and or- ganisation of nuclear genes of filamentous fungi. In: Kinghorn JR (ed) Gene structure in eukaryotic microbes. IRL Press, Washington, pp 117-127
Hansen LJ, Chalker DL, Sandmeyer SB (1988) Ty3, a yeast retro- transposon associated with tRNA genes has homology to ani- mal retroviruses. Mol Cell Biol 8:5245-5256
Harling R, Kenyon L, Lewis BG, Oliver RP, Turner JG, Cod- dington A (1988) Conditions for efficient isolation and regenera-
347
tion of protoplasts from Fulvia fulva. J Phytopathol 122:143-146
Hodgson CP, Fisk RZ (1987) Hybridization probe size control: optimised "oligolabelling". Nucleic Acid Res 5:6295-6296
Hull R, Sadler J, Longstaff M (1986) The sequence of the Carnation Etched Ring Virus DNA: comparison with cauliltower mosaic virus and retroviruses. EMBO J 5:3083-3090
Inouye S, Yuki S, Saigo K (1986) Complete nucleotide sequence and genome organisation of a Drosophila transposable element 297. Eur J Biochem 154:417-425
Kikuchi Y, Ando Y, Shiba T (1986) Unusual priming mechanism of RNA-directed DNA synthesis in copia retrovirus-like parti- cles of Drosophila. Nature 323: 824-826
Kingsman AJ, Kingsman SM (1988) Ty: a retroelement moving forward Cell 53 : 333-334
Kinsey JA, Helber J (1989) Isolation of a transposable element from Neurospora crassa. Proc Natl Acad Sci USA 86:1929-1933
Kohli J, Munz P, Aebi R, Amstutzt H, Gysler C, Heyer WD, Lehmann L, Schuchert P, Thuriaux P, Leupold U, Bell J, Gam- ulin V, Hottinger H, Pearson D, Soell D (1984) InteralMic and intergeneic conversion in three serine tRNA genes of Sehizosaccharomyces pomnbe. Cold Spring Harbor Symp Quant Biol 49:31-40
Levin HL, Weaver DC, Boeke JD (1990) Two related families of retrotransposans from Sehizosaceharomyees pombe. Mol Cell Biol 10:6791-6798
Lindhout P, Korta W, Cislik M, Vos I, Gerlagh T (1989) Further identification of races of CIadosporiumfulvum (Fulviafulva) on tomato originating from the Netherlands, France and Poland. Neth J Plant Pathol 95:43-148
Maniatis T, Fritsch EF, Sambrook J (1982) Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York
Marlor R, Parkhurst S, Corces V (1986) The Drosophila melanogas- ter gypsy transposable element encodes putative gene products homologous to retroviral proteins. Mol Cell Biol 6:1129-1134
McHale MT, Roberts IN, Talbot NJ, Oliver RP (1989) Expression of reverse transcriptase genes in Fulviafulva. Mol Plant-Microbe Interact 4:165-168
Mellor J, Malim MH, Gull K, Tuite MF, McCready S, Dib- bayawan T, Kingsman SM, Kingsman AJ (1985) Reverse tran- scriptase activity and Ty RNA are associated with virus-like particles in yeast. Nature 318:583-586
Michel F, Lang BF (1985) Mitochondrial class II introns encode proteins related to the reverse transcriptases of retroviruses. Nature 316:641-642
Mount SM, Rubin GM (1985) Complete nucleotide sequence of the Drosophila transposable element copia: homology between co- pia and retroviral proteins. Mol Cell Biol 5:1630-1638
Norrander J, Kemp T, Messing J (1983) Construction of improved M13 vectors using oligodeoxynucleotide-directed mutagenesis. Gene 26:101-106
Poch O, Sauvaget I, Delarue M, Tordo N (1989) Identification of four conserved motifs among the RNA-dependent polymerase encoding elements. EMBO J 8:3867-3874
Punt P J, Oliver RP, Dingemanse MA, Pouwels PH, van den Hondel CAMJJ (1987) Transformation of Aspergillus based on the
hygromycin B resistance marker from Escherich& coli. Gene 56:117-124
Raeder U, Broda P (1985) Rapid preparation of DNA from fil- amentous fungi. Lett Appl Microbiol 1 : 17-20
Ratner L, Haseltine W, Patarca R, Livak K J, Starciach B, Josephs SF, Doran ER, Rafalski JA, Whitehorn EA, Baumeister K, Ivanoff L, Petteway SR, Pearson ML, Lautenberger JA, Papas TS, Ghrayeb J, Chang NT, Gallo RC, Wong-Stahl F (1985) Complete nucleotide sequence of the AIDS virus, HTLV-III. Nature 313:277-283
Saigo K, Kugimiya W, Matsuo Y, Inouye S, Yoshioka K, Yuki S (1984) The identification of the coding sequence for a reverse transcriptase-like enzyme in a transposable genetic element in Drosophila melanogaster. Nature 312: 659-661
Sentry JW, Smyth DR (1989) An element with LTRs and its variant arrangements in the genome of Lilium henryi. Mol Gen Genet 215: 349-354
Shiba T, Saigo K (1983) Retreovirus-like particles containing RNA homologous to the transposable element copia in Drosophila melanogaster. Nature 302:119-123
Smyth DR, Kalitsis P, Joseph JL, Sentry JW (1989) A plant retro- transposon from Lilium henryi is related to Ty3 of yeast and the gypsy group of Drosophila. Proc Natl Acad Sci USA 86:5015-5019
Swofford DL (1990) PAUP: Phylogenetic analysis using parsimory. Version 3.0. Computer program distributed by the Illinois Nat- ural History Survey, Champaign, Illinois
Talbot NJ, Coddington A, Oliver RP (1991) Pulsed field gel electro- phoresis reveals chromosome length polymorphisms between strains of Cladosporium fulvurn (syn. Fulvia fulva). Mol Gen Genet 229 : 267-272
Tanda S, Shrimpton AE, Ling-Ling C, Itayama H, Matsubayashi K, Saigo K, Tobari YN, Langley CH (1988) Retrovirus-like features and site specific insertions of a transposable element tom in Drosophila ananassae. Mol Gen Genet 214:405-411
van Kan JAL, Van den Ackerveken GFJM, De Wit PJGM (1991) Cloning and characterisation of a cDNA of avirulence gene avr9 of the fungal pathogen C. fulvumn; causal agent of tomato leaf mold. Mol Plant Microbe Interact 4: 52-60
Voytas DF, Ausubel FM (1988) A copia-like transposable element family in Arabidopsis thaliana. Nature 336:242-244
Weber F, De Villiers J, Schaffner W (1984) An SV40 "enhancer trap" incorporates exogenous enhancers or generates enhancers from its own sequences. Cell 36:983-992
Xiong Y, Eickbush HT (1988) Similarity of reverse transcriptase- like sequences of viruses, transposable elements and mitochon- drial introns. Mol Biol Evol 6:675-690
Xu H, Boeke JD (1990) Localisation of sequences required in cis for yeast Tyl element transposition near the long teminal repeats - analysis of mini-Tyl elements. Mol Cell Biol 10:2695-2702
Yuki S, Inouye Y, Ishimaru S, Saigo K (1986) Nucleotide sequence characterisation of a Drosophila retrotransposon 412. Eur J Biochem 158:403M10
C o m m u n i c a t e d by D.J . F i n n e g a n