|
|
|
|
Name: Tn5393 |
|
Family: Tn3 Group: Tn163 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Erwinia amylovora | Molecular Source: | plasmid pEa34 |
| | Date of Isolation: | 1993 |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 81 bp) | | GGGGTCGTTTGCGGGAGAGGGCGAAATCCTACGCTAAGGCTTTGGCCAACGATATTCTCCGGTAAGATTGATGTGTTCCCA |
IRR (Length: 40 bp) | | GGGGTCGTTTGCGGGAGGGGGCGGAATCCTACGCTAAGGC |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCGTTT GCGGGAGAGG GCGAAATCCT ACGCTAAGGC TTTGGCCAAC GATATTCTCC GGTAAGATTG ATGTGTTCCC AGGGGATAGG AGAAGTCGCT 100
TGATATCTAG TATGACGTCT GTCGCACCTG CTTGATCGCG GCCGCGATAG CTAGATCGCG TTGCTCCTCT TCTCCATCCG CGTTCCAAGC TGCGGAAAGG 200
CACCCATAAG CGTACGCCTG GTCGAGCAGG CGACGCGGAT CGACGTCCAG CGCACGAGAG AATGCGTCCG CCATCTGTGC AATGCGTCTA GGATCGAGAC 300
AAAGGTCGTC TCTGTCAGCC GGATCGTAGA ACATATTGGC GGCGCCAAAG CCCACTTCAC CGACCAGACC GACGGGATCT ATCACCAGCC AGCCGCGACT 400
GGAGAACATG ATGTTTTCAT GATGCAGATC GCCATGTAGC CCACGCAGTT CCGAGGCATT GCTCATCATT TGATCGGCTA TAATCGCCGC GTGGACGTAG 500
TCAGTTTGAC AACCTGCGTT TTGATCATCG CGCGCCCGCT GAAACAAAGC TGCAAAGCGA TCCCGGATCG GGAGAAGGGC AGAAGGCAGG GGTTCCTCAG 600
ATGCGGCATA CAGCTTCGCC ATTAGTTCCG CTGCAATTTC GGTCGCCTGG TAGTCGCCGT GCTCGGCAAC GATGTGAGAG AGCATTCGCT CCCCGGCATA 700
TTCGAGCAAC ATCAGATTGT TCTCACGACC GAGCAACCGG ACTGCTCCCC TCCCATTGCG CCATACCAGA TAGTCGGCCC CGCGCAGTTC ATCAGCAATG 800
TCTTCTATAG GTTTCAATCC CTTGACGATT GCAGGAGTCC CGTCTGGCAA TGAAACTTTC CAAACGAGGC TGGAAAAGGT GTCCGCAATG AGAACAGGTT 900
GCGAAACGTG CCAATGAGCA GGAAAAACAG GCGGCATGAA CATCAACCCC AAGTCAGAGG GTCCAATCGC AGATAGAAGG CAAGGCGTTC GCGGTCGGGG 1000
GCTTCGATCC CCAATACATT GAATAGGACA GCGAAGGCGC GCTCTGCTTC ATCTGGCGCT GCCCAGTTCT CTTCGGCGTT AGCAATCATG AGTGCCAAAT 1100
CGGCATAGCG ATCTGCTGTT CCGAGCCGCC CAAGGTCGAT CAGACCCGTG CATTGAAGAG TTTTAGGGTC CACCATGAAG TTCGGCATGC AGGGATCACC 1200
ATGGCAAACA ACCATATCGG TGCGCTCTTG GTCGAGCCGC ACCGGTAGCT CTCGTTCGAC ACGAGCCAAA AGATCGAGCT GCGGCGTACT CTTGTCCTCG 1300
TCCGGTAAGA AGTCGGGATT GACGGCATTG CGGGACACCA CATCAACGGC GCGTCCGAAC ATTCGCGACA GCCTGCGCTC AAACGGACAT TGATCAACCG 1400
ATAGGCTGTG AACAGCGCCA AGTTGCTGCC CCATTGACGG CCACGCTTTG AGCAAATCCG CTCCAGACAG ATCAGCCGCC GGTACTCCCG GAATTGCCGT 1500
TATCACCAAG CATGCACCCT CCTGTTCCTC CTGCCAGTTG ATCACCTCGG GGCAAGCCAC ACCTCGACCT TTGAGCCAAA TGAGGCGGTC ACGCTCTCCA 1600
GCGAGCTCAC CGCGGCGGGA AGCAGGTGCG ATTTTCGCGA AGGCATGCCC GTCACCACGT CGAAAAACAA AATCACCAGA TTCTCCGCCT CTGACAGGCA 1700
ACCAGTCAGA ATGCGATTCA CCAAAAAAAA TATTAGTTCG ATTCAATGGA GGTTCCTTCA GTTTTCTGAT GAAGCGCGGA GGTGGCTCAA CCTGCGAAAA 1800
GAAACGAGTT GCTATGGACT TGCACCGGTT GTGTTCCGGT CTATCTCTCA TTTTAAGCGG CTTTTTTCTC AAATGCCACC GGCGATTTCC AGCCGAGTGT 1900
TGAATGTCTT CGGCGTGGAT TATAAAAGCC ATTTATATAT TCGAAGATTG CAATCTCAAT ATCTCGCCTT GTTTGCCAGT GTCTGCGCCA AATCAACTCA 2000
GCCTTTAATG ATTTAAAGAA GCTTTCTACT GCGGAGTTAT CAAAACAATT GCCTTTCCCG CTCATGGACG GCAGCAATTG ATGTTTGAGC AGTAGCTTTT 2100
GATATTCATG AGCGCAATAT TGGCTCCCAC GGTCTGTGTG TTGAATACAA CCCGGTGGTG GTTTGCGTAA AGCCAACGCC ATATTCAGTG CCCTTAATGC 2200
AAGATCCTGC TTTAATCGAT CACCTGTTGC CCAGCCAATC ACGCGACGGG AATACAGGTC AAGGATAACA GCAAGATAGA CCCATCCTTC TCTGGTCCAA 2300
ACATAAGTGA TATCGCCTGC CCATTTCTGG TTGGGTGCGC TTGCGCTAAA GTCTTGTTTT AATAGGTTCG GTGCAATGTT GAAGGTATGA TGACTATCCG 2400
TTGTCCGTTT GAATTTACGC GTTCGAACAA CTGTAATGTT ATTCTGGCGC ATCAAACGTC CAACCCGACG CTGCCCAACC TGCAGGCCCA GCGCTTTCAA 2500
CTCTTCTGTC ATACGCGGCC TACCATAGCT CCCCAAACAC AACCGATGCT GCTCACGTAT ATGCGCTAGA AGTATAAGAT CACGACGCTG GCGCAGTGAT 2600
GGAGGACGGC GTTTCCATGC ACGTAAACCA CGATCTGTTA CGCCCATCAA ACGACATATG CGTGAACGTG AGAGAGAGCC ACGGTAATCC GTAATAAACT 2700
GAAATCTCAC AGCTTTTGTA CTGCGAAAAA TATTGCTGCC TTTTTTAATA TCTCCCTCTC CTCCCGAAGG ATACGGTTCT CTTTGCGTAA ACGTTCATTC 2800
TCACGCAGAA GATCAGTGTC TTGGGTAGGA ATTTTAGTTT CATCGGAAAT TGATGCGATC CATTTCCCAA GCGTGGAAAG CCCAATACTT AAATCTGACG 2900
CAACTTGACG GCGTGTTAAG CCACTAGTGA GTGCTATGCG AACTGCATCA CGCTTAAATT CATCACTATG CTTTAATGAC ATATATGGTC TCCTTGATAA 3000
CAAATAATGC TCTCAAAAAG ACCGGAACGA AACCGAGGCA AGTCCACTAC GTAAGTCCGA GAACATGCTT TCCATGGTCT CTGAGCTCGC CTTTGGGACC 3100
GACATATCGG TAGAGAGTGA CGCGCTCGAT GCCGAGTTCC TTGCAGAGAT CGGAAACTGA AGTATCGCGC TGGGCCATGG CGGCTTGCGC GAGACGCACC 3200
TGAGCTTTGG TGAGCGCGAA TTTTCGTCCG CCCTTGCGAC CGCGCGCTCT CGCGGAGGCG AGACCCGCCA TGGTGCGCTC TCGGATCAGA TCCCGCTCGA 3300
ACTCGGCCAA GGTGGCGAAG ATTCCGAACA CCATGCGACC GGACGCAGTC GTGGTGTCGA TCTGAGCGCC CTTTCCAGTC AGAACCCGCA GGCCGATCTT 3400
GCGGTCTGAC AGCTCCTTCA CCGTGTTGAC CAGATGGGCA AGCGATCGTC CGAGGCGATC GAGCTTCCAG ACCACCAGCA CATCGCCGTC ACGCAATGAC 3500
TTGAGGCAGG CAGTCAAGCC AGGGCGATCA TCACGACCGC CGGAAGCAAG ATCATCATAG ATATTGTCCC GTTCGACACC TGCGGCGCGC AAGGCGTCGT 3600
GCTGCAGGTC GAGAGACTGC GAGCCATCGG CTTTGGAGAC GCGGGCATAT CCGATCAGCA TGTATCACAA ACGTTGGTTT GAGGCGGCGC TTCGGCCACG 3700
ATTGCATTGA CCTCTGGAAA TGTATCTCAA CCAGCTTCAT AAACAAAGCG TCTTGAACGC TATCAGATTT TGAAAAAGGA ACATGTATGC CGCGTCGCGT 3800
CACTCTAACC GATCGGCAGA AAGACGCGCT GTTGCGCTTG CCGACTTCAC AGACGGATTT GCTCAAGCAC TATACGCTGA GTGATGAAGA CCTTGGGCAT 3900
ATCAGGCTGC GTCGGCGCGC TCACAACAGG TTCGGCTTCG CCCTGCAATT GTGTGTCCTG CGCTATCCCG GCCGGGTGCT GGCTCCAGGC GAACTGATCC 4000
CTGCAGAGGT CATCGAATTT ATCGGAGCGC AGCTTGGCCT GGGTGCCGAC GATCTCGTAG ACTATGCTGC CCGCGAGGAA ACACGGCACG AGCATCTTGC 4100
CGAGTTACGG GGGCTCTACG GCTTCCGCAC CTTCTCCGGA CGTGGTGCGA GCGAGCTGAA GGAATGGTTG TTCCGAGAAG CCGAGATGGC GGTGTCGAAC 4200
GAGGATATCG CCCGTCGCTT CGTAGCCGAG TGCCGACGCA CCCGCACTGT CCTTCCCGCG ACATCCACGA TCGAGCGGCT TTGTGCCGCG GCTCTCGTCG 4300
ATGCCGAGCG ACGCATCGAG ACGAGGATCG CCAGTCGGCT GCCTATGTCG ATCCGAGAAC AGTTGCTGGC ATTGCTCGAG GAGACGGCTG ATGATCGGGT 4400
GACCCGTTTT GTGTGGCTGC GCCAGTTCGA GCCTGGCTCG AACTCTTCGT CGGCCAACCG GCTGCTCGAC CGGCTCGAAT ATCTGCAACG CATCGATCTC 4500
CCCGAGGATC TGCTTGCCGG CGTTCCTGCC CATCGGGTGA CTCGTCTGCG CAGGCAGGGT GAACGGTATT ATGCCGACGG CATGCGCGAT CTCCCGGAGG 4600
ACAGGCGGCT TGCGATCTTG GCTGTTTGCG TCTCGGAATG GCAGGCGATG TTGGCCGACG CAGTGGTCGA AACCCACGAC CGGATCGTCG GCCGTCTCTA 4700
CCGTGCTTCG GAGCGTATTT GCCATGCAAA GGTCGCAGAC GAAGCGGGGG TGGTGCGTGA CACCCTGAAA TCCTTCGCCG AGATCGGGGG CGCCCTGGTC 4800
GATGCACAGG ATGATGGCCA GCCGCTGGGC GATGTCATCG CGAGTGGGTC AGGGTGGGAC GGCTTAAAAA CCCTTGTTGC AATGGCAACC AGGCTGACCG 4900
CCACCATGGC CGACGATCCG CTCAATCATG TGCTCGACGG TTATCACCGC TTCCGCCGAT ACGCTCCACG CATGTTGCGC CTGCTCGATC TGCGAGCTGC 5000
GCCCGTTGCA CTGCCGCTTC TGGAAGCGGT GACGGCCCTT CGTACCGGTT TGAACGATGC CGCGATGACC AGCTTCTTGC GGCCCAGCTC GAAATGGCAT 5100
CGCCACCTTC GGGCCCAGAG GGCTGGCGAC GCTCGCCTAT GGGAGATCGC GGTGCTGTTC CATCTGCGCG ATGCGTTCCG CTCCGGAGAT GTCTGGCTTA 5200
CTAGGTCCCG GCGCTATGGC GATCTGAAAC ACGCACTCGT TCCGGCACAA TCCATCGCGG AAGGCGGTCG TCTCGCTGTG CCATTGCGGC CGGAGGAATG 5300
GCTGGCAGAC CGGCAAGCTC GCCTCGACAT GCGGTTGCGC GACGTTGGCC GTGCCGCTCG CGCAGGCACG ATCCCGGGCG GGTCGATTGA AAACGGCGTT 5400
CTGCATATCG AGAAACTCGA AGCCGCCGCG CCGACAGGCG CCGAAGATCT GGTGCTCGAT CTCTACAAGC AGATCCCGCC CACGCGCATC ACCGATCTCC 5500
TGCTGGAGGT GGATGCGGCG ACCGGCTTCA CCGAAGCGTT CACCCATCTG CGCACAGGAG CACCCTGCGC TGACCGGATC GGGCTAATGA ACGTTATCTT 5600
GGCGGAAGGG ATCAACCTCG GCTTGCGCAA AATGGCGGAT CGGACAAACA CCCACACCTT CTGGGAATTG ATCCGCATTG GACGGTGGCA TGTCGAGGGC 5700
GAAGCCTATG ACCGGGCGCT GGCCATGGTG GTCGAGGCAC AGGCAGCGTT ACCCATGGCC CGGTTCTGGG GCATGGGCAC GTCGGCTTCG AGCGACGGAC 5800
AGTTCTTCGT CGCTACAGAG CAAGGTGAGG CCATGAACCT GGTCAACGCG AAATATGGCA ATACCCCGGG CCTGAAAGCC TATAGCCACG TCTCCGACCA 5900
ATATGCGCCG TTCGCAACCC AGGTGATTCC TGCAACGGCA AGCGAAGCGC CTTACATCCT CGATGGCCTG CTGATGAACG ATGCTGGACG CCATATCCGC 6000
GAGCAGTTCA CCGACACGGG CGGCTTCACC GATCACGTCT TTGCCGCATG TGCCATTCTC GGCTACCGGT TCGCTCCGCG CATCCGCGAC CTGCCATCCA 6100
AACGGCTCTA CGCGTTCAAT CCGTCGGCCG CCCCGGCGCA CCTGCGAGCG TTGATCGGCG GAAAGGTCAA CCAAGCCATG ATCGAGCGCA ATTGGCCCGA 6200
CATCCTGCGC ATCGCCGCCA CCATTGCTGC CGGGACCGTC GCGCCAAGCC AGATTCTGCG GAAACTCGCC TCCTATCCGC GGCAGAACGA GCTCGCGACA 6300
GCCCTGCGGG AAGTCGGTCG CGTCGAGCGC ACCCTGTTCA TGATCGACTG GATTCTGGAT GCCGAACTCC AACGGCGTGC CCAGATCGGG CTCAACAAAG 6400
GCGAACGTCA TCATGCGCTG AAGCGGGCAA TCAGCTTCCA CCGCCGCGGT GAAATCCGCG ACCGTTCCGC CGAAGGCCAG CATTACCGCA TCGCCGGCAT 6500
GAATCTGCTC GCCGCCATCA TCATCTTCTG GAACACCATG AAGCTCGGCG AGGTCGTTGC AAACCAGAAA CGCGATGGAA AGCTGCTATC GCCCGATCTC 6600
TTGGCCCATG TTTCGCCGCT CGGATGGGAA CACATCAATC TCACCGGAGA ATATCGCTGG CCAAAGCCTT AGCGTAGGAT TCCGCCCCCT CCCGCAAACG 6700
ACCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res |
3565-3786 |
222 |
TGTCCCGTTC GACACCTGCG GCGCGCAAGG CGTCGTGCTG CAGGTCGAGA GACTGCGAGC CATCGGCTTT GGAGACGCGG GCATATCCGA TCAGCATGTA TCACAAACGT TGGTTTGAGG CGGCGCTTCG GCCACGATTG CATTGACCTC TGGAAATGTA TCTCAACCAG CTTCATAAAC AAAGCGTCTT GAACGCTATC AGATTTTGAA AAAGGAACAT GT |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
APH(6)-Id (ARO:3002660) |
Tn5393 |
107-952 |
Passenger Gene |
Antibiotic Resistance |
- |
APH(3'')-Ib (ARO:3002639) |
Tn5393 |
943-1746 |
Passenger Gene |
Antibiotic Resistance |
- |
orfAB |
IS1133 |
1853-2982 |
Transposase |
|
- |
orfB |
IS1133 |
1853-2710 |
Transposase |
|
- |
orfA |
IS1133 |
2707-2982 |
Accessory Gene |
Regulator |
- |
tnpR |
Tn5393 |
3050-3661 |
Accessory Gene |
Resolvase |
- |
tnpA |
Tn5393 |
3787-6672 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
APH(6)-Id (ARO:3002660) |
APH(6)-Id |
Tn5393 |
846 |
107-952 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic inactivation (ARO:0001004) |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | APH(6) (ARO:3000151) |
Comment: | strB, orfI || strict match to reference sequence for ARO:3002660 (bitscore: 568) |
Protein Sequence:
|
MGLMFMPPVF PAHWHVSQPV LIADTFSSLV WKVSLPDGTP AIVKGLKPIE DIADELRGAD YLVWRNGRGA VRLLGRENNL MLLEYAGERM LSHIVAEHGD YQATEIAAEL MAKLYAASEE PLPSALLPIR DRFAALFQRA RDDQNAGCQT DYVHAAIIAD QMMSNASELR GLHGDLHHEN IMFSSRGWLV IDPVGLVGEV GFGAANMFYD PADRDDLCLD PRRIAQMADA FSRALDVDPR RLLDQAYAYG CLSAAWNADG EEEQRDLAIA AAIKQVRQTS Y
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
APH(3'')-Ib (ARO:3002639) |
APH(3'')-Ib |
Tn5393 |
804 |
943-1746 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic inactivation (ARO:0001004) |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | APH(3'') (ARO:3000127) |
Comment: | strA, orfH || perfect match to reference sequence for ARO: 3002639 |
Protein Sequence:
|
MNRTNIFFGE SHSDWLPVRG GESGDFVFRR GDGHAFAKIA PASRRGELAG ERDRLIWLKG RGVACPEVIN WQEEQEGACL VITAIPGVPA ADLSGADLLK AWPSMGQQLG AVHSLSVDQC PFERRLSRMF GRAVDVVSRN AVNPDFLPDE DKSTPQLDLL ARVERELPVR LDQERTDMVV CHGDPCMPNF MVDPKTLQCT GLIDLGRLGT ADRYADLALM IANAEENWAA PDEAERAFAV LFNVLGIEAP DRERLAFYLR LDPLTWG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
orfAB |
OrfAB |
IS1133 |
1130 |
1853-2982 |
- |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | fusion protein from -1 programmed frameshifting between orfA and orfB |
Protein Sequence:
|
MSLKHSDEFK RDAVRIALTS GLTRRQVASD LSIGLSTLGK WIASISDETK IPTQDTDLLR ENERLRKENR ILREEREILK KAAIFFAVQK L*DFSLLRIT VALSHVHAYV V*WA*QIVVY VHGNAVLHHC ASVVILYF*R IYVSSIGCVW GAMVGRV*QK S*KRWACRLG SVGLDV*CAR ITLQLFERVN SNGQRIVIIP STLHRTY*NK TLAQAHPTRN GQAISLMFGP EKDGSILLLS LTCIPVA*LA GQQVID*SRI LH*GH*IWRW LYANHHRVVF NTQTVGANIA LMNIKSYCSN INCCRP*AGK AIVLITPQ*K ASLNH*RLS* FGADTGKQGE ILRLQSSNI* MAFIIHAEDI QHSAGNRRWH LRKKPL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
orfB |
OrfB |
IS1133 |
858 |
1853-2710 |
- |
Class: | Transposase |
Function: | Nucleic acid binding (GO:0003676) |
Transpoase Chemistry: | DDE |
Comment: | frameshift required| 5'-end of tnp gene |
Protein Sequence:
|
MRFQFITDYR GSLSRSRICR LMGVTDRGLR AWKRRPPSLR QRRDLILLAH IREQHRLCLG SYGRPRMTEE LKALGLQVGQ RRVGRLMRQN NITVVRTRKF KRTTDSHHTF NIAPNLLKQD FSASAPNQKW AGDITYVWTR EGWVYLAVIL DLYSRRVIGW ATGDRLKQDL ALRALNMALA LRKPPPGCIQ HTDRGSQYCA HEYQKLLLKH QLLPSMSGKG NCFDNSAVES FFKSLKAELI WRRHWQTRRD IEIAIFEYIN GFYNPRRRHS TLGWKSPVAF EKKAA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
orfA |
OrfA |
IS1133 |
276 |
2707-2982 |
- |
Class: | Accessory Gene |
Sub Class: | Regulator |
Function: | transposase activity (GO:0004803) |
Comment: | orfA, together with orfB, encodes a fusion protein formed by translational frame-shifting that is the transposase for IS1133 |
Protein Sequence:
|
MSLKHSDEFK RDAVRIALTS GLTRRQVASD LSIGLSTLGK WIASISDETK IPTQDTDLLR ENERLRKENR ILREEREILK KAAIFFAVQK L
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn5393 |
612 |
3050-3661 |
- |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Function: | recombinaseactivity (GO:0000150) |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MLIGYARVSK ADGSQSLDLQ HDALRAAGVE RDNIYDDLAS GGRDDRPGLT ACLKSLRDGD VLVVWKLDRL GRSLAHLVNT VKELSDRKIG LRVLTGKGAQ IDTTTASGRM VFGIFATLAE FERDLIRERT MAGLASARAR GRKGGRKFAL TKAQVRLAQA AMAQRDTSVS DLCKELGIER VTLYRYVGPK GELRDHGKHV LGLT
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn5393 |
2886 |
3787-6672 |
+ |
Class: | Transposase |
Function: | transposase activity (GO:0004803) |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPRRVTLTDR QKDALLRLPT SQTDLLKHYT LSDEDLGHIR LRRRAHNRFG FALQLCVLRY PGRVLAPGEL IPAEVIEFIG AQLGLGADDL VDYAAREETR HEHLAELRGL YGFRTFSGRG ASELKEWLFR EAEMAVSNED IARRFVAECR RTRTVLPATS TIERLCAAAL VDAERRIETR IASRLPMSIR EQLLALLEET ADDRVTRFVW LRQFEPGSNS SSANRLLDRL EYLQRIDLPE DLLAGVPAHR VTRLRRQGER YYADGMRDLP EDRRLAILAV CVSEWQAMLA DAVVETHDRI VGRLYRASER ICHAKVADEA GVVRDTLKSF AEIGGALVDA QDDGQPLGDV IASGSGWDGL KTLVAMATRL TATMADDPLN HVLDGYHRFR RYAPRMLRLL DLRAAPVALP LLEAVTALRT GLNDAAMTSF LRPSSKWHRH LRAQRAGDAR LWEIAVLFHL RDAFRSGDVW LTRSRRYGDL KHALVPAQSI AEGGRLAVPL RPEEWLADRQ ARLDMRLRDV GRAARAGTIP GGSIENGVLH IEKLEAAAPT GAEDLVLDLY KQIPPTRITD LLLEVDAATG FTEAFTHLRT GAPCADRIGL MNVILAEGIN LGLRKMADRT NTHTFWELIR IGRWHVEGEA YDRALAMVVE AQAALPMARF WGMGTSASSD GQFFVATEQG EAMNLVNAKY GNTPGLKAYS HVSDQYAPFA TQVIPATASE APYILDGLLM NDAGRHIREQ FTDTGGFTDH VFAACAILGY RFAPRIRDLP SKRLYAFNPS AAPAHLRALI GGKVNQAMIE RNWPDILRIA ATIAAGTVAP SQILRKLASY PRQNELATAL REVGRVERTL FMIDWILDAE LQRRAQIGLN KGERHHALKR AISFHRRGEI RDRSAEGQHY RIAGMNLLAA IIIFWNTMKL GEVVANQKRD GKLLSPDLLA HVSPLGWEHI NLTGEYRWPK P
|
|
Internal Transposable Elements (TE) |
|
|
TnCentral Accession |
TE Name |
Type |
Coordinates |
Length |
IS1133-Z12167 |
IS1133 |
Insertion Sequence |
1815-3046 |
1232 |
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
IRL |
Tn5393 |
1-40 |
GGGGTCGTTT GCGGGAGAGG GCGAAATCCT ACGCTAAGGC |
IRR |
IS1133 |
1815-1841 |
TGGACTTGCA CCGGTTGTGT TCCGGTC |
IRL |
IS1133 |
3020-3046 |
CTGGCCTTGC TTTGGCTCCG TTCAGGT |
IRRl |
Tn5393 |
6625-6705 |
ACCCTTGTGT AGTTAGAGTG GCCTCTTATA GCGACCGGTT TCGGAATCGC ATCCTAAGGC GGGGGAGGGC GTTTGCTGGG G |
|
References |
|
|
Chiou CS, Jones AL. Nucleotide sequence analysis of a transposon (Tn5393) carrying streptomycin resistance genes in Erwinia amylovora and other gram-negative bacteria. J Bacteriol. 1993 Feb;175(3):732-40. doi: 10.1128/jb.175.3.732-740.1993. PubMed ID: 8380801
| |
| | |
|
|