Transposon
Name: Tn402.4
Family: Tn402
Evidence of Transposition: no
 Host     

Host Organism:Morganella morganii nx_m63 Molecular Source:plasmid pNXM63-IMP
Place of Origin:Guangzhou, Guangdong, China Date of Isolation:2020
Other Geographic Information:Sun Yat-sen University

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  6965 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCGT TTACACCGGA CCCAGCGAGC 2700
GGCGGCGGCA ATTCGAGCGG GAACTGATGT GAAGCCTGCG CCGCGCTGGC CGCTGCATCC CGCCCCGAAA GAAGGCGAGG CGCTGTCCTC ATGGCTCAAC 2800
CGCGTGGCCC TTTGCTATCA CATGGAGGAG CCCGACCTGC TGGAGCACGA TCTTGGTCAC GGCCAGGTCG ATGACCTGGA CACCGCGCCA CCACTCTCGC 2900
TGCTGGCGTT GCTTTCCCAG CGGAGCGGCA TCGAGCTGGA CCGGCTGCGC TGTATGAGTT TCGCCGGATG GGTGCCTTGG CTACTGGACA GCCTTGATGA 3000
CCAGATTCCA GACGCCTTGG AAACCTATGC GTTTCAGCTC TCGGTGTTGC TGCCAAGACT CCGCCGTAAG ACGCGATCCA TCACGAGCTG GCGTGCCTGG 3100
CTGCCCAGCC AGCCGATAAA CCGCGCCTGT CCGCTCTGCC TGAGCGATCC GGAGAACCAA GCCGTACTGC TCGCGTGGAA GCTGCCCCTG ATGCTGAGCT 3200
GCCCGCTGCA TGGCTGCTGG CTGGAATCCT ATTGGGGCGT GCCAGGGCGG TTTCTCGGCT GGGAGAACGC CGACGCCGAA CCGCGCACCG CCAGCGACGC 3300
GATTGCGGCG ATGGACCAGC GTACCTGGCA GGCACTGACA ACCGGTCACG TGGAGCTGCC GCGCCGACGC ATCCACGCCG GATTGTGGTT TCGACTTCTT 3400
CGCACGCTGC TCGATGAGCT GAACACCCCG CTTTCCGCGT GCGGAACCTG CGCGGGGTAT CCCCGCCAAG TCTGGGAAGG CTGCGGGCAT CCGCTGCGTG 3500
CTGGGCAAAG TCTGTGGCGA CCGTATGAAA CCCTGAATCC GATAGTACGG TTACAGATGC TGGAGGCGGC GGCAACGGCA ATCAGCTTGA TTGAGGTGAG 3600
GGACATCAGC CCGCCAGGCG AGCAGGCAAA GCTATTCTGG TCCGAGCCCC AAACCGGGTT CACCAGTGGC CTGCCGACGA AAGCGCCGAA GCCCGAGCCC 3700
ATCAATCACT GGCAGCGTGC AGTCCAGGCC ATCGACGAGG CCATCATTGA AGCGCGACAC AACCCCGAGA CGGCACGCTC GCTGTTCGCG TTGGCTTCCT 3800
ATGGTCGGCG CGATCCCGCT TCCTTGGAAC GGTTGCGCGC CACCTTCGTG AAGGAAGGCA TCCCGCCGGA ATTTCTGTCA CATTACCTGC CTGATGCACC 3900
CTTTGCATGT CTTAAACAAA ATGACGGGTT AAGTGACAAA TTTTGACGGA TAGAGCTTTC CGGCTCACAC TGTCACATAA TCGAACGTAT ACGTGACGGG 4000
TGAAAAGGTG CTGATCGGCT ACATGCGGGT ATCGAAGGCG GACGGATCCC AGTCCACCAA TTTGCAACGC GATGCGCTCA TCGCCGCTGG TGTGAGCCTT 4100
GCGCACCTTT ACGAGGATCT GGCCTCGGGC AGGCGCGATG ATCGCCCAGG GTTGGCTGCT TGCCTGAAGG CGCTTCGTGA AGGGGACACG CTGATCGTGT 4200
GGAAGCTCGA TCGGCTTGGC CGTGATCTGC GCCACCTGAT CAACACCGTG CACGACCTAA CTGCGCGTAG CGTGGGCCTG AAGGTCCTGA CCGGTCACGG 4300
TGCGGCGGTC GACACGACGA CTGCCGCCGG CAAGCTTGTG TTCGGTATTT TTGCCGCGCT GGCCGAGTTC GAGCGTGAGT TGATTTCCGA GCGAACAGTC 4400
GCTGGACTTA TCTCGGCGCG CGCTCGCGGC AGGAAAGGGG GGCGCCCCTT CAAGATGACC GCCGCCAAGC TACGCCTGGC GATGGCCAGC ATGGGGCAAC 4500
CGGAAACCAA GGTGGGCGAT CTCTGCGAAG AACTCGGGAT TACCCGGCAG ACGCTCTACC GGCACGTGTC GCCCAAGGGC GAACTGCGGC CAGACGGCGT 4600
AAAGCTGCTC TCCCTCGGTT CAGCCGCATA AATGGAGGCG ACCTGGAACG GGGCGCTGTT CAGTGCGGCA ACGATCCGAT TACCGGTGTC GACCCAGAGC 4700
AGCCGTAGAG CTTTTGGGAA AGCTGTCGTT CAACGCTAAG TTCAGCGGCA GTTTGTAAGT TGCGCGTTGT GGAATACTTT GCGAAGCAAA CCACAAAAGC 4800
GCAACTTACA AACTGTCCAG CCACGTAGTG GCGTGCTGCA ACGACTTGTT AGAAATTTAG TTGCTTGGTT TTGATGGTTT TTTACTTTCG TTTAACCCTT 4900
TAACCGCCTG CTCTAATGTA AGTTTCAAGA GTGATGCGTC TCCAACTTCA CTGTGACTTG GAACAACCAG TTTTGCCTTA CCATATTTGG ACTTTAATAA 5000
TTTGGCGGAC TTTGGCCAAG CTTCTATATT TGCGTCACCC AAATTGCCTA AACCGTACGG TTTAATAAAA CAACCACCGA ATAATATTTT CCTTTCAGGC 5100
AGCCAAACCA CTACGTTATC TGGAGTGTGT CCCGGGCCTG GATAAAAAAC TTCAATTTTA TTTTTAACTA GCCAATAGTT AACTCCGCTA AATGAATTTG 5200
TGGCTTGAAC CTTGCCGTCT TTTTTAAGCA GTTCATTTGT TAATTCAGAT GCATACGTGG GGATAGATCG AGAATTAAGC CACTCTATTC CGCCCGTGCT 5300
GTCGCTATGA AAATGAGAGG AAATACTGCC TTTTATTTTA TAGCCACGCT CCACAAACCA AGTGACTAAC TTTTCAGTAT CTTTAGCCGT AAATGGAGTG 5400
TCAATTAGAT AAGCCTCAGC ATTTACAAGA ACCACCAAAC CATGTTTAGG AACAACGCCC CACCCGTTAA CTTCTTCAAA CGAAGTATGA ACATAAACGC 5500
CTTCATCAAG CTTTTCAATT TTTAAATCTG GCAAAGACTC TGCTGCGGTA GCAATGCTGC AAAACAAAAA TATAAAGAAT ACAGATAACT TGCTCATACT 5600
TTTCCTTTTC TAACTTTGTT TTAGGGCGAC TGCCCTGCTG CGTAACATCG TTGCTGCTCC ATAACATCAA ACATCGACCC ACGGCGTAAC GCGCTTGCTG 5700
CTTGGATGCC CGAGGCATAG ACTGTACAAA AAAACAGTCA TAACAAGCCA TGAAAACCGC CACTGCGCCG TTACCACCGC TGCGTTCGGT CAAGGTTCTG 5800
GACCAGTTGC GTGAGCGCAT ACGCTACTTG CATTACAGCT TACGAACCGA ACAGGCTTAT GTCCACTGGG TTCGTGCCTT CATCCGTTTC CACGGTGTGC 5900
GTCACCCGGC AACCTTGGGC AGCAGCGAAG TCGAGGCATT TCTGTCCTGG CTGGCGAACG AGCGCAAGGT TTCGGTCTCC ACGCATCGTC AGGCATTGGC 6000
GGCCTTGCTG TTCTTCTACG GCAAGGTGCT GTGCACGGAT CTGCCCTGGC TTCAGGAGAT CGGAAGACCT CGGCCGTCGC GGCGCTTGCC GGTGGTGCTG 6100
ACCCCGGATG AAGTGGTTCG CATCCTCGGT TTTCTGGAAG GCGAGCATCG TTTGTTCGCC CAGCTTCTGT ATGGAACGGG CATGCGGATC AGTGAGGGTT 6200
TGCAACTGCG GGTCAAGGAT CTGGATTTCG ATCACGGCAC GATCATCGTG CGGGAGGGCA AGGGCTCCAA GGATCGGGCC TTGATGTTAC CCGAGAGCTT 6300
GGCACCCAGC CTGCGCGAGC AGCTGTCGCG TGCACGGGCA TGGTGGCTGA AGGACCAGGC CGAGGGCCGC AGCGGCGTTG CGCTTCCCGA CGCCCTTGAG 6400
CGGAAGTATC CGCGCGCCGG GCATTCCTGG CCGTGGTTCT GGGTTTTTGC GCAGCACACG CATTCGACCG ATCCACGGAG CGGTGTCGTG CGTCGCCATC 6500
ACATGTATGA CCAGACCTTT CAGCGCGCCT TCAAACGTGC CGTAGAACAA GCAGGCATCA CGAAGCCCGC CACACCGCAC ACCCTCCGCC ACTCGTTCGC 6600
GACGGCCTTG CTCCGCAGCG GTTACGACAT TCGAACCGTG CAGGATCTGC TCGGCCATTC CGACGTCTCT ACGACGATGA TTTACACGCA TGTGCTGAAA 6700
GTTGGCGGTG CCGGAGTGCG CTCACCGCTT GATGCGCTGC CGCCCCTCAC TAGTGAGAGG TAGGGCAGCG CAAGTCAATC CTGGCGGATT CACTACCCCT 6800
GCGCGAAGGC CATCGGTGCC GCATCGAACG GCCGGTTGCG GAAAGTCCTC CCTGCGTCCG CTGATGGCCG GCAGCAGCCC GTCGTTGCCT GATGGATCCA 6900
ACCCCTCCGC TGCTATAGTG CAGTCGGCTT CTGACGTTCA GTGCAGCCGT CTTCTGAAAA CGACA

 Recombination Sites     

Name Coordinates Gene Sequence
r6 3824-3837 14 TTGGAACGGT TGCG
r5 3870-3883 14 AATTTCTGTC ACAT
r4 3875-3888 14 CTGTCACATT ACCT
r3 3926-3939 14 GGGTTAAGTG ACAA
res 3967-4001 35 ACACTGTCAC ATAATCGAAC GTATACGTGA CGGGT
r2 3970-3983 14 CTGTCACATA ATCG
r1 3986-3999 14 CGTATACGTG ACGG
attI 5614-5669 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA Tn402.4 142-1821 Transposase   +
tniB Tn402.4 1824-2732 Accessory Gene   +
tniQ Tn402.4 2729-3946 Accessory Gene Target Site Selection +
tniR Tn402.4 4008-4631 Accessory Gene Resolvase +
bla IMP-1 (ARO:3002192) Tn402.4 4857-5597 Passenger Gene Antibiotic Resistance -
intI1 Tn402.4 5750-6763 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA Tn402.4 1680 142-1821 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7
Protein Sequence:  
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE
LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI
GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR
TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR
EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB TniB Tn402.4 909 1824-2732 +
Class:   Accessory Gene
Sequence Family:  ATP binding protein?
Comment:   identical to tniB (Tn1721)| similar function to Tn7 tnsC and MuB
Protein Sequence:  
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMAVYTGP SERRRQFERE
LM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniQ TniQ Tn402.4 1218 2729-3946 +
Class:   Accessory Gene
Sub Class:   Target Site Selection
Comment:   identical to tniQ (Tn1721)|similar function to Tn7 tnsD?
Protein Sequence:  
MKPAPRWPLH PAPKEGEALS SWLNRVALCY HMEEPDLLEH DLGHGQVDDL DTAPPLSLLA LLSQRSGIEL DRLRCMSFAG WVPWLLDSLD DQIPDALETY
AFQLSVLLPR LRRKTRSITS WRAWLPSQPI NRACPLCLSD PENQAVLLAW KLPLMLSCPL HGCWLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW
QALTTGHVEL PRRRIHAGLW FRLLRTLLDE LNTPLSACGT CAGYPRQVWE GCGHPLRAGQ SLWRPYETLN PIVRLQMLEA AATAISLIEV RDISPPGEQA
KLFWSEPQTG FTSGLPTKAP KPEPINHWQR AVQAIDEAII EARHNPETAR SLFALASYGR RDPASLERLR ATFVKEGIPP EFLSHYLPDA PFACLKQNDG
LSDKF

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniR TniR Tn402.4 624 4008-4631 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   resolution of cointegrates || Protein: ACE81792.1 || identical to tniR (Tn1721)
Protein Sequence:  
MLIGYMRVSK ADGSQSTNLQ RDALIAAGVS LAHLYEDLAS GRRDDRPGLA ACLKALREGD TLIVWKLDRL GRDLRHLINT VHDLTARSVG LKVLTGHGAA
VDTTTAAGKL VFGIFAALAE FERELISERT VAGLISARAR GRKGGRPFKM TAAKLRLAMA SMGQPETKVG DLCEELGITR QTLYRHVSPK GELRPDGVKL
LSLGSAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
bla IMP-1 (ARO:3002192) Bla IMP-1 Tn402.4 741 4857-5597 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic inactivation (ARO:0001004)
Target:   cephalosporin (ARO:0000032)||penem (ARO:3003706)||penam (ARO:3000008)||cephamycin (ARO:0000044)||carbapenem (ARO:0000020)
Sequence Family:  IMP beta-lactamase (ARO:3000020)
Comment:   100% identical to reference sequence for ARO:3002192
Protein Sequence:  
MSKLSVFFIF LFCSIATAAE SLPDLKIEKL DEGVYVHTSF EEVNGWGVVP KHGLVVLVNA EAYLIDTPFT AKDTEKLVTW FVERGYKIKG SISSHFHSDS
TGGIEWLNSR SIPTYASELT NELLKKDGKV QATNSFSGVN YWLVKNKIEV FYPGPGHTPD NVVVWLPERK ILFGGCFIKP YGLGNLGDAN IEAWPKSAKL
LKSKYGKAKL VVPSHSEVGD ASLLKLTLEQ AVKGLNESKK PSKPSN

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 Tn402.4 1014 5750-6763 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 Tn402.4 9-27 TCAGAAGACG ACTGCACCA
repeat t2 Tn402.4 49-67 AACACGTCGG TCGAGGACT
repeat t3 Tn402.4 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 Tn402.4 110-128 TCAATACTCG TGTGCACCA
repeat i4 Tn402.4 6846-6864 AGGAGGGACG CAGGCGACT
repeat i3 Tn402.4 6874-6892 CGTCGGGCAG CAACGGACT
repeat i2 Tn402.4 6916-6934 ATCACGTCAG CCGAAGACT
IRi Tn402.4 6933-6965 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT