Transposon
Name: Tn402.5
Family: Tn402
Evidence of Transposition: no
 Host     

Molecular Source:pTL16
Place of Origin:Germany Date of Isolation:2018
Other Geographic Information:Rhizosphere Lettuce

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  7085 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCGT TTACACCGGA CCCAGCGAGC 2700
GGCGGCGGCA ATTCGAGCGG GAACTGATGT GAAGCCTGCG CCGCGCTGGC CGCTGCATCC CGCCCCGAAA GAAGGCGAGG CGCTGTCCTC ATGGCTCAAC 2800
CGCGTGGCCC TTTGCTATCA CATGGAGGAG CCCGACCTGC TGGAGCACGA TCTTGGTCAC GGCCAGGTCG ATGACCTGGA CACCGCGCCA CCACTCTCGC 2900
TGCTGGCGTT GCTTTCCCAG CGGAGCGGCA TCGAGCTGGA CCGGCTGCGC TGTATGAGTT TCGCCGGATG GGTGCCTTGG CTACTGGACA GCCTTGATGA 3000
CCAGATTCCA GACGCCTTGG AAACCTATGC GTTTCAGCTC TCGGTGTTGC TGCCAAGACT CCGCCGTAAG ACGCGATCCA TCACGAGCTG GCGTGCCTGG 3100
CTGCCCAGCC AGCCGATAAA CCGCGCCTGT CCGCTCTGCC TGAGCGATCC GGAGAACCAA GCCGTACTGC TCGCGTGGAA GCTGCCCCTG ATGCTGAGCT 3200
GCCCGCTGCA TGGCTGCTGG CTGGAATCCT ATTGGGGCGT GCCAGGGCGG TTTCTCGGCT GGGAGAACGC CGACGCCGAA CCGCGCACCG CCAGCGACGC 3300
GATTGCGGCG ATGGACCAGC GTACCTGGCA GGCACTGACA ACCGGTCACG TGGAGCTGCC GCGCCGACGC ATCCACGCCG GATTGTGGTT TCGACTTCTT 3400
CGCACGCTGC TCGATGAGCT GAACACCCCG CTTTCCGCGT GCGGAACCTG CGCGGGGTAT CCCCGCCAAG TCTGGGAAGG CTGCGGGCAT CCGCTGCGTG 3500
CTGGGCAAAG TCTGTGGCGA CCGTATGAAA CCCTGAATCC GATAGTACGG TTACAGATGC TGGAGGCGGC GGCAACGGCA ATCAGCTTGA TTGAGGTGAG 3600
GGACATCAGC CCGCCAGGCG AGCAGGCAAA GCTATTCTGG TCCGAGCCCC AAACCGGGTT CACCAGTGGC CTGCCGACGA AAGCGCCGAA GCCCGAGCCC 3700
ATCAATCACT GGCAGCGTGC AGTCCAGGCC ATCGACGAGG CCATCATTGA AGCGCGACAC AACCCCGAGA CGGCACGCTC GCTGTTCGCG TTGGCTTCCT 3800
ATGGTCGGCG CGATCCCGCT TCCTTGGAAC GGTTGCGCGC CACCTTCGTG AAGGAAGGCA TCCCGCCGGA ATTTCTGTCA CATTACCTGC CTGATGCACC 3900
CTTTGCATGT CTTAAACAAA ATGACGGGTT AAGTGACAAA TTTTGACGGA TAGAGCTTTC CGGCTCACAC TGTCACATAA TCGAACGTAT ACGTGACGGG 4000
TGAAAAGGTG CTGATCGGCT ACATGCGGGT ATCGAAGGCG GACGGATCCC AGTCCACCAA TTTGCAACGC GATGCGCTCA TCGCCGCTGG TGTGAGCCTT 4100
GCGCACCTTT ACGAGGATCT GGCCTCGGGC AGGCGCGATG ATCGCCCAGG GTTGGCTGCT TGCCTGAAGG CGCTTCGTGA AGGGGACACG CTGATCGTGT 4200
GGAAGCTCGA TCGGCTTGGC CGTGATCTGC GCCACCTGAT CAACACCGTG CACGACCTAA CTGCGCGTAG CGTGGGCCTG AAGGTCCTGA CCGGTCACGG 4300
TGCGGCGGTC GACACGACGA CTGCCGCCGG CAAGCTTGTG TTCGGTATTT TTGCCGCGCT GGCCGAGTTC GAGCGTGAGT TGATTTCCGA GCGAACAGTC 4400
GCTGGACTTA TCTCGGCGCG CGCTCGCGGC AGGAAAGGGG GGCGCCCCTT CAAGATGACC GCCGCCAAGC TACGCCTGGC GATGGCCAGC ATGGGGCAAC 4500
CGGAAACCAA GGTGGGCGAT CTCTGCGAAG AACTCGGGAT TACCCGGCAG ACGCTCTACC GGCACGTGTC GCCCAAGGGC GAACTGCGGC CAGACGGCGT 4600
AAAGCTGCTC TCCCTCGGTT CAGCCGCATA AATGGAGGCG ACCTGGAACG GGGCGCTGTT CAGTGCGGCA ACGATCCGAT TACCGGTGTC GACCCAGAGC 4700
AGCCGTAGAG CTTTTGGGAA AGCTGTCGTT CAACGCCGAG TTCAGCGGCA GTTTTTAAGT TGTGATTTTA TGGAATACTT TTGCGCAGCA AAACCATAAA 4800
GCCGCGACTT AAAAACTGTC CAGCGCAGGC ACGAAGTGCT GGAGCGGTGC TGCAACGACT TGTTAGATGA CTGAGTTTAT TAGTGGGCAC TTGCTTTGGA 4900
AAGCAAGTTT AAAACTACTA CACCACTAAC TATGAGCCCC ATACCTACAA AGCCCCACGC ATCAAGCTTT TGCCCATGAA GCAACCAGGC AATGGCTGTA 5000
ATTATGACGA CGCCGAGTCC CGACCAGACT GCATAAGCAA CACCGACAGG GATGGATTTC AGAACCAGAG AAAGAAAATA AAATGCGATG CCATAACCGA 5100
TTATGACAAC GGCGGAAGGG GCAAGCTTAG TAAAGCCCTC GCTAGATTTT AATGCGGATG TTGCGATTAC TTCGCCAACT ATTGCGATAA CAAGAAAAAG 5200
CCAGCCTTTC ATGATATATC TCCCAATTTG TGTAGGGCTT ATTATGCACG CTTAAAAATA ATAAAAGCAG ACTTGACCTG ATAGTTTGGC TGTGAGCAAT 5300
TATGTGCTTA GTGCATCTAA CGCCTGAGTT CAGCCGCCGC CGTAGGCGGT CGGCTGCGAC GAAATGTTAG GGGCGCTGCT TGCGGAAGAA CCCTTTGACG 5400
ACCCAAATGG CTGCATACAA CAAAACTGCG AAAACAAGTG CGATGAAAAA GCCTTCTGGC CACTTTCCCG GCCCAGAAAC GTAAGCGGAG ACACCAAGAA 5500
CACCGACGAT GCCAAGGCAT AACCGAAGGT CAATGGGTAA ATTGACAAAA GGGAGGCCAG CAAGACTAGC TGTAACAACA GCAATGCCGA GAAAAATGCG 5600
GCCCATTTCG GGAGAATCAG GGGCACCAGT GGCGAACATG AACATGAGGA AGCCGGCGAC GGCAACGCAA AGCACCGCCC AGACAAGTTT CAACCGGTGA 5700
ATACCGTGAT GCATATCCAT GTATATACCC TAACTTTGTT TTAGGGCGAC TGCCCTGCTG CGTAACATCG TTGCTGCTCC ATAACATCAA ACATCGACCC 5800
ACGGCGTAAC GCGCTTGCTG CTTGGATGCC CGAGGCATAG ACTGTACAAA AAAACAGTCA TAACAAGCCA TGAAAACCGC CACTGCGCCG TTACCACCGC 5900
TGCGTTCGGT CAAGGTTCTG GACCAGTTGC GTGAGCGCAT ACGCTACTTG CATTACAGCT TACGAACCGA ACAGGCTTAT GTCCACTGGG TTCGTGCCTT 6000
CATCCGTTTC CACGGTGTGC GTCACCCGGC AACCTTGGGC AGCAGCGAAG TCGAGGCATT TCTGTCCTGG CTGGCGAACG AGCGCAAGGT TTCGGTCTCC 6100
ACGCATCGTC AGGCATTGGC GGCCTTGCTG TTCTTCTACG GCAAGGTGCT GTGCACGGAT CTGCCCTGGC TTCAGGAGAT CGGAAGACCT CGGCCGTCGC 6200
GGCGCTTGCC GGTGGTGCTG ACCCCGGATG AAGTGGTTCG CATCCTCGGT TTTCTGGAAG GCGAGCATCG TTTGTTCGCC CAGCTTCTGT ATGGAACGGG 6300
CATGCGGATC AGTGAGGGTT TGCAACTGCG GGTCAAGGAT CTGGATTTCG ATCACGGCAC GATCATCGTG CGGGAGGGCA AGGGCTCCAA GGATCGGGCC 6400
TTGATGTTAC CCGAGAGCTT GGCACCCAGC CTGCGCGAGC AGCTGTCGCG TGCACGGGCA TGGTGGCTGA AGGACCAGGC CGAGGGCCGC AGCGGCGTTG 6500
CGCTTCCCGA CGCCCTTGAG CGGAAGTATC CGCGCGCCGG GCATTCCTGG CCGTGGTTCT GGGTTTTTGC GCAGCACACG CATTCGACCG ATCCACGGAG 6600
CGGTGTCGTG CGTCGCCATC ACATGTATGA CCAGACCTTT CAGCGCGCCT TCAAACGTGC CGTAGAACAA GCAGGCATCA CGAAGCCCGC CACACCGCAC 6700
ACCCTCCGCC ACTCGTTCGC GACGGCCTTG CTCCGCAGCG GTTACGACAT TCGAACCGTG CAGGATCTGC TCGGCCATTC CGACGTCTCT ACGACGATGA 6800
TTTACACGCA TGTGCTGAAA GTTGGCGGTG CCGGAGTGCG CTCACCGCTT GATGCGCTGC CGCCCCTCAC TAGTGAGAGG TAGGGCAGCG CAAGTCAATC 6900
CTGGCGGATT CACTACCCCT GCGCGAAGGC CATCGGTGCC GCATCGAACG GCCGGTTGCG GAAAGTCCTC CCTGCGTCCG CTGATGGCCG GCAGCAGCCC 7000
GTCGTTGCCT GATGGATCCA ACCCCTCCGC TGCTATAGTG CAGTCGGCTT CTGACGTTCA GTGCAGCCGT CTTCTGAAAA CGACA

 Recombination Sites     

Name Coordinates Gene Sequence
r6 3824-3837 14 TTGGAACGGT TGCG
r5 3870-3883 14 AATTTCTGTC ACAT
r4 3875-3888 14 CTGTCACATT ACCT
r3 3926-3939 14 GGGTTAAGTG ACAA
res 3967-4001 35 ACACTGTCAC ATAATCGAAC GTATACGTGA CGGGT
r2 3970-3983 14 CTGTCACATA ATCG
r1 3986-3999 14 CGTATACGTG ACGG
attC qacE core 4728-4868 141 GTTCAACGCC GAGTTCAGCG GCAGTTTTTA AGTTGTGATT TTATGGAATA CTTTTGCGCA
GCAAAACCAT AAAGCCGCGA CTTAAAAACT GTCCAGCGCA GGCACGAAGT GCTGGAGCGG
TGCTGCAACG ACTTGTTAGA T
attI 5734-5789 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tniA Tn402.5 142-1821 Transposase   +
tniB Tn402.5 1824-2732 Accessory Gene   +
tniQ Tn402.5 2729-3946 Accessory Gene Target Site Selection +
tniR Tn402.5 4008-4631 Accessory Gene Resolvase +
qacE (ARO:3005009) Tn402.5 4880-5212 Passenger Gene Antibiotic Resistance -
WP_046742232.1 Tn402.5 5367-5720 Passenger Gene Hypothetical -
intI1 Tn402.5 5870-6883 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA Tn402.5 1680 142-1821 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7
Protein Sequence:  
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE
LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI
GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR
TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR
EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB TniB Tn402.5 909 1824-2732 +
Class:   Accessory Gene
Sequence Family:  ATP binding protein?
Comment:   identical to tniB (Tn1721)| similar function to Tn7 tnsC and MuB
Protein Sequence:  
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMAVYTGP SERRRQFERE
LM

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniQ TniQ Tn402.5 1218 2729-3946 +
Class:   Accessory Gene
Sub Class:   Target Site Selection
Comment:   identical to tniQ (Tn1721)|similar function to Tn7 tnsD?
Protein Sequence:  
MKPAPRWPLH PAPKEGEALS SWLNRVALCY HMEEPDLLEH DLGHGQVDDL DTAPPLSLLA LLSQRSGIEL DRLRCMSFAG WVPWLLDSLD DQIPDALETY
AFQLSVLLPR LRRKTRSITS WRAWLPSQPI NRACPLCLSD PENQAVLLAW KLPLMLSCPL HGCWLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW
QALTTGHVEL PRRRIHAGLW FRLLRTLLDE LNTPLSACGT CAGYPRQVWE GCGHPLRAGQ SLWRPYETLN PIVRLQMLEA AATAISLIEV RDISPPGEQA
KLFWSEPQTG FTSGLPTKAP KPEPINHWQR AVQAIDEAII EARHNPETAR SLFALASYGR RDPASLERLR ATFVKEGIPP EFLSHYLPDA PFACLKQNDG
LSDKF

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniR TniR Tn402.5 624 4008-4631 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Comment:   resolution of cointegrates || Protein: ACE81792.1 || identical to tniR (Tn1721)
Protein Sequence:  
MLIGYMRVSK ADGSQSTNLQ RDALIAAGVS LAHLYEDLAS GRRDDRPGLA ACLKALREGD TLIVWKLDRL GRDLRHLINT VHDLTARSVG LKVLTGHGAA
VDTTTAAGKL VFGIFAALAE FERELISERT VAGLISARAR GRKGGRPFKM TAAKLRLAMA SMGQPETKVG DLCEELGITR QTLYRHVSPK GELRPDGVKL
LSLGSAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacE (ARO:3005009) QacE Tn402.5 333 4880-5212 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   quaternary ammonium salts
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   SMR export pump||strict match to reference sequence for ARO:3005009 (bitscore:204)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIVSGVVV
LNLLSKASAH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
WP_046742232.1 WP_046742232.1 Tn402.5 354 5367-5720 -
Class:   Passenger Gene
Sub Class:   Hypothetical
Protein Sequence:  
MDMHHGIHRL KLVWAVLCVA VAGFLMFMFA TGAPDSPEMG RIFLGIAVVT ASLAGLPFVN LPIDLRLCLG IVGVLGVSAY VSGPGKWPEG FFIALVFAVL
LYAAIWVVKG FFRKQRP

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 Tn402.5 1014 5870-6883 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 Tn402.5 9-27 TCAGAAGACG ACTGCACCA
repeat t2 Tn402.5 49-67 AACACGTCGG TCGAGGACT
repeat t3 Tn402.5 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 Tn402.5 110-128 TCAATACTCG TGTGCACCA
repeat i4 Tn402.5 6966-6984 AGGAGGGACG CAGGCGACT
repeat i3 Tn402.5 6994-7012 CGTCGGGCAG CAACGGACT
repeat i2 Tn402.5 7036-7054 ATCACGTCAG CCGAAGACT
IRi Tn402.5 7053-7085 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT

 References     

Shintani M, Nour E, Elsayed T, Blau K, Wall I, Jechalke S, Spröer C, Bunk B, Overmann J, Smalla K. Plant Species-Dependent Increased Abundance and Diversity of IncP-1 Plasmids in the Rhizosphere: New Insights Into Their Role and Ecology. Front Microbiol. 2020 Nov 27;11:590776. doi: 10.3389/fmicb.2020.590776. eCollection 2020. PubMed ID: 33329469