Transposon
Name: Tn5501.5       (Synonyms: Tn5501.13)
Family: Tn3        Group: Tn3000
Evidence of Transposition: Yes
 Host     

Host Organism:Uncultured bacterium Molecular Source:plasmid pGNB1
Place of Origin:Bielefeld-Heepen, Germany Date of Isolation:2007
Other Geographic Information:activated sludge compartment of a wastewater treatment plant

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTTCTAAGCCGGAACCGCCGAAAATTCCGTCAGCC
IRR (Length: 38 bp)GGGGTTCTAAGCCAGAACCGCCGAAATTTCCGTCATCC

 Sequence     
DNA SequenceLength  6298 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTCTAA GCCGGAACCG CCGAAAATTC CGTCAGCCGA TCAACGTGGC TTGTCCCGCG CCCGGTCGAT GGGGTAGACC CAACAGTCGT GTCACTAGCC 100
GCCATTTCGA TCACGGCAAT GCCAGCCGGA CGTCACGTCC AGATTGTTCC GGTCTGGATG AGGCCGACTG ACGTCTCGGA TGACGGGTGG CATACAACTG 200
CTGTGAGTCC TGCAGGGGGG CAGCTGCCTG ACCGGACGGC GAGCATCAGC CCATCTCATG TATTAGTCAT GTCAGCTTTG ACACTGCGCA CGCGACGGCA 300
CCCGACCCGT TGCAGACCCC CAGACATATG GAAAGCTGAC GCTCAACGTG GAGTTAGCCG GCGCGGCGCG GCTTCATCGC GCAGCGTCCG TGTTGATGGA 400
TGGGTTAGAA CTCTCTGGGG CACACATTTG AGAAAGACTA TTTTTCAGCC AACGCAAAAC CCGCGTTGCC AGATGCAACC ACTGAGACAA ATACAAACGG 500
CACTGGACTA GTATTTAGAG CGCCATGCAC TTGGCCAGGT TTTGCTATGG CAATATCTCC AGCCTTTAGA TGAGCGACTT TGCCGCCTCC TTGGTAATAC 600
TCAGCCTCTC CAGAGATAAC CGTCCAAGTA TCTTGGCCGT CCGGGTGAAC ATGAGCGGTT ATTTCTTGCC CAGGATGGGC ATGCCAAACA ACAACTGCTG 700
AGTCCTTGGT TTCCAGAACC ACTGAACGAA TTGGTTCGCC GTCAGACGGA CGAATATATT CTGTTACCGA AAATATTCTT GATTCCATTT CCATATCCAC 800
TCCAAAAATC AATTTGGCTT GAGAAGTTAT TACAGAACAA CTGTATCCGA GAGTTCTACG TTGCCGATAA CCGGCCCGAA ACGGCGGCGC AACGCGCCGC 900
TGTTGGGGGT CCGTGTTCAT CGGCGGGTTA GACCTGTGCT TGCGTAAGAC GAATATCGAC TTCTCGGTCG ACGTATTTCC ATTCCTCTGA CGCACGCAGT 1000
TCGGAAAGCA GCCTGTTGAA CGGCTTTGTT GCACAAGTCG AAAAAAGTCG ATATGACCCC TACCCCAGAC GCAGCTACGC CACAGCAGGC ACCGCCACCG 1100
TTGTGGGGCA CCCAAGTTGC GTGAATCGGT TGAGCAAGGC CACCCGCACA TGCAACTCTG TCACCTGACG CTCGAATGTG CGGGCCATCA CCCGCTCGCC 1200
CAGGCGCTTG AAGCAATGCA TCTTGGTCTC CACCAGACTC CTGCGGTGGT AGCCGCTCCA CTTCTTCCAG ATACGCCAAC CCAGCCTCTT GCACGCCAGC 1300
ACCGCTTCGT TACGCACCAT GGCCCCAGGC GTTTGGGACT TCCACACCTG GGCGTTCTTG CGCGGCGGGA TGATCGCTTG TGCCCCACGT TCCATGATCG 1400
CCGCATGGCA CGCCTTGGTG TCGTATGCAC CGTCGGTACT CACACTGGCG ATCGGTTCAC CCGGTGGTAT CTGCCCCAGC AGCCCAGGCA GCATGGGCGC 1500
GTCGCCCACG CTGTTGTCTG TGACTTCAAT GGCCCGTATC TCCAGCGTGC TGGCATCAAT ACCCAGGTGC ACTTTGCGCC ACTGGCGCCG GTACTCTGCC 1600
CCGTGTTTCT TGCGTTTCCA CTCCCCCTCG CCCAGGAACT TGATGCCGGT GCTGTCCACC AGCAGGTTCA GCGCGCTGGT AGAGGCTCGG TAGGGCAGTT 1700
GCACCTGCAG TGTCTTTTGG CGTCGGCTCA CGGTGCTGAA GTCGGGCACC CTCCAATCCA GGTCTGCCAG GTGCAGCAGG CTTTGCACCA GTCCCAGGCT 1800
TTGGCGCAGG GCCAAACCAA AAAGGCATTT GATGCTCAGG CAGAACTGGA TGGCGGCGTC GCTGAACACG TGCTGCCTAC CCCGCTTGCC GCTGGCGGGT 1900
GCGTACCACT GCATGTCTTT GTCCAGCCAG ATCGCCAGCG AGCCCCGGGC TTTGAGCGCT GCGTTGTACG CTGCCCAGTT GGTGGTCTTG TAGCGGGGCT 2000
TGGCGCTCTT GGATTCACTC ATGTCCCGAG GCTAACAGTT CGGGGGGCAG GGTTTGTGCA ACAGAGCCGC TGCGATGTAG TCCCACACAT CGGCACGGTC 2100
TTGCTGCGCT TCGGGCGTCC AAACAACCCT CACGCCTGGC TCGCCACACT GGCACGCCGT GCGGCGAATT CGGCCTCAAC TTCATCGTTC GACCGCCCCA 2200
ATCCAGCGCG CATCGAAGCC CGGCCGGCTT CGACCTTGCG GCGCAGGAAC TCGTCGTACT CGCGCGACTC GCGCTGGCGC TGAACGAACT CGCGCATCAG 2300
CTCGCGCAGC ACTTGCGACG CCGGGCGATG GGCCGCCTCG GCTTCGGCCA TAAACTCGGC GCGCAACTCA GGCTCCAGCT TCATCGTGAA AACGGCTTGT 2400
TTTGACATGA TCGGGGCCTC CTGCCACTTG ATACTAACAA AGTATATACG CCGTCATTAC TAAGCGCTAT TCACAGAACG CTGCAAGGCG GGCGTGCGCT 2500
AGGCCAAGGC CTGTCGGAAA ACATTTGTTT TTCGACAGGC CTTCAACGGT CCTCTGCACC AACCTCCGAG TGGCCGCAAA ATTGTGCGGA AAACTCTGTC 2600
GCCAGACGCT ACCATACGGA AACCTCGTCT TAATGGTTTT CCGCTTATGT TGGTAGGTTA CATGCGCGTG TCGTCGGACT CCGACCGCCA GAGCACGAAC 2700
TTGCAGCGCG ATGCGCTGCT CGCCGTCGGC GTCGATGCGC GGCATCTGTT CGAGGATCAT GCTTCCGGCG CGAAGGACGA CCGCGCGGGC CTGGCGCGGG 2800
CGCTCGAATT CGTTCGCCCT GGCGACGTGT TGGTCGTGTG GAAGCTCGAC CGGCTCGGCC GTTCGTTGTC GCACTTGCTC GCCATCGTGA CCTCGCTCAA 2900
GAAAAAGCAG GTGGCGTTCC GCTCGCTGAC GGAGAACCTG GATACCACGA CGCCCTCGGG CGAGTTTCTG TTCCAGGTGT TCGGCGCGCT CGCGCAGTAC 3000
GAACGCGCCT TGATCCAGGA ACGTGTCGTC GCCGGTCTGG CTGCCGCCCG CAAACGCGGC CGGATCGGCG GCCGGCCGCA GGCGATCACC GGCGAGAAGC 3100
TGGAGGCCAT CGTCGCTGCG CTCGATGGCG GCATGTCCAA GGCGGCGGTG TGCCGCAACT TCGGCGTCAA GCGAACCACG CTGATCGAGA CCCTGGCACG 3200
GGTTGGTTGG ACGGGCTCTC GTGGAGCGTC ATCGCGATGA CGACCAAGAG CGAACGATTG ACCGTCCTGT CGGACGCCGA GCAGGAAGCC CTGTACGGCC 3300
TGCCGGACTT CGACGACGCC CAGCGGCTGG AATACTTGGC GTTGACTGAA ACCGAACTGG CGCTCGCCAG CAGCCGGCCT GGTCTCCATG CCCAGGTCTA 3400
TTGCATCTTG CAGATCGGTT ACTTCAAGGC CAAGCATGCC TTCTTCCGCT TCGACTGGAG TGAGGTCGAG CACGATTGCG CCTTCGTGCT GAGCCGCTAC 3500
TTCCACGGCG AGTCCTTCGA GCACAAGCCA ATCTCCAAGC ACGAGCACTA CACCCAGCGC GAGTGGATTG CCGATCTGTT CGGCTACCGG CCGTGGGCGG 3600
CCGAGTTCCT GGCGCAGCTC GCGCAGCAGG CCGCGCAGAC CGTGCGGCGC GACGTGATGC CGGGGTTCAT CGCCGCCGAG CTGATCGTCT GGCTAAACGA 3700
GCACAAGATC ATCCGGCCCG GCTATACCAC CCTGCAAGAG CTGGTGAGCG AAGCCCTGTC CGCCGAGCGT CGGCGGCTGG CTGGCCTGCT GTCGGAAGTG 3800
TTGGACGAAT CGGCCAAGGC CGCGCTGGGT CGGCTTCTAG TGCGTGACGA CACCCTGTCG CAATTGGCGG CGCTCAAGCA GGACGCCAAG GACTTTGGCT 3900
GGCGTCAGAT GGCCCGCGAA CGCGAAAAGC GCGCCACGCT GGAGCCGCTG CACCGGATCG CCAAGGCGCT GCTGCCCAAG CTCGGCGTCT CGCAGCAGAA 4000
TCTGCTGTAC TACGCCAGCC TGGCGAACTT CTACACCGTC CACGATCTAC GCAACCTGAA GGCCGATCAG ACCTACCTCT ACCTGCTTTG CTATGCCTGG 4100
GTGCGCTACC GGCAGCTTTC CGACAACCTG GTCGATGCGA TGGCCTACCA CATGAAGCAG TTGGAGGACG AAAGCAGTGC GGGCGCAAAG CAATCCTTTG 4200
TCGCCGAGCA GGTGCGCCGT CAGCAAGACA CACCGCAGGT CGGCCGCCTG CTGTCGCTTT ACATCGACGA CAGCGTGCCC GATCCCACGC CGTTCGGCGA 4300
TGTGCGCCAG CGCGCCTACA AAATCATGCC CCGCGATACG CTGCAAACCA CCGCGCAGCG CATGAGCGTG AAGCCGGTGA GCAAGCTGGC TTTGCACTGG 4400
CAGGCGGTGG ACGGCCTGGC TGAGCGCATC CGCCGCCATC TTCGGCCGCT GTATGTCGCG CTCGACCTCG CTGGCACTGA TCCGGGCAGC CCGTGGCTCG 4500
TGGCGCTGGC CTGGGCCAAG GACGTGTTCG CCAAACAGCA GCGCCTATCG CAACGGCCGC TCGCCGAATG TCCAGCGGCC ACGCTGCCGA AACGCTTGCG 4600
ACCGTACCTG CTGACCTTCG ATGCCGATGG CAAGCCGACG GACCTGCATG CCGACCGCTA CGAGTTCTGG CTGTACCGCC AGGTCAGGAA GCGCTTCCAG 4700
TCGGGTGAAC TCTACCTCGA CGACAGCTTG CAGCACCGGC ATTTTTCCGA CGAGCTGGTT TCGCTGGATG AGAAGGCCGC CGTGCTGGCG CAGATCGACA 4800
TCCCGTTCCT GCGGCAGCCA CTCGATGCCC AGCTCGATGC GCTCGCGACC GAGCTGCGCG CTCAGTGGCT GGCCTTCAAC CGCGAGCTGA AGCAGGGCAA 4900
GCTGACGCAC CTAGAATACG ACAAGGACAC GCAGAAGCTG ACATGGCGCA AGCCCAAGGG CGAGAACCAG AAGGCGCGCG AGAAGGCGTT CTACGAGCAA 5000
CTGCCGTTCT GCGACGTGGC CGACGTGTTC CGCTTCGTCA ACGGCCAGTG CCAGTTCCTG TCGGCGCTGA CGCCTTTGCA GCCGCGCTAT GCGAAGAAGG 5100
TCGCCGACGC CGACAGCCTG ATGGCGGTCA TCATCGCGCA GGCGATGAAC CACGGCAACC AGGTCATGGC ACGCACCAGC GACATCCCGT ACCACGTGCT 5200
GGAGAGCGCC TACCAACAGT ACCTGCGCCA CGCAACGCTG CACGCGGCCA ACGACTGCAT CAGCAACGCC ATCGCCGCGC TGCCGATCTT CCCGTACTAC 5300
TCGTTCGACC TCGATGCACT GTACGGTGCC GTCGATGGTC AGAAATTCGG CGTCGAGCGG CCGACCGTGA AAGCGCGCCA CTCGCGCAAA TACTTTGGGC 5400
GCGGCAAGGG CGTGGTCGCC TACACGCTGC TGTGCAACCA CGTGCCGCTC AACGGCTACC TGATCGGCGC GCACGATTAC GAGGCCCATC ACGTGTTCGA 5500
CATCTGGTAT CGCAACACGT CGGACATCGT GCCGACCGCG ATCACCGGCG ACATGCACAG CGTCAACAAG GCCAACTTCG CTATCCTGCA CTGGTTCGGC 5600
CTGCGTTTCG AGCCGCGCTT CACCGACCTT GGCGATCAGT TGAAGGAACT CTACAGTGCC GACGATCCGG CGCTGTACGA TCAGTGCCTG ATCCGGCCGG 5700
CCGGGAGAAT CGACCGCGAT CTCATAGTCA GCGAGAAGCC GAACCTCGAC CAGATTGTCG CCACGCTCGG ACTGAAGGAG ATGACGCAGG GCACGCTGAT 5800
CCGCAAGCTA TGCACCTACA CCGCGCCGAA CCCCACGCGG CGCGCGGTGT TCGAGTTCGA CAAGCTCATC CGCAGCATCT ACACGCTGCG CTACCTGCGC 5900
GATCCGCAAC TGGAGCGCAA CGTTCACCGC TCACAGAACC GCATCGAGTC CTATCACCAG CTACGCTCAA CCATCGCCCA GGTCGGCGGC AAGAAGGAAT 6000
TGACCGGGCG CACCGACATC GAAATTGAGA TCAGCAACCA GTGCGCCAGG CTGATCGCCA ACGCGGTCAT CTTCTACAAC TCGGCCATCC TCTCGCGGCT 6100
GCTGATGAAG TACGAGGCGA GCGGCAACGC CAAGGCGCAC GCTCTCCTGA CCCAGATATC GCCGGCGGCC TGGCGGCACA TCCTGCTGAA CGGGCATTAC 6200
ACCTTCCAGA GCGACGGCAA GATGATCGAC CTGGATGCGC TCGTGGCGGG GCTGGAGCTG GGATGACGGA AATTTCGGCG GTTCTGGCTT AGAACCCC

 Recombination Sites     

Name Coordinates Gene Sequence
res 2509-2639 131 GCCTGTCGGA AAACATTTGT TTTTCGACAG GCCTTCAACG GTCCTCTGCA CCAACCTCCG
AGTGGCCGCA AAATTGTGCG GAAAACTCTG TCGCCAGACG CTACCATACG GAAACCTCGT
CTTAATGGTT T
res_site_I 2509-2537 29 GCCTGTCGGA AAACATTTGT TTTTCGACA
res_site_II 2571-2614 44 TGGCCGCAAA ATTGTGCGGA AAACTCTGTC GCCAGACGCT ACCA
res_site_III 2615-2639 25 TACGGAAACC TCGTCTTAAT GGTTT

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
cupin2 Tn5501.5 438-920 Passenger Gene Other -
tnp ISGNB1.1 1075-2022 Transposase   -
parD Tn5501.5 2130-2408 Passenger Gene Antitoxin -
tnpR Tn5501.5 2662-3240 Accessory Gene Resolvase +
tnpA Tn5501.5 3237-6266 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
cupin2 Cupin2 Tn5501.5 483 438-920 -
Class:   Passenger Gene
Sub Class:   Other
Protein Sequence:  
MNTDPQQRRV APPFRAGYRQ RRTLGYSCSV ITSQAKLIFG VDMEMESRIF SVTEYIRPSD GEPIRSVVLE TKDSAVVVWH AHPGQEITAH VHPDGQDTWT
VISGEAEYYQ GGGKVAHLKA GDIAIAKPGQ VHGALNTSPV PFVFVSVVAS GNAGFALAEK

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp ISGNB1.1 948 1075-2022 -
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MSESKSAKPR YKTTNWAAYN AALKARGSLA IWLDKDMQWY APASGKRGRQ HVFSDAAIQF CLSIKCLFGL ALRQSLGLVQ SLLHLADLDW RVPDFSTVSR
RQKTLQVQLP YRASTSALNL LVDSTGIKFL GEGEWKRKKH GAEYRRQWRK VHLGIDASTL EIRAIEVTDN SVGDAPMLPG LLGQIPPGEP IASVSTDGAY
DTKACHAAIM ERGAQAIIPP RKNAQVWKSQ TPGAMVRNEA VLACKRLGWR IWKKWSGYHR RSLVETKMHC FKRLGERVMA RTFERQVTEL HVRVALLNRF
TQLGCPTTVA VPAVA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
parD ParD Tn5501.5 279 2130-2408 -
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  parD (PDB:4Q2U)
Comment:   RelB
Protein Sequence:  
MSKQAVFTMK LEPELRAEFM AEAEAAHRPA SQVLRELMRE FVQRQRESRE YDEFLRRKVE AGRASMRAGL GRSNDEVEAE FAARRASVAS QA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR Tn5501.5 579 2662-3240 +
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MRVSSDSDRQ STNLQRDALL AVGVDARHLF EDHASGAKDD RAGLARALEF VRPGDVLVVW KLDRLGRSLS HLLAIVTSLK KKQVAFRSLT ENLDTTTPSG
EFLFQVFGAL AQYERALIQE RVVAGLAAAR KRGRIGGRPQ AITGEKLEAI VAALDGGMSK AAVCRNFGVK RTTLIETLAR VGWTGSRGAS SR

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA Tn5501.5 3030 3237-6266 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MTTKSERLTV LSDAEQEALY GLPDFDDAQR LEYLALTETE LALASSRPGL HAQVYCILQI GYFKAKHAFF RFDWSEVEHD CAFVLSRYFH GESFEHKPIS
KHEHYTQREW IADLFGYRPW AAEFLAQLAQ QAAQTVRRDV MPGFIAAELI VWLNEHKIIR PGYTTLQELV SEALSAERRR LAGLLSEVLD ESAKAALGRL
LVRDDTLSQL AALKQDAKDF GWRQMARERE KRATLEPLHR IAKALLPKLG VSQQNLLYYA SLANFYTVHD LRNLKADQTY LYLLCYAWVR YRQLSDNLVD
AMAYHMKQLE DESSAGAKQS FVAEQVRRQQ DTPQVGRLLS LYIDDSVPDP TPFGDVRQRA YKIMPRDTLQ TTAQRMSVKP VSKLALHWQA VDGLAERIRR
HLRPLYVALD LAGTDPGSPW LVALAWAKDV FAKQQRLSQR PLAECPAATL PKRLRPYLLT FDADGKPTDL HADRYEFWLY RQVRKRFQSG ELYLDDSLQH
RHFSDELVSL DEKAAVLAQI DIPFLRQPLD AQLDALATEL RAQWLAFNRE LKQGKLTHLE YDKDTQKLTW RKPKGENQKA REKAFYEQLP FCDVADVFRF
VNGQCQFLSA LTPLQPRYAK KVADADSLMA VIIAQAMNHG NQVMARTSDI PYHVLESAYQ QYLRHATLHA ANDCISNAIA ALPIFPYYSF DLDALYGAVD
GQKFGVERPT VKARHSRKYF GRGKGVVAYT LLCNHVPLNG YLIGAHDYEA HHVFDIWYRN TSDIVPTAIT GDMHSVNKAN FAILHWFGLR FEPRFTDLGD
QLKELYSADD PALYDQCLIR PAGRIDRDLI VSEKPNLDQI VATLGLKEMT QGTLIRKLCT YTAPNPTRRA VFEFDKLIRS IYTLRYLRDP QLERNVHRSQ
NRIESYHQLR STIAQVGGKK ELTGRTDIEI EISNQCARLI ANAVIFYNSA ILSRLLMKYE ASGNAKAHAL LTQISPAAWR HILLNGHYTF QSDGKMIDLD
ALVAGLELG

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
ISGNB1.1-EF628291 ISGNB1.1 Insertion Sequence 1022-2068 1047

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRR ISGNB1.1 1022-1036 GGCTTTGTTG CACAA
IRL ISGNB1.1 2054-2068 AACACGTTGT CTCGG

 References     

Schlüter A, Krahn I, Kollin F, Bönemann G, Stiens M, Szczepanowski R, Schneiker S, Pühler A. IncP-1-beta plasmid pGNB1 isolated from a bacterial community from a wastewater treatment plant mediates decolorization of triphenylmethane dyes. Appl Environ Microbiol. 2007 Oct;73(20):6345-50. doi: 10.1128/AEM.01177-07. Epub 2007 Aug 3. PubMed ID: 17675426