|
|
|
|
Name: Tn5501.5 (Synonyms: Tn5501.13) |
|
Family: Tn3 Group: Tn3000 |
|
Evidence of Transposition: Yes |
|
|
Host |
|
|
Host Organism: | Uncultured bacterium | Molecular Source: | plasmid pGNB1 |
Place of Origin: | Bielefeld-Heepen, Germany | Date of Isolation: | 2007 |
| | Other Geographic Information: | activated sludge compartment of a wastewater treatment plant |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 38 bp) | | GGGGTTCTAAGCCGGAACCGCCGAAAATTCCGTCAGCC |
IRR (Length: 38 bp) | | GGGGTTCTAAGCCAGAACCGCCGAAATTTCCGTCATCC |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTCTAA GCCGGAACCG CCGAAAATTC CGTCAGCCGA TCAACGTGGC TTGTCCCGCG CCCGGTCGAT GGGGTAGACC CAACAGTCGT GTCACTAGCC 100
GCCATTTCGA TCACGGCAAT GCCAGCCGGA CGTCACGTCC AGATTGTTCC GGTCTGGATG AGGCCGACTG ACGTCTCGGA TGACGGGTGG CATACAACTG 200
CTGTGAGTCC TGCAGGGGGG CAGCTGCCTG ACCGGACGGC GAGCATCAGC CCATCTCATG TATTAGTCAT GTCAGCTTTG ACACTGCGCA CGCGACGGCA 300
CCCGACCCGT TGCAGACCCC CAGACATATG GAAAGCTGAC GCTCAACGTG GAGTTAGCCG GCGCGGCGCG GCTTCATCGC GCAGCGTCCG TGTTGATGGA 400
TGGGTTAGAA CTCTCTGGGG CACACATTTG AGAAAGACTA TTTTTCAGCC AACGCAAAAC CCGCGTTGCC AGATGCAACC ACTGAGACAA ATACAAACGG 500
CACTGGACTA GTATTTAGAG CGCCATGCAC TTGGCCAGGT TTTGCTATGG CAATATCTCC AGCCTTTAGA TGAGCGACTT TGCCGCCTCC TTGGTAATAC 600
TCAGCCTCTC CAGAGATAAC CGTCCAAGTA TCTTGGCCGT CCGGGTGAAC ATGAGCGGTT ATTTCTTGCC CAGGATGGGC ATGCCAAACA ACAACTGCTG 700
AGTCCTTGGT TTCCAGAACC ACTGAACGAA TTGGTTCGCC GTCAGACGGA CGAATATATT CTGTTACCGA AAATATTCTT GATTCCATTT CCATATCCAC 800
TCCAAAAATC AATTTGGCTT GAGAAGTTAT TACAGAACAA CTGTATCCGA GAGTTCTACG TTGCCGATAA CCGGCCCGAA ACGGCGGCGC AACGCGCCGC 900
TGTTGGGGGT CCGTGTTCAT CGGCGGGTTA GACCTGTGCT TGCGTAAGAC GAATATCGAC TTCTCGGTCG ACGTATTTCC ATTCCTCTGA CGCACGCAGT 1000
TCGGAAAGCA GCCTGTTGAA CGGCTTTGTT GCACAAGTCG AAAAAAGTCG ATATGACCCC TACCCCAGAC GCAGCTACGC CACAGCAGGC ACCGCCACCG 1100
TTGTGGGGCA CCCAAGTTGC GTGAATCGGT TGAGCAAGGC CACCCGCACA TGCAACTCTG TCACCTGACG CTCGAATGTG CGGGCCATCA CCCGCTCGCC 1200
CAGGCGCTTG AAGCAATGCA TCTTGGTCTC CACCAGACTC CTGCGGTGGT AGCCGCTCCA CTTCTTCCAG ATACGCCAAC CCAGCCTCTT GCACGCCAGC 1300
ACCGCTTCGT TACGCACCAT GGCCCCAGGC GTTTGGGACT TCCACACCTG GGCGTTCTTG CGCGGCGGGA TGATCGCTTG TGCCCCACGT TCCATGATCG 1400
CCGCATGGCA CGCCTTGGTG TCGTATGCAC CGTCGGTACT CACACTGGCG ATCGGTTCAC CCGGTGGTAT CTGCCCCAGC AGCCCAGGCA GCATGGGCGC 1500
GTCGCCCACG CTGTTGTCTG TGACTTCAAT GGCCCGTATC TCCAGCGTGC TGGCATCAAT ACCCAGGTGC ACTTTGCGCC ACTGGCGCCG GTACTCTGCC 1600
CCGTGTTTCT TGCGTTTCCA CTCCCCCTCG CCCAGGAACT TGATGCCGGT GCTGTCCACC AGCAGGTTCA GCGCGCTGGT AGAGGCTCGG TAGGGCAGTT 1700
GCACCTGCAG TGTCTTTTGG CGTCGGCTCA CGGTGCTGAA GTCGGGCACC CTCCAATCCA GGTCTGCCAG GTGCAGCAGG CTTTGCACCA GTCCCAGGCT 1800
TTGGCGCAGG GCCAAACCAA AAAGGCATTT GATGCTCAGG CAGAACTGGA TGGCGGCGTC GCTGAACACG TGCTGCCTAC CCCGCTTGCC GCTGGCGGGT 1900
GCGTACCACT GCATGTCTTT GTCCAGCCAG ATCGCCAGCG AGCCCCGGGC TTTGAGCGCT GCGTTGTACG CTGCCCAGTT GGTGGTCTTG TAGCGGGGCT 2000
TGGCGCTCTT GGATTCACTC ATGTCCCGAG GCTAACAGTT CGGGGGGCAG GGTTTGTGCA ACAGAGCCGC TGCGATGTAG TCCCACACAT CGGCACGGTC 2100
TTGCTGCGCT TCGGGCGTCC AAACAACCCT CACGCCTGGC TCGCCACACT GGCACGCCGT GCGGCGAATT CGGCCTCAAC TTCATCGTTC GACCGCCCCA 2200
ATCCAGCGCG CATCGAAGCC CGGCCGGCTT CGACCTTGCG GCGCAGGAAC TCGTCGTACT CGCGCGACTC GCGCTGGCGC TGAACGAACT CGCGCATCAG 2300
CTCGCGCAGC ACTTGCGACG CCGGGCGATG GGCCGCCTCG GCTTCGGCCA TAAACTCGGC GCGCAACTCA GGCTCCAGCT TCATCGTGAA AACGGCTTGT 2400
TTTGACATGA TCGGGGCCTC CTGCCACTTG ATACTAACAA AGTATATACG CCGTCATTAC TAAGCGCTAT TCACAGAACG CTGCAAGGCG GGCGTGCGCT 2500
AGGCCAAGGC CTGTCGGAAA ACATTTGTTT TTCGACAGGC CTTCAACGGT CCTCTGCACC AACCTCCGAG TGGCCGCAAA ATTGTGCGGA AAACTCTGTC 2600
GCCAGACGCT ACCATACGGA AACCTCGTCT TAATGGTTTT CCGCTTATGT TGGTAGGTTA CATGCGCGTG TCGTCGGACT CCGACCGCCA GAGCACGAAC 2700
TTGCAGCGCG ATGCGCTGCT CGCCGTCGGC GTCGATGCGC GGCATCTGTT CGAGGATCAT GCTTCCGGCG CGAAGGACGA CCGCGCGGGC CTGGCGCGGG 2800
CGCTCGAATT CGTTCGCCCT GGCGACGTGT TGGTCGTGTG GAAGCTCGAC CGGCTCGGCC GTTCGTTGTC GCACTTGCTC GCCATCGTGA CCTCGCTCAA 2900
GAAAAAGCAG GTGGCGTTCC GCTCGCTGAC GGAGAACCTG GATACCACGA CGCCCTCGGG CGAGTTTCTG TTCCAGGTGT TCGGCGCGCT CGCGCAGTAC 3000
GAACGCGCCT TGATCCAGGA ACGTGTCGTC GCCGGTCTGG CTGCCGCCCG CAAACGCGGC CGGATCGGCG GCCGGCCGCA GGCGATCACC GGCGAGAAGC 3100
TGGAGGCCAT CGTCGCTGCG CTCGATGGCG GCATGTCCAA GGCGGCGGTG TGCCGCAACT TCGGCGTCAA GCGAACCACG CTGATCGAGA CCCTGGCACG 3200
GGTTGGTTGG ACGGGCTCTC GTGGAGCGTC ATCGCGATGA CGACCAAGAG CGAACGATTG ACCGTCCTGT CGGACGCCGA GCAGGAAGCC CTGTACGGCC 3300
TGCCGGACTT CGACGACGCC CAGCGGCTGG AATACTTGGC GTTGACTGAA ACCGAACTGG CGCTCGCCAG CAGCCGGCCT GGTCTCCATG CCCAGGTCTA 3400
TTGCATCTTG CAGATCGGTT ACTTCAAGGC CAAGCATGCC TTCTTCCGCT TCGACTGGAG TGAGGTCGAG CACGATTGCG CCTTCGTGCT GAGCCGCTAC 3500
TTCCACGGCG AGTCCTTCGA GCACAAGCCA ATCTCCAAGC ACGAGCACTA CACCCAGCGC GAGTGGATTG CCGATCTGTT CGGCTACCGG CCGTGGGCGG 3600
CCGAGTTCCT GGCGCAGCTC GCGCAGCAGG CCGCGCAGAC CGTGCGGCGC GACGTGATGC CGGGGTTCAT CGCCGCCGAG CTGATCGTCT GGCTAAACGA 3700
GCACAAGATC ATCCGGCCCG GCTATACCAC CCTGCAAGAG CTGGTGAGCG AAGCCCTGTC CGCCGAGCGT CGGCGGCTGG CTGGCCTGCT GTCGGAAGTG 3800
TTGGACGAAT CGGCCAAGGC CGCGCTGGGT CGGCTTCTAG TGCGTGACGA CACCCTGTCG CAATTGGCGG CGCTCAAGCA GGACGCCAAG GACTTTGGCT 3900
GGCGTCAGAT GGCCCGCGAA CGCGAAAAGC GCGCCACGCT GGAGCCGCTG CACCGGATCG CCAAGGCGCT GCTGCCCAAG CTCGGCGTCT CGCAGCAGAA 4000
TCTGCTGTAC TACGCCAGCC TGGCGAACTT CTACACCGTC CACGATCTAC GCAACCTGAA GGCCGATCAG ACCTACCTCT ACCTGCTTTG CTATGCCTGG 4100
GTGCGCTACC GGCAGCTTTC CGACAACCTG GTCGATGCGA TGGCCTACCA CATGAAGCAG TTGGAGGACG AAAGCAGTGC GGGCGCAAAG CAATCCTTTG 4200
TCGCCGAGCA GGTGCGCCGT CAGCAAGACA CACCGCAGGT CGGCCGCCTG CTGTCGCTTT ACATCGACGA CAGCGTGCCC GATCCCACGC CGTTCGGCGA 4300
TGTGCGCCAG CGCGCCTACA AAATCATGCC CCGCGATACG CTGCAAACCA CCGCGCAGCG CATGAGCGTG AAGCCGGTGA GCAAGCTGGC TTTGCACTGG 4400
CAGGCGGTGG ACGGCCTGGC TGAGCGCATC CGCCGCCATC TTCGGCCGCT GTATGTCGCG CTCGACCTCG CTGGCACTGA TCCGGGCAGC CCGTGGCTCG 4500
TGGCGCTGGC CTGGGCCAAG GACGTGTTCG CCAAACAGCA GCGCCTATCG CAACGGCCGC TCGCCGAATG TCCAGCGGCC ACGCTGCCGA AACGCTTGCG 4600
ACCGTACCTG CTGACCTTCG ATGCCGATGG CAAGCCGACG GACCTGCATG CCGACCGCTA CGAGTTCTGG CTGTACCGCC AGGTCAGGAA GCGCTTCCAG 4700
TCGGGTGAAC TCTACCTCGA CGACAGCTTG CAGCACCGGC ATTTTTCCGA CGAGCTGGTT TCGCTGGATG AGAAGGCCGC CGTGCTGGCG CAGATCGACA 4800
TCCCGTTCCT GCGGCAGCCA CTCGATGCCC AGCTCGATGC GCTCGCGACC GAGCTGCGCG CTCAGTGGCT GGCCTTCAAC CGCGAGCTGA AGCAGGGCAA 4900
GCTGACGCAC CTAGAATACG ACAAGGACAC GCAGAAGCTG ACATGGCGCA AGCCCAAGGG CGAGAACCAG AAGGCGCGCG AGAAGGCGTT CTACGAGCAA 5000
CTGCCGTTCT GCGACGTGGC CGACGTGTTC CGCTTCGTCA ACGGCCAGTG CCAGTTCCTG TCGGCGCTGA CGCCTTTGCA GCCGCGCTAT GCGAAGAAGG 5100
TCGCCGACGC CGACAGCCTG ATGGCGGTCA TCATCGCGCA GGCGATGAAC CACGGCAACC AGGTCATGGC ACGCACCAGC GACATCCCGT ACCACGTGCT 5200
GGAGAGCGCC TACCAACAGT ACCTGCGCCA CGCAACGCTG CACGCGGCCA ACGACTGCAT CAGCAACGCC ATCGCCGCGC TGCCGATCTT CCCGTACTAC 5300
TCGTTCGACC TCGATGCACT GTACGGTGCC GTCGATGGTC AGAAATTCGG CGTCGAGCGG CCGACCGTGA AAGCGCGCCA CTCGCGCAAA TACTTTGGGC 5400
GCGGCAAGGG CGTGGTCGCC TACACGCTGC TGTGCAACCA CGTGCCGCTC AACGGCTACC TGATCGGCGC GCACGATTAC GAGGCCCATC ACGTGTTCGA 5500
CATCTGGTAT CGCAACACGT CGGACATCGT GCCGACCGCG ATCACCGGCG ACATGCACAG CGTCAACAAG GCCAACTTCG CTATCCTGCA CTGGTTCGGC 5600
CTGCGTTTCG AGCCGCGCTT CACCGACCTT GGCGATCAGT TGAAGGAACT CTACAGTGCC GACGATCCGG CGCTGTACGA TCAGTGCCTG ATCCGGCCGG 5700
CCGGGAGAAT CGACCGCGAT CTCATAGTCA GCGAGAAGCC GAACCTCGAC CAGATTGTCG CCACGCTCGG ACTGAAGGAG ATGACGCAGG GCACGCTGAT 5800
CCGCAAGCTA TGCACCTACA CCGCGCCGAA CCCCACGCGG CGCGCGGTGT TCGAGTTCGA CAAGCTCATC CGCAGCATCT ACACGCTGCG CTACCTGCGC 5900
GATCCGCAAC TGGAGCGCAA CGTTCACCGC TCACAGAACC GCATCGAGTC CTATCACCAG CTACGCTCAA CCATCGCCCA GGTCGGCGGC AAGAAGGAAT 6000
TGACCGGGCG CACCGACATC GAAATTGAGA TCAGCAACCA GTGCGCCAGG CTGATCGCCA ACGCGGTCAT CTTCTACAAC TCGGCCATCC TCTCGCGGCT 6100
GCTGATGAAG TACGAGGCGA GCGGCAACGC CAAGGCGCAC GCTCTCCTGA CCCAGATATC GCCGGCGGCC TGGCGGCACA TCCTGCTGAA CGGGCATTAC 6200
ACCTTCCAGA GCGACGGCAA GATGATCGAC CTGGATGCGC TCGTGGCGGG GCTGGAGCTG GGATGACGGA AATTTCGGCG GTTCTGGCTT AGAACCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res |
2509-2639 |
131 |
GCCTGTCGGA AAACATTTGT TTTTCGACAG GCCTTCAACG GTCCTCTGCA CCAACCTCCG AGTGGCCGCA AAATTGTGCG GAAAACTCTG TCGCCAGACG CTACCATACG GAAACCTCGT CTTAATGGTT T |
res_site_I |
2509-2537 |
29 |
GCCTGTCGGA AAACATTTGT TTTTCGACA |
res_site_II |
2571-2614 |
44 |
TGGCCGCAAA ATTGTGCGGA AAACTCTGTC GCCAGACGCT ACCA |
res_site_III |
2615-2639 |
25 |
TACGGAAACC TCGTCTTAAT GGTTT |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
cupin2 |
Tn5501.5 |
438-920 |
Passenger Gene |
Other |
- |
tnp |
ISGNB1.1 |
1075-2022 |
Transposase |
|
- |
parD |
Tn5501.5 |
2130-2408 |
Passenger Gene |
Antitoxin |
- |
tnpR |
Tn5501.5 |
2662-3240 |
Accessory Gene |
Resolvase |
+ |
tnpA |
Tn5501.5 |
3237-6266 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
cupin2 |
Cupin2 |
Tn5501.5 |
483 |
438-920 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
MNTDPQQRRV APPFRAGYRQ RRTLGYSCSV ITSQAKLIFG VDMEMESRIF SVTEYIRPSD GEPIRSVVLE TKDSAVVVWH AHPGQEITAH VHPDGQDTWT VISGEAEYYQ GGGKVAHLKA GDIAIAKPGQ VHGALNTSPV PFVFVSVVAS GNAGFALAEK
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnp |
Tnp |
ISGNB1.1 |
948 |
1075-2022 |
- |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MSESKSAKPR YKTTNWAAYN AALKARGSLA IWLDKDMQWY APASGKRGRQ HVFSDAAIQF CLSIKCLFGL ALRQSLGLVQ SLLHLADLDW RVPDFSTVSR RQKTLQVQLP YRASTSALNL LVDSTGIKFL GEGEWKRKKH GAEYRRQWRK VHLGIDASTL EIRAIEVTDN SVGDAPMLPG LLGQIPPGEP IASVSTDGAY DTKACHAAIM ERGAQAIIPP RKNAQVWKSQ TPGAMVRNEA VLACKRLGWR IWKKWSGYHR RSLVETKMHC FKRLGERVMA RTFERQVTEL HVRVALLNRF TQLGCPTTVA VPAVA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
parD |
ParD |
Tn5501.5 |
279 |
2130-2408 |
- |
Class: | Passenger Gene |
Sub Class: | Antitoxin |
Sequence Family: | parD (PDB:4Q2U) |
Comment: | RelB |
Protein Sequence:
|
MSKQAVFTMK LEPELRAEFM AEAEAAHRPA SQVLRELMRE FVQRQRESRE YDEFLRRKVE AGRASMRAGL GRSNDEVEAE FAARRASVAS QA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn5501.5 |
579 |
2662-3240 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MRVSSDSDRQ STNLQRDALL AVGVDARHLF EDHASGAKDD RAGLARALEF VRPGDVLVVW KLDRLGRSLS HLLAIVTSLK KKQVAFRSLT ENLDTTTPSG EFLFQVFGAL AQYERALIQE RVVAGLAAAR KRGRIGGRPQ AITGEKLEAI VAALDGGMSK AAVCRNFGVK RTTLIETLAR VGWTGSRGAS SR
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn5501.5 |
3030 |
3237-6266 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MTTKSERLTV LSDAEQEALY GLPDFDDAQR LEYLALTETE LALASSRPGL HAQVYCILQI GYFKAKHAFF RFDWSEVEHD CAFVLSRYFH GESFEHKPIS KHEHYTQREW IADLFGYRPW AAEFLAQLAQ QAAQTVRRDV MPGFIAAELI VWLNEHKIIR PGYTTLQELV SEALSAERRR LAGLLSEVLD ESAKAALGRL LVRDDTLSQL AALKQDAKDF GWRQMARERE KRATLEPLHR IAKALLPKLG VSQQNLLYYA SLANFYTVHD LRNLKADQTY LYLLCYAWVR YRQLSDNLVD AMAYHMKQLE DESSAGAKQS FVAEQVRRQQ DTPQVGRLLS LYIDDSVPDP TPFGDVRQRA YKIMPRDTLQ TTAQRMSVKP VSKLALHWQA VDGLAERIRR HLRPLYVALD LAGTDPGSPW LVALAWAKDV FAKQQRLSQR PLAECPAATL PKRLRPYLLT FDADGKPTDL HADRYEFWLY RQVRKRFQSG ELYLDDSLQH RHFSDELVSL DEKAAVLAQI DIPFLRQPLD AQLDALATEL RAQWLAFNRE LKQGKLTHLE YDKDTQKLTW RKPKGENQKA REKAFYEQLP FCDVADVFRF VNGQCQFLSA LTPLQPRYAK KVADADSLMA VIIAQAMNHG NQVMARTSDI PYHVLESAYQ QYLRHATLHA ANDCISNAIA ALPIFPYYSF DLDALYGAVD GQKFGVERPT VKARHSRKYF GRGKGVVAYT LLCNHVPLNG YLIGAHDYEA HHVFDIWYRN TSDIVPTAIT GDMHSVNKAN FAILHWFGLR FEPRFTDLGD QLKELYSADD PALYDQCLIR PAGRIDRDLI VSEKPNLDQI VATLGLKEMT QGTLIRKLCT YTAPNPTRRA VFEFDKLIRS IYTLRYLRDP QLERNVHRSQ NRIESYHQLR STIAQVGGKK ELTGRTDIEI EISNQCARLI ANAVIFYNSA ILSRLLMKYE ASGNAKAHAL LTQISPAAWR HILLNGHYTF QSDGKMIDLD ALVAGLELG
|
|
Internal Transposable Elements (TE) |
|
|
TnCentral Accession |
TE Name |
Type |
Coordinates |
Length |
ISGNB1.1-EF628291 |
ISGNB1.1 |
Insertion Sequence |
1022-2068 |
1047 |
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
IRR |
ISGNB1.1 |
1022-1036 |
GGCTTTGTTG CACAA |
IRL |
ISGNB1.1 |
2054-2068 |
AACACGTTGT CTCGG |
|
References |
|
|
Schlüter A, Krahn I, Kollin F, Bönemann G, Stiens M, Szczepanowski R, Schneiker S, Pühler A. IncP-1-beta plasmid pGNB1 isolated from a bacterial community from a wastewater treatment plant mediates decolorization of triphenylmethane dyes. Appl Environ Microbiol. 2007 Oct;73(20):6345-50. doi: 10.1128/AEM.01177-07. Epub 2007 Aug 3. PubMed ID: 17675426
| |
| | |
|
|