|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
References | |
|
|
|
|
|
|
|
|
|
Name: Tn5501.1 |
|
Family: Tn3 Group: Tn3000 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Acidovorax sp. RAC01 | Molecular Source: | chromosome |
Place of Origin: | USA | Date of Isolation: | 2016 |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 38 bp) | | GGGGTTCTAAGCCGGAACCGCCGAAAATTCCGTCAGCC |
IRR (Length: 38 bp) | | GGGGTTCTAAGCCAGAACCGCCGAAATTTCCGTCATCC |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTCTAA GCCGGAACCG CCGAAAATTC CGTCAGCCGA TCAACGTGGC TTGTCCCGCG CCCGGTCGAT GGGGTAGACC CAACAGTCGT GTCACTAGCC 100
GCCATTTCGA TCACGGCAAT GCCAGCCGGA CGTCACGTCC AGATTGTTCC GGTCTGGATG AGGCCGACTG ACGTCTCGGA TGACGGGTGG CATACAACTG 200
CTGTGAGTCC TGCAGGGGGG CAGCTGCCTG ACCGGACGGC GAGCATCAGC CCATCTCATG TATTAGTCAT GTCAGCTTTG ACACTGCGCA CGCGACGGCA 300
CCCGACCCAT TGCAGCCGGT CATCTTTCAA ACACCAGCTG CCGAACTTAG GCCGACAGCC GCCAACTGCC CCGCGCTGAA TTGACTAGCG TTCGCTCCAT 400
GAAGTTTGAG TGGCCATTGA TTTCATTCGA CGGGGGTACA AGCGGGACGC CCTCAGTCCC CAACAATTTT CCGCTCTCGG ATCAGCTTGC CGAATCGCAC 500
ACCGTCCTCC AGCGCTTTGG CATGGAATTG AGCGGGCGTC ATCGGTGCTG GCTCGCCACC GAGTGCCGCG ATGCGCTCGC GTGGACCGGG CAAGGCAAGA 600
GCCTTGTTGA TCTCGCGGTT GAGCCGATTA ACGAAGTCAG CAGGCGTGCC GGCGGGGGCG TACAGCCCGA ACACGGTGTC CGCATCGAAG CCTTTTAGGC 700
CGACCTCGTC GAGCGTCGGC ACATCCGGGA ACAAGGGCGA GCGCTTCAAA CTCCCAACCG CCAGCAGCTT GAGTTTGCCG GCCTTGATGT GCGGCATCGC 800
CACGCCGGGA TCGAATAGGA AATCCAGCTG TCCGGCCAGT AGATCAGTCA ATGCCGGGGC TGCACCCCGA TAGGGGACGT GCGTGGCGAA CACGTTCGCC 900
TGCGACTTCA TCATCTCTGC CGCCAGGTGC GGAGAACTGC CATTGCCGGG AGAGCCGAAT GACAGCTTGC CCGGCCTGGC CTTGAGTTGT GCAAGAAATT 1000
CCTTGGCATC CTTCGCGGGA AAGCCCGGCG ATGTGACCAG GAACACGAGC ACCCGTGCAA CCGCAGCCAC GGGCTCCAAA TCTTTGGAAG GGTCGAACGA 1100
CATTTTGGGA TAGATGTGCG GATTGACCGA CACCATGCCT TCCGAACTCA TCAGCAAGGT GTAGCCGTCG GGCGCCGCGC GGGCCACTGC CTCCCCGCCG 1200
ATGTTGCCGC CCGCGCCGCT CCGGTTCTCC ACAACCACGG GCTGGCCCAA GGCCTCCTGC AAGGACTGCG TGACTGAGCG TGCGATCTGG TCCGCGGCGC 1300
CACCGGGTGG AAAGTTGACG ATGATCTTGA CTGGTTTAGA CGGTCATGCT TGGCCCAATG CCAGGCATGG CACCACCAGG GACAGCGCCG CGACGGCTGC 1400
CATCACCGCG CGCCTGCCAG CCTGCGCGCG GGGGCGGCAG GCAAAACCTA ATTTGTTCAT GGGTGTCTCC AGGTGTGTAT ATGCGCGACG TGTGGCCGGA 1500
ACGTGGCGTA GCCACTCCCG GCCGGTCTCA GGCGCCAAGG GAACGCGCCG GACTCAGTCC AGCGAAATGC CGCGTTCCTT GATCAGGCGT GGCCAGAACT 1600
GGCCTTCCTG GGCCCAGTGC TTTTGCATGG TCGCGGCGTC GGAGGGCGTC GGCTCAATGC CCATGGTGGC GAACTGGGTC CGCACGGCAT CCGACTGGAT 1700
GGCCTTTTGC AGCTCGGTGC TGAGCTTGGC CACGATCTCC TTGGGTGTGG ACGACGGAAC GACGAGGCCC TGCCACGCTA CGGCCTCGGT ATCTGTGTAG 1800
CCCAACTCCA TCAGCGTGGG CACATCCGGC ATGGATGCCA GGCGCGACTT GGAGAAGGTG GCAAGCGGCT TGAGCTTGCC GGCCTTCATC ATTTGCATGC 1900
CAGCAGCCGA GTCCACCACC ATCAGCGGCA ACTGCCCACC CACGACGTCC TGGATCGCTG GCGCCGCGCC GCGGTAGGCC ACATGCACAA GGTCTACGTT 2000
GTTGCGGGCC TTGAGCATCT CCATGGCCAG GTGGTGCGGG CTACCGACGC CTGGCGTGGC AAAGCTGAGC TTGCCTGGGT TGGCCTTGGC GGCCGCGATC 2100
AGTGCCTTCA CATCGCTATA GCCAGCATTC GGGTTGACGG CCAACAGCAA CGGGAACCGC GCCATCATGC CAATGGACGT GAAGTCCTTC TGCGGGTCGT 2200
ACGAAAGCTT CTTGAACAGG GCCGTGTTGA ACACAAGCGT GCCGTTGTCG GCCGTGAATA CGGTGTAACC GTCACCCGGC GACTTGGCCA CGTTCTCGGC 2300
GCCAATCACC GTGGCACCGC CTGGCTTGTT GTCGATCAGC ACTGGCTGGC CGAGCTGTGT GGACAGTTGT GAGCCCACTG CACGCCCCAG CGCATCCGAG 2400
CCACCGCCGG CCGCGTAGGC CACGACCCAG CGCACAGGCT TCGCGGGAAA GTTGTCGGCG TGGGCTGCGC CAAGGTAACT GCTGGCCACG AAGGTGGCTG 2500
CCATCAAGAG GGAACGTCGG GTCGTCATGT TGTCTCCGTT AGTTGAGGAA GGGAAAGTGT TTTATTGAGC GGCTGGCGTC GCTTAGCCGG TTTGGCGTGC 2600
CTGAACCAAC CGCACGATCT GCAGTAGCGT GCGTGTGCAG GGCACAGGAA CGCCAAAGGT CACTGCAGCG GCCACCACCG CGCCGTTGAT GAACTCGATT 2700
TCGGTCGGCC TGCCGGCCAG CACGTCCTGC AGCATTGAGG GCTTGTGGCC CACATGGCGT GCGATGGCGT CGGCCACGCG CGCCTTGCAG GCATCTGCGT 2800
GGACCGCGAT GCCCTGCGCC TGCGCGACGG CCAGCACCTC GTCGACCACG GCGAACGCCA GGGCCCGACC GTCTTCGATC GCGCCAAGTT GGTCCACCGT 2900
GCACTGGGTC GCGGTGCAAA TGCTGTTGAG GGCAGCGTTG AAGGCCACCT TCTCCCAGAT GGCGGTCCAC ACCGCGGCAT CGGCGGTACA GGCTAGACCG 3000
GCTTCGTCAA GTGCCTGCGC CACGGTGGCC ACGAAAGGCC GATCCTGCCC GTCCGCTGAC ATCAGCCGCA CCACGCCCTG GCCGTGCGAG TGCACATGGC 3100
CGGGCCCCAC CAGGTCGGCA GGCCAGGTGG TCACGCCGAT CAGGATGCGC TCCGCGGAAA CGTGCTGGCC GATGACCTCC ACGTTGCCCA GGCCGTTCTG 3200
CAGCGACAGC ACATAGGTCT GGGGGCCGAT CAAGCGGGCC ACGCCAGCCA GGGCGCTCGC TGTGTGCAGG GTCTTGGTGA ACACCAGCAC CAGATCCGGC 3300
TGGCCTTCGG CCTGTTCCGG TCGGCAGGCG CGCAGGGCGC GGATGTTGCG CTCACCCCGG TCGTTGTGCA GACGCAGTCC TTGCGACTGG ATGGCCTGCA 3400
GGTGGGCGGC ATTCACATCG ACCAGCGTCA CATCGTGACC CTGCTCGGCC AACAAGCCAC CAAACAGCGA TCCCATGGCG CCAGCGCCTA CGACTGTGAT 3500
CTTCATGGGG TCCACCTGTC AGAACGGGTA GTGCCGCTCG GTGGTCTGCA CCGTGATCCA GCGCAGCTCG GTGAATGCGT CGATGCCGGC GCGGCCGCCG 3600
AAGTGGCCAT ATCCGGACTC CTTCACGCCG CCGAAGGGCA TCTGAGGTTC GTCGTGCACG GTCGGCGCGT TGATGTGGCA GATGCCCGAC TCGATGCGCT 3700
GGGCCACGTT CCAGGCCCGG GCGACATCGC GGCTGAAGAC CGCAGCCGAC AGGCCGAAGG CGTTGTCGTT GGCGCAAGCA ACGGCTGCAT CGACGCCTTC 3800
CACGCGCACG ATGGGCTTGA CCGGGCCGAA GCTCTCTTCC TGGTAGATGC GCATGGTGGG CGTCACGTGG TCGAGCAGCG TAGCCGGCAT CAGCGTGGAG 3900
TCGGCCTTGC CACCGCAGAC CAGCGTGGCG CCCTTGGCCA GCGCGTCGTC GATCAGCGCG TTGCAGCGCT CCACCGTACG CATGTCGACC ACCGAGCCCA 4000
GCACCACCGG ACCCTTGCGC GGATCGCCCA GCGGCAGCGC CTGTGCCTTG GCGGCCAGCT TGGCGACGAA GGCGTCGGCG ACCCTCGTGT CCACGATGAT 4100
GCGTTCGGTG GACATGCAGA TCTGGCCCGA ATTGGCGAAG GCGCCGAAAG CCGCGCCGTT CACGGCGGCG TCCAGGTCCG CATCCTCCAG CACCAGCAGG 4200
GGGGCCTTGC CGCCCAGTTC GAGCACGGCC GGCTTGAGGT GCTGCGCGCA CAGCGCGGCG ATGATCCGGC CCACGTGCGT GGAACCGGTG AAGTTGACGC 4300
GGCGCACGGC AGGGTGGGCG ACCATGGCCT CGACCACGGC GCCGGCGTCG GCCGGGTCGT TGGTCACGAA GTTCACCACG CCCGGCGGCA GGCCGGCCTC 4400
CTGCAGCGCC TCGATGATGA GGCCGTGCGT GGCGGGCGAC AGCTCCGAGC CCTTGAGCAC GACGGTGTTG CCGCAGGCCA GTGGCGTCGC CACCGCGCGC 4500
ACACCAAGGA TCACCGGCGC ATTCCACGGT GCCATGCCCA GTACCACGCC GGCGGCCTGG CGCACGCCCA TGGCCAGGCT GCCCGGTACG TCGGAGGGGA 4600
TGACCTCACC GCCGACCTGG GTGGTCAGCG AAGCAGCTTC CTGCAGCATG CCTGCGGCCA CGTGCACATT GAAGCCCGCC CAGATGGCCG AGCTACCGGT 4700
CTCCGCTGCC ATGGCGGCGG TGAAGGCATC AGCCTTGGCT TCCAGTGCCG CGGCGGCCTT CAGCAGCAGG CCGCGGCGGG CGCCGGGGCC CATGCGCGAC 4800
CAGGCCGGGA AGGCGCGCTT GGCGGCCTCG ACCGCGCGCA CGGCATCGGC CACCGAGGCC GCCGGGGCGC GCGTGGCGAC CTCGCCGTCC AGCGGGTTGC 4900
GCCGCTCGAA GAAGCGTTCG TGGCTGGCCA GGCACTTTTC GCCGTTGATG AGAAGTTGGA TCTGCTGCAT GTGTGTCTCC ACGGGTTGAA ATCAAGGGCG 5000
CGCGGCGGTC ATGCCACGAC GAGTTCGATC AGGAGCGGAC CAGCGTGGCT CAGCGCACCT GCCAGCGCAT CACGCAGGCC CTCGGGCTGC TCGACACGCA 5100
GGCCCGGCAG GCCGTGGCCC TGTGCCAGCG CCACGAAGTC CAGGCCTTGC AAGTCGGTGC CTTCCGGCTT GTCGCCTAGC GCGTAGCCAA ACACGGGGGC 5200
GAAGTCCTGC AGCGCGGCGT AGCGCTGGTT GTTCAGGATC ACGAAGGTGA TGGGCAACTG CAGCTTGGCG GCCGTGAACA GGGCCTGGAT CGCGTACAGG 5300
CTGGAGCCGT CGCCAATCAG CGCGATGACG CGGCCGGGCA AGCCCAGCTT CTGACGGCCC AGCGCCACAC CCACTGCGGC CGGCATGCCA TGCCCCAGAC 5400
CACCGCTGGC CATGGTGTAG AAGGAGTCGG CGCCGCGCAT CGGCAGGCAG GCCTGCATGA CGCCGCGCGA GCCGGGCGCT TCCTCGACCA CCACGTCATC 5500
AGGGCCGCGT GCCGCGTCCA GCGCCTGCAG CGCGAAGGCC ACGGACATCG GCAGGGACGG CTCGGCACGC GCCGCCGGCC GGCGGGGGTC AGGTGCCCTG 5600
CGCGCCGCAG CCGGTGCCGG CGCGGCCAGC AGCGCGGCGA CGGCCAGGTG CACGCTGGAG ACGATGCCCG TGCCCGCGGG GGTCCACGCC GCCATCTGCG 5700
GGTCGTCGAC GATCTGCCAC AGGTCGGCGC CGGGCGGCAT GTGCGGGCCT TGGCCTTCCA CGTGGTAGCT GAAGGCCGGC GCCCCCAGCA CGACGACGAG 5800
GTCGTGGCCG GCCAGCGCTT CGACGATGCG CTCGCGCATC GCGGCCAGGA AGCCAGCAAA CAAGGGGTGT TGCTCCGGAA AGCCGCAGCG CGCGCTCATC 5900
GGGGCGGTGT AGACGGCCGC CTGGTGGCGC TCAGCCAGCG CCACGACGGC ATCGACGGCG CCGTCGCGGT CCACGCCCGC GCCGACCACA AAGGCCGGGC 6000
GCAGGCTGCC GTCCAGCGCC TGCGCCAGCT GCGCCAGCAT GGCGGGGTCC GGCGCCGTCG CGTGGCTCAC CCTGCGGGCG GGCACCCAGT CGGCCGGGCG 6100
GTCCCAGTCG TCCGCAGGAA TGGAGATGAA CACCGGTCCG CACGGCGCCT GCATGGCGAT CTGGTAGGCG CGCAGCAGCG CCAGCGGCAC GTCTTCCGCG 6200
CGCGCCGGCT CGATGGCCCA CTTCACGTAG GGGCGCGGCA GCTCGGTGGC CTGGCTGGCC GACAGGAACG GGTCGAAGGG CAGGATGGCG CGCGACTGCT 6300
GGCCGGCCGT GACGAGCATC GGCGTCTGGT TGCGAAACGC GGTGAAGATG TTGCCCATGG CGTGCCCGAC GCCGGCCGCC GAATGCAGGT TGACCATGGC 6400
GGCATTGCGC GTGACCTGCG CGTACCCGTC GGCCATGCCG ACCACCACCG ACTCCTGCAG GCCCAGCACG TAGCGCATGT CCTTGGGGAA GTCGCGGAAC 6500
ATGGGCAGTT CGGTCGAGCC GGGGTTGCCG AAAACGGTGT CAATGCCGAG GTCCTGCATC AGGCGCAGGA AAGCCTCGCG CACGGTCGGA CGGTTGCTGG 6600
AGGTGGACAT GGTGTTGCAG GCGCTGGTGT CGAAGTGCCT GTAGTGTCGA AGCACCTTGA CATTTGGGCA ACAATATGGA CCTACTAAGT GATATTGGAT 6700
AAACCAATAA ATGGAACAGC TCGACCTGAA CCTGTTACGC GTGTTCGACG TGGTCTACCG CACCCGCAAC GTGAGCCGCG CGGCCGAGCT GCTCGGCATG 6800
TCGCAACCGT CCACGAGCCA GGCGCTGACC CGGCTGCGTT TGGCGCTGGG TGATCCGCTG TTCGAGCGCA TACGCGGCGG TGTGCGGCCC ACGCCGCGCG 6900
CTGAAGATCT GGCCCGCTCC GTCCAGTCAG GGTTGGCGCA AATCGAGGCT GGCCTCGCGT CGGAGAACGC ATTTGATCCG GCACGGTCCT CCGCCGAGCT 7000
TCGCATCCAT CTCACCGACA TTGGCGAGGC GCGCTTCCTT CCGCCGCTGA TGGGCGAGAT GAGGCGTCTC GCGCCAGGCA TTCAGCTACG GGCCAGAGCC 7100
TGGCCGCAGC AGGAAATTGG CGCGGCGCTG GACAGCGGCG ACATCCACCT GGCCATCGGT TTTCTGCCCG AGCTCACCAC CTCGGCCCAT GCGAAGCTCT 7200
TGACAGATCG TTATGTCGTG CTGCTGCGCG CAGACCACCC CGTGGCGCAG CAAGCCAGGC TGACACTGCG CAAGATGCAG TCTCTCGACT GGATTGCGGT 7300
GCGTTCGCAC ACGCAAACGC TGCAGATGCT GCGGGCGGCC AACCTGGACT GCAAGGTCAC GCTGACGACC TCGACCTTCT TGGCGCTGCC CGACATCGTG 7400
AAGAATACCG ATCTGGCCGT ACTGATGCCG CAGCAGATCG CGCGGGAGTT CATGCCGGCT AACCGTCTCA AGTTGCTCGA CTTCGACCTG CCGTCCAACG 7500
ACTTCACTGT ATCTGTCCAC TGGAGCCGGC GCCATGCGCA TTCGCCCTTG GTGAGGTGGA GCCGCGAGGT ACTGCTGCGA CTGTTTCAAC GTGACTAAAA 7600
CAATTTGGCT GCGGGGCGTA TTGGGAACTC CGAGCGGGTT GCCTTGAAAG TTGCGAAAGT CGTGGACGGT AGGGCTTGCA TAGCGCACCG GCGCTTGGGG 7700
TAACGAGCGG CCGCTTTGCC CGAAATCGGA ACTTTGGGTG TCAATGACCA GTGACAGCTC CTGGCCGCCT GCCGCCGACC GCGGTCTGGC TCGAAAGCAG 7800
TCTGTCAACG CGCCGTTTCA TTGTTGCCAT GATCATATCT TAATCGCGCA CAGGCGGCCA CAGGCGGGCA GTATGAACCA GCGTCAGTAT CCACACCGTT 7900
TCGCCGTCGA TCTGATACAC CAGGCGATAG CTTTCGTGCG GGATCAACTC GCGGGTCCCG GGAATCTTTC CCGGCTTGCC CAGCATGGGG TGCTGGATCA 8000
AGCGGGCGGC CGCGTCGCTG AAAATCTCAT CCATCCGGGC CGCCGCGCGC GGATTGTCGG CTGCGATGTA GTCCCACACA TCGGCACGGT CTTGCTGCGC 8100
TTCGGGCGTC CAAACAACCC TCACGCCTGG CTCGCCACAC TGGCACGCCG TGCGGCGAAT TCGGCCTCAA CTTCATCGTT CGACCGCCCC AATCCAGCGC 8200
GCATCGAAGC CCGGCCGGCT TCGACCTTGC GGCGCAGGAA CTCGTCGTAC TCGCGCGACT CGCGCTGGCG CTGAACGAAC TCGCGCATCA GCTCGCGCAG 8300
CACTTGCGAC GCCGGGCGAT GGGCCGCCTC GGCTTCGGCC ATAAACTCGG CGCGCAACTC AGGCTCCAGC TTCATCGTGA AAACGGCTTG TTTTGACATG 8400
ATCGGGGCCT CCTGCCACTT GATACTAACA AAGTATATAC GCCGTCATTA CTAAGCGCTA TTCACAGAAC GCTGCAAGGC GGGCGTGCGC TAGGCCAAGG 8500
CCTGTCGGAA AACATTTGTT TTTCGACAGG CCTTCAACGG TCCTCTGCAC CAACCTCCGA GTGGCCGCAA AATTGTGCGG AAAACTCTGT CGCCAGACGC 8600
TACCATACGG AAACCTCGTC TTAATGGTTT TCCGCTTATG TTGGTAGGTT ACATGCGCGT GTCGTCGGAC TCCGACCGCC AGAGCACGAA CTTGCAGCGC 8700
GATGCGCTGC TCGCCGTCGG CGTCGATGCG CGGCATCTGT TCGAGGATCA TGCTTCCGGC GCGAAGGACG ACCGCGCGGG CCTGGCGCGG GCGCTCGAAT 8800
TCGTTCGCCC TGGCGACGTG TTGGTCGTGT GGAAGCTCGA CCGGCTCGGC CGTTCGTTGT CGCACTTGCT CGCCATCGTG ACCTCGCTCA AGAAAAAGCA 8900
GGTGGCGTTC CGCTCGCTGA CGGAGAACCT GGATACCACG ACGCCCTCGG GCGAGTTTCT GTTCCAGGTG TTCGGCGCGC TCGCGCAGTA CGAACGCGCC 9000
TTGATCCAGG AACGTGTCGT CGCCGGTCTG GCTGCCGCCC GCAAACGCGG CCGGATCGGC GGCCGGCCGC AGGCGATCAC CGGCGAGAAG CTGGAGGCCA 9100
TCGTCGCTGC GCTCGATGGC GGCATGTCCA AGGCGGCGGT GTGCCGCAAC TTCGGCGTCA AGCGAACCAC GCTGATCGAG ACCCTGGCAC GGGTTGGTTG 9200
GACGGGCTCT CGTGGAGCGT CATCGCGATG ACGACCAAGA GCGAACGATT GACCGTCCTG TCGGACGCCG AGCAGGAAGC CCTGTACGGC CTGCCGGACT 9300
TCGACGACGC CCAGCGGCTG GAATACTTGG CGTTGACTGA AACCGAACTG GCGCTCGCCA GCAGCCGGCC TGGTCTCCAT GCCCAGGTCT ATTGCATCTT 9400
GCAGATCGGT TACTTCAAGG CCAAGCATGC CTTCTTCCGC TTCGACTGGA GTGAGGTCGA GCACGATTGC GCCTTCGTGC TGAGCCGCTA CTTCCACGGC 9500
GAGTCCTTCG AGCACAAGCC AATCTCCAAG CACGAGCACT ACACCCAGCG CGAGTGGATT GCCGATCTGT TCGGCTACCG GCCGTGGGCG GCCGAGTTCC 9600
TGGCGCAGCT CGCGCAGCAG GCCGCGCAGA CCGTGCGGCG CGACGTGATG CCGGGGTTCA TCGCCGCCGA GCTGATCGTC TGGCTAAACG AGCACAAGAT 9700
CATCCGGCCC GGCTATACCA CCCTGCAAGA GCTGGTGAGC GAAGCCCTGT CCGCCGAGCG TCGGCGGCTG GCTGGCCTGC TGTCGGAAGT GTTGGACGAA 9800
TCGGCCAAGG CCGCGCTGGG TCGGCTTCTA GTGCGTGACG ACACCCTGTC GCAATTGGCG GCGCTCAAGC AGGACGCCAA GGACTTTGGC TGGCGTCAGA 9900
TGGCCCGCGA ACGCGAAAAG CGCGCCACGC TGGAGCCGCT GCACCGGATC GCCAAGGCGC TGCTGCCCAA GCTCGGCGTC TCGCAGCAGA ATCTGCTGTA 10000
CTACGCCAGC CTGGCGAACT TCTACACCGT CCACGATCTA CGCAACCTGA AGGCCGATCA GACCTACCTC TACCTGCTTT GCTATGCCTG GGTGCGCTAC 10100
CGGCAGCTTT CCGACAACCT GGTCGATGCG ATGGCCTACC ACATGAAGCA GTTGGAGGAC GAAAGCAGTG CGGGCGCAAA GCAATCCTTT GTCGCCGAGC 10200
AGGTGCGCCG TCAGCAAGAC ACACCGCAGG TCGGCCGCCT GCTGTCGCTT TACATCGACG ACAGCGTGCC CGATCCCACG CCGTTCGGCG ATGTGCGCCA 10300
GCGCGCCTAC AAAATCATGC CCCGCGATAC GCTGCAAACC ACCGCGCAGC GCATGAGCGT GAAGCCGGTG AGCAAGCTGG CTTTGCACTG GCAGGCGGTG 10400
GACGGCCTGG CTGAGCGCAT CCGCCGCCAT CTTCGGCCGC TGTATGTCGC GCTCGACCTC GCTGGCACTG ATCCGGGCAG CCCGTGGCTC GTGGCGCTGG 10500
CCTGGGCCAA GGACGTGTTC GCCAAACAGC AGCGCCTATC GCAACGGCCG CTCGCCGAAT GTCCAGCGGC CACGCTGCCG AAACGCTTGC GACCGTACCT 10600
GCTGACCTTC GATGCCGATG GCAAGCCGAC GGACCTGCAT GCCGACCGCT ACGAGTTCTG GCTGTACCGC CAGGTCAGGA AGCGCTTCCA GTCGGGTGAA 10700
CTCTACCTCG ACGACAGCTT GCAGCACCGG CATTTTTCCG ACGAGCTGGT TTCGCTGGAT GAGAAGGCCG CCGTGCTGGC GCAGATCGAC ATCCCGTTCC 10800
TGCGGCAGCC ACTCGATGCC CAGCTCGATG CGCTCGCGAC CGAGCTGCGC GCTCAGTGGC TGGCCTTCAA CCGCGAGCTG AAGCAGGGCA AGCTGACGCA 10900
CCTAGAATAC GACAAGGACA CGCAGAAGCT GACATGGCGC AAGCCCAAGG GCGAGAACCA GAAGGCGCGC GAGAAGGCGT TCTACGAGCA ACTGCCGTTC 11000
TGCGACGTGG CCGACGTGTT CCGCTTCGTC AACGGCCAGT GCCAGTTCCT GTCGGCGCTG ACGCCTTTGC AGCCGCGCTA TGCGAAGAAG GTCGCCGACG 11100
CCGACAGCCT GATGGCGGTC ATCATCGCGC AGGCGATGAA CCACGGCAAC CAGGTCATGG CACGCACCAG CGACATCCCG TACCACGTGC TGGAGAGCGC 11200
CTACCAACAG TACCTGCGCC ACGCAACGCT GCACGCGGCC AACGACTGCA TCAGCAACGC CATCGCCGCG CTGCCGATCT TCCCGTACTA CTCGTTCGAC 11300
CTCGATGCAC TGTACGGTGC CGTCGATGGT CAGAAATTCG GCGTCGAGCG GCCGACCGTG AAAGCGCGCC ACTCGCGCAA ATACTTTGGG CGCGGCAAGG 11400
GCGTGGTCGC CTACACGCTG CTGTGCAACC ACGTGCCGCT CAACGGCTAC CTGATCGGCG CGCACGATTA CGAGGCCCAT CACGTGTTCG ACATCTGGTA 11500
TCGCAACACG TCGGACATCG TGCCGACCGC GATCACCGGC GACATGCACA GCGTCAACAA GGCCAACTTC GCTATCCTGC ACTGGTTCGG CCTGCGTTTC 11600
GAGCCGCGCT TCACCGACCT TGGCGATCAG TTGAAGGAAC TCTACAGTGC CGACGATCCG GCGCTGTACG ATCAGTGCCT GATCCGGCCG GCCGGGAGAA 11700
TCGACCGCGA TCTCATAGTC AGCGAGAAGC CGAACCTCGA CCAGATTGTC GCCACGCTCG GACTGAAGGA GATGACGCAG GGCACGCTGA TCCGCAAGCT 11800
ATGCACCTAC ACCGCGCCGA ACCCCACGCG GCGCGCGGTG TTCGAGTTCG ACAAGCTCAT CCGCAGCATC TACACGCTGC GCTACCTGCG CGATCCGCAA 11900
CTGGAGCGCA ACGTTCACCG CTCACAGAAC CGCATCGAGT CCTATCACCA GCTACGCTCA ACCATCGCCC AGGTCGGCGG CAAGAAGGAA TTGACCGGGC 12000
GCACCGACAT CGAAATTGAG ATCAGCAACC AGTGCGCCAG GCTGATCGCC AACGCGGTCA TCTTCTACAA CTCGGCCATC CTCTCGCGGC TGCTGATGAA 12100
GTACGAGGCG AGCGGCAACG CCAAGGCGCA CGCTCTCCTG ACCCAGATAT CGCCGGCGGC CTGGCGGCAC ATCCTGCTGA ACGGGCATTA CACCTTCCAG 12200
AGCGACGGCA AGATGATCGA CCTGGATGCG CTCGTGGCGG GGCTGGAGCT GGGATGACGG AAATTTCGGC GGTTCTGGCT TAGAACCCC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res |
8500-8630 |
131 |
GCCTGTCGGA AAACATTTGT TTTTCGACAG GCCTTCAACG GTCCTCTGCA CCAACCTCCG AGTGGCCGCA AAATTGTGCG GAAAACTCTG TCGCCAGACG CTACCATACG GAAACCTCGT CTTAATGGTT T |
res_site_I |
8500-8528 |
29 |
GCCTGTCGGA AAACATTTGT TTTTCGACA |
res_site_II |
8562-8605 |
44 |
TGGCCGCAAA ATTGTGCGGA AAACTCTGTC GCCAGACGCT ACCA |
res_site_III |
8606-8630 |
25 |
TACGGAAACC TCGTCTTAAT GGTTT |
|
ORFs |
|
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
ttt receptor |
Ttt receptor |
Tn5501.1 |
786 |
453-1238 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
VVVENRSGAG GNIGGEAVAR AAPDGYTLLM SSEGMVSVNP HIYPKMSFDP SKDLEPVAAV ARVLVFLVTS PGFPAKDAKE FLAQLKARPG KLSFGSPGNG SSPHLAAEMM KSQANVFATH VPYRGAAPAL TDLLAGQLDF LFDPGVAMPH IKAGKLKLLA VGSLKRSPLF PDVPTLDEVG LKGFDADTVF GLYAPAGTPA DFVNRLNREI NKALALPGPR ERIAALGGEP APMTPAQFHA KALEDGVRFG KLIRERKIVG D
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tricarboxylate transport |
Tricarboxylate transport |
Tn5501.1 |
975 |
1554-2528 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
MTTRRSLLMA ATFVASSYLG AAHADNFPAK PVRWVVAYAA GGGSDALGRA VGSQLSTQLG QPVLIDNKPG GATVIGAENV AKSPGDGYTV FTADNGTLVF NTALFKKLSY DPQKDFTSIG MMARFPLLLA VNPNAGYSDV KALIAAAKAN PGKLSFATPG VGSPHHLAME MLKARNNVDL VHVAYRGAAP AIQDVVGGQL PLMVVDSAAG MQMMKAGKLK PLATFSKSRL ASMPDVPTLM ELGYTDTEAV AWQGLVVPSS TPKEIVAKLS TELQKAIQSD AVRTQFATMG IEPTPSDAAT MQKHWAQEGQ FWPRLIKERG ISLD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
2-dehydropantoate 2-reductase |
2-dehydropantoate 2-reductase |
Tn5501.1 |
933 |
2583-3515 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
VDPMKITVVG AGAMGSLFGG LLAEQGHDVT LVDVNAAHLQ AIQSQGLRLH NDRGERNIRA LRACRPEQAE GQPDLVLVFT KTLHTASALA GVARLIGPQT YVLSLQNGLG NVEVIGQHVS AERILIGVTT WPADLVGPGH VHSHGQGVVR LMSADGQDRP FVATVAQALD EAGLACTADA AVWTAIWEKV AFNAALNSIC TATQCTVDQL GAIEDGRALA FAVVDEVLAV AQAQGIAVHA DACKARVADA IARHVGHKPS MLQDVLAGRP TEIEFINGAV VAAAVTFGVP VPCTRTLLQI VRLVQARQTG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
salicylaldehyde dehydrogenase |
Salicylaldehyde dehydrogenase |
Tn5501.1 |
1464 |
3519-4982 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
VETHMQQIQL LINGEKCLAS HERFFERRNP LDGEVATRAP AASVADAVRA VEAAKRAFPA WSRMGPGARR GLLLKAAAAL EAKADAFTAA MAAETGSSAI WAGFNVHVAA GMLQEAASLT TQVGGEVIPS DVPGSLAMGV RQAAGVVLGM APWNAPVILG VRAVATPLAC GNTVVLKGSE LSPATHGLII EALQEAGLPP GVVNFVTNDP ADAGAVVEAM VAHPAVRRVN FTGSTHVGRI IAALCAQHLK PAVLELGGKA PLLVLEDADL DAAVNGAAFG AFANSGQICM STERIIVDTR VADAFVAKLA AKAQALPLGD PRKGPVVLGS VVDMRTVERC NALIDDALAK GATLVCGGKA DSTLMPATLL DHVTPTMRIY QEESFGPVKP IVRVEGVDAA VACANDNAFG LSAAVFSRDV ARAWNVAQRI ESGICHINAP TVHDEPQMPF GGVKESGYGH FGGRAGIDAF TELRWITVQT TERHYPF
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
mdlC |
MdlC |
Tn5501.1 |
1602 |
5009-6610 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Comment: | Benzoylformate decarboxylase |
Protein Sequence:
|
MSTSSNRPTV REAFLRLMQD LGIDTVFGNP GSTELPMFRD FPKDMRYVLG LQESVVVGMA DGYAQVTRNA AMVNLHSAAG VGHAMGNIFT AFRNQTPMLV TAGQQSRAIL PFDPFLSASQ ATELPRPYVK WAIEPARAED VPLALLRAYQ IAMQAPCGPV FISIPADDWD RPADWVPARR VSHATAPDPA MLAQLAQALD GSLRPAFVVG AGVDRDGAVD AVVALAERHQ AAVYTAPMSA RCGFPEQHPL FAGFLAAMRE RIVEALAGHD LVVVLGAPAF SYHVEGQGPH MPPGADLWQI VDDPQMAAWT PAGTGIVSSV HLAVAALLAA PAPAAARRAP DPRRPAARAE PSLPMSVAFA LQALDAARGP DDVVVEEAPG SRGVMQACLP MRGADSFYTM ASGGLGHGMP AAVGVALGRQ KLGLPGRVIA LIGDGSSLYA IQALFTAAKL QLPITFVILN NQRYAALQDF APVFGYALGD KPEGTDLQGL DFVALAQGHG LPGLRVEQPE GLRDALAGAL SHAGPLLIEL VVA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
lysR family |
LysR family |
Tn5501.1 |
888 |
6711-7598 |
+ |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
MEQLDLNLLR VFDVVYRTRN VSRAAELLGM SQPSTSQALT RLRLALGDPL FERIRGGVRP TPRAEDLARS VQSGLAQIEA GLASENAFDP ARSSAELRIH LTDIGEARFL PPLMGEMRRL APGIQLRARA WPQQEIGAAL DSGDIHLAIG FLPELTTSAH AKLLTDRYVV LLRADHPVAQ QARLTLRKMQ SLDWIAVRSH TQTLQMLRAA NLDCKVTLTT STFLALPDIV KNTDLAVLMP QQIAREFMPA NRLKLLDFDL PSNDFTVSVH WSRRHAHSPL VRWSREVLLR LFQRD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
parE |
ParE |
Tn5501.1 |
285 |
7840-8124 |
- |
Class: | Passenger Gene |
Sub Class: | Toxin |
Target: | DNA gyrase |
Sequence Family: | ParE_toxin (Pfam:PF05016) |
Protein Sequence:
|
VRVVWTPEAQ QDRADVWDYI AADNPRAAAR MDEIFSDAAA RLIQHPMLGK PGKIPGTREL IPHESYRLVY QIDGETVWIL TLVHTARLWP PVRD
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
parD |
ParD |
Tn5501.1 |
279 |
8121-8399 |
- |
Class: | Passenger Gene |
Sub Class: | Antitoxin |
Sequence Family: | parD (PDB:4Q2U) |
Comment: | RelB |
Protein Sequence:
|
MSKQAVFTMK LEPELRAEFM AEAEAAHRPA SQVLRELMRE FVQRQRESRE YDEFLRRKVE AGRASMRAGL GRSNDEVEAE FAARRASVAS QA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn5501.1 |
579 |
8653-9231 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MRVSSDSDRQ STNLQRDALL AVGVDARHLF EDHASGAKDD RAGLARALEF VRPGDVLVVW KLDRLGRSLS HLLAIVTSLK KKQVAFRSLT ENLDTTTPSG EFLFQVFGAL AQYERALIQE RVVAGLAAAR KRGRIGGRPQ AITGEKLEAI VAALDGGMSK AAVCRNFGVK RTTLIETLAR VGWTGSRGAS SR
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn5501.1 |
3030 |
9228-12257 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MTTKSERLTV LSDAEQEALY GLPDFDDAQR LEYLALTETE LALASSRPGL HAQVYCILQI GYFKAKHAFF RFDWSEVEHD CAFVLSRYFH GESFEHKPIS KHEHYTQREW IADLFGYRPW AAEFLAQLAQ QAAQTVRRDV MPGFIAAELI VWLNEHKIIR PGYTTLQELV SEALSAERRR LAGLLSEVLD ESAKAALGRL LVRDDTLSQL AALKQDAKDF GWRQMARERE KRATLEPLHR IAKALLPKLG VSQQNLLYYA SLANFYTVHD LRNLKADQTY LYLLCYAWVR YRQLSDNLVD AMAYHMKQLE DESSAGAKQS FVAEQVRRQQ DTPQVGRLLS LYIDDSVPDP TPFGDVRQRA YKIMPRDTLQ TTAQRMSVKP VSKLALHWQA VDGLAERIRR HLRPLYVALD LAGTDPGSPW LVALAWAKDV FAKQQRLSQR PLAECPAATL PKRLRPYLLT FDADGKPTDL HADRYEFWLY RQVRKRFQSG ELYLDDSLQH RHFSDELVSL DEKAAVLAQI DIPFLRQPLD AQLDALATEL RAQWLAFNRE LKQGKLTHLE YDKDTQKLTW RKPKGENQKA REKAFYEQLP FCDVADVFRF VNGQCQFLSA LTPLQPRYAK KVADADSLMA VIIAQAMNHG NQVMARTSDI PYHVLESAYQ QYLRHATLHA ANDCISNAIA ALPIFPYYSF DLDALYGAVD GQKFGVERPT VKARHSRKYF GRGKGVVAYT LLCNHVPLNG YLIGAHDYEA HHVFDIWYRN TSDIVPTAIT GDMHSVNKAN FAILHWFGLR FEPRFTDLGD QLKELYSADD PALYDQCLIR PAGRIDRDLI VSEKPNLDQI VATLGLKEMT QGTLIRKLCT YTAPNPTRRA VFEFDKLIRS IYTLRYLRDP QLERNVHRSQ NRIESYHQLR STIAQVGGKK ELTGRTDIEI EISNQCARLI ANAVIFYNSA ILSRLLMKYE ASGNAKAHAL LTQISPAAWR HILLNGHYTF QSDGKMIDLD ALVAGLELG
|
|
|