|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
|
|
|
|
|
|
|
|
|
|
|
Name: Tn6112 |
|
Family: Tn402 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Pseudomonas11BF10 | | |
Place of Origin: | Sydney, Australia | Date of Isolation: | 2011 |
| | Other Geographic Information: | Uncooked, wild-caught Eastern King Prawn Penaeusplebejus digestive tract |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCGT TTACACCGGA CCCAGCGAGC 2700
GGCGGCGGCA ATTCGAGCGG GAACTGATGT GAAGCCTGCG CCGCGCTGGC CGCTGCATCC CGCCCCGAAA GAAGGCGAGG CGCTGTCCTC ATGGCTCAAC 2800
CGCGTGGCCC TTTGCTATCA CATGGAGGAG CCCGACCTGC TGGAGCACGA TCTTGGTCAC GGCCAGGTCG ATGACCTGGA CACCGCGCCA CCACTCTCGC 2900
TGCTGGCGTT GCTTTCCCAG CGGAGCGGCA TCGAGCTGGA CCGGCTGCGC TGTATGAGTT TCGCCGGATG GGTGCCTTGG CTACTGGACA GCCTTGATGA 3000
CCAGATTCCA GACGCCTTGG AAACCTATGC GTTTCAGCTC TCGGTGTTGC TGCCAAGACT CCGCCGTAAG ACGCGATCCA TCACGAGCTG GCGTGCCTGG 3100
CTGCCCAGCC AGCCGATAAA CCGCGCCTGT CCGCTCTGCC TGAGCGATCC GGAGAACCAA GCCGTACTGC TCGCGTGGAA GCTGCCCCTG ATGCTGAGCT 3200
GCCCGCTGCA TGGCTGCTGG CTGGAATCCT ATTGGGGCGT GCCAGGGCGG TTTCTCGGCT GGGAGAACGC CGACGCCGAA CCGCGCACCG CCAGCGACGC 3300
GATTGCGGCG ATGGACCAGC GTACCTGGCA GGCACTGACA ACCGGTCACG TGGAGCTGCC GCGCCGACGC ATCCACGCCG GATTGTGGTT TCGACTTCTT 3400
CGCACGCTGC TCGATGAGCT GAACACCCCG CTTTCCGCGT GCGGAACCTG CGCGGGGTAT CCCCGCCAAG TCTGGGAAGG CTGCGGGCAT CCGCTGCGTG 3500
CTGGGCAAAG TCTGTGGCGA CCGTATGAAA CCCTGAATCC GATAGTACGG TTACAGATGC TGGAGGCGGC GGCAACGGCA ATCAGCTTGA TTGAGGTGAG 3600
GGACATCAGC CCGCCAGGCG AGCAGGCAAA GCTATTCTGG TCCGAGCCCC AAACCGGGTT CACCAGTGGC CTGCCGACGA AAGCGCCGAA GCCCGAGCCC 3700
ATCAATCACT GGCAGCGTGC AGTCCAGGCC ATCGACGAGG CCATCATTGA AGCGCGACAC AACCCCGAGA CGGCACGCTC GCTGTTCGCG TTGGCTTCCT 3800
ATGGTCGGCG CGATCCCGCT TCCTTGGAAC GGTTGCGCGC CACCTTCGTG AAGGAAGGCA TCCCGCCGGA ATTTCTGTCA CATTACCTGC CTGATGCACT 3900
CTTTGCATGT CTTAAACAAA ATGACGGGTT AAGTGACAAA TTTTGACGGA TAGAGCTTTC CGGCTCACAC TGTCACATAA TCGAACGTAT ACGTGACGGG 4000
TGAAAAGGTG CTGATCGGCT ACATGCGGGT ATCGAAGGCG GACGGATCCC AGTCCACCAA TTTGCAACGC GATGCGCTCA TCGCCGCTGG TGTGAGCCTT 4100
GCGCACCTTT ACGAGGATCT GGCCTCGGGC AGGCGCGATG ATCGCCCAGG GTTGGCTGCT TGCCTGAAGG CGCTTCGTGA AGGGGACACG CTGATCGTGT 4200
GGAAGCTCGA TCGGCTTGGC CGTGATCTGC GCCACCTGAT CAACACCGTG CACGACCTAA CTGCGCGTAG CGTGGGCCTG AAGGTCCTGA CCGGTCACGG 4300
TGCGGCGGTC GACACGACGA CTGCCGCCGG CAAGCTTGTG TTCGGTATTT TTGCCGCGCT GGCCGAGTTC GAGCGTGAGT TGATTTCCGA GCGAACAGTC 4400
GCTGGACTTA TCTCGGCGCG CGCTCGCGGC AGGAAAGGGG GGCGCCCCTT CAAGATGACC GCCGCCAAGC TACGCCTGGC GATGGCCAGC ATGGGGCAAC 4500
CGGAAACCAA GGTGGGCGAT CTCTGCGAAG AACTCGGGAT TACCCGGCAG ACGCTCTACC GGCACGTGTC GCCCAAGGGC GAACTGCGGC CAGACGGCGT 4600
AAAGCTGCTC TCCCTCGGTT CAGCCGCATA AATGGAGGCG ACCTGGAACG GGGCGCTGTT CAGTGCGGCA ACGATCCGAT TACCGGTGTC GACCCAGAGC 4700
AGCCGTAGAG CTTTTGGGAA AGCTGTCGTT CAACGTGGAA GTGAGCGGAC TGCGCAGCTT TTCGCGCAGG TCCGCTCGAC TGTAGGGTTG GACATTTACG 4800
GTCTACCTTG CATGTGAAGT TACTGGTCTT TGGTGGAGTG TGAGCGTTGG CGCAGCCGCG TGAGAAACAA CGGCCAGCCA GGGACCAACT GAAAAACTAT 4900
AACTCCTACA AAGCTGATGC CGCCTAGCCA GCGGAACCAC AAGTTTGACA TGCGGAGAAT CATGACCAGG GCAAAAAGTG CGAAGAGCAG AATTTGCCAT 5000
TTACCGAACT TTGCGAAGTA GGCACTTCGG CGCAAGTGTG CTTTGTCTAG GACGGGTTCA CCTGCCTCGG TAACGAGCTC AAGGCGTTGG CTTGAAAGCG 5100
CTTGCTCAAA GTGCTGAGAA ATATATGGGC CAACCTCGCC AATCGACCAC AGTTTGATTG GTTCATTTGC CACCCCATCG ATATATAGCA GGTAAAACTT 5200
CGGCGCTGAG TTCATGAATG TCTAACGTGC AGCTAACCGG GCTGGCGCGG CTTGCCGCGA CAGGTCCGTG TTGAGCTGAA TGTTAGGGCT CATCCCTGGC 5300
CCTAGGATTT CCGGAGTTTG GTCGCTGCCG CTTGCGCCGT CGGATTGGGC GTAGGCTCTC CGACCTTCCG GAGCGTGTTG CGATCCGTAT GGTGGAACGG 5400
CTCCGCCTGG CCTGGGATGA TCAACACAGT GTCCTGTTCG TAGGACCACG TTCCATCGAC ATTGATCGCA ACCGTGATGA TGTAGCTCTT CGTTCTGAAC 5500
GCATATTCCA AGAACGGGTT TGACACGATC CCGTTTGTTT CGGACCCGCG GACAGCCTCG AGGCGAAACG TTCTCGCATC GGCCTCGGCA CGGCCGACGG 5600
CCATCGCTGC CTGTCCGCGC GGTATCGAAA GCGTCTGGAT CACGGTGCCC GTCGCCGGTT CCCATAGCCA ATAGCCGACT TGGTCGTGGA ATGTCTCGAC 5700
GTCATCAGGC TTGACGATCC GCGTGTGGTA CCTCAGTCCG TAGAACAATT GCGGCCCATT GGTCTGTGCA TCGATCGGCT GCAACTCCAT GTGTTCGATG 5800
AAAGCCTGCT TTTCGGGGCC GTTGGGCTTC GGATTGACGT CGAGGCCGTG CGTGCCCACC CATATGCCTG CCATGCCGGT GAGCGGTCCC AGATTGGCGA 5900
GCGTATCGAC TGCCGGCTCA GGTTCCGTGT AGATGTCTTT GGGGATGTTT TCCATAGTTG ATTCCAGCGG GCCTAACTTT GTTTTAGGGC GACTGCCCTG 6000
CTGCGTAACA TCGTTGCTGC TCCATAACAT CAAACATCGA CCCACGGCGT AACGCGCTTG CTGCTTGGAT GCCCGAGGCA TAGACTGTAC AAAAAAACAG 6100
TCATAACAAG CCATGAAAAC CGCCACTGCG CCGTTACCAC CGCTGCGTTC GGTCAAGGTT CTGGACCAGT TGCGTGAGCG CATACGCTAC TTGCATTACA 6200
GCTTACGAAC CGAACAGGCT TATGTCCACT GGGTTCGTGC CTTCATCCGT TTCCACGGTG TGCGTCACCC GGCAACCTTG GGCAGCAGCG AAGTCGAGGC 6300
ATTTCTGTCC TGGCTGGCGA ACGAGCGCAA GGTTTCGGTC TCCACGCATC GTCAGGCATT GGCGGCCTTG CTGTTCTTCT ACGGCAAGGT GCTGTGCACG 6400
GATCTGCCCT GGCTTCAGGA GATCGGAAGA CCTCGGCCGT CGCGGCGCTT GCCGGTGGTG CTGACCCCGG ATGAAGTGGT TCGCATCCTC GGTTTTCTGG 6500
AAGGCGAGCA TCGTTTGTTC GCCCAGCTTC TGTATGGAAC GGGCATGCGG ATCAGTGAGG GTTTGCAACT GCGGGTCAAG GATCTGGATT TCGATCACGG 6600
CACGATCATC GTGCGGGAGG GCAAGGGCTC CAAGGATCGG GCCTTGATGT TACCCGAGAG CTTGGCACCC AGCCTGCGCG AGCAGCTGTC GCGTGCACGG 6700
GCATGGTGGC TGAAGGACCA GGCCGAGGGC CGCAGCGGCG TTGCGCTTCC CGACGCCCTT GAGCGGAAGT ATCCGCGCGC CGGGCATTCC TGGCCGTGGT 6800
TCTGGGTTTT TGCGCAGCAC ACGCATTCGA CCGATCCACG GAGCGGTGTC GTGCGTCGCC ATCACATGTA TGACCAGACC TTTCAGCGCG CCTTCAAACG 6900
TGCCGTAGAA CAAGCAGGCA TCACGAAGCC CGCCACACCG CACACCCTCC GCCACTCGTT CGCGACGGCC TTGCTCCGCA GCGGTTACGA CATTCGAACC 7000
GTGCAGGATC TGCTCGGCCA TTCCGACGTC TCTACGACGA TGATTTACAC GCATGTGCTG AAAGTTGGCG GTGCCGGAGT GCGCTCACCG CTTGATGCGC 7100
TGCCGCCCCT CACTAGTGAG AGGTAGGGCA GCGCAAGTCA ATCCTGGCGG ATTCACTACC CCTGCGCGAA GGCCATCGGT GCCGCATCGA ACGGCCGGTT 7200
GCGGAAAGTC CTCCCTGCGT CCGCTGATGG CCGGCAGCAG CCCGTCGTTG CCTGATGGAT CCAACCCCTC CGCTGCTATA GTGCAGTCGG CTTCTGACGT 7300
TCAGTGCAGC CGTCTTCTGA AAACGACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
r6 |
3824-3837 |
14 |
TTGGAACGGT TGCG |
r5 |
3870-3883 |
14 |
AATTTCTGTC ACAT |
r4 |
3875-3888 |
14 |
CTGTCACATT ACCT |
r3 |
3926-3939 |
14 |
GGGTTAAGTG ACAA |
res |
3967-4001 |
35 |
ACACTGTCAC ATAATCGAAC GTATACGTGA CGGGT |
r2 |
3970-3983 |
14 |
CTGTCACATA ATCG |
r1 |
3986-3999 |
14 |
CGTATACGTG ACGG |
attC orf10 core |
4734-4793 |
60 |
CGTGGAAGTG AGCGGACTGC GCAGCTTTTC GCGCAGGTCC GCTCGACTGT AGGGTTGGAC
|
attC orf9 core |
5226-5276 |
51 |
CGTGCAGCTA ACCGGGCTGG CGCGGCTTGC CGCGACAGGT CCGTGTTGAG C |
attI |
5977-6032 |
56 |
CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tniA |
Tn6112 |
142-1821 |
Transposase |
|
+ |
tniB |
Tn6112 |
1824-2732 |
Accessory Gene |
|
+ |
tniQ |
Tn6112 |
2729-3946 |
Accessory Gene |
Target Site Selection |
+ |
tniR |
Tn6112 |
4008-4631 |
Accessory Gene |
Resolvase |
+ |
orf10 |
Tn6112 |
4820-5215 |
Passenger Gene |
Hypothetical |
- |
FABP family protein |
Tn6112 |
5302-5955 |
Passenger Gene |
Other |
- |
intI1 |
Tn6112 |
6113-7126 |
Integron Integrase |
Class 1 |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
Tn6112 |
1680 |
142-1821 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7 |
Protein Sequence:
|
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB |
TniB |
Tn6112 |
909 |
1824-2732 |
+ |
Class: | Accessory Gene |
Sequence Family: | ATP binding protein? |
Comment: | identical to tniB (Tn1721)| similar function to Tn7 tnsC and MuB |
Protein Sequence:
|
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMAVYTGP SERRRQFERE LM
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniQ |
TniQ |
Tn6112 |
1218 |
2729-3946 |
+ |
Class: | Accessory Gene |
Sub Class: | Target Site Selection |
Comment: | identical to tniQ (Tn1721)|similar function to Tn7 tnsD? |
Protein Sequence:
|
MKPAPRWPLH PAPKEGEALS SWLNRVALCY HMEEPDLLEH DLGHGQVDDL DTAPPLSLLA LLSQRSGIEL DRLRCMSFAG WVPWLLDSLD DQIPDALETY AFQLSVLLPR LRRKTRSITS WRAWLPSQPI NRACPLCLSD PENQAVLLAW KLPLMLSCPL HGCWLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW QALTTGHVEL PRRRIHAGLW FRLLRTLLDE LNTPLSACGT CAGYPRQVWE GCGHPLRAGQ SLWRPYETLN PIVRLQMLEA AATAISLIEV RDISPPGEQA KLFWSEPQTG FTSGLPTKAP KPEPINHWQR AVQAIDEAII EARHNPETAR SLFALASYGR RDPASLERLR ATFVKEGIPP EFLSHYLPDA LFACLKQNDG LSDKF
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniR |
TniR |
Tn6112 |
624 |
4008-4631 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Comment: | resolution of cointegrates || Protein: ACE81792.1 || identical to tniR (Tn1721) |
Protein Sequence:
|
MLIGYMRVSK ADGSQSTNLQ RDALIAAGVS LAHLYEDLAS GRRDDRPGLA ACLKALREGD TLIVWKLDRL GRDLRHLINT VHDLTARSVG LKVLTGHGAA VDTTTAAGKL VFGIFAALAE FERELISERT VAGLISARAR GRKGGRPFKM TAAKLRLAMA SMGQPETKVG DLCEELGITR QTLYRHVSPK GELRPDGVKL LSLGSAA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
orf10 |
Orf10 |
Tn6112 |
396 |
4820-5215 |
- |
Class: | Passenger Gene |
Sub Class: | Hypothetical |
Protein Sequence:
|
MNSAPKFYLL YIDGVANEPI KLWSIGEVGP YISQHFEQAL SSQRLELVTE AGEPVLDKAH LRRSAYFAKF GKWQILLFAL FALVMILRMS NLWFRWLGGI SFVGVIVFQL VPGWPLFLTR LRQRSHSTKD Q
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
FABP family protein |
FABP family protein |
Tn6112 |
654 |
5302-5955 |
- |
Class: | Passenger Gene |
Sub Class: | Other |
Protein Sequence:
|
MENIPKDIYT EPEPAVDTLA NLGPLTGMAG IWVGTHGLDV NPKPNGPEKQ AFIEHMELQP IDAQTNGPQL FYGLRYHTRI VKPDDVETFH DQVGYWLWEP ATGTVIQTLS IPRGQAAMAV GRAEADARTF RLEAVRGSET NGIVSNPFLE YAFRTKSYII TVAINVDGTW SYEQDTVLII PGQAEPFHHT DRNTLRKVGE PTPNPTAQAA ATKLRKS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
intI1 |
IntI1 |
Tn6112 |
1014 |
6113-7126 |
+ |
Class: | Integron Integrase |
Sub Class: | Class 1 |
Transpoase Chemistry: | Tyrosine |
Sequence Family: | Class 1 Integron Tyrosine Integrase |
Protein Sequence:
|
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER
|
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t1 |
Tn6112 |
9-27 |
TCAGAAGACG ACTGCACCA |
repeat t2 |
Tn6112 |
49-67 |
AACACGTCGG TCGAGGACT |
repeat t3 |
Tn6112 |
78-97 |
TCAGAAGTGA TCTGCACCAA |
repeat t4 |
Tn6112 |
110-128 |
TCAATACTCG TGTGCACCA |
repeat i4 |
Tn6112 |
7209-7227 |
AGGAGGGACG CAGGCGACT |
repeat i3 |
Tn6112 |
7237-7255 |
CGTCGGGCAG CAACGGACT |
repeat i2 |
Tn6112 |
7279-7297 |
ATCACGTCAG CCGAAGACT |
IRi |
Tn6112 |
7296-7328 |
CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT |
|
References |
|
|