ISAeme5
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP007567 | ND | Aeromonas media | Aeromonas media WS |
DNA section
IS Length : 2438 bp
Ends
IR Length : 15/18
IRL : GTAAGCGCCCTTGAATCGGCATCTTTTTTTCCCAAAATCAGCTCGTCACA
IRR : GTAAGCGCCGCTAAATCGACCTACTTGCCTTCGTTGTATTGCGCCACTTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCCGGCGATG | ATCCGCTG | CGATAACGGC | 8 |
DNA sequence
GTAAGCGCCCTTGAATCGGCATCTTTTTTTCCCAAAATCAGCTCGTCACACTTGAAATCCATTACCTCTTCGCTGGATTTTCACTATGACCACTCAAGAA
CGTGCTGATTTTTGGCAGCAGCAGATCACTGCCTGGCTTGACTCCGGCCTCTCCGGCCACGCCTTTTGCAAAAGCCAGGCACTGGTCTATCACCAGTTTG
CTTACTGGCGTAAAAAGCTCGAACAGCCTGTAGACTCGGACTCTCTTCCTGGCTTTGTCAGAGTAGCCATGCCTGCTCCTGTCGGGACCGAAGTCGGCTT
GACGCTCACCTTACCCAACGGGATCACTCTCTCCGGATTACATCCGGGCAATATCGATTTGCTCGGCGCTATCATGAGGCAGTTGTAATGCGCCCTCGTT
ACTTGCGTCCGGCACTGACCATGCCGCAGATCTACCTCTATCGGGCTCCTATCGATTTTCGTAAACAGGCCCACGGCCTGGCGGCGCTGGTCGAGCAGCA
ACTGGGGCACAACCCGTTCACCGGTGCCCTCTATGCCTTTACCAATCGCCGCCGCAACAAGATCAAGTGTCTGATGTGGGAGGACAACGGCTTTGTGCTC
TATTACAAGGCGCTGGCCGAGGAGAAATTCAAGTGGCCATCGTCTGCCGATGAGCTACTCTCCCTCTCTTCCGAGCAGATAAACTGGTTGCTCGATGGCT
ACGATATCTCCTTGATGCAGGGCCATAAAACACTGCATTATGAGACGCTTTAATACCGCCAACCTCGTGCTTTACCGCCTTGTTTTTGTTATAATTTCCC
CATGAAAATGACGCTGAAAAACCTCACCCCATCACCTGATCTCAGTGGCCTTAGCGCCGCTGAGTTGTTGGCGGTTATTGCGGGTTTTCAGCAACAGTTG
GCGTTAAAAGAAGAGGCTATCCAGCGGCGTGACGCCCATATCCTGTTACTCGAAGAGCTGCTGCGTCTGCGCAGGATCCAGCGCTTTGCCGCCAGCAGCG
AGAAACTCCATCAGCTCCAGCTCTTTGATGAGGCAGAGCTGGAAGCGGACATGGACGCCCTGCTGGCCCAGTTACCCGATGACCTGCCGCAAACTGCCGA
AGCAAAGGCCAAGCCGCGTCAACGTGGCTTCTCCGCCTCACTGCTGCGCGAGCGCATCGAACTCACCCTCAGCGATGAGCAGAAAGCCGGGGCCAGCAAG
GTGTTCTTTACCAAGGTCAAAGAGGAGTTGCAGTTCATTCCGGCTCAACTCAAGGTGCTGGAGATCTGGCAAGAAAAGGCGGTGTTCGAGCGTGATGGCG
AAGAGGTGATCCTCGCGGCCAACCGCCCCGTACACCCGCTGGGCAAATGTATCGCCACGCCGTCATTGCTGGGCTACATCATTACCTCTAAGTATGCCGA
TGGCCTGCCACTGTACCGTCTGGAGCAGATGTTCAAGCGCCTAGGGCAGGAGGTGAGCCGCACCAGCATGGCCCACTGGATCATCCGGCTGGATGAGGTG
TTCCAGCCATTGATGAACCTGCTGCGCGAGGAGCAGAACCACGCGACGTATCTGCAAGCCGACGAGACGCGGATCCAGGTGCTCAAAGAAGAGGGAAAAA
CTGCGCAATCGGACAAATGGATGTGGGTGACCCGCGGTGGTCCGCCGGGCCGCTCCTCTGTGCTATTTGCCTACGATCCGTCGCGCGCCGGAAGCGTTCC
TGTGCGTCTGCTGGAGGGCTTTAGTGGCATACTGCAGGCCGATGGTTATTCCGGCTATAGCCAGGTGTGCAAAGAGAGCGGCCTGACACGGATTGGCTGC
TGGGATCATGCTCGGCGCAAATTCATCGAAGCGACCCGAGCTGCACCTAAGGGTAAAGACAAAGGTAAGAGCAAAGCCAGTACTGGCCTGGCCGATGTAG
CATTGGGGTACATAGGTAAACTCTATGCCATCGAGCGGGAGCAGAAGGAGCGCAGTGATGCCGAGCGTTATCAGGCACGGCAGACACGTAGCATGCCCCT
GTTGGCGGAGCTCAAAACCTGGCTGGACAATAACGTCGGCAAGGTGATGAAAGGCTCGCTGACTCGACAGGCGATGGAATATACGCTGGGGCAATGGCCC
CATCTGGTGGGTTACTGCGTGCGGGGAGATCTGCACATCAGCAATATCCTGGCGGAGAACGCAATCCGCCCGTTCGCCGTGGGGCGCAAGGCCTGGTTGT
TCGCCGACAGCGCGCAAGGCGCCAAGGCCAGCGCGACCTGCTACTCGTTGCTGGAGACAGCCAAAGCCAATGACCTGGAGCCATCAGCGTATATCAACTA
TGTGCTGGCGCAGATCGGCGAGGCCGATAGCCTGGAAAAACTCGAAGCCCTGCTACCCTGGAACGTCCCGCTGGAGCCCATTGCAAAAAAAGTGGCGCAA
TACAACGAAGGCAAGTAGGTCGATTTAGCGGCGCTTAC
CGTGCTGATTTTTGGCAGCAGCAGATCACTGCCTGGCTTGACTCCGGCCTCTCCGGCCACGCCTTTTGCAAAAGCCAGGCACTGGTCTATCACCAGTTTG
CTTACTGGCGTAAAAAGCTCGAACAGCCTGTAGACTCGGACTCTCTTCCTGGCTTTGTCAGAGTAGCCATGCCTGCTCCTGTCGGGACCGAAGTCGGCTT
GACGCTCACCTTACCCAACGGGATCACTCTCTCCGGATTACATCCGGGCAATATCGATTTGCTCGGCGCTATCATGAGGCAGTTGTAATGCGCCCTCGTT
ACTTGCGTCCGGCACTGACCATGCCGCAGATCTACCTCTATCGGGCTCCTATCGATTTTCGTAAACAGGCCCACGGCCTGGCGGCGCTGGTCGAGCAGCA
ACTGGGGCACAACCCGTTCACCGGTGCCCTCTATGCCTTTACCAATCGCCGCCGCAACAAGATCAAGTGTCTGATGTGGGAGGACAACGGCTTTGTGCTC
TATTACAAGGCGCTGGCCGAGGAGAAATTCAAGTGGCCATCGTCTGCCGATGAGCTACTCTCCCTCTCTTCCGAGCAGATAAACTGGTTGCTCGATGGCT
ACGATATCTCCTTGATGCAGGGCCATAAAACACTGCATTATGAGACGCTTTAATACCGCCAACCTCGTGCTTTACCGCCTTGTTTTTGTTATAATTTCCC
CATGAAAATGACGCTGAAAAACCTCACCCCATCACCTGATCTCAGTGGCCTTAGCGCCGCTGAGTTGTTGGCGGTTATTGCGGGTTTTCAGCAACAGTTG
GCGTTAAAAGAAGAGGCTATCCAGCGGCGTGACGCCCATATCCTGTTACTCGAAGAGCTGCTGCGTCTGCGCAGGATCCAGCGCTTTGCCGCCAGCAGCG
AGAAACTCCATCAGCTCCAGCTCTTTGATGAGGCAGAGCTGGAAGCGGACATGGACGCCCTGCTGGCCCAGTTACCCGATGACCTGCCGCAAACTGCCGA
AGCAAAGGCCAAGCCGCGTCAACGTGGCTTCTCCGCCTCACTGCTGCGCGAGCGCATCGAACTCACCCTCAGCGATGAGCAGAAAGCCGGGGCCAGCAAG
GTGTTCTTTACCAAGGTCAAAGAGGAGTTGCAGTTCATTCCGGCTCAACTCAAGGTGCTGGAGATCTGGCAAGAAAAGGCGGTGTTCGAGCGTGATGGCG
AAGAGGTGATCCTCGCGGCCAACCGCCCCGTACACCCGCTGGGCAAATGTATCGCCACGCCGTCATTGCTGGGCTACATCATTACCTCTAAGTATGCCGA
TGGCCTGCCACTGTACCGTCTGGAGCAGATGTTCAAGCGCCTAGGGCAGGAGGTGAGCCGCACCAGCATGGCCCACTGGATCATCCGGCTGGATGAGGTG
TTCCAGCCATTGATGAACCTGCTGCGCGAGGAGCAGAACCACGCGACGTATCTGCAAGCCGACGAGACGCGGATCCAGGTGCTCAAAGAAGAGGGAAAAA
CTGCGCAATCGGACAAATGGATGTGGGTGACCCGCGGTGGTCCGCCGGGCCGCTCCTCTGTGCTATTTGCCTACGATCCGTCGCGCGCCGGAAGCGTTCC
TGTGCGTCTGCTGGAGGGCTTTAGTGGCATACTGCAGGCCGATGGTTATTCCGGCTATAGCCAGGTGTGCAAAGAGAGCGGCCTGACACGGATTGGCTGC
TGGGATCATGCTCGGCGCAAATTCATCGAAGCGACCCGAGCTGCACCTAAGGGTAAAGACAAAGGTAAGAGCAAAGCCAGTACTGGCCTGGCCGATGTAG
CATTGGGGTACATAGGTAAACTCTATGCCATCGAGCGGGAGCAGAAGGAGCGCAGTGATGCCGAGCGTTATCAGGCACGGCAGACACGTAGCATGCCCCT
GTTGGCGGAGCTCAAAACCTGGCTGGACAATAACGTCGGCAAGGTGATGAAAGGCTCGCTGACTCGACAGGCGATGGAATATACGCTGGGGCAATGGCCC
CATCTGGTGGGTTACTGCGTGCGGGGAGATCTGCACATCAGCAATATCCTGGCGGAGAACGCAATCCGCCCGTTCGCCGTGGGGCGCAAGGCCTGGTTGT
TCGCCGACAGCGCGCAAGGCGCCAAGGCCAGCGCGACCTGCTACTCGTTGCTGGAGACAGCCAAAGCCAATGACCTGGAGCCATCAGCGTATATCAACTA
TGTGCTGGCGCAGATCGGCGAGGCCGATAGCCTGGAAAAACTCGAAGCCCTGCTACCCTGGAACGTCCCGCTGGAGCCCATTGCAAAAAAAGTGGCGCAA
TACAACGAAGGCAAGTAGGTCGATTTAGCGGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
303 bp | 100 aa | 86 | 388 | + | No |
AG : IS66 TnpA
ORF sequence :
MTTQERADFWQQQITAWLDSGLSGHAFCKSQALVYHQFAYWRKKLEQPVDSDSLPGFVRVAMPAPVGTEVGLTLTLPNGITLSGLHPGNIDLLGAIMRQL
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
366 bp | 121 aa | 388 | 753 | + | No |
AG : IS66 TnpB
ORF sequence :
MRPRYLRPALTMPQIYLYRAPIDFRKQAHGLAALVEQQLGHNPFTGALYAFTNRRRNKIKCLMWEDNGFVLYYKALAEEKFKWPSSADELLSLSSEQINW
LLDGYDISLMQGHKTLHYETL
LLDGYDISLMQGHKTLHYETL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1617 bp | 538 aa | 802 | 2418 | + | No |
Chemistry : DDE
ORF sequence :
MKMTLKNLTPSPDLSGLSAAELLAVIAGFQQQLALKEEAIQRRDAHILLLEELLRLRRIQRFAASSEKLHQLQLFDEAELEADMDALLAQLPDDLPQTAE
AKAKPRQRGFSASLLRERIELTLSDEQKAGASKVFFTKVKEELQFIPAQLKVLEIWQEKAVFERDGEEVILAANRPVHPLGKCIATPSLLGYIITSKYAD
GLPLYRLEQMFKRLGQEVSRTSMAHWIIRLDEVFQPLMNLLREEQNHATYLQADETRIQVLKEEGKTAQSDKWMWVTRGGPPGRSSVLFAYDPSRAGSVP
VRLLEGFSGILQADGYSGYSQVCKESGLTRIGCWDHARRKFIEATRAAPKGKDKGKSKASTGLADVALGYIGKLYAIEREQKERSDAERYQARQTRSMPL
LAELKTWLDNNVGKVMKGSLTRQAMEYTLGQWPHLVGYCVRGDLHISNILAENAIRPFAVGRKAWLFADSAQGAKASATCYSLLETAKANDLEPSAYINY
VLAQIGEADSLEKLEALLPWNVPLEPIAKKVAQYNEGK
AKAKPRQRGFSASLLRERIELTLSDEQKAGASKVFFTKVKEELQFIPAQLKVLEIWQEKAVFERDGEEVILAANRPVHPLGKCIATPSLLGYIITSKYAD
GLPLYRLEQMFKRLGQEVSRTSMAHWIIRLDEVFQPLMNLLREEQNHATYLQADETRIQVLKEEGKTAQSDKWMWVTRGGPPGRSSVLFAYDPSRAGSVP
VRLLEGFSGILQADGYSGYSQVCKESGLTRIGCWDHARRKFIEATRAAPKGKDKGKSKASTGLADVALGYIGKLYAIEREQKERSDAERYQARQTRSMPL
LAELKTWLDNNVGKVMKGSLTRQAMEYTLGQWPHLVGYCVRGDLHISNILAENAIRPFAVGRKAWLFADSAQGAKASATCYSLLETAKANDLEPSAYINY
VLAQIGEADSLEKLEALLPWNVPLEPIAKKVAQYNEGK
Blast result :
Comments
ISAeme5 is 45% (orfA) aa similar to ISVme1 , 72% (orfB) to ISAba24, and 58% (orfC : the transposase) to ISPpu13.
IS_pep_1 is not in UniProt, IS_pep_2 is B224_2172, and IS_pep_3 is B224_2171.
IS_pep_1 is not in UniProt, IS_pep_2 is B224_2172, and IS_pep_3 is B224_2171.
References
[1] Pfeiffer F. (2015) Direct submission
[2] Chai B., Wang H. and Chen X. (2012) J. Bacteriol. 194 (23), 6693-6694
[2] Chai B., Wang H. and Chen X. (2012) J. Bacteriol. 194 (23), 6693-6694