ISPa120
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP047592.1 | ND | Pseudomonas aeruginosa | Pseudomonas aeruginosa INP-43 |
DNA section
IS Length : 2880 bp
Ends
IR Length : 20/24
IRL : GTAAGCGTCCGGCGAACTCACCTTCCGAACTCCAGCATTAAGGGCGATGG
IRR : GTAAGCGTACGGCCAATACACCTTTACTTAGGTTGAGCCGTATCAGCAAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GACTCGTAGTCT | CCTGAAAA | GACCGGCGGCTGGC | 8 |
DNA sequence
GTAAGCGTCCGGCGAACTCACCTTCCGAACTCCAGCATTAAGGGCGATGGTGTCCGCGTATTTTTAAGCGGACGCGATCACCCTTATCGCCAGGATTTCC
ATGCGCCAACGAAGCTCTTACCCGAAACCGTTCAAAGCCCAGGTCGTTCAGGAATGCCTGCAACCTGGGGCAACGGTGTCCAGTGTCGCCATCAGCCACG
GCATCAACGCCAATGTCATCCGCAAATGGCTGACGCTTTATCGAGACCAGCCCGTACCAGCCTCGTTACCAGCCTTTGTCCCGCTGAAGGCCACCCCTAA
ACGGCCAGCCGAAACGTCAGTGCTCATTGAACTGCCCATGGCCGGGCAAATGATCACGGTGAAATGGCCAGCCTCAGATCCCGAGGGCTGCGCGCAATTC
ATCCGGGCTTTCGCTCGATGATCCGCATCGATGCGATCTGGCTAGCCACCGAACCGATGGACATGCGCGCCGGCACCGAGACGGCATTAGCCCGGGTAAT
TGCGGTGTTCGGTGCGGCGAAGCCGCACTGCGCTTATCTGTTCGCCAATCGCCGGGCTAACCGAATGAAAGTGCTGGTGCACGATGGCGTGGGCATCTGG
CTTGCCGCGCGTCGACTGAACCAAGGCAAGTTCCACTGGCCCGGCATTCGCCATGGCTGCGAGGTCGAACTCGACAGCGAACAACTCCAGGCCTTGGTGC
TGGGCCTGCCGTGGCAGCGCGTCGGCACAGGCGGTGTGATCAGCATGCTGTAAACGCCGGCCATTTGGCTGGTCTACTTGTCGTCGCATCGCGCCTGCTC
TGGCACAATCGGCGGCATGACTTCCTCGCCCAACCTTGACCAGATGACCCCGGAACAGCTTCGTGCCTTGGCGGCACAGGCGTTGCAGTTGCAATCCCAG
GTCGAGGCGATGAGCAGGAAAATCCGCAACAATGAAACCCTCATCGAACAGTTCAAGTTCGAAATCGCTCTGCTCAAACGCCACAAGTTTGCCAAGCGCA
GCGAGCAAATCAGTTCGGCGCAAGGCAGCTTGCTGGATGACCTGCTCGACACCGACCTTGAAGCTATCGAGGCCGAGCTGAAACAACTCCTTCCAGCTTC
GCCACAAGCCGAGCCACGGCAATCCCCGAAACGTTCGCCATTGCCGCCGCAGTTCCCGCGCACGGTGATTCGCCACGAACCTGAAAATACCCAATGCGCC
TGCGGCTGCCAACTTCAACGCATCGGCGAAGACGTCAGCGAGAAGCTGGATTACACGCCGGGCGTGTTTACCGTCGAGCAACATGTGAGGGGCAAATGGG
CCTGCCGTCAGTGCGAAACCCTGATCCAGGCGCCGGTGCCAGCCCAGGTTATTGATAAAGGCATCCCGACCGCAGGTTTGTTGGCCCACGTGATGGTGGC
CAAGTTTGCCGATCACTTGCCGCTGTACAGACAGGAAAAAATCTTTGGCCGCGCCGGGCTGCCAATTGCCCGCTCGACCCTGGCGCAGTGGGTCGGACAA
ACTGGCGTGCGGCTTCAGCCACTGGTCGATGCACTGCGTGAAGCCGTGCTGAACCAGGACGTGATCCACGCCGATGAAACACCGGTGCAAATGCTTGCAC
CAGGCGAGAAGAAAACCCACCGGGTCTATGTCTGGGCCTACAGCACGACGCCGTTTTCGGCGCTCAAAGCGGTGGTTTACGACTTCAGCCCAAGCCGTGC
CGGAGAACATGCACGCAACTTCCTAGGCGACTGGAATGGCAAGCTGGTCTGCGACGACTTCGCTGGATACAAGGCCGGTTTTGAACAAGGCATCACTGAA
ATCGGCTGCATGGCTCATGCTCGCCGCAAGTTCTTCGACCTGCATGTCGCTAACAAAAGCCAACTGGCCGAACAGGCGCTGCACTCAATTGGCGGTTTGT
ACGAGGTTGAACGCCAGGCTCGGGACATGAGCAACGAAGACCGTTGGCGAATACGTCAGGAAATGGCGGTACCGATCAGCAAAACACTGCATGACTGGAT
GTTGGCCCAGCGCGACCTGGTGCCCAACGGCTCGGCCACAGCTAAAGCCCTCGACTACAGCCTGAAACGCTGGGGAGCGCTGACGCGCTACCTGGACGAT
GGGGCTGTGCCCATCGACAACAATCAGGTGGAGAACCAGATACGGCCGTGGGCGCTCGGACGCTCGAACTGGTTATTTGCCGGATCGCTGCGCAGTGGCA
AACGAGCAGCAGCTATCATGAGCCTGATCCAGTCCGCTCGCATGAACGGGCATGATCCGTATGCCTACCTGAAGGACGTGCTAACTCGCCTGCCGACGTT
ACGGTCGAAAGACATCAGCCAGTTGCTGCCGCATCAGTGGGTACAGATCTAGCTTATGTGATCTATTGTCCCCTGTGAAACATATATAAATCATACTTGG
TAGGTGAGGGAAATGGATATTCGCCTGGAGATTTTAGCGCTTGAACAGCTGTTGCTAGAGCCGGAATCGAGAAAGAATGATCGACTGCTTAAACAGCTGC
TTACCGAAGACTTCGTTGAATTTGGAGCTATCGGCAAAAGCTGGACGAAAGCGGAGGTGATCGTGGGACTAAAATCCCAGACTTGGATCAAAAGGACAAT
CGAGGATTTCAAACTGCGTGTGCTTGCAGATGGTGTCGCGTTAGCAACGTACCGATGCCGTCATCAAAATGCTAATGGCGATGAGTCGTTATCAATGCGT
AGCTCTGTTTGGAAAACCTACGAAGATGGTTGGCACATGGTGTTTCACCAAGGCACGAGGGTCTCCGAGTAGATGTCGGTACCAAAGCCAATGTGCATGA
GGCGGTACCTCACACATATGAATCGATTCTTTTGCTGATACGGCTCAACCTAAGTAAAGGTGTATTGGCCGTACGCTTAC
ATGCGCCAACGAAGCTCTTACCCGAAACCGTTCAAAGCCCAGGTCGTTCAGGAATGCCTGCAACCTGGGGCAACGGTGTCCAGTGTCGCCATCAGCCACG
GCATCAACGCCAATGTCATCCGCAAATGGCTGACGCTTTATCGAGACCAGCCCGTACCAGCCTCGTTACCAGCCTTTGTCCCGCTGAAGGCCACCCCTAA
ACGGCCAGCCGAAACGTCAGTGCTCATTGAACTGCCCATGGCCGGGCAAATGATCACGGTGAAATGGCCAGCCTCAGATCCCGAGGGCTGCGCGCAATTC
ATCCGGGCTTTCGCTCGATGATCCGCATCGATGCGATCTGGCTAGCCACCGAACCGATGGACATGCGCGCCGGCACCGAGACGGCATTAGCCCGGGTAAT
TGCGGTGTTCGGTGCGGCGAAGCCGCACTGCGCTTATCTGTTCGCCAATCGCCGGGCTAACCGAATGAAAGTGCTGGTGCACGATGGCGTGGGCATCTGG
CTTGCCGCGCGTCGACTGAACCAAGGCAAGTTCCACTGGCCCGGCATTCGCCATGGCTGCGAGGTCGAACTCGACAGCGAACAACTCCAGGCCTTGGTGC
TGGGCCTGCCGTGGCAGCGCGTCGGCACAGGCGGTGTGATCAGCATGCTGTAAACGCCGGCCATTTGGCTGGTCTACTTGTCGTCGCATCGCGCCTGCTC
TGGCACAATCGGCGGCATGACTTCCTCGCCCAACCTTGACCAGATGACCCCGGAACAGCTTCGTGCCTTGGCGGCACAGGCGTTGCAGTTGCAATCCCAG
GTCGAGGCGATGAGCAGGAAAATCCGCAACAATGAAACCCTCATCGAACAGTTCAAGTTCGAAATCGCTCTGCTCAAACGCCACAAGTTTGCCAAGCGCA
GCGAGCAAATCAGTTCGGCGCAAGGCAGCTTGCTGGATGACCTGCTCGACACCGACCTTGAAGCTATCGAGGCCGAGCTGAAACAACTCCTTCCAGCTTC
GCCACAAGCCGAGCCACGGCAATCCCCGAAACGTTCGCCATTGCCGCCGCAGTTCCCGCGCACGGTGATTCGCCACGAACCTGAAAATACCCAATGCGCC
TGCGGCTGCCAACTTCAACGCATCGGCGAAGACGTCAGCGAGAAGCTGGATTACACGCCGGGCGTGTTTACCGTCGAGCAACATGTGAGGGGCAAATGGG
CCTGCCGTCAGTGCGAAACCCTGATCCAGGCGCCGGTGCCAGCCCAGGTTATTGATAAAGGCATCCCGACCGCAGGTTTGTTGGCCCACGTGATGGTGGC
CAAGTTTGCCGATCACTTGCCGCTGTACAGACAGGAAAAAATCTTTGGCCGCGCCGGGCTGCCAATTGCCCGCTCGACCCTGGCGCAGTGGGTCGGACAA
ACTGGCGTGCGGCTTCAGCCACTGGTCGATGCACTGCGTGAAGCCGTGCTGAACCAGGACGTGATCCACGCCGATGAAACACCGGTGCAAATGCTTGCAC
CAGGCGAGAAGAAAACCCACCGGGTCTATGTCTGGGCCTACAGCACGACGCCGTTTTCGGCGCTCAAAGCGGTGGTTTACGACTTCAGCCCAAGCCGTGC
CGGAGAACATGCACGCAACTTCCTAGGCGACTGGAATGGCAAGCTGGTCTGCGACGACTTCGCTGGATACAAGGCCGGTTTTGAACAAGGCATCACTGAA
ATCGGCTGCATGGCTCATGCTCGCCGCAAGTTCTTCGACCTGCATGTCGCTAACAAAAGCCAACTGGCCGAACAGGCGCTGCACTCAATTGGCGGTTTGT
ACGAGGTTGAACGCCAGGCTCGGGACATGAGCAACGAAGACCGTTGGCGAATACGTCAGGAAATGGCGGTACCGATCAGCAAAACACTGCATGACTGGAT
GTTGGCCCAGCGCGACCTGGTGCCCAACGGCTCGGCCACAGCTAAAGCCCTCGACTACAGCCTGAAACGCTGGGGAGCGCTGACGCGCTACCTGGACGAT
GGGGCTGTGCCCATCGACAACAATCAGGTGGAGAACCAGATACGGCCGTGGGCGCTCGGACGCTCGAACTGGTTATTTGCCGGATCGCTGCGCAGTGGCA
AACGAGCAGCAGCTATCATGAGCCTGATCCAGTCCGCTCGCATGAACGGGCATGATCCGTATGCCTACCTGAAGGACGTGCTAACTCGCCTGCCGACGTT
ACGGTCGAAAGACATCAGCCAGTTGCTGCCGCATCAGTGGGTACAGATCTAGCTTATGTGATCTATTGTCCCCTGTGAAACATATATAAATCATACTTGG
TAGGTGAGGGAAATGGATATTCGCCTGGAGATTTTAGCGCTTGAACAGCTGTTGCTAGAGCCGGAATCGAGAAAGAATGATCGACTGCTTAAACAGCTGC
TTACCGAAGACTTCGTTGAATTTGGAGCTATCGGCAAAAGCTGGACGAAAGCGGAGGTGATCGTGGGACTAAAATCCCAGACTTGGATCAAAAGGACAAT
CGAGGATTTCAAACTGCGTGTGCTTGCAGATGGTGTCGCGTTAGCAACGTACCGATGCCGTCATCAAAATGCTAATGGCGATGAGTCGTTATCAATGCGT
AGCTCTGTTTGGAAAACCTACGAAGATGGTTGGCACATGGTGTTTCACCAAGGCACGAGGGTCTCCGAGTAGATGTCGGTACCAAAGCCAATGTGCATGA
GGCGGTACCTCACACATATGAATCGATTCTTTTGCTGATACGGCTCAACCTAAGTAAAGGTGTATTGGCCGTACGCTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
321 bp | 106 aa | 101 | 421 | + | No |
AG : IS66 TnpA
ORF sequence :
MRQRSSYPKPFKAQVVQECLQPGATVSSVAISHGINANVIRKWLTLYRDQPVPASLPAFVPLKATPKRPAETSVLIELPMAGQMITVKWPASDPEGCAQF
IRAFAR
IRAFAR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
336 bp | 111 aa | 418 | 753 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRIDAIWLATEPMDMRAGTETALARVIAVFGAAKPHCAYLFANRRANRMKVLVHDGVGIWLAARRLNQGKFHWPGIRHGCEVELDSEQLQALVLGLPWQ
RVGTGGVISML
RVGTGGVISML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1536 bp | 511 aa | 817 | 2352 | + | No |
Chemistry : DDE
ORF sequence :
MTSSPNLDQMTPEQLRALAAQALQLQSQVEAMSRKIRNNETLIEQFKFEIALLKRHKFAKRSEQISSAQGSLLDDLLDTDLEAIEAELKQLLPASPQAEP
RQSPKRSPLPPQFPRTVIRHEPENTQCACGCQLQRIGEDVSEKLDYTPGVFTVEQHVRGKWACRQCETLIQAPVPAQVIDKGIPTAGLLAHVMVAKFADH
LPLYRQEKIFGRAGLPIARSTLAQWVGQTGVRLQPLVDALREAVLNQDVIHADETPVQMLAPGEKKTHRVYVWAYSTTPFSALKAVVYDFSPSRAGEHAR
NFLGDWNGKLVCDDFAGYKAGFEQGITEIGCMAHARRKFFDLHVANKSQLAEQALHSIGGLYEVERQARDMSNEDRWRIRQEMAVPISKTLHDWMLAQRD
LVPNGSATAKALDYSLKRWGALTRYLDDGAVPIDNNQVENQIRPWALGRSNWLFAGSLRSGKRAAAIMSLIQSARMNGHDPYAYLKDVLTRLPTLRSKDI
SQLLPHQWVQI
RQSPKRSPLPPQFPRTVIRHEPENTQCACGCQLQRIGEDVSEKLDYTPGVFTVEQHVRGKWACRQCETLIQAPVPAQVIDKGIPTAGLLAHVMVAKFADH
LPLYRQEKIFGRAGLPIARSTLAQWVGQTGVRLQPLVDALREAVLNQDVIHADETPVQMLAPGEKKTHRVYVWAYSTTPFSALKAVVYDFSPSRAGEHAR
NFLGDWNGKLVCDDFAGYKAGFEQGITEIGCMAHARRKFFDLHVANKSQLAEQALHSIGGLYEVERQARDMSNEDRWRIRQEMAVPISKTLHDWMLAQRD
LVPNGSATAKALDYSLKRWGALTRYLDDGAVPIDNNQVENQIRPWALGRSNWLFAGSLRSGKRAAAIMSLIQSARMNGHDPYAYLKDVLTRLPTLRSKDI
SQLLPHQWVQI
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
360 bp | 119 aa | 2413 | 2772 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MDIRLEILALEQLLLEPESRKNDRLLKQLLTEDFVEFGAIGKSWTKAEVIVGLKSQTWIKRTIEDFKLRVLADGVALATYRCRHQNANGDESLSMRSSVW
KTYEDGWHMVFHQGTRVSE
KTYEDGWHMVFHQGTRVSE
Blast result :
Comments
ISPa120 is 97% (transposase) aa similar to ISPpu14
References
1] Dongsheng Zhou (2020) Direct submission.
2] Garcia-Contreras,R., Perez-Eretza,B., Jasso-Chavez,R., Lira-Silva,E., Roldan-Sanchez,J.A., Gonzalez-Valdez,A., Soberon-Chavez,G., Coria-Jimenez,R., Martinez-Vazquez,M., Alcaraz,L.D., Maeda,T. and Wood,T.K. (2015) Pathog Dis 73 (6), ftv040
2] Garcia-Contreras,R., Perez-Eretza,B., Jasso-Chavez,R., Lira-Silva,E., Roldan-Sanchez,J.A., Gonzalez-Valdez,A., Soberon-Chavez,G., Coria-Jimenez,R., Martinez-Vazquez,M., Alcaraz,L.D., Maeda,T. and Wood,T.K. (2015) Pathog Dis 73 (6), ftv040