ISPa97
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
EU595745 | ND | Pseudomonas aeruginosa | Pseudomonas aeruginosa Pseudomonas aeruginosa PACS171b |
DNA section
IS Length : 2361 bp
Ends
IR Length : 23/29
IRL : GTAACCGTCCGGGCAACACACCTCCCCGGTTCTTCCAGCTAGCGGCATGG
IRR : GTAAGCGAACGGGCATCACACCTTGCCGGCGGGTACCCAGTTATGCGGCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCACTCTATCGGCG | GACCAGGG | GAGCCGTCCCCAT | 8 |
DNA sequence
GTAACCGTCCGGGCAACACACCTCCCCGGTTCTTCCAGCTAGCGGCATGGTGTCCACCTGTTTTTGGTGGACACCATTTCAAGCCCTTTGCAGAGGATCT
CCCGTGCCAGGCCAACGCCGCTCCTATCCCAAATCCTTCAAGGCCCAGATCGTCGAGGAGTGTACCCAGCCCGGCGCCTCGGTTGCCGGCGTGGCCTTGA
GCCACGGCCTTAACGCCAATCTCGTGCACAAGTGGATTCGCCGCCAGCAGGCACAGCTGCCCGCGGTACCCTCAGGTTTTATTCCAATTCCTCTCGTCCC
GAGCGGGCCGGCTACCCCGAGTGCGGCCGACAGGGCCATCCAGATCGCCATTCCCCATCGGACCGGCAAACTGTCGGTGCAGTGGCCGGGCAACGACCCT
GAAGGCTGCGCCCGCTTCCTGCGCGAGCTGCTGAAGTGATCCGCATCGAGCTCATCTGGCTCGCCACCGAGCCGCTGGACATGCGCGCCGGCACTGAAAC
CGCGTTGGCCCGGGTGGTTCAAGTGTTCGGTGCGGCGCAGCCGCACTGTGCCTACCTGTTCACCAACAAGCGCGCCAATCGGATGAAGGTGCTGGTGCAC
GATGGCTTCGGCATCTGGCTGGCCGCTCGCAGGCTCAATCGAGGGCGCTTCGTCTGGTCGGGCAATTGGCAGGGACAGCAAGTGGAACTCAACCCCGAGC
AGTTGCAGGCCCTGGTGATCGGCCTGCCCTGGCAGCGTCTTGGGCCCGATGCCGAGATCAGGCTCCTGTAGCCGACGTGCAATTAGGCCAATGGCCTATC
GCCGAGGGCGCCGGGATTCCTCACACTCGACGCCATGAACCCTGCCGCTTTCGATCAGCTGACACCCGAGCAACTGCGCCAACTGGCCGCGCAGTTGAGT
CAGCGCGTCGATCACTTGGAAACGCTCAACCAGCGGTTGAACCACGAACTCGCCGTGCTGCGGCGCCACCGCTTCGCCCGCCGCAGCGAGCAGCTCAATG
CCGACCAGCTCAATCTGCTCGACGAGATGATCGATGCCGACATCGCCGCCATCGAGGCAGAGCTTGAGGCGGCCCGACCAGAGTCCGCCAAGCGCGAGCT
ACGGCAGCAGCCCAAGCGAGCGCCTCTGCCGGCCGAACTGCCGCGCACGCTGATACTTCACGAGCCGGACAGCACCCAGTGTGCCTGCGGCTGCCAGCTC
AAGCGCATCGGCGAAGACGTCAGCGAGAAGCTCGACTACACCCCGGGCACCTTCACCGTCGAGCGGCATATCCGTGGCAAGTGGGTCTGCGCCGCGTGTG
AAACGTTGATCCAGGCCCCAGTGCCTGCGCAGGTGATCGACAAGGGCATCCCGACTGCGGGGCTGCTGGCTCAGGTCATGGTGGCCAAGTTCGCCGATCA
CCTGCCGCTGTACCGCCAGGAAAAGATCTTCGCCCGCGCCGGGCTGGCCATCGCGCGTTCCACCCTAGCGCAATGGGTCGGCGCTTGCGGCGTCCAGCTG
CAACCTCTGGTCGATGCCCTGCGCGACTGCCTGCTCCAGCAGGACTTCATCCTCGCCGACGAAACCCCGGTGCAGATGCTCGCCCCGGGCACGAAGAAGA
CCCAGCGGGCTTACGTCTGGGCCTATGCACCCAGCCCCTTCGCCGACCTCAAGGCCGTGGTCTACGACTTCAGGCCGAGCCGGGCTGGCGAGCACGCGCG
CAGCTTCCTGGGCGACTGGCAAGGCAAACTGGTCTGCGACGACTTCGCCGGCTACAAGGCCAGCTTCGAGCAAGGCGTGACCGAGATCGGCTGCATGGCC
CATGCACGGCGCAAGTTCTTCGACCTGCATGCTGCCAACCAGAGCCAGCTGGCCGAGCAGGCCCTCCAGTACATCGGTCAGCTCTACGATGTGGAACGCG
AAGGGCGAGAGCTGCTCGCCGAACAGCGACGGCAACTGCGCCAGGATAAAGCCAAGTCGATCATCGATGGCCTGCATAGCTGGATACTTGGGCAGCGGCA
GAAGGTGCCGGAGGGCAGCGCGATCGCCAAAACACTCGACTACAGCCTCAAGCGTTGGGCAGCGTTGGTGCGCTACCTGGATGACGGCAACCTACCCATC
GACAACAACTGGATCGAGAACCAGATCCGCCCCTGGGCCCTGGGACGCGCCAACTGGCTGTTTGCCGGCTCGCTACGCAGTGGCCAGCGTGGCGCAGCCT
TGATGACGCTGATCCAGTCAGCCCGCCTGAACGGGCACGATCCGTACGCCTACCTGAAGGACGTGCTCATGCGCCTGCCGACGCAGAAGGCCAGCGCCCT
GGCCGAGCTGCTGCCGCATAACTGGGTACCCGCCGGCAAGGTGTGATGCCCGTTCGCTTAC
CCCGTGCCAGGCCAACGCCGCTCCTATCCCAAATCCTTCAAGGCCCAGATCGTCGAGGAGTGTACCCAGCCCGGCGCCTCGGTTGCCGGCGTGGCCTTGA
GCCACGGCCTTAACGCCAATCTCGTGCACAAGTGGATTCGCCGCCAGCAGGCACAGCTGCCCGCGGTACCCTCAGGTTTTATTCCAATTCCTCTCGTCCC
GAGCGGGCCGGCTACCCCGAGTGCGGCCGACAGGGCCATCCAGATCGCCATTCCCCATCGGACCGGCAAACTGTCGGTGCAGTGGCCGGGCAACGACCCT
GAAGGCTGCGCCCGCTTCCTGCGCGAGCTGCTGAAGTGATCCGCATCGAGCTCATCTGGCTCGCCACCGAGCCGCTGGACATGCGCGCCGGCACTGAAAC
CGCGTTGGCCCGGGTGGTTCAAGTGTTCGGTGCGGCGCAGCCGCACTGTGCCTACCTGTTCACCAACAAGCGCGCCAATCGGATGAAGGTGCTGGTGCAC
GATGGCTTCGGCATCTGGCTGGCCGCTCGCAGGCTCAATCGAGGGCGCTTCGTCTGGTCGGGCAATTGGCAGGGACAGCAAGTGGAACTCAACCCCGAGC
AGTTGCAGGCCCTGGTGATCGGCCTGCCCTGGCAGCGTCTTGGGCCCGATGCCGAGATCAGGCTCCTGTAGCCGACGTGCAATTAGGCCAATGGCCTATC
GCCGAGGGCGCCGGGATTCCTCACACTCGACGCCATGAACCCTGCCGCTTTCGATCAGCTGACACCCGAGCAACTGCGCCAACTGGCCGCGCAGTTGAGT
CAGCGCGTCGATCACTTGGAAACGCTCAACCAGCGGTTGAACCACGAACTCGCCGTGCTGCGGCGCCACCGCTTCGCCCGCCGCAGCGAGCAGCTCAATG
CCGACCAGCTCAATCTGCTCGACGAGATGATCGATGCCGACATCGCCGCCATCGAGGCAGAGCTTGAGGCGGCCCGACCAGAGTCCGCCAAGCGCGAGCT
ACGGCAGCAGCCCAAGCGAGCGCCTCTGCCGGCCGAACTGCCGCGCACGCTGATACTTCACGAGCCGGACAGCACCCAGTGTGCCTGCGGCTGCCAGCTC
AAGCGCATCGGCGAAGACGTCAGCGAGAAGCTCGACTACACCCCGGGCACCTTCACCGTCGAGCGGCATATCCGTGGCAAGTGGGTCTGCGCCGCGTGTG
AAACGTTGATCCAGGCCCCAGTGCCTGCGCAGGTGATCGACAAGGGCATCCCGACTGCGGGGCTGCTGGCTCAGGTCATGGTGGCCAAGTTCGCCGATCA
CCTGCCGCTGTACCGCCAGGAAAAGATCTTCGCCCGCGCCGGGCTGGCCATCGCGCGTTCCACCCTAGCGCAATGGGTCGGCGCTTGCGGCGTCCAGCTG
CAACCTCTGGTCGATGCCCTGCGCGACTGCCTGCTCCAGCAGGACTTCATCCTCGCCGACGAAACCCCGGTGCAGATGCTCGCCCCGGGCACGAAGAAGA
CCCAGCGGGCTTACGTCTGGGCCTATGCACCCAGCCCCTTCGCCGACCTCAAGGCCGTGGTCTACGACTTCAGGCCGAGCCGGGCTGGCGAGCACGCGCG
CAGCTTCCTGGGCGACTGGCAAGGCAAACTGGTCTGCGACGACTTCGCCGGCTACAAGGCCAGCTTCGAGCAAGGCGTGACCGAGATCGGCTGCATGGCC
CATGCACGGCGCAAGTTCTTCGACCTGCATGCTGCCAACCAGAGCCAGCTGGCCGAGCAGGCCCTCCAGTACATCGGTCAGCTCTACGATGTGGAACGCG
AAGGGCGAGAGCTGCTCGCCGAACAGCGACGGCAACTGCGCCAGGATAAAGCCAAGTCGATCATCGATGGCCTGCATAGCTGGATACTTGGGCAGCGGCA
GAAGGTGCCGGAGGGCAGCGCGATCGCCAAAACACTCGACTACAGCCTCAAGCGTTGGGCAGCGTTGGTGCGCTACCTGGATGACGGCAACCTACCCATC
GACAACAACTGGATCGAGAACCAGATCCGCCCCTGGGCCCTGGGACGCGCCAACTGGCTGTTTGCCGGCTCGCTACGCAGTGGCCAGCGTGGCGCAGCCT
TGATGACGCTGATCCAGTCAGCCCGCCTGAACGGGCACGATCCGTACGCCTACCTGAAGGACGTGCTCATGCGCCTGCCGACGCAGAAGGCCAGCGCCCT
GGCCGAGCTGCTGCCGCATAACTGGGTACCCGCCGGCAAGGTGTGATGCCCGTTCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
393 bp | 130 aa | 47 | 439 | + | No |
AG : IS66 TnpA
ORF sequence :
MVSTCFWWTPFQALCRGSPVPGQRRSYPKSFKAQIVEECTQPGASVAGVALSHGLNANLVHKWIRRQQAQLPAVPSGFIPIPLVPSGPATPSAADRAIQI
AIPHRTGKLSVQWPGNDPEGCARFLRELLK
AIPHRTGKLSVQWPGNDPEGCARFLRELLK
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
336 bp | 111 aa | 436 | 771 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRIELIWLATEPLDMRAGTETALARVVQVFGAAQPHCAYLFTNKRANRMKVLVHDGFGIWLAARRLNRGRFVWSGNWQGQQVELNPEQLQALVIGLPWQ
RLGPDAEIRLL
RLGPDAEIRLL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1512 bp | 503 aa | 835 | 2346 | + | No |
Chemistry : DDE
ORF sequence :
MNPAAFDQLTPEQLRQLAAQLSQRVDHLETLNQRLNHELAVLRRHRFARRSEQLNADQLNLLDEMIDADIAAIEAELEAARPESAKRELRQQPKRAPLPA
ELPRTLILHEPDSTQCACGCQLKRIGEDVSEKLDYTPGTFTVERHIRGKWVCAACETLIQAPVPAQVIDKGIPTAGLLAQVMVAKFADHLPLYRQEKIFA
RAGLAIARSTLAQWVGACGVQLQPLVDALRDCLLQQDFILADETPVQMLAPGTKKTQRAYVWAYAPSPFADLKAVVYDFRPSRAGEHARSFLGDWQGKLV
CDDFAGYKASFEQGVTEIGCMAHARRKFFDLHAANQSQLAEQALQYIGQLYDVEREGRELLAEQRRQLRQDKAKSIIDGLHSWILGQRQKVPEGSAIAKT
LDYSLKRWAALVRYLDDGNLPIDNNWIENQIRPWALGRANWLFAGSLRSGQRGAALMTLIQSARLNGHDPYAYLKDVLMRLPTQKASALAELLPHNWVPA
GKV
ELPRTLILHEPDSTQCACGCQLKRIGEDVSEKLDYTPGTFTVERHIRGKWVCAACETLIQAPVPAQVIDKGIPTAGLLAQVMVAKFADHLPLYRQEKIFA
RAGLAIARSTLAQWVGACGVQLQPLVDALRDCLLQQDFILADETPVQMLAPGTKKTQRAYVWAYAPSPFADLKAVVYDFRPSRAGEHARSFLGDWQGKLV
CDDFAGYKASFEQGVTEIGCMAHARRKFFDLHAANQSQLAEQALQYIGQLYDVEREGRELLAEQRRQLRQDKAKSIIDGLHSWILGQRQKVPEGSAIAKT
LDYSLKRWAALVRYLDDGNLPIDNNWIENQIRPWALGRANWLFAGSLRSGQRGAALMTLIQSARLNGHDPYAYLKDVLMRLPTQKASALAELLPHNWVPA
GKV
Blast result :
Comments
ISPa97 is 84% aa similar to ISPsy43.
References
1] Dongsheng Zhou (2020) Direct submission.
2] Hayden,H.S., Gillett,W., Saenphimmachak,C., Lim,R., Zhou,Y., Jacobs,M.A., Chang,J., Rohmer,L., D'Argenio,D.A., Palmieri,A., Levy,R., Haugen,E., Wong,G.K., Brittnacher,M.J., Burns,J.L., Miller,S.I., Olson,M.V. and Kaul,R. (2008) Genomics 91 (6), 530-537.
2] Hayden,H.S., Gillett,W., Saenphimmachak,C., Lim,R., Zhou,Y., Jacobs,M.A., Chang,J., Rohmer,L., D'Argenio,D.A., Palmieri,A., Levy,R., Haugen,E., Wong,G.K., Brittnacher,M.J., Burns,J.L., Miller,S.I., Olson,M.V. and Kaul,R. (2008) Genomics 91 (6), 530-537.