ISPa77
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Pseudomonas aeruginosa | Pseudomonas aeruginosa Pseudomonas aeruginosa ST233 |
DNA section
IS Length : 2597 bp
Ends
IR Length : 26/34
IRL : TGCGGATTCCACGGTCATCTGGCCACCCATTCCATGAGCATCTGACCACC
IRR : TGCGGATTCCACGCCATTCGGACACTCAGCCCACGCTGATCCGGACACCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGAATTTCGACAGGCCGTAGGCG | ATGGC | GATGGCCGACATCGCGACGCCC | 5 |
DNA sequence
TGCGGATTCCACGGTCATCTGGCCACCCATTCCATGAGCATCTGACCACCGATTCCACGGTGATCCGGCCACCCATTCCACGCGCATCCGGCCACTGATT
CCACGGCCATCCGGCCACTCAACCGGGCAGGCAGTAACGCAGGATTTCTTCACTACCATCGACCTCTTTTTCGAAGCAGAGAGGTCGTCGTGGAGCGTTT
ATCCATGCGTAAGATTCGCGAAGTACTACGTCTCAAGTTCGAGGTCGGACTATCGGCTCGCCAGATTGCGGTCAGCGTGCAGGTCGGTCGTGTCACCGTC
GGCGACTACCTCAATCGTTTTGCCGCCAGCGGTCTCAGTTGGCCCTGTTCGTTGTCCGATGCCGAGTTGGAGCAGCAACTGTTCCCGCCGGCCCCGGCGG
TTGCCAGTGAGAAGCGGCCTTTACCCGATTGGGCATGGGTGCATGCCGAACTGCGCCGCCCCGGGGTGACCCTGGCGCTGCTCTGGCAGGAGTACCGCCT
GAGCCAGCCTCAGGGCTTTCAGTACAGCTGGTTCTGTGAGCACTACCGAGCCTGGCAGGGCAAGTTGGACGTGGTGATGCGTCAGGAGCACCGCGTCGGC
GAGAAGCTGTTCGTCGACTATGCCGGCCAGACGGTGCCGGTTATCGATCGCCACAGCGGCGAGATCCGCCAGGCGCAGGTGTTCGTCGCGGTGCTCGGCG
CGTCCAGCTACACCTTCGCCGAAGCCACCTGGTCGCAGCAGCTGCCGGACTGGCTAGGCTCCCATGCCCGCTGCTTCGCCTTCCTCGGCGGCGTGCCGGA
GATCGTGGTGCCGGACAACCTGCGCAGCGCGGTGAGCAAGAGTCACCGCTACGAGCCGGACATCAACCCCAGCTACCGCGATCTGGCCGAGCACTATGGC
GTGGCGGTGGTGCCGGCGCGGGCACGCAAACCGCGCGACAAGGCCAAGGCCGAAGTCGGCGTGCAGGTGGTCGAGCGTTGGATCCTCGCCGCACTGAGGA
ATCGGCAGTTCTTCTCCCTGGATGAACTCAACACGGCCATCGCCGGGCTGCTGGAGCGGCTCAACCAACGCCCGTTCAAGAAGCTGCCGGGCTCCCGGCA
GTCGGCCTTCGACAGCCTGGATCGTCCGGCGCTGCGCCCCCTGCCGGAGCAACCCTACGTCTACGCCGAGTGGAAGAAGGCGCGGGTGCACATCGACTAC
CACGTCGAGGTCGATGGGCATTACTACTCGGTGCCGTATCAACTGGTGAAGAAGCAGCTGGAGGTGCGCCTGACGGCGCGCACCGTCGAGTTTTTCCACG
CCAACCAGCGAGTGGCCAGCCACCTGCGCTCAATGCACAAGGGCAGGCACAGCACGCAGGCCGAGCACATGCCCAAGAGCCATCGCGAGCATGCCGAGTG
GACGCCGCAACGGCTGATCCGCTGGGCCGAGCAGACCGGGCCGAACACCGCCGGCGTGATCCGGCACATCCTCGAACGGCGCATCCATCCGCAGCAGGGC
TACCGGGCCTGCCTGGGCATCCTGCGCCTGGGTAAAACCCACGGTGAGGCGCGTCTGGAGTTGGCCTGCCGTCGCGCCATCAGCCTCGGCACGTGCAGCT
ACAAGAGCCTCGAATCGATCCTGCGCCAGGGGCTGGAAAACCTGCCGCTAGCTCAAACCAACCTGCCGCTGCTGCCGGACGACCACGCCAACCTGCGCGG
ATCCGCCTACTACCACTGACCCCAAGGAATCCCACCATGCTGCCCCATCCGACCCTGGACAAGCTGCAAACCCTGCGCCTGCACGGCATGCTCAAGGCGC
TGAATGAACAACTGAAAACCCCGGACATCGACAGCCTGAGCTTCGAAGAACGCCTCGGCCTGCTGGTCGACCGCGAGCTGACCGAACGCGATGACAAGCG
CCTGAGCAGCCGCCTGCGCCAGGCCCGGCTCAAGCACAACGCCTGCCTCGAAGACATCGACTACCGCAGCCCGCGCGGACTGGATAAGGCGCTGATCTTG
CAACTGAGCAGTGGTCAGTGGCTGCGCGACGGCCTCAACCTGATCATCGGCGGCCCCACCGGTGTCGGTAAAACCTGGCTGGCCTGCGCCCTGGCCCACC
AGGCCTGCCGGGAGGGCTACAGCGTGCGCTACCTGCGCCTGCCACGTTTGCTGGAAGAACTGGGTCTGGCCCATGGCGACGGCCGCTTCGCCAAGCTGAT
GAGCAGCTACGCCAAGACCGACCTGCTGATCCTCGACGACTGGGGCCTGGCCCCGTTCACCGGCGAGCAACGGCGCGACATGCTGGAGCTACTGGACGAC
CGTTACGGCCAGCGCTCGACCATCGTCACCAGCCAGATGCCGGTGGACAACTGGCACGAACTGATCGGCGATCCGACCCTGGCCGATGCCATCCTCGACC
GCCTGGTGCACAACGCTTATCGGATCAATCTGAAGGGTGAATCAATGCGCAAACGGACGCAGAAATTGACGACGCCAGCCAACCCGGACTAACAATGCCA
CCCCTGCGTCGCTGCGCTCCGACTGCCTGTCCGAATGAGCGTGGAACAGGTGTCCGGATCAGCGTGGGCTGAGTGTCCGAATGGCGTGGAATCCGCA
CCACGGCCATCCGGCCACTCAACCGGGCAGGCAGTAACGCAGGATTTCTTCACTACCATCGACCTCTTTTTCGAAGCAGAGAGGTCGTCGTGGAGCGTTT
ATCCATGCGTAAGATTCGCGAAGTACTACGTCTCAAGTTCGAGGTCGGACTATCGGCTCGCCAGATTGCGGTCAGCGTGCAGGTCGGTCGTGTCACCGTC
GGCGACTACCTCAATCGTTTTGCCGCCAGCGGTCTCAGTTGGCCCTGTTCGTTGTCCGATGCCGAGTTGGAGCAGCAACTGTTCCCGCCGGCCCCGGCGG
TTGCCAGTGAGAAGCGGCCTTTACCCGATTGGGCATGGGTGCATGCCGAACTGCGCCGCCCCGGGGTGACCCTGGCGCTGCTCTGGCAGGAGTACCGCCT
GAGCCAGCCTCAGGGCTTTCAGTACAGCTGGTTCTGTGAGCACTACCGAGCCTGGCAGGGCAAGTTGGACGTGGTGATGCGTCAGGAGCACCGCGTCGGC
GAGAAGCTGTTCGTCGACTATGCCGGCCAGACGGTGCCGGTTATCGATCGCCACAGCGGCGAGATCCGCCAGGCGCAGGTGTTCGTCGCGGTGCTCGGCG
CGTCCAGCTACACCTTCGCCGAAGCCACCTGGTCGCAGCAGCTGCCGGACTGGCTAGGCTCCCATGCCCGCTGCTTCGCCTTCCTCGGCGGCGTGCCGGA
GATCGTGGTGCCGGACAACCTGCGCAGCGCGGTGAGCAAGAGTCACCGCTACGAGCCGGACATCAACCCCAGCTACCGCGATCTGGCCGAGCACTATGGC
GTGGCGGTGGTGCCGGCGCGGGCACGCAAACCGCGCGACAAGGCCAAGGCCGAAGTCGGCGTGCAGGTGGTCGAGCGTTGGATCCTCGCCGCACTGAGGA
ATCGGCAGTTCTTCTCCCTGGATGAACTCAACACGGCCATCGCCGGGCTGCTGGAGCGGCTCAACCAACGCCCGTTCAAGAAGCTGCCGGGCTCCCGGCA
GTCGGCCTTCGACAGCCTGGATCGTCCGGCGCTGCGCCCCCTGCCGGAGCAACCCTACGTCTACGCCGAGTGGAAGAAGGCGCGGGTGCACATCGACTAC
CACGTCGAGGTCGATGGGCATTACTACTCGGTGCCGTATCAACTGGTGAAGAAGCAGCTGGAGGTGCGCCTGACGGCGCGCACCGTCGAGTTTTTCCACG
CCAACCAGCGAGTGGCCAGCCACCTGCGCTCAATGCACAAGGGCAGGCACAGCACGCAGGCCGAGCACATGCCCAAGAGCCATCGCGAGCATGCCGAGTG
GACGCCGCAACGGCTGATCCGCTGGGCCGAGCAGACCGGGCCGAACACCGCCGGCGTGATCCGGCACATCCTCGAACGGCGCATCCATCCGCAGCAGGGC
TACCGGGCCTGCCTGGGCATCCTGCGCCTGGGTAAAACCCACGGTGAGGCGCGTCTGGAGTTGGCCTGCCGTCGCGCCATCAGCCTCGGCACGTGCAGCT
ACAAGAGCCTCGAATCGATCCTGCGCCAGGGGCTGGAAAACCTGCCGCTAGCTCAAACCAACCTGCCGCTGCTGCCGGACGACCACGCCAACCTGCGCGG
ATCCGCCTACTACCACTGACCCCAAGGAATCCCACCATGCTGCCCCATCCGACCCTGGACAAGCTGCAAACCCTGCGCCTGCACGGCATGCTCAAGGCGC
TGAATGAACAACTGAAAACCCCGGACATCGACAGCCTGAGCTTCGAAGAACGCCTCGGCCTGCTGGTCGACCGCGAGCTGACCGAACGCGATGACAAGCG
CCTGAGCAGCCGCCTGCGCCAGGCCCGGCTCAAGCACAACGCCTGCCTCGAAGACATCGACTACCGCAGCCCGCGCGGACTGGATAAGGCGCTGATCTTG
CAACTGAGCAGTGGTCAGTGGCTGCGCGACGGCCTCAACCTGATCATCGGCGGCCCCACCGGTGTCGGTAAAACCTGGCTGGCCTGCGCCCTGGCCCACC
AGGCCTGCCGGGAGGGCTACAGCGTGCGCTACCTGCGCCTGCCACGTTTGCTGGAAGAACTGGGTCTGGCCCATGGCGACGGCCGCTTCGCCAAGCTGAT
GAGCAGCTACGCCAAGACCGACCTGCTGATCCTCGACGACTGGGGCCTGGCCCCGTTCACCGGCGAGCAACGGCGCGACATGCTGGAGCTACTGGACGAC
CGTTACGGCCAGCGCTCGACCATCGTCACCAGCCAGATGCCGGTGGACAACTGGCACGAACTGATCGGCGATCCGACCCTGGCCGATGCCATCCTCGACC
GCCTGGTGCACAACGCTTATCGGATCAATCTGAAGGGTGAATCAATGCGCAAACGGACGCAGAAATTGACGACGCCAGCCAACCCGGACTAACAATGCCA
CCCCTGCGTCGCTGCGCTCCGACTGCCTGTCCGAATGAGCGTGGAACAGGTGTCCGGATCAGCGTGGGCTGAGTGTCCGAATGGCGTGGAATCCGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1515 bp | 504 aa | 205 | 1719 | + | No |
Chemistry : DDE
ORF sequence :
MRKIREVLRLKFEVGLSARQIAVSVQVGRVTVGDYLNRFAASGLSWPCSLSDAELEQQLFPPAPAVASEKRPLPDWAWVHAELRRPGVTLALLWQEYRLS
QPQGFQYSWFCEHYRAWQGKLDVVMRQEHRVGEKLFVDYAGQTVPVIDRHSGEIRQAQVFVAVLGASSYTFAEATWSQQLPDWLGSHARCFAFLGGVPEI
VVPDNLRSAVSKSHRYEPDINPSYRDLAEHYGVAVVPARARKPRDKAKAEVGVQVVERWILAALRNRQFFSLDELNTAIAGLLERLNQRPFKKLPGSRQS
AFDSLDRPALRPLPEQPYVYAEWKKARVHIDYHVEVDGHYYSVPYQLVKKQLEVRLTARTVEFFHANQRVASHLRSMHKGRHSTQAEHMPKSHREHAEWT
PQRLIRWAEQTGPNTAGVIRHILERRIHPQQGYRACLGILRLGKTHGEARLELACRRAISLGTCSYKSLESILRQGLENLPLAQTNLPLLPDDHANLRGS
AYYH
QPQGFQYSWFCEHYRAWQGKLDVVMRQEHRVGEKLFVDYAGQTVPVIDRHSGEIRQAQVFVAVLGASSYTFAEATWSQQLPDWLGSHARCFAFLGGVPEI
VVPDNLRSAVSKSHRYEPDINPSYRDLAEHYGVAVVPARARKPRDKAKAEVGVQVVERWILAALRNRQFFSLDELNTAIAGLLERLNQRPFKKLPGSRQS
AFDSLDRPALRPLPEQPYVYAEWKKARVHIDYHVEVDGHYYSVPYQLVKKQLEVRLTARTVEFFHANQRVASHLRSMHKGRHSTQAEHMPKSHREHAEWT
PQRLIRWAEQTGPNTAGVIRHILERRIHPQQGYRACLGILRLGKTHGEARLELACRRAISLGTCSYKSLESILRQGLENLPLAQTNLPLLPDDHANLRGS
AYYH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
756 bp | 251 aa | 1737 | 2492 | + | No |
AG : IS21 helper
ORF sequence :
MLPHPTLDKLQTLRLHGMLKALNEQLKTPDIDSLSFEERLGLLVDRELTERDDKRLSSRLRQARLKHNACLEDIDYRSPRGLDKALILQLSSGQWLRDGL
NLIIGGPTGVGKTWLACALAHQACREGYSVRYLRLPRLLEELGLAHGDGRFAKLMSSYAKTDLLILDDWGLAPFTGEQRRDMLELLDDRYGQRSTIVTSQ
MPVDNWHELIGDPTLADAILDRLVHNAYRINLKGESMRKRTQKLTTPANPD
NLIIGGPTGVGKTWLACALAHQACREGYSVRYLRLPRLLEELGLAHGDGRFAKLMSSYAKTDLLILDDWGLAPFTGEQRRDMLELLDDRYGQRSTIVTSQ
MPVDNWHELIGDPTLADAILDRLVHNAYRINLKGESMRKRTQKLTTPANPD
Blast result :
Comments
ISPa77 is 99% aa similar to IS1491 (ORFA : transposase) and 97% aa similar to ISUnCu3 (ORFB : helper of transposition).
References
1] Benoit Valot (2017) Direct GenBank submission.