ISPa108
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Pseudomonas aeruginosa | Pseudomonas aeruginosa RW109 |
DNA section
IS Length : 2642 bp
Ends
IR Length : 39/49
IRL : TGCGGATTTCGGAGCATCGTGACCGGCCGTTTCGGTTGATCGTGACCGGT
IRR : TGCGGATTTCGGTGAACGTGACCGAGCGTTTCGCTAATACGTGACCGGCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCCCGACTCG | GCAATTCC | CTTCTTGGGC | 8 |
TTGGTGTGGCTCCT | GTTCGC | CATATTTTCGTGTG | 6 |
DNA sequence
TGCGGATTTCGGAGCATCGTGACCGGCCGTTTCGGTTGATCGTGACCGGTCATTTCGCTAACGCGTGACCGCTCATTTCGGTAGCAACGTGACCGATTTT
CCGCCTGCTCCGAAACAGGCGGTCACGGCTTACCGAAATCGCCGGTCACGGCTTAGCGAAAGCCTTCCTCTTCGTTGCGCATGACCTGATGCACCGCCAC
CCTCGCCCGATTTCGGGAGACGAGGGATGGCGGCGCCGCGAGTAGCCATGCGAAACATCAAAGAATGTCTGCGCCTCAAGCTCGAGGCCGGTTTGTCCCA
CGAGAAGATCGCCCGCGCCTTGCAGTTGTCCAAGGGCGTGGTCAGCAAGTACGTTACGGCGGCCCGGGTGGCCGGGCTGGACTGGCCGATGTTGGCAGCT
ATGGATGAGGCCACGTTGGCAGCCGCCTTGTTTGCGCCGACGCCACCGGGCAAGCCGCGCGGTGAGCGGGTACTGCCCGATGTGCTGAGCATCCACCGCG
AGCTGCGGCGCAAGGGCGTGACCTTGCAGCTGCTGTGGGAGGAGTATCTCGCCGCGCATGCGGGGCAGCCGACCTACCGCTACACCCAGTTCGTCGAGCA
CTACCGGCGTTACGCCCAGACGCTCAAACGTTCGATGCGCCAGGTGCACCGGGCGGGCGAGAAGCTGTTTATCGACTATGCCGGCCCGACGCTGCCGGTG
GTCGATCCAGACACCGGCGAAGTGCGCCGGGCGCATATCTTCGTCGCCGCCCTGGGCGCCTCGAATTACACCTATGCCTGCGCGACGCCGGGCGAAACCC
AAGTGGACTGGCTGACTTCGCTGGGCCAGGCGTTGACCTACTTCGGCGGCGTGCCGGAGATGGTCGTGCCGGACAATCCGCGCGCCCTGGTCGCGCTGCC
GGATCGTTACGAACCGGGCCTGAACCGAGCGGCGCTGGAGTGCGCCCGCCATTACGACACCGTGATGCTGCCGGCGCGGCCACGCAAACCCCAGGACAAG
GCCAAGGCCGAGGTGGCGGTGCAGGTGGTCGAGCGTTGGATCATGGCGCGCCTGCGCCACCGGCAGTTCTTCAGCCTGCATGCGCTTAACCAGGCCATCG
CCGAGCTGCTGGAGGATCTGAATCGGCGCCCGTTCAAGCGGCTCGATGGCTGCCGGCGCGACTGGTTCGAGCGCCTGGATCGGCCGGCCTTGCGAGCGCT
GCCGGTGCATCCCTACGAGGTCGCCACCTTCAAGCGCTGCAAGGTCAGCATCGACTACCACATCGAGGTCAATGGCAGCTTCTACAGCGTGCCCTCCGCC
CTGGCCCGGCAGAGCGTCGAGGTGCGGCTGACGGCGCACACCCTGGAAGTGCTGCATGGCAACCGACGGGTGGCCAGCCATGTGCTGCTGGGCCGTCGCG
GCGCCTACAGCACGCAACGCGAGCACATGCCCGCGGCGCACCAGGCGCATCGCGAATGGACGCCGCAACGCCTGCTCGACTGGGGCGAGCGGATCGGCCC
CTACACGCGCCAACTGATCGATCACCAGCTGACCCACAAGCCGCACCCGGAGATGGGCTACCGCGCCTGCCTTGGCTTGCTCTCGCTGGCCCGCCGCTAC
GGCAATGCACGGCTGGAAGCCGCCGCCGAGCGGGCCGTCCAACTGCGTGCCTTCACCGGGCGCAGCGTGCGCAACCTGCTCCAGCAAGGCCTGGATCAGC
AGCCGCTGCCCCAGCGTGCGGCTGCAACGGCCTTACCCGAGCACCACGAAAACGTCCGTGGCGCCGACTACTACCAACCCCCGCAACAGGAGCTATTCGA
TGATGCCGCAACACACCCTGAATCAACTGCACCAGCTACGCCTGGACGGCATGGCTCGCGCACTGGAGGAACAATGGACGCTGCCGGGCAGCCACAGCCT
GAGCTTCGATGAACGCCTCGGGCTGCTGCTCGACCGCGAACTGGCCTGGCGTGACAACCAGCGCCTGGTGCGACTGCGCAAGAAGGCCAAGCTCAAGTAC
GCCAACGCCTGCCTGGAAGATCTCGACCGCCGACCCGGCCGCGCCCTGGACGAACGCCTGATCGCCAGCCTGGCCGGCGGCGACTGGATCCGCCAGCAGC
ACAACCTGCTGCTGACCGGGCCGACCGGCGCCGGCAAGACCTGGCTGGCCTGCGCGCTGGGTAACCAAGCCTGCCGCCAGGGCTACAGCACCCTGTATCT
GCGCACCCCGCGCCTGCTGGAGCAACTGCGTATCGCCCACGGCGACGGCAGCTTCGGGCGCACCCTGCAACAGCTGGCCAAGGTCGACGTCCTGGTGCTG
GACGACTGGGCGCTGGCCCCGTTGGAGGAAGGGGCTCGGCATGACCTGCTGGAAGTGATCGACGACCGCGCCGGCAACCGCTCCACCATCCTGACCAGCC
AACTGCCGCTCGAACACTGGCATGGCTGGATCAACGACCCGACCCTGGCCGACGCCATCCTCGACCGCCTGGTGCACAACGCTTACCGACTGACGATGAA
GGGCGAGTCACTGCGCCGGAAAAAAGCCGAGGAACAAACCGCATCGTGACCGATGCGATTACAATCCAAAACCCGCGCAACCGGGGTGGAAGAGCCGGTC
ACGTATTAGCGAAACGCTCGGTCACGTTCACCGAAATCCGCA
CCGCCTGCTCCGAAACAGGCGGTCACGGCTTACCGAAATCGCCGGTCACGGCTTAGCGAAAGCCTTCCTCTTCGTTGCGCATGACCTGATGCACCGCCAC
CCTCGCCCGATTTCGGGAGACGAGGGATGGCGGCGCCGCGAGTAGCCATGCGAAACATCAAAGAATGTCTGCGCCTCAAGCTCGAGGCCGGTTTGTCCCA
CGAGAAGATCGCCCGCGCCTTGCAGTTGTCCAAGGGCGTGGTCAGCAAGTACGTTACGGCGGCCCGGGTGGCCGGGCTGGACTGGCCGATGTTGGCAGCT
ATGGATGAGGCCACGTTGGCAGCCGCCTTGTTTGCGCCGACGCCACCGGGCAAGCCGCGCGGTGAGCGGGTACTGCCCGATGTGCTGAGCATCCACCGCG
AGCTGCGGCGCAAGGGCGTGACCTTGCAGCTGCTGTGGGAGGAGTATCTCGCCGCGCATGCGGGGCAGCCGACCTACCGCTACACCCAGTTCGTCGAGCA
CTACCGGCGTTACGCCCAGACGCTCAAACGTTCGATGCGCCAGGTGCACCGGGCGGGCGAGAAGCTGTTTATCGACTATGCCGGCCCGACGCTGCCGGTG
GTCGATCCAGACACCGGCGAAGTGCGCCGGGCGCATATCTTCGTCGCCGCCCTGGGCGCCTCGAATTACACCTATGCCTGCGCGACGCCGGGCGAAACCC
AAGTGGACTGGCTGACTTCGCTGGGCCAGGCGTTGACCTACTTCGGCGGCGTGCCGGAGATGGTCGTGCCGGACAATCCGCGCGCCCTGGTCGCGCTGCC
GGATCGTTACGAACCGGGCCTGAACCGAGCGGCGCTGGAGTGCGCCCGCCATTACGACACCGTGATGCTGCCGGCGCGGCCACGCAAACCCCAGGACAAG
GCCAAGGCCGAGGTGGCGGTGCAGGTGGTCGAGCGTTGGATCATGGCGCGCCTGCGCCACCGGCAGTTCTTCAGCCTGCATGCGCTTAACCAGGCCATCG
CCGAGCTGCTGGAGGATCTGAATCGGCGCCCGTTCAAGCGGCTCGATGGCTGCCGGCGCGACTGGTTCGAGCGCCTGGATCGGCCGGCCTTGCGAGCGCT
GCCGGTGCATCCCTACGAGGTCGCCACCTTCAAGCGCTGCAAGGTCAGCATCGACTACCACATCGAGGTCAATGGCAGCTTCTACAGCGTGCCCTCCGCC
CTGGCCCGGCAGAGCGTCGAGGTGCGGCTGACGGCGCACACCCTGGAAGTGCTGCATGGCAACCGACGGGTGGCCAGCCATGTGCTGCTGGGCCGTCGCG
GCGCCTACAGCACGCAACGCGAGCACATGCCCGCGGCGCACCAGGCGCATCGCGAATGGACGCCGCAACGCCTGCTCGACTGGGGCGAGCGGATCGGCCC
CTACACGCGCCAACTGATCGATCACCAGCTGACCCACAAGCCGCACCCGGAGATGGGCTACCGCGCCTGCCTTGGCTTGCTCTCGCTGGCCCGCCGCTAC
GGCAATGCACGGCTGGAAGCCGCCGCCGAGCGGGCCGTCCAACTGCGTGCCTTCACCGGGCGCAGCGTGCGCAACCTGCTCCAGCAAGGCCTGGATCAGC
AGCCGCTGCCCCAGCGTGCGGCTGCAACGGCCTTACCCGAGCACCACGAAAACGTCCGTGGCGCCGACTACTACCAACCCCCGCAACAGGAGCTATTCGA
TGATGCCGCAACACACCCTGAATCAACTGCACCAGCTACGCCTGGACGGCATGGCTCGCGCACTGGAGGAACAATGGACGCTGCCGGGCAGCCACAGCCT
GAGCTTCGATGAACGCCTCGGGCTGCTGCTCGACCGCGAACTGGCCTGGCGTGACAACCAGCGCCTGGTGCGACTGCGCAAGAAGGCCAAGCTCAAGTAC
GCCAACGCCTGCCTGGAAGATCTCGACCGCCGACCCGGCCGCGCCCTGGACGAACGCCTGATCGCCAGCCTGGCCGGCGGCGACTGGATCCGCCAGCAGC
ACAACCTGCTGCTGACCGGGCCGACCGGCGCCGGCAAGACCTGGCTGGCCTGCGCGCTGGGTAACCAAGCCTGCCGCCAGGGCTACAGCACCCTGTATCT
GCGCACCCCGCGCCTGCTGGAGCAACTGCGTATCGCCCACGGCGACGGCAGCTTCGGGCGCACCCTGCAACAGCTGGCCAAGGTCGACGTCCTGGTGCTG
GACGACTGGGCGCTGGCCCCGTTGGAGGAAGGGGCTCGGCATGACCTGCTGGAAGTGATCGACGACCGCGCCGGCAACCGCTCCACCATCCTGACCAGCC
AACTGCCGCTCGAACACTGGCATGGCTGGATCAACGACCCGACCCTGGCCGACGCCATCCTCGACCGCCTGGTGCACAACGCTTACCGACTGACGATGAA
GGGCGAGTCACTGCGCCGGAAAAAAGCCGAGGAACAAACCGCATCGTGACCGATGCGATTACAATCCAAAACCCGCGCAACCGGGGTGGAAGAGCCGGTC
ACGTATTAGCGAAACGCTCGGTCACGTTCACCGAAATCCGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1686 bp | 561 aa | 227 | 1912 | + | No |
Chemistry : DDE
ORF sequence :
MAAPRVAMRNIKECLRLKLEAGLSHEKIARALQLSKGVVSKYVTAARVAGLDWPMLAAMDEATLAAALFAPTPPGKPRGERVLPDVLSIHRELRRKGVTL
QLLWEEYLAAHAGQPTYRYTQFVEHYRRYAQTLKRSMRQVHRAGEKLFIDYAGPTLPVVDPDTGEVRRAHIFVAALGASNYTYACATPGETQVDWLTSLG
QALTYFGGVPEMVVPDNPRALVALPDRYEPGLNRAALECARHYDTVMLPARPRKPQDKAKAEVAVQVVERWIMARLRHRQFFSLHALNQAIAELLEDLNR
RPFKRLDGCRRDWFERLDRPALRALPVHPYEVATFKRCKVSIDYHIEVNGSFYSVPSALARQSVEVRLTAHTLEVLHGNRRVASHVLLGRRGAYSTQREH
MPAAHQAHREWTPQRLLDWGERIGPYTRQLIDHQLTHKPHPEMGYRACLGLLSLARRYGNARLEAAAERAVQLRAFTGRSVRNLLQQGLDQQPLPQRAAA
TALPEHHENVRGADYYQPPQQELFDDAATHPESTAPATPGRHGSRTGGTMDAAGQPQPELR
QLLWEEYLAAHAGQPTYRYTQFVEHYRRYAQTLKRSMRQVHRAGEKLFIDYAGPTLPVVDPDTGEVRRAHIFVAALGASNYTYACATPGETQVDWLTSLG
QALTYFGGVPEMVVPDNPRALVALPDRYEPGLNRAALECARHYDTVMLPARPRKPQDKAKAEVAVQVVERWIMARLRHRQFFSLHALNQAIAELLEDLNR
RPFKRLDGCRRDWFERLDRPALRALPVHPYEVATFKRCKVSIDYHIEVNGSFYSVPSALARQSVEVRLTAHTLEVLHGNRRVASHVLLGRRGAYSTQREH
MPAAHQAHREWTPQRLLDWGERIGPYTRQLIDHQLTHKPHPEMGYRACLGLLSLARRYGNARLEAAAERAVQLRAFTGRSVRNLLQQGLDQQPLPQRAAA
TALPEHHENVRGADYYQPPQQELFDDAATHPESTAPATPGRHGSRTGGTMDAAGQPQPELR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
699 bp | 232 aa | 1851 | 2549 | + | No |
AG : IS21 helper
ORF sequence :
MARALEEQWTLPGSHSLSFDERLGLLLDRELAWRDNQRLVRLRKKAKLKYANACLEDLDRRPGRALDERLIASLAGGDWIRQQHNLLLTGPTGAGKTWLA
CALGNQACRQGYSTLYLRTPRLLEQLRIAHGDGSFGRTLQQLAKVDVLVLDDWALAPLEEGARHDLLEVIDDRAGNRSTILTSQLPLEHWHGWINDPTLA
DAILDRLVHNAYRLTMKGESLRRKKAEEQTAS
CALGNQACRQGYSTLYLRTPRLLEQLRIAHGDGSFGRTLQQLAKVDVLVLDDWALAPLEEGARHDLLEVIDDRAGNRSTILTSQLPLEHWHGWINDPTLA
DAILDRLVHNAYRLTMKGESLRRKKAEEQTAS
Blast result :
Comments
ISPa108 is 94% (transposase) aa similar to IS1474.
References
1] Xiaoyuan Jiang (2019) Direct submission.
2] Green,A. (2017) Direct GenBank submission.
2] Green,A. (2017) Direct GenBank submission.