ISPssa2
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
LT629787 | ND | Pseudomonas salegens | Pseudomonas salegens CECT 8338 |
DNA section
IS Length : 1984 bp
Ends
IR Length : 22/28
IRL : TGTCACCGCCGGAGTAATACTGACCCACCAACGTCGACTTAAAATTGACC
IRR : TGTCAACGACGGTTCAAAACTGACCCACTTATCGGCGATATCGTCGGTTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GATAAACGGAAAAGC | GATC | CCGGCAAAGTTGG | 4 |
DNA sequence
TGTCACCGCCGGAGTAATACTGACCCACCAACGTCGACTTAAAATTGACCCACCTGAAGCAAAATGGCCGCCTAATCAAATGACCAGAGCGGCGGCCGAT
GTTGACTCAGGAGAGACTCGTGGAAATTCATGTATTACATGGCCAAGGCAACAGCATTCGCGCCATCGCAAAGCAGCTCGGAGTGTCGCGCAATACGGTG
CGGAGGTATCTGCGGGATCTGACGGTTGTACCTAGCTATCCGGACCGGGCTGCTCGCCCAGCAAAGCTAGAGCCCTACAAAACCTATCTGTTAGCACGGA
TCGAGGCGGCCAAACCGCACTGGATACCGGCAACGGTTCTGTTCAGCGAGATTCAGGATCGTGGCTATGCTGGCGGAATCACTCAGCTCAAGAACTATCT
CGCCGAGTTCAAGACGGCCGTGCCTGATCCGGTAGTTCGTTTCGAGACGCCTCCAGGCAAGCAAATGCAGGTCGATTTCACGACCATCTCTCGCCACCGG
CGCACGATCAAAGCTTTCGTCGCCACGCTGGGATATAGCCGCGCTACCTACGTCCGGTTTTCCGAGCATGAGCGTCAGGATGACTGGCTGCGCGGGATCG
AGGAGGCATGCCATTACTTCGGCGGGGTACCCCAGGAGATCCTGTTCGACAACGCCAAGACCATCATGATCGAGCGTGATGCTTACGGCGAAGGCCAGCA
TCGTTGGAATGCGCAACTGCTGGCGACTGCGCGTGATTATGGCTTTATCGCCCGCGCTTGCCGACCCTACAGAGCTCGCACCAAGGGCAAGGTCGAGCGC
TTCAATGGTTACTTGAAAAACAGCTTCATCACGCCCCTGGCAGCCACTCTGAATCAGGCTGGGCTTCGGCTGGATGTGGCCACGGCCAATGCCCATATAG
GTCCCTGGCTCGAGCGTGTGGCGCACCAGCGTATCCATGGCACCACCGGCATCAAGCCTCAGATATTGCTGGATCAGGAGCGCTTCCAGCTCAGGCCATT
GCCACGGCGGGCAAAACGGGGCTTAGAGCACATACCCGCCGCCGCACGGCCTGTGCCCAGGGAGAGCTTCCAGCATCCGCTGAGCACCTATGATCGGCTG
CTGGAGGCACGTCCATGAACCTGCAACATCAGCGTATCCAACACGCCTGCGCGAACCTAAAGCTCGACACGTTAGCCAGCGAATGGTCAGCCATGGCCGA
CCGTTGCGCCTCGCAAGAAGACACCTTGGCCGACTTCCTTGAGCAGCTATTGCGCTTGGAACTGGACGCACGGTCCCTGCGCTCCCGTGAGACCTTACTC
AAGTTTGCCGGCTTCCCTGGGCGCAAGCTCTTCGAGGACTATGACTTTAAATTCGCCAGCGGAGCGCCGCGCAAACAGCTGAACGAACTGACAAGCTTGG
CCTTCGTAGAGCGAGCAGAGAATGTCGTACTCCTCGGCCCCAGTGGCGTTGGCAAGAGTCATCTCGCCATCAGCCTAGGTCACAAAGCCGTCGCTCAAGG
CATCAAGACACGCTTCATCGCCGCAGCCGATCTGATGCTGCAACTGGCAACAGCCCGCAAACAGGAACGCCTAGAGCAATACTTGAAGCGCAGCGTGCTA
GCTCCGCGGCTGCTGATCATCGACGAGATCGGCTATCTGCCGTTTGGCCGCGAAGAAGCCAATCTGTTCTTCAATGTCATCGCCAAGCGGTACGAGCAAG
GCAGTGTCATCGTAACCAGTAACCTGCCGTTCTCGCAGTGGTCCCATGCCTTCGCCGATGACACAACCTTGACGGCAGCGCTGTTAGACCGGTTGTTACA
CCATGCGCATATCGTGCAGATCCGTGGCGAAAGCTACCGCTTGAAGGACAAGAACGCTGCGGGTATCGCGCCGGTCTCGGGCAGCAGCCAAGGTCTATTA
ACCAACAATTAATCGCCAAGGGTGGGTCAGTTTTAAACCGACGATATCGCCGATAAGTGGGTCAGTTTTGAACCGTCGTTGACA
GTTGACTCAGGAGAGACTCGTGGAAATTCATGTATTACATGGCCAAGGCAACAGCATTCGCGCCATCGCAAAGCAGCTCGGAGTGTCGCGCAATACGGTG
CGGAGGTATCTGCGGGATCTGACGGTTGTACCTAGCTATCCGGACCGGGCTGCTCGCCCAGCAAAGCTAGAGCCCTACAAAACCTATCTGTTAGCACGGA
TCGAGGCGGCCAAACCGCACTGGATACCGGCAACGGTTCTGTTCAGCGAGATTCAGGATCGTGGCTATGCTGGCGGAATCACTCAGCTCAAGAACTATCT
CGCCGAGTTCAAGACGGCCGTGCCTGATCCGGTAGTTCGTTTCGAGACGCCTCCAGGCAAGCAAATGCAGGTCGATTTCACGACCATCTCTCGCCACCGG
CGCACGATCAAAGCTTTCGTCGCCACGCTGGGATATAGCCGCGCTACCTACGTCCGGTTTTCCGAGCATGAGCGTCAGGATGACTGGCTGCGCGGGATCG
AGGAGGCATGCCATTACTTCGGCGGGGTACCCCAGGAGATCCTGTTCGACAACGCCAAGACCATCATGATCGAGCGTGATGCTTACGGCGAAGGCCAGCA
TCGTTGGAATGCGCAACTGCTGGCGACTGCGCGTGATTATGGCTTTATCGCCCGCGCTTGCCGACCCTACAGAGCTCGCACCAAGGGCAAGGTCGAGCGC
TTCAATGGTTACTTGAAAAACAGCTTCATCACGCCCCTGGCAGCCACTCTGAATCAGGCTGGGCTTCGGCTGGATGTGGCCACGGCCAATGCCCATATAG
GTCCCTGGCTCGAGCGTGTGGCGCACCAGCGTATCCATGGCACCACCGGCATCAAGCCTCAGATATTGCTGGATCAGGAGCGCTTCCAGCTCAGGCCATT
GCCACGGCGGGCAAAACGGGGCTTAGAGCACATACCCGCCGCCGCACGGCCTGTGCCCAGGGAGAGCTTCCAGCATCCGCTGAGCACCTATGATCGGCTG
CTGGAGGCACGTCCATGAACCTGCAACATCAGCGTATCCAACACGCCTGCGCGAACCTAAAGCTCGACACGTTAGCCAGCGAATGGTCAGCCATGGCCGA
CCGTTGCGCCTCGCAAGAAGACACCTTGGCCGACTTCCTTGAGCAGCTATTGCGCTTGGAACTGGACGCACGGTCCCTGCGCTCCCGTGAGACCTTACTC
AAGTTTGCCGGCTTCCCTGGGCGCAAGCTCTTCGAGGACTATGACTTTAAATTCGCCAGCGGAGCGCCGCGCAAACAGCTGAACGAACTGACAAGCTTGG
CCTTCGTAGAGCGAGCAGAGAATGTCGTACTCCTCGGCCCCAGTGGCGTTGGCAAGAGTCATCTCGCCATCAGCCTAGGTCACAAAGCCGTCGCTCAAGG
CATCAAGACACGCTTCATCGCCGCAGCCGATCTGATGCTGCAACTGGCAACAGCCCGCAAACAGGAACGCCTAGAGCAATACTTGAAGCGCAGCGTGCTA
GCTCCGCGGCTGCTGATCATCGACGAGATCGGCTATCTGCCGTTTGGCCGCGAAGAAGCCAATCTGTTCTTCAATGTCATCGCCAAGCGGTACGAGCAAG
GCAGTGTCATCGTAACCAGTAACCTGCCGTTCTCGCAGTGGTCCCATGCCTTCGCCGATGACACAACCTTGACGGCAGCGCTGTTAGACCGGTTGTTACA
CCATGCGCATATCGTGCAGATCCGTGGCGAAAGCTACCGCTTGAAGGACAAGAACGCTGCGGGTATCGCGCCGGTCTCGGGCAGCAGCCAAGGTCTATTA
ACCAACAATTAATCGCCAAGGGTGGGTCAGTTTTAAACCGACGATATCGCCGATAAGTGGGTCAGTTTTGAACCGTCGTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1020 bp | 339 aa | 99 | 1118 | + | No |
Chemistry : DDE
ORF sequence :
MLTQERLVEIHVLHGQGNSIRAIAKQLGVSRNTVRRYLRDLTVVPSYPDRAARPAKLEPYKTYLLARIEAAKPHWIPATVLFSEIQDRGYAGGITQLKNY
LAEFKTAVPDPVVRFETPPGKQMQVDFTTISRHRRTIKAFVATLGYSRATYVRFSEHERQDDWLRGIEEACHYFGGVPQEILFDNAKTIMIERDAYGEGQ
HRWNAQLLATARDYGFIARACRPYRARTKGKVERFNGYLKNSFITPLAATLNQAGLRLDVATANAHIGPWLERVAHQRIHGTTGIKPQILLDQERFQLRP
LPRRAKRGLEHIPAAARPVPRESFQHPLSTYDRLLEARP
LAEFKTAVPDPVVRFETPPGKQMQVDFTTISRHRRTIKAFVATLGYSRATYVRFSEHERQDDWLRGIEEACHYFGGVPQEILFDNAKTIMIERDAYGEGQ
HRWNAQLLATARDYGFIARACRPYRARTKGKVERFNGYLKNSFITPLAATLNQAGLRLDVATANAHIGPWLERVAHQRIHGTTGIKPQILLDQERFQLRP
LPRRAKRGLEHIPAAARPVPRESFQHPLSTYDRLLEARP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
798 bp | 265 aa | 1115 | 1912 | + | No |
AG : IS21 helper
ORF sequence :
MNLQHQRIQHACANLKLDTLASEWSAMADRCASQEDTLADFLEQLLRLELDARSLRSRETLLKFAGFPGRKLFEDYDFKFASGAPRKQLNELTSLAFVER
AENVVLLGPSGVGKSHLAISLGHKAVAQGIKTRFIAAADLMLQLATARKQERLEQYLKRSVLAPRLLIIDEIGYLPFGREEANLFFNVIAKRYEQGSVIV
TSNLPFSQWSHAFADDTTLTAALLDRLLHHAHIVQIRGESYRLKDKNAAGIAPVSGSSQGLLTNN
AENVVLLGPSGVGKSHLAISLGHKAVAQGIKTRFIAAADLMLQLATARKQERLEQYLKRSVLAPRLLIIDEIGYLPFGREEANLFFNVIAKRYEQGSVIV
TSNLPFSQWSHAFADDTTLTAALLDRLLHHAHIVQIRGESYRLKDKNAAGIAPVSGSSQGLLTNN
Blast result :
Comments
ISPssa2 is 76% (transposase) aa similar to ISAeme17.
Identified within an integron that contains a putatively non-functional ISPssa1 isoform.
Identified within an integron that contains a putatively non-functional ISPssa1 isoform.
References
1] Sarah Sonbol (2021) Direct submission.