ISAar44
- Family IS3
- Group IS3
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Arthrobacter arilaitensis | Arthrobacter arilaitensis RE117 |
DNA section
IS Length : 1599 bp
Ends
IR Length : 20/38
IRL : CCCTGTCTGTCAGGCTGGAGTGAGACCACTGATTCATAGCAAGTAAACCT
IRR : CGGAAATTTTCAAGTCAAGTGAGACCAATGAGTGTTAACCAGCTTGCATT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGTCCGCGCC | AGCCGGCCTG | 0 | |
CATATTCCTG | TGG | CGGTGACGTG | 3 |
GTCAGTACGC | TGG | GGCTACCGGT | 3 |
ACATCTGCCG | TTT | CGATGGAGCG | 3 |
DNA sequence
CCCTGTCTGTCAGGCTGGAGTGAGACCACTGATTCATAGCAAGTAAACCTCCCCTCAGACAGGGCGAGAACAGGTACAACCGCCATCATGAGCATCGATT
CCATCCACCCCACCCCGGGGAACATGCCGGAGAATGGAACTATGGACTCCATCGATTCGCGCCCAGAACAGCCCCGCCGCAGGAGCATCAGCCCAGCCCA
GAAACTGGCTTACCTCGACGGCTACGAACAAGCGATCCAAACCGGCGAGGGCCGAGGCTACCTGCGCGCCAACGCCTTGTATTCCTCGCAGATCGTTGAA
TGGCGCAGGCTACGCGACGCCGGAGTCCTCGGAGATTCAACCAGCAACAGCTTCCCGGCCCGGTCCAGTAGCCGGCTGAGCAAGGAACAAGCGGAAATCG
CCCGACTGAAAAAGCAGCTAGCTGCCAACGAGCAGAAGCTTGCAACCACCCAAACCGCGTTGGAGATCATGGGAAAAGCACACGCGCTCTTGGAACAAAT
CTCGAAGAGCGCGGATTCCAAATAGCAGCCCAGGACCTCCTGTCCCACGTTTATGAACAACTGGTCACCGGCGGGGTATCAACACGTCAGGCCAGCTCGC
TGGCCGGGGTCAGTAGGGCGACGATGAACCGCCGCCAAAACGCTGCCCGCGCAGGGCGAGCACTGCGGGCTACCGCTCCTCGCCCGGCACCAGCGAACAA
GCTCACCGCGCAGGAAGAAACCACGATTCTGGCAACGCTGAACAGCGAACGGTTCGTGGACCAAGCCCCAGAGCAGATCCACGCCACGCTGCTGTCTGAA
GGGACATATCTGTGCTCGGTATCGAGCATGTATCGGTTGCTGCGGAGGGCGAAGCAAGTGGCTGAACGCCGTCGACAGGCCCGTCATCCAGCACGAAAGG
TCCCCGAGCTGGTCGCGGATCAGCCTGGAGAGGTGTTCACTTGGGATATCACTAAGCTGGCCGGCCCGACGAAGGGCGTGTATTTCGACGCGTACGTGAT
GATTGATATCTATTCGCGCTATATCGTCGGTTGCCAGGTCCATACTCGTGAATCCGGGGAGCTGGCGCGTGATTTCATTGCCGGGGTGTTCGCCAAAGAC
CAGGTGCCGAAAGTTGTGCATGCCGACCGCGGCACCTCGATGACCTCGAAACCCGTGGCGGCTTTGTTGGCGGACTTGGACGTGCTCAAATCGCATTCGC
GGCCGAAGGTCAGCAATGACAACCCGTTCAGTGAGGCGTGGTTCAAGACCTTGAAGTATCTTCCGACATTCCCGCAGCGGTTTGGATCACTTGTTGATGC
CCGGGCTTTCATGGACCGGTTTGTTCAGTCGTATAACGGGCACCATCGCCATTCGGGGATTGGGTTCCATACGCCTGCTGATGTTCATTTCGGGATGACT
GGGCATGTTGATGACCAGAGGTTGGCTGCATTGCAGAGGGCTTGGGATGAGCATCCTGAGCGTTTTGGGCGGCGCAGGTTGCCGAAGAAGCTTCAGATGC
CCGAAGCGGCGTGGATCAATGAGCCGGTGAAGCGGTTGGAAGGACAAGAAATGCAAGCTGGTTAACACTCATTGGTCTCACTTGACTTGAAAATTTCCG
CCATCCACCCCACCCCGGGGAACATGCCGGAGAATGGAACTATGGACTCCATCGATTCGCGCCCAGAACAGCCCCGCCGCAGGAGCATCAGCCCAGCCCA
GAAACTGGCTTACCTCGACGGCTACGAACAAGCGATCCAAACCGGCGAGGGCCGAGGCTACCTGCGCGCCAACGCCTTGTATTCCTCGCAGATCGTTGAA
TGGCGCAGGCTACGCGACGCCGGAGTCCTCGGAGATTCAACCAGCAACAGCTTCCCGGCCCGGTCCAGTAGCCGGCTGAGCAAGGAACAAGCGGAAATCG
CCCGACTGAAAAAGCAGCTAGCTGCCAACGAGCAGAAGCTTGCAACCACCCAAACCGCGTTGGAGATCATGGGAAAAGCACACGCGCTCTTGGAACAAAT
CTCGAAGAGCGCGGATTCCAAATAGCAGCCCAGGACCTCCTGTCCCACGTTTATGAACAACTGGTCACCGGCGGGGTATCAACACGTCAGGCCAGCTCGC
TGGCCGGGGTCAGTAGGGCGACGATGAACCGCCGCCAAAACGCTGCCCGCGCAGGGCGAGCACTGCGGGCTACCGCTCCTCGCCCGGCACCAGCGAACAA
GCTCACCGCGCAGGAAGAAACCACGATTCTGGCAACGCTGAACAGCGAACGGTTCGTGGACCAAGCCCCAGAGCAGATCCACGCCACGCTGCTGTCTGAA
GGGACATATCTGTGCTCGGTATCGAGCATGTATCGGTTGCTGCGGAGGGCGAAGCAAGTGGCTGAACGCCGTCGACAGGCCCGTCATCCAGCACGAAAGG
TCCCCGAGCTGGTCGCGGATCAGCCTGGAGAGGTGTTCACTTGGGATATCACTAAGCTGGCCGGCCCGACGAAGGGCGTGTATTTCGACGCGTACGTGAT
GATTGATATCTATTCGCGCTATATCGTCGGTTGCCAGGTCCATACTCGTGAATCCGGGGAGCTGGCGCGTGATTTCATTGCCGGGGTGTTCGCCAAAGAC
CAGGTGCCGAAAGTTGTGCATGCCGACCGCGGCACCTCGATGACCTCGAAACCCGTGGCGGCTTTGTTGGCGGACTTGGACGTGCTCAAATCGCATTCGC
GGCCGAAGGTCAGCAATGACAACCCGTTCAGTGAGGCGTGGTTCAAGACCTTGAAGTATCTTCCGACATTCCCGCAGCGGTTTGGATCACTTGTTGATGC
CCGGGCTTTCATGGACCGGTTTGTTCAGTCGTATAACGGGCACCATCGCCATTCGGGGATTGGGTTCCATACGCCTGCTGATGTTCATTTCGGGATGACT
GGGCATGTTGATGACCAGAGGTTGGCTGCATTGCAGAGGGCTTGGGATGAGCATCCTGAGCGTTTTGGGCGGCGCAGGTTGCCGAAGAAGCTTCAGATGC
CCGAAGCGGCGTGGATCAATGAGCCGGTGAAGCGGTTGGAAGGACAAGAAATGCAAGCTGGTTAACACTCATTGGTCTCACTTGACTTGAAAATTTCCG
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
438 bp | 145 aa | 88 | 525 | + | No |
Description : First part of the transposase
ORF sequence :
MSIDSIHPTPGNMPENGTMDSIDSRPEQPRRRSISPAQKLAYLDGYEQAIQTGEGRGYLRANALYSSQIVEWRRLRDAGVLGDSTSNSFPARSSSRLSKE
QAEIARLKKQLAANEQKLATTQTALEIMGKAHALLEQISKSADSK
QAEIARLKKQLAANEQKLATTQTALEIMGKAHALLEQISKSADSK
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1194 bp | 397 aa | 372 | 1565 | + | No |
Description : Second part of the transposase
ORF sequence :
PAEQGTSGNRPTEKAASCQRAEACNHPNRVGDHGKSTRALGTNLEERGFQIAAQDLLSHVYEQLVTGGVSTRQASSLAGVSRATMNRRQNAARAGRALRA
TAPRPAPANKLTAQEETTILATLNSERFVDQAPEQIHATLLSEGTYLCSVSSMYRLLRRAKQVAERRRQARHPARKVPELVADQPGEVFTWDITKLAGPT
KGVYFDAYVMIDIYSRYIVGCQVHTRESGELARDFIAGVFAKDQVPKVVHADRGTSMTSKPVAALLADLDVLKSHSRPKVSNDNPFSEAWFKTLKYLPTF
PQRFGSLVDARAFMDRFVQSYNGHHRHSGIGFHTPADVHFGMTGHVDDQRLAALQRAWDEHPERFGRRRLPKKLQMPEAAWINEPVKRLEGQEMQAG
TAPRPAPANKLTAQEETTILATLNSERFVDQAPEQIHATLLSEGTYLCSVSSMYRLLRRAKQVAERRRQARHPARKVPELVADQPGEVFTWDITKLAGPT
KGVYFDAYVMIDIYSRYIVGCQVHTRESGELARDFIAGVFAKDQVPKVVHADRGTSMTSKPVAALLADLDVLKSHSRPKVSNDNPFSEAWFKTLKYLPTF
PQRFGSLVDARAFMDRFVQSYNGHHRHSGIGFHTPADVHFGMTGHVDDQRLAALQRAWDEHPERFGRRRLPKKLQMPEAAWINEPVKRLEGQEMQAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1478 bp | 492 aa | 88 | 1565 | + | Yes |
Chemistry : DDE
ORF sequence :
MSIDSIHPTPGNMPENGTMDSIDSRPEQPRRRSISPAQKLAYLDGYEQAIQTGEGRGYLRANALYSSQIVEWRRLRDAGVLGDSTSNSFPARSSSRLSKE
QAEIARLKKQLAANEQKLATTQTALEIMGKSTRALGTNLEERGFQIAAQDLLSHVYEQLVTGGVSTRQASSLAGVSRATMNRRQNAARAGRALRATAPRP
APANKLTAQEETTILATLNSERFVDQAPEQIHATLLSEGTYLCSVSSMYRLLRRAKQVAERRRQARHPARKVPELVADQPGEVFTWDITKLAGPTKGVYF
DAYVMIDIYSRYIVGCQVHTRESGELARDFIAGVFAKDQVPKVVHADRGTSMTSKPVAALLADLDVLKSHSRPKVSNDNPFSEAWFKTLKYLPTFPQRFG
SLVDARAFMDRFVQSYNGHHRHSGIGFHTPADVHFGMTGHVDDQRLAALQRAWDEHPERFGRRRLPKKLQMPEAAWINEPVKRLEGQEMQAG
QAEIARLKKQLAANEQKLATTQTALEIMGKSTRALGTNLEERGFQIAAQDLLSHVYEQLVTGGVSTRQASSLAGVSRATMNRRQNAARAGRALRATAPRP
APANKLTAQEETTILATLNSERFVDQAPEQIHATLLSEGTYLCSVSSMYRLLRRAKQVAERRRQARHPARKVPELVADQPGEVFTWDITKLAGPTKGVYF
DAYVMIDIYSRYIVGCQVHTRESGELARDFIAGVFAKDQVPKVVHADRGTSMTSKPVAALLADLDVLKSHSRPKVSNDNPFSEAWFKTLKYLPTFPQRFG
SLVDARAFMDRFVQSYNGHHRHSGIGFHTPADVHFGMTGHVDDQRLAALQRAWDEHPERFGRRRLPKKLQMPEAAWINEPVKRLEGQEMQAG
Blast result :
Comments
ISAar44 is 56% aa similar to ISGur10.
There are 4 full length copies of ISAar44 encoded in the Arthrobacter arilaitensis genome.
The ISAar44 ends are not classic for this ISs family. An additional fragment is also present on each extremities of IRs (as for ISSde10).
The third ORF is a putative ORFAB transposase reconstructed in silico by a possible -1 frameshift.
There are 4 full length copies of ISAar44 encoded in the Arthrobacter arilaitensis genome.
The ISAar44 ends are not classic for this ISs family. An additional fragment is also present on each extremities of IRs (as for ISSde10).
The third ORF is a putative ORFAB transposase reconstructed in silico by a possible -1 frameshift.
References
1] ISfinder annotation (2009).