ISAzs20
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_013854 | ND | Azospirillum sp. | Azospirillum sp. B510 plasmid pAB510b Azospirillum sp. B510 plasmid pAB510f Azospirillum sp. B510 Azospirillum sp. B510 plasmid pAB510c |
DNA section
IS Length : 2811 bp
Ends
IR Length : 16/22
IRL : GTACCCCTCCGCCGAGCGCCGCCTTGTGACCGAGCGGGGAAGCTGAATCA
IRR : GTACGCATCCGGTGAGTGACGCGGGTGCGGATCGAATCGTTCAGCGTGCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACGCTATAGC | GCCTAATTCA | 0 | |
CCCGGAGCCG | ACAAGGTG | CGCCGCTTCA | 8 |
TCGGGTGATC | GCTCTATCGG | 0 | |
CGCGTGCTTC | CCACCACCAG | 0 |
DNA sequence
GTACCCCTCCGCCGAGCGCCGCCTTGTGACCGAGCGGGGAAGCTGAATCATTGTGCCTGTCCCTATGGCTATCCCTATGGCAGTCCCTATGACTATCCAA
CGCGTCGAGGTGATCACGGGCCAGGAGCGGCGGCGGCAGTTCAGCGACGAGGAGAAGCTGCGGCTGGTCGAAGAGGCGTTCCAGCCGGGCGTCAAGGCGA
CCGAAGTCGCCCGGCGCCTGGGCGTGGACGTCAGCCTGCTGTACCGCTGGCGCCGCCAGTTCTTCGGTCAGCAGCCCCGGCTGCCCGCCTTCATGCCGAT
CACCGTCGCCACCGACGCTCCGGCACCGGAGGAGGTGGCAGAGCCGACAGCGGCGCCAGCGGCCCCACCAGCCGGTCTCATCGAGGTCGAATTCGCGACG
GCGCGCTTGCGCATCACCGGCCCGGTCGATCCGGCCCTGGTCGGCACGGTGATCGCCGCGCTGTCGGGACGGTCGGCATGATCCCGGTCCCCTCGGGCGT
TCGGGTCTGGCTGGCGGGCGGGGTCACCGACATGCGCTGCGGGATGAACTCTCTGGCGCTGAAGGTCCAGGAGGGTCTTGGCCGCGATCCCCATGCCGGC
GATTTATACGTCTTCCGTGGACGTCGCGGTGACATGGTGAAGTGCCTGTGGCATGACGGGCTCGGCATGTCGCTGTACGCCAAGAGGCTTGAACGGGGCC
GTTTCATCTGGCCCAGCCCGGCCAGCGGAGCCGTGGCCATTTCCGCGTCCCAGTTCGCGTATCTGCTCGACGCCATCGACTGGCGCAATCCGCAGCAGAC
TTGGAGACCGCGCTCGGCCGGATAGGCTGCGGCACAGTGAATCAACCCGCCGCTCTCCGGCGGGAAGAGCGAGGATCGGTGCGGGTTTCCGGGTACAATC
CGGCCCCATGGACGCCCTGCCCGACACCATCGACGCCCTGCGCGCCGCGCTGATCGAAGCGCGTGGCCGGGCTGCGTTGGCCGAGGCGGACGCCGCCCAG
GCGCGGGCCGAACGGTCCGGCGACCAGGCGCTGATCGCCACCCTGAAGCTCCAGATCGAGAAGCTCCAGCGCGACCTCTACGGCCGGCGCTCCGAACGGA
CCTCCCGACTGCTCGGCCAGCTCGAATTCCAGTTGGAGGAGGCGCAGGCAAGCGTCGGCGAGGACGATCTGGCGGCGGAACAGGTGGCCGAGGCGACCGG
CGCGGCCCGCATCACCCGCAAGGCGCCGTCGCGCAAGCCGTTCCCGGCGCACCTGCCGCGCGAGCGCGTCGTGATCCCGGCGCCGGCGGTGTGCCCATGC
TGCGGCTCGACCCGGCTGTGCAAGCTGGGTGAAAGCGTGACCGAGACGGCCGAGCGCATTCCCGCGCGGTGGAAAATCATCCAGACGGTGCGCGAGAAAT
TCTCCTGCCGGGACTGCGAGACGATCAGCCAGCCGCCGGCACCGTTCCACACCACGCCGCGCGGCTGGGCGGGGCCGAACCTGCTGGCCACGCTGCTGTT
CGAGAAGTTCGGCCAGCATCAGCCGCTGAACCGCCAGTGCGAGCGCTTCGCCAAGGAGGGCATGGAGATCAGCCTGTCCACCGCGGCCGACCAGGTGGGG
GCGGCCTGCGGCGTGCTGAAGCCGCTGCTCGACCGGCTCGCAGCGCATGTGCTGGCGGCGGAGCGGTTGCACGGCGAGCCCGAAGGGCCAGCGAAGCTAC
GACACGACCGTGCCGGTGCTGGCGAAGGGCAAGACCGACACCGGCCGCATCTGGGTCTATGTGCGCGACGATCGCCCCTTCGCGGGCGCGGCGGCACCGG
CGGCGCTGTTCCACTACTCCCGCGACCGTGGCGGCGGGCATCCCGAGGCACATCTGGCCGGTTGGGCCGGAGTGCTGCAGGCCGACGCCTATGCCGGATA
CAACCGCCTCTACGACGCGAGCCGCCAGCCGGAGCCCGTGGCCGAGGTCCTGTGTTGGGCGCACGCGCGTCGGAAATTCTTCGAACTCGCCGATATCGCC
GCCAACAAGCGGCGCGGCAAGGGGGCGCCCCCGATTTCTCCGCTGGCGCTGGAGGCGGTGCGGCGCATCGACCCGCTGTTCGACATCGAGCGCGAAGCCC
TCGGGCGCTCGGCAGCCGACCGTCTGGCGGTGCGCACAGAGCTGTCCAAGCCCCAGGTCGAGGAACTGGAAAACTGGATGCGAACGGCCCGGGCCGGGAT
GTCGAAGCACGCGCCGGTGGCCAAGGCGATGGACTACATGCTGACCCGCTGGGAAGGCTTCACCCGCTTCCTGCGTGACGGACGGGTCTGCCTGACCAAC
AATGCTGCCGAACGAGCGTTGCGTGGAATCGCCCTTGGCAGGAAGGCATGGCTATTCTGCGGATCGGATCGAGGCGGGCAGCGCGCCGCCGCCATGTACA
GCCTGATCGTGACGGCAAAGATGAACGACATCGATCCCCAGGCGTGGCTGGCCGACGTCCTGGCCCGCATCAACGATCTGCCGCAGACCAAGCTGCACGA
ACTGCTGCCCTGGGAATGGAAGCGGCTGCACGAGGCGACCACGGCGACCTGAGGAACCATGGCCAGAACCCGCAACCTCGTGACCATGGAGCGCGTCGCC
GAGATCCTCGGCGAGGACGTCGAATGGCTGATCGACATTGCCATCGAGCTGGAGCCGGAGGACGGTTGCCTGGCCGTGTTCGGCCCAGGCGAGCAGTGGT
TCCATGCCCTGACCGAAGATGGCGTCGAAAGCCTTAAGGAACTCATCCAAATCCACCGAGCAGCACGCTGAACGATTCGATCCGCACCCGCGTCACTCAC
CGGATGCGTAC
CGCGTCGAGGTGATCACGGGCCAGGAGCGGCGGCGGCAGTTCAGCGACGAGGAGAAGCTGCGGCTGGTCGAAGAGGCGTTCCAGCCGGGCGTCAAGGCGA
CCGAAGTCGCCCGGCGCCTGGGCGTGGACGTCAGCCTGCTGTACCGCTGGCGCCGCCAGTTCTTCGGTCAGCAGCCCCGGCTGCCCGCCTTCATGCCGAT
CACCGTCGCCACCGACGCTCCGGCACCGGAGGAGGTGGCAGAGCCGACAGCGGCGCCAGCGGCCCCACCAGCCGGTCTCATCGAGGTCGAATTCGCGACG
GCGCGCTTGCGCATCACCGGCCCGGTCGATCCGGCCCTGGTCGGCACGGTGATCGCCGCGCTGTCGGGACGGTCGGCATGATCCCGGTCCCCTCGGGCGT
TCGGGTCTGGCTGGCGGGCGGGGTCACCGACATGCGCTGCGGGATGAACTCTCTGGCGCTGAAGGTCCAGGAGGGTCTTGGCCGCGATCCCCATGCCGGC
GATTTATACGTCTTCCGTGGACGTCGCGGTGACATGGTGAAGTGCCTGTGGCATGACGGGCTCGGCATGTCGCTGTACGCCAAGAGGCTTGAACGGGGCC
GTTTCATCTGGCCCAGCCCGGCCAGCGGAGCCGTGGCCATTTCCGCGTCCCAGTTCGCGTATCTGCTCGACGCCATCGACTGGCGCAATCCGCAGCAGAC
TTGGAGACCGCGCTCGGCCGGATAGGCTGCGGCACAGTGAATCAACCCGCCGCTCTCCGGCGGGAAGAGCGAGGATCGGTGCGGGTTTCCGGGTACAATC
CGGCCCCATGGACGCCCTGCCCGACACCATCGACGCCCTGCGCGCCGCGCTGATCGAAGCGCGTGGCCGGGCTGCGTTGGCCGAGGCGGACGCCGCCCAG
GCGCGGGCCGAACGGTCCGGCGACCAGGCGCTGATCGCCACCCTGAAGCTCCAGATCGAGAAGCTCCAGCGCGACCTCTACGGCCGGCGCTCCGAACGGA
CCTCCCGACTGCTCGGCCAGCTCGAATTCCAGTTGGAGGAGGCGCAGGCAAGCGTCGGCGAGGACGATCTGGCGGCGGAACAGGTGGCCGAGGCGACCGG
CGCGGCCCGCATCACCCGCAAGGCGCCGTCGCGCAAGCCGTTCCCGGCGCACCTGCCGCGCGAGCGCGTCGTGATCCCGGCGCCGGCGGTGTGCCCATGC
TGCGGCTCGACCCGGCTGTGCAAGCTGGGTGAAAGCGTGACCGAGACGGCCGAGCGCATTCCCGCGCGGTGGAAAATCATCCAGACGGTGCGCGAGAAAT
TCTCCTGCCGGGACTGCGAGACGATCAGCCAGCCGCCGGCACCGTTCCACACCACGCCGCGCGGCTGGGCGGGGCCGAACCTGCTGGCCACGCTGCTGTT
CGAGAAGTTCGGCCAGCATCAGCCGCTGAACCGCCAGTGCGAGCGCTTCGCCAAGGAGGGCATGGAGATCAGCCTGTCCACCGCGGCCGACCAGGTGGGG
GCGGCCTGCGGCGTGCTGAAGCCGCTGCTCGACCGGCTCGCAGCGCATGTGCTGGCGGCGGAGCGGTTGCACGGCGAGCCCGAAGGGCCAGCGAAGCTAC
GACACGACCGTGCCGGTGCTGGCGAAGGGCAAGACCGACACCGGCCGCATCTGGGTCTATGTGCGCGACGATCGCCCCTTCGCGGGCGCGGCGGCACCGG
CGGCGCTGTTCCACTACTCCCGCGACCGTGGCGGCGGGCATCCCGAGGCACATCTGGCCGGTTGGGCCGGAGTGCTGCAGGCCGACGCCTATGCCGGATA
CAACCGCCTCTACGACGCGAGCCGCCAGCCGGAGCCCGTGGCCGAGGTCCTGTGTTGGGCGCACGCGCGTCGGAAATTCTTCGAACTCGCCGATATCGCC
GCCAACAAGCGGCGCGGCAAGGGGGCGCCCCCGATTTCTCCGCTGGCGCTGGAGGCGGTGCGGCGCATCGACCCGCTGTTCGACATCGAGCGCGAAGCCC
TCGGGCGCTCGGCAGCCGACCGTCTGGCGGTGCGCACAGAGCTGTCCAAGCCCCAGGTCGAGGAACTGGAAAACTGGATGCGAACGGCCCGGGCCGGGAT
GTCGAAGCACGCGCCGGTGGCCAAGGCGATGGACTACATGCTGACCCGCTGGGAAGGCTTCACCCGCTTCCTGCGTGACGGACGGGTCTGCCTGACCAAC
AATGCTGCCGAACGAGCGTTGCGTGGAATCGCCCTTGGCAGGAAGGCATGGCTATTCTGCGGATCGGATCGAGGCGGGCAGCGCGCCGCCGCCATGTACA
GCCTGATCGTGACGGCAAAGATGAACGACATCGATCCCCAGGCGTGGCTGGCCGACGTCCTGGCCCGCATCAACGATCTGCCGCAGACCAAGCTGCACGA
ACTGCTGCCCTGGGAATGGAAGCGGCTGCACGAGGCGACCACGGCGACCTGAGGAACCATGGCCAGAACCCGCAACCTCGTGACCATGGAGCGCGTCGCC
GAGATCCTCGGCGAGGACGTCGAATGGCTGATCGACATTGCCATCGAGCTGGAGCCGGAGGACGGTTGCCTGGCCGTGTTCGGCCCAGGCGAGCAGTGGT
TCCATGCCCTGACCGAAGATGGCGTCGAAAGCCTTAAGGAACTCATCCAAATCCACCGAGCAGCACGCTGAACGATTCGATCCGCACCCGCGTCACTCAC
CGGATGCGTAC
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
393 bp | 130 aa | 89 | 481 | + | No |
AG : IS66 TnpA
ORF sequence :
MTIQRVEVITGQERRRQFSDEEKLRLVEEAFQPGVKATEVARRLGVDVSLLYRWRRQFFGQQPRLPAFMPITVATDAPAPEEVAEPTAAPAAPPAGLIEV
EFATARLRITGPVDPALVGTVIAALSGRSA
EFATARLRITGPVDPALVGTVIAALSGRSA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 478 | 825 | + | No |
AG : IS66 TnpB
ORF sequence :
MIPVPSGVRVWLAGGVTDMRCGMNSLALKVQEGLGRDPHAGDLYVFRGRRGDMVKCLWHDGLGMSLYAKRLERGRFIWPSPASGAVAISASQFAYLLDAI
DWRNPQQTWRPRSAG
DWRNPQQTWRPRSAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1645 bp | 548 aa | 908 | 2552 | + | Yes |
Chemistry : DDE
ORF sequence :
MDALPDTIDALRAALIEARGRAALAEADAAQARAERSGDQALIATLKLQIEKLQRDLYGRRSERTSRLLGQLEFQLEEAQASVGEDDLAAEQVAEATGAA
RITRKAPSRKPFPAHLPRERVVIPAPAVCPCCGSTRLCKLGESVTETAERIPARWKIIQTVREKFSCRDCETISQPPAPFHTTPRGWAGPNLLATLLFEK
FGQHQPLNRQCERFAKEGMEISLSTAADQVGAACGVLKPLLDRLAAHVLAAERLHGEPPKGQRSYDTTVPVLAKGKTDTGRIWVYVRDDRPFAGAAAPAA
LFHYSRDRGGGHPEAHLAGWAGVLQADAYAGYNRLYDASRQPEPVAEVLCWAHARRKFFELADIAANKRRGKGAPPISPLALEAVRRIDPLFDIEREALG
RSAADRLAVRTELSKPQVEELENWMRTARAGMSKHAPVAKAMDYMLTRWEGFTRFLRDGRVCLTNNAAERALRGIALGRKAWLFCGSDRGGQRAAAMYSL
IVTAKMNDIDPQAWLADVLARINDLPQTKLHELLPWEWKRLHEATTAT
RITRKAPSRKPFPAHLPRERVVIPAPAVCPCCGSTRLCKLGESVTETAERIPARWKIIQTVREKFSCRDCETISQPPAPFHTTPRGWAGPNLLATLLFEK
FGQHQPLNRQCERFAKEGMEISLSTAADQVGAACGVLKPLLDRLAAHVLAAERLHGEPPKGQRSYDTTVPVLAKGKTDTGRIWVYVRDDRPFAGAAAPAA
LFHYSRDRGGGHPEAHLAGWAGVLQADAYAGYNRLYDASRQPEPVAEVLCWAHARRKFFELADIAANKRRGKGAPPISPLALEAVRRIDPLFDIEREALG
RSAADRLAVRTELSKPQVEELENWMRTARAGMSKHAPVAKAMDYMLTRWEGFTRFLRDGRVCLTNNAAERALRGIALGRKAWLFCGSDRGGQRAAAMYSL
IVTAKMNDIDPQAWLADVLARINDLPQTKLHELLPWEWKRLHEATTAT
Blast result :
Comments
ISAzs20 is 84% (ORFC : the transposase) aa similar to ISAli10. The transposase is reconstructed in silico, there is a non programmed frameshift inside.
References
1] ISfinder annotation (2010).
2] Kaneko,T., Minamisawa,K., Isawa,T., Nakatsukasa,H., Mitsui,H., Kawaharada,Y., Nakamura,Y., Watanabe,A., Kawashima,K., Ono,A., Shimizu,Y., Takahashi,C., Minami,C., Fujishiro,T., Kohara,M., Katoh,M., Nakazaki,N., Nakayama,S., Yamada,M., Tabata,S. and Sato,S. (2010) DNA Res. 17 (1), 37-50.
2] Kaneko,T., Minamisawa,K., Isawa,T., Nakatsukasa,H., Mitsui,H., Kawaharada,Y., Nakamura,Y., Watanabe,A., Kawashima,K., Ono,A., Shimizu,Y., Takahashi,C., Minami,C., Fujishiro,T., Kohara,M., Katoh,M., Nakazaki,N., Nakayama,S., Yamada,M., Tabata,S. and Sato,S. (2010) DNA Res. 17 (1), 37-50.