ISAzo19
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006513 | ND | Azoarcus sp. | Aromatoleum aromaticum EbN1 Azoarcus sp. EbN1 Azoarcus sp. EbN1 plasmid 2 |
DNA section
IS Length : 2423 bp
Ends
IR Length : 23/26
IRL : GTAAGCGGTAACTGACCGGCGTTATTCCAGGGCACGTGATCGACGTGGAC
IRR : GTAAGCGGTAACTGACGAGCGTTCTTGACTACTTCGAGATCCCGAAGAGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCGATGCCTT | CGTGCAGG | TGCTGCAAGT | 8 |
CCTTGACCTC | GGTCCGGC | GGCTGCGGAT | 8 |
AAGTTGATAC | GGGCTGCC | GATTGAAAGT | 8 |
DNA sequence
GTAAGCGGTAACTGACCGGCGTTATTCCAGGGCACGTGATCGACGTGGACAATTGCCTCCAGGGTCAACAATGGAGGGATCGTTCATGGCAGGGGTGCTC
AAGCGCAAATGGCGCCGGCGCAGCCGAGAGGAGTGGCAGGAGGTGTTCGCCCGCCACGGTTCGAGCGGGCTGAGCGTCACGGCATTCTGCGCGCACGAGT
CGATCAGCGTCTCGAGCTTTCAGCGCTGGCGTGCGATCGTCGGGCCGGTGTGCGGCAAGGCCGTCGCGGTGGGGCCGGCTCGGCAAGAGGCGTTCGTCGA
TCTGGGAGTGCTGGGTTCGGGCGGCGCTTCGCGCTGGGAGTTGAAGCTTGACCTGGGTGACGGCGTGGTGCTGCACCTGGTGCGCGGCTGATGTTCTTTC
CCGAAGGCGCGGTGCGGGTGCATCTGTACGGGCGCCCGGTCGACATGCGCAAATCCTTCGACGGCTTGTATGCGCTGGCGCGCCACGGCGTGGGACAGGA
TCCGTTGTCGGGGCATCTGTTCGTGTTCATCAACCGGCGCGCGACCCAGATGAAGGTGTTGTACTGGGACCGCAGCGGGTTTTGCATCTGGGCCAAGCGG
CTCGAGTCGGGTCGCTTCGTGTCGGACTGGTCGCGGGCGACGAGCCGCGAGATGGACTGGACGGGGCTGAAGCTGTTGCTCGAAGGCATCGAGCCGGCAC
GCTTCAAAAAGCGCTTCGCGCTGCCCGCAAGTCACCGCAAACCCGCATGAATGCTGGCTTTTTTGGTAAAATGACGCCATGCCTCCGACGCCTTTGACGC
AGCTGCCGACCCGTGAAGAAGCCGCCCGGTGGACGGCCGATCAGGTCGTCGAACTGGCGGGCGCGCATCTGGCGCAGCAGCGCCAGTTCGAGGTCATCAA
AGGCGAATGCGAGGCGATGCGCCACCAGCTCGACTGGTTCCGGCGCCAGCTCTTCGGGGCGAAGAGCGAGAAGCGCCTCGTGGACGCCAACCCGCATCAG
ATGAGCCTGGGCGAGTTGCCGGTGCCCGAATCCTCGCCGCCGCCCCCCGGTCAGGACATTGCCGCCCACACCCGCCGGGCCCGCACCAGCGACTGCGCCA
AGGGCGACGCATCGGCGCTGTTCTTCGACGAGGCCCGCGTGCCGGTCGAGACCATCGAGGTGCCCAACCCCGAGGCCGAGGGGCTCGCCCCCGACCAGTT
CGAGGTGATCGGCGAAAAGGTCAGTCACCGCCTGGCGCAGCGCCCGGGCAGCTACGTGATCCTCAAGTACGTGCGTCCGGTGATCAAGCGCCTCGACACC
CAGGCGATTTCCTGCCCGGCGGCACCCGGGGGGGTGATTCGCGGCAGCCGCGCCGATGTGAGCTTCGTCGCCGGGCTCATCACCGACAAGTTCTGCTACC
ACCAGCCGCTCTACCGCCAGCACCAGCGGCTCGGCGACAATGGCATCAGGGTGTCGCGCCCGTGGCTCACGCAGCTCACCCACGGGGCGCTCGCGCTGCT
CGAACCGATCTTTACGGCCCAACTCGACTCGATCCGCGCGAGCCGCATCAAGGCGATGGACGAGACCCCGATCAAGGCCGGCCGTGCGGGCCCCGGCAAG
ATGAAGGGCGGCTACTTCTGGCCGGTCTATGGCGAACGTGACGAGATCTGTTTCGTCTATCGCGAGAGCCGCAAGGCGCAGCACATCGAGCAGATCCTGG
GCACCGACCCGCCGCCCGAGGGGGCGGTGCTGCTCACCGATGGCTATGGCGCCTACGAACGCTATGCCGAGAAGTGCGGGCTCACGCACGCGCAATGCTG
GGCGCACTCGAGGAGGAAGTTCTTCGACGCCCAGTCGGTCGAGCCCGAGCGCGCGGGCCGGGCGCTGGAGATGATCGGCAAGCTCTATGCGGTCGAAAAA
CGCATCCGCGAGGCCACGCTCGTCGGCGAGGCGTGCCGGGCGTACCGGATCGAGCATGCACAGCCCGTGGTCCACGAGTTCTTCGCCTGGGTCGATGCCC
AGTTCGACACCCACGGCCTGTTGCCCAGCTCGCCGCTCACCACCGCCATGGCTTATGTGCGGGAGCGCCGGGCCGCCCTCGAGGTGTACCTGCGCGATCC
TGAAGTGGCGATCGACACAAACCATCTCGAGCGGGCGCTACGCGTGGTGCCGATGGGTCGTCGCAACTGGCTCTTTTGCTGGACCGAGGTCGGCGCCAAA
TACGTCGGTATCGCGCAGAGCCTGATCGCCACCTGCCGGCTGCACGACATCGATCCGTACGACTATCTGGTCGATGTGCTGCAGCGTGTCGGCCAACACC
CCGCTGCCGACGTCGCGCAACTCACCCCCCGTCTGTGGAAGCAGCACTTCGCTGCCAACCCGCTGCGATCCGATCTCTTCGGGATCTCGAAGTAGTCAAG
AACGCTCGTCAGTTACCGCTTAC
AAGCGCAAATGGCGCCGGCGCAGCCGAGAGGAGTGGCAGGAGGTGTTCGCCCGCCACGGTTCGAGCGGGCTGAGCGTCACGGCATTCTGCGCGCACGAGT
CGATCAGCGTCTCGAGCTTTCAGCGCTGGCGTGCGATCGTCGGGCCGGTGTGCGGCAAGGCCGTCGCGGTGGGGCCGGCTCGGCAAGAGGCGTTCGTCGA
TCTGGGAGTGCTGGGTTCGGGCGGCGCTTCGCGCTGGGAGTTGAAGCTTGACCTGGGTGACGGCGTGGTGCTGCACCTGGTGCGCGGCTGATGTTCTTTC
CCGAAGGCGCGGTGCGGGTGCATCTGTACGGGCGCCCGGTCGACATGCGCAAATCCTTCGACGGCTTGTATGCGCTGGCGCGCCACGGCGTGGGACAGGA
TCCGTTGTCGGGGCATCTGTTCGTGTTCATCAACCGGCGCGCGACCCAGATGAAGGTGTTGTACTGGGACCGCAGCGGGTTTTGCATCTGGGCCAAGCGG
CTCGAGTCGGGTCGCTTCGTGTCGGACTGGTCGCGGGCGACGAGCCGCGAGATGGACTGGACGGGGCTGAAGCTGTTGCTCGAAGGCATCGAGCCGGCAC
GCTTCAAAAAGCGCTTCGCGCTGCCCGCAAGTCACCGCAAACCCGCATGAATGCTGGCTTTTTTGGTAAAATGACGCCATGCCTCCGACGCCTTTGACGC
AGCTGCCGACCCGTGAAGAAGCCGCCCGGTGGACGGCCGATCAGGTCGTCGAACTGGCGGGCGCGCATCTGGCGCAGCAGCGCCAGTTCGAGGTCATCAA
AGGCGAATGCGAGGCGATGCGCCACCAGCTCGACTGGTTCCGGCGCCAGCTCTTCGGGGCGAAGAGCGAGAAGCGCCTCGTGGACGCCAACCCGCATCAG
ATGAGCCTGGGCGAGTTGCCGGTGCCCGAATCCTCGCCGCCGCCCCCCGGTCAGGACATTGCCGCCCACACCCGCCGGGCCCGCACCAGCGACTGCGCCA
AGGGCGACGCATCGGCGCTGTTCTTCGACGAGGCCCGCGTGCCGGTCGAGACCATCGAGGTGCCCAACCCCGAGGCCGAGGGGCTCGCCCCCGACCAGTT
CGAGGTGATCGGCGAAAAGGTCAGTCACCGCCTGGCGCAGCGCCCGGGCAGCTACGTGATCCTCAAGTACGTGCGTCCGGTGATCAAGCGCCTCGACACC
CAGGCGATTTCCTGCCCGGCGGCACCCGGGGGGGTGATTCGCGGCAGCCGCGCCGATGTGAGCTTCGTCGCCGGGCTCATCACCGACAAGTTCTGCTACC
ACCAGCCGCTCTACCGCCAGCACCAGCGGCTCGGCGACAATGGCATCAGGGTGTCGCGCCCGTGGCTCACGCAGCTCACCCACGGGGCGCTCGCGCTGCT
CGAACCGATCTTTACGGCCCAACTCGACTCGATCCGCGCGAGCCGCATCAAGGCGATGGACGAGACCCCGATCAAGGCCGGCCGTGCGGGCCCCGGCAAG
ATGAAGGGCGGCTACTTCTGGCCGGTCTATGGCGAACGTGACGAGATCTGTTTCGTCTATCGCGAGAGCCGCAAGGCGCAGCACATCGAGCAGATCCTGG
GCACCGACCCGCCGCCCGAGGGGGCGGTGCTGCTCACCGATGGCTATGGCGCCTACGAACGCTATGCCGAGAAGTGCGGGCTCACGCACGCGCAATGCTG
GGCGCACTCGAGGAGGAAGTTCTTCGACGCCCAGTCGGTCGAGCCCGAGCGCGCGGGCCGGGCGCTGGAGATGATCGGCAAGCTCTATGCGGTCGAAAAA
CGCATCCGCGAGGCCACGCTCGTCGGCGAGGCGTGCCGGGCGTACCGGATCGAGCATGCACAGCCCGTGGTCCACGAGTTCTTCGCCTGGGTCGATGCCC
AGTTCGACACCCACGGCCTGTTGCCCAGCTCGCCGCTCACCACCGCCATGGCTTATGTGCGGGAGCGCCGGGCCGCCCTCGAGGTGTACCTGCGCGATCC
TGAAGTGGCGATCGACACAAACCATCTCGAGCGGGCGCTACGCGTGGTGCCGATGGGTCGTCGCAACTGGCTCTTTTGCTGGACCGAGGTCGGCGCCAAA
TACGTCGGTATCGCGCAGAGCCTGATCGCCACCTGCCGGCTGCACGACATCGATCCGTACGACTATCTGGTCGATGTGCTGCAGCGTGTCGGCCAACACC
CCGCTGCCGACGTCGCGCAACTCACCCCCCGTCTGTGGAAGCAGCACTTCGCTGCCAACCCGCTGCGATCCGATCTCTTCGGGATCTCGAAGTAGTCAAG
AACGCTCGTCAGTTACCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
306 bp | 101 aa | 86 | 391 | + | No |
AG : IS66 TnpA
ORF sequence :
MAGVLKRKWRRRSREEWQEVFARHGSSGLSVTAFCAHESISVSSFQRWRAIVGPVCGKAVAVGPARQEAFVDLGVLGSGGASRWELKLDLGDGVVLHLVR
G
G
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
360 bp | 119 aa | 391 | 750 | + | No |
AG : IS66 TnpB
ORF sequence :
MFFPEGAVRVHLYGRPVDMRKSFDGLYALARHGVGQDPLSGHLFVFINRRATQMKVLYWDRSGFCIWAKRLESGRFVSDWSRATSREMDWTGLKLLLEGI
EPARFKKRFALPASHRKPA
EPARFKKRFALPASHRKPA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1617 bp | 538 aa | 779 | 2395 | + | No |
Chemistry : DDE
ORF sequence :
MPPTPLTQLPTREEAARWTADQVVELAGAHLAQQRQFEVIKGECEAMRHQLDWFRRQLFGAKSEKRLVDANPHQMSLGELPVPESSPPPPGQDIAAHTRR
ARTSDCAKGDASALFFDEARVPVETIEVPNPEAEGLAPDQFEVIGEKVSHRLAQRPGSYVILKYVRPVIKRLDTQAISCPAAPGGVIRGSRADVSFVAGL
ITDKFCYHQPLYRQHQRLGDNGIRVSRPWLTQLTHGALALLEPIFTAQLDSIRASRIKAMDETPIKAGRAGPGKMKGGYFWPVYGERDEICFVYRESRKA
QHIEQILGTDPPPEGAVLLTDGYGAYERYAEKCGLTHAQCWAHSRRKFFDAQSVEPERAGRALEMIGKLYAVEKRIREATLVGEACRAYRIEHAQPVVHE
FFAWVDAQFDTHGLLPSSPLTTAMAYVRERRAALEVYLRDPEVAIDTNHLERALRVVPMGRRNWLFCWTEVGAKYVGIAQSLIATCRLHDIDPYDYLVDV
LQRVGQHPAADVAQLTPRLWKQHFAANPLRSDLFGISK
ARTSDCAKGDASALFFDEARVPVETIEVPNPEAEGLAPDQFEVIGEKVSHRLAQRPGSYVILKYVRPVIKRLDTQAISCPAAPGGVIRGSRADVSFVAGL
ITDKFCYHQPLYRQHQRLGDNGIRVSRPWLTQLTHGALALLEPIFTAQLDSIRASRIKAMDETPIKAGRAGPGKMKGGYFWPVYGERDEICFVYRESRKA
QHIEQILGTDPPPEGAVLLTDGYGAYERYAEKCGLTHAQCWAHSRRKFFDAQSVEPERAGRALEMIGKLYAVEKRIREATLVGEACRAYRIEHAQPVVHE
FFAWVDAQFDTHGLLPSSPLTTAMAYVRERRAALEVYLRDPEVAIDTNHLERALRVVPMGRRNWLFCWTEVGAKYVGIAQSLIATCRLHDIDPYDYLVDV
LQRVGQHPAADVAQLTPRLWKQHFAANPLRSDLFGISK
Blast result :
Comments
ISAzo19 is 56%(orfA), 79%(orfB), 73%(orfC) aa similar to ISCARN15.
References
1] Kuhner,S., Wohlbrand,L., Fritz,I., Wruck,W., Hultschig,C., Hufnagel,P., Kube,M., Reinhardt,R. and Rabus,R. (2005) J. Bacteriol. 187 (4), 1493-1503.
2] Rabus,R., Kube,M., Heider,J., Beck,A., Heitmann,K., Widdel,F. and Reinhardt,R. (2005) Arch. Microbiol. 183 (1), 27-36.
3] PROSCIENCE (2004) Direct Submission GenBank.
4] ISfinder annotation (2008)
2] Rabus,R., Kube,M., Heider,J., Beck,A., Heitmann,K., Widdel,F. and Reinhardt,R. (2005) Arch. Microbiol. 183 (1), 27-36.
3] PROSCIENCE (2004) Direct Submission GenBank.
4] ISfinder annotation (2008)