ISAzo21
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006513 | ND | Azoarcus sp. | Aromatoleum aromaticum EbN1 Azoarcus sp. EbN1 |
DNA section
IS Length : 2423 bp
Ends
IR Length : 23/26
IRL : GTAAGCGGTAACTGACCGGCGTTATTCCTGGGCCCGTGATCGACGTGGAC
IRR : GTAAGCGGTAACTGACGAGCGTTCTTGACTACTTCGAGATCCCGAAGAGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACGACTGCCC | GGATATTC | AGCATGCCTT | 8 |
GCCCACGGCC | GGCCGCAC | TCCTCGGCCG | 8 |
DNA sequence
GTAAGCGGTAACTGACCGGCGTTATTCCTGGGCCCGTGATCGACGTGGACAATTGCCTCCAGGCTCAGCAATGGAGGGATCGTTCATGGCAGGAGTGCTC
AAGCGCAATTGGCGCCGGCGCAGCCGAGAGGAGTGGCAGGAGGTTTTCGCCCGACACGGTTCGAGTGGGCTGAGCGTCACGGCATTCTGCGCGCGCGAGT
CGATCAGCGTCTCGAGCTTTCAGCGCTGGCGAGCGACGGTCGGTCCGGTGTGCGGTAAGGCAGTCGCTGGCGGGGCGACTCGGAAAGAGGCCTTCGTTGA
TCTGGGAGTGCTGGGGTCGGGCGGTACTTCGCGCTGGGAGTTGAAGCTTGATCTGGGTGACGGCGTCGTGCTGCATCTGGTGCGCGGCTGATGTTCTTTC
CCGAAGGCGCGGTGCGGGTGCATCTGTACGGGCGCCCGGTCGATATGCGCAAATCCTTCGACGGTTTGTATGCGCTCGCCCGCCACGGCGTGGGACAGGA
TCCGTTGTCGGGGCATCTGTTCGTGTTCATCAGCCGCCGCGCGACCCAGATGAAGGTCCTGTACTGGGATCGCAGCGGCTTTTGCATCTGGGCCAAGCGG
CTCGAGTCGGGCCGTTTCGTGTCGGATTGGTCGCGGGTGGCCACCCGCGAGATGGACTGGACGGGGCTGAAACTGCTGCTCGAGGGCATCGAGCCGGCAC
GCTTCAAAAAGCGCTTTGCGCTTCCAGAAAGTAACCGCAAACCCGTATGAATGCTGGCGTTTTTGGTAAAATGACGCCATGCCTCAGACGCCTTTGACGC
AGCTGCCGACCCGTGAAGAAGCCGCCCAGTGGACGGCCGATCAGGTCGTCGAACTGGCGGGCGCGCACTTGGCGCAGCAGCGCCAGTTCGAGGTCATCAA
AGGCGAATTCGAGGCGATGCGCCACCAGCTCGACTGGTTCCGTCGGCAGCTCTTCGGCCAGAAGAGCGAGAAGCGCATCGTGGACGCCAACCCACATCAG
ATGAGTCTGGGCGAGTTGCCGGTGCCCGGATCCTCGCCGCCGCCCCCCGCCCAGGACATTGCCGCCCATACCCGACGGGCCCGGACCAGCGACTGCGCCA
AGGGCGACGAGTCGGCGCTGTTCTTCGACGAGGCCCGGGTGCCGGTCGAGACGATCGAGGTGCCCAACCCCGAGGCCGAGGGGCTCGCTCCCGGTCAGTT
CGAGGTGATCGGTGAGAAGACGAGTTTCCGGCTCGCGCAGCGCCCGGGCAGCTACGTCATCCTCAAATACGTGCGCCCGGTGATCAAGCTTCGCAACACG
CAGCTCATCTCCTGCCCCGCGGCGCCCAGGGGTGTCATCGAGGGTAGCCGCGCTGATGTGAGCTTTGTCGTCGGGCTCATCACGGACAAGTTCTGCTATC
ACCAGCCGCTCTACCGCCAGCACCAGCGACTGGGGGACAATGGCATCAGGGTGTCGCGCCCGTGGCTCACGCAGCTCACCCATAGCGCGCTCGCGCTGCT
CGAACCCGTCTTCACCGCACAGCTCGGCTCGATTCGGCTGTCGCGGGTCAAGGCGATGGACGAGACCCCGATCAAGGCCGGCCGTGCTGGCCCCGGCAAG
ATGAAAGGCGGCTATTTCTGGCCGGTCTATGGCGAACGTGACGAGATCTGTTTCGTCTATCACGAGAGCCGCAAGGCGCAGCACATCGAGCAGATCCTGG
GCACTGACCCGCCCCCCGAGGGGGCGGTGCTCCTCACCGATGGCTATGGAGTCTACGAACGCTATGCCGAGAAGTGCGGACTCACGCACGCTCAATGCTG
GGCGCATTCGAGGAGGAAGTTCTTCGACGCCCAGTCGGTCGAGCCCGAGCGTGCGGGCCGGGCGCTGGAGATGATCGGCAAGCTCTATGCAGTCGAGAAA
CGCATCCGCGAGGCCACGCTCGTCGGCGAGGCGAGCCGGGCGTACCGGATCGAGCATGCACAGCCCGTGGTCCACGAGTTCTTCGCCTGGGTCGATGCCC
AGTTCGACACCCACGGCTTGCTGCCCAGTTCGCCGCTCACCACCGCCATGGCTTATGTGCGGGAGCGCCGTGCCGCCCTCGAGGTGTACCTGCGCGACCC
CGAAGTGTCGATCGACACAAACCATCTCGAGCGGGCGTTGCGCGTGGTGCCCATGGGACGTCGCAACTGGCTCTTTTGCTGGACCGAGGTCGGCGCCAAA
TACGTCGGCATCGCGCAGAGCCTGATCGCCACCTGCCGGCTGCACGACATCGATCCGTACGAATATTTGGTCGATGTGCTGCAGCGTGTCGGCCAACACC
CCGCTGCCGACGTCGCGCAACTCACCCCCCGTCTGTGGAAGCAGCACTTCGCTGCCAACCCGCTGCGATCCGATCTCTTCGGGATCTCGAAGTAGTCAAG
AACGCTCGTCAGTTACCGCTTAC
AAGCGCAATTGGCGCCGGCGCAGCCGAGAGGAGTGGCAGGAGGTTTTCGCCCGACACGGTTCGAGTGGGCTGAGCGTCACGGCATTCTGCGCGCGCGAGT
CGATCAGCGTCTCGAGCTTTCAGCGCTGGCGAGCGACGGTCGGTCCGGTGTGCGGTAAGGCAGTCGCTGGCGGGGCGACTCGGAAAGAGGCCTTCGTTGA
TCTGGGAGTGCTGGGGTCGGGCGGTACTTCGCGCTGGGAGTTGAAGCTTGATCTGGGTGACGGCGTCGTGCTGCATCTGGTGCGCGGCTGATGTTCTTTC
CCGAAGGCGCGGTGCGGGTGCATCTGTACGGGCGCCCGGTCGATATGCGCAAATCCTTCGACGGTTTGTATGCGCTCGCCCGCCACGGCGTGGGACAGGA
TCCGTTGTCGGGGCATCTGTTCGTGTTCATCAGCCGCCGCGCGACCCAGATGAAGGTCCTGTACTGGGATCGCAGCGGCTTTTGCATCTGGGCCAAGCGG
CTCGAGTCGGGCCGTTTCGTGTCGGATTGGTCGCGGGTGGCCACCCGCGAGATGGACTGGACGGGGCTGAAACTGCTGCTCGAGGGCATCGAGCCGGCAC
GCTTCAAAAAGCGCTTTGCGCTTCCAGAAAGTAACCGCAAACCCGTATGAATGCTGGCGTTTTTGGTAAAATGACGCCATGCCTCAGACGCCTTTGACGC
AGCTGCCGACCCGTGAAGAAGCCGCCCAGTGGACGGCCGATCAGGTCGTCGAACTGGCGGGCGCGCACTTGGCGCAGCAGCGCCAGTTCGAGGTCATCAA
AGGCGAATTCGAGGCGATGCGCCACCAGCTCGACTGGTTCCGTCGGCAGCTCTTCGGCCAGAAGAGCGAGAAGCGCATCGTGGACGCCAACCCACATCAG
ATGAGTCTGGGCGAGTTGCCGGTGCCCGGATCCTCGCCGCCGCCCCCCGCCCAGGACATTGCCGCCCATACCCGACGGGCCCGGACCAGCGACTGCGCCA
AGGGCGACGAGTCGGCGCTGTTCTTCGACGAGGCCCGGGTGCCGGTCGAGACGATCGAGGTGCCCAACCCCGAGGCCGAGGGGCTCGCTCCCGGTCAGTT
CGAGGTGATCGGTGAGAAGACGAGTTTCCGGCTCGCGCAGCGCCCGGGCAGCTACGTCATCCTCAAATACGTGCGCCCGGTGATCAAGCTTCGCAACACG
CAGCTCATCTCCTGCCCCGCGGCGCCCAGGGGTGTCATCGAGGGTAGCCGCGCTGATGTGAGCTTTGTCGTCGGGCTCATCACGGACAAGTTCTGCTATC
ACCAGCCGCTCTACCGCCAGCACCAGCGACTGGGGGACAATGGCATCAGGGTGTCGCGCCCGTGGCTCACGCAGCTCACCCATAGCGCGCTCGCGCTGCT
CGAACCCGTCTTCACCGCACAGCTCGGCTCGATTCGGCTGTCGCGGGTCAAGGCGATGGACGAGACCCCGATCAAGGCCGGCCGTGCTGGCCCCGGCAAG
ATGAAAGGCGGCTATTTCTGGCCGGTCTATGGCGAACGTGACGAGATCTGTTTCGTCTATCACGAGAGCCGCAAGGCGCAGCACATCGAGCAGATCCTGG
GCACTGACCCGCCCCCCGAGGGGGCGGTGCTCCTCACCGATGGCTATGGAGTCTACGAACGCTATGCCGAGAAGTGCGGACTCACGCACGCTCAATGCTG
GGCGCATTCGAGGAGGAAGTTCTTCGACGCCCAGTCGGTCGAGCCCGAGCGTGCGGGCCGGGCGCTGGAGATGATCGGCAAGCTCTATGCAGTCGAGAAA
CGCATCCGCGAGGCCACGCTCGTCGGCGAGGCGAGCCGGGCGTACCGGATCGAGCATGCACAGCCCGTGGTCCACGAGTTCTTCGCCTGGGTCGATGCCC
AGTTCGACACCCACGGCTTGCTGCCCAGTTCGCCGCTCACCACCGCCATGGCTTATGTGCGGGAGCGCCGTGCCGCCCTCGAGGTGTACCTGCGCGACCC
CGAAGTGTCGATCGACACAAACCATCTCGAGCGGGCGTTGCGCGTGGTGCCCATGGGACGTCGCAACTGGCTCTTTTGCTGGACCGAGGTCGGCGCCAAA
TACGTCGGCATCGCGCAGAGCCTGATCGCCACCTGCCGGCTGCACGACATCGATCCGTACGAATATTTGGTCGATGTGCTGCAGCGTGTCGGCCAACACC
CCGCTGCCGACGTCGCGCAACTCACCCCCCGTCTGTGGAAGCAGCACTTCGCTGCCAACCCGCTGCGATCCGATCTCTTCGGGATCTCGAAGTAGTCAAG
AACGCTCGTCAGTTACCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
306 bp | 101 aa | 86 | 391 | + | No |
AG : IS66 TnpA
ORF sequence :
MAGVLKRNWRRRSREEWQEVFARHGSSGLSVTAFCARESISVSSFQRWRATVGPVCGKAVAGGATRKEAFVDLGVLGSGGTSRWELKLDLGDGVVLHLVR
G
G
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
360 bp | 119 aa | 391 | 750 | + | No |
AG : IS66 TnpB
ORF sequence :
MFFPEGAVRVHLYGRPVDMRKSFDGLYALARHGVGQDPLSGHLFVFISRRATQMKVLYWDRSGFCIWAKRLESGRFVSDWSRVATREMDWTGLKLLLEGI
EPARFKKRFALPESNRKPV
EPARFKKRFALPESNRKPV
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1617 bp | 538 aa | 779 | 2395 | + | No |
Chemistry : DDE
ORF sequence :
MPQTPLTQLPTREEAAQWTADQVVELAGAHLAQQRQFEVIKGEFEAMRHQLDWFRRQLFGQKSEKRIVDANPHQMSLGELPVPGSSPPPPAQDIAAHTRR
ARTSDCAKGDESALFFDEARVPVETIEVPNPEAEGLAPGQFEVIGEKTSFRLAQRPGSYVILKYVRPVIKLRNTQLISCPAAPRGVIEGSRADVSFVVGL
ITDKFCYHQPLYRQHQRLGDNGIRVSRPWLTQLTHSALALLEPVFTAQLGSIRLSRVKAMDETPIKAGRAGPGKMKGGYFWPVYGERDEICFVYHESRKA
QHIEQILGTDPPPEGAVLLTDGYGVYERYAEKCGLTHAQCWAHSRRKFFDAQSVEPERAGRALEMIGKLYAVEKRIREATLVGEASRAYRIEHAQPVVHE
FFAWVDAQFDTHGLLPSSPLTTAMAYVRERRAALEVYLRDPEVSIDTNHLERALRVVPMGRRNWLFCWTEVGAKYVGIAQSLIATCRLHDIDPYEYLVDV
LQRVGQHPAADVAQLTPRLWKQHFAANPLRSDLFGISK
ARTSDCAKGDESALFFDEARVPVETIEVPNPEAEGLAPGQFEVIGEKTSFRLAQRPGSYVILKYVRPVIKLRNTQLISCPAAPRGVIEGSRADVSFVVGL
ITDKFCYHQPLYRQHQRLGDNGIRVSRPWLTQLTHSALALLEPVFTAQLGSIRLSRVKAMDETPIKAGRAGPGKMKGGYFWPVYGERDEICFVYHESRKA
QHIEQILGTDPPPEGAVLLTDGYGVYERYAEKCGLTHAQCWAHSRRKFFDAQSVEPERAGRALEMIGKLYAVEKRIREATLVGEASRAYRIEHAQPVVHE
FFAWVDAQFDTHGLLPSSPLTTAMAYVRERRAALEVYLRDPEVSIDTNHLERALRVVPMGRRNWLFCWTEVGAKYVGIAQSLIATCRLHDIDPYEYLVDV
LQRVGQHPAADVAQLTPRLWKQHFAANPLRSDLFGISK
Blast result :
Comments
ISAzo21 is 93%(orfA), 97%(orfB), 96%(orfC) aa similar to ISAzo19.
References
1] Kuhner,S., Wohlbrand,L., Fritz,I., Wruck,W., Hultschig,C., Hufnagel,P., Kube,M., Reinhardt,R. and Rabus,R. (2005) J. Bacteriol. 187 (4), 1493-1503.
2] Rabus,R., Kube,M., Heider,J., Beck,A., Heitmann,K., Widdel,F. and Reinhardt,R. (2005) Arch. Microbiol. 183 (1), 27-36.
3] PROSCIENCE (2004) Direct Submission GenBank.
4] ISfinder annotation (2008)
2] Rabus,R., Kube,M., Heider,J., Beck,A., Heitmann,K., Widdel,F. and Reinhardt,R. (2005) Arch. Microbiol. 183 (1), 27-36.
3] PROSCIENCE (2004) Direct Submission GenBank.
4] ISfinder annotation (2008)