ISAzo15
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006513 | ND | Azoarcus sp. | Aromatoleum aromaticum EbN1 Azoarcus sp. EbN1 |
DNA section
IS Length : 2441 bp
Ends
IR Length : 20/21
IRL : GTAAGCGTCCGCCGCTCCCACCTGTAACAGCCCCCCGGTTTTGCCGACAC
IRR : GTAAGCGTCCGCCGCTTCCACGCCCTCTTACGCTTGCGCAGACTCTTTCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGCTGTAGAC | GATCATCG | AGCCCTCCTT | 8 |
CTCACGCTAC | GACCTATC | TGCGTAACCG | 8 |
TGAGGGTATT | ACTGGTAG | TGGTAGCGAC | 8 |
GCAGCAACGA | CCAGTACA | AGGTCGCGAT | 8 |
DNA sequence
GTAAGCGTCCGCCGCTCCCACCTGTAACAGCCCCCCGGTTTTGCCGACACTCCTGATTTTCACCCGGAGGTCGTGATGAGCACACAATCGGAGCGCGTTG
CGCGCTGGCGCGAGCACGTGGAGCGGTGGCGCCGCAGCGGTCAGACGCAGGCGGCCTACAGCGCCGCGCATGGCGTGAGCAAGAAGTCACTGGGGTACTG
GATTCGGCGGTCGCGCCACGAGTCAGCGCGCGAGGCGGATTCGGTCCTGACGCTGGTGGCGGCGCGTCCTGTCGGCGTTGCGCCGCCACGGCAAGCGGGG
GATGTGCTGTCGCTGTGCAGCCCGTCGGGCTGGCGCCTGCAGTTCGGTGCGCTGCCGCCGGCGCCGTGGCTCGCCGAGGTGCTCGCGCACGGGGCGACGT
GATGATTGCGCCGGAGCAGATCTGGCTCGCGGTCGAGCCGATCGACATGCGTCTGGGCATCGACGGCCTGTCGGCACGGCTGCAGAACAGCCTGGGCCGC
GCGCCGTGCGATGGCAGTGCGTACGCCTTCATCAACCGGGCCCGCACGCGCGTGAAGGTGCTGCGGTGGGATGGCACCGGAGTGTGGCTGAGTCAGCGGC
GTCTGCATCGGGGCAGCTTCAGCTGGCCCGCTGCCGACACGGCGGTGTTCGCGCTGTCGGCCGAGCAGTGGCAGTGGCTCGTGGCGGGGGTCGAGTGGCA
GCGCCTGAGTGCCGCCGCACCGGCTCACTGGCGGGTGTGACCGGTGTGCCGGCAGCGTCGACGATCCGCCACGTAAAAATTCCTGAAACCCTTGTGGATG
CTGGGATTCAGCGCCTTCATCGGGTATAATTCCGGCATGGATTTCGCCGCCGAACTGACCGCTTTCGACCTGCCGCCCGCGCTCGCGCAGCAGGTTCAGC
GGTGGGTCGCGCAGGCGGCCGACGTGGCCCGCCTGGAAGCCGAGCTCAAGCTGAGCAAGCTCAAGATCGAGGCCCTGGTCCACGAGATCGCCACCCTCAA
GCGCCTGCGCTTTGGCGCGCGCAGCGAGACGCTGCCGGCGGGCATGAAGGACTTGTTCGACGAGACGCTCGCGGCCGATCTGGCCGCGTGCGAAGCCCGG
CTCGAGGCGCTGCGCGACGCCGCGGCATCCGAAGCCGAAGCGTCTCCGAAGGCCCCGCCCGAGCGCCCCCGCGCCGGCCGCCCGCCGTTGCCGGCACACC
TCGAGCGCATCGAACACCGCCACGAGCCCGAATCCTGCAGCTGCGCCGCGTGCGGGCAGGATCTGGTCAAGATCGGCGAGGACGTCTCCGAGCAGCTCGA
CATCATCCCGGCCAAGTTCTTCGTGCATCGCCACATCCGTCCGCAGTACGCCTGCCGGCACTGCGAGACGGTCTCGACCGCCCCCGTGCCGGCGGCGGTC
ATCGACGGCGGCCTGGCCGCGCCGGGGCTGCTGGCGTGGGTGACGGTGAGCAAGTTCGTCGATCATCTGCCGCTGTACCGGCTCGAACAGATTGCCCGGC
GCAGCGAGGTGGCGCTACCGCGCTCGACACAGTCCGAGTGGATCGGGCGTATCGGGGTGGCGCTGTCGCCCCTCTACGCTCGACTGGTCGAGCATCTGTT
GGCGGGTACGGTGCTGCATGCGGACGAAACCCCCGTCGAGCAACTCGATCCGGGGCGCGGCAAGACGAAGCGGGCGTATCTGTGGGCCTACCGCAGCAAT
ACGCTCGGCGCCGACCCGCCCATCATCCTCTTCGATTACCAGCCCGGGCGCGGCGGGCAATATCCCCAGGCCTTCCTGAAAGGCTGGAAGGGCATGCTCA
TGGTCGATGACTACGCGGGGTACAAGGCGCTGCTGGGGGGCGACATCGGCGAGTTGGCCTGCATGGCCCATGCAAGACGCAAGTACTTCGAACTGCATCA
GGCCAACAAGAGCCCGGTGGCGGCCGAGGCGTTGCGGCGCATCGGCGAATTGTATGCGCTCGAAGAGCAGGCGCGCGACGTCTCGATCGAGGCGCGCGCC
GAACTGCGCGCGCAGTACGCCCGCCCGCGTCTGGAGGCGATGTACCTGTGGCTCGTGCAGACCCGCAAGACCGTGGCCGATGGCGCGGCGCTGGCGCGCG
CCATCGACTACAGCTTGAAGCGCTGGCCGGCGCTCGCGCGTTACGCGAGCCGTGGCGACTGGCCGATCGACAATAATCCAATCGAGAACGCCATCCGCCC
GATCGCTCTGGGCAAGAAAAATTGGATGTTTGCGGGTTCCGAAGCCGCCGGCAAGCGGGCCGCGGTGATTCAGTCGCTGCTCGCCACCGCACGCGCCAAT
GGCTTCGAGCCCCTGGCGTGGCTCTCCGACACCCTCGAGAAGCTGCCGGCCTGGCCCAACAGCCGCATCGACGAATTGCTGCCGATCAGAAAGAAAGAGT
CTGCGCAAGCGTAAGAGGGCGTGGAAGCGGCGGACGCTTAC
CGCGCTGGCGCGAGCACGTGGAGCGGTGGCGCCGCAGCGGTCAGACGCAGGCGGCCTACAGCGCCGCGCATGGCGTGAGCAAGAAGTCACTGGGGTACTG
GATTCGGCGGTCGCGCCACGAGTCAGCGCGCGAGGCGGATTCGGTCCTGACGCTGGTGGCGGCGCGTCCTGTCGGCGTTGCGCCGCCACGGCAAGCGGGG
GATGTGCTGTCGCTGTGCAGCCCGTCGGGCTGGCGCCTGCAGTTCGGTGCGCTGCCGCCGGCGCCGTGGCTCGCCGAGGTGCTCGCGCACGGGGCGACGT
GATGATTGCGCCGGAGCAGATCTGGCTCGCGGTCGAGCCGATCGACATGCGTCTGGGCATCGACGGCCTGTCGGCACGGCTGCAGAACAGCCTGGGCCGC
GCGCCGTGCGATGGCAGTGCGTACGCCTTCATCAACCGGGCCCGCACGCGCGTGAAGGTGCTGCGGTGGGATGGCACCGGAGTGTGGCTGAGTCAGCGGC
GTCTGCATCGGGGCAGCTTCAGCTGGCCCGCTGCCGACACGGCGGTGTTCGCGCTGTCGGCCGAGCAGTGGCAGTGGCTCGTGGCGGGGGTCGAGTGGCA
GCGCCTGAGTGCCGCCGCACCGGCTCACTGGCGGGTGTGACCGGTGTGCCGGCAGCGTCGACGATCCGCCACGTAAAAATTCCTGAAACCCTTGTGGATG
CTGGGATTCAGCGCCTTCATCGGGTATAATTCCGGCATGGATTTCGCCGCCGAACTGACCGCTTTCGACCTGCCGCCCGCGCTCGCGCAGCAGGTTCAGC
GGTGGGTCGCGCAGGCGGCCGACGTGGCCCGCCTGGAAGCCGAGCTCAAGCTGAGCAAGCTCAAGATCGAGGCCCTGGTCCACGAGATCGCCACCCTCAA
GCGCCTGCGCTTTGGCGCGCGCAGCGAGACGCTGCCGGCGGGCATGAAGGACTTGTTCGACGAGACGCTCGCGGCCGATCTGGCCGCGTGCGAAGCCCGG
CTCGAGGCGCTGCGCGACGCCGCGGCATCCGAAGCCGAAGCGTCTCCGAAGGCCCCGCCCGAGCGCCCCCGCGCCGGCCGCCCGCCGTTGCCGGCACACC
TCGAGCGCATCGAACACCGCCACGAGCCCGAATCCTGCAGCTGCGCCGCGTGCGGGCAGGATCTGGTCAAGATCGGCGAGGACGTCTCCGAGCAGCTCGA
CATCATCCCGGCCAAGTTCTTCGTGCATCGCCACATCCGTCCGCAGTACGCCTGCCGGCACTGCGAGACGGTCTCGACCGCCCCCGTGCCGGCGGCGGTC
ATCGACGGCGGCCTGGCCGCGCCGGGGCTGCTGGCGTGGGTGACGGTGAGCAAGTTCGTCGATCATCTGCCGCTGTACCGGCTCGAACAGATTGCCCGGC
GCAGCGAGGTGGCGCTACCGCGCTCGACACAGTCCGAGTGGATCGGGCGTATCGGGGTGGCGCTGTCGCCCCTCTACGCTCGACTGGTCGAGCATCTGTT
GGCGGGTACGGTGCTGCATGCGGACGAAACCCCCGTCGAGCAACTCGATCCGGGGCGCGGCAAGACGAAGCGGGCGTATCTGTGGGCCTACCGCAGCAAT
ACGCTCGGCGCCGACCCGCCCATCATCCTCTTCGATTACCAGCCCGGGCGCGGCGGGCAATATCCCCAGGCCTTCCTGAAAGGCTGGAAGGGCATGCTCA
TGGTCGATGACTACGCGGGGTACAAGGCGCTGCTGGGGGGCGACATCGGCGAGTTGGCCTGCATGGCCCATGCAAGACGCAAGTACTTCGAACTGCATCA
GGCCAACAAGAGCCCGGTGGCGGCCGAGGCGTTGCGGCGCATCGGCGAATTGTATGCGCTCGAAGAGCAGGCGCGCGACGTCTCGATCGAGGCGCGCGCC
GAACTGCGCGCGCAGTACGCCCGCCCGCGTCTGGAGGCGATGTACCTGTGGCTCGTGCAGACCCGCAAGACCGTGGCCGATGGCGCGGCGCTGGCGCGCG
CCATCGACTACAGCTTGAAGCGCTGGCCGGCGCTCGCGCGTTACGCGAGCCGTGGCGACTGGCCGATCGACAATAATCCAATCGAGAACGCCATCCGCCC
GATCGCTCTGGGCAAGAAAAATTGGATGTTTGCGGGTTCCGAAGCCGCCGGCAAGCGGGCCGCGGTGATTCAGTCGCTGCTCGCCACCGCACGCGCCAAT
GGCTTCGAGCCCCTGGCGTGGCTCTCCGACACCCTCGAGAAGCTGCCGGCCTGGCCCAACAGCCGCATCGACGAATTGCTGCCGATCAGAAAGAAAGAGT
CTGCGCAAGCGTAAGAGGGCGTGGAAGCGGCGGACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
327 bp | 108 aa | 76 | 402 | + | No |
AG : IS66 TnpA
ORF sequence :
MSTQSERVARWREHVERWRRSGQTQAAYSAAHGVSKKSLGYWIRRSRHESAREADSVLTLVAARPVGVAPPRQAGDVLSLCSPSGWRLQFGALPPAPWLA
EVLAHGAT
EVLAHGAT
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
339 bp | 112 aa | 402 | 740 | + | No |
AG : IS66 TnpB
ORF sequence :
MIAPEQIWLAVEPIDMRLGIDGLSARLQNSLGRAPCDGSAYAFINRARTRVKVLRWDGTGVWLSQRRLHRGSFSWPAADTAVFALSAEQWQWLVAGVEWQ
RLSAAAPAHWRV
RLSAAAPAHWRV
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1578 bp | 525 aa | 837 | 2414 | + | No |
Chemistry : DDE
ORF sequence :
MDFAAELTAFDLPPALAQQVQRWVAQAADVARLEAELKLSKLKIEALVHEIATLKRLRFGARSETLPAGMKDLFDETLAADLAACEARLEALRDAAASEA
EASPKAPPERPRAGRPPLPAHLERIEHRHEPESCSCAACGQDLVKIGEDVSEQLDIIPAKFFVHRHIRPQYACRHCETVSTAPVPAAVIDGGLAAPGLLA
WVTVSKFVDHLPLYRLEQIARRSEVALPRSTQSEWIGRIGVALSPLYARLVEHLLAGTVLHADETPVEQLDPGRGKTKRAYLWAYRSNTLGADPPIILFD
YQPGRGGQYPQAFLKGWKGMLMVDDYAGYKALLGGDIGELACMAHARRKYFELHQANKSPVAAEALRRIGELYALEEQARDVSIEARAELRAQYARPRLE
AMYLWLVQTRKTVADGAALARAIDYSLKRWPALARYASRGDWPIDNNPIENAIRPIALGKKNWMFAGSEAAGKRAAVIQSLLATARANGFEPLAWLSDTL
EKLPAWPNSRIDELLPIRKKESAQA
EASPKAPPERPRAGRPPLPAHLERIEHRHEPESCSCAACGQDLVKIGEDVSEQLDIIPAKFFVHRHIRPQYACRHCETVSTAPVPAAVIDGGLAAPGLLA
WVTVSKFVDHLPLYRLEQIARRSEVALPRSTQSEWIGRIGVALSPLYARLVEHLLAGTVLHADETPVEQLDPGRGKTKRAYLWAYRSNTLGADPPIILFD
YQPGRGGQYPQAFLKGWKGMLMVDDYAGYKALLGGDIGELACMAHARRKYFELHQANKSPVAAEALRRIGELYALEEQARDVSIEARAELRAQYARPRLE
AMYLWLVQTRKTVADGAALARAIDYSLKRWPALARYASRGDWPIDNNPIENAIRPIALGKKNWMFAGSEAAGKRAAVIQSLLATARANGFEPLAWLSDTL
EKLPAWPNSRIDELLPIRKKESAQA
Blast result :
Comments
ISAzo15 orfA is 44% aa similar to ISDpr4, orfB and orfC are respectively 61% and 64% aa similar to ISThsp3.
References
1] Kuhner,S., Wohlbrand,L., Fritz,I., Wruck,W., Hultschig,C., Hufnagel,P., Kube,M., Reinhardt,R. and Rabus,R. (2005) J. Bacteriol. 187 (4), 1493-1503.
2] Rabus,R., Kube,M., Heider,J., Beck,A., Heitmann,K., Widdel,F. and Reinhardt,R. (2005) Arch. Microbiol. 183 (1), 27-36.
3] PROSCIENCE (2004) Direct Submission GenBank.
4] ISfinder annotation (2008)
2] Rabus,R., Kube,M., Heider,J., Beck,A., Heitmann,K., Widdel,F. and Reinhardt,R. (2005) Arch. Microbiol. 183 (1), 27-36.
3] PROSCIENCE (2004) Direct Submission GenBank.
4] ISfinder annotation (2008)