ISAfe4
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Acidithiobacillus ferrooxidans | Acidithiobacillus ferrooxidans ATCC 23270 |
DNA section
IS Length : 2477 bp
Ends
IR Length : 17/22
IRL : GTAAGCGTCTAAGCAACCCACCTTCCCAAATTTCCTGACCTGACCGATCC
IRR : GTAAACGTTCAGCCAACCCACCCTTGCGGGTTGGCGCCAATCAGGCTAAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCGTAGCCCTGTA | GGAACTGC | GCACCCGTGACGA | 8 |
DNA sequence
GTAAGCGTCTAAGCAACCCACCTTCCCAAATTTCCTGACCTGACCGATCCTGATCTGCATTCAGCGCGGAGGGATGGTGATGAAGAAAGAAGAGAAAGTC
GCTTATTGGCGGCAGCAGGTAGAGGGATTTCAGGCGAGCGGACAGTCGGTCAAGAATTATTGTGCGCAGGCGGGGATCGCCGTAGCCACGCTGCATTACT
GGCGCAAACGCTTTGCCGATTCTGTACAAACGCCGTTGCTGTCCGCCGTGCCCGAAGGGTTTTTGCCGGTAACTCTGGCCGCGCCGGTCAGGACGCCTGT
TTCACCGGTGGAAATTCACCTGTTATCCGGTCGTAGCCTGAAGCTTGCGGCGCCGGTAGATACCAGCTGGCTCCAGACCTTGGTGCAGATTCTGGAAAGA
CCATGTGGCTAAATTCGAGCAGCCGGATCTGGCTCGCGGCAGCGCCCGTAGATATGCGGTTAGGCTTTGATGGCCTGGCGGCCAAGGTACAGGGGGTATT
GGCTGCCGATCCTTTTTGCGGTCATGCTTTTGTCTTTCGCAACCGCCGAGGGGATCGTCTGAAATTGTTACTGTGGGATGGACTGGGTTTCTGGCTGGTG
TATCGCCGTCTGGATCAGGGACGACTCCATTGGCCCCGCGCCGATGCCGGTGCCCTGGAACTCTCCGCTGCGCAGTGGGCGATGCTGGTAGAGGGGCGCC
CGTGGACGCCGTTACCGAGCCTGGAAAAATGCACGCCAAAACTGCTGTAACTAGCGGGAATTTCTTGTATAAATAGCCCAGTTCCGGTACTATATTCCGA
TGGTTTCAACCTCACACACACTGCTTCCCACCAGCCCGGACGCCCTGCGCGATCTGGTGCTGAGCCTGCTGCAACAGCAGGAGGAACAGGAAGCGGAACG
CACCCGGATCATCGCCCAACAGCAAGCGGCCATTGCCCTGCGCGATGAAACCATCGCCCGCCTGGAAAGCACCATCGCCAAACTGCAGCGCTGGCGATTC
GGACGCCGTTCGGAAAAGCTCTCTCCCGACCAGATCAGTCTTTGGGAAGAAGCGCTGGATACCGAAATCGCGGCGATGGAAAGCCTCCTCGAAACCGTTC
TGGAAGACAGTGCGGCCGTGACCGCTGGCCGTGCAGAGGGAGCAGCCGCCGACGCGCAGACCGTAACGGTCCCCGCCCGTCCCGTACGCCGGCACCCCGG
CCGCATGGCGATCCCCGCCCATCTGCCCCGGGTAGAAGTACGTCACGATCCCAAGACTTGCACTTGCGCGCAATGTGGGGGTCCACTTGAAACGGTCGGG
GAAGAGATCAGCGAAAAACTGGATTATATCCCCGGACGCTTCCAGGTGATTCGCCATATCCGCCCCAAACTGGCCTGCCGCCCCTGCGGTACCCTCGAAA
GTCCGGCATTACCGGCACAGGTGATTGACAAGGGCTTGCCCACGGCGCGCCTAGTGGCCCATGTGATGACGGCGAAACACGTGGATCACCTGCCTTTGTA
CCGGCAGGGAACCCAGTACCAGCGAGCGGGCGTACCGATCTCTCGCGCCACTCTCTGCAGCTGGCTGGGTCAGGGCGAATACTGGATCAGCATTCTCGCC
GAAGCCTGCAAAATGGCCTTGCTGGAAGGAAAGATTCTGCACGCCGACGAGACGCCCTTGCCCGTCCTCAACCCCGGCAGCGGCAAGACGGATAAAGCCT
ATCTCTGGGTGTATCGCAGCCAGGCGGATGCCCCGCATCCCATCGTGGTCTTTGATTATGCCCCGGACCGTAAGGGGATCCACGCGCAGCACTTTCTGGG
TGACTGGCAAGGCATCCTCCAGACCGATGACTATGGGGGGTATGATGCCCTCTACCGCAAGGAACAGATCATCGAAGCGGGATGCTGGGCGCATGTCCGC
CGCCACTTCTATGACGTGGAACAGCGGGGTCCCAGTCCGGTAGCCCAGAAGGCCCTGGCCTGGATCGTCAAACTCTACGCCATCGAAGCGGAGATCAAGG
AATCTCTACCAGATCAGAAAGTCGCCGCCCGGCAGCAGCGTGCCGGTCCCCTGCTGGAAGCCTTCCATGCCTGGCTCACGGAAACCCAGATGCAGGTGGC
GCCGCAAAGTGGCATCGCTAAAGCCATCGGCTATGCCCTCAACCGTTGGAAAGCACTGACGCTCTACCTCGAAGAAGGACAACTCAGCATCGATAATAAT
CCGGTGGAGCGAGCGCTGCGGGGCGTGGCCATTGGTCGCAAGAATTTTTTATTTGTTGGAAATGATGCCGGTGGCGAGCGTGCCGCGTCCTTCTACAGCA
TCATCGAAACGTGCAAACTCAACGGCATCGAGCCCTTCGCGTACCTCTGTGACGTGCTCGAAAAGCTCCCGACCTGGCCCAACAAAAAACTCCACGAACT
CTTGCCGTGGAACTGGAAAAAATCTGCATTAGCCTGATTGGCGCCAACCCGCAAGGGTGGGTTGGCTGAACGTTTAC
GCTTATTGGCGGCAGCAGGTAGAGGGATTTCAGGCGAGCGGACAGTCGGTCAAGAATTATTGTGCGCAGGCGGGGATCGCCGTAGCCACGCTGCATTACT
GGCGCAAACGCTTTGCCGATTCTGTACAAACGCCGTTGCTGTCCGCCGTGCCCGAAGGGTTTTTGCCGGTAACTCTGGCCGCGCCGGTCAGGACGCCTGT
TTCACCGGTGGAAATTCACCTGTTATCCGGTCGTAGCCTGAAGCTTGCGGCGCCGGTAGATACCAGCTGGCTCCAGACCTTGGTGCAGATTCTGGAAAGA
CCATGTGGCTAAATTCGAGCAGCCGGATCTGGCTCGCGGCAGCGCCCGTAGATATGCGGTTAGGCTTTGATGGCCTGGCGGCCAAGGTACAGGGGGTATT
GGCTGCCGATCCTTTTTGCGGTCATGCTTTTGTCTTTCGCAACCGCCGAGGGGATCGTCTGAAATTGTTACTGTGGGATGGACTGGGTTTCTGGCTGGTG
TATCGCCGTCTGGATCAGGGACGACTCCATTGGCCCCGCGCCGATGCCGGTGCCCTGGAACTCTCCGCTGCGCAGTGGGCGATGCTGGTAGAGGGGCGCC
CGTGGACGCCGTTACCGAGCCTGGAAAAATGCACGCCAAAACTGCTGTAACTAGCGGGAATTTCTTGTATAAATAGCCCAGTTCCGGTACTATATTCCGA
TGGTTTCAACCTCACACACACTGCTTCCCACCAGCCCGGACGCCCTGCGCGATCTGGTGCTGAGCCTGCTGCAACAGCAGGAGGAACAGGAAGCGGAACG
CACCCGGATCATCGCCCAACAGCAAGCGGCCATTGCCCTGCGCGATGAAACCATCGCCCGCCTGGAAAGCACCATCGCCAAACTGCAGCGCTGGCGATTC
GGACGCCGTTCGGAAAAGCTCTCTCCCGACCAGATCAGTCTTTGGGAAGAAGCGCTGGATACCGAAATCGCGGCGATGGAAAGCCTCCTCGAAACCGTTC
TGGAAGACAGTGCGGCCGTGACCGCTGGCCGTGCAGAGGGAGCAGCCGCCGACGCGCAGACCGTAACGGTCCCCGCCCGTCCCGTACGCCGGCACCCCGG
CCGCATGGCGATCCCCGCCCATCTGCCCCGGGTAGAAGTACGTCACGATCCCAAGACTTGCACTTGCGCGCAATGTGGGGGTCCACTTGAAACGGTCGGG
GAAGAGATCAGCGAAAAACTGGATTATATCCCCGGACGCTTCCAGGTGATTCGCCATATCCGCCCCAAACTGGCCTGCCGCCCCTGCGGTACCCTCGAAA
GTCCGGCATTACCGGCACAGGTGATTGACAAGGGCTTGCCCACGGCGCGCCTAGTGGCCCATGTGATGACGGCGAAACACGTGGATCACCTGCCTTTGTA
CCGGCAGGGAACCCAGTACCAGCGAGCGGGCGTACCGATCTCTCGCGCCACTCTCTGCAGCTGGCTGGGTCAGGGCGAATACTGGATCAGCATTCTCGCC
GAAGCCTGCAAAATGGCCTTGCTGGAAGGAAAGATTCTGCACGCCGACGAGACGCCCTTGCCCGTCCTCAACCCCGGCAGCGGCAAGACGGATAAAGCCT
ATCTCTGGGTGTATCGCAGCCAGGCGGATGCCCCGCATCCCATCGTGGTCTTTGATTATGCCCCGGACCGTAAGGGGATCCACGCGCAGCACTTTCTGGG
TGACTGGCAAGGCATCCTCCAGACCGATGACTATGGGGGGTATGATGCCCTCTACCGCAAGGAACAGATCATCGAAGCGGGATGCTGGGCGCATGTCCGC
CGCCACTTCTATGACGTGGAACAGCGGGGTCCCAGTCCGGTAGCCCAGAAGGCCCTGGCCTGGATCGTCAAACTCTACGCCATCGAAGCGGAGATCAAGG
AATCTCTACCAGATCAGAAAGTCGCCGCCCGGCAGCAGCGTGCCGGTCCCCTGCTGGAAGCCTTCCATGCCTGGCTCACGGAAACCCAGATGCAGGTGGC
GCCGCAAAGTGGCATCGCTAAAGCCATCGGCTATGCCCTCAACCGTTGGAAAGCACTGACGCTCTACCTCGAAGAAGGACAACTCAGCATCGATAATAAT
CCGGTGGAGCGAGCGCTGCGGGGCGTGGCCATTGGTCGCAAGAATTTTTTATTTGTTGGAAATGATGCCGGTGGCGAGCGTGCCGCGTCCTTCTACAGCA
TCATCGAAACGTGCAAACTCAACGGCATCGAGCCCTTCGCGTACCTCTGTGACGTGCTCGAAAAGCTCCCGACCTGGCCCAACAAAAAACTCCACGAACT
CTTGCCGTGGAACTGGAAAAAATCTGCATTAGCCTGATTGGCGCCAACCCGCAAGGGTGGGTTGGCTGAACGTTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
333 bp | 110 aa | 80 | 412 | + | No |
AG : IS66 TnpA
ORF sequence :
MKKEEKVAYWRQQVEGFQASGQSVKNYCAQAGIAVATLHYWRKRFADSAQTPLLSAVPEGFLPVTLAAPVRTPVSPVEIHLLSGRSLKLAAAVDTSWLQT
LVQILERPCG
LVQILERPCG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 403 | 750 | + | No |
AG : IS66 TnpB
ORF sequence :
MWLNSSSRIWLAAAPVDMRLGFDGLAAKVQGVLAADPFCGHAFVFRNRRGDRLKLLLWDGLGFWLVYRRLDQGRLHWPRADAGALELSAAQWAMLVEGRP
WTPLPTLEKCTPKLL
WTPLPTLEKCTPKLL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1638 bp | 545 aa | 800 | 2437 | + | No |
Chemistry : DDE
ORF sequence :
MISAPQTPLPTSPDALRDLVLSLLQQQEEQEAERTRIIAQQQAAIALRDETIARLESTIAKLQRWRFGRRSEKLSPDQISLWEESLDTEIAAMESLLETV
LEDSAAVTAGRAEGAAADAQTVTVPARPVRRHPGRMAIPAHLPRVEVRHDPKTCTCAQCGGPLETVGEEISEKLDYIPGRFQVIRHIRPKLACRPCGTIE
SPALPAQVIDKGLPTARLVAHVMTAKHVDHLPLYRQGTQYQRAGVPISRATLCSWLGQGEYWISLLAEACKMALLEGKILHADETPLPVLNPGSGKTDKA
YLWVYRSQADAPHPVVVFDYAPDRKGIHAQHFLGDWQGILQTDDYGGYDALYRKEQIIEAGCWAHVRRHFYDVEQRGPSPVAQKALAWIVKLYAIEAEIK
ESLPDQKVAARQQRAGPLLEAFHAWLTETQMQVAPKSGIAKAMAYALNRWKALTLYLEEGQLSIDNNPVERALRGVAIGRKNFLFVGNDAGGERAASFYS
IIETCKLNGVEPFAYLCDVLEKLPTWPNKRLHELLPWNWKKTALP
LEDSAAVTAGRAEGAAADAQTVTVPARPVRRHPGRMAIPAHLPRVEVRHDPKTCTCAQCGGPLETVGEEISEKLDYIPGRFQVIRHIRPKLACRPCGTIE
SPALPAQVIDKGLPTARLVAHVMTAKHVDHLPLYRQGTQYQRAGVPISRATLCSWLGQGEYWISLLAEACKMALLEGKILHADETPLPVLNPGSGKTDKA
YLWVYRSQADAPHPVVVFDYAPDRKGIHAQHFLGDWQGILQTDDYGGYDALYRKEQIIEAGCWAHVRRHFYDVEQRGPSPVAQKALAWIVKLYAIEAEIK
ESLPDQKVAARQQRAGPLLEAFHAWLTETQMQVAPKSGIAKAMAYALNRWKALTLYLEEGQLSIDNNPVERALRGVAIGRKNFLFVGNDAGGERAASFYS
IIETCKLNGVEPFAYLCDVLEKLPTWPNKRLHELLPWNWKKTALP
Blast result :
Comments
There are 3 copies (and one truncated copy) of ISAfe4 in Acidithiobacillus ferrooxidans ATCC 23270.
ISAfe4 is 45% (ORFA) aa similar to ISDpr1, 72% (ORFB) and 59% (ORFC) to ISPre3.
ISAfe4 is 45% (ORFA) aa similar to ISDpr1, 72% (ORFB) and 59% (ORFC) to ISPre3.
References
1] Robert DeBoy (2005) Direct submission.
2] ISfinder annotation (2008)
2] ISfinder annotation (2008)