ISAar36
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Arthrobacter arilaitensis | Arthrobacter arilaitensis RE117 |
DNA section
IS Length : 2571 bp
Ends
IR Length : 34/48
IRL : TGAATATTCGACTCTTTGGGGGCCACTGAAATTGTCTTTGGGAGCCATTA
IRR : TGAATATTCGTAGGTCTGGGTACCACCAGATATTGCCTTTGGGGACCACA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCTCAAAGCA | CCGCG | TACCCACGTC | 5 |
DNA sequence
TGAATATTCGACTCTTTGGGGGCCACTGAAATTGTCTTTGGGAGCCATTACCGCAAATCGTGATTATTGTGCTCGGGTGCCACCTATGGCTTCATTGGGG
ACCACAAAGTAAGCTCCTTTCACTATGTGAGGGCCACATGGTGAAAGGAGCCATTTTCAATGGTACGTAAAATCCAGACGAAGCCAATCCTGAGGCTTCG
ATCCGAAGGATTCTCGCAAAGGTCGATCTCCACATCCCAGGGATTCTCCCGTCGTAGCGTTGCTGCCGTCTTCGAAGCAGCAGACCGCGAACAACTTGAC
TGGGAGGCAGCCCGAGACATCCCTGAGTCCACTGTCTACGACAGATTATTCCCAGGCAGAGGAACCCACACCAGCGTCTTCGTCCAACCTGATTGGGCAC
GTGTGCACCAAGAACTGGCGAAGGTCGGCGTGACGCTCAAGCTTCTCCACAGCGAATACGTTGACCAGCATTTCCGGCCTGATACGCCCACGATGGGCTA
CGACCGTTTCTGCAAGACTTACCAACGCTTTGTGTTGGAGTCTAATGTAGCTTCCCGCGTCGAACACAAAGCGGGGATGAGCGTGGAAGTGGACTGGTCG
GGACCGACCATGACACTCACAGATCCGCTGACCGGGGCACCTTCCAAGGTGTATTTGTTCGTCGCTTGCTTGCCATTCAGCCGGTACGCATTCGTCGAAT
CGACCTTGAACATGAACCAGGATTCCTGGCTCAGAGCGCACGTGAGCATGTTCGAAGCCTTCACGGGCAGCGTCCCGCGGATCGTGCCGGACAACCTGAA
AACAGGCGTGACCCGCCATCCAACCGAAGGTGAGATCGTGCTCAACGATGCCTACCGGCACATGGCGGCGCACTACAGTGCTGCTGTGCTGCCAGGACGG
GTACGCAAGCCGAAAGACAAAGCCAGCGTAGAGAACACCGTTGGACATATCGCTACCTGGGTCATCGCCGGTTTGCGGCATAGCACCTATACCAGCTTGA
ACGAATTACGCCAGGCAATCCGCGAACGGGTCCACGCCTACAACGCGCAGCCTTTTCAGAAGCGTGCTGGCTCGCGTACCAGCGTGTTCCGCGAACAAGA
GCAGCCTCTGCTGCATCCCTTGCCAGCCGTGCCATATGTCATCAGCACCTGGGTTTACGGGCGCAAAGTCGCGAAGAACAGCTACGTCTCGTACAAGCGG
AACTATTATTCGGTTCCGGTCGCCCATCTCGGGGCCAGCGTGGACTTGCGCGTCACTGACACGGTGCTGGAAATCTTCAAAGGACATCAGCGGTTGGGTA
GCCATGTGTTGCTGTCCGCGCAGAGTGTGAACCAGTACCAAACGAATGATTCGGATATCCCAGCAGAACACCGCTTCACTCAATGGGATCCGCAGCGAGT
GCGGGAGTGGGCGCAACGGTGTGGAGTGCAGACTTTGGAAGTGGTTGACCGGATTTTTGCTGCAGTCCAGGTTCAGGAACAAGGGATCAACCCTGCATTA
GCGGTGCTGCGGTTAAGCCGTAAGTACAGTTCAGACAGGTTGGACGCAGCATGCAGAATCGCTTTGGAAAGTGCTATCAACTCGCCGCGATACGCCCATT
TGGAGCCGATCTTGAAGACTGGGCAGGACAAGAACCTCATTGCTGAAGTCCCTGTTGTTGCGGATTCTGGTGGCTACGTCCGTGGTAGCGCTTACTACGA
CGGAGGACAGCGATGAGCGGTTTGGATCTGGAGACCAAAAGAAAACTGCGCGAGATGGGTGCGGTTGAATTGCTGCATGCGGTCGAAGCGCAGGACGAGT
CATTAAGCATGAGCCTAAGGTTCAATGAGCGGATGCGGATGGCCGTGGACGAAGCACATTCGGCCTACACCACGGGACGCGTGGGAGGGCTGGTTCGGCG
GGCCAAGCTTCGTTACCCCGATGCTGACCTGCGTACCTTGGATTTCGTCGAAGAGCGTGGCCTAGACCAAACCACTTTGGCTTCGTTGGGCAGTTGCGGT
TTCATCGCCCAGAACCATAACGTGGTTTTCCAAGGCTTCACTGGCTCAGGGAAGTCCTATCTGGGATGCGCGTTGGCTAAGCAGGCGTGCCGTCATCAGA
TTCGTACTTTTTACGTACGGATGCCGGACTTGGAAGAAGAATGGGTTCAAGTTCAGGACAAGCCGTTGGGCGCTTCGAAATTCTTGAAGAAATATGGGTC
GTACACGCTGCTGGTCATTGACGAGTGGCTGTTGGATCGACCTGATGGCGATTTCCTTCGAATGCTTCTGGAGCTGATGGAACGCCGTTATGGAACATCA
TCGACGGTGTTCTGCACCCAGTATCCGAAGAAGGATTGGCATCAGCGGCTCGGTTCCGGGGTTCATGCAGACGCGATCATGGATCGGATCATTCACAACA
CGACGTGGTTCGAGACCGGGACGTACAACATGCGTGAACAGTTGAGCTCTTCGTAACTAGGAATGCTGTTGGGGGCTGGTGGTTCCCTTCCGAGAGATCG
CTGGCTCCCAACGGCAATACTTGTGGTCCCCAAAGGCAATATCTGGTGGTACCCAGACCTACGAATATTCA
ACCACAAAGTAAGCTCCTTTCACTATGTGAGGGCCACATGGTGAAAGGAGCCATTTTCAATGGTACGTAAAATCCAGACGAAGCCAATCCTGAGGCTTCG
ATCCGAAGGATTCTCGCAAAGGTCGATCTCCACATCCCAGGGATTCTCCCGTCGTAGCGTTGCTGCCGTCTTCGAAGCAGCAGACCGCGAACAACTTGAC
TGGGAGGCAGCCCGAGACATCCCTGAGTCCACTGTCTACGACAGATTATTCCCAGGCAGAGGAACCCACACCAGCGTCTTCGTCCAACCTGATTGGGCAC
GTGTGCACCAAGAACTGGCGAAGGTCGGCGTGACGCTCAAGCTTCTCCACAGCGAATACGTTGACCAGCATTTCCGGCCTGATACGCCCACGATGGGCTA
CGACCGTTTCTGCAAGACTTACCAACGCTTTGTGTTGGAGTCTAATGTAGCTTCCCGCGTCGAACACAAAGCGGGGATGAGCGTGGAAGTGGACTGGTCG
GGACCGACCATGACACTCACAGATCCGCTGACCGGGGCACCTTCCAAGGTGTATTTGTTCGTCGCTTGCTTGCCATTCAGCCGGTACGCATTCGTCGAAT
CGACCTTGAACATGAACCAGGATTCCTGGCTCAGAGCGCACGTGAGCATGTTCGAAGCCTTCACGGGCAGCGTCCCGCGGATCGTGCCGGACAACCTGAA
AACAGGCGTGACCCGCCATCCAACCGAAGGTGAGATCGTGCTCAACGATGCCTACCGGCACATGGCGGCGCACTACAGTGCTGCTGTGCTGCCAGGACGG
GTACGCAAGCCGAAAGACAAAGCCAGCGTAGAGAACACCGTTGGACATATCGCTACCTGGGTCATCGCCGGTTTGCGGCATAGCACCTATACCAGCTTGA
ACGAATTACGCCAGGCAATCCGCGAACGGGTCCACGCCTACAACGCGCAGCCTTTTCAGAAGCGTGCTGGCTCGCGTACCAGCGTGTTCCGCGAACAAGA
GCAGCCTCTGCTGCATCCCTTGCCAGCCGTGCCATATGTCATCAGCACCTGGGTTTACGGGCGCAAAGTCGCGAAGAACAGCTACGTCTCGTACAAGCGG
AACTATTATTCGGTTCCGGTCGCCCATCTCGGGGCCAGCGTGGACTTGCGCGTCACTGACACGGTGCTGGAAATCTTCAAAGGACATCAGCGGTTGGGTA
GCCATGTGTTGCTGTCCGCGCAGAGTGTGAACCAGTACCAAACGAATGATTCGGATATCCCAGCAGAACACCGCTTCACTCAATGGGATCCGCAGCGAGT
GCGGGAGTGGGCGCAACGGTGTGGAGTGCAGACTTTGGAAGTGGTTGACCGGATTTTTGCTGCAGTCCAGGTTCAGGAACAAGGGATCAACCCTGCATTA
GCGGTGCTGCGGTTAAGCCGTAAGTACAGTTCAGACAGGTTGGACGCAGCATGCAGAATCGCTTTGGAAAGTGCTATCAACTCGCCGCGATACGCCCATT
TGGAGCCGATCTTGAAGACTGGGCAGGACAAGAACCTCATTGCTGAAGTCCCTGTTGTTGCGGATTCTGGTGGCTACGTCCGTGGTAGCGCTTACTACGA
CGGAGGACAGCGATGAGCGGTTTGGATCTGGAGACCAAAAGAAAACTGCGCGAGATGGGTGCGGTTGAATTGCTGCATGCGGTCGAAGCGCAGGACGAGT
CATTAAGCATGAGCCTAAGGTTCAATGAGCGGATGCGGATGGCCGTGGACGAAGCACATTCGGCCTACACCACGGGACGCGTGGGAGGGCTGGTTCGGCG
GGCCAAGCTTCGTTACCCCGATGCTGACCTGCGTACCTTGGATTTCGTCGAAGAGCGTGGCCTAGACCAAACCACTTTGGCTTCGTTGGGCAGTTGCGGT
TTCATCGCCCAGAACCATAACGTGGTTTTCCAAGGCTTCACTGGCTCAGGGAAGTCCTATCTGGGATGCGCGTTGGCTAAGCAGGCGTGCCGTCATCAGA
TTCGTACTTTTTACGTACGGATGCCGGACTTGGAAGAAGAATGGGTTCAAGTTCAGGACAAGCCGTTGGGCGCTTCGAAATTCTTGAAGAAATATGGGTC
GTACACGCTGCTGGTCATTGACGAGTGGCTGTTGGATCGACCTGATGGCGATTTCCTTCGAATGCTTCTGGAGCTGATGGAACGCCGTTATGGAACATCA
TCGACGGTGTTCTGCACCCAGTATCCGAAGAAGGATTGGCATCAGCGGCTCGGTTCCGGGGTTCATGCAGACGCGATCATGGATCGGATCATTCACAACA
CGACGTGGTTCGAGACCGGGACGTACAACATGCGTGAACAGTTGAGCTCTTCGTAACTAGGAATGCTGTTGGGGGCTGGTGGTTCCCTTCCGAGAGATCG
CTGGCTCCCAACGGCAATACTTGTGGTCCCCAAAGGCAATATCTGGTGGTACCCAGACCTACGAATATTCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1557 bp | 518 aa | 160 | 1716 | + | No |
Chemistry : DDE
ORF sequence :
MVRKIQTKPILRLRSEGFSQRSISTSQGFSRRSVAAVFEAADREQLDWEAARDIPESTVYDRLFPGRGTHTSVFVQPDWARVHQELAKVGVTLKLLHSEY
VDQHFRPDTPTMGYDRFCKTYQRFVLESNVASRVEHKAGMSVEVDWSGPTMTLTDPLTGAPSKVYLFVACLPFSRYAFVESTLNMNQDSWLRAHVSMFEA
FTGSVPRIVPDNLKTGVTRHPTEGEIVLNDAYRHMAAHYSAAVLPGRVRKPKDKASVENTVGHIATWVIAGLRHSTYTSLNELRQAIRERVHAYNAQPFQ
KRAGSRTSVFREQEQPLLHPLPAVPYVISTWVYGRKVAKNSYVSYKRNYYSVPVAHLGASVDLRVTDTVLEIFKGHQRLGSHVLLSAQSVNQYQTNDSDI
PAEHRFTQWDPQRVREWAQRCGVQTLEVVDRIFAAVQVQEQGINPALAVLRLSRKYSSDRLDAACRIALESAINSPRYAHLEPILKTGQDKNLIAEVPVV
ADSGGYVRGSAYYDGGQR
VDQHFRPDTPTMGYDRFCKTYQRFVLESNVASRVEHKAGMSVEVDWSGPTMTLTDPLTGAPSKVYLFVACLPFSRYAFVESTLNMNQDSWLRAHVSMFEA
FTGSVPRIVPDNLKTGVTRHPTEGEIVLNDAYRHMAAHYSAAVLPGRVRKPKDKASVENTVGHIATWVIAGLRHSTYTSLNELRQAIRERVHAYNAQPFQ
KRAGSRTSVFREQEQPLLHPLPAVPYVISTWVYGRKVAKNSYVSYKRNYYSVPVAHLGASVDLRVTDTVLEIFKGHQRLGSHVLLSAQSVNQYQTNDSDI
PAEHRFTQWDPQRVREWAQRCGVQTLEVVDRIFAAVQVQEQGINPALAVLRLSRKYSSDRLDAACRIALESAINSPRYAHLEPILKTGQDKNLIAEVPVV
ADSGGYVRGSAYYDGGQR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
744 bp | 247 aa | 1713 | 2456 | + | No |
AG : IS21 helper
ORF sequence :
MSGLDLETKRKLREMGAVELLHAVEAQDESLSMSLRFNERMRMAVDEAHSAYTTGRVGGLVRRAKLRYPDADLRTLDFVEERGLDQTTLASLGSCGFIAQ
NHNVVFQGFTGSGKSYLGCALAKQACRHQIRTFYVRMPDLEEEWVQVQDKPLGASKFLKKYGSYTLLVIDEWLLDRPDGDFLRMLLELMERRYGTSSTVF
CTQYPKKDWHQRLGSGVHADAIMDRIIHNTTWFETGTYNMREQLSSS
NHNVVFQGFTGSGKSYLGCALAKQACRHQIRTFYVRMPDLEEEWVQVQDKPLGASKFLKKYGSYTLLVIDEWLLDRPDGDFLRMLLELMERRYGTSSTVF
CTQYPKKDWHQRLGSGVHADAIMDRIIHNTTWFETGTYNMREQLSSS
Blast result :
Comments
ISAar36 orfA (Transposase) is 54% aa similar to ISSpu5, and orfB (helper of transposition) is 53% aa similar to ISgur9.
ISAar36 was reconstructed in silico by deletion of ISAar4 sequence and one of the direct repeat generated by its insertion.
ISAar36 was reconstructed in silico by deletion of ISAar4 sequence and one of the direct repeat generated by its insertion.
References
1] ISfinder annotation (2009)