ISAar33
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Arthrobacter arilaitensis | Arthrobacter arilaitensis RE117 |
DNA section
IS Length : 2628 bp
Ends
IR Length : 45/50
IRL : TGAATATTTGACTCTTTGAGAGCCACCAATATTGCGATTCAGAGCCGCCC
IRR : TGAATATTTGACTCTTTGAGAGCCACCAATATTCCCACGCAGAGCCACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATGGTGCGCA | TGAA | GAACCAAAAT | 4 |
DNA sequence
TGAATATTTGACTCTTTGAGAGCCACCAATATTGCGATTCAGAGCCGCCCCGTCGAAATGACGTTATTGTCCGTCAGAGCCACCCAATATTCTCCTTCAG
AGCCACCCTGTATAAACTCCTCCCACTACGCAACAGCCTGCGTGGTGGAAGGAGCCATTTTCAATGGTACGAAAGATCAAAGCGAAGCGAATACTGCAGC
TTCGCGCCGAGGGCTTGTCAGGACGCACCATCGCGTCCACCCAAGGCGTTTCCCGCAATAGCGTCGCAGAAGTCCTCAACGCTGCCGCCGCCAACTCCAG
CAGCTGGGACGAACTCAAGGACCTCACCGAAGACAAAGTCTACGAACTCCTGTTCCCTGGCCGCAGCGAGCACGAAAGCGTGTTCGCCCAACCGGACTGG
CCCACCGTGCATAAGGAACTGGCCAAAGTCGGCACCAACCTCAAGCTCCTGCACGGCGAATACTCCGATCAGGCCAAAGCCACCGGCGCAGCATTCATGG
GTTACGACCGGTTCTGCAAGTCCTACCAACGCTACGTCCTGGAACACGGGGCCACCTCACGAGTGGAGCATAAATCCGGTGTCAGCGTCGAAGTGGACTG
GTCAGGTCCAACCATGGCCCTGCATGACCCGGTCACTGGCCGACGATCCACGGTGTATTTGTTCGTCGCCTGCCTGCCGTTCAGCCGGTACGCGTTCGTT
GAGCCCTGCCTCGATATGAAGCAGGAATCCTGGATGCTATGCCACGCAGCGATGTTCAACGCTTTCGGCGGCAGCGTGCCGCGGATCGTGTGCGACAACC
TGAAAACCGGGGTGATCAAGCACCCGGCGGAAGGGGAGATTATCCTCAACGATGCCTACCGTCACCTCGCCGAGCACTATTCTTCCGCGATCCTGCCAGG
CAGAGTCCGCAGGCCAAAAGACAAATCCAGCGTGGAAAATACCGTGGGTCATGTGGCTACGTGGGTGATCGCTTCACTACGCCATGAAAAGTTCACCACG
CTTGATTCCCTGCGTGCGGCCATCTATCGGCAGGCAGAAGCCTATAACGCGCAACCCTTCCAGAAGCGTGCCGGCTCCCGCCAGAGTGTTTTCCGCGAGG
AAGAGCAGCCACTGCTGCGTCCGCTCCCAGTGGTGCCCTACGAGATCAGCACGTGGGTCCACGGACGCAAGGTCGCCAGGAATAGCTACGTGACGTGGAA
GAAGAACTTCTACTCCGTGCCGCTCAAACACGTCGGCGCCACCGTCGACCTGAGGATCACGTCCAAGACCCTGGAAGTCTATCTTCAGTCCCAACGGCTG
AGCAGTCACCTCTTGTGCAATGCGGCCACGGTGAATCAGTACCGGACCAATGATTCCGATATCCCACCAGAACGCAAATACCGCTCGTGGGACCCCAAAC
GGCTCCGCGGATGGGCTCACCGCATCGGCCCGAACACGGTGCAGGTGATCGATAAGATCTTCATTTCCGTGCCGGTTGCCGAGCAAGGAATCAACCCCGC
CCTGGCGGTGCTGCGCTTGTCTTCAAAATACAGTCCACAACGACTGGAGGACGCGTGCTTCGTCGCATTGCAGTCACGGATCCGCTCCCCACGCTACGGG
CATTTGCATCCGATTTTGCATAACAGGCAGGACGAGAACTGGAAAGAAGCGACCCTCCCACCAGCAGCTGAAGACAGCACTGGATACGTACGAGGCAGCG
ACTACTACGCAAGGAAGACCCGATGAACCTGAACGCCGAAACCAAGCGCAAACTCCGTGAGATGGCCGCAGGTGACCTGCTCACGGCATTCGAATCCCAG
GACGATGTGGTGAGCATGTCATTAAGTATCGAAGCACGGATCGAGCTGGCCGTGGACCAAGCCCACGGGCTATTTCTGAATACGAAGTCCCACGGTCTAA
TACGTCGGGCTAAGCTCCGTTATCCGCAGGCTGACTTGCGAAAAGTGGACCGGGTGGAAGAGCGGGGCCTGAACCAGCCGCTGCTGGCCCAGTTGGCGAC
CTGCCATTTCATTGAGCTCAACAGGAATCTGGTTTTCCAGGGTTTTACGGGGTCAGGCAAGTCGTATCTGGCGTGCGCCGTGGCGAAGCAGGCCCGTGTT
CACAGCTACCGTACCGGATACGTGCGGATGCCGGACCTCGAAGAAGAATGGCAGCAGGCCGCCAACAAGCCGTTGGGCGTGCAAAAGCTGTTGCGCAGGT
ATGCCAATTACAGTGCTCTGGTATTGGACGAATGGCTTTTGGAGCCACCGAGTGGAGAGTTCCTGAGTTTCATCTTCGAGCTCATGGAGCGAAGGTACGA
CGCCGCGTCGACTATTTTCTGCACTCAGTACAAGCAGGCCGATTGGCATGCCCGGTTGGGTGCTGGTGCGTTGGCCGACGCAATCATGGACCGCATCGTG
CACAACACGATCTGGATTGAGACCGGTGGGTTCAACATGCGCGAGGCGAACATGAGTGCTGGGCACTAAGCGCATCAAGCGTCCACGGTGAGATTCCTTG
TTCAGAGCCGCCGGTCAACCGTCAAGGCTGGACGGTGGCTCTGAACGAGAATATTGGGTGGCTCCGAACCGCAATAACGGGTGGCTCTGCGTGGGAATAT
TGGTGGCTCTCAAAGAGTCAAATATTCA
AGCCACCCTGTATAAACTCCTCCCACTACGCAACAGCCTGCGTGGTGGAAGGAGCCATTTTCAATGGTACGAAAGATCAAAGCGAAGCGAATACTGCAGC
TTCGCGCCGAGGGCTTGTCAGGACGCACCATCGCGTCCACCCAAGGCGTTTCCCGCAATAGCGTCGCAGAAGTCCTCAACGCTGCCGCCGCCAACTCCAG
CAGCTGGGACGAACTCAAGGACCTCACCGAAGACAAAGTCTACGAACTCCTGTTCCCTGGCCGCAGCGAGCACGAAAGCGTGTTCGCCCAACCGGACTGG
CCCACCGTGCATAAGGAACTGGCCAAAGTCGGCACCAACCTCAAGCTCCTGCACGGCGAATACTCCGATCAGGCCAAAGCCACCGGCGCAGCATTCATGG
GTTACGACCGGTTCTGCAAGTCCTACCAACGCTACGTCCTGGAACACGGGGCCACCTCACGAGTGGAGCATAAATCCGGTGTCAGCGTCGAAGTGGACTG
GTCAGGTCCAACCATGGCCCTGCATGACCCGGTCACTGGCCGACGATCCACGGTGTATTTGTTCGTCGCCTGCCTGCCGTTCAGCCGGTACGCGTTCGTT
GAGCCCTGCCTCGATATGAAGCAGGAATCCTGGATGCTATGCCACGCAGCGATGTTCAACGCTTTCGGCGGCAGCGTGCCGCGGATCGTGTGCGACAACC
TGAAAACCGGGGTGATCAAGCACCCGGCGGAAGGGGAGATTATCCTCAACGATGCCTACCGTCACCTCGCCGAGCACTATTCTTCCGCGATCCTGCCAGG
CAGAGTCCGCAGGCCAAAAGACAAATCCAGCGTGGAAAATACCGTGGGTCATGTGGCTACGTGGGTGATCGCTTCACTACGCCATGAAAAGTTCACCACG
CTTGATTCCCTGCGTGCGGCCATCTATCGGCAGGCAGAAGCCTATAACGCGCAACCCTTCCAGAAGCGTGCCGGCTCCCGCCAGAGTGTTTTCCGCGAGG
AAGAGCAGCCACTGCTGCGTCCGCTCCCAGTGGTGCCCTACGAGATCAGCACGTGGGTCCACGGACGCAAGGTCGCCAGGAATAGCTACGTGACGTGGAA
GAAGAACTTCTACTCCGTGCCGCTCAAACACGTCGGCGCCACCGTCGACCTGAGGATCACGTCCAAGACCCTGGAAGTCTATCTTCAGTCCCAACGGCTG
AGCAGTCACCTCTTGTGCAATGCGGCCACGGTGAATCAGTACCGGACCAATGATTCCGATATCCCACCAGAACGCAAATACCGCTCGTGGGACCCCAAAC
GGCTCCGCGGATGGGCTCACCGCATCGGCCCGAACACGGTGCAGGTGATCGATAAGATCTTCATTTCCGTGCCGGTTGCCGAGCAAGGAATCAACCCCGC
CCTGGCGGTGCTGCGCTTGTCTTCAAAATACAGTCCACAACGACTGGAGGACGCGTGCTTCGTCGCATTGCAGTCACGGATCCGCTCCCCACGCTACGGG
CATTTGCATCCGATTTTGCATAACAGGCAGGACGAGAACTGGAAAGAAGCGACCCTCCCACCAGCAGCTGAAGACAGCACTGGATACGTACGAGGCAGCG
ACTACTACGCAAGGAAGACCCGATGAACCTGAACGCCGAAACCAAGCGCAAACTCCGTGAGATGGCCGCAGGTGACCTGCTCACGGCATTCGAATCCCAG
GACGATGTGGTGAGCATGTCATTAAGTATCGAAGCACGGATCGAGCTGGCCGTGGACCAAGCCCACGGGCTATTTCTGAATACGAAGTCCCACGGTCTAA
TACGTCGGGCTAAGCTCCGTTATCCGCAGGCTGACTTGCGAAAAGTGGACCGGGTGGAAGAGCGGGGCCTGAACCAGCCGCTGCTGGCCCAGTTGGCGAC
CTGCCATTTCATTGAGCTCAACAGGAATCTGGTTTTCCAGGGTTTTACGGGGTCAGGCAAGTCGTATCTGGCGTGCGCCGTGGCGAAGCAGGCCCGTGTT
CACAGCTACCGTACCGGATACGTGCGGATGCCGGACCTCGAAGAAGAATGGCAGCAGGCCGCCAACAAGCCGTTGGGCGTGCAAAAGCTGTTGCGCAGGT
ATGCCAATTACAGTGCTCTGGTATTGGACGAATGGCTTTTGGAGCCACCGAGTGGAGAGTTCCTGAGTTTCATCTTCGAGCTCATGGAGCGAAGGTACGA
CGCCGCGTCGACTATTTTCTGCACTCAGTACAAGCAGGCCGATTGGCATGCCCGGTTGGGTGCTGGTGCGTTGGCCGACGCAATCATGGACCGCATCGTG
CACAACACGATCTGGATTGAGACCGGTGGGTTCAACATGCGCGAGGCGAACATGAGTGCTGGGCACTAAGCGCATCAAGCGTCCACGGTGAGATTCCTTG
TTCAGAGCCGCCGGTCAACCGTCAAGGCTGGACGGTGGCTCTGAACGAGAATATTGGGTGGCTCCGAACCGCAATAACGGGTGGCTCTGCGTGGGAATAT
TGGTGGCTCTCAAAGAGTCAAATATTCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1563 bp | 520 aa | 164 | 1726 | + | No |
Chemistry : DDE
ORF sequence :
MVRKIKAKRILQLRAEGLSGRTIASTQGVSRNSVAEVLNAAAANSSSWDELKDLTEDKVYELLFPGRSEHESVFAQPDWPTVHKELAKVGTNLKLLHGEY
SDQAKATGAAFMGYDRFCKSYQRYVLEHGATSRVEHKSGVSVEVDWSGPTMALHDPVTGRRSTVYLFVACLPFSRYAFVEPCLDMKQESWMLCHAAMFNA
FGGSVPRIVCDNLKTGVIKHPAEGEIILNDAYRHLAEHYSSAILPGRVRRPKDKSSVENTVGHVATWVIASLRHEKFTTLDSLRAAIYRQAEAYNAQPFQ
KRAGSRQSVFREEEQPLLRPLPVVPYEISTWVHGRKVARNSYVTWKKNFYSVPLKHVGATVDLRITSKTLEVYLQSQRLSSHLLCNAATVNQYRTNDSDI
PPERKYRSWDPKRLRGWAHRIGPNTVQVIDKIFISVPVAEQGINPALAVLRLSSKYSPQRLEDACFVALQSRIRSPRYGHLHPILHNRQDENWKEATLPP
AAEDSTGYVRGSDYYARKTR
SDQAKATGAAFMGYDRFCKSYQRYVLEHGATSRVEHKSGVSVEVDWSGPTMALHDPVTGRRSTVYLFVACLPFSRYAFVEPCLDMKQESWMLCHAAMFNA
FGGSVPRIVCDNLKTGVIKHPAEGEIILNDAYRHLAEHYSSAILPGRVRRPKDKSSVENTVGHVATWVIASLRHEKFTTLDSLRAAIYRQAEAYNAQPFQ
KRAGSRQSVFREEEQPLLRPLPVVPYEISTWVHGRKVARNSYVTWKKNFYSVPLKHVGATVDLRITSKTLEVYLQSQRLSSHLLCNAATVNQYRTNDSDI
PPERKYRSWDPKRLRGWAHRIGPNTVQVIDKIFISVPVAEQGINPALAVLRLSSKYSPQRLEDACFVALQSRIRSPRYGHLHPILHNRQDENWKEATLPP
AAEDSTGYVRGSDYYARKTR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
747 bp | 248 aa | 1723 | 2469 | + | No |
AG : IS21 helper
ORF sequence :
MNLNAETKRKLREMAAGDLLTAFESQDDVVSMSLSIEARIELAVDQAHGLFLNTKSHGLIRRAKLRYPQADLRKVDRVEERGLNQPLLAQLATCHFIELN
RNLVFQGFTGSGKSYLACAVAKQARVHSYRTGYVRMPDLEEEWQQAANKPLGVQKLLRRYANYSALVLDEWLLEPPSGEFLSFIFELMERRYDAASTIFC
TQYKQADWHARLGAGALADAIMDRIVHNTIWIETGGFNMREANMSAGH
RNLVFQGFTGSGKSYLACAVAKQARVHSYRTGYVRMPDLEEEWQQAANKPLGVQKLLRRYANYSALVLDEWLLEPPSGEFLSFIFELMERRYDAASTIFC
TQYKQADWHARLGAGALADAIMDRIVHNTIWIETGGFNMREANMSAGH
Blast result :
Comments
ISAar33 orfA (transposase)is 51% aa similar to ISPre4 and orfB (helper of transposition)is 52% aa similar to ISRme9.
References
1] ISfinder annotation (2009)