ISArsp15
- Family IS30
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
MH067967 | ND | Arthrobacter sp. | Arthrobacter sp. Arthrobacter sp. ANT_H19B pA19BH1 |
DNA section
IS Length : 1862 bp
Ends
IR Length : 19/26
IRL : GGATTCTATTGATCGAAGCAACGCCTATTTCTAGGTGGTTTGCAATGTTT
IRR : GGATTTCAGTGGTCGACGCAACGGATGAGTGTTTTTCGGGGGCGTGTGCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGATTCTATTGATCGAAGCAACG | GACCCACGCG | GGATTTCAGTGGTCGACGCAACG | 10 |
DNA sequence
GGATTCTATTGATCGAAGCAACGCCTATTTCTAGGTGGTTTGCAATGTTTCGTGCTTCTTCGTCAGCAGCAGGCGTCCGGTTTTTCGCCTCAATCAAGGA
AGGGCGCGGTCTCAAACCCTCCGCCCGCGACGCCGGCATCGACAAGGAGGTCGGGTATCGCTGGCTTCGCGAGAAGTACCTGCACCTGCGCCGGGCCGGC
AAGACGCCCGCCGAGACAACCGCCGTACTCGGGTTCACCACATCCCGGTTGCTGGCTTGGGAGGCCGACGTCGATCGCAGTGATGATCGGCACCATCTGC
GTGTCGACCGCGACGAAGAGGCCGCGTTCTGGGCGTGCTTCGAGGACAGCCAGGGCACGAAGGAAGCAGTGATAGCTGCGGGCGTGAGCCGGTCGACCGG
GTATCGGTGGATCGACAAACGCTTCAACCAGCTGCGGCGCGCGGGCGTCACTCTCCGGCGATGCCAAACCCAGTTGCAGCTCACGGATGACCGCACACAG
AGCCTCGAAGAACGACGGCTGCTCCGGCTGCGAAGAGACGCGGCTGCCGCCGCGGCCGCCCGACGTGAAGCCGCGCGGTCCTCCGGGCGCTACGCCGACC
GGGTCCTGGTGGGCGAGCTAACAGCGGGCCGGCAGCGGCTGAGGCTGCGCAATGAGAGGTATTGGCAGCTGATGCGTGACGGGCTGAGCAACGCGGAGGC
TTGCAGACTATTGGGCATGCATCGAGCCTCTGGCACCCAGATTCGTCAAGCCACCAAGTACCAGATCCCTCGTCTTCCCGGCCCGCGGGAAACACTCGGG
CGCTACCTGGACGCGCGCGAGCGGCTGCAGATCGCGGACTTGTTGCGGCTGGGGCACTCAATGCGCCAGATCGCGGCTGAACTGGGACGACAACCGTCGA
CCATCTCTCGGGAGCTGGGCCGGCACCGAAAAGCCGGGGGTCACTACCTGCCCGCGACGGCCGACCACGACGCGCGCCTGCAACGCGCCCGACCCAAAAT
GCCCAAGCTGGTTGCCAGCGCGAAACTGCGGCTTCTGGTGCAGCGAAAACTGAACCGGTGCTGGTCACCAGACGAGATCTGCGGCTGGATGAGGAAGGAG
TTCCCTGATGATCAGACGATGCGGCTCTGCCCGGAGACGATCTACCGGGCTCTGCTGCTCCGCGAGGGCCAGGGCCTGCACAAACGCTTCTCCGTGAAGC
TGCGCACCGGTCGGCGCATCCGCAAGAGCCGCTGGCGCCGACGAATCGGACGCGGATCAGCGATCATCAACATGACGATGATCGATCAGCGCCCCGCCGA
GGTCGAAGACCGGGAACAGGCCGGCCACTGGGAAGGCGACCTCATCGTCGGTCTCGGATCCGTCTCCGCGATGATGACTCTCCGCGAACGAAAGACCCAG
TACGGCATCATCGTGAACCTGCCCCTGGACCACACCGCCGCGAGCGTCAACGCGGCCGCCATCGCTGCGTTCGCAACCCTGCCGCCGCACCTGAAGCGAA
CCCTGACCTGGGACCAGGGAGTCGAGATGGCCTGGCACGAGAAGCTCACCCTCGCCACCGGAGTCCCGGTCTACTTCGCCGAACGCTCCAGCCCCTGGCA
GCGCGGCGCCAACGAGAACTTCAACGGGCTGGCCCGCCAGTACTTCCCCAAGGGCACCAACCTCGCCGTTCACAGCAGCGAGCACGTCGCCCATGTCATG
CGCGAGCTCAACGAACGGCCTCGGAAAACCCTGGGTTACGACACCCCCGCAGCCCGCCTACAGGCCGAACGCGACGCGCCGTCCGCCGCCGTGCGATAGC
CTCCAAACAGCGGGCACACGCCCCCGAAAAACACTCATCCGTTGCGTCGACCACTGAAATCC
AGGGCGCGGTCTCAAACCCTCCGCCCGCGACGCCGGCATCGACAAGGAGGTCGGGTATCGCTGGCTTCGCGAGAAGTACCTGCACCTGCGCCGGGCCGGC
AAGACGCCCGCCGAGACAACCGCCGTACTCGGGTTCACCACATCCCGGTTGCTGGCTTGGGAGGCCGACGTCGATCGCAGTGATGATCGGCACCATCTGC
GTGTCGACCGCGACGAAGAGGCCGCGTTCTGGGCGTGCTTCGAGGACAGCCAGGGCACGAAGGAAGCAGTGATAGCTGCGGGCGTGAGCCGGTCGACCGG
GTATCGGTGGATCGACAAACGCTTCAACCAGCTGCGGCGCGCGGGCGTCACTCTCCGGCGATGCCAAACCCAGTTGCAGCTCACGGATGACCGCACACAG
AGCCTCGAAGAACGACGGCTGCTCCGGCTGCGAAGAGACGCGGCTGCCGCCGCGGCCGCCCGACGTGAAGCCGCGCGGTCCTCCGGGCGCTACGCCGACC
GGGTCCTGGTGGGCGAGCTAACAGCGGGCCGGCAGCGGCTGAGGCTGCGCAATGAGAGGTATTGGCAGCTGATGCGTGACGGGCTGAGCAACGCGGAGGC
TTGCAGACTATTGGGCATGCATCGAGCCTCTGGCACCCAGATTCGTCAAGCCACCAAGTACCAGATCCCTCGTCTTCCCGGCCCGCGGGAAACACTCGGG
CGCTACCTGGACGCGCGCGAGCGGCTGCAGATCGCGGACTTGTTGCGGCTGGGGCACTCAATGCGCCAGATCGCGGCTGAACTGGGACGACAACCGTCGA
CCATCTCTCGGGAGCTGGGCCGGCACCGAAAAGCCGGGGGTCACTACCTGCCCGCGACGGCCGACCACGACGCGCGCCTGCAACGCGCCCGACCCAAAAT
GCCCAAGCTGGTTGCCAGCGCGAAACTGCGGCTTCTGGTGCAGCGAAAACTGAACCGGTGCTGGTCACCAGACGAGATCTGCGGCTGGATGAGGAAGGAG
TTCCCTGATGATCAGACGATGCGGCTCTGCCCGGAGACGATCTACCGGGCTCTGCTGCTCCGCGAGGGCCAGGGCCTGCACAAACGCTTCTCCGTGAAGC
TGCGCACCGGTCGGCGCATCCGCAAGAGCCGCTGGCGCCGACGAATCGGACGCGGATCAGCGATCATCAACATGACGATGATCGATCAGCGCCCCGCCGA
GGTCGAAGACCGGGAACAGGCCGGCCACTGGGAAGGCGACCTCATCGTCGGTCTCGGATCCGTCTCCGCGATGATGACTCTCCGCGAACGAAAGACCCAG
TACGGCATCATCGTGAACCTGCCCCTGGACCACACCGCCGCGAGCGTCAACGCGGCCGCCATCGCTGCGTTCGCAACCCTGCCGCCGCACCTGAAGCGAA
CCCTGACCTGGGACCAGGGAGTCGAGATGGCCTGGCACGAGAAGCTCACCCTCGCCACCGGAGTCCCGGTCTACTTCGCCGAACGCTCCAGCCCCTGGCA
GCGCGGCGCCAACGAGAACTTCAACGGGCTGGCCCGCCAGTACTTCCCCAAGGGCACCAACCTCGCCGTTCACAGCAGCGAGCACGTCGCCCATGTCATG
CGCGAGCTCAACGAACGGCCTCGGAAAACCCTGGGTTACGACACCCCCGCAGCCCGCCTACAGGCCGAACGCGACGCGCCGTCCGCCGCCGTGCGATAGC
CTCCAAACAGCGGGCACACGCCCCCGAAAAACACTCATCCGTTGCGTCGACCACTGAAATCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1755 bp | 584 aa | 45 | 1799 | + | No |
Chemistry : DDE
ORF sequence :
MFRASSSAAGVRFFASIKEGRGLKPSARDAGIDKEVGYRWLREKYLHLRRAGKTPAETTAVLGFTTSRLLAWEADVDRSDDRHHLRVDRDEEAAFWACFE
DSQGTKEAVIAAGVSRSTGYRWIDKRFNQLRRAGVTLRRCQTQLQLTDDRTQSLEERRLLRLRRDAAAAAAARREAARSSGRYADRVLVGELTAGRQRLR
LRNERYWQLMRDGLSNAEACRLLGMHRASGTQIRQATKYQIPRLPGPRETLGRYLDARERLQIADLLRLGHSMRQIAAELGRQPSTISRELGRHRKAGGH
YLPATADHDARLQRARPKMPKLVASAKLRLLVQRKLNRCWSPDEICGWMRKEFPDDQTMRLCPETIYRALLLREGQGLHKRFSVKLRTGRRIRKSRWRRR
IGRGSAIINMTMIDQRPAEVEDREQAGHWEGDLIVGLGSVSAMMTLRERKTQYGIIVNLPLDHTAASVNAAAIAAFATLPPHLKRTLTWDQGVEMAWHEK
LTLATGVPVYFAERSSPWQRGANENFNGLARQYFPKGTNLAVHSSEHVAHVMRELNERPRKTLGYDTPAARLQAERDAPSAAVR
DSQGTKEAVIAAGVSRSTGYRWIDKRFNQLRRAGVTLRRCQTQLQLTDDRTQSLEERRLLRLRRDAAAAAAARREAARSSGRYADRVLVGELTAGRQRLR
LRNERYWQLMRDGLSNAEACRLLGMHRASGTQIRQATKYQIPRLPGPRETLGRYLDARERLQIADLLRLGHSMRQIAAELGRQPSTISRELGRHRKAGGH
YLPATADHDARLQRARPKMPKLVASAKLRLLVQRKLNRCWSPDEICGWMRKEFPDDQTMRLCPETIYRALLLREGQGLHKRFSVKLRTGRRIRKSRWRRR
IGRGSAIINMTMIDQRPAEVEDREQAGHWEGDLIVGLGSVSAMMTLRERKTQYGIIVNLPLDHTAASVNAAAIAAFATLPPHLKRTLTWDQGVEMAWHEK
LTLATGVPVYFAERSSPWQRGANENFNGLARQYFPKGTNLAVHSSEHVAHVMRELNERPRKTLGYDTPAARLQAERDAPSAAVR
Blast result :
Comments
ISArsp15 is 63% aa similar to ISLxc3.
References
1] Romaniuk, K. (2018) Direct submission.