ISThsp12
- Family IS1634
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Thiomonas sp. | Thiomonas sp. 3As |
DNA section
IS Length : 2004 bp
Ends
IR Length : 18/19
IRL : GGCCACCCAAGTTCGACACCGCTCCGCCGATTTTCCCCGCTCCAAGGCGC
IRR : GGCCACCCGAGTTCGACACTTCAAGAGCGTAAGTCATTGATTTGACGAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCAACCGGCGCCAGGGTCGGT | AGGGTGGCGCAGTTTGGCTGGGG | 0 |
DNA sequence
GGCCACCCAAGTTCGACACCGCTCCGCCGATTTTCCCCGCTCCAAGGCGCAAGAAGCTCATTGTTGACAGGGGTTTGGCACATCGAGAGCGCGTGGTCCA
GACGACATTTGTAGTGGCACGTTTAATGAGTGATGTAGTTCCTTGTTTCTCGCTGTTGCGCTACACTTGCGCCATGTTTCTCAAGCTCACGAAGTCTGGC
GGCCGGCATTACGCGCAGCTCGTCGCGTCCTTCCGCAACGAGGCCGGCCAGCCGCGCCAGCGCACCATCTGCACGCTGGGCCGGCTGGAGCCCGGCGGTG
ATGTCGACAAGCTGATCGCCTCGTTGCAGCGGGCGCGGGGCCTGGACGCCGCCCCCGCCGGCAACCCGCTCGACGGCCTGCGTTTTGAGGGCAGCCGCTG
TGCCGGCGATGTGTGGGCGCTGTGGCAACTGTGGCGCACCCTGGGCTTCGATGGCCTGGCCACCGCCTGGCGTGGCTCGCGGGTCGAGGTCGATGTGCTG
GGGTGCCTGCGCGCCATGGTCTTCAACCGCCTGTGCGACACCGGCAGCAAGCTCGGCGTGCTGCGCTGGCTGGAGACGGTGGCGTTGCCGCTGGACTTCG
GGTTCGCCGACGGGCCGCCGCAGCACCAGCACCTGCTGCGCGCCATGGACGTCATCGACGAACACAGCGATGCGCTTGGCGCACGACTGGCCACGCTCAT
GCGCCCCCTGATCGACGAAGACTTGTCAGTGGTGTTCTACGACCTCACCACCGTCGAGGTCGCCGGCGAGGCGGTGGTGGCTGACGACGTGCGCGCCTAC
GGGATGAGCAAGTCAGGGATGGTGGCGCGGCAGTTCATGCTGTCGCTGGTGCAGACCGCTGAAGGCCTGCCGATCGCCCATGAGGTGCACCCGGGCAACA
TCGCCGAGGCCAAGACACTGCTGCCGATGATCAAGTCGCTGCTGGCGCGCTACCCGCTCAAGCGTGTGGTGCTGGTGGCCGACCGTGGGCTGCTGAGCGT
GAACAACCTCGACGAGTTGGCCACGCTGCAGGCCACGCTGGCCGGCGAGGGGCGCGCCGTGAGCCTGGAGTACGTGCTGGCGGTGCCGACAGCACGCTAT
GGGGACTTCGCCGAGGCGCTGCGCACCTCGGCCAGCAGCCAGCCTGCCGACCAGCCGTGGGTGGCCGAGAGCACTTGGCAGTCATCCAGCAAGTCGTCGT
CCAAGTCGACGTCACCATCAGCGCCCATCCAGCGCCGACTGGTCATCGCCCACGACCCCGAGGTGGCCCGGCGGCGCACCCAGGCGCGCAACAAGGCGAT
CACGGAGCTGCTGGCCCTGGGTGAGCAGTGGGGCGGCAAGCTCGATGCGCAGGACGCTGGCGCAAGCCGCCGCGGCCGCCCGCTGTCGGACTCGGGTGCG
AAGGCGCGCTTCTATCACGCGGCCCAGGACGCGAGCCTGGCCCACCTGATCAAGGTCGATCTGCAGGCCGAGGCCTTCAGCTTCCACGTCGACGAGGACA
AGCAGCGCGACCTTGAACTGCTCGACGGCAAGCTGCTGCTGGTGACCAACACCGACGCGCCAGCGGTCGAGGTGGTGCAGCGCTACAAGAGCCTGGCCGA
CATCGAGCGAGGCTTCCGGGTGCTGAAGTCCGACATCGAGATCGGTCCGGTGTACCACCGGCTGCCAAAGCGCATTCGCGCGCACGCGCTGGTGTGCTTC
ATGGCGCTGATCCTGTACCGGGTGATGCGCATGCGGCTAAAGGCCAACGACCACGATGAGTCACCGATCCGCCTGCTGCAGCAACTGCAGCGCATCCATC
AGCAGACCGTGCGCACCGCCGACGGGCAAGCTCTGCACGGGCTGACCGAAATGACGCCGACGCAAAAGTCGCTGTTCTCCGTTCTCCAACTGCCAATGCC
GACGCCTGCCGCACTCTCCAAGCCCGTTTTGTAGTCTGCGCCTTTGACCATCGTCCTCGTCAAATCAATGACTTACGCTCTTGAAGTGTCGAACTCGGGT
GGCC
GACGACATTTGTAGTGGCACGTTTAATGAGTGATGTAGTTCCTTGTTTCTCGCTGTTGCGCTACACTTGCGCCATGTTTCTCAAGCTCACGAAGTCTGGC
GGCCGGCATTACGCGCAGCTCGTCGCGTCCTTCCGCAACGAGGCCGGCCAGCCGCGCCAGCGCACCATCTGCACGCTGGGCCGGCTGGAGCCCGGCGGTG
ATGTCGACAAGCTGATCGCCTCGTTGCAGCGGGCGCGGGGCCTGGACGCCGCCCCCGCCGGCAACCCGCTCGACGGCCTGCGTTTTGAGGGCAGCCGCTG
TGCCGGCGATGTGTGGGCGCTGTGGCAACTGTGGCGCACCCTGGGCTTCGATGGCCTGGCCACCGCCTGGCGTGGCTCGCGGGTCGAGGTCGATGTGCTG
GGGTGCCTGCGCGCCATGGTCTTCAACCGCCTGTGCGACACCGGCAGCAAGCTCGGCGTGCTGCGCTGGCTGGAGACGGTGGCGTTGCCGCTGGACTTCG
GGTTCGCCGACGGGCCGCCGCAGCACCAGCACCTGCTGCGCGCCATGGACGTCATCGACGAACACAGCGATGCGCTTGGCGCACGACTGGCCACGCTCAT
GCGCCCCCTGATCGACGAAGACTTGTCAGTGGTGTTCTACGACCTCACCACCGTCGAGGTCGCCGGCGAGGCGGTGGTGGCTGACGACGTGCGCGCCTAC
GGGATGAGCAAGTCAGGGATGGTGGCGCGGCAGTTCATGCTGTCGCTGGTGCAGACCGCTGAAGGCCTGCCGATCGCCCATGAGGTGCACCCGGGCAACA
TCGCCGAGGCCAAGACACTGCTGCCGATGATCAAGTCGCTGCTGGCGCGCTACCCGCTCAAGCGTGTGGTGCTGGTGGCCGACCGTGGGCTGCTGAGCGT
GAACAACCTCGACGAGTTGGCCACGCTGCAGGCCACGCTGGCCGGCGAGGGGCGCGCCGTGAGCCTGGAGTACGTGCTGGCGGTGCCGACAGCACGCTAT
GGGGACTTCGCCGAGGCGCTGCGCACCTCGGCCAGCAGCCAGCCTGCCGACCAGCCGTGGGTGGCCGAGAGCACTTGGCAGTCATCCAGCAAGTCGTCGT
CCAAGTCGACGTCACCATCAGCGCCCATCCAGCGCCGACTGGTCATCGCCCACGACCCCGAGGTGGCCCGGCGGCGCACCCAGGCGCGCAACAAGGCGAT
CACGGAGCTGCTGGCCCTGGGTGAGCAGTGGGGCGGCAAGCTCGATGCGCAGGACGCTGGCGCAAGCCGCCGCGGCCGCCCGCTGTCGGACTCGGGTGCG
AAGGCGCGCTTCTATCACGCGGCCCAGGACGCGAGCCTGGCCCACCTGATCAAGGTCGATCTGCAGGCCGAGGCCTTCAGCTTCCACGTCGACGAGGACA
AGCAGCGCGACCTTGAACTGCTCGACGGCAAGCTGCTGCTGGTGACCAACACCGACGCGCCAGCGGTCGAGGTGGTGCAGCGCTACAAGAGCCTGGCCGA
CATCGAGCGAGGCTTCCGGGTGCTGAAGTCCGACATCGAGATCGGTCCGGTGTACCACCGGCTGCCAAAGCGCATTCGCGCGCACGCGCTGGTGTGCTTC
ATGGCGCTGATCCTGTACCGGGTGATGCGCATGCGGCTAAAGGCCAACGACCACGATGAGTCACCGATCCGCCTGCTGCAGCAACTGCAGCGCATCCATC
AGCAGACCGTGCGCACCGCCGACGGGCAAGCTCTGCACGGGCTGACCGAAATGACGCCGACGCAAAAGTCGCTGTTCTCCGTTCTCCAACTGCCAATGCC
GACGCCTGCCGCACTCTCCAAGCCCGTTTTGTAGTCTGCGCCTTTGACCATCGTCCTCGTCAAATCAATGACTTACGCTCTTGAAGTGTCGAACTCGGGT
GGCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1761 bp | 586 aa | 174 | 1934 | + | No |
Chemistry : DDE
ORF sequence :
MFLKLTKSGGRHYAQLVASFRNEAGQPRQRTICTLGRLEPGGDVDKLIASLQRARGLDAAPAGNPLDGLRFEGSRCAGDVWALWQLWRTLGFDGLATAWR
GSRVEVDVLGCLRAMVFNRLCDTGSKLGVLRWLETVALPLDFGFADGPPQHQHLLRAMDVIDEHSDALGARLATLMRPLIDEDLSVVFYDLTTVEVAGEA
VVADDVRAYGMSKSGMVARQFMLSLVQTAEGLPIAHEVHPGNIAEAKTLLPMIKSLLARYPLKRVVLVADRGLLSVNNLDELATLQATLAGEGRAVSLEY
VLAVPTARYGDFAEALRTSASSQPADQPWVAESTWQSSSKSSSKSTSPSAPIQRRLVIAHDPEVARRRTQARNKAITELLALGEQWGGKLDAQDAGASRR
GRPLSDSGAKARFYHAAQDASLAHLIKVDLQAEAFSFHVDEDKQRDLELLDGKLLLVTNTDAPAVEVVQRYKSLADIERGFRVLKSDIEIGPVYHRLPKR
IRAHALVCFMALILYRVMRMRLKANDHDESPIRLLQQLQRIHQQTVRTADGQALHGLTEMTPTQKSLFSVLQLPMPTPAALSKPVL
GSRVEVDVLGCLRAMVFNRLCDTGSKLGVLRWLETVALPLDFGFADGPPQHQHLLRAMDVIDEHSDALGARLATLMRPLIDEDLSVVFYDLTTVEVAGEA
VVADDVRAYGMSKSGMVARQFMLSLVQTAEGLPIAHEVHPGNIAEAKTLLPMIKSLLARYPLKRVVLVADRGLLSVNNLDELATLQATLAGEGRAVSLEY
VLAVPTARYGDFAEALRTSASSQPADQPWVAESTWQSSSKSSSKSTSPSAPIQRRLVIAHDPEVARRRTQARNKAITELLALGEQWGGKLDAQDAGASRR
GRPLSDSGAKARFYHAAQDASLAHLIKVDLQAEAFSFHVDEDKQRDLELLDGKLLLVTNTDAPAVEVVQRYKSLADIERGFRVLKSDIEIGPVYHRLPKR
IRAHALVCFMALILYRVMRMRLKANDHDESPIRLLQQLQRIHQQTVRTADGQALHGLTEMTPTQKSLFSVLQLPMPTPAALSKPVL
Blast result :
Comments
ISThsp12 is 65% aa similar to ISAzo14.
References
1] ISfinder annotation (2009)