ISThsp7
- Family IS1634
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Thiomonas sp. | Thiomonas sp. 3As |
DNA section
IS Length : 1920 bp
Ends
IR Length : 12/13
IRL : CTGAGTTCGACAGTTCGAAGCCGTTGCGCAGCCACAAACAATCGCCGGGG
IRR : CCGAGTTCGACACCAACTGTCGTAAGTGCTTGATCTGTCTATGGCAGGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCCGAGAGGTCGCCC | GCCC | GCCGCCTCAGTGGCCA | 4 |
DNA sequence
CTGAGTTCGACAGTTCGAAGCCGTTGCGCAGCCACAAACAATCGCCGGGGTCCTATGAATCAATCACTTGCAAGCGCTCAGAAGGGCGTTTTCAGATTTC
TTTGTAGTGCCAATTTACTACCGTGAAGTAGTTCATCTGCGCGTGCGCAGCGCTATACTTGGTGCATGTTCCTCAAGATCTCCTCCTCGGGCGGTCACCG
CTACGTTCGGCTGGTGGAGTCGTTCCGCAACGCGGATGGCCAACCCCGGCAACGCACGATCGCAACCCTCGGGCGTCTGGACGAACAGGGCGGACCGCTC
GATGCCCTGCTGGGGGCCTTGCTGCGGGCCAAGGGTCGGCCTTCGGGCGACAGTGATCCGTCTCAGGTGCGCTTCGAGTCGGCGCTGACGTTGGGCGATG
TTTGGGCCCTGCACGCCTTGTGGCACGAACTCGGATTCCATGGCCTGGGCGCCATCTTCCGGCGCGCCCGGTTCACCACTGCGGTCGAGCACGCGATCCG
TGTGATGGTCTTCAACCGACTGTGCGACCCAGACTCCAAGCTGGGCGTGCTGCGCTGGCTGCAGACCGTCAGCATGCCCGGCATCGACGTGGACAAGCTG
ACGCATCAGCACTTGCTGCGCAGCATGGATGCCCTGATGGACCACCAGCAAGCCGTCGACGACTGCGTGGCCCAGTTGCTGCGCCCGCTCATTGATGAAG
ACCTCTCGGTGGTCTTTTACGATCTGACCACCATCCGCGCCGAAGGTCTCAGCCAGCAAGACGGTGATGTACGCCACTTCGGCCTGTCCAAGGAGGGCGT
GATCGCCCGGCAGTTCCTGCTGGGCGTGGTGCAGACAGCCGACGGCATGCCGATCTTCCACGAGGTGTTCGACGGCAACGCCGCCGAGGCGCCGACCCTG
GAGCCCACCTTGAAGAAGGTGCTCTCGCGCTACCCGCACATCAGGCGCCTGGTGGTGGTGGCCGACCGCGGGCTACTTTCGCTTGACAACATCGAGGCGT
TGTCCAAGTTGCATGGGGCTGGAGATCGGCCGCTGGAGTTCATCCTGGCGGTGCCGGGGCGACGCTACGGTGAGTTTGTCGATCTTCTCGAGCCCATGAG
CGAGCGTGCTGCCCAGGCCAACCAGGAGATCGTCGAAGAGGCGCAGTGGCAAGGCCATCGCCTGGTGGTGGTCCACAGCCCCGTGCGAGCGACCGAGCAA
ACCCAGGAGCGCCTTGCACGCATCCATGCCTTGCAGCAACGCGCCGATCAGTTGGCCGGCAAGCTCGACGCCCAGGACGAAGGCAAAGTCCAGCGCGGGC
GCAAGCTCTCGGATTCCGGCGCCAAGGCCCGCTTCTTCCACGAAGTCAGCGACGCCCGCCTGGCGCGCATCATCAAGGTGGACCTGCAGTCCGATCTGTT
CACCTATGCGATCGATGAAACCGCACTCGCGCGAGCGCAACTCATGGACGGCAAGCTGATGCTGGTCACCAATGTCCAGGATCTGAGCCCGGCCGAGGTG
GTGCGGCGCTACAAGTCGCTGGCCGACATCGAGCGTGGCTTCAAGGTGCTCAAGTCCGAGATCGAGATCGCCCCGGTGTTCCACCGCCGGCCCGAGCGCA
TCAAGGCCCACGCCAGCCTGTGCTTCATCGCGCTGATCCTGTACCGCGTCATGCGCCAGCGCCTCAAACTCGCCAGCAGCGAACTGTCTCCAGAAACCGC
CCTGGCCGACCTGCGCCGCATCCAGCGCCACACCGTGCGCATCGACAGCGGCGCCCCCATCCACGGCATCTCCACCATCCAACCTCGCCAGGCCGATGTC
CTGGCCGCACTCAACATCAAAAAACCCACCCAAGACACCCAACTGCCCCTGCTGTAGTGGCAATTTCAACAACCTGCCATAGACAGATCAAGCACTTACG
ACAGTTGGTGTCGAACTCGG
TTTGTAGTGCCAATTTACTACCGTGAAGTAGTTCATCTGCGCGTGCGCAGCGCTATACTTGGTGCATGTTCCTCAAGATCTCCTCCTCGGGCGGTCACCG
CTACGTTCGGCTGGTGGAGTCGTTCCGCAACGCGGATGGCCAACCCCGGCAACGCACGATCGCAACCCTCGGGCGTCTGGACGAACAGGGCGGACCGCTC
GATGCCCTGCTGGGGGCCTTGCTGCGGGCCAAGGGTCGGCCTTCGGGCGACAGTGATCCGTCTCAGGTGCGCTTCGAGTCGGCGCTGACGTTGGGCGATG
TTTGGGCCCTGCACGCCTTGTGGCACGAACTCGGATTCCATGGCCTGGGCGCCATCTTCCGGCGCGCCCGGTTCACCACTGCGGTCGAGCACGCGATCCG
TGTGATGGTCTTCAACCGACTGTGCGACCCAGACTCCAAGCTGGGCGTGCTGCGCTGGCTGCAGACCGTCAGCATGCCCGGCATCGACGTGGACAAGCTG
ACGCATCAGCACTTGCTGCGCAGCATGGATGCCCTGATGGACCACCAGCAAGCCGTCGACGACTGCGTGGCCCAGTTGCTGCGCCCGCTCATTGATGAAG
ACCTCTCGGTGGTCTTTTACGATCTGACCACCATCCGCGCCGAAGGTCTCAGCCAGCAAGACGGTGATGTACGCCACTTCGGCCTGTCCAAGGAGGGCGT
GATCGCCCGGCAGTTCCTGCTGGGCGTGGTGCAGACAGCCGACGGCATGCCGATCTTCCACGAGGTGTTCGACGGCAACGCCGCCGAGGCGCCGACCCTG
GAGCCCACCTTGAAGAAGGTGCTCTCGCGCTACCCGCACATCAGGCGCCTGGTGGTGGTGGCCGACCGCGGGCTACTTTCGCTTGACAACATCGAGGCGT
TGTCCAAGTTGCATGGGGCTGGAGATCGGCCGCTGGAGTTCATCCTGGCGGTGCCGGGGCGACGCTACGGTGAGTTTGTCGATCTTCTCGAGCCCATGAG
CGAGCGTGCTGCCCAGGCCAACCAGGAGATCGTCGAAGAGGCGCAGTGGCAAGGCCATCGCCTGGTGGTGGTCCACAGCCCCGTGCGAGCGACCGAGCAA
ACCCAGGAGCGCCTTGCACGCATCCATGCCTTGCAGCAACGCGCCGATCAGTTGGCCGGCAAGCTCGACGCCCAGGACGAAGGCAAAGTCCAGCGCGGGC
GCAAGCTCTCGGATTCCGGCGCCAAGGCCCGCTTCTTCCACGAAGTCAGCGACGCCCGCCTGGCGCGCATCATCAAGGTGGACCTGCAGTCCGATCTGTT
CACCTATGCGATCGATGAAACCGCACTCGCGCGAGCGCAACTCATGGACGGCAAGCTGATGCTGGTCACCAATGTCCAGGATCTGAGCCCGGCCGAGGTG
GTGCGGCGCTACAAGTCGCTGGCCGACATCGAGCGTGGCTTCAAGGTGCTCAAGTCCGAGATCGAGATCGCCCCGGTGTTCCACCGCCGGCCCGAGCGCA
TCAAGGCCCACGCCAGCCTGTGCTTCATCGCGCTGATCCTGTACCGCGTCATGCGCCAGCGCCTCAAACTCGCCAGCAGCGAACTGTCTCCAGAAACCGC
CCTGGCCGACCTGCGCCGCATCCAGCGCCACACCGTGCGCATCGACAGCGGCGCCCCCATCCACGGCATCTCCACCATCCAACCTCGCCAGGCCGATGTC
CTGGCCGCACTCAACATCAAAAAACCCACCCAAGACACCCAACTGCCCCTGCTGTAGTGGCAATTTCAACAACCTGCCATAGACAGATCAAGCACTTACG
ACAGTTGGTGTCGAACTCGG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1692 bp | 563 aa | 166 | 1857 | + | No |
Chemistry : DDE
ORF sequence :
MFLKISSSGGHRYVRLVESFRNADGQPRQRTIATLGRLDEQGGPLDALLGALLRAKGRPSGDSDPSQVRFESALTLGDVWALHALWHELGFHGLGAIFRR
ARFTTAVEHAIRVMVFNRLCDPDSKLGVLRWLQTVSMPGIDVDKLTHQHLLRSMDALMDHQQAVDDCVAQLLRPLIDEDLSVVFYDLTTIRAEGLSQQDG
DVRHFGLSKEGVIARQFLLGVVQTADGMPIFHEVFDGNAAEAPTLEPTLKKVLSRYPHIRRLVVVADRGLLSLDNIEALSKLHGAGDRPLEFILAVPGRR
YGEFVDLLEPMSERAAQANQEIVEEAQWQGHRLVVVHSPVRATEQTQERLARIHALQQRADQLAGKLDAQDEGKVQRGRKLSDSGAKARFFHEVSDARLA
RIIKVDLQSDLFTYAIDETALARAQLMDGKLMLVTNVQDLSPAEVVRRYKSLADIERGFKVLKSEIEIAPVFHRRPERIKAHASLCFIALILYRVMRQRL
KLASSELSPETALADLRRIQRHTVRIDSGAPIHGISTIQPRQADVLAALNIKKPTQDTQLPLL
ARFTTAVEHAIRVMVFNRLCDPDSKLGVLRWLQTVSMPGIDVDKLTHQHLLRSMDALMDHQQAVDDCVAQLLRPLIDEDLSVVFYDLTTIRAEGLSQQDG
DVRHFGLSKEGVIARQFLLGVVQTADGMPIFHEVFDGNAAEAPTLEPTLKKVLSRYPHIRRLVVVADRGLLSLDNIEALSKLHGAGDRPLEFILAVPGRR
YGEFVDLLEPMSERAAQANQEIVEEAQWQGHRLVVVHSPVRATEQTQERLARIHALQQRADQLAGKLDAQDEGKVQRGRKLSDSGAKARFFHEVSDARLA
RIIKVDLQSDLFTYAIDETALARAQLMDGKLMLVTNVQDLSPAEVVRRYKSLADIERGFKVLKSEIEIAPVFHRRPERIKAHASLCFIALILYRVMRQRL
KLASSELSPETALADLRRIQRHTVRIDSGAPIHGISTIQPRQADVLAALNIKKPTQDTQLPLL
Blast result :
Comments
ISThsp7 is 75% aa similar to ISAzo6.
References
1] Stephanie Weiss (2008) Direct submission.