ISThsp8
- Family IS1634
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Thiomonas sp. | Thiomonas sp. 3As |
DNA section
IS Length : 1929 bp
Ends
IR Length : 24/32
IRL : CCGGAGTTCGACAGTTTGAGTCAAAAATTGGCTGAAAACGCCGACCCGGC
IRR : CCGGAGTTCGACAGTTACGGTCGTAAGTACGCGATATCGTTAGGGTGAGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGCGACAACC | TGACAC | ACCCGCGCCG | 6 |
DNA sequence
CCGGAGTTCGACAGTTTGAGTCAAAAATTGGCTGAAAACGCCGACCCGGCCTTGACGGAACAAAGGGTTAGCCTCGTTTTTGGAAAAGTACGTGTGGTGG
CACGTTTTTCCACATTGTGAGGCTGTTGCCATTGGCAGATGGGTTGGTGTATTGTGGCGGCATGTTCATCAAGATCACCACCTCCGGCGGTCGCCGCTAC
GTCCAGCTCGTGGAGTCCTACCGGGATGAGAGCGGGCGGGTGAAGAAACGCACGGTGGCCACCCTGGGGCGCGCCGACCAAGCTGGCAGCCAGCTTGAGA
GCGTGATCCGCGGCCTGCAGCGACTTCGCAGCGACACGCCGCCGGAGGCCGGCGCGGTGGCGACAGTGGCCGACGTGCGGTTCGAGTCCAGCCGCGCCCT
GGGCGACGTATGGGCGCTGACCGAGCTGTGGGACGAATTGGGATTTGGCGAGCTGCGCCGGGTTTTTGGTCGCACGCGCAGCCGCATCGACGTCGAGGCG
CTGGTGCGCCTGATGGTGCTCAACCGCCTGTGCGACCCGACGTCCAAGCTCGGCGTGCTGCGCTGGTTGCAGACGGTGGCGCTGCCGCGCTTTGCGCCCA
AGGAGGTGACGCATCAGCACCTGCTGCGGGCGATGGATGCGCTGGTCGAACACCAGGACTGCGTCCAGGCGGCGTTGGCCGGGCTGCTGCGCCCGCTGAT
CGACCAGGATCTGTCGGTGGTGTTCTACGACATGACCACCATCGGCGTGGAGGGCCAGACCGAGTTGGCCGGCGACCTGCGCCAGTTCGGCCTGTCCAAG
GACGGGGGGATGCGTCGCCAGTTCATGCTCGGCGTGGTGCAGACGGCCGAGGGACTGCCGCTGACGCATCGGGTCTGGGAAGGGAACACGGCCGAGGCAC
CGACGCTCAGCACCGTGGTCCAGGAGGTTCTGGCGCTGTACCCGGTCAAGCGCGTGGTGCTGGTGGCCGATCGCGGTCTGCTCAGCCTGGACAATCTGGA
GTGGCTGCGCGGGCAACGCGTGGGCGGCACCGAGCAGCCACTGGAGTTCATCCTGGCCGTGCCGGGGCGGCGTTATGCCGAATTCTCGGAGATTCTGGAG
CCGATCCACGCGCAGCGCTGCGCGGCAGCCACCCGCGAGGTCCTCGGCGAGACCCGCTGGCAGGACCTGCGCCTGGTCTGGGCCCACGATCCGCGCCGCG
CCCTGGAACAAACCGCAGCGCGCCGCCAGCATATCGGAGAACTCACGCAGGAAGCCCAGCAGCGTGCTGGCAAGCTTGATGCGCAAGAAGGCGGCACCGC
CTTCAGAGGCCGGCGCCTGAGCGACTCCGGTGCTAAGGCCTGGCTGTATCGCGCGGTCTCCGAGGCGCATCTGGGCTCGATCATCAAGGTCGACCTGCAA
TCGGATCTGTTCACCTACGTCATCGACGACAAGGCGCTGGCGCGTGCAGAACTGGGCGACGGCAAGCTGTTGTTGGTGACCAATGTGCCAGAGTTGTCGG
CGCTTGAGGTTCTGCGACGCTACAAGAGCCTGGCCGACATCGAGCGCGGCTTCAAGATTCTCAAATCCGAGATCGAGATCGCCCCGGTGTTCCACCGCTT
GCCCGACCGCATCCGTGCCCATGCGCTGATCTGCTTCATCGCTCTGGTGCTGTACCGCGTCATGCGCATGCGCCTCAAGGACGCCGGCTCCAAGCTGTCG
CCCGAACGTGCACTCGAGCAACTGCGACGCATCCAATACCACCAGGTCCACCTCGGCGGCGAGCGCCGCGACGGCACGTCCTCGCTCAGCGACGCCGACC
ACGCCATCCTCCAAGGACTCAAGCTTCCGAAACCCGCCATCGACGACCAACTCGCACTCCTGTAGGGGCACGTTTCAACGCTCACCCTAACGATATCGCG
TACTTACGACCGTAACTGTCGAACTCCGG
CACGTTTTTCCACATTGTGAGGCTGTTGCCATTGGCAGATGGGTTGGTGTATTGTGGCGGCATGTTCATCAAGATCACCACCTCCGGCGGTCGCCGCTAC
GTCCAGCTCGTGGAGTCCTACCGGGATGAGAGCGGGCGGGTGAAGAAACGCACGGTGGCCACCCTGGGGCGCGCCGACCAAGCTGGCAGCCAGCTTGAGA
GCGTGATCCGCGGCCTGCAGCGACTTCGCAGCGACACGCCGCCGGAGGCCGGCGCGGTGGCGACAGTGGCCGACGTGCGGTTCGAGTCCAGCCGCGCCCT
GGGCGACGTATGGGCGCTGACCGAGCTGTGGGACGAATTGGGATTTGGCGAGCTGCGCCGGGTTTTTGGTCGCACGCGCAGCCGCATCGACGTCGAGGCG
CTGGTGCGCCTGATGGTGCTCAACCGCCTGTGCGACCCGACGTCCAAGCTCGGCGTGCTGCGCTGGTTGCAGACGGTGGCGCTGCCGCGCTTTGCGCCCA
AGGAGGTGACGCATCAGCACCTGCTGCGGGCGATGGATGCGCTGGTCGAACACCAGGACTGCGTCCAGGCGGCGTTGGCCGGGCTGCTGCGCCCGCTGAT
CGACCAGGATCTGTCGGTGGTGTTCTACGACATGACCACCATCGGCGTGGAGGGCCAGACCGAGTTGGCCGGCGACCTGCGCCAGTTCGGCCTGTCCAAG
GACGGGGGGATGCGTCGCCAGTTCATGCTCGGCGTGGTGCAGACGGCCGAGGGACTGCCGCTGACGCATCGGGTCTGGGAAGGGAACACGGCCGAGGCAC
CGACGCTCAGCACCGTGGTCCAGGAGGTTCTGGCGCTGTACCCGGTCAAGCGCGTGGTGCTGGTGGCCGATCGCGGTCTGCTCAGCCTGGACAATCTGGA
GTGGCTGCGCGGGCAACGCGTGGGCGGCACCGAGCAGCCACTGGAGTTCATCCTGGCCGTGCCGGGGCGGCGTTATGCCGAATTCTCGGAGATTCTGGAG
CCGATCCACGCGCAGCGCTGCGCGGCAGCCACCCGCGAGGTCCTCGGCGAGACCCGCTGGCAGGACCTGCGCCTGGTCTGGGCCCACGATCCGCGCCGCG
CCCTGGAACAAACCGCAGCGCGCCGCCAGCATATCGGAGAACTCACGCAGGAAGCCCAGCAGCGTGCTGGCAAGCTTGATGCGCAAGAAGGCGGCACCGC
CTTCAGAGGCCGGCGCCTGAGCGACTCCGGTGCTAAGGCCTGGCTGTATCGCGCGGTCTCCGAGGCGCATCTGGGCTCGATCATCAAGGTCGACCTGCAA
TCGGATCTGTTCACCTACGTCATCGACGACAAGGCGCTGGCGCGTGCAGAACTGGGCGACGGCAAGCTGTTGTTGGTGACCAATGTGCCAGAGTTGTCGG
CGCTTGAGGTTCTGCGACGCTACAAGAGCCTGGCCGACATCGAGCGCGGCTTCAAGATTCTCAAATCCGAGATCGAGATCGCCCCGGTGTTCCACCGCTT
GCCCGACCGCATCCGTGCCCATGCGCTGATCTGCTTCATCGCTCTGGTGCTGTACCGCGTCATGCGCATGCGCCTCAAGGACGCCGGCTCCAAGCTGTCG
CCCGAACGTGCACTCGAGCAACTGCGACGCATCCAATACCACCAGGTCCACCTCGGCGGCGAGCGCCGCGACGGCACGTCCTCGCTCAGCGACGCCGACC
ACGCCATCCTCCAAGGACTCAAGCTTCCGAAACCCGCCATCGACGACCAACTCGCACTCCTGTAGGGGCACGTTTCAACGCTCACCCTAACGATATCGCG
TACTTACGACCGTAACTGTCGAACTCCGG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1704 bp | 567 aa | 162 | 1865 | + | No |
Chemistry : DDE
ORF sequence :
MFIKITTSGGRRYVQLVESYRDESGRVKKRTVATLGRADQAGSQLESVIRGLQRLRSDTPPEAGAVATVADVRFESSRALGDVWALTELWDELGFGELRR
VFGRTRSRIDVEALVRLMVLNRLCDPTSKLGVLRWLQTVALPRFAPKEVTHQHLLRAMDALVEHQDCVQAALAGLLRPLIDQDLSVVFYDMTTIGVEGQT
ELAGDLRQFGLSKDGGMRRQFMLGVVQTAEGLPLTHRVWEGNTAEAPTLSTVVQEVLALYPVKRVVLVADRGLLSLDNLEWLRGQRVGGTEQPLEFILAV
PGRRYAEFSEILEPIHAQRCAAATREVLGETRWQDLRLVWAHDPRRALEQTAARRQHIGELTQEAQQRAGKLDAQEGGTAFRGRRLSDSGAKAWLYRAVS
EAHLGSIIKVDLQSDLFTYVIDDKALARAELGDGKLLLVTNVPELSALEVLRRYKSLADIERGFKILKSEIEIAPVFHRLPDRIRAHALICFIALVLYRV
MRMRLKDAGSKLSPERALEQLRRIQYHQVHLGGERRDGTSSLSDADHAILQGLKLPKPAIDDQLALL
VFGRTRSRIDVEALVRLMVLNRLCDPTSKLGVLRWLQTVALPRFAPKEVTHQHLLRAMDALVEHQDCVQAALAGLLRPLIDQDLSVVFYDMTTIGVEGQT
ELAGDLRQFGLSKDGGMRRQFMLGVVQTAEGLPLTHRVWEGNTAEAPTLSTVVQEVLALYPVKRVVLVADRGLLSLDNLEWLRGQRVGGTEQPLEFILAV
PGRRYAEFSEILEPIHAQRCAAATREVLGETRWQDLRLVWAHDPRRALEQTAARRQHIGELTQEAQQRAGKLDAQEGGTAFRGRRLSDSGAKAWLYRAVS
EAHLGSIIKVDLQSDLFTYVIDDKALARAELGDGKLLLVTNVPELSALEVLRRYKSLADIERGFKILKSEIEIAPVFHRLPDRIRAHALICFIALVLYRV
MRMRLKDAGSKLSPERALEQLRRIQYHQVHLGGERRDGTSSLSDADHAILQGLKLPKPAIDDQLALL
Blast result :
Comments
ISThsp8 is 76% aa similar to ISCARN23.
References
1] Stephanie Weiss (2008) Direct submission.