ISThsp10
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Thiomonas sp. | Thiomonas sp. 3As |
DNA section
IS Length : 2501 bp
Ends
IR Length : 26/30
IRL : TGTTGATTTCCATCCAGAACTGACCCAGTAACCCCCAGGATTTCCATCTA
IRR : TGTTGATTTCCATGCAAACCTGACCCACTATTTTCATCTCGATCTGACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGGCTCCGTCAGGCCAGTG | CCTACGTCGCAGCCTGTGGG | 0 | |
TCAAATCAGATGCTCCGAA | CCAATCCCCAAAAATCCATC | 0 | |
CGCCTGCGGTCGATCTGCG | TGCAGGCCGATGAAAGCCGG | 0 |
DNA sequence
TGTTGATTTCCATCCAGAACTGACCCAGTAACCCCCAGGATTTCCATCTAAATCTGACCCACGTACTAACCCTAACCTGCTGCTTTGTTTAGCAGCAGGA
GACCAGGAGTGATAGACGTGGCAACACTGAGTGTTATCAGACGCTGTGCCCTGCGCGAGCAGATGTCCATCCGCGAGATCTCCCGCCGCACCGGCCTTTC
TCGCAACACCATCCGCAAGTACCTGCGCGCAGGCGAGTCCGAACCGCACTACGCCAAGCGGGTTAGCCCCAGCAAGCTCGATCCTTTCGCTGGCAAACTC
GCTGGCTGGCTCAAGGCCGAGGCTGGCCGATCACGCAAGCAGCGGCGCACCGTCAAGCAGATGCACGCGGATCTACAGACCCTGGGCTACCCGGGCTCCT
ACAACCGCGTCGCGGCCTTCGTCCGGCTTTGGCACGAGGAGCGCCTGGTGGCCCAGCAGACTACGGGCCGTGGCACCTTCGTGCCCCTGACCTTCGGCCC
AGGTGAGGCCTTTCAGTTTGACTGGAGTGAGGACTGGGCGGTTATTGCCGGAGTGCGCACCAAGCTGCAGGTGGCGCACTTCAAGCTCAGCCACAGCCGG
GCGTTCTACCTGCGCGCCTATCTGCTGCAGACCCACGAGATGCTGTTTGACGCGCACAACCACGCCTTTGCGGTGTTCGGTGGCGTGCCGCGCCGGGGTA
TCTATGACAACATGCGCACCGCTGTGGATCGGGTTCGCAGGGGTAAGGAACGTGATGTCAACGCACGCTTTGCGGCCATGGCCAGCCACTTCTTGTTCGA
GGCCGAGTTCTGCAACCCCGCCTCCGGCTGGGAGAAGGGCCAGGTGGAGAAGAATGTGTGTGATGCGCGCCACCGGCTGTGGCAAGTCGTGCCGGCCTTC
CCGGCCCTCGACGATCTCAACGTCTGGCTGGAACAACGCTGCCAGGCGCTATGGCGCGAGATTGCGCATGGCAAGCTGCCGGGTACCGTAGCTCACGTCT
GGGCCCAGGAGCGGGCCGCCTTGATGCCGGTGCCACGGCCCTTCGATGGCTTTGTGGAACACACCAAGCGGGTCTCGCCCACCTGCCTGATCCACTTCGA
GCGCAACCGCTACAGCGTGCCAGCGCCGTAGGCCAATCGGCCTGTGAGCCTGCGGGTCTACGCGGATCGCCTGGTTGTCGCGGCCGAGGGCCAGATCGTC
TGTGAGCATCAGCGCCTGATCGAGCGCAACCACCATGGCGCTGGCCAGACCGTCTACGACTGGCGCCACTACCTGGCGGTGCTGCAGCGCAAGCCCGGGG
CCCTGCGCAACGGTGCTCCGTTCCTGGAGCTGCCTGGAGCCTTCAAGCGACTGCAGGCTGCACTCGTGAAACAGCCGGGTGGAGATAGGGAGATGGTGGA
GGTTATGGCGCTGGTGCTGCACCACGACGAGCAGGCTGTGCTGGCGGCGGTGGAACTGGCCTTGGAGGCCGGAGCCGCCAGCAAGACCCACATCCTGAAT
GTCTTGCACCGCTTGCTCGATGGCAAGCCGGCCCCGGCGCCTGTCACCTCTCCCCAGGCACTCAAGCTGTCTGTGGAACCCCAGGCCAACGTGCTTCGCT
ATGACCAACTGAGGGAGGTCCGCTATGCGTCATGACCCTGCCATCGCTTCCATCGTGATCATGCTGCGCGAGCTCAAGATGCACGGCATGGCCCAGGCGG
TCACTGAACTGGCCGAGCAAGGTGCTCCCGCTTTCGAGGCAGCCCAGCCGATCCTGTCGCAGTTACTCAAAGCGGAGACCGCCGAGCGAGAAGTGCGCTC
GGTGGCCTACCAGTTGAAGGGGGCAAGATTCCCGGCGTATCGGGACCTGGCAGGCTTTGACTTCTCGCACAGCGAAGTGAACGAAGCCCTGGTGCGCCAG
TTGCATCGGTGCGCGTTCCTGGAGGATGCCAACAACGTGGTCCTCGTGGGCGGGCCCGGCACTGGCAAGACCCACATTGCCACAGCCCTCGGTGTGCAGG
CCATTGAGCACCATCACCGCAGGGTTCGGTTCTTGTCCACGGTGGAGCTGGTCAATGCATTGGAGCAGGAGAAGGCCAACGGCAAGTCCGGACAGTTGGC
CCACCGGCTGGCCTATGCCGATCTGGTGATTCTCGATGAGCTGGGTTACCTGCCGTTCAGCGCTTCAGGCGGTGCCTTGCTGTTTCACCTGCTGTCCAAG
CTGTACGAGCGCACCAGTGTCGTGATCACCACCAACCTGAGCTTCAGCGAATGGGCCGGCGTGTTCGGGGATGCCAAAATGACCACGGCACTGCTAGACC
GGCTCACGCATCACTGCCATATTCTGGAAACCGGTAACGACAGCTTCCGGTTCAAAAACAGCTCTGCACAGCAACCCCCACAGACGACCAAGAAGGAGAA
GATACCCAAGAACTTATCCACAACGTGAGCTTCAAGTTCACAAACAAGGGTGGGTCAGATCGAGATGAAAATAGTGGGTCAGGTTTGCATGGAAATCAAC
A
GACCAGGAGTGATAGACGTGGCAACACTGAGTGTTATCAGACGCTGTGCCCTGCGCGAGCAGATGTCCATCCGCGAGATCTCCCGCCGCACCGGCCTTTC
TCGCAACACCATCCGCAAGTACCTGCGCGCAGGCGAGTCCGAACCGCACTACGCCAAGCGGGTTAGCCCCAGCAAGCTCGATCCTTTCGCTGGCAAACTC
GCTGGCTGGCTCAAGGCCGAGGCTGGCCGATCACGCAAGCAGCGGCGCACCGTCAAGCAGATGCACGCGGATCTACAGACCCTGGGCTACCCGGGCTCCT
ACAACCGCGTCGCGGCCTTCGTCCGGCTTTGGCACGAGGAGCGCCTGGTGGCCCAGCAGACTACGGGCCGTGGCACCTTCGTGCCCCTGACCTTCGGCCC
AGGTGAGGCCTTTCAGTTTGACTGGAGTGAGGACTGGGCGGTTATTGCCGGAGTGCGCACCAAGCTGCAGGTGGCGCACTTCAAGCTCAGCCACAGCCGG
GCGTTCTACCTGCGCGCCTATCTGCTGCAGACCCACGAGATGCTGTTTGACGCGCACAACCACGCCTTTGCGGTGTTCGGTGGCGTGCCGCGCCGGGGTA
TCTATGACAACATGCGCACCGCTGTGGATCGGGTTCGCAGGGGTAAGGAACGTGATGTCAACGCACGCTTTGCGGCCATGGCCAGCCACTTCTTGTTCGA
GGCCGAGTTCTGCAACCCCGCCTCCGGCTGGGAGAAGGGCCAGGTGGAGAAGAATGTGTGTGATGCGCGCCACCGGCTGTGGCAAGTCGTGCCGGCCTTC
CCGGCCCTCGACGATCTCAACGTCTGGCTGGAACAACGCTGCCAGGCGCTATGGCGCGAGATTGCGCATGGCAAGCTGCCGGGTACCGTAGCTCACGTCT
GGGCCCAGGAGCGGGCCGCCTTGATGCCGGTGCCACGGCCCTTCGATGGCTTTGTGGAACACACCAAGCGGGTCTCGCCCACCTGCCTGATCCACTTCGA
GCGCAACCGCTACAGCGTGCCAGCGCCGTAGGCCAATCGGCCTGTGAGCCTGCGGGTCTACGCGGATCGCCTGGTTGTCGCGGCCGAGGGCCAGATCGTC
TGTGAGCATCAGCGCCTGATCGAGCGCAACCACCATGGCGCTGGCCAGACCGTCTACGACTGGCGCCACTACCTGGCGGTGCTGCAGCGCAAGCCCGGGG
CCCTGCGCAACGGTGCTCCGTTCCTGGAGCTGCCTGGAGCCTTCAAGCGACTGCAGGCTGCACTCGTGAAACAGCCGGGTGGAGATAGGGAGATGGTGGA
GGTTATGGCGCTGGTGCTGCACCACGACGAGCAGGCTGTGCTGGCGGCGGTGGAACTGGCCTTGGAGGCCGGAGCCGCCAGCAAGACCCACATCCTGAAT
GTCTTGCACCGCTTGCTCGATGGCAAGCCGGCCCCGGCGCCTGTCACCTCTCCCCAGGCACTCAAGCTGTCTGTGGAACCCCAGGCCAACGTGCTTCGCT
ATGACCAACTGAGGGAGGTCCGCTATGCGTCATGACCCTGCCATCGCTTCCATCGTGATCATGCTGCGCGAGCTCAAGATGCACGGCATGGCCCAGGCGG
TCACTGAACTGGCCGAGCAAGGTGCTCCCGCTTTCGAGGCAGCCCAGCCGATCCTGTCGCAGTTACTCAAAGCGGAGACCGCCGAGCGAGAAGTGCGCTC
GGTGGCCTACCAGTTGAAGGGGGCAAGATTCCCGGCGTATCGGGACCTGGCAGGCTTTGACTTCTCGCACAGCGAAGTGAACGAAGCCCTGGTGCGCCAG
TTGCATCGGTGCGCGTTCCTGGAGGATGCCAACAACGTGGTCCTCGTGGGCGGGCCCGGCACTGGCAAGACCCACATTGCCACAGCCCTCGGTGTGCAGG
CCATTGAGCACCATCACCGCAGGGTTCGGTTCTTGTCCACGGTGGAGCTGGTCAATGCATTGGAGCAGGAGAAGGCCAACGGCAAGTCCGGACAGTTGGC
CCACCGGCTGGCCTATGCCGATCTGGTGATTCTCGATGAGCTGGGTTACCTGCCGTTCAGCGCTTCAGGCGGTGCCTTGCTGTTTCACCTGCTGTCCAAG
CTGTACGAGCGCACCAGTGTCGTGATCACCACCAACCTGAGCTTCAGCGAATGGGCCGGCGTGTTCGGGGATGCCAAAATGACCACGGCACTGCTAGACC
GGCTCACGCATCACTGCCATATTCTGGAAACCGGTAACGACAGCTTCCGGTTCAAAAACAGCTCTGCACAGCAACCCCCACAGACGACCAAGAAGGAGAA
GATACCCAAGAACTTATCCACAACGTGAGCTTCAAGTTCACAAACAAGGGTGGGTCAGATCGAGATGAAAATAGTGGGTCAGGTTTGCATGGAAATCAAC
A
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1527 bp | 508 aa | 109 | 1635 | + | No |
Chemistry : DDE
ORF sequence :
VIDVATLSVIRRCALREQMSIREISRRTGLSRNTIRKYLRAGESEPHYAKRVSPSKLDPFAGKLAGWLKAEAGRSRKQRRTVKQMHADLQTLGYPGSYNR
VAAFVRLWHEERLVAQQTTGRGTFVPLTFGPGEAFQFDWSEDWAVIAGVRTKLQVAHFKLSHSRAFYLRAYLLQTHEMLFDAHNHAFAVFGGVPRRGIYD
NMRTAVDRVRRGKERDVNARFAAMASHFLFEAEFCNPASGWEKGQVEKNVCDARHRLWQVVPAFPALDDLNVWLEQRCQALWREIAHGKLPGTVAHVWAQ
ERAALMPVPRPFDGFVEHTKRVSPTCLIHFERNRYSVPAP*ANRPVSLRVYADRLVVAAEGQIVCEHQRLIERNHHGAGQTVYDWRHYLAVLQRKPGALR
NGAPFLELPGAFKRLQAALVKQPGGDREMVEVMALVLHHDEQAVLAAVELALEAGAASKTHILNVLHRLLDGKPAPAPVTSPQALKLSVEPQANVLRYDQ
LREVRYAS
VAAFVRLWHEERLVAQQTTGRGTFVPLTFGPGEAFQFDWSEDWAVIAGVRTKLQVAHFKLSHSRAFYLRAYLLQTHEMLFDAHNHAFAVFGGVPRRGIYD
NMRTAVDRVRRGKERDVNARFAAMASHFLFEAEFCNPASGWEKGQVEKNVCDARHRLWQVVPAFPALDDLNVWLEQRCQALWREIAHGKLPGTVAHVWAQ
ERAALMPVPRPFDGFVEHTKRVSPTCLIHFERNRYSVPAP*ANRPVSLRVYADRLVVAAEGQIVCEHQRLIERNHHGAGQTVYDWRHYLAVLQRKPGALR
NGAPFLELPGAFKRLQAALVKQPGGDREMVEVMALVLHHDEQAVLAAVELALEAGAASKTHILNVLHRLLDGKPAPAPVTSPQALKLSVEPQANVLRYDQ
LREVRYAS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
804 bp | 268 aa | 1625 | 2428 | + | No |
AG : IS21 helper
ORF sequence :
MRHDPAIASIVIMLRELKMHGMAQAVTELAEQGAPAFEAAQPILSQLLKAETAEREVRSVAYQLKGARFPAYRDLAGFDFSHSEVNEALVRQLHRCAFLE
DANNVVLVGGPGTGKTHIATALGVQAIEHHHRRVRFLSTVELVNALEQEKANGKSGQLAHRLAYADLVILDELGYLPFSASGGALLFHLLSKLYERTSVV
ITTNLSFSEWAGVFGDAKMTTALLDRLTHHCHILETGNDSFRFKNSSAQQPPQTTKKEKIPKNLSTT
DANNVVLVGGPGTGKTHIATALGVQAIEHHHRRVRFLSTVELVNALEQEKANGKSGQLAHRLAYADLVILDELGYLPFSASGGALLFHLLSKLYERTSVV
ITTNLSFSEWAGVFGDAKMTTALLDRLTHHCHILETGNDSFRFKNSSAQQPPQTTKKEKIPKNLSTT
Blast result :
Comments
ISThsp10 is 89%(ORFA) and 91%(ORFB) aa similar to IS1600.
There is a STOP codon in phase at position 341 of the transposase in each 3 copies.
There is a STOP codon in phase at position 341 of the transposase in each 3 copies.
References
1] ISfinder annotation