ISThma1
- Family ISNCY
- Group ISDol1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AFWV01000002 | ND | Thiocapsa marina | Thiocapsa marina 5811 |
DNA section
IS Length : 1901 bp
Ends
IR Length : 19/26
IRL : GTGCCTGAACAGAAACCACGTTTTCTATTTCCGAACAGTGGGTTATAATA
IRR : GTGCTCGACCGAAAACCCCGTTTTTTCCCGCCCCGGCACCTCGCCCTGAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAGGCATTCT | TAACTA | CCCATACGTC | 6 |
CCCTTAGTAT | TAATTA | TAACCTCTTT | 6 |
CCGATTGCTC | AAGCTA | GATTGGAGTA | 6 |
DNA sequence
GTGCCTGAACAGAAACCACGTTTTCTATTTCCGAACAGTGGGTTATAATATGTCCCCATGATCGAAGATCATGCCGATTTTCGATCGAGAGGGGCAGAGA
AAACGCTAACCTATTGATTTCCTGATGACTCGAAGGCGATTTCGGCCGATTTTCGCCGGGGCATGCCATGAGTCTTCATGAACGTCGTGATTCGAGCCAC
TATGCGCAACGTGATCGATCCGCAATTGAAACTCGGCGAAGAGGATATCGCCGCCATCGCTCTGGATCCGCGCTCGCGCGACGATATCCCGCAGCTCTTG
CGTGGACTGCAGCACATCTACACCACCCCGCAGGTGCGTGAGCCGGTGTTCGCGATCCTCGCCGAGCTGCGCCCGCCGGGCAGCGGCGAGGACGGCAAGG
CGAGCCCCGAGACCGGCCGGCCCGGTCTGTCGCAGTGGGCGATCCTGGTGCTCGGCGTGCTGCGCTTGGGACTGAACGCCGATTACGACCGCATTCTCGA
GCTGGCCAATCAGCACACCACTCTGCGCAAGATGCTCGGCCATGCCGACTGGGCCGACGACACGACCTACAATCTGCAGACGCTCAAGGACAACCTGAGC
CTGTTCACGCCCGAGCTGCTCGAGCGCATCAACCAAGCGCTGGTGGGTGCCGGCCATGCCTTGGTAAAAAAAAGTCCGGACGATTCCCTCGCGGTGCGCT
GTGACTCCTTCGTGGTCGAGACCCACGTGCATTATCCGACCGACATCAACCTGCTCTTCGATGCGGTGCGCAAGGCCATCGAGACCAGCGCCGGATTGTG
CGAGGACGCCGGCCTGAGCGATTGGCGGCAAAGCGCCTACAACGTGCGCTGCCTGAAGAAAGCCTATCGGCGGGCGCAAACCCTCAAGCACTCCACGGCG
CAGGATCCGGAGAAACGGGCCGCACGGCGTATCGAGATCGAAGCGGCCCATGCCGCCTACCTGGACCTGGCCGGGGGCTTTCTGATGCGGGCGCGCGAGA
CGCGCATCCGCCTGCACCTGATCGCCGCGTTGCCGGACGTCCAGCTCGCGCCGCTCGATGCCTTCATCGCCCACGGCGAGCGGCAGATCGACCAGATCCG
CCGCCGCGTGCTGTGCGGTCAGACCATCCCGCATGCCGAGAAGGTCTTCTCGATCTTCCAACCCCATACCGAGTGGATCAGCAAAGGCAAGGCCGGCGTG
CCGGTGGAGCTGGGTCTGCGCGTGGCGATCGGCGAAGATCAGCACGGCTTCATCCTGCACCATCGGGTCATGGAGCGCATCACCGACGATCAGGTCGCCG
TTCCCTTGGTGGAAGAGATCGTGGCGCGCTTCCCGGCGGTCGGGAGCGTCAGCATGGACAAGGGCTTTCACAGCCCCGCCAACCAACGCGCCCTGGCCGA
GGTGATCGACTTTCCGGTGCTGCCGAAGAAGGGCAAGTGCTCGGCCGCGGAGCGCGAACGCGAAGGCGATCCGCGCTTCATCCAGCTGCGCCGCAAACAC
TCCGCCGTGGAGTCCGCCATCAACGCGCTGGAGGTCCACGGGCTCGACCGCTGTCGCGATCACGGGATCGACGGCTTCAAGCGCTATGTCGCCTTGGCGG
TGGTGGCGCGCAACCTCCAGCGCATCGGTACGCTGCTGCTGGCGCAGGAGGCCGAGGAGGCGCGGCGCGAGCGGGAACGACAGCGCCGCCGACGCGCTGC
TTGACCCTTTCGATCGGATCCGGTGCCGACCCCCACGGGAGTGGTGCGCCTGCGGAATGCCGGTGATGCGAGTCATTCGCATTAATCCGGTTTAATTCCA
CGAATCCGCATCGAATGCGGCTCGATTGAATACGCTGGCCCCGGCCCGCCATTCAGGGCGAGGTGCCGGGGCGGGAAAAAACGGGGTTTTCGGTCGAGCA
C
AAACGCTAACCTATTGATTTCCTGATGACTCGAAGGCGATTTCGGCCGATTTTCGCCGGGGCATGCCATGAGTCTTCATGAACGTCGTGATTCGAGCCAC
TATGCGCAACGTGATCGATCCGCAATTGAAACTCGGCGAAGAGGATATCGCCGCCATCGCTCTGGATCCGCGCTCGCGCGACGATATCCCGCAGCTCTTG
CGTGGACTGCAGCACATCTACACCACCCCGCAGGTGCGTGAGCCGGTGTTCGCGATCCTCGCCGAGCTGCGCCCGCCGGGCAGCGGCGAGGACGGCAAGG
CGAGCCCCGAGACCGGCCGGCCCGGTCTGTCGCAGTGGGCGATCCTGGTGCTCGGCGTGCTGCGCTTGGGACTGAACGCCGATTACGACCGCATTCTCGA
GCTGGCCAATCAGCACACCACTCTGCGCAAGATGCTCGGCCATGCCGACTGGGCCGACGACACGACCTACAATCTGCAGACGCTCAAGGACAACCTGAGC
CTGTTCACGCCCGAGCTGCTCGAGCGCATCAACCAAGCGCTGGTGGGTGCCGGCCATGCCTTGGTAAAAAAAAGTCCGGACGATTCCCTCGCGGTGCGCT
GTGACTCCTTCGTGGTCGAGACCCACGTGCATTATCCGACCGACATCAACCTGCTCTTCGATGCGGTGCGCAAGGCCATCGAGACCAGCGCCGGATTGTG
CGAGGACGCCGGCCTGAGCGATTGGCGGCAAAGCGCCTACAACGTGCGCTGCCTGAAGAAAGCCTATCGGCGGGCGCAAACCCTCAAGCACTCCACGGCG
CAGGATCCGGAGAAACGGGCCGCACGGCGTATCGAGATCGAAGCGGCCCATGCCGCCTACCTGGACCTGGCCGGGGGCTTTCTGATGCGGGCGCGCGAGA
CGCGCATCCGCCTGCACCTGATCGCCGCGTTGCCGGACGTCCAGCTCGCGCCGCTCGATGCCTTCATCGCCCACGGCGAGCGGCAGATCGACCAGATCCG
CCGCCGCGTGCTGTGCGGTCAGACCATCCCGCATGCCGAGAAGGTCTTCTCGATCTTCCAACCCCATACCGAGTGGATCAGCAAAGGCAAGGCCGGCGTG
CCGGTGGAGCTGGGTCTGCGCGTGGCGATCGGCGAAGATCAGCACGGCTTCATCCTGCACCATCGGGTCATGGAGCGCATCACCGACGATCAGGTCGCCG
TTCCCTTGGTGGAAGAGATCGTGGCGCGCTTCCCGGCGGTCGGGAGCGTCAGCATGGACAAGGGCTTTCACAGCCCCGCCAACCAACGCGCCCTGGCCGA
GGTGATCGACTTTCCGGTGCTGCCGAAGAAGGGCAAGTGCTCGGCCGCGGAGCGCGAACGCGAAGGCGATCCGCGCTTCATCCAGCTGCGCCGCAAACAC
TCCGCCGTGGAGTCCGCCATCAACGCGCTGGAGGTCCACGGGCTCGACCGCTGTCGCGATCACGGGATCGACGGCTTCAAGCGCTATGTCGCCTTGGCGG
TGGTGGCGCGCAACCTCCAGCGCATCGGTACGCTGCTGCTGGCGCAGGAGGCCGAGGAGGCGCGGCGCGAGCGGGAACGACAGCGCCGCCGACGCGCTGC
TTGACCCTTTCGATCGGATCCGGTGCCGACCCCCACGGGAGTGGTGCGCCTGCGGAATGCCGGTGATGCGAGTCATTCGCATTAATCCGGTTTAATTCCA
CGAATCCGCATCGAATGCGGCTCGATTGAATACGCTGGCCCCGGCCCGCCATTCAGGGCGAGGTGCCGGGGCGGGAAAAAACGGGGTTTTCGGTCGAGCA
C
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1527 bp | 508 aa | 178 | 1704 | + | No |
Chemistry : DDE
ORF sequence :
MNVVIRATMRNVIDPQLKLGEEDIAAIALDPRSRDDIPQLLRGLQHIYTTPQVREPVFAILAELRPPGSGEDGKASPETGRPGLSQWAILVLGVLRLGLN
ADYDRILELANQHTTLRKMLGHADWADDTTYNLQTLKDNLSLFTPELLERINQALVGAGHALVKKSPDDSLAVRCDSFVVETHVHYPTDINLLFDAVRKA
IETSAGLCEDAGLSDWRQSAYNVRCLKKAYRRAQTLKHSTAQDPEKRAARRIEIEAAHAAYLDLAGGFLMRARETRIRLHLIAALPDVQLAPLDAFIAHG
ERQIDQIRRRVLCGQTIPHAEKVFSIFQPHTEWISKGKAGVPVELGLRVAIGEDQHGFILHHRVMERITDDQVAVPLVEEIVARFPAVGSVSMDKGFHSP
ANQRALAEVIDFPVLPKKGKCSAAEREREGDPRFIQLRRKHSAVESAINALEVHGLDRCRDHGIDGFKRYVALAVVARNLQRIGTLLLAQEAEEARRERE
RQRRRRAA
ADYDRILELANQHTTLRKMLGHADWADDTTYNLQTLKDNLSLFTPELLERINQALVGAGHALVKKSPDDSLAVRCDSFVVETHVHYPTDINLLFDAVRKA
IETSAGLCEDAGLSDWRQSAYNVRCLKKAYRRAQTLKHSTAQDPEKRAARRIEIEAAHAAYLDLAGGFLMRARETRIRLHLIAALPDVQLAPLDAFIAHG
ERQIDQIRRRVLCGQTIPHAEKVFSIFQPHTEWISKGKAGVPVELGLRVAIGEDQHGFILHHRVMERITDDQVAVPLVEEIVARFPAVGSVSMDKGFHSP
ANQRALAEVIDFPVLPKKGKCSAAEREREGDPRFIQLRRKHSAVESAINALEVHGLDRCRDHGIDGFKRYVALAVVARNLQRIGTLLLAQEAEEARRERE
RQRRRRAA
Blast result :
Comments
ISThma1 is 72% aa similar to ISSymo1.
References
1] ISfinder annotation (2016)
2] Lucas,S., Han,J., Cheng,J.-F., Goodwin,L., Pitluck,S., Peters,L., Land,M.L., Hauser,L., Vogl,K., Liu,Z., Imhoff,J., Thiel,V., Frigaard,N.-U., Bryant,D. and Woyke,T.J. (2011) Direct submission GenBank.
2] Lucas,S., Han,J., Cheng,J.-F., Goodwin,L., Pitluck,S., Peters,L., Land,M.L., Hauser,L., Vogl,K., Liu,Z., Imhoff,J., Thiel,V., Frigaard,N.-U., Bryant,D. and Woyke,T.J. (2011) Direct submission GenBank.