ISSoEn4
- Family ISL3
- Group
Isoform Synonym(s) ISsope4
Accession number | Transposition | Origin | Host |
---|---|---|---|
AM921792 | ND | Primary endosymbiont | Primary endosymbiont of Sitophilus oryzae |
DNA section
IS Length : 1712 bp
Ends
IR Length : 24
IRL : GGGTCTTCCCCTGTTGTGGTGGCTAAGGGCATTATGATGGCAGTCTTGAT
IRR : GGGTCTTCCCCTGTTGTGGTGGCTCAATCGACAGTTCTGCATTTTTACTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AAAACACTGG | TCATTTAT | ACAGACTTAA | 8 |
GAAAGACTGT | ATAAACAA | ACAGGGGACG | 8 |
GCCAAAGTGT | ACATTTGT | ACATTATCAT | 8 |
TTTGTCCTGT | AGATTGAA | ACAGACATTG | 8 |
TTTGTCCTGT | AGATTGAA | ACAGACATTG | 8 |
CTGTATATAT | AT | GCATGTACAG | 2 |
ATAAGCATGG | AAATAAAA | ACACCGCCAC | 8 |
GTTGAATTAT | TGCTTTGTTGA | 0 | |
TGATTACTGG | TATTGTAT | CCAGTAGTTAT | 8 |
TTGTTATTAT | AT | AAATAAACAGT | 2 |
GTACAGTCAT | ACAGTTCATAC | 0 | |
TCAAGGCTGT | AAATAGAA | ACAGTACTCGT | 8 |
GTGTTACTGT | ATAAAAAC | ACAGTATCCGG | 8 |
GTCTGCATTA | CTGTATTATCA | 0 |
DNA sequence
GGGTCTTCCCCTGTTGTGGTGGCTAAGGGCATTATGATGGCAGTCTTGATTTTTGAGGAGTCTGCCATGGACGAAAAGTCCCTCTATGCCCATATCCTTA
ACCTGTCCGCACCGTGGCAGGTACAATCCCTTTCTCTTAATGAAAAATCTGGATCAGTGACGGTAATTGTCGGCATTGCCGAGCACACACAACTGGCCTG
CCCAACCTGCGGTAAATCCTGCTCCATACATGATCACCGGCGTCGTAAATGGCGTCACCTCGATACCTGTCAGTTCACCACGCTGGTTGAGGCTGATGTC
CCCCGCGTTGACTGCCCCGAGCACGGTTGCCAGACACTGCCTGTTCCCTGGGCAGGGTCAGGCAGCCGCTACACCTTGTTGTTCGAAGCCTTTGTTCTTT
CATGGCTGAAAGTCAGCACCGTGGATGCTGTCAGAAAGCAGCTCAAACTCAGTTGGAATGCCGTTGAAGGCATCATGATGCGCGCAGTCAAACGAGGCTT
GGCCCGGATAAAACAACCCTTATCCGCCCGTTACCTCTGCGTGGATGAAGTCGGGTTCAAAAAAGGACACCAGTACGTCACCGTTATCTCTGACAGGCAG
GGACGCGCTTTGCAACTGACCGACGACCGCGGTGTAGAAAGCCTTGCCAGTTATCTGCGCAGCCTGAGAGATCACCAGCTTGATGAGATAAAAACGCTGT
CTATGGACATGAACATGGCCTATATCAGTGCAGCCCGCATCCATCTTCCCAATGCCGTCGATAAAATTGCTTTCGATCACTTCCATGTGGCAAAAATGTT
GTGCGCCGTCGTTGATAAAACCCGTCAGGGTGAGATGAAACAGATCCCGTCGTCAGACAGGAAAGACGCCCACCGCTCACGCTATCTATGGTTTTACAGC
AAACAAAATCGCCTCGGGTGCTGGGCAGAGAGGTTAGAAGTTGCCCGGCTGATGTTACCCCAAACGAGCCAGTGCTGGGTAATGAAAGAGCTTGCTCGCG
ATCTGTGGCACCGCCGCTATGACAATCATAGCCGTAAGCTGTGGCAGGAATGGATGGCGATGGCTAAAGACACCGGCATACCGCTCATGGCCAGCATTGC
CCGCATGGTGGCAAAACGCCTTTACGGCATTCTGAATGCAATGAAAAACCGGGTATCAAATGGGAATGCGGAGTCCCTGAACAGCAAAATACGGCTGTTC
AGGATCAAGTCACAGGGCTTCAGGAACAAAGAACGTTTCAAGCTGGGCGTAATGTTCCACTATGGGAAACTAAATATGAATTTTTGAGCAATCATTTAAG
AGAGTTATAAAATTTTTTTTGCTGGTGATGTTGTATTCGCTCGTCATTCTCCTTTTCTTTTTCCTTTTGCCACATATTACTGCATCCCGGATCGCTAGGG
GTAAGACAATATTGAGTTTTATCAACTTGAGATTCACTTTTCTCCCAAGATAATTTTGGCGTTACCTCTGGTGACAGTGAATAGTCTTTCTTATCAGTGT
CATACATCATATTTGGTGAGGTCGATGCTGCAACACCATGAAGCAGTAAGATAAATGGGGTATATGCATATTTTTTTAAGCATGACATGAACGTTATTCC
TTTAAACGTAAATATGTTTGACATCCATACAGAAGTCCAAGTTCCAGCCAAAAGAATAAGTAAAGTAAAAATGCAGAACTGTCGATTGAGCCACCACAAC
AGGGGAAGACCC
ACCTGTCCGCACCGTGGCAGGTACAATCCCTTTCTCTTAATGAAAAATCTGGATCAGTGACGGTAATTGTCGGCATTGCCGAGCACACACAACTGGCCTG
CCCAACCTGCGGTAAATCCTGCTCCATACATGATCACCGGCGTCGTAAATGGCGTCACCTCGATACCTGTCAGTTCACCACGCTGGTTGAGGCTGATGTC
CCCCGCGTTGACTGCCCCGAGCACGGTTGCCAGACACTGCCTGTTCCCTGGGCAGGGTCAGGCAGCCGCTACACCTTGTTGTTCGAAGCCTTTGTTCTTT
CATGGCTGAAAGTCAGCACCGTGGATGCTGTCAGAAAGCAGCTCAAACTCAGTTGGAATGCCGTTGAAGGCATCATGATGCGCGCAGTCAAACGAGGCTT
GGCCCGGATAAAACAACCCTTATCCGCCCGTTACCTCTGCGTGGATGAAGTCGGGTTCAAAAAAGGACACCAGTACGTCACCGTTATCTCTGACAGGCAG
GGACGCGCTTTGCAACTGACCGACGACCGCGGTGTAGAAAGCCTTGCCAGTTATCTGCGCAGCCTGAGAGATCACCAGCTTGATGAGATAAAAACGCTGT
CTATGGACATGAACATGGCCTATATCAGTGCAGCCCGCATCCATCTTCCCAATGCCGTCGATAAAATTGCTTTCGATCACTTCCATGTGGCAAAAATGTT
GTGCGCCGTCGTTGATAAAACCCGTCAGGGTGAGATGAAACAGATCCCGTCGTCAGACAGGAAAGACGCCCACCGCTCACGCTATCTATGGTTTTACAGC
AAACAAAATCGCCTCGGGTGCTGGGCAGAGAGGTTAGAAGTTGCCCGGCTGATGTTACCCCAAACGAGCCAGTGCTGGGTAATGAAAGAGCTTGCTCGCG
ATCTGTGGCACCGCCGCTATGACAATCATAGCCGTAAGCTGTGGCAGGAATGGATGGCGATGGCTAAAGACACCGGCATACCGCTCATGGCCAGCATTGC
CCGCATGGTGGCAAAACGCCTTTACGGCATTCTGAATGCAATGAAAAACCGGGTATCAAATGGGAATGCGGAGTCCCTGAACAGCAAAATACGGCTGTTC
AGGATCAAGTCACAGGGCTTCAGGAACAAAGAACGTTTCAAGCTGGGCGTAATGTTCCACTATGGGAAACTAAATATGAATTTTTGAGCAATCATTTAAG
AGAGTTATAAAATTTTTTTTGCTGGTGATGTTGTATTCGCTCGTCATTCTCCTTTTCTTTTTCCTTTTGCCACATATTACTGCATCCCGGATCGCTAGGG
GTAAGACAATATTGAGTTTTATCAACTTGAGATTCACTTTTCTCCCAAGATAATTTTGGCGTTACCTCTGGTGACAGTGAATAGTCTTTCTTATCAGTGT
CATACATCATATTTGGTGAGGTCGATGCTGCAACACCATGAAGCAGTAAGATAAATGGGGTATATGCATATTTTTTTAAGCATGACATGAACGTTATTCC
TTTAAACGTAAATATGTTTGACATCCATACAGAAGTCCAAGTTCCAGCCAAAAGAATAAGTAAAGTAAAAATGCAGAACTGTCGATTGAGCCACCACAAC
AGGGGAAGACCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1221 bp | 406 aa | 67 | 1287 | + | No |
Chemistry : Unknow
ORF sequence :
MDEKSLYAHILNLSAPWQVQSLSLNEKSGSVTVIVGIAEHTQLACPTCGKSCSIHDHRRRKWRHLDTCQFTTLVEADVPRVDCPEHGCQTLPVPWAGSGS
RYTLLFEAFVLSWLKVSTVDAVRKQLKLSWNAVEGIMMRAVKRGLARIKQPLSARYLCVDEVGFKKGHQYVTVISDRQGRALQLTDDRGVESLASYLRSL
RDHQLDEIKTLSMDMNMAYISAARIHLPNAVDKIAFDHFHVAKMLCAVVDKTRQGEMKQIPSSDRKDAHRSRYLWFYSKQNRLGCWAERLEVARLMLPQT
SQCWVMKELARDLWHRRYDNHSRKLWQEWMAMAKDTGIPLMASIARMVAKRLYGILNAMKNRVSNGNAESLNSKIRLFRIKSQGFRNKERFKLGVMFHYG
KLNMNF
RYTLLFEAFVLSWLKVSTVDAVRKQLKLSWNAVEGIMMRAVKRGLARIKQPLSARYLCVDEVGFKKGHQYVTVISDRQGRALQLTDDRGVESLASYLRSL
RDHQLDEIKTLSMDMNMAYISAARIHLPNAVDKIAFDHFHVAKMLCAVVDKTRQGEMKQIPSSDRKDAHRSRYLWFYSKQNRLGCWAERLEVARLMLPQT
SQCWVMKELARDLWHRRYDNHSRKLWQEWMAMAKDTGIPLMASIARMVAKRLYGILNAMKNRVSNGNAESLNSKIRLFRIKSQGFRNKERFKLGVMFHYG
KLNMNF
Blast result :
Comments
ISSoEn4 is 65% aa similar to ISShes3. There 14 complete copies in the genome.
References
1] ISfinder submission (2011)
2] Kelly Oakeson (2011) Direct submission
3] Gil,R., Belda,E., Gosalbes,M.J., Delaye,L., Vallier,A., Vincent-Monegat,C., Heddi,A., Silva,F.J., Moya,A. and Latorre,A.(2008) Int. Microbiol. 11 (1), 41-48.
2] Kelly Oakeson (2011) Direct submission
3] Gil,R., Belda,E., Gosalbes,M.J., Delaye,L., Vallier,A., Vincent-Monegat,C., Heddi,A., Silva,F.J., Moya,A. and Latorre,A.(2008) Int. Microbiol. 11 (1), 41-48.