ISSoEn3
- Family IS21
- Group
Isoform Synonym(s) ISsope3
Accession number | Transposition | Origin | Host |
---|---|---|---|
AM921791 | ND | Primary endosymbiont | Primary endosymbiont of Sitophilus oryzae |
DNA section
IS Length : 2600 bp
Ends
IR Length : 33/48
IRL : TGCTGGTTCCGGTCACATCCGATCACTGATTCCGATTTCACCCGATCACT
IRR : TGCAGATTCCGACGTATCCGATCAGCTGTTCCGGCGACATCCGATCAGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTCCCGTTCA | CCCAT | CATCTCGCCC | 5 |
TTTACGCCCG | CCACG | GCGGTAAAAT | 5 |
CCGCTACAAT | ACATCAAGGC | 0 | |
GCCTTTCGTT | GCCCG | CAGGGCGTTA | 5 |
CTTTGCCATG | AACAGCCGGC | 0 | |
GCTGTGGTGG | CTGCCCGCGT | 0 | |
GAGGCAGTGT | ATACTCGCGT | 0 | |
GTCCGGCAGG | ACAGC | CGCATCATCA | 5 |
TGCCTGAGCG | GTACT | GGCGGCCAGG | 5 |
TTCTAACCTA | CCTAC | CCCTAACTAC | 5 |
TTCATTTTCG | GCCAAC | CACATACCTA | 6 |
GTCGTCCGCC | GCTTT | TCTGCGCGGG | 5 |
GCGTTACTGG | CACGGCACCT | 0 | |
TGCGGCTGGA | CCGGT | GGCACTGCAC | 5 |
TTTGATATGT | GAGCCTGTTT | 0 | |
CTTCGATGTT | GATGC | CCAGCGCCAG | 5 |
TTCTAAACAG | TAATCAGATC | 0 | |
GGGGATTACG | GGGGGGGGGG | CGCGCTTCGC | 10 |
GGTGACTGCG | CCAGC | CACGACAGCA | 5 |
GCCCAGCGCC | AGGAA | CACAGATTTG | 5 |
ATCCGCCTGT | GAGGCCTGTT | 0 |
DNA sequence
TGCTGGTTCCGGTCACATCCGATCACTGATTCCGATTTCACCCGATCACTAATTCTGATTTCATCCGATCACTGATTCCGGTCGCCCGATCAGCGATTCC
GATTCTGTCCGATCGCTCATCTTCTGTTCCGCCATACTCTGGAGACTTTTAGCTTCCGGGGGCATGGCATGGCACGTAAAAAAAAGAAAGCGAGAACGGA
AATGTGCATCTATATTAATGTCTTACGTATGAAATTCGAGCAGCGTCGCTCGAATCGCACTATCGCAGCAGCGCTCGGCATAGGCTGTACTACCGTGCAC
GATATCCTCGGCCGATTCACGGTAGTTAACCTGGTCTGGCCATTGCCGGCGGAACTGTCCCCCGTCGACCTCGACCGCCTGCTCTATCCCGGCAAATCCG
GAAAAGTTATCAATACATTACCCAGCTGGCTTGATATCGATACCGAGTTAAGCCGCAAGGGCATGACCAAGCAGCTGCTCTGGATGGAATATCAGTCCGC
CGTGGGCGGTGATGCCCTCGGTTACTCACAGTTTTGTGCACTGTTCCGTGACTGGAAAAAGAAGCAGAGGCGTTCTATGCGCATGGAGCACAAGGCTGGC
GAAAAGCTCTTCATCAACTTCTGTGGCCCCACCGTACCTATCGTCAACCCTGCGACCAGTAGCGTACGCCAGGTCGCTATCTTCGTCGCTGCCATGGGCG
TGTCAGGCTATGCGTATATCGAAGCCTGCGAAAGCCAGGACATGGCATCGTGGCTCAACGCCAATAGCCGCTGTCTGCACTTCATGGGTGGGGTTCCGGA
GCTGATGATACCTGATAATCTGCGCAGCGCTGTCAGCACCCCTGACCGCTATGAGCCGGTCATAAACCAGAGCTACCAGGCGCTGGCAAATCACTATGAG
ACAGTGGTGCTACCGGCGCGCCCGAGAAAACCGAAAGACAAGGCGAAGGCAGAATCAACTGTGCAGCTGGTAGAACGCTGGGTTTTGGCCCGGTTGCGTA
AACGTAGGTTCTACTCGCTGCCCGAACTCAACCAGGTGATACGAGAACTCAATCATGAGTTGAATCTGCGCCCGATGCGTCATTACGGCGGACAAAGTCG
CCTTGAACGCTTCGAGCAGCTGGACAAACCGGCTCTTGGGCCTCTACCGCCCACACAATGGGAATACAGTGAGTATCTCGTTGCCCGAGTGGGACCTGAT
TACCACATAGACTACGGCAAAAACTGGTACTCGGTGCCGCATCCGCTGGTTGGCGAGCGCGTTGACGTCATCGCCACCCAACGGCTGGTGCAAATCCACC
ATAAGGGCGTCTGCGTGGCTACGCACCCTCGCAGCGATAACGCCTATAGGCACACGACTCAGGCGGCGCACATGCCGGCTAACCATAAGGGGCAGAGTCA
GTGGACGCCGGAAAGGCTGTGCAGTTGGGCGCTGTCGGTGGGTGTGTGCACACTGAAAGTGGTCGAGTCCATCCAAAAGAGCAAAGCCCATCCGGAGCAG
GCTTACCGCTCCGTGCTGGGGCTACTCAATCTGCAACGGCGCTATGAGACGACGCGACTGGAGAAGGCCTGCGTGCTGGCGTTGGAGAAAGGGTGCATTA
ACCGCTCTTTCATAGCCAACGTATTGAAACACGGTCGTGAAAGTGAGGTCACCCAGGACGGAGCCGGCGTATCAATGCTGGTTCACGAAAACCTCCGAGG
TCCGGACAGTTATCACTAAGGAGAATAAATATGGATACACTGTTAATGGCTCTGCGAGAGCTGAAGTTGTCGGCAATGGTCCAGGCGTTGGAGACGCAAC
GCGAACTCCCGGGGAGTTATGGGGAGCTGGGGTTCGAGGAGCGGTTGTCGCTGATGGTAGAAGCGGAAAATTTACATAGAAAAAACAACCACATATACCG
TATTCGACGGCAATCGCAAATGCGCTTGCAGGCAAAACCGGAAGATATCCGCTATATCCCTAGCCGAGGAGTGACACCGGAACAGATGCGAGATCTGCTA
GGGGGACAATATCTGAAATATCAGAAAAGCATACTCATCACGGGGCCGACAGGTACGGGCAAAACCTGGCTCAGTTGTGCGCTTAGTGAGCAGGCATGCC
GGCAGCAATATAGCGTGCGTTACTGGCGAGTGGGTCGGTTGCTGGCCCATCTTCACCAGTGTCAGGTAGACGGGACCTATCTAAAACAGCTTAAGCAGTT
AGAAAAAATAGAGTTACTGATCTTGGACGATGTGGGCCTAGAATCAATAAGTCCGATGCAGGCAACGATGCTGTTGGAGGTGATGGAAGATCGCTACGAC
AAAAGCAGCAGCATCCTGATCAGTCAACTGCCGGTGAAAAAATGGTATGGACTGATAGAAAACCCCACGACAGCTGACGCGTTACTCGATCGGTCAGTAC
ACCCCAGCTATAGACTGGAACTTAAAGGCGAATCACTACGCAAAGAGCAAGGAGTAGCCAGCACAGGAAAAATAGACTAAACCCGAGTCAGAAGATGAGC
GAACACGTGATCGAATATCACTGGAATGGGTGATCGGAAAATATCGGAATAACTGATCGGATGTCGCCGGAACAGCTGATCGGATACGTCGGAATCTGCA
GATTCTGTCCGATCGCTCATCTTCTGTTCCGCCATACTCTGGAGACTTTTAGCTTCCGGGGGCATGGCATGGCACGTAAAAAAAAGAAAGCGAGAACGGA
AATGTGCATCTATATTAATGTCTTACGTATGAAATTCGAGCAGCGTCGCTCGAATCGCACTATCGCAGCAGCGCTCGGCATAGGCTGTACTACCGTGCAC
GATATCCTCGGCCGATTCACGGTAGTTAACCTGGTCTGGCCATTGCCGGCGGAACTGTCCCCCGTCGACCTCGACCGCCTGCTCTATCCCGGCAAATCCG
GAAAAGTTATCAATACATTACCCAGCTGGCTTGATATCGATACCGAGTTAAGCCGCAAGGGCATGACCAAGCAGCTGCTCTGGATGGAATATCAGTCCGC
CGTGGGCGGTGATGCCCTCGGTTACTCACAGTTTTGTGCACTGTTCCGTGACTGGAAAAAGAAGCAGAGGCGTTCTATGCGCATGGAGCACAAGGCTGGC
GAAAAGCTCTTCATCAACTTCTGTGGCCCCACCGTACCTATCGTCAACCCTGCGACCAGTAGCGTACGCCAGGTCGCTATCTTCGTCGCTGCCATGGGCG
TGTCAGGCTATGCGTATATCGAAGCCTGCGAAAGCCAGGACATGGCATCGTGGCTCAACGCCAATAGCCGCTGTCTGCACTTCATGGGTGGGGTTCCGGA
GCTGATGATACCTGATAATCTGCGCAGCGCTGTCAGCACCCCTGACCGCTATGAGCCGGTCATAAACCAGAGCTACCAGGCGCTGGCAAATCACTATGAG
ACAGTGGTGCTACCGGCGCGCCCGAGAAAACCGAAAGACAAGGCGAAGGCAGAATCAACTGTGCAGCTGGTAGAACGCTGGGTTTTGGCCCGGTTGCGTA
AACGTAGGTTCTACTCGCTGCCCGAACTCAACCAGGTGATACGAGAACTCAATCATGAGTTGAATCTGCGCCCGATGCGTCATTACGGCGGACAAAGTCG
CCTTGAACGCTTCGAGCAGCTGGACAAACCGGCTCTTGGGCCTCTACCGCCCACACAATGGGAATACAGTGAGTATCTCGTTGCCCGAGTGGGACCTGAT
TACCACATAGACTACGGCAAAAACTGGTACTCGGTGCCGCATCCGCTGGTTGGCGAGCGCGTTGACGTCATCGCCACCCAACGGCTGGTGCAAATCCACC
ATAAGGGCGTCTGCGTGGCTACGCACCCTCGCAGCGATAACGCCTATAGGCACACGACTCAGGCGGCGCACATGCCGGCTAACCATAAGGGGCAGAGTCA
GTGGACGCCGGAAAGGCTGTGCAGTTGGGCGCTGTCGGTGGGTGTGTGCACACTGAAAGTGGTCGAGTCCATCCAAAAGAGCAAAGCCCATCCGGAGCAG
GCTTACCGCTCCGTGCTGGGGCTACTCAATCTGCAACGGCGCTATGAGACGACGCGACTGGAGAAGGCCTGCGTGCTGGCGTTGGAGAAAGGGTGCATTA
ACCGCTCTTTCATAGCCAACGTATTGAAACACGGTCGTGAAAGTGAGGTCACCCAGGACGGAGCCGGCGTATCAATGCTGGTTCACGAAAACCTCCGAGG
TCCGGACAGTTATCACTAAGGAGAATAAATATGGATACACTGTTAATGGCTCTGCGAGAGCTGAAGTTGTCGGCAATGGTCCAGGCGTTGGAGACGCAAC
GCGAACTCCCGGGGAGTTATGGGGAGCTGGGGTTCGAGGAGCGGTTGTCGCTGATGGTAGAAGCGGAAAATTTACATAGAAAAAACAACCACATATACCG
TATTCGACGGCAATCGCAAATGCGCTTGCAGGCAAAACCGGAAGATATCCGCTATATCCCTAGCCGAGGAGTGACACCGGAACAGATGCGAGATCTGCTA
GGGGGACAATATCTGAAATATCAGAAAAGCATACTCATCACGGGGCCGACAGGTACGGGCAAAACCTGGCTCAGTTGTGCGCTTAGTGAGCAGGCATGCC
GGCAGCAATATAGCGTGCGTTACTGGCGAGTGGGTCGGTTGCTGGCCCATCTTCACCAGTGTCAGGTAGACGGGACCTATCTAAAACAGCTTAAGCAGTT
AGAAAAAATAGAGTTACTGATCTTGGACGATGTGGGCCTAGAATCAATAAGTCCGATGCAGGCAACGATGCTGTTGGAGGTGATGGAAGATCGCTACGAC
AAAAGCAGCAGCATCCTGATCAGTCAACTGCCGGTGAAAAAATGGTATGGACTGATAGAAAACCCCACGACAGCTGACGCGTTACTCGATCGGTCAGTAC
ACCCCAGCTATAGACTGGAACTTAAAGGCGAATCACTACGCAAAGAGCAAGGAGTAGCCAGCACAGGAAAAATAGACTAAACCCGAGTCAGAAGATGAGC
GAACACGTGATCGAATATCACTGGAATGGGTGATCGGAAAATATCGGAATAACTGATCGGATGTCGCCGGAACAGCTGATCGGATACGTCGGAATCTGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1551 bp | 516 aa | 169 | 1719 | + | No |
Chemistry : DDE
ORF sequence :
MARKKKKARTEMCIYINVLRMKFEQRRSNRTIAAALGIGCTTVHDILGRFTVVNLVWPLPAELSPVDLDRLLYPGKSGKVINTLPSWLDIDTELSRKGMT
KQLLWMEYQSAVGGDALGYSQFCALFRDWKKKQRRSMRMEHKAGEKLFINFCGPTVPIVNPATSSVRQVAIFVAAMGVSGYAYIEACESQDMASWLNANS
RCLHFMGGVPELMIPDNLRSAVSTPDRYEPVINQSYQALANHYETVVLPARPRKPKDKAKAESTVQLVERWVLARLRKRRFYSLPELNQVIRELNHELNL
RPMRHYGGQSRLERFEQLDKPALGPLPPTQWEYSEYLVARVGPDYHIDYGKNWYSVPHPLVGERVDVIATQRLVQIHHKGVCVATHPRSDNAYRHTTQAA
HMPANHKGQSQWTPERLCSWALSVGVCTLKVVESIQKSKAHPEQAYRSVLGLLNLQRRYETTRLEKACVLALEKGCINRSFIANVLKHGRESEVTQDGAG
VSMLVHENLRGPDSYH
KQLLWMEYQSAVGGDALGYSQFCALFRDWKKKQRRSMRMEHKAGEKLFINFCGPTVPIVNPATSSVRQVAIFVAAMGVSGYAYIEACESQDMASWLNANS
RCLHFMGGVPELMIPDNLRSAVSTPDRYEPVINQSYQALANHYETVVLPARPRKPKDKAKAESTVQLVERWVLARLRKRRFYSLPELNQVIRELNHELNL
RPMRHYGGQSRLERFEQLDKPALGPLPPTQWEYSEYLVARVGPDYHIDYGKNWYSVPHPLVGERVDVIATQRLVQIHHKGVCVATHPRSDNAYRHTTQAA
HMPANHKGQSQWTPERLCSWALSVGVCTLKVVESIQKSKAHPEQAYRSVLGLLNLQRRYETTRLEKACVLALEKGCINRSFIANVLKHGRESEVTQDGAG
VSMLVHENLRGPDSYH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
750 bp | 249 aa | 1731 | 2480 | + | No |
AG : IS21 helper
ORF sequence :
MDTLLMALRELKLSAMVQALETQRELPGSYGELGFEERLSLMVEAENLHRKNNHIYRIRRQSQMRLQAKPEDIRYIPSRGVTPEQMRDLLGGQYLKYQKS
ILITGPTGTGKTWLSCALSEQACRQQYSVRYWRVGRLLAHLHQCQVDGTYLKQLKQLEKIELLILDDVGLESISPMQATMLLEVMEDRYDKSSSILISQL
PVKKWYGLIENPTTADALLDRSVHPSYRLELKGESLRKEQGVASTGKID
ILITGPTGTGKTWLSCALSEQACRQQYSVRYWRVGRLLAHLHQCQVDGTYLKQLKQLEKIELLILDDVGLESISPMQATMLLEVMEDRYDKSSSILISQL
PVKKWYGLIENPTTADALLDRSVHPSYRLELKGESLRKEQGVASTGKID
Blast result :
Comments
There are 45 complete copies in the genome.
ISSoEn3 is 70%(ORFA) and 77%(ORFB) aa similar to ISSso4.
ISSoEn3 is 70%(ORFA) and 77%(ORFB) aa similar to ISSso4.
References
1] ISfinder submission (2011)
2] Kelly Oakeson (2011) Direct submission
3] Gil,R., Belda,E., Gosalbes,M.J., Delaye,L., Vallier,A., Vincent-Monegat,C., Heddi,A., Silva,F.J., Moya,A. and Latorre,A.(2008) Int. Microbiol. 11 (1), 41-48.
2] Kelly Oakeson (2011) Direct submission
3] Gil,R., Belda,E., Gosalbes,M.J., Delaye,L., Vallier,A., Vincent-Monegat,C., Heddi,A., Silva,F.J., Moya,A. and Latorre,A.(2008) Int. Microbiol. 11 (1), 41-48.