ISGsp3
- Family ISLre2
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_012793 | ND | Geobacillus sp. | Anoxybacillus flavithermus WK1 Geobacillus kaustophilus HTA426 Geobacillus sp. WCH70 Geobacillus kaustophilus HTA426 plasmid pHTA426 Geobacillus thermoleovorans CCB_US3_UF5 |
DNA section
IS Length : 1519 bp
Ends
IR Length : 20/29
IRL : GTGTGAGTAAAATATTTGTGGGTGACATTCAGGAATGATTCCCCGACGAG
IRR : GAGTCAGTCAAGACTTTGTGGAGAACATTTTTTCGGTTCACTACGGGCGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATCCTCTAGG | CGCGTGGGTG | TAATATCTTG | 10 |
ACCGCCGTTT | CTAAGGCTTA | TTGCCTGTTG | 10 |
ACCGCCGTTT | CTAAGGCTTA | TTGCCTGTTG | 10 |
CATACCGCCC | AGCGTCCTA | TTTCCACTTT | 9 |
GTGGATTTGC | ATACTTGAC | CTCCTAGTTGC | 9 |
TATTGAACAA | AGATGAAAT | TTAGAGGTGGA | 9 |
TGGGCGCAAA | AAAATGCGA | GCGACCATTGA | 9 |
CACTCTTAAT | CTTGTATAA | TAACTGCTTG | 9 |
ACGGCGGCTT | GGCTTTTCG | TTCATCATTG | 9 |
CAGCAAGTTG | TTTGCCTTT | TTGTTCATTG | 9 |
TTGTCAGCTA | ATGTTCACA | CAATCGTTTT | 9 |
TTTTTGTTTG | CTGGCTCGC | TTTTTACGAG | 9 |
GCGCTATCCC | CCATTTGGA | TTCGTAGTGG | 9 |
CAAAGGATTT | TTGTAGATT | GGTTGGTTTT | 9 |
AACCGGCTCT | TCACCACAA | AATGTACTTG | 9 |
AATCTGTCCC | ATTGCGGAA | AATACAGTTG | 9 |
CGTCCTTTTT | CCTCATGCC | ATGTGATTTG | 9 |
GCCTTGGGAC | GGCCTTGCC | GCTTCATTTG | 9 |
ATCAGAAAAG | CAAGCCATG | CAATCTCTCC | 9 |
AGAAGGATAT | TGTAGAAAA | ATATACATTG | 9 |
CAGCTGCCAT | CGTGAGCGG | AAACACGGTG | 9 |
CATGGGCAAT | CGATGACAC | CATCATCTTG | 9 |
TTTTTGTTTT | CCGGTTCAC | TTTTTACGAG | 9 |
TCGCATAGTA | TGACAATGT | TTTCTTTTTG | 9 |
GTTTATTTTT | TTCTTTTAA | AATATGGTTG | 9 |
CGCAGCGAAT | ACAGCTCAT | AAACTTGTTG | 9 |
CGTTTCTAAA | CTTCCCGAA | GATAGCGACA | 9 |
ACGGCCGTCC | TGGTTTTAT | TGACGTATTG | 9 |
TACAGCGAGA | CAGGTTGGT | TGGGTCATTT | 9 |
TACAGCGAGA | CAGGTTGGT | TGGGTCATTT | 9 |
CTTTGGTCAA | ACCATTTAG | CTGTTGTTTG | 9 |
ATTGCTCAAT | CATTATTAT | GATTTCCTTG | 9 |
CGCCCTTTTT | TGCATACAT | ATATTGTTTG | 9 |
TTTGGATCCA | GACCCAGGG | GATGAAGATG | 9 |
CTAGACAACA | ACCGTCCAT | GTAGGAGTCG | 9 |
TTTCACAGTG | AAATCCCCC | TTTATTGTTG | 9 |
TCCCTTTTCT | TTTCCTCAA | TTTAGGAGTG | 9 |
CTTTTTCCGG | AGACAATAC | GGATGGCTAG | 9 |
CACGGATGGA | ATAAATAAA | TAAAAACTTG | 9 |
GAGGATGGAC | CGCGAGGCG | CTTTCGGTTG | 9 |
TCGGCAAGAT | TTCTTTTCC | CACCAGGTTG | 9 |
CAAATCCATC | ATGGGTACT | AAAAAAACAT | 9 |
CTTAGAAAAG | ATCCAGAA | AACCGTTCTT | 8 |
TGGAAATGCT | ATTCGTTC | CGTGACAGCA | 8 |
ATGTACTCAA | ACCATCTCTT | 0 | |
GAAAGTCCAC | GAATGGTAAT | 0 | |
TTATTGTTTA | CTTCTGCCAG | 0 | |
TCGTTATAGG | ATTATTATAT | 0 | |
CAAAGAAACC | TATAATAATC | 0 | |
AAATCTTGAA | TCGATGCATA | 0 | |
GGTTCAAGCA | CTTCAAGCAA | 0 | |
TTTCTTTTCC | CAACGCCATG | 0 | |
TCATTATTAT | TTCTTTTCCC | 0 | |
TTTCTTTTCC | CAACGCCATG | 0 | |
TTAGGTATAG | CTTGTGTAAC | 0 |
DNA sequence
GTGTGAGTAAAATATTTGTGGGTGACATTCAGGAATGATTCCCCGACGAGGGTTGCGAACTTTTGTTCGCGACGCTCGTCGGGGCCACCAAGCGAAGCGC
GGTAGAAAAAAGCAAATGTCATGCAAAAAGGACTCTCTCCCTGCTATGATGGTGATGACCAACATCCATAAAACAGGAGGGAGAGAGTCCATGAAACATC
TTACCACAGAATGGCCTTTATTAAAAGAGCTGGAGGAACAATTAGTCAGAACTCTTCAAAAGGTGTTCGCTGTCTTGTTGGCGGCCCTTTTGGAGGAGAT
TGATCAACAACTGGCGGAAGCGCGGGACAAGCGCCGGTATCAGCTGAAAGACAAACGGCCGACCACGATCCAAACGCTGTTTGGAGAAGTGACGTTTCGA
CGGAACTACTACTATGATCGGCAGGCGGGGGCGTATACCTTCTTGCTGGATGCCGAACTGGGCTTTGATGGAGCGCAGTCGATCAGCCCTTGCCTCGAGG
AAACGGCGGTCGAGTTGGCCGTAGAGTGCTCTTCCTACCGCAAAGCAGCCCGTACGTTGGAGTCGATCGTGGGGTATGCGGTCCTAAGCCACGAGGCGAT
TCGCCAACTGGTGCTGGAGGCCCCTGTCTCGCTGCACCACCCTGTTTCCCAACGGCACGGCCGAGTGCTGTTTGTGGAGGCGGATGGGCTGTTCATTTCC
CGCCAGGGGAAAGGGAAACGGGCGAAAGAAGAGAAAATCCTGGCGGTTCACGAGGGATGGAAACGAAACGGTTCGCAGCTCGAGCTCGTGAACCGGCGCC
ACTACCTCCATGAAGGGGAGGGAGACGTGTGGGAACGGTTCGAAGAGTGGCTGATGAACGAATATGCCTATGATCCGTGCCGGGACCTGTTGATCATCAA
CGGCGACGCGGCGTCGTGGATCACGGCCTGCCGGGAGTATTTTGGGAAGCGGGCGTGCTTTCAGCTGGATCGATTTCATGTGGCGCGGGAGCTGCGTCAG
TGTCTGTCCGGCCATCCGCGTTGGCGGGAGGTGCGGAAGAAGCTGGCGAAACAAGACGAAGAGGGGCTTCTCGTGGAGCTGAACAGCGCGGTCGGCACGT
TGGAGGACGAAGCGAAAGAGAAGCAGATGGCTGCCATGATCCGCCGGATCGAGTCGATGCCGGGATGCATCCGGGACTATCGGGAGTGGCTGTCGGAGCA
AGGGGTGGAGACGACCGGCATGCGTCCGATGGGCCACGCCGAGAGCGTGATGAGCCGGTTTGCGCATCGGGTGAAATCCCGCCGCAGCTGGAAAGACCAA
GGGCTTCGGGCGTTTCTGAGGGCGATGGCGGCCCGAATCGACGGGATTTGGCGGAGAAATGGGCAGTTGGTGGAGGAAGAAGAGACCCGAACGGCGGCCT
CGGCCTCAACAAAGTCCAAGCGGATCGAACAGGCCAAACGGAAGGCCGGACGGTTATGGGCAGATGTGGTGCGTCAGAATCTACCGTGTCTGCAGCGGTC
ATCCGGGACACCGATCCATCAAGCGTTGTCGGCGCTCCGGGATGGTGGTTGGGTGTAAAAAAATGGAATATCGTATCATCGCCTCAAGATGAGGGATGAG
AGTCCGAAAGCGCTTACGATATGAATTCCTGAAAATGGTTCGCTAACTAGTGATTTACAACGCCCGTAGTGAACCGAAAAAATGTTCTCCACAAAGTCTT
GACTGACTC
GGTAGAAAAAAGCAAATGTCATGCAAAAAGGACTCTCTCCCTGCTATGATGGTGATGACCAACATCCATAAAACAGGAGGGAGAGAGTCCATGAAACATC
TTACCACAGAATGGCCTTTATTAAAAGAGCTGGAGGAACAATTAGTCAGAACTCTTCAAAAGGTGTTCGCTGTCTTGTTGGCGGCCCTTTTGGAGGAGAT
TGATCAACAACTGGCGGAAGCGCGGGACAAGCGCCGGTATCAGCTGAAAGACAAACGGCCGACCACGATCCAAACGCTGTTTGGAGAAGTGACGTTTCGA
CGGAACTACTACTATGATCGGCAGGCGGGGGCGTATACCTTCTTGCTGGATGCCGAACTGGGCTTTGATGGAGCGCAGTCGATCAGCCCTTGCCTCGAGG
AAACGGCGGTCGAGTTGGCCGTAGAGTGCTCTTCCTACCGCAAAGCAGCCCGTACGTTGGAGTCGATCGTGGGGTATGCGGTCCTAAGCCACGAGGCGAT
TCGCCAACTGGTGCTGGAGGCCCCTGTCTCGCTGCACCACCCTGTTTCCCAACGGCACGGCCGAGTGCTGTTTGTGGAGGCGGATGGGCTGTTCATTTCC
CGCCAGGGGAAAGGGAAACGGGCGAAAGAAGAGAAAATCCTGGCGGTTCACGAGGGATGGAAACGAAACGGTTCGCAGCTCGAGCTCGTGAACCGGCGCC
ACTACCTCCATGAAGGGGAGGGAGACGTGTGGGAACGGTTCGAAGAGTGGCTGATGAACGAATATGCCTATGATCCGTGCCGGGACCTGTTGATCATCAA
CGGCGACGCGGCGTCGTGGATCACGGCCTGCCGGGAGTATTTTGGGAAGCGGGCGTGCTTTCAGCTGGATCGATTTCATGTGGCGCGGGAGCTGCGTCAG
TGTCTGTCCGGCCATCCGCGTTGGCGGGAGGTGCGGAAGAAGCTGGCGAAACAAGACGAAGAGGGGCTTCTCGTGGAGCTGAACAGCGCGGTCGGCACGT
TGGAGGACGAAGCGAAAGAGAAGCAGATGGCTGCCATGATCCGCCGGATCGAGTCGATGCCGGGATGCATCCGGGACTATCGGGAGTGGCTGTCGGAGCA
AGGGGTGGAGACGACCGGCATGCGTCCGATGGGCCACGCCGAGAGCGTGATGAGCCGGTTTGCGCATCGGGTGAAATCCCGCCGCAGCTGGAAAGACCAA
GGGCTTCGGGCGTTTCTGAGGGCGATGGCGGCCCGAATCGACGGGATTTGGCGGAGAAATGGGCAGTTGGTGGAGGAAGAAGAGACCCGAACGGCGGCCT
CGGCCTCAACAAAGTCCAAGCGGATCGAACAGGCCAAACGGAAGGCCGGACGGTTATGGGCAGATGTGGTGCGTCAGAATCTACCGTGTCTGCAGCGGTC
ATCCGGGACACCGATCCATCAAGCGTTGTCGGCGCTCCGGGATGGTGGTTGGGTGTAAAAAAATGGAATATCGTATCATCGCCTCAAGATGAGGGATGAG
AGTCCGAAAGCGCTTACGATATGAATTCCTGAAAATGGTTCGCTAACTAGTGATTTACAACGCCCGTAGTGAACCGAAAAAATGTTCTCCACAAAGTCTT
GACTGACTC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1368 bp | 455 aa | 191 | 1558 | + | No |
Chemistry : DDE
ORF sequence :
MKHLTTEWPLLKELEEQLVRTLQKVFAVLLAALLEEIDQQLAEARDKRRYQLKDKRPTTIQTLFGEVTFRRNYYYDRQAGAYTFLLDAELGFDGAQSISP
CLEETAVELAVECSSYRKAARTLESIVGYAVLSHEAIRQLVLEAPVSLHHPVSQRHGRVLFVEADGLFISRQGKGKRAKEEKILAVHEGWKRNGSQLELV
NRRHYLHEGEGDVWERFEEWLMNEYAYDPCRDLLIINGDAASWITACREYFGKRACFQLDRFHVARELRQCLSGHPRWREVRKKLAKQDEEGLLVELNSA
VGTLEDEAKEKQMAAMIRRIESMPGCIRDYREWLSEQGVETTGMRPMGHAESVMSRFAHRVKSRRSWKDQGLRAFLRAMAARIDGIWRRNGQLVEEEETR
TAASASTKSKRIEQAKRKAGRLWADVVRQNLPCLQRSSGTPIHQALSALRDGGWV
CLEETAVELAVECSSYRKAARTLESIVGYAVLSHEAIRQLVLEAPVSLHHPVSQRHGRVLFVEADGLFISRQGKGKRAKEEKILAVHEGWKRNGSQLELV
NRRHYLHEGEGDVWERFEEWLMNEYAYDPCRDLLIINGDAASWITACREYFGKRACFQLDRFHVARELRQCLSGHPRWREVRKKLAKQDEEGLLVELNSA
VGTLEDEAKEKQMAAMIRRIESMPGCIRDYREWLSEQGVETTGMRPMGHAESVMSRFAHRVKSRRSWKDQGLRAFLRAMAARIDGIWRRNGQLVEEEETR
TAASASTKSKRIEQAKRKAGRLWADVVRQNLPCLQRSSGTPIHQALSALRDGGWV
Blast result :
Comments
References