IS5376
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
X67861 | Y | Bacillus stearothermophilus | Bacillus stearothermophilus T124 Bacillus stearothermophilus CU21 Bacillus stearothermophilus FH112 |
DNA section
IS Length : 2107 bp
Ends
IR Length : 39/50
IRL : TGTTAAAGCCGATGATAAAATCCCCAATATAGCCGGAATAAAATTCCCCA
IRR : TGTCAAGGCCGATTATTTTTTCCCCAAAATCGCCGGTTTAAAATTCCCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
NNNNNNNNNN | TGAAT | NNNNNNNNNN | 5 |
DNA sequence
TGTTAAAGCCGATGATAAAATCCCCAATATAGCCGGAATAAAATTCCCCACTTACAATAGAACCATAGTGTCAACGAGAGGAGCTATGGTTCATGATTAC
GAGAGGGGAATTTTTTATGATCAAAGAGATGTATGAAAGGGGAATGAGTATTTCCGATATTGCGAGGGAATTGGGGATCGATCGGAAAACCGTCCGAAAA
TATATTCACTCCCCCAATCCTCCTTCCAAATCCAAGCGAAAACAAAGAAAAAGCAAGTTAGATCCATTTAAGCCGTATCTTCAAAAACGAATGTTAGAAG
ATGGGGTGTTTAATAGCGAAAAGTTGTTTTTTGAAATTCGACAACAGGGCTATACGGGAGGAAAGACGATTTTAAAAGACTATATGAAACCTTTCCGAGA
GACGGCGAAAAAGAAATACACCGTTCGTTATGAAACGCTTCCTGGCGAACAAATGCAAGTCGATTGGAAAGAAGTTGGGGAGGTCGTGATCGAAGGGAAA
AAAGTCAAGTTATCGCTATTTGTGGCCACGTTAGGCTATTCGCGGATGAAATACGCGGTATTTACGACCAGCCAGGATCAGGAACACTTAATGGAATGCC
TGATTCAGAGCTTCAAGTACTTTGGCGGGGTTCCGAAGAAGGTGTTATTTGACAATATGAAGACCGTTACAGACGGGCGAGAACAAGGAGTGGTGAAATG
GAATCAACGATTTTCCGAATTTGCGAGTTACTATGGATTTATTCCAAAAGTATGCCGGCCTTACCGGGCCCAGACAAAGGGAAAAGTCGAACGAGCCATT
CAGTATATCATGGATCACTTCTATGTGGGGACAGCGTTTGAAAGCATCGAGGAATTAAATTTCCTTCTCCATCGTTGGCTCGATCAAGTGGCGAATCGGA
AGCCAAACGCCACTACCGGTATTTCTCCGCAAGAGCGTTGGGCCGAGGAGTCACTCAAGCCTCTTCCGTTGAAAGATTACGATACGAGCTATCTTTCCTA
TCGGAAGGTGCATTGGGATGGCAGTTTCTCCTACAAAGGGGAACAATGGCTCTTATCGGCGGAGTATGCGGGCAAAGAAATTCTGGTGAAGGAGCGATTA
AATGGAGATATTCGATTGTACTTTCGAGGGGAGGAGATTTCTCACGTGGACCAACAGAAAAAAGTGATTTCATTCGCCGAAAAAATAAAAAAGAAACAAA
CGGAAATGGCCGCCACCATTTCGCCTGTTTCGGTGGAAGTGGATACTCGTCCATTGTCCGTTTATGACGCATTCCTGCGAGGGGAAAGCTCATGAAAGAA
CGAATACACGAGTATTGCCACCGACTCCATTTGCCTGTCATGGCGGAACGATGGTCCGCCATGGCAGAATACGCCTCTACTCATAATATATCATATTCAG
AGTTTTTATTCCGCTTATTAGAGGCAGAAATCGTCGAAAAACAGGCACGATCGATCCAAACGCTCATCAAGCTGTCCAAACTGCCGTATCGCAAGACGAT
CGATACGTTTGATTTTACCGCGCAGCCTTCGGTGGATGAGCGCCGGATTCGAGAACTGCTTACGTTGTCCTTTATTGACCGGAAAGAAAATATCCTCTTT
CTCGGTCCACCGGGTATTGGGAAGACACATCTGGCAATTTCGATTGGAATGGAGGCGATCGCAAGAGGATATAAAACGTATTTTATTACCGCTCACGATT
TGGTCAATCAGTTAAGAAGAGCCGACCAGGAAGGAAAGTTGGAGAAAAAGCTTCGTGTCTTTGTGAAGCCAACCGTTCTCATTATTGATGAAATGGGGTA
TCTAAAACTGGACCCGAACAGCGCTCATTACTTATTTCAAGTGATCGCCCGGCGGTACGAGCATGCCCCGATTATCCTCACCTCCAACAAAAGCTTTGGG
GAATGGGGAGAAATCGTGGGAGACTCGGTTTTGGCGACAGCGATGTTAGATCGATTACTGCATCATTCCATCATTTTCAACCTAAAGGGGGAAAGCTATC
GATTACGGGAAAAGAGGCTCCAAGAAGAAAAACAGAAGGATCAATGAAAGGTCCTTCTGGGGAATTTTAAACCGGCGATTTTGGGGAAAAAATAATCGGC
CTTGACA
GAGAGGGGAATTTTTTATGATCAAAGAGATGTATGAAAGGGGAATGAGTATTTCCGATATTGCGAGGGAATTGGGGATCGATCGGAAAACCGTCCGAAAA
TATATTCACTCCCCCAATCCTCCTTCCAAATCCAAGCGAAAACAAAGAAAAAGCAAGTTAGATCCATTTAAGCCGTATCTTCAAAAACGAATGTTAGAAG
ATGGGGTGTTTAATAGCGAAAAGTTGTTTTTTGAAATTCGACAACAGGGCTATACGGGAGGAAAGACGATTTTAAAAGACTATATGAAACCTTTCCGAGA
GACGGCGAAAAAGAAATACACCGTTCGTTATGAAACGCTTCCTGGCGAACAAATGCAAGTCGATTGGAAAGAAGTTGGGGAGGTCGTGATCGAAGGGAAA
AAAGTCAAGTTATCGCTATTTGTGGCCACGTTAGGCTATTCGCGGATGAAATACGCGGTATTTACGACCAGCCAGGATCAGGAACACTTAATGGAATGCC
TGATTCAGAGCTTCAAGTACTTTGGCGGGGTTCCGAAGAAGGTGTTATTTGACAATATGAAGACCGTTACAGACGGGCGAGAACAAGGAGTGGTGAAATG
GAATCAACGATTTTCCGAATTTGCGAGTTACTATGGATTTATTCCAAAAGTATGCCGGCCTTACCGGGCCCAGACAAAGGGAAAAGTCGAACGAGCCATT
CAGTATATCATGGATCACTTCTATGTGGGGACAGCGTTTGAAAGCATCGAGGAATTAAATTTCCTTCTCCATCGTTGGCTCGATCAAGTGGCGAATCGGA
AGCCAAACGCCACTACCGGTATTTCTCCGCAAGAGCGTTGGGCCGAGGAGTCACTCAAGCCTCTTCCGTTGAAAGATTACGATACGAGCTATCTTTCCTA
TCGGAAGGTGCATTGGGATGGCAGTTTCTCCTACAAAGGGGAACAATGGCTCTTATCGGCGGAGTATGCGGGCAAAGAAATTCTGGTGAAGGAGCGATTA
AATGGAGATATTCGATTGTACTTTCGAGGGGAGGAGATTTCTCACGTGGACCAACAGAAAAAAGTGATTTCATTCGCCGAAAAAATAAAAAAGAAACAAA
CGGAAATGGCCGCCACCATTTCGCCTGTTTCGGTGGAAGTGGATACTCGTCCATTGTCCGTTTATGACGCATTCCTGCGAGGGGAAAGCTCATGAAAGAA
CGAATACACGAGTATTGCCACCGACTCCATTTGCCTGTCATGGCGGAACGATGGTCCGCCATGGCAGAATACGCCTCTACTCATAATATATCATATTCAG
AGTTTTTATTCCGCTTATTAGAGGCAGAAATCGTCGAAAAACAGGCACGATCGATCCAAACGCTCATCAAGCTGTCCAAACTGCCGTATCGCAAGACGAT
CGATACGTTTGATTTTACCGCGCAGCCTTCGGTGGATGAGCGCCGGATTCGAGAACTGCTTACGTTGTCCTTTATTGACCGGAAAGAAAATATCCTCTTT
CTCGGTCCACCGGGTATTGGGAAGACACATCTGGCAATTTCGATTGGAATGGAGGCGATCGCAAGAGGATATAAAACGTATTTTATTACCGCTCACGATT
TGGTCAATCAGTTAAGAAGAGCCGACCAGGAAGGAAAGTTGGAGAAAAAGCTTCGTGTCTTTGTGAAGCCAACCGTTCTCATTATTGATGAAATGGGGTA
TCTAAAACTGGACCCGAACAGCGCTCATTACTTATTTCAAGTGATCGCCCGGCGGTACGAGCATGCCCCGATTATCCTCACCTCCAACAAAAGCTTTGGG
GAATGGGGAGAAATCGTGGGAGACTCGGTTTTGGCGACAGCGATGTTAGATCGATTACTGCATCATTCCATCATTTTCAACCTAAAGGGGGAAAGCTATC
GATTACGGGAAAAGAGGCTCCAAGAAGAAAAACAGAAGGATCAATGAAAGGTCCTTCTGGGGAATTTTAAACCGGCGATTTTGGGGAAAAAATAATCGGC
CTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1203 bp | 400 aa | 93 | 1295 | + | No |
Chemistry : DDE
ORF sequence :
MITRGEFFMIKEMYERGMSISDIARELGIDRKTVRKYIHSPNPPSKSKRKQRKSKLDPFKPYLQKRMLEDGVFNSEKLFFEIRQQGYTGGKTILKDYMKP
FRETAKKKYTVRYETLPGEQMQVDWKEVGEVVIEGKKVKLSLFVATLGYSRMKYAVFTTSQDQEHLMECLIQSFKYFGGVPKKVLFDNMKTVTDGREQGV
VKWNQRFSEFASYYGFIPKVCRPYRAQTKGKVERAIQYIMDHFYVGTAFESIEELNFLLHRWLDQVANRKPNATTGISPQERWAEESLKPLPLKDYDTSY
LSYRKVHWDGSFSYKGEQWLLSAEYAGKEILVKERLNGDIRLYFRGEEISHVDQQKKVISFAEKIKKKQTEMAATISPVSVEVDTRPLSVYDAFLRGESS
FRETAKKKYTVRYETLPGEQMQVDWKEVGEVVIEGKKVKLSLFVATLGYSRMKYAVFTTSQDQEHLMECLIQSFKYFGGVPKKVLFDNMKTVTDGREQGV
VKWNQRFSEFASYYGFIPKVCRPYRAQTKGKVERAIQYIMDHFYVGTAFESIEELNFLLHRWLDQVANRKPNATTGISPQERWAEESLKPLPLKDYDTSY
LSYRKVHWDGSFSYKGEQWLLSAEYAGKEILVKERLNGDIRLYFRGEEISHVDQQKKVISFAEKIKKKQTEMAATISPVSVEVDTRPLSVYDAFLRGESS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
756 bp | 251 aa | 1292 | 2047 | + | No |
AG : IS21 helper
ORF sequence :
MKERIHEYCHRLHLPVMAERWSAMAEYASTHNISYSEFLFRLLEAEIVEKQARSIQTLIKLSKLPYRKTIDTFDFTAQPSVDERRIRELLTLSFIDRKEN
ILFLGPPGIGKTHLAISIGMEAIARGYKTYFITAHDLVNQLRRADQEGKLEKKLRVFVKPTVLIIDEMGYLKLDPNSAHYLFQVIARRYEHAPIILTSNK
SFGEWGEIVGDSVLATAMLDRLLHHSIIFNLKGESYRLREKRLQEEKQKDQ
ILFLGPPGIGKTHLAISIGMEAIARGYKTYFITAHDLVNQLRRADQEGKLEKKLRVFVKPTVLIIDEMGYLKLDPNSAHYLFQVIARRYEHAPIILTSNK
SFGEWGEIVGDSVLATAMLDRLLHHSIIFNLKGESYRLREKRLQEEKQKDQ
Blast result :
Comments
At least four potential 23-bp repeats have been identified (J. Mahillon) at the ends of IS5376:
L1: TAAAGCCGATGATAAAATCCCCA (4-26)
L2: TATAGCCGGAATAAAATTCCCCA (28-52)
R1: CAAGGCCGATTATTTTTTCCCCA (2082-2104) CS
R2: AATCGCCGGTTTAAAATTCCCCA (2058-2080) CS
L1: TAAAGCCGATGATAAAATCCCCA (4-26)
L2: TATAGCCGGAATAAAATTCCCCA (28-52)
R1: CAAGGCCGATTATTTTTTCCCCA (2082-2104) CS
R2: AATCGCCGGTTTAAAATTCCCCA (2058-2080) CS
References
1] Xu, K., He, Z.-Q., Mao, Y.-M., Sheng, R.-Q., and Sheng, Z.-Y. (1993) Plasmid 29, 1-9.