IS232
- Family IS21
- Group
Isoform Synonym(s) IR2, IR2150, IS232A, IS232B, IS232C
Accession number | Transposition | Origin | Host |
---|---|---|---|
M38370 | Y | Bacillus thuringiensis | Bacillus thuringiensis subsp. Tolworthi Bacillus thuringiensis subsp. thuringiensis HD2 plasmid p81.87kb Bacillus thuringiensis subsp. morrisoni Bacillus thuringiensis subsp. thuringiensis berliner 1715 plasmid p65kb Bacillus thuringiensis subsp. aizawai 7.29 plasmid pBT45 Bacillus thuringiensis subsp. darmstadiensis Bacillus thuringiensis subsp. galleriae Bacillus thuringiensis subsp. kurstaki HD1 plasmid p66kb Bacillus thuringiensis subsp. kurstaki HD1-Dipel Bacillus thuringiensis subsp. kurstaki HD73 plasmid p75kb Bacillus thuringiensis subsp. kurstaki HD244 Bacillus thuringiensis subsp. sotto Bacillus thuringiensis subsp. thuringiensis F Bacillus thuringiensis subsp. thuringiensis HD120 Bacillus thuringiensis subsp. thuringiensis HD290 |
DNA section
IS Length : 2184 bp
Ends
IR Length : 48/67
IRL : GTATAAATGCTAACTTAAATATGTACATTAACGCTTGAATAAATATGTAC
IRR : GTGTAAATGTCAAGATAAACATGTACATTTTCGCTTGTTTAAGCATGTAC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
GTATAAATGCTAACTTAAATATGTACATTAACGCTTGAATAAATATGTACATTTTCAATTGGATTCTCCATACTGACTCTGAGGTGGTTAGTATGTATAT
TAAGCTAGATATTCAAACAGAATTTGAGGTTAAAAGTCTTTCAGACTTACCAAATTTTAAAAAACTAATGGGGAACTTAAAAATGAAGATAAATAAAAGT
CAATTAGCCAGAGAATTGAATGTGGATCGGCGTACCATAGATAAGTATTTGAATGGTTTTACACCAAAAGGGACAAAAAATAAAACATCGAAAATCGATA
CATATTATGAAGTGATTGCAGCTCTTTTATCTAGTGATTCTAAACAAATCTTCTACTACAAACGAGTGTTATGGCAGTATCTAACAGACAATCACGGTTT
AAAATGTTCACAGTCTGCATTTCGTGCTTATATTAATAGAAAGCCTGAATTTAGAACATATTTTGATGAGGGGAAGCGTATTTTATCAGGTCATTCAGTG
GGTGTCCGTTATGAGACACCTCCAGGAGAACAAGCTCAATTAGATTGGAAAGAAAGCATACGATTTGAAACCAAAAGTGGCGAAATCGTATATGTGAATG
TAGCTGTACTTTTATTGTCCTACTCGAGATTTAAAGTTTTTCATTTGAATATTTCAAAATCACAAAGTGTTTTAATGTCATTTATGACAGAGGCATTTGA
AATGTTTGGTGGTGTACCGAAGGTAATTGTCACGGATAATATGAAGACGGTAATGGATGAAGCTCGAACAGAACACTTTACAGGAACGATTAACAATAAG
TTTGCCCAATTCGCTCAAGATTTTGGATTTAAGGTACAACCTTGTATCGCAGGGCGACCAAATACCAAAGGGAAAGTAGAAGCGCCAATGAAACTTCTAG
ACGAAATTCATACTTATCAAGGAAGATTCACTTTTGAAGAATTACATGAATTTGTGCAAAAATTATGTGCAAGAATTAATCAAACATTTCATCAAGGGAC
TGGTAAGATTCCAGTGTTTGCCCTAAAACAGGAAAAGAATCTCCTACAACCACTCCCGAAGAGCGCGATAAGAGATTCCTATATGATTAAGCATAAACTT
GTAAAAGTTAATACATCAGGCATGATATCTTACAAATCGAATCAATACTCAGTTCCAGCTGAATATCAAGGTAAAACCGTCGGTTTACAAGTATATGATA
ATCAAATATATGTTTATCATAACATGAAGTTAATTGTACAACATAAAATCAGCCAATCTAAGCTCAATTATAAAGAAGAACATTATAAAAAAGCATTGGC
TAAGTCACTACCTAAATATCCGAACATTGACAATTTGGCGAAACAAAATTTATCAGTAATTGGTGAGGTATATAGAAATGAAGAATAGCTATCAACAATT
AACAACAAACCTAGAGTATTTAAAATTAAAACAAATGGCACAACATTTAGGTGACGTAGTCGATTTTAGCATTAATAATGAATTATCCTTCGTAGAGACA
CTTGTTAAACTGACAAACTATGAGATTGATGTACGAGAACAAAATATGATTCATTCTATGGTGAAAATGGGCGCATTTCCTCATAGAAAGGAGGTTGATG
AGTTTGATTTCGAATTCCAGCCGAGTATTAATAAACAACAAATCTTAGATTTTATTTCTCTACGTTTCTTAGAGCAACAAGAAAACATAGTATTTTTAGG
ACCTAGTGGTGTTGGTAAGACCCATTTGGCCACGTCTATTGGTATAGCAGCAGCTAAAAAGCGAACAAGTACTTATTTTATTAAATGTCATGATTTACTT
CAAAATTTAAAACGTGCCAAGATTGAGAATCGCCTAGAATCTCGTTTAAAGCACTATACAAAATACAAATTACTTATTATTGATGAAATTGGGTACTTGC
CTATTGATCCGGAGGATGCAAAATTATTCTTTCAATTAATCGATATGCGTTATGAAAAGCGTAGTACCATCCTAACGACCAATATCAACTTCAAGTCTTG
GGACGAAGTATTCCAGGACCCTAAACTCGCCAATGCCATACTAGATCGTGTCTTACATCATGCCACGGTGGTCAGTATTGTAGGACAATCCTATCGAATT
AAAGATCATTTTAGCAAAGAAAATGATTGATTTTGTACATGCTTAAACAAGCGAAAATGTACATGTTTATCTTGACATTTACAC
TAAGCTAGATATTCAAACAGAATTTGAGGTTAAAAGTCTTTCAGACTTACCAAATTTTAAAAAACTAATGGGGAACTTAAAAATGAAGATAAATAAAAGT
CAATTAGCCAGAGAATTGAATGTGGATCGGCGTACCATAGATAAGTATTTGAATGGTTTTACACCAAAAGGGACAAAAAATAAAACATCGAAAATCGATA
CATATTATGAAGTGATTGCAGCTCTTTTATCTAGTGATTCTAAACAAATCTTCTACTACAAACGAGTGTTATGGCAGTATCTAACAGACAATCACGGTTT
AAAATGTTCACAGTCTGCATTTCGTGCTTATATTAATAGAAAGCCTGAATTTAGAACATATTTTGATGAGGGGAAGCGTATTTTATCAGGTCATTCAGTG
GGTGTCCGTTATGAGACACCTCCAGGAGAACAAGCTCAATTAGATTGGAAAGAAAGCATACGATTTGAAACCAAAAGTGGCGAAATCGTATATGTGAATG
TAGCTGTACTTTTATTGTCCTACTCGAGATTTAAAGTTTTTCATTTGAATATTTCAAAATCACAAAGTGTTTTAATGTCATTTATGACAGAGGCATTTGA
AATGTTTGGTGGTGTACCGAAGGTAATTGTCACGGATAATATGAAGACGGTAATGGATGAAGCTCGAACAGAACACTTTACAGGAACGATTAACAATAAG
TTTGCCCAATTCGCTCAAGATTTTGGATTTAAGGTACAACCTTGTATCGCAGGGCGACCAAATACCAAAGGGAAAGTAGAAGCGCCAATGAAACTTCTAG
ACGAAATTCATACTTATCAAGGAAGATTCACTTTTGAAGAATTACATGAATTTGTGCAAAAATTATGTGCAAGAATTAATCAAACATTTCATCAAGGGAC
TGGTAAGATTCCAGTGTTTGCCCTAAAACAGGAAAAGAATCTCCTACAACCACTCCCGAAGAGCGCGATAAGAGATTCCTATATGATTAAGCATAAACTT
GTAAAAGTTAATACATCAGGCATGATATCTTACAAATCGAATCAATACTCAGTTCCAGCTGAATATCAAGGTAAAACCGTCGGTTTACAAGTATATGATA
ATCAAATATATGTTTATCATAACATGAAGTTAATTGTACAACATAAAATCAGCCAATCTAAGCTCAATTATAAAGAAGAACATTATAAAAAAGCATTGGC
TAAGTCACTACCTAAATATCCGAACATTGACAATTTGGCGAAACAAAATTTATCAGTAATTGGTGAGGTATATAGAAATGAAGAATAGCTATCAACAATT
AACAACAAACCTAGAGTATTTAAAATTAAAACAAATGGCACAACATTTAGGTGACGTAGTCGATTTTAGCATTAATAATGAATTATCCTTCGTAGAGACA
CTTGTTAAACTGACAAACTATGAGATTGATGTACGAGAACAAAATATGATTCATTCTATGGTGAAAATGGGCGCATTTCCTCATAGAAAGGAGGTTGATG
AGTTTGATTTCGAATTCCAGCCGAGTATTAATAAACAACAAATCTTAGATTTTATTTCTCTACGTTTCTTAGAGCAACAAGAAAACATAGTATTTTTAGG
ACCTAGTGGTGTTGGTAAGACCCATTTGGCCACGTCTATTGGTATAGCAGCAGCTAAAAAGCGAACAAGTACTTATTTTATTAAATGTCATGATTTACTT
CAAAATTTAAAACGTGCCAAGATTGAGAATCGCCTAGAATCTCGTTTAAAGCACTATACAAAATACAAATTACTTATTATTGATGAAATTGGGTACTTGC
CTATTGATCCGGAGGATGCAAAATTATTCTTTCAATTAATCGATATGCGTTATGAAAAGCGTAGTACCATCCTAACGACCAATATCAACTTCAAGTCTTG
GGACGAAGTATTCCAGGACCCTAAACTCGCCAATGCCATACTAGATCGTGTCTTACATCATGCCACGGTGGTCAGTATTGTAGGACAATCCTATCGAATT
AAAGATCATTTTAGCAAAGAAAATGATTGATTTTGTACATGCTTAAACAAGCGAAAATGTACATGTTTATCTTGACATTTACAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1296 bp | 431 aa | 93 | 1388 | + | No |
Chemistry : DDE
ORF sequence :
MYIKLDIQTEFEVKSLSDLPNFKKLMGNLKMKINKSQLARELNVDRRTIDKYLNGFTPKGTKNKTSKIDTYYEVIAALLSSDSKQIFYYKRVLWQYLTDN
HGLKCSQSAFRAYINRKPEFRTYFDEGKRILSGHSVGVRYETPPGEQAQLDWKESIRFETKSGEIVYVNVAVLLLSYSRFKVFHLNISKSQSVLMSFMTE
AFEMFGGVPKVIVTDNMKTVMDEARTEHFTGTINNKFAQFAQDFGFKVQPCIAGRPNTKGKVEAPMKLLDEIHTYQGRFTFEELHEFVQKLCARINQTFH
QGTGKIPVFALKQEKNLLQPLPKSAIRDSYMIKHKLVKVNTSGMISYKSNQYSVPAEYQGKTVGLQVYDNQIYVYHNMKLIVQHKISQSKLNYKEEHYKK
ALAKSLPKYPNIDNLAKQNLSVIGEVYRNEE
HGLKCSQSAFRAYINRKPEFRTYFDEGKRILSGHSVGVRYETPPGEQAQLDWKESIRFETKSGEIVYVNVAVLLLSYSRFKVFHLNISKSQSVLMSFMTE
AFEMFGGVPKVIVTDNMKTVMDEARTEHFTGTINNKFAQFAQDFGFKVQPCIAGRPNTKGKVEAPMKLLDEIHTYQGRFTFEELHEFVQKLCARINQTFH
QGTGKIPVFALKQEKNLLQPLPKSAIRDSYMIKHKLVKVNTSGMISYKSNQYSVPAEYQGKTVGLQVYDNQIYVYHNMKLIVQHKISQSKLNYKEEHYKK
ALAKSLPKYPNIDNLAKQNLSVIGEVYRNEE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
753 bp | 250 aa | 1378 | 2130 | + | No |
AG : IS21 helper
ORF sequence :
MKNSYQQLTTNLEYLKLKQMAQHLGDVVDFSINNELSFVETLVKLTNYEIDVREQNMIHSMVKMGAFPHRKEVDEFDFEFQPSINKQQILDFISLRFLEQ
QENIVFLGPSGVGKTHLATSIGIAAAKKRTSTYFIKCHDLLQNLKRAKIENRLESRLKHYTKYKLLIIDEIGYLPIDPEDAKLFFQLIDMRYEKRSTILT
TNINFKSWDEVFQDPKLANAILDRVLHHATVVSIVGQSYRIKDHFSKEND
QENIVFLGPSGVGKTHLATSIGIAAAKKRTSTYFIKCHDLLQNLKRAKIENRLESRLKHYTKYKLLIIDEIGYLPIDPEDAKLFFQLIDMRYEKRSTILT
TNINFKSWDEVFQDPKLANAILDRVLHHATVVSIVGQSYRIKDHFSKEND
Blast result :
Comments
IS232A displays long IR (L: 1-67 and R: 2118-2184/CS) that contains internal DR of 22 bp long. These DR are present three times at the left end and twice at the right end. The two ORF overlap by 11 bp (3 aa).
L: GTATAAATGCTAACTTAAATATGTACATTAACGCTTGAATAAATATGTACATTTTCAATTGGATTCT
|| |||||| || |||| ||||||||| |||||| ||| ||||||| ||||| ||||
R: GTGTAAATGTCAAGATAAACATGTACATTTTCGCTTGTTTAAGCATGTACAAAATCAATCATTTTCT
L1: TAAATATGTACATTAACGCTTG (16-37)
R1: TAAACATGTACATTTTCGCTTG (2148-2169) CS
L2: TAAATATGTACATTTTCAATTGGATTCT (40-67)
R2: TAAGCATGTACAAAATCAATCATTTTCT (2118-2145) CS
L3: TTAGTATGTATATTAAGCTAGATATTCA (88-115), ATG underlined.
This situation is similar to that found in IS1326:
L1: GAGTTGCATCTAAAATTGACCC (5-26)
L2: GATTTGCGTCGAAATTTGACCC (39-60)
L3: GATATTGAGCGCAATTCGACGC (122-143)
R1: GATTTGCATTGAATTTTGACCC (2421-2442) CS
R2: GATTTGCACCCAAATTTGACCC (2445-2466) CS
The 572-bp DNA sequence of IS232B from file M77344 is identical to that of IS232A, although differences must exist between the two ISs (restriction maps different).
July 13 2012 : the file IS232 replace the IS232A, IS232B and IS232C files ; IS232B and IS232C were partial sequences.
L: GTATAAATGCTAACTTAAATATGTACATTAACGCTTGAATAAATATGTACATTTTCAATTGGATTCT
|| |||||| || |||| ||||||||| |||||| ||| ||||||| ||||| ||||
R: GTGTAAATGTCAAGATAAACATGTACATTTTCGCTTGTTTAAGCATGTACAAAATCAATCATTTTCT
L1: TAAATATGTACATTAACGCTTG (16-37)
R1: TAAACATGTACATTTTCGCTTG (2148-2169) CS
L2: TAAATATGTACATTTTCAATTGGATTCT (40-67)
R2: TAAGCATGTACAAAATCAATCATTTTCT (2118-2145) CS
L3: TTAGTATGTATATTAAGCTAGATATTCA (88-115), ATG underlined.
This situation is similar to that found in IS1326:
L1: GAGTTGCATCTAAAATTGACCC (5-26)
L2: GATTTGCGTCGAAATTTGACCC (39-60)
L3: GATATTGAGCGCAATTCGACGC (122-143)
R1: GATTTGCATTGAATTTTGACCC (2421-2442) CS
R2: GATTTGCACCCAAATTTGACCC (2445-2466) CS
The 572-bp DNA sequence of IS232B from file M77344 is identical to that of IS232A, although differences must exist between the two ISs (restriction maps different).
July 13 2012 : the file IS232 replace the IS232A, IS232B and IS232C files ; IS232B and IS232C were partial sequences.
References