ISEc49
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004431 | ND | Escherichia coli | Escherichia coli CFT073 Shigella boydii CDC 3083-94 |
DNA section
IS Length : 2766 bp
Ends
IR Length : 19/27
IRL : GTGTCGTTACGGGGAAGTCATCTTCACAGGGATCTTCGGTCAGTTAGTAT
IRR : GTAAGCTTCCGGGGAACTCATATTGACGCGTTTTTCAGGCGCGCGTCATC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGATCGTTCG | TATGCGCC | GCCGATGACG | 8 |
AATTTGCCGA | AATTTTGA | TAATCAGTTC | 8 |
DNA sequence
GTGTCGTTACGGGGAAGTCATCTTCACAGGGATCTTCGGTCAGTTAGTATTACCGACATCCATTCACCAACTTACCGGAGATAATCGATGTCAAAACCCC
GATGGACCATTGAACAGAAAAGGCAGCACGTTGCTGCCTGGCGCGCCAGTGGTCTTACACGTCAGCAATACTGTGAACTCAACGATATTCCCTTCACATC
ACTTCGCGAATGGCCGCAGGATGTCGCAAAAGCTGAACGCCGGGCAAATGAACCGACCGTTCTTCCCGTTCATATTGCGCCACCACTGCATGCTGATGTA
CCACAGCCCGTCACGAATGAGCCAGTCACGCTCTTCCTTCCCGGTGGAGTACGGATGTGCTGTCAGCCGTCACAACTTACCGATGTCTTCAGGGCACTCA
AATATGCTCAGTCCTGATAATGTCTTCATCGCAATTAAACCTGTTGACATGCGCCGGGGCATCGACTCACTGACGCAGTATATACAGGATGAACTCCGGT
CGACATGGCATGAGGGAGCCGCCTTCGTCTTTGTTAACAAAGCCCGTTCGCGTATCAAAGTTCTCCGGTGGGATAAACACGGGGTGTGGTTGTGTACCCG
CCGTCTGCACAAAGGCAGCTTCCGCTGGCCACGTGCAAATGACGCTGCCTGGCACCTCACTCCCGACGAGTTTAACTGGCTGATTGCCGGTGTTGACTGG
CAGCAGGTTAAGGGACATGACCTGACGAAATGGGTCTGGCAGAATGAACCTGAACTGCGCCCTGAAAACACGCAAAATACACTGCTAACTCAGTGAAAAA
TAATGAATATCCGTATCTGGAGTGGTATACTCCCCTGTATGGATATCTCCGCTCTCAACACCACGAATGACATCGAAAAACTGCGTGCTATGGCACTTGC
CATGGTACAAGAAGTCATGTCGGAGAATGCCGAAAAAGAGCGGGAATTACTGGAGAAAAGCCGGCGCATCCAGCTTCTGGAAGAAATGCTGAAACTGGTT
CGTCAACAGCGCTTCGGAAAAAAATGTGGAACGCTGGCTGGTATGCAACGCTCCCTGTTCGAAGAGGATGTTGATGCCGATATCGCCGCGCTTACCGCAC
ATCTGGATAAACTGCTCCCGCAATCCCCTGAAGAAGACGAAAAAGCGTCCCGTTCACGCCCGATACGCAAACCCTTACCGGTTCATCTTCCACGGGTGGA
AAAAATTATCCAGCCGGACACTGACCATTGCCCTGAATGTGACGAGCCGCTGCACTATATCCGCGATGCGGTGAGTGAAAAGCTGGAGTATATTCCCGCT
CACTTTGTGGTGAACCGTTATGTCCGTCCACAATACAGTTGTCCCTGTTGCCAGAAGGTGTTCAGCGGTGAAATGCCGGCACATATCCTCCCGAAAAGTG
CCGTTGAGCCATCAGTCATCGCACAGGTGATCATCAATAAATACGGTGACCACCTGCCTCTGTATCGCCAGCAACAGGTCTTTGCCCGTTCAGATGTCGG
GCTGCCCGTCAGTTCGATGGCTGACATGGTTGGCGCGGCGGGTGCCGCATTATCTCCCCTGGCGGCGTTACTCCATCGCGAGTTGATAAACCGTCCGGTG
GTGCATGCAGATGAGACTACCCTGAAGATCCTGAACACGAAGAAAGGCGGTAAATCCTGCTCCGGTTATCTGTGGGCATACGTCAGTGGAGAAAGGACGG
GACCGTCAGTTGTGTGCTTCGACTGCCGGACCGGACGTAGCCATGAGTATCCTGAAAACTGGCTTCAGGGCTGGGGCGGGACGCTGGTTGTCGACGGACA
TAAAGCTTACCGGACTCTGGCAAACAAAGTGCCGGAGATCACGCTGGCCGGATGCTGGGCCCATGCCCGCAGGGGCTTCGCCGACCTGTATAAAATCAGT
AAAGATCCACGGGCTGCCATAGCCGTGAAGAAAATCGCGGGGTTGTACCGTCTTGAGAAGAAGATCAGTAGCCGCCCCGTGGAAAAAATCCGCCAGTGGC
GACAGCGTTATGCCCGTCCGATACTGGAAGAACTGTGGTCATGGCTTGAAGAGCAGGAACCGCAATGTTCTCCGGGAAGGGCATTACACAAAGCCATTGC
CTATGCGCTGTCTCATCGCGTGGAACTGAGCCGCTTCCTGGAAGATGGTGCGGTGCCGCTGGATAATAATGTGTGTGAACGGGCCATCAAAAACGTGGTT
CTGGGCAGAAAATCGTGGCTGTTCGCCGGTTCGCAGATGGCGGGAGAACGCGCCGCGCAAATAATGAGCTTGCTGGAAACCGCGAAACGCAACGGTCTGG
AGCCGCATGCCTGGTTGACAGACGTCCTGATGCGTCTGCCGGAGTGGCCGGAGGAGCGACTGGCAGAGTTGCTGCCTCTTGAGGGATTTACCTTCTCCGG
GTGAGTGATACCTGCCGTCAGGTGTTCGTGCACCGGGCCATAACCTGCAGTCGGGAGTTGAACTCCTGACGGCAGGAAATGAGCCAGACCAGCAACCACG
CGGGCTGACAGCGAATAAGAACCTGAATCAGCCCGGAGTCATTCCCGGGTCAGTACATCATCACCTGCTATTAATCCCCGAGTCAGTACTTCGCGGCCTG
ACGCGATCACCCTACTCAGTACTTCAGAGCCCGCCTCCGGTACCTGCCTGGCGATCGCCCCTGTGCAACAAAATAAATCAACCGTTATCAAACCACGATC
GTTCGTATGCGCCGCCGATGACGCGCGCCTGAAAAACGCGTCAATATGAGTTCCCCGGAAGCTTAC
GATGGACCATTGAACAGAAAAGGCAGCACGTTGCTGCCTGGCGCGCCAGTGGTCTTACACGTCAGCAATACTGTGAACTCAACGATATTCCCTTCACATC
ACTTCGCGAATGGCCGCAGGATGTCGCAAAAGCTGAACGCCGGGCAAATGAACCGACCGTTCTTCCCGTTCATATTGCGCCACCACTGCATGCTGATGTA
CCACAGCCCGTCACGAATGAGCCAGTCACGCTCTTCCTTCCCGGTGGAGTACGGATGTGCTGTCAGCCGTCACAACTTACCGATGTCTTCAGGGCACTCA
AATATGCTCAGTCCTGATAATGTCTTCATCGCAATTAAACCTGTTGACATGCGCCGGGGCATCGACTCACTGACGCAGTATATACAGGATGAACTCCGGT
CGACATGGCATGAGGGAGCCGCCTTCGTCTTTGTTAACAAAGCCCGTTCGCGTATCAAAGTTCTCCGGTGGGATAAACACGGGGTGTGGTTGTGTACCCG
CCGTCTGCACAAAGGCAGCTTCCGCTGGCCACGTGCAAATGACGCTGCCTGGCACCTCACTCCCGACGAGTTTAACTGGCTGATTGCCGGTGTTGACTGG
CAGCAGGTTAAGGGACATGACCTGACGAAATGGGTCTGGCAGAATGAACCTGAACTGCGCCCTGAAAACACGCAAAATACACTGCTAACTCAGTGAAAAA
TAATGAATATCCGTATCTGGAGTGGTATACTCCCCTGTATGGATATCTCCGCTCTCAACACCACGAATGACATCGAAAAACTGCGTGCTATGGCACTTGC
CATGGTACAAGAAGTCATGTCGGAGAATGCCGAAAAAGAGCGGGAATTACTGGAGAAAAGCCGGCGCATCCAGCTTCTGGAAGAAATGCTGAAACTGGTT
CGTCAACAGCGCTTCGGAAAAAAATGTGGAACGCTGGCTGGTATGCAACGCTCCCTGTTCGAAGAGGATGTTGATGCCGATATCGCCGCGCTTACCGCAC
ATCTGGATAAACTGCTCCCGCAATCCCCTGAAGAAGACGAAAAAGCGTCCCGTTCACGCCCGATACGCAAACCCTTACCGGTTCATCTTCCACGGGTGGA
AAAAATTATCCAGCCGGACACTGACCATTGCCCTGAATGTGACGAGCCGCTGCACTATATCCGCGATGCGGTGAGTGAAAAGCTGGAGTATATTCCCGCT
CACTTTGTGGTGAACCGTTATGTCCGTCCACAATACAGTTGTCCCTGTTGCCAGAAGGTGTTCAGCGGTGAAATGCCGGCACATATCCTCCCGAAAAGTG
CCGTTGAGCCATCAGTCATCGCACAGGTGATCATCAATAAATACGGTGACCACCTGCCTCTGTATCGCCAGCAACAGGTCTTTGCCCGTTCAGATGTCGG
GCTGCCCGTCAGTTCGATGGCTGACATGGTTGGCGCGGCGGGTGCCGCATTATCTCCCCTGGCGGCGTTACTCCATCGCGAGTTGATAAACCGTCCGGTG
GTGCATGCAGATGAGACTACCCTGAAGATCCTGAACACGAAGAAAGGCGGTAAATCCTGCTCCGGTTATCTGTGGGCATACGTCAGTGGAGAAAGGACGG
GACCGTCAGTTGTGTGCTTCGACTGCCGGACCGGACGTAGCCATGAGTATCCTGAAAACTGGCTTCAGGGCTGGGGCGGGACGCTGGTTGTCGACGGACA
TAAAGCTTACCGGACTCTGGCAAACAAAGTGCCGGAGATCACGCTGGCCGGATGCTGGGCCCATGCCCGCAGGGGCTTCGCCGACCTGTATAAAATCAGT
AAAGATCCACGGGCTGCCATAGCCGTGAAGAAAATCGCGGGGTTGTACCGTCTTGAGAAGAAGATCAGTAGCCGCCCCGTGGAAAAAATCCGCCAGTGGC
GACAGCGTTATGCCCGTCCGATACTGGAAGAACTGTGGTCATGGCTTGAAGAGCAGGAACCGCAATGTTCTCCGGGAAGGGCATTACACAAAGCCATTGC
CTATGCGCTGTCTCATCGCGTGGAACTGAGCCGCTTCCTGGAAGATGGTGCGGTGCCGCTGGATAATAATGTGTGTGAACGGGCCATCAAAAACGTGGTT
CTGGGCAGAAAATCGTGGCTGTTCGCCGGTTCGCAGATGGCGGGAGAACGCGCCGCGCAAATAATGAGCTTGCTGGAAACCGCGAAACGCAACGGTCTGG
AGCCGCATGCCTGGTTGACAGACGTCCTGATGCGTCTGCCGGAGTGGCCGGAGGAGCGACTGGCAGAGTTGCTGCCTCTTGAGGGATTTACCTTCTCCGG
GTGAGTGATACCTGCCGTCAGGTGTTCGTGCACCGGGCCATAACCTGCAGTCGGGAGTTGAACTCCTGACGGCAGGAAATGAGCCAGACCAGCAACCACG
CGGGCTGACAGCGAATAAGAACCTGAATCAGCCCGGAGTCATTCCCGGGTCAGTACATCATCACCTGCTATTAATCCCCGAGTCAGTACTTCGCGGCCTG
ACGCGATCACCCTACTCAGTACTTCAGAGCCCGCCTCCGGTACCTGCCTGGCGATCGCCCCTGTGCAACAAAATAAATCAACCGTTATCAAACCACGATC
GTTCGTATGCGCCGCCGATGACGCGCGCCTGAAAAACGCGTCAATATGAGTTCCCCGGAAGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
330 bp | 109 aa | 88 | 417 | + | No |
AG : IS66 TnpA
ORF sequence :
MSKPRWTIEQKRQHVAAWRASGLTRQQYCELNDIPFTSLREWPQDVAKAERRANEPTVLPVHIAPPLHADVPQPVTNEPVTLFLPGGVRMCCQPSQLTDV
FRALKYAQS
FRALKYAQS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
393 bp | 130 aa | 404 | 796 | + | No |
AG : IS66 TnpB
ORF sequence :
MLSPDNVFIAIKPVDMRRGIDSLTQYIQDELRSTWHEGAAFVFVNKARSRIKVLRWDKHGVWLCTRRLHKGSFRWPRANDAAWHLTPDEFNWLIAGVDWQ
QVKGHDLTKWVWQNEPELRPENTQNTLLTQ
QVKGHDLTKWVWQNEPELRPENTQNTLLTQ
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1602 bp | 533 aa | 803 | 2404 | + | No |
Chemistry : DDE
ORF sequence :
MNIRIWSGILPCMDISALNTTNDIEKLRAMALAMVQEVMSENAEKERELLEKSRRIQLLEEMLKLVRQQRFGKKCGTLAGMQRSLFEEDVDADIAALTAH
LDKLLPQSPEEDEKASRSRPIRKPLPVHLPRVEKIIQPDTDHCPECDEPLHYIRDAVSEKLEYIPAHFVVNRYVRPQYSCPCCQKVFSGEMPAHILPKSA
VEPSVIAQVIINKYGDHLPLYRQQQVFARSDVGLPVSSMADMVGAAGAALSPLAALLHRELINRPVVHADETTLKILNTKKGGKSCSGYLWAYVSGERTG
PSVVCFDCRTGRSHEYPENWLQGWGGTLVVDGHKAYRTLANKVPEITLAGCWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKIRQWR
QRYARPILEELWSWLEEQEPQCSPGRALHKAIAYALSHRVELSRFLEDGAVPLDNNVCERAIKNVVLGRKSWLFAGSQMAGERAAQIMSLLETAKRNGLE
PHAWLTDVLMRLPEWPEERLAELLPLEGFTFSG
LDKLLPQSPEEDEKASRSRPIRKPLPVHLPRVEKIIQPDTDHCPECDEPLHYIRDAVSEKLEYIPAHFVVNRYVRPQYSCPCCQKVFSGEMPAHILPKSA
VEPSVIAQVIINKYGDHLPLYRQQQVFARSDVGLPVSSMADMVGAAGAALSPLAALLHRELINRPVVHADETTLKILNTKKGGKSCSGYLWAYVSGERTG
PSVVCFDCRTGRSHEYPENWLQGWGGTLVVDGHKAYRTLANKVPEITLAGCWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKIRQWR
QRYARPILEELWSWLEEQEPQCSPGRALHKAIAYALSHRVELSRFLEDGAVPLDNNVCERAIKNVVLGRKSWLFAGSQMAGERAAQIMSLLETAKRNGLE
PHAWLTDVLMRLPEWPEERLAELLPLEGFTFSG
Blast result :
Comments
ISEc49 is 58%(orfA) aa similar to ISVsp4, 72%(orfB) and 58%(orfC) aa similar to ISAzo15.
References
1] ISfinder annotation (2012)
2] Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R. (2002) Proc. Natl. Acad. Sci. U.S.A. 99 (26), 17020-17024
2] Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R. (2002) Proc. Natl. Acad. Sci. U.S.A. 99 (26), 17020-17024