ISEc49

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
NC_004431 ND Escherichia coli
Escherichia coli CFT073
Shigella boydii CDC 3083-94
DNA section
IS Length : 2766 bp

Ends


IR Length : 19/27

IRL : GTGTCGTTACGGGGAAGTCATCTTCACAGGGATCTTCGGTCAGTTAGTAT
IRR : GTAAGCTTCCGGGGAACTCATATTGACGCGTTTTTCAGGCGCGCGTCATC

Insertion site


Left flankDirect repeatRight flankDR Length
CGATCGTTCGTATGCGCCGCCGATGACG8
AATTTGCCGAAATTTTGATAATCAGTTC8

DNA sequence

GTGTCGTTACGGGGAAGTCATCTTCACAGGGATCTTCGGTCAGTTAGTATTACCGACATCCATTCACCAACTTACCGGAGATAATCGATGTCAAAACCCC
GATGGACCATTGAACAGAAAAGGCAGCACGTTGCTGCCTGGCGCGCCAGTGGTCTTACACGTCAGCAATACTGTGAACTCAACGATATTCCCTTCACATC
ACTTCGCGAATGGCCGCAGGATGTCGCAAAAGCTGAACGCCGGGCAAATGAACCGACCGTTCTTCCCGTTCATATTGCGCCACCACTGCATGCTGATGTA
CCACAGCCCGTCACGAATGAGCCAGTCACGCTCTTCCTTCCCGGTGGAGTACGGATGTGCTGTCAGCCGTCACAACTTACCGATGTCTTCAGGGCACTCA
AATATGCTCAGTCCTGATAATGTCTTCATCGCAATTAAACCTGTTGACATGCGCCGGGGCATCGACTCACTGACGCAGTATATACAGGATGAACTCCGGT
CGACATGGCATGAGGGAGCCGCCTTCGTCTTTGTTAACAAAGCCCGTTCGCGTATCAAAGTTCTCCGGTGGGATAAACACGGGGTGTGGTTGTGTACCCG
CCGTCTGCACAAAGGCAGCTTCCGCTGGCCACGTGCAAATGACGCTGCCTGGCACCTCACTCCCGACGAGTTTAACTGGCTGATTGCCGGTGTTGACTGG
CAGCAGGTTAAGGGACATGACCTGACGAAATGGGTCTGGCAGAATGAACCTGAACTGCGCCCTGAAAACACGCAAAATACACTGCTAACTCAGTGAAAAA
TAATGAATATCCGTATCTGGAGTGGTATACTCCCCTGTATGGATATCTCCGCTCTCAACACCACGAATGACATCGAAAAACTGCGTGCTATGGCACTTGC
CATGGTACAAGAAGTCATGTCGGAGAATGCCGAAAAAGAGCGGGAATTACTGGAGAAAAGCCGGCGCATCCAGCTTCTGGAAGAAATGCTGAAACTGGTT
CGTCAACAGCGCTTCGGAAAAAAATGTGGAACGCTGGCTGGTATGCAACGCTCCCTGTTCGAAGAGGATGTTGATGCCGATATCGCCGCGCTTACCGCAC
ATCTGGATAAACTGCTCCCGCAATCCCCTGAAGAAGACGAAAAAGCGTCCCGTTCACGCCCGATACGCAAACCCTTACCGGTTCATCTTCCACGGGTGGA
AAAAATTATCCAGCCGGACACTGACCATTGCCCTGAATGTGACGAGCCGCTGCACTATATCCGCGATGCGGTGAGTGAAAAGCTGGAGTATATTCCCGCT
CACTTTGTGGTGAACCGTTATGTCCGTCCACAATACAGTTGTCCCTGTTGCCAGAAGGTGTTCAGCGGTGAAATGCCGGCACATATCCTCCCGAAAAGTG
CCGTTGAGCCATCAGTCATCGCACAGGTGATCATCAATAAATACGGTGACCACCTGCCTCTGTATCGCCAGCAACAGGTCTTTGCCCGTTCAGATGTCGG
GCTGCCCGTCAGTTCGATGGCTGACATGGTTGGCGCGGCGGGTGCCGCATTATCTCCCCTGGCGGCGTTACTCCATCGCGAGTTGATAAACCGTCCGGTG
GTGCATGCAGATGAGACTACCCTGAAGATCCTGAACACGAAGAAAGGCGGTAAATCCTGCTCCGGTTATCTGTGGGCATACGTCAGTGGAGAAAGGACGG
GACCGTCAGTTGTGTGCTTCGACTGCCGGACCGGACGTAGCCATGAGTATCCTGAAAACTGGCTTCAGGGCTGGGGCGGGACGCTGGTTGTCGACGGACA
TAAAGCTTACCGGACTCTGGCAAACAAAGTGCCGGAGATCACGCTGGCCGGATGCTGGGCCCATGCCCGCAGGGGCTTCGCCGACCTGTATAAAATCAGT
AAAGATCCACGGGCTGCCATAGCCGTGAAGAAAATCGCGGGGTTGTACCGTCTTGAGAAGAAGATCAGTAGCCGCCCCGTGGAAAAAATCCGCCAGTGGC
GACAGCGTTATGCCCGTCCGATACTGGAAGAACTGTGGTCATGGCTTGAAGAGCAGGAACCGCAATGTTCTCCGGGAAGGGCATTACACAAAGCCATTGC
CTATGCGCTGTCTCATCGCGTGGAACTGAGCCGCTTCCTGGAAGATGGTGCGGTGCCGCTGGATAATAATGTGTGTGAACGGGCCATCAAAAACGTGGTT
CTGGGCAGAAAATCGTGGCTGTTCGCCGGTTCGCAGATGGCGGGAGAACGCGCCGCGCAAATAATGAGCTTGCTGGAAACCGCGAAACGCAACGGTCTGG
AGCCGCATGCCTGGTTGACAGACGTCCTGATGCGTCTGCCGGAGTGGCCGGAGGAGCGACTGGCAGAGTTGCTGCCTCTTGAGGGATTTACCTTCTCCGG
GTGAGTGATACCTGCCGTCAGGTGTTCGTGCACCGGGCCATAACCTGCAGTCGGGAGTTGAACTCCTGACGGCAGGAAATGAGCCAGACCAGCAACCACG
CGGGCTGACAGCGAATAAGAACCTGAATCAGCCCGGAGTCATTCCCGGGTCAGTACATCATCACCTGCTATTAATCCCCGAGTCAGTACTTCGCGGCCTG
ACGCGATCACCCTACTCAGTACTTCAGAGCCCGCCTCCGGTACCTGCCTGGCGATCGCCCCTGTGCAACAAAATAAATCAACCGTTATCAAACCACGATC
GTTCGTATGCGCCGCCGATGACGCGCGCCTGAAAAACGCGTCAATATGAGTTCCCCGGAAGCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
330 bp109 aa88417+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MSKPRWTIEQKRQHVAAWRASGLTRQQYCELNDIPFTSLREWPQDVAKAERRANEPTVLPVHIAPPLHADVPQPVTNEPVTLFLPGGVRMCCQPSQLTDV
FRALKYAQS

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
393 bp130 aa404796+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MLSPDNVFIAIKPVDMRRGIDSLTQYIQDELRSTWHEGAAFVFVNKARSRIKVLRWDKHGVWLCTRRLHKGSFRWPRANDAAWHLTPDEFNWLIAGVDWQ
QVKGHDLTKWVWQNEPELRPENTQNTLLTQ

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1602 bp533 aa8032404+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MNIRIWSGILPCMDISALNTTNDIEKLRAMALAMVQEVMSENAEKERELLEKSRRIQLLEEMLKLVRQQRFGKKCGTLAGMQRSLFEEDVDADIAALTAH
LDKLLPQSPEEDEKASRSRPIRKPLPVHLPRVEKIIQPDTDHCPECDEPLHYIRDAVSEKLEYIPAHFVVNRYVRPQYSCPCCQKVFSGEMPAHILPKSA
VEPSVIAQVIINKYGDHLPLYRQQQVFARSDVGLPVSSMADMVGAAGAALSPLAALLHRELINRPVVHADETTLKILNTKKGGKSCSGYLWAYVSGERTG
PSVVCFDCRTGRSHEYPENWLQGWGGTLVVDGHKAYRTLANKVPEITLAGCWAHARRGFADLYKISKDPRAAIAVKKIAGLYRLEKKISSRPVEKIRQWR
QRYARPILEELWSWLEEQEPQCSPGRALHKAIAYALSHRVELSRFLEDGAVPLDNNVCERAIKNVVLGRKSWLFAGSQMAGERAAQIMSLLETAKRNGLE
PHAWLTDVLMRLPEWPEERLAELLPLEGFTFSG

 

Blast result :
Comments
ISEc49 is 58%(orfA) aa similar to ISVsp4, 72%(orfB) and 58%(orfC) aa similar to ISAzo15.
References
1] ISfinder annotation (2012)
2] Welch,R.A., Burland,V., Plunkett,G. III, Redford,P., Roesch,P., Rasko,D., Buckles,E.L., Liou,S.R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L., Donnenberg,M.S. and Blattner,F.R. (2002) Proc. Natl. Acad. Sci. U.S.A. 99 (26), 17020-17024