ISEc53
- Family ISL3
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
HG941718 | ND | Escherichia coli | Escherichia coli O25b:H4-ST131 EC958 |
DNA section
IS Length : 1885 bp
Ends
IR Length : 19/23
IRL : GGGTCTTCCCCTGTTGTGGTGGGTGGTACATAATTGAGCGATTTTCAAAC
IRR : GGGTCTTCCCCGATCATGGTGGGAAGACTCAGTACGCCATATTCAGTTTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCGATGGATA | TATAT | ACACTTTGCC | 5 |
DNA sequence
GGGTCTTCCCCTGTTGTGGTGGGTGGTACATAATTGAGCGATTTTCAAACACCACACAGTACCTTTATGGTGTCTCTTTTTACATGAGAGGGAATTCTTA
TGAGTGTGCCGCAAACAAAAGCTGAACTGCTTTTAGCTATTGATAAAAATTTTAGTAAATTAATTAGTTACCTCAACACAATCCCACCAGAAATTACTTC
AGATAAATCAATGGACGGACACGCCAAAGGAACGGAGATGAGTGTTCGTGATCTCGTTTCGTATCTGCTTGGATGGAATGCTCTTGTTGTAAAGTGGATC
GCTTCTGATGCTAAAGGTCTGCCTGTCGATTTTCCGGAAACTGGCTATAAATGGAATCAGCTTGGCCTTCTTGCTCAAAAATTTTACTCAGATTACAGTG
AGTTAAGTTATGAGTTGTTAGTAGCTGAACTTCAGACTGTAAAAAATGAGATTGTGAACCTTATTAATGATCGTACCGATGATATTTTGTATGGAAGACC
ATGGTACACAAAATGGACGATGGGGAGAATGATCTCATTTAACACATCTTCGCCTTACGCCAACGCTAATGGAAGATTAAGAAAGTGGGCAAAAAATAAT
AATATCAGTTTAAAGTAAGCTTCAGAATGAAATTATATGGACGAAAAATCCCTCTATGCCCATATCCTTAACCTGACTGCACCATGGCAGGTTAAATCCC
TTACCCTCGATGAAAATGCAGGTTCCGTTACTGTTACAGTCGGAATTGCTGAAAATACTCAGTTAACCTGTCCGACCTGCAGGAAATCCTGTTCTGTTCA
CGATCACCGACATCGTAAATGGCGCCACCTTGATACCTGCCAGTTCATGACATTAGTAGAAGCCGATGTTCCCCGCGTTATGTGCCCGGAGCATGGCTGC
CAGACTCTGCCTGTACCGTGGGCAGGTTCCGGCAGTCGGTACACTCTGCTGTTCGAATCGTTCGTGCTCTCATGGCTTAAAATCAGCACCGTTGATGCGG
TCAGAAAACAACTTAAACTTAGCTGGAATGCCGTTGACGGCATCATGACCCGCGCGGTTAAGCGAGGCCTGTCGCGGATTAAAAAGCCTTTATCAGTGCG
TCATATGAATGTAGACGAAGTCGCCTTTAAAAAAGGGCATAGGTATATAACCGTGGTATCTGATCGCGACGGGCGGGCACTGGCATTAACCGATGATCGT
GGCACAGAGAGTCTTGCCAGCTATCTCCGTTCGCTTACTGACAGTCAGTTGTTGGCCATCAAAACACTGTCGATGGACATGAATGCGGGCTATATAAGAG
CAGCGCGTATCCATTTACCCAATGCGGTCGAGAAAATCGCCTTCGATCGCTTCCATGTGGCGAAGCAACTGGGCGAAGTGGTTGATAAAACCCGCCAGAA
TGAACATCCGCACCTCCCTGTTGAAAGTCGTCGTCAGGCCAAAGGTACCCGCTTCCTGTGGCAGTACAGCGACAAATGGATGACTGAGTCCCGGCAGGAA
AAGCTGATGTGGTTGCGGGAACAGATGCAACAGACAAGCCAGTGCTGGACACTGAAAGAGCTGGCAAAAAATATCTGGGATCGCCCCTGGAGCACAGAAC
GCAGGAATGACTGGTTGCAGTGGATATCGCTGGCGTCTGAATGTGATGTGCCGATGATGAAAAACGCAGCGAAAACCATTAAAAAACGGTTATACGGAAT
ACTGAATGCAATGCGTCATCGCGTCTCGAATGGAAATGCGGAGGCGCTGAACAGCAAGATCAGACTGCTGAGGATAAAGGCCAGGGGATACCGAAACCGG
GAACGCTTTAAACTGGGAGTTATGTTCCACTATGGGAAACTGAATATGGCGTACTGAGTCTTCCCACCATGATCGGGGAAGACCC
TGAGTGTGCCGCAAACAAAAGCTGAACTGCTTTTAGCTATTGATAAAAATTTTAGTAAATTAATTAGTTACCTCAACACAATCCCACCAGAAATTACTTC
AGATAAATCAATGGACGGACACGCCAAAGGAACGGAGATGAGTGTTCGTGATCTCGTTTCGTATCTGCTTGGATGGAATGCTCTTGTTGTAAAGTGGATC
GCTTCTGATGCTAAAGGTCTGCCTGTCGATTTTCCGGAAACTGGCTATAAATGGAATCAGCTTGGCCTTCTTGCTCAAAAATTTTACTCAGATTACAGTG
AGTTAAGTTATGAGTTGTTAGTAGCTGAACTTCAGACTGTAAAAAATGAGATTGTGAACCTTATTAATGATCGTACCGATGATATTTTGTATGGAAGACC
ATGGTACACAAAATGGACGATGGGGAGAATGATCTCATTTAACACATCTTCGCCTTACGCCAACGCTAATGGAAGATTAAGAAAGTGGGCAAAAAATAAT
AATATCAGTTTAAAGTAAGCTTCAGAATGAAATTATATGGACGAAAAATCCCTCTATGCCCATATCCTTAACCTGACTGCACCATGGCAGGTTAAATCCC
TTACCCTCGATGAAAATGCAGGTTCCGTTACTGTTACAGTCGGAATTGCTGAAAATACTCAGTTAACCTGTCCGACCTGCAGGAAATCCTGTTCTGTTCA
CGATCACCGACATCGTAAATGGCGCCACCTTGATACCTGCCAGTTCATGACATTAGTAGAAGCCGATGTTCCCCGCGTTATGTGCCCGGAGCATGGCTGC
CAGACTCTGCCTGTACCGTGGGCAGGTTCCGGCAGTCGGTACACTCTGCTGTTCGAATCGTTCGTGCTCTCATGGCTTAAAATCAGCACCGTTGATGCGG
TCAGAAAACAACTTAAACTTAGCTGGAATGCCGTTGACGGCATCATGACCCGCGCGGTTAAGCGAGGCCTGTCGCGGATTAAAAAGCCTTTATCAGTGCG
TCATATGAATGTAGACGAAGTCGCCTTTAAAAAAGGGCATAGGTATATAACCGTGGTATCTGATCGCGACGGGCGGGCACTGGCATTAACCGATGATCGT
GGCACAGAGAGTCTTGCCAGCTATCTCCGTTCGCTTACTGACAGTCAGTTGTTGGCCATCAAAACACTGTCGATGGACATGAATGCGGGCTATATAAGAG
CAGCGCGTATCCATTTACCCAATGCGGTCGAGAAAATCGCCTTCGATCGCTTCCATGTGGCGAAGCAACTGGGCGAAGTGGTTGATAAAACCCGCCAGAA
TGAACATCCGCACCTCCCTGTTGAAAGTCGTCGTCAGGCCAAAGGTACCCGCTTCCTGTGGCAGTACAGCGACAAATGGATGACTGAGTCCCGGCAGGAA
AAGCTGATGTGGTTGCGGGAACAGATGCAACAGACAAGCCAGTGCTGGACACTGAAAGAGCTGGCAAAAAATATCTGGGATCGCCCCTGGAGCACAGAAC
GCAGGAATGACTGGTTGCAGTGGATATCGCTGGCGTCTGAATGTGATGTGCCGATGATGAAAAACGCAGCGAAAACCATTAAAAAACGGTTATACGGAAT
ACTGAATGCAATGCGTCATCGCGTCTCGAATGGAAATGCGGAGGCGCTGAACAGCAAGATCAGACTGCTGAGGATAAAGGCCAGGGGATACCGAAACCGG
GAACGCTTTAAACTGGGAGTTATGTTCCACTATGGGAAACTGAATATGGCGTACTGAGTCTTCCCACCATGATCGGGGAAGACCC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
519 bp | 172 aa | 100 | 618 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MSVPQTKAELLLAIDKNFSKLISYLNTIPPEITSDKSMDGHAKGTEMSVRDLVSYLLGWNALVVKWIASDAKGLPVDFPETGYKWNQLGLLAQKFYSDYS
ELSYELLVAELQTVKNEIVNLINDRTDDILYGRPWYTKWTMGRMISFNTSSPYANANGRLRKWAKNNNISLK
ELSYELLVAELQTVKNEIVNLINDRTDDILYGRPWYTKWTMGRMISFNTSSPYANANGRLRKWAKNNNISLK
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1221 bp | 406 aa | 637 | 1857 | + | No |
Chemistry : Unknow
ORF sequence :
MDEKSLYAHILNLTAPWQVKSLTLDENAGSVTVTVGIAENTQLTCPTCRKSCSVHDHRHRKWRHLDTCQFMTLVEADVPRVMCPEHGCQTLPVPWAGSGS
RYTLLFESFVLSWLKISTVDAVRKQLKLSWNAVDGIMTRAVKRGLSRIKKPLSVRHMNVDEVAFKKGHRYITVVSDRDGRALALTDDRGTESLASYLRSL
TDSQLLAIKTLSMDMNAGYIRAARIHLPNAVEKIAFDRFHVAKQLGEVVDKTRQNEHPHLPVESRRQAKGTRFLWQYSDKWMTESRQEKLMWLREQMQQT
SQCWTLKELAKNIWDRPWSTERRNDWLQWISLASECDVPMMKNAAKTIKKRLYGILNAMRHRVSNGNAEALNSKIRLLRIKARGYRNRERFKLGVMFHYG
KLNMAY
RYTLLFESFVLSWLKISTVDAVRKQLKLSWNAVDGIMTRAVKRGLSRIKKPLSVRHMNVDEVAFKKGHRYITVVSDRDGRALALTDDRGTESLASYLRSL
TDSQLLAIKTLSMDMNAGYIRAARIHLPNAVEKIAFDRFHVAKQLGEVVDKTRQNEHPHLPVESRRQAKGTRFLWQYSDKWMTESRQEKLMWLREQMQQT
SQCWTLKELAKNIWDRPWSTERRNDWLQWISLASECDVPMMKNAAKTIKKRLYGILNAMRHRVSNGNAEALNSKIRLLRIKARGYRNRERFKLGVMFHYG
KLNMAY
Blast result :
Comments
ISEc53 is 83% aa (ORFB : the transposase) similar to ISSoEn4.
The first ORF is a passenger gene annotated as hypothetical protein.
The first ORF is a passenger gene annotated as hypothetical protein.
References
1] Totsika,M., Beatson,S.A., Sarkar,S., Phan,M.D., Petty,N.K., Bachmann,N., Szubert,M., Sidjabat,H.E., Paterson,D.L., Upton,M. and Schembri,M.A.(2011) PLoS ONE 6 (10), E26578