ISEc47
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AIFX01000063 | ND | Escherichia coli | Escherichia coli DEC6C |
DNA section
IS Length : 2541 bp
Ends
IR Length : 15/22
IRL : GTAAGCGTAAAGCCAATGCCGTCTGTAACACCTGCTCCTTGCAGACTAAA
IRR : GTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTACTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGCCGGTTGC | TAAAATGG | TGATATTGGG | 8 |
DNA sequence
GTAAGCGTAAAGCCAATGCCGTCTGTAACACCTGCTCCTTGCAGACTAAATTAGAGCTCCTTCTAAATTAGACGGAGTTCTATTGATGGATAAACCTACA
GACTGGCGCTCCGGAACCCGCCGGATATTTTCTAATGAATTTAAACTTCATATGGTTGAACTGGCTTCGAAACCAAATGCCAATGTTGCACAACTGGCCC
GGGAACATGGCGTTGATAACAACCTGATTTTTAAATGGCTACGCCTCTGGCAAAGAGAAGGACGTATTTCTCGTAGAATGCCTCCAACTATTGTAGGCCC
TACAGTATCACAATCTTTTCCGGCCTCTCCGACTCTGGTTCCAGTGGAACTTATCGACACCCCGCGCTGTGCTACAGATGCTCCTGCTCCGGGGGCATTA
TCAGTTGCTTGTGCAGCTTCCTGCCATGTGGAATTCCATTACGGTAAAATGATGCTGGAAAATCCTTCACCAGAGCTGCTCACGGTGTTGATCCGTGAGC
TGACCGGGAGGGGACGATGATTTCACTCCCATCAGGTACCCGTATCTGGCTCGTTGCCGGCGTTACTGATATGCGTAAATCCTTCAACGGTCTGGGGGAG
CAGATACAGCATGTGCTGGATGATAACCCCTTCTCCGGTCACCTGTTTATCTTCCGTGGCCGACGGGGAGACACGATTAAAATCCTGTGGGCTGATGCTG
ATGGTCTGTGCCTGTTCACCAAACGCCTTGAGGAAGGTCAGTTTATCTGGCCTGCGGTGCGTGACGGTAAGATATCCATTACCCGCTCGCAACTGGCAAT
GCTCCTCGATAAGCTGGACTGGCGTCAGCCAAAAACATCCCGCCTTAACGCACTGACAATGTTGTAAAAAACGCCGGGCCGGATTATAAAAACGGCCATG
AGTCAGAAATACCTCATTCGCATCGCTGAGCTGGAAAGGCTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAG
AGACGGAGGCCTTCCTGCGCTCTGCACTGGCACGTGCCGAAGAAAAGATCGAAGAAGATGAGCGAGAAATAGAGCATCTGCGGGCTCAGATAGAAAAACT
GCGCCGGATGCTGTTCGGAACCCGTTCTGAAAAACTGCGTCGTGAGGTTGAACAGGCTGAAGCCCTGCTGAAACAACGCGAGCAGGAAAGCGATCGTTAC
AGTGGGCGTGAGGATGACCCGCTGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACATCTCCCCCGTGAAATATACCGCCTGG
AGCCTGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCCCTGAAAGT
GATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGCATCGTTGAAGCACCGGCGCCGTCCCGCCCGATAGAGCGTGGTATCGCGGGCCCC
GGATTACTTGCCCGCGTGTTAACGGGAAAATACTGCGAACACCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGACAGGGTGTCGAACTGAGCCGTG
CATTACTCTCCAACTGGGTTGACGCGTGCTGCCAGTTAATGACGCCGCTGAATGATGCCCTGTACCGTTATGTGATGAATACCCGCAAGCTTCACACTGA
CGACACACCGGTAAAGGTACTGGCACCGGGCCTGAAAAAGACGAAAACAGGGCGCATCTGGACGTATGTCCGGGATGATCGCAATGCGGGTTCGTCATCT
CCTCCGGCGGTCTGGTTCGCGTACTCATCGAACCGGCAGGGGAAACACCCGGAGCAACACCTCCGCCCCTTCCGGGGTATCCTGCAGGCGGATGCGTTCA
CAGGTTATGACAGGCTGTTCAGTGCAGAACGTGAAGGTAGTGCGCTGACAGAAGTTGCGTGCTGGGCTCATGCCCGGAGAAAAATCCACGATGTATACAT
CAGCAGCAAAAGTGCGACGGCAGAAGAAGCCCTGAAGCGAATCAGTGAACTGTACGCCATCGAGGATGAAATACGGGGATTACCAGAGTCAGAGCGTCTT
GCAGCCAGGCAGCAGCGAAGCAAAGCGTTACTGACGTCGCTGCATGAATGGATGGTGGAGAAGAATGGCACGCTGTCGAAAAAATCCAGACTGGGCGAAG
CGTTCAGCTATGTACTGAATCAGTGGGACGCCCTCTGTTATTACAGTGATGACGGTCTGGCGGAGGTGGACAATAACACAGCGGAAAGAGCGCTTCGTGC
AGTCTGTCTCGGAAAGAAAAATTACGTGTTCTTCGGTAGCGATCACGGCGGCGAGCGTGGAGCACTGCTGTACGGGCTGATCGGCACCTGCCGTCTGAAC
GGTATCGATCCGGAAGCGTATCTGCGCCATATTCTGAGCGTACTGCCGGAATGGCCCTCCAACCGTGTTGACGAACTCCTGCCATGGAACGTAGTACTCA
CCAATAAATAAGCGTCAATACGGTGCTCCGTTGACGCTTAC
GACTGGCGCTCCGGAACCCGCCGGATATTTTCTAATGAATTTAAACTTCATATGGTTGAACTGGCTTCGAAACCAAATGCCAATGTTGCACAACTGGCCC
GGGAACATGGCGTTGATAACAACCTGATTTTTAAATGGCTACGCCTCTGGCAAAGAGAAGGACGTATTTCTCGTAGAATGCCTCCAACTATTGTAGGCCC
TACAGTATCACAATCTTTTCCGGCCTCTCCGACTCTGGTTCCAGTGGAACTTATCGACACCCCGCGCTGTGCTACAGATGCTCCTGCTCCGGGGGCATTA
TCAGTTGCTTGTGCAGCTTCCTGCCATGTGGAATTCCATTACGGTAAAATGATGCTGGAAAATCCTTCACCAGAGCTGCTCACGGTGTTGATCCGTGAGC
TGACCGGGAGGGGACGATGATTTCACTCCCATCAGGTACCCGTATCTGGCTCGTTGCCGGCGTTACTGATATGCGTAAATCCTTCAACGGTCTGGGGGAG
CAGATACAGCATGTGCTGGATGATAACCCCTTCTCCGGTCACCTGTTTATCTTCCGTGGCCGACGGGGAGACACGATTAAAATCCTGTGGGCTGATGCTG
ATGGTCTGTGCCTGTTCACCAAACGCCTTGAGGAAGGTCAGTTTATCTGGCCTGCGGTGCGTGACGGTAAGATATCCATTACCCGCTCGCAACTGGCAAT
GCTCCTCGATAAGCTGGACTGGCGTCAGCCAAAAACATCCCGCCTTAACGCACTGACAATGTTGTAAAAAACGCCGGGCCGGATTATAAAAACGGCCATG
AGTCAGAAATACCTCATTCGCATCGCTGAGCTGGAAAGGCTGCTCTCTGAGCAGGCTGAAGCCCTCCGTCAGAAAGACCAGCAACTGAGTCTGGTTGAAG
AGACGGAGGCCTTCCTGCGCTCTGCACTGGCACGTGCCGAAGAAAAGATCGAAGAAGATGAGCGAGAAATAGAGCATCTGCGGGCTCAGATAGAAAAACT
GCGCCGGATGCTGTTCGGAACCCGTTCTGAAAAACTGCGTCGTGAGGTTGAACAGGCTGAAGCCCTGCTGAAACAACGCGAGCAGGAAAGCGATCGTTAC
AGTGGGCGTGAGGATGACCCGCTGGTTCCCCGCCAGTTGCGACAGTCGCGCCATCGTCGTCCGTTACCGGCACATCTCCCCCGTGAAATATACCGCCTGG
AGCCTGAAGAAAGCTGTTGCCCGGAGTGTGGCGGTGAGCTGGATTATCTGGGGGAAGTCAGCGCAGAACAACTGGAACTGGTGAGCAGCGCCCTGAAAGT
GATCCGCACAGAACGGGTAAAAAAAGCCTGTACAAAATGTGACTGCATCGTTGAAGCACCGGCGCCGTCCCGCCCGATAGAGCGTGGTATCGCGGGCCCC
GGATTACTTGCCCGCGTGTTAACGGGAAAATACTGCGAACACCTGCCACTGTATCGTCAGAGTGAAATCTTTGCCCGACAGGGTGTCGAACTGAGCCGTG
CATTACTCTCCAACTGGGTTGACGCGTGCTGCCAGTTAATGACGCCGCTGAATGATGCCCTGTACCGTTATGTGATGAATACCCGCAAGCTTCACACTGA
CGACACACCGGTAAAGGTACTGGCACCGGGCCTGAAAAAGACGAAAACAGGGCGCATCTGGACGTATGTCCGGGATGATCGCAATGCGGGTTCGTCATCT
CCTCCGGCGGTCTGGTTCGCGTACTCATCGAACCGGCAGGGGAAACACCCGGAGCAACACCTCCGCCCCTTCCGGGGTATCCTGCAGGCGGATGCGTTCA
CAGGTTATGACAGGCTGTTCAGTGCAGAACGTGAAGGTAGTGCGCTGACAGAAGTTGCGTGCTGGGCTCATGCCCGGAGAAAAATCCACGATGTATACAT
CAGCAGCAAAAGTGCGACGGCAGAAGAAGCCCTGAAGCGAATCAGTGAACTGTACGCCATCGAGGATGAAATACGGGGATTACCAGAGTCAGAGCGTCTT
GCAGCCAGGCAGCAGCGAAGCAAAGCGTTACTGACGTCGCTGCATGAATGGATGGTGGAGAAGAATGGCACGCTGTCGAAAAAATCCAGACTGGGCGAAG
CGTTCAGCTATGTACTGAATCAGTGGGACGCCCTCTGTTATTACAGTGATGACGGTCTGGCGGAGGTGGACAATAACACAGCGGAAAGAGCGCTTCGTGC
AGTCTGTCTCGGAAAGAAAAATTACGTGTTCTTCGGTAGCGATCACGGCGGCGAGCGTGGAGCACTGCTGTACGGGCTGATCGGCACCTGCCGTCTGAAC
GGTATCGATCCGGAAGCGTATCTGCGCCATATTCTGAGCGTACTGCCGGAATGGCCCTCCAACCGTGTTGACGAACTCCTGCCATGGAACGTAGTACTCA
CCAATAAATAAGCGTCAATACGGTGCTCCGTTGACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
435 bp | 144 aa | 86 | 520 | + | No |
AG : IS66 TnpA
ORF sequence :
MDKPTDWRSGTRRIFSNEFKLHMVELASKPNANVAQLAREHGVDNNLIFKWLRLWQREGRISRRMPPTIVGPTVSQSFPASPTLVPVELIDTPRCATDAP
APGALSVACAASCHVEFHYGKMMLENPSPELLTVLIRELTGRGR
APGALSVACAASCHVEFHYGKMMLENPSPELLTVLIRELTGRGR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
351 bp | 116 aa | 517 | 867 | + | No |
AG : IS66 TnpB
ORF sequence :
MISLPSGTRIWLVAGVTDMRKSFNGLGEQIQHVLDDNPFSGHLFIFRGRRGDTIKILWADADGLCLFTKRLEEGQFIWPAVRDGKISITRSQLAMLLDKL
DWRQPKTSRLNALTML
DWRQPKTSRLNALTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1614 bp | 537 aa | 898 | 2511 | + | No |
Chemistry : DDE
ORF sequence :
MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVEQAEALLKQREQESDR
YSGREDDPLVPRQLRQSRHRRPLPAHLPREIYRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAG
PGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNTRKLHTDDTPVKVLAPGLKKTKTGRIWTYVRDDRNAGSS
SPPAVWFAYSSNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGSALTEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAARQQRSKALLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEVDNNTAERALRAVCLGKKNYVFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
YSGREDDPLVPRQLRQSRHRRPLPAHLPREIYRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAG
PGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPLNDALYRYVMNTRKLHTDDTPVKVLAPGLKKTKTGRIWTYVRDDRNAGSS
SPPAVWFAYSSNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGSALTEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESER
LAARQQRSKALLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEVDNNTAERALRAVCLGKKNYVFFGSDHGGERGALLYGLIGTCRL
NGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK
Blast result :
Comments
References
1] ISfinder annotation (2012)
2] Rasko,D., Redman,J., Daugherty,S.C., Chibucos,M.C., Tallon,L., Sadzewicz,L., Jones,K., Santana-Cruz,I. and Liu,X. (2012) Direct submission to Genbank.
2] Rasko,D., Redman,J., Daugherty,S.C., Chibucos,M.C., Tallon,L., Sadzewicz,L., Jones,K., Santana-Cruz,I. and Liu,X. (2012) Direct submission to Genbank.