IS679
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_002142 | ND | Escherichia coli | Escherichia coli O111:H- DNA, genomic island GEI2.04 Escherichia coli B171 plasmid pB171 Escherichia coli O111:H- DNA, genomic island GEI4.36 Escherichia coli O111:H- DNA, genomic island GEI4.52 Escherichia coli O111:H- DNA, genomic island GEI3.10 |
DNA section
IS Length : 2704 bp
Ends
IR Length : 17/25
IRL : GTAAGCGCATCATTTAAACCGTCTTTCGCCTCCCTTTCCTGTTTCCGATA
IRR : GTAAGCGGCTCGCCAGAACCGTATTGATATTTACTGAGAGGTCAGATCAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGTTAATCCT | GGATGATC | TGGAGCCTCT | 8 |
TTCGGATGTT | GTTAATCT | CAACCGACGT | 8 |
GAATGGAAGA | GTGGAGAC | TGCAGACAGG | 8 |
CAGCCAGGCC | GTTGAAGC | CATTTCTCAT | 8 |
CTTACCACAC | ACTGTATC | TGCTTTTTAT | 8 |
ACCACTGGTA | TAACAAGCAC | 0 |
DNA sequence
GTAAGCGCATCATTTAAACCGTCTTTCGCCTCCCTTTCCTGTTTCCGATACTAATGTCCATTTTCGCAGTAAAAGGACATTTAAGATGAATGCACGAAAA
GCGGTACTGGCAGATAATCCAGAATTGATCCTGCGTGTGCTACAGCTGAGATTTGACGAGTCACTGTCGTACCCGCGCATTTCTGCGCAGACTGGTGTCA
GCAAAACCGCCATTTTTTCTCTGGTGAGGCGATTTCACCAGGTATTCACTGACTGGCCTCTTTCCGGTGAATATTCCTGCGGGCAACTGGCCCGGGCTCT
TTTCCCGGGGCGATACCCTTCAGCCCCGACCGTGACTCAGCCTGTGAAAGCAGAGAAACCCCGCCGGAACCGATTTTCACCGGAGTTTAAATGGAGACTG
GTTCAGCAAACTCTTTTACCCGGTGCCTGTGTCGCACAAATAGCTCGAGAGAATGGAATTAACGATAACCTGCTCTTTAACTGGCGGCATCTCTGGCGTA
ACGGTGGCCTGCAACCGCCCGGCGAACATGAAACATCGCTACTTCCCGTGACGTTAACTCCGGAGCCGGATAATAAAATCCCGGCACCAGTGCAGATACC
TGAACAGATAAATACACTGCCAGACAGTCTGTGCTGCGAGCTGGTTCTGCCGGCCGGAACTCTCAGGCTGAAAGGTGAACTGACACCGGCGTTATTACAG
ACACTTATCCGCGAAATGAAAGGGAGCAGCCACTGATGATATCTCTCCCTGCCGGTTCGCGTATCTGGCTGGTTGCCGGTATCACCGATATGCGAAATGG
CTTTAACGGCCTGGCCTCAAAAGTTCAGAACGTCTTGAAGGATGACCCGTTCTCCGGGCATCTGTTCATCTTCCGCGAACGCCGGGGTGACCAGATAAAA
GTGCTGTGGGCTGACAGTGACGGACTGTGCCTCTTCACCAAACGCCTGGAGCGGGGCCGCTTCGTCTGGCCGGTCACCCGCGATGGAAAGGTTCACCTTA
CTCCGGCTCAGTTGTCCATGCTTCTCGAAGGCATCGACTGGAAGCACCCGAAACGAACGGAACGCGCTGGCATCCGCATATAACCCGTTGTAAAGTGAGG
ATATGGACACCTCACTTGCTCATGAGAATGCCCGCCTGCGGGCACTGTTGCAGACGCAACAGGACACCATCCGCCAGATGGCGAAATACAACCGCCTGCT
CTCACAGCGGGTGGCGGCTTATGCTTCCGAAATCAACCGGCTGAAGGCGCTGGTTGCGAAACTGCAGCGCATGCAGTTCGGTAAAAGCTCAGAAAAACTT
CGCGCTAAAACCGAACGGCAGATACAGGAAGCACAGGAGCGAATCAGCGCACTTCAGGAAGAAATGGCGGAAACGCTGGGTGAGCAATATGACCCGGTAC
TGCCATCCGCCCTGCGCCAGTCTTCAGCCCGTAAACCGTTACCGGCCTCACTTCCCCGTGAAACCCGGGTCATCCGGCCGGAAGAGGAATGCTGCCCGGC
CTGTGGTGGTGACCTTAGTCCTCTGGGGTGTGATGTGTCAGAGCAACTGGAGCTTATCAGCAGCGCCTTTAAGGTTATCGAAACACAACGTCCGAAACTG
GCCTGTTGTCGGTGCGACCATATCGTGCAGGCAACAGTACCTTCAAAACCCATTGCACGCAGTTATGCCGGAGCGGGGCTTCTGGCCCATGTTGTTACCG
GGAAATATGCAGACCATCTGCCGTTATACCGCCAGTCAGAAATATACCGTCGCCAGGGCGTGGAGCTGAGCCGCGCCACGCTGGGGCGCTGGACAGGTGC
CGTTGCTGAACTGCTGGAGCCGCTGTATGACGTCCTGCGCCAGTATGTGCTGATGCCCGGTAAAGTCCATGCTGATGATATCCCCGTCCCGGTCCAGGAG
CCGGGCAGCGGTAAAACCCGGACCGCCCGGCTGTGGGTCTACGTCCGTGATGACCGTAACGCCGGTTCGGAAATGCCCCCGGCGGTCTGGTTCGCGTACT
CACCGGACCGGAAAGGTATCCATCCACAAAATCATCTGGCCGGTTACAGCGGTGTGCTTCAGGCCGATGCTTACGGTGGTTACCGGGTGGTATACGAATC
CGGCAGAATAACGGAAGCCGCGTGTATGGCTCATGCCCGGAGAAAAATCCACGATGTGCATGCAAGAGTGCCCACCGACATCACCACGGAAGCCCTGCAG
CGTATCGGTGAACTGTATGCCATAGAGGCAGAAGTCCGGGGATGTACAGCAGAACAGCGTCTGGCGGCAAGAAAAGCCAGAGCTGCGCCACTGATGCAGA
CACTGTATGACTGGATACAGACTCAGATGAAAACACTGTCGAGTCACTCGGATACGGTAAAAGCGTTCGCATACCTGGTGAAACAGTGGGACGGGCTGAA
CGTGTACTGCAGTAATGGCTGGGTGGAAATCGACAACAACATCGCAGAGAACGCCTTACGGGGAGTGGCCGTAGGCCGGAAAAACTGGCTGTTCGCGGGT
TCCGACAGCGGTGGCGAACATGCGGCGGTGTTGTACTCGCTGATCGGCACATGCCGTCTGAACAATGTGGAACCAGAAAAATGGCTGCGTTACGTCATTG
AGCATATCCAGGATTGGCCGGCAAACCGGGTACGCGATCTGTTGCCCTGGAAAGTTGATCTGACCTCTCAGTAAATATCAATACGGTTCTGGCGAGCCGC
TTAC
GCGGTACTGGCAGATAATCCAGAATTGATCCTGCGTGTGCTACAGCTGAGATTTGACGAGTCACTGTCGTACCCGCGCATTTCTGCGCAGACTGGTGTCA
GCAAAACCGCCATTTTTTCTCTGGTGAGGCGATTTCACCAGGTATTCACTGACTGGCCTCTTTCCGGTGAATATTCCTGCGGGCAACTGGCCCGGGCTCT
TTTCCCGGGGCGATACCCTTCAGCCCCGACCGTGACTCAGCCTGTGAAAGCAGAGAAACCCCGCCGGAACCGATTTTCACCGGAGTTTAAATGGAGACTG
GTTCAGCAAACTCTTTTACCCGGTGCCTGTGTCGCACAAATAGCTCGAGAGAATGGAATTAACGATAACCTGCTCTTTAACTGGCGGCATCTCTGGCGTA
ACGGTGGCCTGCAACCGCCCGGCGAACATGAAACATCGCTACTTCCCGTGACGTTAACTCCGGAGCCGGATAATAAAATCCCGGCACCAGTGCAGATACC
TGAACAGATAAATACACTGCCAGACAGTCTGTGCTGCGAGCTGGTTCTGCCGGCCGGAACTCTCAGGCTGAAAGGTGAACTGACACCGGCGTTATTACAG
ACACTTATCCGCGAAATGAAAGGGAGCAGCCACTGATGATATCTCTCCCTGCCGGTTCGCGTATCTGGCTGGTTGCCGGTATCACCGATATGCGAAATGG
CTTTAACGGCCTGGCCTCAAAAGTTCAGAACGTCTTGAAGGATGACCCGTTCTCCGGGCATCTGTTCATCTTCCGCGAACGCCGGGGTGACCAGATAAAA
GTGCTGTGGGCTGACAGTGACGGACTGTGCCTCTTCACCAAACGCCTGGAGCGGGGCCGCTTCGTCTGGCCGGTCACCCGCGATGGAAAGGTTCACCTTA
CTCCGGCTCAGTTGTCCATGCTTCTCGAAGGCATCGACTGGAAGCACCCGAAACGAACGGAACGCGCTGGCATCCGCATATAACCCGTTGTAAAGTGAGG
ATATGGACACCTCACTTGCTCATGAGAATGCCCGCCTGCGGGCACTGTTGCAGACGCAACAGGACACCATCCGCCAGATGGCGAAATACAACCGCCTGCT
CTCACAGCGGGTGGCGGCTTATGCTTCCGAAATCAACCGGCTGAAGGCGCTGGTTGCGAAACTGCAGCGCATGCAGTTCGGTAAAAGCTCAGAAAAACTT
CGCGCTAAAACCGAACGGCAGATACAGGAAGCACAGGAGCGAATCAGCGCACTTCAGGAAGAAATGGCGGAAACGCTGGGTGAGCAATATGACCCGGTAC
TGCCATCCGCCCTGCGCCAGTCTTCAGCCCGTAAACCGTTACCGGCCTCACTTCCCCGTGAAACCCGGGTCATCCGGCCGGAAGAGGAATGCTGCCCGGC
CTGTGGTGGTGACCTTAGTCCTCTGGGGTGTGATGTGTCAGAGCAACTGGAGCTTATCAGCAGCGCCTTTAAGGTTATCGAAACACAACGTCCGAAACTG
GCCTGTTGTCGGTGCGACCATATCGTGCAGGCAACAGTACCTTCAAAACCCATTGCACGCAGTTATGCCGGAGCGGGGCTTCTGGCCCATGTTGTTACCG
GGAAATATGCAGACCATCTGCCGTTATACCGCCAGTCAGAAATATACCGTCGCCAGGGCGTGGAGCTGAGCCGCGCCACGCTGGGGCGCTGGACAGGTGC
CGTTGCTGAACTGCTGGAGCCGCTGTATGACGTCCTGCGCCAGTATGTGCTGATGCCCGGTAAAGTCCATGCTGATGATATCCCCGTCCCGGTCCAGGAG
CCGGGCAGCGGTAAAACCCGGACCGCCCGGCTGTGGGTCTACGTCCGTGATGACCGTAACGCCGGTTCGGAAATGCCCCCGGCGGTCTGGTTCGCGTACT
CACCGGACCGGAAAGGTATCCATCCACAAAATCATCTGGCCGGTTACAGCGGTGTGCTTCAGGCCGATGCTTACGGTGGTTACCGGGTGGTATACGAATC
CGGCAGAATAACGGAAGCCGCGTGTATGGCTCATGCCCGGAGAAAAATCCACGATGTGCATGCAAGAGTGCCCACCGACATCACCACGGAAGCCCTGCAG
CGTATCGGTGAACTGTATGCCATAGAGGCAGAAGTCCGGGGATGTACAGCAGAACAGCGTCTGGCGGCAAGAAAAGCCAGAGCTGCGCCACTGATGCAGA
CACTGTATGACTGGATACAGACTCAGATGAAAACACTGTCGAGTCACTCGGATACGGTAAAAGCGTTCGCATACCTGGTGAAACAGTGGGACGGGCTGAA
CGTGTACTGCAGTAATGGCTGGGTGGAAATCGACAACAACATCGCAGAGAACGCCTTACGGGGAGTGGCCGTAGGCCGGAAAAACTGGCTGTTCGCGGGT
TCCGACAGCGGTGGCGAACATGCGGCGGTGTTGTACTCGCTGATCGGCACATGCCGTCTGAACAATGTGGAACCAGAAAAATGGCTGCGTTACGTCATTG
AGCATATCCAGGATTGGCCGGCAAACCGGGTACGCGATCTGTTGCCCTGGAAAGTTGATCTGACCTCTCAGTAAATATCAATACGGTTCTGGCGAGCCGC
TTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
651 bp | 216 aa | 86 | 736 | + | No |
AG : IS66 TnpA
ORF sequence :
MNARKAVLADNPELILRVLQLRFDESLSYPRISAQTGVSKTAIFSLVRRFHQVFTDWPLSGEYSCGQLARALFPGRYPSAPTVTQPVKAEKPRRNRFSPE
FKWRLVQQTLLPGACVAQIARENGINDNLLFNWRHLWRNGGLQPPGEHETSLLPVTLTPEPDNKIPAPVQIPEQINTLPDSLCCELVLPAGTLRLKGELT
PALLQTLIREMKGSSH
FKWRLVQQTLLPGACVAQIARENGINDNLLFNWRHLWRNGGLQPPGEHETSLLPVTLTPEPDNKIPAPVQIPEQINTLPDSLCCELVLPAGTLRLKGELT
PALLQTLIREMKGSSH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 736 | 1083 | + | No |
AG : IS66 TnpB
ORF sequence :
MISLPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRERRGDQIKVLWADSDGLCLFTKRLERGRFVWPVTRDGKVHLTPAQLSMLLEGI
DWKHPKRTERAGIRI
DWKHPKRTERAGIRI
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1572 bp | 523 aa | 1103 | 2674 | + | No |
Chemistry : DDE
ORF sequence :
MDTSLAHENARLRALLQTQQDTIRQMAKYNRLLSQRVAAYASEINRLKALVAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVL
PSALRQSSARKPLPASLPRETRVIRPEEECCPACGGDLSPLGCDVSEQLELISSAFKVIETQRPKLACCRCDHIVQATVPSKPIARSYAGAGLLAHVVTG
KYADHLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDVLRQYVLMPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGGYRVVYESGRITEAACMAHARRKIHDVHARVPTDITTEALQRIGELYAIEAEVRGCTAEQRLAARKARAAPLMQT
LYDWIQTQMKTLSSHSDTVKAFAYLVKQWDGLNVYCSNGWVEIDNNIAENALRGVAVGRKNWLFAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRYVIE
HIQDWPANRVRDLLPWKVDLTSQ
PSALRQSSARKPLPASLPRETRVIRPEEECCPACGGDLSPLGCDVSEQLELISSAFKVIETQRPKLACCRCDHIVQATVPSKPIARSYAGAGLLAHVVTG
KYADHLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDVLRQYVLMPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSEMPPAVWFAYS
PDRKGIHPQNHLAGYSGVLQADAYGGYRVVYESGRITEAACMAHARRKIHDVHARVPTDITTEALQRIGELYAIEAEVRGCTAEQRLAARKARAAPLMQT
LYDWIQTQMKTLSSHSDTVKAFAYLVKQWDGLNVYCSNGWVEIDNNIAENALRGVAVGRKNWLFAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRYVIE
HIQDWPANRVRDLLPWKVDLTSQ
Blast result :
Comments
IS679 is respectively 70%, 99% and 98% aa similar to orfA, orfB and orfC of ISCro1.
References
1] Tobe,T., Hayashi,T., Han,C.G., Schoolnik,G.K., Ohtsubo,E. and Sasakawa,C. (1999) Infect. Immun. 67 (10), 5455-5462.