ISEc12
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
DQ269483 | Y | Escherichia coli | Escherichia coli APEC O1 |
DNA section
IS Length : 2581 bp
Ends
IR Length : 20/24
IRL : TGCGTATTTTCATGAAAGGAGATCACTCAATAACTTCCATCGAGATCGGG
IRR : TGCGTATTACCGTGAAGGAGATCGGTGAGTAACATCGATGGAGATCGGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATCCTTCATA | CTCCT | GCCGGTCAGA | 5 |
DNA sequence
TGCGTATTTTCATGAAAGGAGATCACTCAATAACTTCCATCGAGATCGGGTAATAACATTTGAACAGATCGCTGAATAACATCGATGGAGATCACTTTTG
ACTCATTTTGTTATTCAGTGATCTCCATCAATGTTATTGGAACTTCACAGGTGTGTTGATCTGTATCTTTTGCCATTCCGGTAAAGGATACCTATGCCAA
CAGTTCCAATTTCTATGAGAAAACTTAAAGAAATTCTTAGGCTTAAATACGGTGTTGGACTCAGCCATCGACAAATTGGTCGTAGTCTTGCAATCTCCCC
TTCCGTTGTATCCAGATATGCTAATCGGGCGGCTCAACTTGGCATAAAGCAGTGGCCCTTACCTACAGGATGGGATGATACAAAACTAAAACATGCGTTC
CTTCAGACCCAGGTTAAGATGAAGAAGCACTCTCTGCCTGACTGGGCTACAGTACACCGGGAACTGCGTAATAAATGCGTGACGCTGCAGCTACTCTGGG
AAGAATACTGTGAGCGTAATCCAGGCGGTTTTTACAGCTATAACCATTACTGCCGGATGTACCGTGAATGGCTCAAAACCACTTCACCATCAATGCGTCA
GGTACATAAAGCTGGCGAAAAACTTTTCGTTGATTACTGTGGACCTACCGTTGGCGTTACCGACCCTGAGACCGGAGAAATAAGAACTGCTCAGGTCATC
GTAGCTGTTCTCGGGGCATCAAGTTACACATGGGCAGAGGCCACCTGGTCTCAGCAGCTTGAAGACTGGGTGATGAGTCATGTTCGCTGCTTCCAGTGGT
TGGGTGGCGTTCCTGAACTTGTTGTTCCGGACAATCTGAAAAGCGCCACATCCAGGGCATGTAAGTATGATCCTGACGTTAACCCTACCTACCAGCAGAT
GCTTGAGCATTATAATGTCGCAGTTTTGCCTGCGCGGCCACGTAAACCGAAAGATAAAGCCAAAGCTGAAGTTGGCGTTCAGGTTGTTGAACGCTGGATC
ATGGCCCGAATCAGGCATGAGATCTTCTACAGCCTTGCATCGCTTAATCAGCGCATTCGGGAGTTGCTGGAAAGACTGAATAACAAAATAATGCAGAAGT
TGGGTTATTCACGTGCAGAACTCTTCATCCAGCTTGATAAACCCGCACTGAAGCCTCTTCCTGAAGCCAGTTACAGTTACACCCTGGTGAAGAAAGTCAG
AGTTCATGCCGATTACCACGTGGAAATCGACAAACATTACTACTCGGTTCCATGTTCGCTGTTAGGCCAGCAACTGGAAGCATGGATCTCCGGAGAACTG
GTAAGACTCTTCAATCAGGGGCAGGAGGTTGCTGTGCACCCGCGCAAGCGTACTTATGGCTACAGTACCCGCAACGAGCACATGCCTGAAGCTCATCGAC
AGCATGCCACCTGGACGCCAGAGCGTCTTCTGGAATGGGCGGGGCACATAGGCAGTGAAACTCATAGTTATGTGCTTCATATACTGAACTCTCGTCCACA
TCCGGAACAAAGCTATCGCTTCTGCCTTGGACTCCTGAACCTTCATAAAAAATACAGTAAAGCCAGACTTAATGCAGCATGTGCAAGAGCTCTGAAAACA
AAGGTATGGCGTCTGTCAGGTATTAAATCGATCCTGGAAAAAGGTCTGGATAAACAACCTGTTCAGGATCCAAAACCAGATCTGTTATCCACGATGGAAC
ACGAAAACGTACGCGGCAGTGAGTATTACCACTGATACGGGATCCAATGATGAATCATCTTTACGAACAACTGACCGCACTTAAACTCACCGGCTTCCGT
GATGCGCTTAAAAAGCAACTTGCTCAGCCGGGCACATACCAGGAGCTGGGCTTCGAAGAACGCCTGTCATTACTGACAGCAGAAGAACTAACCTGCCGTG
AAAACAGGAAGGCAGAGCGTCTGATCAAACATGCACGGTTCAGACTTAATGCTGAGTTATCAAAGCTGGATTATCGTAACAATAGAGGGCTGGACAGGGC
CCTCATCCGTTCACTCAGTCAGGGAAACTGGTTAACCCTGAAACAAAATATTTTACTGACCGGGGCCACCGGCAGCGGTAAAACGTTCCTGGCATGTGCA
CTTGGTCATAATGCCTGCCGACAGGGATACAAGGTCTACTATTATCGCCTTAAAGCGCTGATGGAACAGTGCTATCAGGGGCATGCTGATGGAAGATACA
GCAAACTTTTGACCAGGCTGAATAATAGCGATCTGCTGCTTCTGGATGACTGGGGGCTGGAACCTCTCTCATCAGAACAGCGTAGCGACCTGCTGGAAAT
AGTGGATCTGATGTACCAACGAGGCTCAATCATCGTAGTGAGCCAGTTGCCGGTGGAAAACTGGTACAAAATGATCGGAGACTCCACACATGCGGATGCC
ATCCTAGATCGACTGGTTCATGGCAGTATCAAGATCGAACTTAAAGGAGAATCAATGCGGAAAATACAATCTCCGTTGACCGAAGGAGATCAGTGAAGGT
AATTTAAAAACGGTTCTGTGAAAGTGACACGAACCGATCTCCATCGATGTTACTCACCGATCTCCTTCACGGTAATACGCA
ACTCATTTTGTTATTCAGTGATCTCCATCAATGTTATTGGAACTTCACAGGTGTGTTGATCTGTATCTTTTGCCATTCCGGTAAAGGATACCTATGCCAA
CAGTTCCAATTTCTATGAGAAAACTTAAAGAAATTCTTAGGCTTAAATACGGTGTTGGACTCAGCCATCGACAAATTGGTCGTAGTCTTGCAATCTCCCC
TTCCGTTGTATCCAGATATGCTAATCGGGCGGCTCAACTTGGCATAAAGCAGTGGCCCTTACCTACAGGATGGGATGATACAAAACTAAAACATGCGTTC
CTTCAGACCCAGGTTAAGATGAAGAAGCACTCTCTGCCTGACTGGGCTACAGTACACCGGGAACTGCGTAATAAATGCGTGACGCTGCAGCTACTCTGGG
AAGAATACTGTGAGCGTAATCCAGGCGGTTTTTACAGCTATAACCATTACTGCCGGATGTACCGTGAATGGCTCAAAACCACTTCACCATCAATGCGTCA
GGTACATAAAGCTGGCGAAAAACTTTTCGTTGATTACTGTGGACCTACCGTTGGCGTTACCGACCCTGAGACCGGAGAAATAAGAACTGCTCAGGTCATC
GTAGCTGTTCTCGGGGCATCAAGTTACACATGGGCAGAGGCCACCTGGTCTCAGCAGCTTGAAGACTGGGTGATGAGTCATGTTCGCTGCTTCCAGTGGT
TGGGTGGCGTTCCTGAACTTGTTGTTCCGGACAATCTGAAAAGCGCCACATCCAGGGCATGTAAGTATGATCCTGACGTTAACCCTACCTACCAGCAGAT
GCTTGAGCATTATAATGTCGCAGTTTTGCCTGCGCGGCCACGTAAACCGAAAGATAAAGCCAAAGCTGAAGTTGGCGTTCAGGTTGTTGAACGCTGGATC
ATGGCCCGAATCAGGCATGAGATCTTCTACAGCCTTGCATCGCTTAATCAGCGCATTCGGGAGTTGCTGGAAAGACTGAATAACAAAATAATGCAGAAGT
TGGGTTATTCACGTGCAGAACTCTTCATCCAGCTTGATAAACCCGCACTGAAGCCTCTTCCTGAAGCCAGTTACAGTTACACCCTGGTGAAGAAAGTCAG
AGTTCATGCCGATTACCACGTGGAAATCGACAAACATTACTACTCGGTTCCATGTTCGCTGTTAGGCCAGCAACTGGAAGCATGGATCTCCGGAGAACTG
GTAAGACTCTTCAATCAGGGGCAGGAGGTTGCTGTGCACCCGCGCAAGCGTACTTATGGCTACAGTACCCGCAACGAGCACATGCCTGAAGCTCATCGAC
AGCATGCCACCTGGACGCCAGAGCGTCTTCTGGAATGGGCGGGGCACATAGGCAGTGAAACTCATAGTTATGTGCTTCATATACTGAACTCTCGTCCACA
TCCGGAACAAAGCTATCGCTTCTGCCTTGGACTCCTGAACCTTCATAAAAAATACAGTAAAGCCAGACTTAATGCAGCATGTGCAAGAGCTCTGAAAACA
AAGGTATGGCGTCTGTCAGGTATTAAATCGATCCTGGAAAAAGGTCTGGATAAACAACCTGTTCAGGATCCAAAACCAGATCTGTTATCCACGATGGAAC
ACGAAAACGTACGCGGCAGTGAGTATTACCACTGATACGGGATCCAATGATGAATCATCTTTACGAACAACTGACCGCACTTAAACTCACCGGCTTCCGT
GATGCGCTTAAAAAGCAACTTGCTCAGCCGGGCACATACCAGGAGCTGGGCTTCGAAGAACGCCTGTCATTACTGACAGCAGAAGAACTAACCTGCCGTG
AAAACAGGAAGGCAGAGCGTCTGATCAAACATGCACGGTTCAGACTTAATGCTGAGTTATCAAAGCTGGATTATCGTAACAATAGAGGGCTGGACAGGGC
CCTCATCCGTTCACTCAGTCAGGGAAACTGGTTAACCCTGAAACAAAATATTTTACTGACCGGGGCCACCGGCAGCGGTAAAACGTTCCTGGCATGTGCA
CTTGGTCATAATGCCTGCCGACAGGGATACAAGGTCTACTATTATCGCCTTAAAGCGCTGATGGAACAGTGCTATCAGGGGCATGCTGATGGAAGATACA
GCAAACTTTTGACCAGGCTGAATAATAGCGATCTGCTGCTTCTGGATGACTGGGGGCTGGAACCTCTCTCATCAGAACAGCGTAGCGACCTGCTGGAAAT
AGTGGATCTGATGTACCAACGAGGCTCAATCATCGTAGTGAGCCAGTTGCCGGTGGAAAACTGGTACAAAATGATCGGAGACTCCACACATGCGGATGCC
ATCCTAGATCGACTGGTTCATGGCAGTATCAAGATCGAACTTAAAGGAGAATCAATGCGGAAAATACAATCTCCGTTGACCGAAGGAGATCAGTGAAGGT
AATTTAAAAACGGTTCTGTGAAAGTGACACGAACCGATCTCCATCGATGTTACTCACCGATCTCCTTCACGGTAATACGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1542 bp | 513 aa | 194 | 1735 | + | No |
Chemistry : DDE
ORF sequence :
MPTVPISMRKLKEILRLKYGVGLSHRQIGRSLAISPSVVSRYANRAAQLGIKQWPLPTGWDDTKLKHAFLQTQVKMKKHSLPDWATVHRELRNKCVTLQL
LWEEYCERNPGGFYSYNHYCRMYREWLKTTSPSMRQVHKAGEKLFVDYCGPTVGVTDPETGEIRTAQVIVAVLGASSYTWAEATWSQQLEDWVMSHVRCF
QWLGGVPELVVPDNLKSATSRACKYDPDVNPTYQQMLEHYNVAVLPARPRKPKDKAKAEVGVQVVERWIMARIRHEIFYSLASLNQRIRELLERLNNKIM
QKLGYSRAELFIQLDKPALKPLPEASYSYTLVKKVRVHADYHVEIDKHYYSVPCSLLGQQLEAWISGELVRLFNQGQEVAVHPRKRTYGYSTRNEHMPEA
HRQHATWTPERLLEWAGHIGSETHSYVLHILNSRPHPEQSYRFCLGLLNLHKKYSKARLNAACARALKTKVWRLSGIKSILEKGLDKQPVQDPKPDLLST
MEHENVRGSEYYH
LWEEYCERNPGGFYSYNHYCRMYREWLKTTSPSMRQVHKAGEKLFVDYCGPTVGVTDPETGEIRTAQVIVAVLGASSYTWAEATWSQQLEDWVMSHVRCF
QWLGGVPELVVPDNLKSATSRACKYDPDVNPTYQQMLEHYNVAVLPARPRKPKDKAKAEVGVQVVERWIMARIRHEIFYSLASLNQRIRELLERLNNKIM
QKLGYSRAELFIQLDKPALKPLPEASYSYTLVKKVRVHADYHVEIDKHYYSVPCSLLGQQLEAWISGELVRLFNQGQEVAVHPRKRTYGYSTRNEHMPEA
HRQHATWTPERLLEWAGHIGSETHSYVLHILNSRPHPEQSYRFCLGLLNLHKKYSKARLNAACARALKTKVWRLSGIKSILEKGLDKQPVQDPKPDLLST
MEHENVRGSEYYH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
750 bp | 249 aa | 1747 | 2496 | + | No |
AG : IS21 helper
ORF sequence :
MMNHLYEQLTALKLTGFRDALKKQLAQPGTYQELGFEERLSLLTAEELTCRENRKAERLIKHARFRLNAELSKLDYRNNRGLDRALIRSLSQGNWLTLKQ
NILLTGATGSGKTFLACALGHNACRQGYKVYYYRLKALMEQCYQGHADGRYSKLLTRLNNSDLLLLDDWGLEPLSSEQRSDLLEIVDLMYQRGSIIVVSQ
LPVENWYKMIGDSTHADAILDRLVHGSIKIELKGESMRKIQSPLTEGDQ
NILLTGATGSGKTFLACALGHNACRQGYKVYYYRLKALMEQCYQGHADGRYSKLLTRLNNSDLLLLDDWGLEPLSSEQRSDLLEIVDLMYQRGSIIVVSQ
LPVENWYKMIGDSTHADAILDRLVHGSIKIELKGESMRKIQSPLTEGDQ
Blast result :
Comments
ISEc12 is 65% (ORF1)and 67% (ORF2) aa similar to ISPpu7
Eight copies of ISEc12 are found within the genome of avian pathogenic E. coli (APEC) strain O1, with two of these occurring on plasmids.
ISEc12 was isolated in Turkey clinically diagnosed with avian colibacillosis.
File updated : 2014-08-27 : ends are extended on each side and addition of the direct repeat sequence.
Eight copies of ISEc12 are found within the genome of avian pathogenic E. coli (APEC) strain O1, with two of these occurring on plasmids.
ISEc12 was isolated in Turkey clinically diagnosed with avian colibacillosis.
File updated : 2014-08-27 : ends are extended on each side and addition of the direct repeat sequence.
References
1] Kariyawasam, S., Johnson, T.J., and Nolan, L.K. Pap Operon of an Avian Pathogenic Escherichia coli Strain APEC O1 is Located on a Novel Pathogenicity Island. Infect. Immun. (Will be published in January 2006).