ISCro1

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Citrobacter rodentium
Citrobacter rodentium ICC168
DNA section
IS Length : 2699 bp

Ends


IR Length : 16/22

IRL : GTAAGCGTCTCATTTAAACCGTCTGGTCTGTTTCCTCCGGCTCCACAAAA
IRR : GTAAGCGGCTCGCCAGAACCGTATTGATATTTACTGAGAGCTCAGATCAA

Insertion site


Left flankDirect repeatRight flankDR Length
GTTTCGGATAAGCCTCGTCCTGTGGGCTAATACTCT8

DNA sequence

GTAAGCGTCTCATTTAAACCGTCTGGTCTGTTTCCTCCGGCTCCACAAAAATAATGTCCATCATTTTTAGTGGACACTATCGTATGGAATACCGGACCTG
GATTACTGAAGCTTTACGCCTTCACTTCGAAGAACATTTACCTCGGGTTGTGGCCGGACGTCGCCTGGGTGTACCAAAATCAACAGTTTGTAGTATGTTC
GTGCGCTTTCGGAGAGCTGGCCTTTCGTGGCCTTTGCCCGCAGGCATGTCGGAGCAGGAACTTGATGCCTGCCTTTACGGACAATTTTCCACGGTACCAG
TCGTACGTCCTGAAAGCACCGTTATATCCGAAACCCCCGTGGTAAAAAAACGTCCCCGGCGGCCCAACTTCCCTTATGAGTTTAAAATCGCCTTAGTGGA
GCAGTCACTGCAGCCCGGAGCCTGTGTGGCGCAGATCGCCCGGGAAAACGGAATCAACGATAACCTGCTCTTCAACTGGCGCCATCAATACCGGAAAGGT
GGCCTGCTGCCTTCCGGAAAAAATATGCCGGCACTGCTTCCCGTGACGTTAACGCCGGAGCCGGATAATAAAATCCCGGCCCCCGCACAGGAACCAGAGC
AGATAAATACACCGTCCGACAGTCTGTGTTGTGAGCTGGTTCTGCCGGCCGGAACTCTCAGGCTTAAAGGTAAACTGACGCCGGCGTTATTACAGACACT
TATCCGCGAAATAAAAGGGAGCAGCCACTGATGATATCTCTCCCTGCAGGTTCGCGTATCTGGCTGGTTGCAGGTATCACCGATATGCGAAATGGCTTTA
ACGGCCTGGCATCAAAAGTTCAGAACGTCCTGAAGGATGACCCGTTCTCCGGACACCTGTTCATCTTCCGCGGACGCCGGGGTGACCAGATAAAAGTGTT
GTGGGCTGACAGTGACGGACTGTGCCTCTTCACCAAACGCCTGGAGCGGGGCCGCTTCATCTGGCCAGTCACCCGTGACGGCAAGGTGCACCTTACTCCG
GCTCAGTTATCCATGCTTCTTGAAGGTATCAACTGGAAGCACCCGAAACGAACGGAACGCGCTGGAATCCGCATATAACCCGTTGTAAAGTGAGGATATG
GACACCTCACTTGCTCATGAGAACGCCCGCCTGCGGGCACTGTTGCAGACGCAACAGGACACCATCCGCCAGATGGCCGAATACAACCGCCTGCTCTCAC
AGCGGGTGGCGGCTTATGCTTCCGAAATCAACCGGCTGAAGGCGCTGGTTGCGAAACTGCAACGTATGCAGTTCGGTAAAAGCTCAGAAAAACTTCGCGC
AAAAACCGAACGGCAGATACAGGATGCACAGGAGAGAATCAGCGCACTTCAGGAAGAAATGGCTGAAACGCTGGGTGAGCAATATGACCCGGCACTGCCA
TCCGCCCTGCGCCAGTCTTCAGCCCGTAAACCGTTACCGGCCTCACTTCCCCGTGAAACCCGGGTTATCCGGCCGGAAGAGGAATGCTGTCCTGCCTGTG
GTGGTGAACTCAGTTCTCTGGGATGTGATGTGTCAGAGCAACTGGAGCTTATCAGCAGCGCCTTTAAGGTTATCGAAACACAACGTCCGAAACTGGCCTG
TTGCCGGTGCGACCATATCGTGCAGGCACCAGTACCTTCAAAACCCATTGCACGCAGTTATGCCGGAGCGGGGCTTCTGGCCCATGTTGTCACCGGGAAA
TATGCAGACCATCTGCCGTTATACCGCCAGTCAGAAATATACCGTCGTCAGGGAGTGGAGCTGAGCCGTGCCACACTGGGGCGCTGGACAGGTGCTGTTG
CTGAACTGCTGGAGCCGCTGTATGACGTCCTGCGCCAGTATGTGCTGATGCCCGGTAAAGTCCATGCTGATGATATCCCCGTCCCGGTCCAGGAGCCGGG
CAGCGGTAAAACCCGGACAGCCCGGCTGTGGGTCTACGTCCGTGATGACCGTAACGCCGGTTCACAGATGCCCCCGGCGGTCTGGTTCGCGTACAGCCCG
GACCGGAAAGGCATACATCCACAGAATCACCTGTCCGGTTACAGCGGAGTGCTTCAGGCCGATGCTTACGGTGGCTACCGGGCGTTATACGAATCCGGCA
GAATAACGGAAGCCGCGTGTATGGCCCATGCCCGGAGAAAAATCCACGATGTGCATGCAAGAGCGCCAACCGATATCACCACGGAAGCCCTGCAGCGTAT
CGGTGAACTGTATGCCATCGAAGCAGAAGTCCGGGGATGTTCAGCAGAACAGCGTCTGGCGGCAAGAAAAGCCAGAGCTGCGTCACTGATGCAGTCACTG
TATGACTGGATACAGACTCAGATGAAAACACTGTCGCGTCACTCGGATACGGCAAAAGCGTTCGCATACCTGCTGAAACAGTGGGATAGCCTGAACGTGT
ACTGCAGTAATGGCTGGGTGGAAATCGACAACAACATCGCAGAGAACGCCTTAAGGGGAGTGGCCGTAGGCCGGAAAAACTGGCTGTTCGCGGGTTCTGA
CAGCGGTGGCGAACATGCGGCGGTGTTGTACTCGCTGATCGGCACATGCCGTCTGAACAATGTGGAGCCAGAAAAATGGCTGCGTTACGTCATTGAGCAT
ATCCAGGACTGGCCGGCAAATCGGGTACGCGATCTGTTGCCCTGGAAAGTTGATCTGAGCTCTCAGTAAATATCAATACGGTTCTGGCGAGCCGCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
678 bp225 aa54731+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MSIIFSGHYRMEYRTWITEALRLHFEEHLPRVVAGRRLGVPKSTVCSMFVRFRRAGLSWPLPAGMSEQELDACLYGQFSTVPVVRPESTVISETPVVKKR
PRRPNFPYEFKIALVEQSLQPGACVAQIARENGINDNLLFNWRHQYRKGGLLPSGKNMPALLPVTLTPEPDNKIPAPAQEPEQINTPSDSLCCELVLPAG
TLRLKGKLTPALLQTLIREIKGSSH

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
348 bp115 aa7311078+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MISLPAGSRIWLVAGITDMRNGFNGLASKVQNVLKDDPFSGHLFIFRGRRGDQIKVLWADSDGLCLFTKRLERGRFIWPVTRDGKVHLTPAQLSMLLEGI
NWKHPKRTERAGIRI

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1571 bp523 aa10982668+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MDTSLAHENARLRALLQTQQDTIRQMAEYNRLLSQRVAAYASEINRLKALVAKLQRMQFGKSSEKLRAKTERQIQDAQERISALQEEMAETLGEQYDPAL
PSALRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLELISSAFKVIETQRPKLACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVTG
KYADHLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDVLRQYVLMPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSQMPPAVWFAYS
PDRKGIHPQNHLSGYSGVLQADAYGGYRALYESGRITEAACMAHARRKIHDVHARAPTDITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAASLMQS
LYDWIQTQMKTLSRHSDTAKAFAYLLKQWDSLNVYCSNGWVEIDNNIAENALRGVAVGRKNWLFAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRYVIE
HIQDWPANRVRDLLPWKVDLSSQ

 

Blast result :
Comments
21 identical copies in C. rodentium chromosome, plus 3 remnants. An example of a complete ISCro1 element is found in the C. rodentium genome sequence at co-ordinates 2992292-2994990.
ISCro1 is 54% (ORFA) aa similar to ISSfl3, 72% (ORFB) to ISBcen14 and 72% (ORFC) to ISEc8.
References
1] Nicola Petty (2007) Direct submission.