ISSba11
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009052 | ND | Shewanella baltica | Shewanella baltica OS155 |
DNA section
IS Length : 2394 bp
Ends
IR Length : 44/54
IRL : TATTGCGCGACAATTACCCTGACCGGTTCAGCGACAATTAGAATGGCCGG
IRR : TATAGCGCGTCAATTAGCTTGGCCGATCCAGCGACAATTAACTTGGCCGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGGTCAGGGC | GAAGGCAATT | 0 | |
CAGCGAAGGC | GAACGTGATT | 0 | |
TATGAACGTG | CAGGGCGTAC | 0 | |
CGTTGTCCTT | GACGACTGAC | 0 | |
TTGGCAAGGC | GACAACTGCT | 0 | |
CTGCATCGGG | AACATACCAT | 0 | |
ACCCTTAGAC | GTAGGAGCTT | 0 | |
TTGTAACGTG | AACGGGTGAT | 0 | |
CAAACCCATC | ACCATCGAAA | 0 | |
CTTACACACT | CACGTTACTG | 0 | |
TGTGCTTCCA | CGGC | TTTGCCATTT | 4 |
AAACCTAACA | CATC | ATAATCAGTG | 4 |
ATCAAATACC | CACTAT | GCTGGATTTA | 6 |
ATTTGAAAGT | ACCATT | ACTTCCTGAA | 6 |
TTACTTGAAT | GACTTC | AATATTTTCT | 6 |
TACATTGTCC | GACGAC | TAAAACGACA | 6 |
TCATAATTCA | TAGCCC | CAACAACCAT | 6 |
GGGGCAACAA | AGACAT | TTACTAAAAA | 6 |
ACATTTCAAC | TGTTCT | TCCCTTCACA | 6 |
GCTTAGCAAC | AGCAAG | AGAATGGGTG | 6 |
ATCGAATTAA | GAAGGC | GTAATTATAC | 6 |
TTTAACTTCC | CTATAG | ACACTAGAGC | 6 |
TTGCAACTAT | CCCTTA | AGTAATTATA | 6 |
TCAAAATTTG | AACGTT | ACACTAAACG | 6 |
ACCACGGTAT | AAAGAC | TCAAAGCGAT | 6 |
TCTACCCATG | ATCCCC | TGACCAAGTT | 6 |
AACAAGGTAT | CACCTT | AGAAGAATCT | 6 |
CTGTAGAAAC | GGTGTC | TATGACATCT | 6 |
TGCATCATTA | AGCGAG | CAAGTTTATT | 6 |
GCGTCTGTGA | CACCTG | ATAAACTGAG | 6 |
AAGTGTATCA | CTACGGG | CTTAGTGACT | 7 |
GAAAGGCTAA | GTGGGTG | GGATTTATGA | 7 |
GAAAGTCAAA | CCACTAC | GCTGGCAGCG | 7 |
CTTTTATTTC | GTCTCAG | GCTAAATCTT | 7 |
DNA sequence
TATTGCGCGACAATTACCCTGACCGGTTCAGCGACAATTAGAATGGCCGGTTGATCTGATACATTCTGGACAAAGATTCTAAGGGAGTTCTTTGTGCCAG
GTCATCGAATTACAGATCAACAAATAAGGCTATTTATGTCTAAACGAAAAGATCACCTTCAAGTTACTGCGGCGAGCAAAGCTGGCATTTCGGAACGCTC
TGCAAGACGAATTGAATCCGGCCAGCGACAATTAGGCCCATCAAAACCTCGAAACTACCGTACTCGTACCGACCCCCTAGAACTTGTATGGGACCCCGTC
GTTTTACCACTCCTACAACTTTCTGACACTATTACTCCTGTTGGCGTGTTTGATTATCTTTGCGAAGAATATTCAGATGTTTTCGCTGTAACGCTAAGGC
GAACCCTTGAACGACGTATTCAAAAATGGCGACAGATAAATGGCCAAGATAAAGAGGTTATTTTTCGCCAAGTTAAAGAATTCGGTCAGTTGGGGATCAT
GGATTTCACTTGGGCGGATTTCATTGTCACGATTAGAGGAACAACTCTTAAGCATCGATTTTTTAACTATCGATTACCTGCAAGTGGTTGGAGTTACGTT
CAAGTCGTATACGGCGGAGAAAGCTTCGTAGCTGTTGCTACAGGCTTACAAAATGCCTTTGAACAATCAAATGGTGTTCCACAGGAAGTCAGAACGGATA
GCTTGAGCGCTGCTTATAAAAACCATTCCAACGAAACATTATTTACTGAACGATTTTCAGAATTATCGATTCATTATGGTTTCAAACCTTCTAAAAATAA
CACCGGCATTGCCCATGAAAATGGTGCCATTGAAAGCGCCAACAATCATCTAAAAAACCAAATCCGACAAGCTTTGGCTATTCGTGGTTCAAGTGACTTT
GATAGCATTGACGAGTATGAAACATTTATTGATGACGTCGTCCAAAGACGTAATCGCCGTATTATGGCACTTCTCATCGATGAGCAACGACAATTACAAC
CCTTACCTAAATTTGAAAGTGTTAATTACGAAATTTACCCAGTAAAAGTATCGAGCACCAGTACGTTTCAGTTAAAACGAGTGACTTATTCAGTACCATC
CAGACTCGTCGGTGCAACATTGCGCGTGCATCTTTTCGATAAAAAATTAGATATCTATTGCCATGGAGTGCACACCGCCACGCTCACTCGCGTACATGCG
TCAGCAAATAATCGCGGTCATCAAATTAATTATCGTCACTTAATCGGTGCACTGATGAAAAAACCTCGAGCATTCAGGGGGGGCCAATGGCGAGACCAAC
TGCTTCCAAATGAAGACTATCGCCAAATATGGAAGAATGTCGATGCTCTACTAAGTGCTGATGAAGCAAGTCTTTATATGGTTAGGTTACTCAATATCGC
CAGTAAATCAGATCGTGAAGAAGCGGTAGGAAGATTCGTTCTCGAAGGGATAAATCTAGGGCAACGGCCAAACATAGTTGACTGTGAAGAACGTTTTTTA
AAAGACGAAGAGTGGGAATTTAACCCTAAAGTACAACAACATAGCTTAGCGTCTTATCAGCAAATTCTTGACGAGGTGAATGAATATGTCAGTTGAAACG
TTGCCTATTATTCTCAAAGAGCTTCGACTAGTGAGCTTACTTCCACATTGGCAACCGTTGGCTGAGAAAGCACGAGAGCAGCATTGGCCGGTGGAGCGTT
ACTTAGCTGAATTATGCCAATTAGAACTCAGTTGCAGAGAACAAAAGCGATTGCACCGAGGCTTAAAAGAAGCAACGTTACCAATAGGTAAATATCTTGA
CACCTATGATTTTAGCGAAGTTGAAGGCTTATCCAAGAAGCAAGTCTGGCATTTAGCCGATAATGCTGAGTGGTTGAAAACAGGAGATAATATCTTGCTG
TTCGGTGCCAGTGGTTTAGGTAAAACACATATTGCCGCTGGACTTGGTTATCGTCTTGTAGAGCAAGGGCATAGAGTTAAATTTATGAGTGCGAGCCTAC
TTGTGCAGCACCTGCAAAAAGCGAAAGAAGAGCTGAGATTGCCAGAAGCCCTAGTCAAATTGGACAGATTTGCAGTGTTAATCTTAGACGATCTAGGCTA
CGTGCAAAAAAGTACAGAAGAAACTAGCGTCTTGTTCGAGCTTATCGCGCATCGTTATGAAAGATATAGCCTAATAATAACCTCAAACCAATCATTCGAA
GATTGGGATAAGTTATTCAGTGACACAGTGATGACCGTAGCTGCAATCGATAGGCTGATCCACCACGCGAAAATTTTGCAATGCAAAGGAGAAAGCTACA
GGCGAAAAGAAGCCCAAAACAAACTAAATTAAACCGAGTCTCAACCGGCCAAGTTAATTGTCGCTGGATCGGCCAAGCTAATTGACGCGCTATA
GTCATCGAATTACAGATCAACAAATAAGGCTATTTATGTCTAAACGAAAAGATCACCTTCAAGTTACTGCGGCGAGCAAAGCTGGCATTTCGGAACGCTC
TGCAAGACGAATTGAATCCGGCCAGCGACAATTAGGCCCATCAAAACCTCGAAACTACCGTACTCGTACCGACCCCCTAGAACTTGTATGGGACCCCGTC
GTTTTACCACTCCTACAACTTTCTGACACTATTACTCCTGTTGGCGTGTTTGATTATCTTTGCGAAGAATATTCAGATGTTTTCGCTGTAACGCTAAGGC
GAACCCTTGAACGACGTATTCAAAAATGGCGACAGATAAATGGCCAAGATAAAGAGGTTATTTTTCGCCAAGTTAAAGAATTCGGTCAGTTGGGGATCAT
GGATTTCACTTGGGCGGATTTCATTGTCACGATTAGAGGAACAACTCTTAAGCATCGATTTTTTAACTATCGATTACCTGCAAGTGGTTGGAGTTACGTT
CAAGTCGTATACGGCGGAGAAAGCTTCGTAGCTGTTGCTACAGGCTTACAAAATGCCTTTGAACAATCAAATGGTGTTCCACAGGAAGTCAGAACGGATA
GCTTGAGCGCTGCTTATAAAAACCATTCCAACGAAACATTATTTACTGAACGATTTTCAGAATTATCGATTCATTATGGTTTCAAACCTTCTAAAAATAA
CACCGGCATTGCCCATGAAAATGGTGCCATTGAAAGCGCCAACAATCATCTAAAAAACCAAATCCGACAAGCTTTGGCTATTCGTGGTTCAAGTGACTTT
GATAGCATTGACGAGTATGAAACATTTATTGATGACGTCGTCCAAAGACGTAATCGCCGTATTATGGCACTTCTCATCGATGAGCAACGACAATTACAAC
CCTTACCTAAATTTGAAAGTGTTAATTACGAAATTTACCCAGTAAAAGTATCGAGCACCAGTACGTTTCAGTTAAAACGAGTGACTTATTCAGTACCATC
CAGACTCGTCGGTGCAACATTGCGCGTGCATCTTTTCGATAAAAAATTAGATATCTATTGCCATGGAGTGCACACCGCCACGCTCACTCGCGTACATGCG
TCAGCAAATAATCGCGGTCATCAAATTAATTATCGTCACTTAATCGGTGCACTGATGAAAAAACCTCGAGCATTCAGGGGGGGCCAATGGCGAGACCAAC
TGCTTCCAAATGAAGACTATCGCCAAATATGGAAGAATGTCGATGCTCTACTAAGTGCTGATGAAGCAAGTCTTTATATGGTTAGGTTACTCAATATCGC
CAGTAAATCAGATCGTGAAGAAGCGGTAGGAAGATTCGTTCTCGAAGGGATAAATCTAGGGCAACGGCCAAACATAGTTGACTGTGAAGAACGTTTTTTA
AAAGACGAAGAGTGGGAATTTAACCCTAAAGTACAACAACATAGCTTAGCGTCTTATCAGCAAATTCTTGACGAGGTGAATGAATATGTCAGTTGAAACG
TTGCCTATTATTCTCAAAGAGCTTCGACTAGTGAGCTTACTTCCACATTGGCAACCGTTGGCTGAGAAAGCACGAGAGCAGCATTGGCCGGTGGAGCGTT
ACTTAGCTGAATTATGCCAATTAGAACTCAGTTGCAGAGAACAAAAGCGATTGCACCGAGGCTTAAAAGAAGCAACGTTACCAATAGGTAAATATCTTGA
CACCTATGATTTTAGCGAAGTTGAAGGCTTATCCAAGAAGCAAGTCTGGCATTTAGCCGATAATGCTGAGTGGTTGAAAACAGGAGATAATATCTTGCTG
TTCGGTGCCAGTGGTTTAGGTAAAACACATATTGCCGCTGGACTTGGTTATCGTCTTGTAGAGCAAGGGCATAGAGTTAAATTTATGAGTGCGAGCCTAC
TTGTGCAGCACCTGCAAAAAGCGAAAGAAGAGCTGAGATTGCCAGAAGCCCTAGTCAAATTGGACAGATTTGCAGTGTTAATCTTAGACGATCTAGGCTA
CGTGCAAAAAAGTACAGAAGAAACTAGCGTCTTGTTCGAGCTTATCGCGCATCGTTATGAAAGATATAGCCTAATAATAACCTCAAACCAATCATTCGAA
GATTGGGATAAGTTATTCAGTGACACAGTGATGACCGTAGCTGCAATCGATAGGCTGATCCACCACGCGAAAATTTTGCAATGCAAAGGAGAAAGCTACA
GGCGAAAAGAAGCCCAAAACAAACTAAATTAAACCGAGTCTCAACCGGCCAAGTTAATTGTCGCTGGATCGGCCAAGCTAATTGACGCGCTATA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1503 bp | 500 aa | 94 | 1596 | + | No |
Chemistry : DDE
ORF sequence :
MPGHRITDQQIRLFMSKRKDHLQVTAASKAGISERSARRIESGQRQLGPSKPRNYRTRTDPLELVWDPVVLPLLQLSDTITPVGVFDYLCEEYSDVFAVT
LRRTLERRIQKWRQINGQDKEVIFRQVKEFGQLGIMDFTWADFIVTIRGTTLKHRFFNYRLPASGWSYVQVVYGGESFVAVATGLQNAFEQSNGVPQEVR
TDSLSAAYKNHSNETLFTERFSELSIHYGFKPSKNNTGIAHENGAIESANNHLKNQIRQALAIRGSSDFDSIDEYETFIDDVVQRRNRRIMALLIDEQRQ
LQPLPKFESVNYEIYPVKVSSTSTFQLKRVTYSVPSRLVGATLRVHLFDKKLDIYCHGVHTATLTRVHASANNRGHQINYRHLIGALMKKPRAFRGGQWR
DQLLPNEDYRQIWKNVDALLSADEASLYMVRLLNIASKSDREEAVGRFVLEGINLGQRPNIVDCEERFLKDEEWEFNPKVQQHSLASYQQILDEVNEYVS
LRRTLERRIQKWRQINGQDKEVIFRQVKEFGQLGIMDFTWADFIVTIRGTTLKHRFFNYRLPASGWSYVQVVYGGESFVAVATGLQNAFEQSNGVPQEVR
TDSLSAAYKNHSNETLFTERFSELSIHYGFKPSKNNTGIAHENGAIESANNHLKNQIRQALAIRGSSDFDSIDEYETFIDDVVQRRNRRIMALLIDEQRQ
LQPLPKFESVNYEIYPVKVSSTSTFQLKRVTYSVPSRLVGATLRVHLFDKKLDIYCHGVHTATLTRVHASANNRGHQINYRHLIGALMKKPRAFRGGQWR
DQLLPNEDYRQIWKNVDALLSADEASLYMVRLLNIASKSDREEAVGRFVLEGINLGQRPNIVDCEERFLKDEEWEFNPKVQQHSLASYQQILDEVNEYVS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
745 bp | 248 aa | 1588 | 2332 | + | No |
AG : IS21 helper
ORF sequence :
MSVETLPIILKELRLVSLLPHWQPLAEKAREQHWPVERYLAELCQLELSCREQKRLHRGLKEATLPIGKYLDTYDFSEVEGLSKKQVWHLADNAEWLKTG
DNILLFGASGLGKTHIAAGLGYRLVEQGHRVKFMSASLLVQHLQKAKEELRLPEALVKLDRFAVLILDDLGYVQKSTEETSVLFELIAHRYERYSLIITS
NQSFEDWDKLFSDTVMTVAAIDRLIHHAKILQCKGESYRRKEAQNKL
DNILLFGASGLGKTHIAAGLGYRLVEQGHRVKFMSASLLVQHLQKAKEELRLPEALVKLDRFAVLILDDLGYVQKSTEETSVLFELIAHRYERYSLIITS
NQSFEDWDKLFSDTVMTVAAIDRLIHHAKILQCKGESYRRKEAQNKL
Blast result :
Comments
There are 33 intact and 3 degenerate copies of ISSba11 encoded in the S. baltica OS155 genome. All but 2 copies are encoded on the chromosome with the remaining to are encoded in plasmid 2. Orf1 has 40% identity with ISEc10 and Orf2 has 51% identity with ISAli13.
References
1] Romine, M. F. (2008) Direct submission