ISPpr4
- Family IS4
- Group IS50
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_006370 | ND | Photobacterium profundum | Photobacterium profundum SS9 |
DNA section
IS Length : 1400 bp
Ends
IR Length : 19/21
IRL : CATAGGTATCTACACAAAATGGTTACGAATTAACCAATCGGCCAAGAGAA
IRR : CATAGGTATCTACACAGATTGAGCAAAAAACGAACTGGCTATTTGCCTCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GATCTGAAGG | GTTATGAG | AGTAAGCTTG | 8 |
AAACTTAACG | GCTAATAGC | CCAATAAATA | 9 |
ACTTAACTTA | CCCAGAAG | GCTGGTTAGC | 8 |
GATTTAGTAG | CTTATGTG | CTTTTAAAAT | 8 |
TATTAAAGAC | CCTAAAN | CCGATAAATC | 7 |
GTTTTGAGCG | CAGTATTGC | ACCAAAAGAC | 9 |
ACGATAAAGA | GTCTCTGGG | TGCTTTCTTG | 9 |
AAGTATGCCT | CCCTAGAAG | CCTTACTAGA | 9 |
TCGTTTCAAT | CTTCTGGC | AGATATGATC | 8 |
CGACATACTT | GCTATAGC | AGCAATATAA | 8 |
TTTTGCTCGA | CTGATAGC | AGATAAGCTC | 8 |
CGACAGCAGT | CCCCATCAC | CTCAGGATGA | 9 |
GACTAAGCAG | CCTCTAGG | AGCAAAAGGT | 8 |
CAATTGTACC | CTTCTGTG | ATTATTGATG | 8 |
ATTTGATATG | GCTCTAAG | AGAGCGTTGT | 8 |
CCGTATAACC | CTTATGCTG | AGCAGATGAA | 9 |
TATATCATTT | CTGTATGGG | AGCTAAGAGA | 9 |
GAGCAGAGTA | GTGATAGG | CTTAATAATT | 8 |
TGAAACGAGC | NNNNTNAGG | GATTCAATCT | 9 |
AGGCTATGAC | GTTTTACAGA | 0 | |
AAGTGCCTTG | CTTAGAGC | CCAACAAATT | 8 |
TTTCGGAAGT | CTCCTAAC | CCCTCAAGTC | 8 |
CTGCCATAAT | GTTATTGCCT | 0 | |
CTGCTAGGGC | CCTTCAAGCT | 0 | |
GGTTGGATCT | GATAGAAG | AGACAATCTA | 8 |
TTCCAAACCA | CCAATAGAC | CATAAAAGCG | 9 |
GTCCCATAAC | CCTATATACC | 0 | |
TAGATGTAAT | GCTAAAAG | TGCCTAGAGG | 8 |
CCTCTAACCA | CGTAGAGC | CTTTTGTCTC | 8 |
GTCCTTGAAC | GTTAGAGGCG | 0 | |
AAACACCAAG | GCTATAAG | TGCCTAAAGC | 8 |
TGCCTAAAGC | CTCCCGAAGG | 0 | |
TACTGTACCT | GCCATAAC | TGGAACCAAT | 8 |
TATATACCAA | GCCTTAAAC | CGCAATAACC | 9 |
TTACTAAGTC | GCTTTAAG | TGATAATTAA | 8 |
CTCTGATACC | NNTAAAGN | GGGCTATTTG | 8 |
ATTGGTAAGC | GCTCATCAG | TGGTTAAATA | 9 |
CAGCCAGATG | CCTTTACCTA | 0 | |
TAGTTAAATC | CTTATAACCG | 0 | |
CACCTTAACC | GCCATAAG | TGGTGAAGAA | 8 |
CGGAAGACCC | CCAATTAGG | TGATGATAGA | 9 |
GCGCTGGTTT | GTTATGAG | CTTATTAAGC | 8 |
GTTTTAATCG | GTTATGTG | GGCTTTCGTA | 8 |
TTTCTGATGT | GTCCTGCAC | TGACGAAAAT | 9 |
GATGGAAAGC | CCTATTAGG | GGTTTCACAG | 9 |
ATTGTTGGGG | GCCTGTAGG | TGTTCAATCT | 9 |
ACTCCCGAAG | GTTAACGGTA | 0 | |
TTACTGACAA | GGTAGAAG | ACCTTCAGAA | 8 |
DNA sequence
CATAGGTATCTACACAAAATGGTTACGAATTAACCAATCGGCCAAGAGAAGGGAACTGATCTAAGGATCAACGATCAAGAGAGTTAACTTTCATTTCATT
AGTATTGACGATGACTCTTTTTGAACAACATCAATTACCCTGTATCCTTGAATCAAGATTATCTAAGCGTTATCAGACCCTTATAATGGAACACATGACA
GTTAATTCTAGCAATGCACCAGGTGTAGAATCTCTTCGCCACCACACACAATCATGGGCATCGACACAAGCAACATGGCGTTTTTATCATAATGAGGATG
TGACTTTTCCTATGCTAAGTGGCCCGATGCTGGGACTTGCTCGTTCTGGTGTGAAAGAAAGTCAAAGTCGATATGTATTAATGGCTCATGATTGGTGCCA
TATCAATTTCGCTAAACATCATAGTAAGTTAGATAAAACTAAGATGTCACACGCTCTCGATGTTGGCTACGAACTGCAAGCGTCTTTATTGGTAGACGCA
AATACTGGCGCACCCATTGCTCCAGCAGGTCTTAACTTACTGACAAGCAACGGTATTTATCAATGCCGAAGCCAAGAGTTACAACCCAAGCAAAGTCACC
TAGATTCACTCTTTGACAGCATTCATTGGCAAGAACAATTAGATTTAGACAAGCCCCTGGTGCATGTTGTTGATAGAGAAGCAGATTCAGCGAAAGACTT
AAGACGTTTAGGCTCAGTTCACTGGCTAACTCGAACTAAAAAAGGCTCAACGTTCCGTCACGAAGGTCAGTTTAAAACGGCTGAAATCATCAGTCGAACA
ATCTCCCCAGACTTGAAAGGTGTTATTTCTCTTCGAGGTAAAGAGGGCTATTTGTTTGTTGGTGAAACGACTGTTGAGTTACACCGGAAATCAGAAAAGC
TAGCGTCAGCGGCGCCCACCTGTCGCTTTGTTATGAGCCTGGTCACGGATGATGAAGGTAAAGAGCTAGCAAGATGGTATCTGCTGTCTAACGTGTTGGA
TGTTGATGCAACAGAGATTGCAACGTGGTATTGCCATCGCTGGAATATTGAATCTTGGTTTAAGTTATTGAAGTCAGATGGTCATCAGTTAGAAAAATGG
CAGCAAACTACTGCGGAGTCAATATTAAAGCGTCTGATCACAGCCAGTGTTGCAACGACGTTGATATTTAAGCTTTATTCGGACAGCTTGGATGAAGCTA
ATGAATTTAAAGGTTTTTTGGTTAAGCTGAGTGGTCGTTTAACTAAGCGAACAAAGCCTGTCACTCAGCCATCACTGCTTGCGGGACTATGGGTTTTCCT
ACAAATGTGTGAAGTACTAGATACCTACACCATGGATGAGATAAACGCGATGAGGCAAATAGCCAGTTCGTTTTTTGCTCAATCTGTGTAGATACCTATG
AGTATTGACGATGACTCTTTTTGAACAACATCAATTACCCTGTATCCTTGAATCAAGATTATCTAAGCGTTATCAGACCCTTATAATGGAACACATGACA
GTTAATTCTAGCAATGCACCAGGTGTAGAATCTCTTCGCCACCACACACAATCATGGGCATCGACACAAGCAACATGGCGTTTTTATCATAATGAGGATG
TGACTTTTCCTATGCTAAGTGGCCCGATGCTGGGACTTGCTCGTTCTGGTGTGAAAGAAAGTCAAAGTCGATATGTATTAATGGCTCATGATTGGTGCCA
TATCAATTTCGCTAAACATCATAGTAAGTTAGATAAAACTAAGATGTCACACGCTCTCGATGTTGGCTACGAACTGCAAGCGTCTTTATTGGTAGACGCA
AATACTGGCGCACCCATTGCTCCAGCAGGTCTTAACTTACTGACAAGCAACGGTATTTATCAATGCCGAAGCCAAGAGTTACAACCCAAGCAAAGTCACC
TAGATTCACTCTTTGACAGCATTCATTGGCAAGAACAATTAGATTTAGACAAGCCCCTGGTGCATGTTGTTGATAGAGAAGCAGATTCAGCGAAAGACTT
AAGACGTTTAGGCTCAGTTCACTGGCTAACTCGAACTAAAAAAGGCTCAACGTTCCGTCACGAAGGTCAGTTTAAAACGGCTGAAATCATCAGTCGAACA
ATCTCCCCAGACTTGAAAGGTGTTATTTCTCTTCGAGGTAAAGAGGGCTATTTGTTTGTTGGTGAAACGACTGTTGAGTTACACCGGAAATCAGAAAAGC
TAGCGTCAGCGGCGCCCACCTGTCGCTTTGTTATGAGCCTGGTCACGGATGATGAAGGTAAAGAGCTAGCAAGATGGTATCTGCTGTCTAACGTGTTGGA
TGTTGATGCAACAGAGATTGCAACGTGGTATTGCCATCGCTGGAATATTGAATCTTGGTTTAAGTTATTGAAGTCAGATGGTCATCAGTTAGAAAAATGG
CAGCAAACTACTGCGGAGTCAATATTAAAGCGTCTGATCACAGCCAGTGTTGCAACGACGTTGATATTTAAGCTTTATTCGGACAGCTTGGATGAAGCTA
ATGAATTTAAAGGTTTTTTGGTTAAGCTGAGTGGTCGTTTAACTAAGCGAACAAAGCCTGTCACTCAGCCATCACTGCTTGCGGGACTATGGGTTTTCCT
ACAAATGTGTGAAGTACTAGATACCTACACCATGGATGAGATAAACGCGATGAGGCAAATAGCCAGTTCGTTTTTTGCTCAATCTGTGTAGATACCTATG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1287 bp | 428 aa | 102 | 1388 | + | No |
Chemistry : DDE
ORF sequence :
MTMTLFEQHQLPCILESRLSKRYQTLIMEHMTVNSSNAPGVESLRHHTQSWASTQATWRFYHNEDVTFPMLSGPMLGLARSGVKESQSRYVLMAHDWCHI
NFAKHHSKLDKTKMSHALDVGYELQASLLVDANTGAPIAPAGLNLLTSNGIYQCRSQELQPKQSHLDSLFDSIHWQEQLDLDKPLVHVVDREADSAKDLR
RLGSVHWLTRTKKGSTFRHEGQFKTAEIISRTISPDLKGVISLRGKEGYLFVGETTVELHRKSEKLASAAPTCRFVMSLVTDDEGKELARWYLLSNVLDV
DATEIATWYCHRWNIESWFKLLKSDGHQLEKWQQTTAESILKRLITASVATTLIFKLYSDSLDEANEFKGFLVKLSGRLTKRTKPVTQPSLLAGLWVFLQ
MCEVLDTYTMDEINAMRQIASSFFAQSV
NFAKHHSKLDKTKMSHALDVGYELQASLLVDANTGAPIAPAGLNLLTSNGIYQCRSQELQPKQSHLDSLFDSIHWQEQLDLDKPLVHVVDREADSAKDLR
RLGSVHWLTRTKKGSTFRHEGQFKTAEIISRTISPDLKGVISLRGKEGYLFVGETTVELHRKSEKLASAAPTCRFVMSLVTDDEGKELARWYLLSNVLDV
DATEIATWYCHRWNIESWFKLLKSDGHQLEKWQQTTAESILKRLITASVATTLIFKLYSDSLDEANEFKGFLVKLSGRLTKRTKPVTQPSLLAGLWVFLQ
MCEVLDTYTMDEINAMRQIASSFFAQSV
Blast result :
Comments
ISPpr4 is 36% aa similar to ISAzo5.
ISPpr4 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-93-D(N3)-125-E(C1). The copy number in Photobacterium profundum SS9 is 48, 20 copies lying on chromosome 1 and 28 on chromosome 2.
ISPpr4 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-93-D(N3)-125-E(C1). The copy number in Photobacterium profundum SS9 is 48, 20 copies lying on chromosome 1 and 28 on chromosome 2.
References
1] Vezzi,A., Campanaro,S., D'Angelo,M., Simonato,F., Vitulo,N., Lauro,F., Cestaro,A., Malacrida,G., Simionati,B., Cannata,N., Bartlett,D. and Valle,G.(2004)Unpublished
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18