ISLpn6
- Family IS4
- Group IS50
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_002942 | ND | Legionella pneumophila | Legionella pneumophila subsp. pneumophila Philadelphia 1 |
DNA section
IS Length : 1490 bp
Ends
IR Length : 19
IRL : CATACTCTTATACATAAGTATTAATCCATGTAAAATTCATATTTTTTATA
IRR : CATACTCTTATACATAAGTTATTCTAACTCAATGTATGAAAGGTTTCCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTTACCCTCT | CCCCAAAGGGG | CGAGGGAATG | 11 |
CATATCCGCT | CCCTGATAGT | TGCGGCTCGG | 10 |
CCCAGCCGCG | ACCGTGAGGA | AGCGGACTTT | 10 |
DNA sequence
CATACTCTTATACATAAGTATTAATCCATGTAAAATTCATATTTTTTATATGGAACTTAGATGGCTAGTTGTGAATGGATTATTAATGAATCCAAGGAAG
CGAACTTCGGTGATAAGCGGTTAGATAAGCGTTATGGAGATATCTTAAGCGGGTTACTTAGCTCACCAAATGCAAGCATTCCAACTACATTTCAAAGTTG
GAATGAAACACTGGCAGCCTACCGGTTTTTTAACCATGTGAATGTAAATCCGGAATCAATTCTTTTACCCCATTCTGAAGCGACACTAGAACGTATAAAA
GCAGAAAAGATAGTTCTAATTCCCCAAGACACAACAGAGGTAGATTTTACAGGTAGAAAATCATTGTCAGGCATGGGTTATCTCTCCAACGAGAGAAGCC
GGGGATTATATTTACATCCAAGCATTGCATTTACGCCAGAACGAGTTTGTTTGGGCGTAGTAGAAATGCAGCATTGGATAAGAAAAGAGATTGGTACGAG
GAATAGTAGAAAGGGAAAGTCAATTGAGGAAAAGGAAACCTATTGTTGGCTTAAGGGCTATAATGCAGCGAATAAAATTGCACTGGCAGTACCCGATACA
ATGGTTGTCAGTATTTCCGATAGAGAGGGAGATATTTATGAGGTTCTTGAAAAACTTCCCTCCGAAGAAAATAAAGCCTATTGGCTTATACGTTGTCAGC
ATGATAGAGCTGTTTTGAATGAAGAAACAAATCAATTCGAGTTACTTCTTAAAAAAGAAGTTAGCAAAGCTTGTGTTCTTGGCACCATAGAGTTTGAAAT
TCCCGCAGGCACTTCCTATAGAAATTGCAAAAAAAGACATACTCGAAAAGCCCGTAGCGTTAGACAGGAGATTCGTATATGTAGTGTTTCTTTAAGGCCT
CCTCGTCGAAAAAGCAAAAAATTGAACGTCATTGAAATTCAGGTTGTACACTGCAAAGAAATAAATACACCAGAAGGGGAGCAACCCGTTGAATGGTTTC
TAATCACAAGTGTCCCTATAAAAACACTGGATCGGGCAGTTGAAATTGTTAATTGGTATTTATGTAGGTGGTTAATAGAAATGTACATAAAAATTTTAAA
GAGTGGATGCAAGATAGAGGAATTACGATTTGAAACTTACGAGGCTACTCTTAATTGCATCGCTTTTTATATGATAGTGGCATGGCGGGTATTTTATTTA
ACAATGTTGGGACGAACTTGCCCTGATATAGACTGTACAACAGTCTTTGAAGATAACGAATGGCAAGCCACTTATGCCATGGCAACAAAGAAAAAACCAC
CAAAAAAACCACCTAAACTTTATGAAATTATTTTAATGATCGCAAAATTTGGAGGACATTTAGGAAGAGGCTCTGATGGCCTCCCTGGAACAAAAGTCAT
GTGGATTGGTCTACAACGCATGAAAGATTTTACTTTGGCTTGGGAAACCTTTCATACATTGAGTTAGAATAACTTATGTATAAGAGTATG
CGAACTTCGGTGATAAGCGGTTAGATAAGCGTTATGGAGATATCTTAAGCGGGTTACTTAGCTCACCAAATGCAAGCATTCCAACTACATTTCAAAGTTG
GAATGAAACACTGGCAGCCTACCGGTTTTTTAACCATGTGAATGTAAATCCGGAATCAATTCTTTTACCCCATTCTGAAGCGACACTAGAACGTATAAAA
GCAGAAAAGATAGTTCTAATTCCCCAAGACACAACAGAGGTAGATTTTACAGGTAGAAAATCATTGTCAGGCATGGGTTATCTCTCCAACGAGAGAAGCC
GGGGATTATATTTACATCCAAGCATTGCATTTACGCCAGAACGAGTTTGTTTGGGCGTAGTAGAAATGCAGCATTGGATAAGAAAAGAGATTGGTACGAG
GAATAGTAGAAAGGGAAAGTCAATTGAGGAAAAGGAAACCTATTGTTGGCTTAAGGGCTATAATGCAGCGAATAAAATTGCACTGGCAGTACCCGATACA
ATGGTTGTCAGTATTTCCGATAGAGAGGGAGATATTTATGAGGTTCTTGAAAAACTTCCCTCCGAAGAAAATAAAGCCTATTGGCTTATACGTTGTCAGC
ATGATAGAGCTGTTTTGAATGAAGAAACAAATCAATTCGAGTTACTTCTTAAAAAAGAAGTTAGCAAAGCTTGTGTTCTTGGCACCATAGAGTTTGAAAT
TCCCGCAGGCACTTCCTATAGAAATTGCAAAAAAAGACATACTCGAAAAGCCCGTAGCGTTAGACAGGAGATTCGTATATGTAGTGTTTCTTTAAGGCCT
CCTCGTCGAAAAAGCAAAAAATTGAACGTCATTGAAATTCAGGTTGTACACTGCAAAGAAATAAATACACCAGAAGGGGAGCAACCCGTTGAATGGTTTC
TAATCACAAGTGTCCCTATAAAAACACTGGATCGGGCAGTTGAAATTGTTAATTGGTATTTATGTAGGTGGTTAATAGAAATGTACATAAAAATTTTAAA
GAGTGGATGCAAGATAGAGGAATTACGATTTGAAACTTACGAGGCTACTCTTAATTGCATCGCTTTTTATATGATAGTGGCATGGCGGGTATTTTATTTA
ACAATGTTGGGACGAACTTGCCCTGATATAGACTGTACAACAGTCTTTGAAGATAACGAATGGCAAGCCACTTATGCCATGGCAACAAAGAAAAAACCAC
CAAAAAAACCACCTAAACTTTATGAAATTATTTTAATGATCGCAAAATTTGGAGGACATTTAGGAAGAGGCTCTGATGGCCTCCCTGGAACAAAAGTCAT
GTGGATTGGTCTACAACGCATGAAAGATTTTACTTTGGCTTGGGAAACCTTTCATACATTGAGTTAGAATAACTTATGTATAAGAGTATG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1404 bp | 468 aa | 61 | 1464 | + | No |
Chemistry : DDE
ORF sequence :
MASCEWIINESKEANFGDKRLDKRYGDILSGLLSSPNASIPTTFQSWNETLAAYRFFNHVNVNPESILLPHSEATLERIKAEKIVLIPQDTTEVDFTGRK
SLSGMGYLSNERSRGLYLHPSIAFTPERVCLGVVEMQHWIRKEIGTRNSRKGKSIEEKETYCWLKGYNAANKIALAVPDTMVVSISDREGDIYEVLEKLP
SEENKAYWLIRCQHDRAVLNEETNQFELLLKKEVSKACVLGTIEFEIPAGTSYRNCKKRHTRKARSVRQEIRICSVSLRPPRRKSKKLNVIEIQVVHCKE
INTPEGEQPVEWFLITSVPIKTLDRAVEIVNWYLCRWLIEMYIKILKSGCKIEELRFETYEATLNCIAFYMIVAWRVFYLTMLGRTCPDIDCTTVFEDNE
WQATYAMATKKKPPKKPPKLYEIILMIAKFGGHLGRGSDGLPGTKVMWIGLQRMKDFTLAWETFHTLS
SLSGMGYLSNERSRGLYLHPSIAFTPERVCLGVVEMQHWIRKEIGTRNSRKGKSIEEKETYCWLKGYNAANKIALAVPDTMVVSISDREGDIYEVLEKLP
SEENKAYWLIRCQHDRAVLNEETNQFELLLKKEVSKACVLGTIEFEIPAGTSYRNCKKRHTRKARSVRQEIRICSVSLRPPRRKSKKLNVIEIQVVHCKE
INTPEGEQPVEWFLITSVPIKTLDRAVEIVNWYLCRWLIEMYIKILKSGCKIEELRFETYEATLNCIAFYMIVAWRVFYLTMLGRTCPDIDCTTVFEDNE
WQATYAMATKKKPPKKPPKLYEIILMIAKFGGHLGRGSDGLPGTKVMWIGLQRMKDFTLAWETFHTLS
Blast result :
Comments
ISLpn6 is 56% aa similar to ISAzo5.
ISLpn6 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-96-D(N3)-152-E(C1). The copy number in 'Legionella pneumophila subsp. pneumophila str. Philadelphia 1' is 3.
ISLpn6 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-96-D(N3)-152-E(C1). The copy number in 'Legionella pneumophila subsp. pneumophila str. Philadelphia 1' is 3.
References
1] Chien,M., Morozova,I., Shi,S., Sheng,H., Chen,J., Gomez,S.M., Asamani,G., Hill,K., Nuara,J., Feder,M., Rineer,J., Greenberg,J.J., Steshenko,V., Park,S.H., Zhao,B., Teplitskaya,E., Edwards,J.R., Pampou,S., Georghiou,A., Chou,I.-C., Iannuccilli,W., Ulz,M.E., Kim,D.H., Geringer-Sameth,A., Goldsberry,C., Morozov,P., Fischer,S.G., Segal,G., Qu,X., Rzhetsky,A., Zhang,P., Cayanis,E., De Jong,P.J., Ju,J., Kalachikov,S., Shuman,H.A. and Russo,J.J.(2004)Science 305(5692), 1966-1968.
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18