ISLpn4
- Family IS4
- Group IS50
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_002942 | ND | Legionella pneumophila | Legionella pneumophila subsp.pneumophila Philadelphia 1 |
DNA section
IS Length : 1498 bp
Ends
IR Length : 14/19
IRL : CTGAATCTTCTACACATCTTGTCTATATCAAAGAAACATGATCAAATCGC
IRR : CTGTCTCTTGATCACATCTCCATCAACTCGTTAGCTATCCTAAGCCCTTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GACATCTTT | GGTAGAGCCA | 0 |
DNA sequence
CTGAATCTTCTACACATCTTGTCTATATCAAAGAAACATGATCAAATCGCCTCATTTTGAGGTAATGTATGGATTTAGCAATTGAGGATTCTGCGGCATG
GTCGGAAGCTATTTTTGGTTCAGTTGATTTAGGTGATAAACGCCTGACTCATCGGTTAGCACAGATAGGTAAACAACTAACATCCTTGCCTGGTGGTTCT
CTTCCAGAAAGTTGTGAAGGACAGGATGCACTCATAGAAGGGAGTTATCGGTTTTTACGCAATAAACGAGTTACTGCGAGTCAAATTGCAGAGGGTGGTT
ACCAAGTAACAAGCTGGTTATCTCAGCGAATCCCTACTCTTTTGGCGATTGAAGACACAACCACACTATCCTATACCCATCAAATAAAAGAATCATTAGG
TGATTTAGGCGGTCCTAAAGAAAAAAGCAATCGAGGATTTCATGTTCACACCACCATGCTCATGGATGCAGAACAAGAGAAAACAATAGGATTAATAGCG
CAAGAACGTTGGTGTCGAGATATCAAAGAAAGAGGCAAAAAAAATCATCGACGAGTAAGGTTGTATACAGAGAAAGAAAGTTATAAATGGGAAAAGAATA
CTCGAGAGTTAGAGCGTCGTCTGGGTTCTAAAATGTCTGATGTGATTTCTGTTTGCGATAGGGAAGCCGATATTTTTGAATACATTCAGTATAAGTTAGA
TAATCATCAACGTTTTATTGTTCGAGCGAGCCATAATCGAAAACTGGAAGGAAGTTATGGTTATTTATTTCAAATGTTACCTTCTGCAACAATTTTGGGT
ACTTATACAATTGCAATAGCTCAAAAAGCAAATAGAAAGAAACGCCAAGCGGCTCTTGAATTAAAAACAGCATCCGTTACTTTCTCACTACCAGAGCGAA
GAGCCAAAGCACGGGAATTAAAGCCAATCACTTTAAATGTGGTTATCGCAAAAGAGAAAAATCCATCTGAAAGCGATTGCCTTGAATGGGTTCTATTAAC
AACAGAAGCAACCACAACATTAGAGTGTGCTCGAAAAATAACACGATATTATGAGATGAGATGGCGCATTGAGGATTTTCATAAAGCATGGAAATCGGGG
GTTGGTGCTGAAGAGCAGCGTATGCAATCAATTGAAAATTTAGAGAAAATGATTGTCATTCTCTCATTTGTAGCAATTCGATTACTCCAATTAAAAGAGT
ATTTTGAATACCCGACTACTCTAGTTATCAATGATAGCAGTACTTCTTGCGATGAATTGCTCACTGATGCTAAATGGAAAGTCTTATGGAATAGTGTTGA
AAGAAAATCATTACCTGAAAAAATACCTACTGCTGCTTGGGCTTATAAAGCAATAGCCAAGTTGGGCGGTTGGACTGATTCCAAAAGAACTGGGAAAGCA
GCTTGGTCTACTATCTGGAAAGGATGGTTCCGATTACAGGAACGAGTAGAAGGGCTTAGGATAGCTAACGAGTTGATGGAGATGTGATCAAGAGACAG
GTCGGAAGCTATTTTTGGTTCAGTTGATTTAGGTGATAAACGCCTGACTCATCGGTTAGCACAGATAGGTAAACAACTAACATCCTTGCCTGGTGGTTCT
CTTCCAGAAAGTTGTGAAGGACAGGATGCACTCATAGAAGGGAGTTATCGGTTTTTACGCAATAAACGAGTTACTGCGAGTCAAATTGCAGAGGGTGGTT
ACCAAGTAACAAGCTGGTTATCTCAGCGAATCCCTACTCTTTTGGCGATTGAAGACACAACCACACTATCCTATACCCATCAAATAAAAGAATCATTAGG
TGATTTAGGCGGTCCTAAAGAAAAAAGCAATCGAGGATTTCATGTTCACACCACCATGCTCATGGATGCAGAACAAGAGAAAACAATAGGATTAATAGCG
CAAGAACGTTGGTGTCGAGATATCAAAGAAAGAGGCAAAAAAAATCATCGACGAGTAAGGTTGTATACAGAGAAAGAAAGTTATAAATGGGAAAAGAATA
CTCGAGAGTTAGAGCGTCGTCTGGGTTCTAAAATGTCTGATGTGATTTCTGTTTGCGATAGGGAAGCCGATATTTTTGAATACATTCAGTATAAGTTAGA
TAATCATCAACGTTTTATTGTTCGAGCGAGCCATAATCGAAAACTGGAAGGAAGTTATGGTTATTTATTTCAAATGTTACCTTCTGCAACAATTTTGGGT
ACTTATACAATTGCAATAGCTCAAAAAGCAAATAGAAAGAAACGCCAAGCGGCTCTTGAATTAAAAACAGCATCCGTTACTTTCTCACTACCAGAGCGAA
GAGCCAAAGCACGGGAATTAAAGCCAATCACTTTAAATGTGGTTATCGCAAAAGAGAAAAATCCATCTGAAAGCGATTGCCTTGAATGGGTTCTATTAAC
AACAGAAGCAACCACAACATTAGAGTGTGCTCGAAAAATAACACGATATTATGAGATGAGATGGCGCATTGAGGATTTTCATAAAGCATGGAAATCGGGG
GTTGGTGCTGAAGAGCAGCGTATGCAATCAATTGAAAATTTAGAGAAAATGATTGTCATTCTCTCATTTGTAGCAATTCGATTACTCCAATTAAAAGAGT
ATTTTGAATACCCGACTACTCTAGTTATCAATGATAGCAGTACTTCTTGCGATGAATTGCTCACTGATGCTAAATGGAAAGTCTTATGGAATAGTGTTGA
AAGAAAATCATTACCTGAAAAAATACCTACTGCTGCTTGGGCTTATAAAGCAATAGCCAAGTTGGGCGGTTGGACTGATTCCAAAAGAACTGGGAAAGCA
GCTTGGTCTACTATCTGGAAAGGATGGTTCCGATTACAGGAACGAGTAGAAGGGCTTAGGATAGCTAACGAGTTGATGGAGATGTGATCAAGAGACAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1416 bp | 472 aa | 69 | 1484 | + | No |
Chemistry : DDE
ORF sequence :
MDLAIEDSAAWSEAIFGSVDLGDKRLTHRLAQIGKQLTSLPGGSLPESCEGQDALIEGSYRFLRNKRVTASQIAEGGYQVTSWLSQRIPTLLAIEDTTTL
SYTHQIKESLGDLGGPKEKSNRGFHVHTTMLMDAEQEKTIGLIAQERWCRDIKERGKKNHRRVRLYTEKESYKWEKNTRELERRLGSKMSDVISVCDREA
DIFEYIQYKLDNHQRFIVRASHNRKLEGSYGYLFQMLPSATILGTYTIAIAQKANRKKRQAALELKTASVTFSLPERRAKARELKPITLNVVIAKEKNPS
ESDCLEWVLLTTEATTTLECARKITRYYEMRWRIEDFHKAWKSGVGAEEQRMQSIENLEKMIVILSFVAIRLLQLKEYFEYPTTLVINDSSTSCDELLTD
AKWKVLWNSVERKSLPEKIPTAAWAYKAIAKLGGWTDSKRTGKAAWSTIWKGWFRLQERVEGLRIANELMEM
SYTHQIKESLGDLGGPKEKSNRGFHVHTTMLMDAEQEKTIGLIAQERWCRDIKERGKKNHRRVRLYTEKESYKWEKNTRELERRLGSKMSDVISVCDREA
DIFEYIQYKLDNHQRFIVRASHNRKLEGSYGYLFQMLPSATILGTYTIAIAQKANRKKRQAALELKTASVTFSLPERRAKARELKPITLNVVIAKEKNPS
ESDCLEWVLLTTEATTTLECARKITRYYEMRWRIEDFHKAWKSGVGAEEQRMQSIENLEKMIVILSFVAIRLLQLKEYFEYPTTLVINDSSTSCDELLTD
AKWKVLWNSVERKSLPEKIPTAAWAYKAIAKLGGWTDSKRTGKAAWSTIWKGWFRLQERVEGLRIANELMEM
Blast result :
Comments
ISLpn4 is 59% aa similar to ISPlu9.
ISLpn4 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-100-D(N3)-137-E(C1). The copy number in 'Legionella pneumophila subsp.pneumophila str. Philadelphia 1' is 1.
ISLpn4 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-100-D(N3)-137-E(C1). The copy number in 'Legionella pneumophila subsp.pneumophila str. Philadelphia 1' is 1.
References
1] Chien,M., Morozova,I., Shi,S., Sheng,H., Chen,J., Gomez,S.M., Asamani,G., Hill,K., Nuara,J., Feder,M., Rineer,J., Greenberg,J.J., Steshenko,V., Park,S.H., Zhao,B., Teplitskaya,E., Edwards,J.R., Pampou,S., Georghiou,A., Chou,I.-C., Iannuccilli,W., Ulz,M.E., Kim,D.H., Geringer-Sameth,A., Goldsberry,C., Morozov,P., Fischer,S.G., Segal,G., Qu,X., Rzhetsky,A., Zhang,P., Cayanis,E., De Jong,P.J., Ju,J., Kalachikov,S., Shuman,H.A. and Russo,J.J.(2004)Science 305(5692), 1966-1968.
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18