ISLpn3
- Family IS4
- Group IS50
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_002942 | ND | Legionella pneumophila | Legionella pneumophila subsp. pneumophila Philadelphia 1 |
DNA section
IS Length : 1497 bp
Ends
IR Length : 14/19
IRL : CTGAATCTTCTACACATCTTGTTTATTCCAAACAAACATGATCAAATCAC
IRR : CTGTCTCTTGATCACATCTCCATCAACTCGTTAGCTATCCTAAGCCCTTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCGGGTCTTT | CCCAAATAC | GGGGGCATTT | 9 |
TTGGGTCTTT | NNNNNNNNN | GTAGAGCCTA | 9 |
DNA sequence
CTGAATCTTCTACACATCTTGTTTATTCCAAACAAACATGATCAAATCACCTCTTTTTGAGGTATGTATGGACTTAGCAGTTGAAGATACTACAGCATGG
TCGGAAGCTATTTTTGGTTCAGTTGCTTTAGGGGATAAACGACTTACTCGTCGGTTAATTCAAATAGGCAAACAATTATCATCGACGCCTGGTGGTTCTC
TTTCAGGAAGTTGTGGAGGGCAGGATGCGCTTATAGAAGGTAGTTATCGTTTTTTACGAAACAAACGAGTCACAGCGAATCAAATTGCAGAGGGTGGTTA
TCAAGTAACAAGCTGGTTATCTCAGTCAATCCCCACGCTTTTAGCCATTGAAGACACAACCACGCTCTCCTATACCCATCAAGTAAAAGAATCATTAGGT
GATTTAGGCGGTCCTAAAGAAAAAAGCAATCGAGGTTTTCATGTTCACACCACCATGCTCATGGATGCAGAACAAGAGAAAACAATAGGATTAATCGCGC
AAGAACGTTGGTGTCGAGATATCAAAGAAAGAGGCAAAAAAAATCATCGACGAGTAAGGTTATATACAGAGAAAGAAAGCTATAAATGGGAAAAGAATAC
TCGGGAATTAGAGAATCGATTGGGTTCTAAAATGTCTGATGTGATTTCTGTTTGCGATAGAGAAGCTGATATTTTTGAATACATTCAGTACAAGTTAGAC
CATGCTCAACGTTTTATTGTTCGAGCGAGCCATAATCGAAAACTAGAAGGAAGTAACTGTTATTTATTTCAAATGTTACCTTCAGCAGTAAACTTGGGTA
TTTATACAATTGAAGTGGCTCAAAAAGCAAATAGAAAGAAACGCCAAGTGACTCTTGAGTTAAAAACAACATCTGTTACTTTCTCACCTTCTGAACGAAG
AGCTAAAGCACGTGAATTAAAACCAATCACTTTAAACGTAGTAATCGCTAAAGAGAAAAATCCATCTGAAAGCGATTGCCTTGAATGGATTCTATTAACA
ACAGAAGCCACAACAACATTAGAGTGTACCCGAAAAATAACGCGATATTATGAAATGAGATGGCGCATTGAGGATTTTCATAAAGCATGGAAATCGGGGG
TTGGTGCTGAAGAGCAGCGTATGCAATCAATTGAAAATTTAGAGAAAATGATTGTCATTCTCTCATTTGTAGCAATTCGATTACTCCAATTAAAAGAGTA
TTTTGAATACCCGACTACTCTAGTTATCAATGATAGCAGTACTTCTTGCGATGAATTGCTCACTGATGCTGAATGGAAAGTCTTATGGAATAGTGTTGAA
AGAAAATCATTACCTGAAAAAATACCTACTGCTGCTTGGGCTTATAAAGCAATAGCCAAGTTGGGCGGTTGGACTGATTCCAAAAGAACTGGGAAAGCAG
CTTGGTCTACTATCTGGAAAGGATGGTTCCGATTACAGGAACGAGTAGAAGGGCTTAGGATAGCTAACGAGTTGATGGAGATGTGATCAAGAGACAG
TCGGAAGCTATTTTTGGTTCAGTTGCTTTAGGGGATAAACGACTTACTCGTCGGTTAATTCAAATAGGCAAACAATTATCATCGACGCCTGGTGGTTCTC
TTTCAGGAAGTTGTGGAGGGCAGGATGCGCTTATAGAAGGTAGTTATCGTTTTTTACGAAACAAACGAGTCACAGCGAATCAAATTGCAGAGGGTGGTTA
TCAAGTAACAAGCTGGTTATCTCAGTCAATCCCCACGCTTTTAGCCATTGAAGACACAACCACGCTCTCCTATACCCATCAAGTAAAAGAATCATTAGGT
GATTTAGGCGGTCCTAAAGAAAAAAGCAATCGAGGTTTTCATGTTCACACCACCATGCTCATGGATGCAGAACAAGAGAAAACAATAGGATTAATCGCGC
AAGAACGTTGGTGTCGAGATATCAAAGAAAGAGGCAAAAAAAATCATCGACGAGTAAGGTTATATACAGAGAAAGAAAGCTATAAATGGGAAAAGAATAC
TCGGGAATTAGAGAATCGATTGGGTTCTAAAATGTCTGATGTGATTTCTGTTTGCGATAGAGAAGCTGATATTTTTGAATACATTCAGTACAAGTTAGAC
CATGCTCAACGTTTTATTGTTCGAGCGAGCCATAATCGAAAACTAGAAGGAAGTAACTGTTATTTATTTCAAATGTTACCTTCAGCAGTAAACTTGGGTA
TTTATACAATTGAAGTGGCTCAAAAAGCAAATAGAAAGAAACGCCAAGTGACTCTTGAGTTAAAAACAACATCTGTTACTTTCTCACCTTCTGAACGAAG
AGCTAAAGCACGTGAATTAAAACCAATCACTTTAAACGTAGTAATCGCTAAAGAGAAAAATCCATCTGAAAGCGATTGCCTTGAATGGATTCTATTAACA
ACAGAAGCCACAACAACATTAGAGTGTACCCGAAAAATAACGCGATATTATGAAATGAGATGGCGCATTGAGGATTTTCATAAAGCATGGAAATCGGGGG
TTGGTGCTGAAGAGCAGCGTATGCAATCAATTGAAAATTTAGAGAAAATGATTGTCATTCTCTCATTTGTAGCAATTCGATTACTCCAATTAAAAGAGTA
TTTTGAATACCCGACTACTCTAGTTATCAATGATAGCAGTACTTCTTGCGATGAATTGCTCACTGATGCTGAATGGAAAGTCTTATGGAATAGTGTTGAA
AGAAAATCATTACCTGAAAAAATACCTACTGCTGCTTGGGCTTATAAAGCAATAGCCAAGTTGGGCGGTTGGACTGATTCCAAAAGAACTGGGAAAGCAG
CTTGGTCTACTATCTGGAAAGGATGGTTCCGATTACAGGAACGAGTAGAAGGGCTTAGGATAGCTAACGAGTTGATGGAGATGTGATCAAGAGACAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1446 bp | 482 aa | 38 | 1483 | + | No |
Chemistry : DDE
ORF sequence :
MIKSPLFEVCMDLAVEDTTAWSEAIFGSVALGDKRLTRRLIQIGKQLSSTPGGSLSGSCGGQDALIEGSYRFLRNKRVTANQIAEGGYQVTSWLSQSIPT
LLAIEDTTTLSYTHQVKESLGDLGGPKEKSNRGFHVHTTMLMDAEQEKTIGLIAQERWCRDIKERGKKNHRRVRLYTEKESYKWEKNTRELENRLGSKMS
DVISVCDREADIFEYIQYKLDHAQRFIVRASHNRKLEGSNCYLFQMLPSAVNLGIYTIEVAQKANRKKRQVTLELKTTSVTFSPSERRAKARELKPITLN
VVIAKEKNPSESDCLEWILLTTEATTTLECTRKITRYYEMRWRIEDFHKAWKSGVGAEEQRMQSIENLEKMIVILSFVAIRLLQLKEYFEYPTTLVINDS
STSCDELLTDAEWKVLWNSVERKSLPEKIPTAAWAYKAIAKLGGWTDSKRTGKAAWSTIWKGWFRLQERVEGLRIANELMEM
LLAIEDTTTLSYTHQVKESLGDLGGPKEKSNRGFHVHTTMLMDAEQEKTIGLIAQERWCRDIKERGKKNHRRVRLYTEKESYKWEKNTRELENRLGSKMS
DVISVCDREADIFEYIQYKLDHAQRFIVRASHNRKLEGSNCYLFQMLPSAVNLGIYTIEVAQKANRKKRQVTLELKTTSVTFSPSERRAKARELKPITLN
VVIAKEKNPSESDCLEWILLTTEATTTLECTRKITRYYEMRWRIEDFHKAWKSGVGAEEQRMQSIENLEKMIVILSFVAIRLLQLKEYFEYPTTLVINDS
STSCDELLTDAEWKVLWNSVERKSLPEKIPTAAWAYKAIAKLGGWTDSKRTGKAAWSTIWKGWFRLQERVEGLRIANELMEM
Blast result :
Comments
ISLpn3 is 58% aa similar to IS50R.
ISLpn3 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-100-D(N3)-137-E(C1). The copy number in 'Legionella pneumophila subsp. pneumophila str. Philadelphia 1' is 2.
ISLpn3 was found by screening completely sequenced genomes for sequences homologous to the IS50R transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-100-D(N3)-137-E(C1). The copy number in 'Legionella pneumophila subsp. pneumophila str. Philadelphia 1' is 2.
References
1] Chien,M., Morozova,I., Shi,S., Sheng,H., Chen,J., Gomez,S.M., Asamani,G., Hill,K., Nuara,J., Feder,M., Rineer,J., Greenberg,J.J., Steshenko,V., Park,S.H., Zhao,B., Teplitskaya,E., Edwards,J.R., Pampou,S., Georghiou,A., Chou,I.-C., Iannuccilli,W., Ulz,M.E., Kim,D.H., Geringer-Sameth,A., Goldsberry,C., Morozov,P., Fischer,S.G., Segal,G., Qu,X., Rzhetsky,A., Zhang,P., Cayanis,E., De Jong,P.J., Ju,J., Kalachikov,S., Shuman,H.A. and Russo,J.J.(2004)Science 305(5692), 1966-1968.
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18