ISPen1
- Family IS4
- Group IS4
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_008027 | ND | Pseudomonas entomophila | Pseudomonas entomophila L48 |
DNA section
IS Length : 1514 bp
Ends
IR Length : 16
IRL : GAATGGCACTTACTTAGCAGCCAACCCTTTGATCTGTAGGAAAAAAATGA
IRR : GAATGGCACTTACTTAACATGCTTCGGATGCCCATGTTTCCGGATATCTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGACAGATGT | AAATCTGTCC | 0 |
DNA sequence
GAATGGCACTTACTTAGCAGCCAACCCTTTGATCTGTAGGAAAAAAATGAAGAAGGGCTGGCCTTGGTCTACGGTGATAGTTGCGAACACACACCATTAA
GCCAAGGACGCAGCCCAGAATGGATCGTAGACGTCAAATCCTGCAGCATCAACAGCAGCGCTTCCGCCACCATGCTTCGACCAGTGACGCTCAGGCATTT
TTCGATCTCCTGATGGCCCCCGAACTGCTGAAGCGTGTCGAGTCATTGTTGCCAGCACATCGAGAGCGCCTATTCCCACCGACTGAAACGCTCTCGATGT
TCCTGGCTCAAGCAATGAATGCTGATCGCTCCTGCCAGCAGGCGGTCAATGACTTTTCCATCAAGCGATCCTGCAACGGTATGAAGCCCAACAGCACTCG
TACAGGCGCCTTCTGCAGAGCCAGGCAGCGCCTACCGGTGGAGATGGTCTCAACCCTGGTTCGTCACACGGCTTCATCGATCAGTGATCAAGCCCCCTCG
TCGTGGCGTTGGAGGGGACGGCCGGTGCGTCTTGTTGATGGAACAACCGTTTCGATGCCCGACACGGCTGCCAATCAGGTTGCATATCCACAATCACGGG
GGCAGAAGGTCGGCCTGGGTTTCCCACTCTGCCGAATGGTTGGCATTGTCTGTCTATGTAGCGGTGCCGTTCTGGACGCTGCTTTGGGCCGCTTCAGAGG
TAAAGGTGGTGATGAACAGACCCTGCTCAGGTCAATGCTCAATGTGCTGAAAACAGGCGATATTCTGTTGGGTGATGCGTATTACGCTACCTACTTTCTG
TTGTGTGAGCTGCAGCGAAGGGGCGTCGATGGGGTGTTTGAGCAGTACGGAGCGCGACGGCGCAGTACGGATTTTGCTTTAGGGCAACGTCTCGGACCGG
AAGACCATTTGATCGAGCTGAAAAAGCCTGGATGCAGGCCGCCTTGGATGAGCATGGCGCAGTATGAACAGGCGCCGGAGAGGCTGACGGTGAGAGAGCT
GAAGGCCGGTGGCAAGATCCTGGTGACAACGTTGCAATGCCCAAAGCAAACACCCAAATCAGCCCTCAGGTCGCTCTACAAAGGGCGCTGGCATGTCGAA
CTGGATTTACGAAACCTGAAAGCAACGTTGGGACTGGGAAAACTGAGTTGCAAAACACCAGGGATGGCCGTCAAGGAGCTGTGGGTCTATCTGCTGGCAC
ACAACCTGATCCGAATGCTCATGTCACAATCGGCGTTAATGGCCGATTGCTTGCCCCGCGAGCTGAGTTTCAAGCACAGCCTCCAGCTATGGCTGGCGCT
ACGACAGCGCAGCTACGGCGAGGGGGAGGATCGGTTGGCAAGTTTGCTGATGCTGATTGCTCAAAGGCGTGTAGGCAACCGGCCTGGTCGTGTCGAGCCT
CGGGCGATAAAGCGAAGGCCTCAGGCCTACCCCTTGCTGACCAAACCACGTCGATCAGCAAGGGCAGATATCCGGAAACATGGGCATCCGAAGCATGTTA
AGTAAGTGCCATTC
GCCAAGGACGCAGCCCAGAATGGATCGTAGACGTCAAATCCTGCAGCATCAACAGCAGCGCTTCCGCCACCATGCTTCGACCAGTGACGCTCAGGCATTT
TTCGATCTCCTGATGGCCCCCGAACTGCTGAAGCGTGTCGAGTCATTGTTGCCAGCACATCGAGAGCGCCTATTCCCACCGACTGAAACGCTCTCGATGT
TCCTGGCTCAAGCAATGAATGCTGATCGCTCCTGCCAGCAGGCGGTCAATGACTTTTCCATCAAGCGATCCTGCAACGGTATGAAGCCCAACAGCACTCG
TACAGGCGCCTTCTGCAGAGCCAGGCAGCGCCTACCGGTGGAGATGGTCTCAACCCTGGTTCGTCACACGGCTTCATCGATCAGTGATCAAGCCCCCTCG
TCGTGGCGTTGGAGGGGACGGCCGGTGCGTCTTGTTGATGGAACAACCGTTTCGATGCCCGACACGGCTGCCAATCAGGTTGCATATCCACAATCACGGG
GGCAGAAGGTCGGCCTGGGTTTCCCACTCTGCCGAATGGTTGGCATTGTCTGTCTATGTAGCGGTGCCGTTCTGGACGCTGCTTTGGGCCGCTTCAGAGG
TAAAGGTGGTGATGAACAGACCCTGCTCAGGTCAATGCTCAATGTGCTGAAAACAGGCGATATTCTGTTGGGTGATGCGTATTACGCTACCTACTTTCTG
TTGTGTGAGCTGCAGCGAAGGGGCGTCGATGGGGTGTTTGAGCAGTACGGAGCGCGACGGCGCAGTACGGATTTTGCTTTAGGGCAACGTCTCGGACCGG
AAGACCATTTGATCGAGCTGAAAAAGCCTGGATGCAGGCCGCCTTGGATGAGCATGGCGCAGTATGAACAGGCGCCGGAGAGGCTGACGGTGAGAGAGCT
GAAGGCCGGTGGCAAGATCCTGGTGACAACGTTGCAATGCCCAAAGCAAACACCCAAATCAGCCCTCAGGTCGCTCTACAAAGGGCGCTGGCATGTCGAA
CTGGATTTACGAAACCTGAAAGCAACGTTGGGACTGGGAAAACTGAGTTGCAAAACACCAGGGATGGCCGTCAAGGAGCTGTGGGTCTATCTGCTGGCAC
ACAACCTGATCCGAATGCTCATGTCACAATCGGCGTTAATGGCCGATTGCTTGCCCCGCGAGCTGAGTTTCAAGCACAGCCTCCAGCTATGGCTGGCGCT
ACGACAGCGCAGCTACGGCGAGGGGGAGGATCGGTTGGCAAGTTTGCTGATGCTGATTGCTCAAAGGCGTGTAGGCAACCGGCCTGGTCGTGTCGAGCCT
CGGGCGATAAAGCGAAGGCCTCAGGCCTACCCCTTGCTGACCAAACCACGTCGATCAGCAAGGGCAGATATCCGGAAACATGGGCATCCGAAGCATGTTA
AGTAAGTGCCATTC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1383 bp | 461 aa | 120 | 1502 | + | No |
Chemistry : DDE
ORF sequence :
MDRRRQILQHQQQRFRHHASTSDAQAFFDLLMAPELLKRVESLLPAHRERLFPPTETLSMFLAQAMNADRSCQQAVNDFSIKRSCNGMKPNSTRTGAFCR
ARQRLPVEMVSTLVRHTASSISDQAPSSWRWRGRPVRLVDGTTVSMPDTAANQVAYPQSRGQKVGLGFPLCRMVGIVCLCSGAVLDAALGRFRGKGGDEQ
TLLRSMLNVLKTGDILLGDAYYATYFLLCELQRRGVDGVFEQYGARRRSTDFALGQRLGPEDHLIELKKPGCRPPWMSMAQYEQAPERLTVRELKAGGKI
LVTTLQCPKQTPKSALRSLYKGRWHVELDLRNLKATLGLGKLSCKTPGMAVKELWVYLLAHNLIRMLMSQSALMADCLPRELSFKHSLQLWLALRQRSYG
EGEDRLASLLMLIAQRRVGNRPGRVEPRAIKRRPQAYPLLTKPRRSARADIRKHGHPKHVK
ARQRLPVEMVSTLVRHTASSISDQAPSSWRWRGRPVRLVDGTTVSMPDTAANQVAYPQSRGQKVGLGFPLCRMVGIVCLCSGAVLDAALGRFRGKGGDEQ
TLLRSMLNVLKTGDILLGDAYYATYFLLCELQRRGVDGVFEQYGARRRSTDFALGQRLGPEDHLIELKKPGCRPPWMSMAQYEQAPERLTVRELKAGGKI
LVTTLQCPKQTPKSALRSLYKGRWHVELDLRNLKATLGLGKLSCKTPGMAVKELWVYLLAHNLIRMLMSQSALMADCLPRELSFKHSLQLWLALRQRSYG
EGEDRLASLLMLIAQRRVGNRPGRVEPRAIKRRPQAYPLLTKPRRSARADIRKHGHPKHVK
Blast result :
Comments
ISPen1 is 67% aa similar to ISPosp1. ISPen1 was found by screening completely sequenced genomes for sequences homologous to the ISRso13 transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-78-D(N3)-107-E(C1). The copy number on the Pseudomonas entomophila L48 chromosome is 1.
References
1] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] Vodovar,N., Vallenet,D., Cruveiller,S., Rouy,Z., Barbe,V., Acosta,C., Cattolico,L., Jubin,C., Lajus,A., Segurens,B., Vacherie,B., Wincker,P., Weissenbach,J., Lemaitre,B., Medigue,C. and Boccard,F. (2006) Unpublished
2] Vodovar,N., Vallenet,D., Cruveiller,S., Rouy,Z., Barbe,V., Acosta,C., Cattolico,L., Jubin,C., Lajus,A., Segurens,B., Vacherie,B., Wincker,P., Weissenbach,J., Lemaitre,B., Medigue,C. and Boccard,F. (2006) Unpublished