ISPsy20
- Family IS21
- Group
Isoform Synonym(s) IS53
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_005773 | ND | Pseudomonas syringae | Pseudomonas syringae pv. phaseolicola 1448A Pseudomonas syringae subsp. savastanoi PB213 plasmid pIAA2 |
DNA section
IS Length : 2572 bp
Ends
IR Length : 23/27
IRL : TGTTGCCGCTGACCGAAAACTGACCCAGTAGAGGCTGTTCTGCCGACTGA
IRR : TGTTGGCGCGGATTGAAAACTGACCCACCCTGCCGATTGAAAATTGACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCGCCGACCTGATTCATGTG | ATACAGCTTGTCTTTGGCTTTTG | 0 |
DNA sequence
TGTTGCCGCTGACCGAAAACTGACCCAGTAGAGGCTGTTCTGCCGACTGAAAACTGACCCAGGTGTTCAACTGCTTCTGCTCACTTTTTGAGCAGGAGAA
CACAGGGTGATCAGCATGGAAATGTTGGGAAAAATCCGGCGGATGTACTTCCGCGACAAGCTCTCGCTGCATCAGATAGCCAAGCGTACCGGGCTGTCGA
GAAACACCATCCGAAAATGGGTCAGAGCGCCCGAAGCCAATCAACCAGCGTACCAACGGTGCGCATCCTTCAATAAGCTCAACCCTTTTCACGAGACCTT
AGAGCAGGCGCTCAAGGCTGATTCGTTCCGGCCAAAACACAACCGTAGGAGCGCCAAGGCACTTTTTGAGCAGATCAAGGCCGAGGGTTATGACGGCGGT
TACAGCCAGCTCACGGCGTTTGTCCGCTCTTGGCGGTGCGAGCAAGGCAAGTCTGTACGCGCTTTTGTACCGCTGACTTTTGCCCTCGGTGAGGCCTTTC
AGTTTGACTGGAGCGAAGAAGGCCTGCTGATCGGTGGCCTGTTTCGACGTATCCAGGTCTCCCATATGAAGCTGTGCGCAAGTCGCGCGTTCTGGTTGGT
TGCGTACCCCAGTCAAGGCCACGAGATGCTGTTTGATGCCCATACACGTTCGTTTGGAGCGTTGGGTGGCGTGCCACGTCGCGGCATCTACGACAACATG
AAAACGGCTGTCGACAAGGTTAACAAAGGCAAAGGCCGCACGGTCAATGCCCGGTTTTCCGTGATGTGCGCGCACTATCTGTTCGATCCGGACTTCTGCA
ACGTGGCCTCTGGCTGGGAGAAAGGCATCGTCGAAAAGAACGTGCAAGACAGCCGGAGGCGAATCTGGCTCGATGCTCAAAATTGCATGTTCCATACCTT
CGAGGAGCTGAACGTCTGGCTCGGCCAACGCTGTCGTACGCTCTGGGCTGAACTTGTACATCCGCAGTACAACGGTTTGACGGTGGCAGAAGTCCTGGAG
CTGGAGCAGGCTGAAATGATGCCGATGCCGACGGCCTTTGATGGCTACGTGGAACGCACCGTCCGTGTCTCTAGCACCTGCCTGATCAGCGTGGCGCGCA
ATCGTTATTCGGTGCCTTGCGAGCGTGTGGGTCAATGGGTCAGCAGCCGTTTGTACCCTTCGCGAATCGTGGTCATTGCCGACGAAACGGTGATCGCCAG
TCACGAGCGCCTCTTTGATCGAGATCAGGTCGGTTTTGACTGGCAGCACTACATCCCACTCATCGAACGCAAGCCTGGTGCACTGCGCAATGGCGCTCCA
TTTGCTGATCTGCCAAAACCGTTACAGCTTCTTAAACGCGGACTGAGGCGTCACACCAACGGTGATCGAATCATGATGCAGGTACTGGCTGCTGTGCCCA
TCGCCGGTCTCGAACCAGTGCTGGTGGCGGTGGAGCTGGTACTTGAATCGGGAAGTCTGAGCGCCGATCACATCCTCAATGTCGTTGCCCGCCTGACCTC
CACCGCACCACCTCCCTGTGTAGAAACCAGCCTGCAACTCAAAGTGGCGCCGGTTGCCAATACAGCACGCTACGACCGGCTCCGTACGACCGATGAGGAG
AATCGCAATGCGTGACCTAATGACTGAACTCAAAGAACTGCGCCTGCACGGCATGGCCAGTGCTTGGGAAGAACTGGTCTCTCAAGGAACAGCGTCGACG
GCTTCATCGAAATGGCTGCTGGAACATCTGCTTCAGCAGGAACATGCGGATCGCGCGGTGCGCTCGGTGAATCATCAGATGAACATGGCAAAACTCCCCA
TGCACCGTGACTTAGCTGGCTTTGACTTCAGCGCTTCCAGCGCTGATGCCCGGCTGGTCAGCGATCTATCCAGCTTGGCGTTCACTGAAACCGCGCAGAA
TGTCGTGTTCATCGGCGGCCCTGGCACCGGGAAAACACACCTGGCCAGTGCCTTGGCTGTATCCGGAATCACGGCCTACAACAAACGCGTGCGCTTCTTT
TCCACGGTGGATCTGGTGAATCTGCTGGAGCGTGAGAAGTACGATGGCAAGGCAGGGCGAATCGCCCAGGCACTGCTGCGCACAGACCTGGTGATACTGA
ATGAACTCGGATATTTGCCCTTCAGCCAAAGCGGTGGCGCCCTGTTATTTCACCTGCTGTCCAAACTGTACGAACACACCAGCGTGGTCATCACCACCAA
CCTGAGCTTCTCGGAATGGTCGAGTGTGTTTGGCGACGCCAAGATGACCACCGCGTTGCTGGATCGGCTGACACACCACTGCCACATCGTCGAAACGGGC
AACGAGTCCTACCGCCTACAACACAGCACATTGGCTGCCCAGACAAAGATCAAAACACGCGAACGAAAGCGAAAGGACGGAAATGATATCGAGGACGATG
AGCCATTTTGATCCGCACCATAAATGCCCTGCTGCATGCGGTGGGGCTTGATGCGTCTTGAATCGGCACACTGCTGCTAAGCTTATTCACAATTGCTGGG
GCAAAAATCAGCAATCCATCCTGGGTCAATTTTCAATCGGCAGGGTGGGTCAGTTTTCAATCCGCGCCAACA
CACAGGGTGATCAGCATGGAAATGTTGGGAAAAATCCGGCGGATGTACTTCCGCGACAAGCTCTCGCTGCATCAGATAGCCAAGCGTACCGGGCTGTCGA
GAAACACCATCCGAAAATGGGTCAGAGCGCCCGAAGCCAATCAACCAGCGTACCAACGGTGCGCATCCTTCAATAAGCTCAACCCTTTTCACGAGACCTT
AGAGCAGGCGCTCAAGGCTGATTCGTTCCGGCCAAAACACAACCGTAGGAGCGCCAAGGCACTTTTTGAGCAGATCAAGGCCGAGGGTTATGACGGCGGT
TACAGCCAGCTCACGGCGTTTGTCCGCTCTTGGCGGTGCGAGCAAGGCAAGTCTGTACGCGCTTTTGTACCGCTGACTTTTGCCCTCGGTGAGGCCTTTC
AGTTTGACTGGAGCGAAGAAGGCCTGCTGATCGGTGGCCTGTTTCGACGTATCCAGGTCTCCCATATGAAGCTGTGCGCAAGTCGCGCGTTCTGGTTGGT
TGCGTACCCCAGTCAAGGCCACGAGATGCTGTTTGATGCCCATACACGTTCGTTTGGAGCGTTGGGTGGCGTGCCACGTCGCGGCATCTACGACAACATG
AAAACGGCTGTCGACAAGGTTAACAAAGGCAAAGGCCGCACGGTCAATGCCCGGTTTTCCGTGATGTGCGCGCACTATCTGTTCGATCCGGACTTCTGCA
ACGTGGCCTCTGGCTGGGAGAAAGGCATCGTCGAAAAGAACGTGCAAGACAGCCGGAGGCGAATCTGGCTCGATGCTCAAAATTGCATGTTCCATACCTT
CGAGGAGCTGAACGTCTGGCTCGGCCAACGCTGTCGTACGCTCTGGGCTGAACTTGTACATCCGCAGTACAACGGTTTGACGGTGGCAGAAGTCCTGGAG
CTGGAGCAGGCTGAAATGATGCCGATGCCGACGGCCTTTGATGGCTACGTGGAACGCACCGTCCGTGTCTCTAGCACCTGCCTGATCAGCGTGGCGCGCA
ATCGTTATTCGGTGCCTTGCGAGCGTGTGGGTCAATGGGTCAGCAGCCGTTTGTACCCTTCGCGAATCGTGGTCATTGCCGACGAAACGGTGATCGCCAG
TCACGAGCGCCTCTTTGATCGAGATCAGGTCGGTTTTGACTGGCAGCACTACATCCCACTCATCGAACGCAAGCCTGGTGCACTGCGCAATGGCGCTCCA
TTTGCTGATCTGCCAAAACCGTTACAGCTTCTTAAACGCGGACTGAGGCGTCACACCAACGGTGATCGAATCATGATGCAGGTACTGGCTGCTGTGCCCA
TCGCCGGTCTCGAACCAGTGCTGGTGGCGGTGGAGCTGGTACTTGAATCGGGAAGTCTGAGCGCCGATCACATCCTCAATGTCGTTGCCCGCCTGACCTC
CACCGCACCACCTCCCTGTGTAGAAACCAGCCTGCAACTCAAAGTGGCGCCGGTTGCCAATACAGCACGCTACGACCGGCTCCGTACGACCGATGAGGAG
AATCGCAATGCGTGACCTAATGACTGAACTCAAAGAACTGCGCCTGCACGGCATGGCCAGTGCTTGGGAAGAACTGGTCTCTCAAGGAACAGCGTCGACG
GCTTCATCGAAATGGCTGCTGGAACATCTGCTTCAGCAGGAACATGCGGATCGCGCGGTGCGCTCGGTGAATCATCAGATGAACATGGCAAAACTCCCCA
TGCACCGTGACTTAGCTGGCTTTGACTTCAGCGCTTCCAGCGCTGATGCCCGGCTGGTCAGCGATCTATCCAGCTTGGCGTTCACTGAAACCGCGCAGAA
TGTCGTGTTCATCGGCGGCCCTGGCACCGGGAAAACACACCTGGCCAGTGCCTTGGCTGTATCCGGAATCACGGCCTACAACAAACGCGTGCGCTTCTTT
TCCACGGTGGATCTGGTGAATCTGCTGGAGCGTGAGAAGTACGATGGCAAGGCAGGGCGAATCGCCCAGGCACTGCTGCGCACAGACCTGGTGATACTGA
ATGAACTCGGATATTTGCCCTTCAGCCAAAGCGGTGGCGCCCTGTTATTTCACCTGCTGTCCAAACTGTACGAACACACCAGCGTGGTCATCACCACCAA
CCTGAGCTTCTCGGAATGGTCGAGTGTGTTTGGCGACGCCAAGATGACCACCGCGTTGCTGGATCGGCTGACACACCACTGCCACATCGTCGAAACGGGC
AACGAGTCCTACCGCCTACAACACAGCACATTGGCTGCCCAGACAAAGATCAAAACACGCGAACGAAAGCGAAAGGACGGAAATGATATCGAGGACGATG
AGCCATTTTGATCCGCACCATAAATGCCCTGCTGCATGCGGTGGGGCTTGATGCGTCTTGAATCGGCACACTGCTGCTAAGCTTATTCACAATTGCTGGG
GCAAAAATCAGCAATCCATCCTGGGTCAATTTTCAATCGGCAGGGTGGGTCAGTTTTCAATCCGCGCCAACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1491 bp | 496 aa | 125 | 1615 | + | No |
Chemistry : DDE
ORF sequence :
MGKIRRMYFRDKLSLHQIAKRTGLSRNTIRKWVRAPEANQPAYQRCASFNKLNPFHETLEQALKADSFRPKHNRRSAKALFEQIKAEGYDGGYSQLTAFV
RSWRCEQGKSVRAFVPLTFALGEAFQFDWSEEGLLIGGLFRRIQVSHMKLCASRAFWLVAYPSQGHEMLFDAHTRSFGALGGVPRRGIYDNMKTAVDKVN
KGKGRTVNARFSVMCAHYLFDPDFCNVASGWEKGIVEKNVQDSRRRIWLDAQNCMFHTFEELNVWLGQRCRTLWAELVHPQYNGLTVAEVLELEQAEMMP
MPTAFDGYVERTVRVSSTCLISVARNRYSVPCERVGQWVSSRLYPSRIVVIADETVIASHERLFDRDQVGFDWQHYIPLIERKPGALRNGAPFADLPKPL
QLLKRGLRRHTNGDRIMMQVLAAVPIAGLEPVLVAVELVLESGSLSADHILNVVARLTSTAPPPCVETSLQLKVAPVANTARYDRLRTTDEENRNA
RSWRCEQGKSVRAFVPLTFALGEAFQFDWSEEGLLIGGLFRRIQVSHMKLCASRAFWLVAYPSQGHEMLFDAHTRSFGALGGVPRRGIYDNMKTAVDKVN
KGKGRTVNARFSVMCAHYLFDPDFCNVASGWEKGIVEKNVQDSRRRIWLDAQNCMFHTFEELNVWLGQRCRTLWAELVHPQYNGLTVAEVLELEQAEMMP
MPTAFDGYVERTVRVSSTCLISVARNRYSVPCERVGQWVSSRLYPSRIVVIADETVIASHERLFDRDQVGFDWQHYIPLIERKPGALRNGAPFADLPKPL
QLLKRGLRRHTNGDRIMMQVLAAVPIAGLEPVLVAVELVLESGSLSADHILNVVARLTSTAPPPCVETSLQLKVAPVANTARYDRLRTTDEENRNA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
804 bp | 267 aa | 1608 | 2411 | + | No |
AG : IS21 helper
ORF sequence :
MRDLMTELKELRLHGMASAWEELVSQGTASTASSKWLLEHLLQQEHADRAVRSVNHQMNMAKLPMHRDLAGFDFSASSADARLVSDLSSLAFTETAQNVV
FIGGPGTGKTHLASALAVSGITAYNKRVRFFSTVDLVNLLEREKYDGKAGRIAQALLRTDLVILNELGYLPFSQSGGALLFHLLSKLYEHTSVVITTNLS
FSEWSSVFGDAKMTTALLDRLTHHCHIVETGNESYRLQHSTLAAQTKIKTRERKRKDGNDIEDDEPF
FIGGPGTGKTHLASALAVSGITAYNKRVRFFSTVDLVNLLEREKYDGKAGRIAQALLRTDLVILNELGYLPFSQSGGALLFHLLSKLYEHTSVVITTNLS
FSEWSSVFGDAKMTTALLDRLTHHCHIVETGNESYRLQHSTLAAQTKIKTRERKRKDGNDIEDDEPF
Blast result :
Comments
ISPsy20 is 90% (ORFA) aa similar to IS53 and 73% (ORFB) to ISAav1.
September 27 2012 : the file with mutated IS53 (accession number : M83932 ) was deleted and information added in this file.
Previous comments of IS53 file :
IS53 was found inserted into IS51, itself residing in the iaa-containing plasmid pIAA2. IS53 shares 87 % homology with a 316-bp of pTOL (Kok et al., 1989) from Pseudomonas oleavorans. IS53 IstA ORF could be extended up to 496 residues (124-723, 725-1120, 1122-1376, 1375-1614). The "IstB" ORF would however required even more frameshifts. The origin and significance of these differences with regard to the other members of the IS21 family remain unknown. At least four 21-bp terminal repeats are present at the ends of IS53:
L1: GCTGACCGAAAACTGACCCAG (9-28)
L2: GCCGACTGAAAACTGACCCAG (42-62)
R1: GCTGATTGAAAACTGACCCAC (2528-2548) CS
R2: GCCGATTGAAAATTGACCCAG (2504-2524) CS
September 27 2012 : the file with mutated IS53 (accession number : M83932 ) was deleted and information added in this file.
Previous comments of IS53 file :
IS53 was found inserted into IS51, itself residing in the iaa-containing plasmid pIAA2. IS53 shares 87 % homology with a 316-bp of pTOL (Kok et al., 1989) from Pseudomonas oleavorans. IS53 IstA ORF could be extended up to 496 residues (124-723, 725-1120, 1122-1376, 1375-1614). The "IstB" ORF would however required even more frameshifts. The origin and significance of these differences with regard to the other members of the IS21 family remain unknown. At least four 21-bp terminal repeats are present at the ends of IS53:
L1: GCTGACCGAAAACTGACCCAG (9-28)
L2: GCCGACTGAAAACTGACCCAG (42-62)
R1: GCTGATTGAAAACTGACCCAC (2528-2548) CS
R2: GCCGATTGAAAATTGACCCAG (2504-2524) CS
References
1] Joardar,V., Lindeberg,M., Jackson,R.W., Selengut,J., Dodson,R., Brinkac,L.M., Daugherty,S.C., Deboy,R., Durkin,A.S., Giglio,M.G., Madupu,R., Nelson,W.C., Rosovitz,M.J., Sullivan,S., Crabtree,J., Creasy,T., Davidsen,T., Haft,D.H., Zafar,N., Zhou,L., Halpin,R., Holley,T., Khouri,H., Feldblyum,T., White,O., Fraser,C.M., Chatterjee,A.K., Cartinhour,S., Schneider,D.J., Mansfield,J., Collmer,A. and Buell,C.R. (2005) J. Bacteriol. 187 (18), 6488-6498.
2] Kok, M., Oldenhuis, R., van der Linden, M.P.G., Raatjes, P., Kingma, J., van Lelyveld, P.H., and Witholt, B. (1989) J. Biol. Chem. 264, 5435-5441.
3] Soby, S., Kirkpatrick, B., and Kosuge, T. (1993) Plasmid 29, 135-141.
2] Kok, M., Oldenhuis, R., van der Linden, M.P.G., Raatjes, P., Kingma, J., van Lelyveld, P.H., and Witholt, B. (1989) J. Biol. Chem. 264, 5435-5441.
3] Soby, S., Kirkpatrick, B., and Kosuge, T. (1993) Plasmid 29, 135-141.