ISPpa5
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY225410 | Y | Paracoccus pantotrophus | Paracoccus pantotrophus DSM 11072 Paracoccus bengalensis DSM 17099 |
DNA section
IS Length : 2829 bp
Ends
IR Length : 20/22
IRL : GTAACGGCCCGATTGTTACCGCCTGCCTGAGTTTGGGCTGATGGATTAAG
IRR : GTAACCGGCCGATTGTTACCGCGGCGACTGAGCCTTGATGCGGGCGACGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AAGCTCAGGT | GAGAACATCC | CTGCCTGAAC | 10 |
GGCATCAATG | CATTAAAT | GCTTATAACG | 8 |
DNA sequence
GTAACGGCCCGATTGTTACCGCCTGCCTGAGTTTGGGCTGATGGATTAAGGCACGTGCATAGTGACGTCGTTAATGACGTGTCTTATGCGGGTGTGGGGA
TGCGGGGCGATATTCTTGGACTGGAGCGGCGGCGGCGGTGGAGCGACGAGGAGAAGCTCGAGATCGTTTTGTCGGTCGGCGTGGACGGGGCGACGGTGAC
GCAGGTTGCACAGCGGCACGATGTGACGCGGCAGCAAATCTATGCCTGGCGGCACCAACTGAAGAAGAAGGGCGTGGTTTCGGCGTCTCCGGAGACGCTG
TTTCTGCCGGTTGGATTGGACCGGCCCACGGAATTGGTGATGCAGACAGCGGCCATCTGCACAGGAGCGCCTCGGCCTGCCCAGATCGAGTTGCAGTTGG
CGAATGGTCGCTGCCTGCGCTTCGATCCGGCTCTGGATACCGTGACGCTGACACATGTGATCCGTGCGGTGGAAGGGGCATGATCGGGCCGGGAACCGGC
GTTCGTGTCTATCTGGCCTGCGGCGTGACCGACATGAGGAAGGGAATCTCCGGGCTGGCTGCACTTGCCCAGGATGTTCTGCGCCAGAAGCCGGCCGGCG
GTGCGGTTTTTGCATTCCGCGGGCGTCGCGGCGACAGGTTGAAATTGCTTCATTGGGATGGTCAGGGCTTCTGCCTGTATTACAAGGTGCTGGAGAAGGG
GCGCTTTCCCTGGCCGATGCGCGGGGATGGTGCGGTTCGCCTGACCTCGGCGCAGCTGGCCATGCTGTGGGAAGGCATTGACTGGCGCAGGCCGGATTGG
GGCGCGCCACCGGCCCGGGTGGGCTGATTTACCCTGCTAAAACGCCTTGTTTTTATGGCAATTTTGCCTGCCTTGTGGTAGGTGGGCGCATGTCGAATGC
GGCCCAGACCCTTCCTGATGATCCGGCGCTTCTGAAGTCGCTGATCGCGGCGTTGCAGGCGGAAAACGCGAAGATCTCGGCCACGTTGCGCGCCCATGAC
CAGTTGATCCAGTCCCTGCGGCTGCGCATTGCCAGGCTGAAGAAGCAGGTCTTCGGCCAGTCCTCGGAAAAGATCGAGCGCGAGATCGAACAGCTGGAGC
TGGCGCTCGAGGATCTCATGATCGCAGCGGCCGAAGGCCAGCCTGACGTCGTGAACGACGGCCAAAACGATGATGGGTTGGAAACGGCACCTGCCGATGA
AGCTGCCGAGCGCCCATCCCGCCGACGTCCCCGGGTCTCGGACAGCACGCCGCGCGAGCGGCGGGAGCTCGACCCCGGCAGTTGCTGCCCCGATTGCGGC
GGCGATTTGCGCCTGGTGGGCGAAGATGTCAGCGAGATGCTGGATCTGATCGCGGCACAATTGAAGGTTGTCCAGATCGCCCGCCTGAAAAAGTCCTGCC
GCCGCTGTGAACGCATGGTGCAGATGCCGGCCCCCAGCCGGCCGATCCCCGGCAGCATGGCCGGTGCGAACCTGCTCGCCCACATCCTCGTCTCCAAGTT
TGACGACCACCTTCCGCTTTATCGCCAGCATGAGATCTTCACCCGGATGGGCGCCGACATCCCGGACAGTACCTTGGTCGACTGGTGCGGTCGCGCCATG
AAGGTGCTGGCACCGCTGATCGAGCGGATCGAGGTGGATGTGATGGCCAGCGACCTCCTTCATGCCGATGACACACCGATCCGGGTGCTGGATCGCGCGG
GCCGCGACAAGGGGGTGGGCAAGGGTGTGAAGAAGGGCCGGATCTGGGCCTATGTCCGCGATCAGCGCCCTTGGGCAGGCGCCTCACCGCCCGGCGCGGT
CTATGCCTTCGCGCCCGACTGGAAGGAAGAGCACGTCCACGGCCATCTCGCCAACACCCGCGGCATTCTCCAGGCCGATGGCTACAAGGGTTATGCCAAG
CTCTATGAACCGGAACCCGACGGGAAGCCCCGCCTTCGGGAGGCCGCGTGTTGGGCTCACCTGAGGCGTGACTTCCATGATGAATGGACCAAGACCAAAT
CAACGATCGCCCGCGAGGCACTCGACCGTATCGGCGCGCTCTATGACATCGAGCGGGAGATCACCGGCCATCCCGCCGATATCCGCCTTGCCGCGCGCCG
GAAACACAGCGTTCCGAAGGTCGAGGTCTTCTTCGCCTGGTCAGAGCAGCAGCTCTCGCGGATCCCCGGCAAGGGCGATCTGGCCAAGGCCTTTCGTTAC
GGGCTGAGCCGCCGGGACGCTTTCAGCCTGTTCCTTGAGGACGGCCGGGTGGCGATCGACAACAATCCTGCCGAGCGCGCGCTGCGCCCGATCGGTGTCG
GGCGCCGCAACTGGCTCTTCGCCGGTGCGGATACCGGAGCCGAAACGCTGGCCCGCGCCATGACCATCGTTGAAACCGCCAAGATGAACGGCCTGGACCC
CCAAGCCTATCTGGCCGACATCCTGGCCCGCATCCATGATCACAAGATCAATCGGCTCGACGATCTGTTGCCCTGGAACTGGTCGCCGCTGCCCTCCGCC
CTGCACGAGGCCGCCTGATGGCAACCGTCACCCACGTCTGCACCCTCGACTACATCGCAAAGATGCTGGGCGAAGACCCCGAGCTTCTCAAAGCCATCGT
CTACAACGACGACAACCTGACCTATGGCTCGATCATCAGCGTCTATACCGGCCCGGATGACACCGTCACCGCCCTGACCGATGACGGTATCGATGAACTG
AAAGACATGCTCAGGGACGCGCGCATCACCACCGAAACCTGGCACGCGTTCCTCGACGACTTTGTCGATGACGCGGAACTCGTCGCCCGCATCAAGGCTC
AGTCGCCGCGGTAACAATCGGCCGGTTAC
TGCGGGGCGATATTCTTGGACTGGAGCGGCGGCGGCGGTGGAGCGACGAGGAGAAGCTCGAGATCGTTTTGTCGGTCGGCGTGGACGGGGCGACGGTGAC
GCAGGTTGCACAGCGGCACGATGTGACGCGGCAGCAAATCTATGCCTGGCGGCACCAACTGAAGAAGAAGGGCGTGGTTTCGGCGTCTCCGGAGACGCTG
TTTCTGCCGGTTGGATTGGACCGGCCCACGGAATTGGTGATGCAGACAGCGGCCATCTGCACAGGAGCGCCTCGGCCTGCCCAGATCGAGTTGCAGTTGG
CGAATGGTCGCTGCCTGCGCTTCGATCCGGCTCTGGATACCGTGACGCTGACACATGTGATCCGTGCGGTGGAAGGGGCATGATCGGGCCGGGAACCGGC
GTTCGTGTCTATCTGGCCTGCGGCGTGACCGACATGAGGAAGGGAATCTCCGGGCTGGCTGCACTTGCCCAGGATGTTCTGCGCCAGAAGCCGGCCGGCG
GTGCGGTTTTTGCATTCCGCGGGCGTCGCGGCGACAGGTTGAAATTGCTTCATTGGGATGGTCAGGGCTTCTGCCTGTATTACAAGGTGCTGGAGAAGGG
GCGCTTTCCCTGGCCGATGCGCGGGGATGGTGCGGTTCGCCTGACCTCGGCGCAGCTGGCCATGCTGTGGGAAGGCATTGACTGGCGCAGGCCGGATTGG
GGCGCGCCACCGGCCCGGGTGGGCTGATTTACCCTGCTAAAACGCCTTGTTTTTATGGCAATTTTGCCTGCCTTGTGGTAGGTGGGCGCATGTCGAATGC
GGCCCAGACCCTTCCTGATGATCCGGCGCTTCTGAAGTCGCTGATCGCGGCGTTGCAGGCGGAAAACGCGAAGATCTCGGCCACGTTGCGCGCCCATGAC
CAGTTGATCCAGTCCCTGCGGCTGCGCATTGCCAGGCTGAAGAAGCAGGTCTTCGGCCAGTCCTCGGAAAAGATCGAGCGCGAGATCGAACAGCTGGAGC
TGGCGCTCGAGGATCTCATGATCGCAGCGGCCGAAGGCCAGCCTGACGTCGTGAACGACGGCCAAAACGATGATGGGTTGGAAACGGCACCTGCCGATGA
AGCTGCCGAGCGCCCATCCCGCCGACGTCCCCGGGTCTCGGACAGCACGCCGCGCGAGCGGCGGGAGCTCGACCCCGGCAGTTGCTGCCCCGATTGCGGC
GGCGATTTGCGCCTGGTGGGCGAAGATGTCAGCGAGATGCTGGATCTGATCGCGGCACAATTGAAGGTTGTCCAGATCGCCCGCCTGAAAAAGTCCTGCC
GCCGCTGTGAACGCATGGTGCAGATGCCGGCCCCCAGCCGGCCGATCCCCGGCAGCATGGCCGGTGCGAACCTGCTCGCCCACATCCTCGTCTCCAAGTT
TGACGACCACCTTCCGCTTTATCGCCAGCATGAGATCTTCACCCGGATGGGCGCCGACATCCCGGACAGTACCTTGGTCGACTGGTGCGGTCGCGCCATG
AAGGTGCTGGCACCGCTGATCGAGCGGATCGAGGTGGATGTGATGGCCAGCGACCTCCTTCATGCCGATGACACACCGATCCGGGTGCTGGATCGCGCGG
GCCGCGACAAGGGGGTGGGCAAGGGTGTGAAGAAGGGCCGGATCTGGGCCTATGTCCGCGATCAGCGCCCTTGGGCAGGCGCCTCACCGCCCGGCGCGGT
CTATGCCTTCGCGCCCGACTGGAAGGAAGAGCACGTCCACGGCCATCTCGCCAACACCCGCGGCATTCTCCAGGCCGATGGCTACAAGGGTTATGCCAAG
CTCTATGAACCGGAACCCGACGGGAAGCCCCGCCTTCGGGAGGCCGCGTGTTGGGCTCACCTGAGGCGTGACTTCCATGATGAATGGACCAAGACCAAAT
CAACGATCGCCCGCGAGGCACTCGACCGTATCGGCGCGCTCTATGACATCGAGCGGGAGATCACCGGCCATCCCGCCGATATCCGCCTTGCCGCGCGCCG
GAAACACAGCGTTCCGAAGGTCGAGGTCTTCTTCGCCTGGTCAGAGCAGCAGCTCTCGCGGATCCCCGGCAAGGGCGATCTGGCCAAGGCCTTTCGTTAC
GGGCTGAGCCGCCGGGACGCTTTCAGCCTGTTCCTTGAGGACGGCCGGGTGGCGATCGACAACAATCCTGCCGAGCGCGCGCTGCGCCCGATCGGTGTCG
GGCGCCGCAACTGGCTCTTCGCCGGTGCGGATACCGGAGCCGAAACGCTGGCCCGCGCCATGACCATCGTTGAAACCGCCAAGATGAACGGCCTGGACCC
CCAAGCCTATCTGGCCGACATCCTGGCCCGCATCCATGATCACAAGATCAATCGGCTCGACGATCTGTTGCCCTGGAACTGGTCGCCGCTGCCCTCCGCC
CTGCACGAGGCCGCCTGATGGCAACCGTCACCCACGTCTGCACCCTCGACTACATCGCAAAGATGCTGGGCGAAGACCCCGAGCTTCTCAAAGCCATCGT
CTACAACGACGACAACCTGACCTATGGCTCGATCATCAGCGTCTATACCGGCCCGGATGACACCGTCACCGCCCTGACCGATGACGGTATCGATGAACTG
AAAGACATGCTCAGGGACGCGCGCATCACCACCGAAACCTGGCACGCGTTCCTCGACGACTTTGTCGATGACGCGGAACTCGTCGCCCGCATCAAGGCTC
AGTCGCCGCGGTAACAATCGGCCGGTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
384 bp | 127 aa | 100 | 483 | + | No |
AG : IS66 TnpA
ORF sequence :
MRGDILGLERRRRWSDEEKLEIVLSVGVDGATVTQVAQRHDVTRQQIYAWRHQLKKKGVVSASPETLFLPVGLDRPTELVMQTAAICTGAPRPAQIELQL
ANGRCLRFDPALDTVTLTHVIRAVEGA
ANGRCLRFDPALDTVTLTHVIRAVEGA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 480 | 827 | + | No |
AG : IS66 TnpB
ORF sequence :
MIGPGTGVRVYLACGVTDMRKGISGLAALAQDVLRQKPAGGAVFAFRGRRGDRLKLLHWDGQGFCLYYKVLEKGRFPWPMRGDGAVRLTSAQLAMLWEGI
DWRRPDWGAPPARVG
DWRRPDWGAPPARVG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1629 bp | 542 aa | 890 | 2518 | + | No |
Chemistry : DDE
ORF sequence :
MSNAAQTLPDDPALLKSLIAALQAENAKISATLRAHDQLIQSLRLRIARLKKQVFGQSSEKIEREIEQLELALEDLMIAAAEGQPDVVNDGQNDDGLETA
PADEAAERPSRRRPRVSDSTPRERRELDPGSCCPDCGGDLRLVGEDVSEMLDLIAAQLKVVQIARLKKSCRRCERMVQMPAPSRPIPGSMAGANLLAHIL
VSKFDDHLPLYRQHEIFTRMGADIPDSTLVDWCGRAMKVLAPLIERIEVDVMASDLLHADDTPIRVLDRAGRDKGVGKGVKKGRIWAYVRDQRPWAGASP
PGAVYAFAPDWKEEHVHGHLANTRGILQADGYKGYAKLYEPEPDGKPRLREAACWAHLRRDFHDEWTKTKSTIAREALDRIGALYDIEREITGHPADIRL
AARRKHSVPKVEVFFAWSEQQLSRIPGKGDLAKAFRYGLSRRDAFSLFLEDGRVAIDNNPAERALRPIGVGRRNWLFAGADTGAETLARAMTIVETAKMN
GLDPQAYLADILARIHDHKINRLDDLLPWNWSPLPSALHEAA
PADEAAERPSRRRPRVSDSTPRERRELDPGSCCPDCGGDLRLVGEDVSEMLDLIAAQLKVVQIARLKKSCRRCERMVQMPAPSRPIPGSMAGANLLAHIL
VSKFDDHLPLYRQHEIFTRMGADIPDSTLVDWCGRAMKVLAPLIERIEVDVMASDLLHADDTPIRVLDRAGRDKGVGKGVKKGRIWAYVRDQRPWAGASP
PGAVYAFAPDWKEEHVHGHLANTRGILQADGYKGYAKLYEPEPDGKPRLREAACWAHLRRDFHDEWTKTKSTIAREALDRIGALYDIEREITGHPADIRL
AARRKHSVPKVEVFFAWSEQQLSRIPGKGDLAKAFRYGLSRRDAFSLFLEDGRVAIDNNPAERALRPIGVGRRNWLFAGADTGAETLARAMTIVETAKMN
GLDPQAYLADILARIHDHKINRLDDLLPWNWSPLPSALHEAA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
297 bp | 98 aa | 2518 | 2814 | + | No |
Annotation : Description :
ORF sequence :
MATVTHVCTLDYIAKMLGEDPELLKAIVYNDDNLTYGSIISVYTGPDDTVTALTDDGIDELKDMLRDARITTETWHAFLDDFVDDAELVARIKAQSPR
Blast result :
Comments
ISPpa5 has been identified by its integration to the entrapment vector pMEC1. As judged from hybridization analysis ISPpa5 is present in three strains of P. pantotrophus (2 copies in the host strain DSM 11072; 3 copies in DSM 11073 and 2 copies in LMD 82.5) as well as in 1 copy in P. denitrificans DSM 413.
ISPpa5 is 59% (ORF1) aa similar to IS1087, 63% (ORF2) to IS66, 54% (ORF3) to ISRm14 and 47%(ORF4) to ISXc5.
ISPpa5 is 59% (ORF1) aa similar to IS1087, 63% (ORF2) to IS66, 54% (ORF3) to ISRm14 and 47%(ORF4) to ISXc5.
References
1] Bartosik,D., Sochacka,M. and Baj,J. (2003) J. Bacteriol. 185 (13), 3753-3763
2] Bartosik,D.(2003) Direct submission
2] Bartosik,D.(2003) Direct submission