ISPye9
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP020444 | ND | Paracoccus yeei | Paracoccus yeei FDAARGOS_252 Paracoccus yeei CCUG 32053 Paracoccus yeei TT13 |
DNA section
IS Length : 2488 bp
Ends
IR Length : 17/22
IRL : GTAAGCGGCAAGGAGACCCCGTGGGTTAGCGGCGTTGCCGGTCCGGGATA
IRR : GTAAGCGGGAAGCTGATGCCGTTCAGGGCGTGTAGTTCCAAGGCATGAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
gaaaccacga | GGGTTTTG | agacagcgag | 8 |
gccgaagcgg | TGAGGTGT | ttggagatcc | 8 |
tgcgacaggc | GAGGCGCC | cggcatccgc | 8 |
DNA sequence
GTAAGCGGCAAGGAGACCCCGTGGGTTAGCGGCGTTGCCGGTCCGGGATAGGCTGCTGGCGTTGTTCTGACGCCGGGGTTTCATGTCTACGCCTAGTTCT
ACGCCTATGCCTGCGACTGGTAGCTTACAGTTGGTCGAGGCTGTGATCGAGCGTTTCGACGGAGCTCCGGTTGGGGAGCGCCGCCGGTGGTCGGATGAGT
TCAAGGCCCAAGCCGTCGCGGCGGCGCTGGAGCCAGGCGTGAATGTTTCGGCGTTGGCGCGCCGCCTGGGGATTTCGCCGCCGCAGCTGTTCGGCTGGCG
CAAGGCCTTCCTGAAGAAGCAGAACGCAGACGCCCCGGCGGGCACGCCAACACCCATGGTGGAGATCGTGGTGGGCGAGGTGATGATCCGCGTCGGCCCG
GACCTGAGCGAGGCTGTGTTGCGCCGGATCCTGCGTGCGGTGCGTTCGGCATGATGCCGTCGGGTGTGAAGGTCTATCTGGCCAGCCGGTAGACTTCCGC
AAGGGGCCTGACAGCCTGCTGTCGCTGGTGCGCGATGCCGGCAGTGACCCGTTCAGCGGGGCGCTTTACGTGTTCCGGGCTAAGCGGGCGGACCGGGTCA
AGATTGTCTGGTGGGATGGCAGCGGGATCTGCCTCTTTGCCAAGCGGCTCGAGAAATCCTCGTTCTGCTGGCCGCGGATCGGGCCCGCTCGGGTGCAGCT
CAACCATGCCCAGCTCATGGCGTTGCTCGACGGCCTGGATTGGAAACGGGTTCGTCCCGTAGTGGTAAAAGCGCCTGTATTTGCTGGATAACCAGACTGC
GGCAAAGTGAATCACCCCGGCTCCGGACGTGGATCATCCGAGGCATGGCAAGGCGGGATGTGCTATCCTCAGCGGCATGCGCACCGACGATCTTTCCCTG
CCCGATGACATGGACCTGCTCAAGGCCATGGTCCGCGCCATGGCGGAGAAGACGGCCGTGCTGGAGGACGAGAATGCGGCCCTGAAGGCCCGCAGCCTCG
ATGCCGACGCGCGGATCAAGCGGCTGATGCAGATCCTGAAGGCCTATGACCGGGCCCGGTTCGGGCGGCGCTCCGAGAAGCTTGGCACTGCTGGGGCTGG
TGCTGACGAAGAGGCGCAGCAGGCCTTCGTCTTCGAGGAGATCGAGACCGGCATCTCGGCGCTGAGGTCGCAGACAGGTCAAGGCCGGACCTCGGGCGAG
AAGCAGGCTCCGCGGCCGCGCAAGGGCTTTGCGCCCCATCTCGAGCGGGTCGAGGTGGTGATCGAACCCGAGAACCTGCCGGAACACGCGGGCCGAAAGA
AAGTGCTGATCGGCGAAGACATCTCGGAGCGGCTGGATGTCATTCCGGCCAGGTTCCGTGTCATCGTGACCCGCCGCCCGAAATATGCGTTCAAGGACGC
GGATGGCGTCGTCCAGGCGCTCGCCCCGGCCCACATCATCGAAAGCGGCCTGCCGACCGAGGCATTGCTGGCCCAGATCGCGGTCTCCAAATATGCCGAT
GGCCTGCCGCTGTTCCGGCAGGAAGGGATCCATGCCCGCGACCGGGTGGAGATCGACCGGCGTCTGATGGCGCAATGGATGGGCCGGGTCGGCTTCGAGC
TGGAGATCCTCGCTGCCCATGTGCTGGGCGAAATCCTGAAGGGCCCGCGCTTCTTCGCCGATGAAACCAGCCTGCCGACCCTCGCGCCCGGCACCGGCGC
GGTGAAAAAGGCCTGGCTCTGGGCCTATGCCCGCGACGACAGCACCTTCGGCGGCAGCGGGCCACCCATGGTGGCCTATCGCTTCGAGGACAGCCGATCC
GGAGAATGTGTGCGCCGACATCTCGGCGGCTACGGCGGCATCCTCCAAGTGGACGGCTATGCCGCCTACAACCAGCTTGTCCGCAAGGATGGGGGCAATG
ATGGCCCGCGCCTCGCTGGATGTTGGGCCCATAGCCGGCGGCGGTTCTTTGAGCTCCACACCGCGGGGGACAGCCAGGTCGCCACCACCACGGTCGAGCG
CATGGCCGATCTGTGGAAGCTCGAGGCCGAGGTGCGCGGCCAGAGCCCCGAGGCCCGCGCCGCCGCGCGGCAGGCCATCTCCGCGCCCATCGTCGCCGAG
TTGTTCGCCCTCTGGCAGCAGACACTGCCCCGCATCTCGGGCAAATCGAAGCTGGCCGAGGCCATCCGCTATGCCATCGCCCGGCGCCACATCTTCGAGC
GCTTCCTGACCGACGGCCTGATTGAACTCGACTCCAACATCGTCGAGCGGGCTATCAGGCCCCAGACCATCACCCGCAAGAACAGCCTCTTCGCTGGCTC
GGACGGCGGTGGCCGCACATGGGCCACCATCGCCACCCTCCTGCAGACCTGCAAGATGAACAACGTCGATCCTACGGCCTGGCTGACGCAGACCCTCGAA
CGCATCGCCAACCAATGGCCGAGCGCCAAAATCGACGTCCTCATGCCTTGGAACTACACGCCCTGAACGGCATCAGCTTCCCGCTTAC
ACGCCTATGCCTGCGACTGGTAGCTTACAGTTGGTCGAGGCTGTGATCGAGCGTTTCGACGGAGCTCCGGTTGGGGAGCGCCGCCGGTGGTCGGATGAGT
TCAAGGCCCAAGCCGTCGCGGCGGCGCTGGAGCCAGGCGTGAATGTTTCGGCGTTGGCGCGCCGCCTGGGGATTTCGCCGCCGCAGCTGTTCGGCTGGCG
CAAGGCCTTCCTGAAGAAGCAGAACGCAGACGCCCCGGCGGGCACGCCAACACCCATGGTGGAGATCGTGGTGGGCGAGGTGATGATCCGCGTCGGCCCG
GACCTGAGCGAGGCTGTGTTGCGCCGGATCCTGCGTGCGGTGCGTTCGGCATGATGCCGTCGGGTGTGAAGGTCTATCTGGCCAGCCGGTAGACTTCCGC
AAGGGGCCTGACAGCCTGCTGTCGCTGGTGCGCGATGCCGGCAGTGACCCGTTCAGCGGGGCGCTTTACGTGTTCCGGGCTAAGCGGGCGGACCGGGTCA
AGATTGTCTGGTGGGATGGCAGCGGGATCTGCCTCTTTGCCAAGCGGCTCGAGAAATCCTCGTTCTGCTGGCCGCGGATCGGGCCCGCTCGGGTGCAGCT
CAACCATGCCCAGCTCATGGCGTTGCTCGACGGCCTGGATTGGAAACGGGTTCGTCCCGTAGTGGTAAAAGCGCCTGTATTTGCTGGATAACCAGACTGC
GGCAAAGTGAATCACCCCGGCTCCGGACGTGGATCATCCGAGGCATGGCAAGGCGGGATGTGCTATCCTCAGCGGCATGCGCACCGACGATCTTTCCCTG
CCCGATGACATGGACCTGCTCAAGGCCATGGTCCGCGCCATGGCGGAGAAGACGGCCGTGCTGGAGGACGAGAATGCGGCCCTGAAGGCCCGCAGCCTCG
ATGCCGACGCGCGGATCAAGCGGCTGATGCAGATCCTGAAGGCCTATGACCGGGCCCGGTTCGGGCGGCGCTCCGAGAAGCTTGGCACTGCTGGGGCTGG
TGCTGACGAAGAGGCGCAGCAGGCCTTCGTCTTCGAGGAGATCGAGACCGGCATCTCGGCGCTGAGGTCGCAGACAGGTCAAGGCCGGACCTCGGGCGAG
AAGCAGGCTCCGCGGCCGCGCAAGGGCTTTGCGCCCCATCTCGAGCGGGTCGAGGTGGTGATCGAACCCGAGAACCTGCCGGAACACGCGGGCCGAAAGA
AAGTGCTGATCGGCGAAGACATCTCGGAGCGGCTGGATGTCATTCCGGCCAGGTTCCGTGTCATCGTGACCCGCCGCCCGAAATATGCGTTCAAGGACGC
GGATGGCGTCGTCCAGGCGCTCGCCCCGGCCCACATCATCGAAAGCGGCCTGCCGACCGAGGCATTGCTGGCCCAGATCGCGGTCTCCAAATATGCCGAT
GGCCTGCCGCTGTTCCGGCAGGAAGGGATCCATGCCCGCGACCGGGTGGAGATCGACCGGCGTCTGATGGCGCAATGGATGGGCCGGGTCGGCTTCGAGC
TGGAGATCCTCGCTGCCCATGTGCTGGGCGAAATCCTGAAGGGCCCGCGCTTCTTCGCCGATGAAACCAGCCTGCCGACCCTCGCGCCCGGCACCGGCGC
GGTGAAAAAGGCCTGGCTCTGGGCCTATGCCCGCGACGACAGCACCTTCGGCGGCAGCGGGCCACCCATGGTGGCCTATCGCTTCGAGGACAGCCGATCC
GGAGAATGTGTGCGCCGACATCTCGGCGGCTACGGCGGCATCCTCCAAGTGGACGGCTATGCCGCCTACAACCAGCTTGTCCGCAAGGATGGGGGCAATG
ATGGCCCGCGCCTCGCTGGATGTTGGGCCCATAGCCGGCGGCGGTTCTTTGAGCTCCACACCGCGGGGGACAGCCAGGTCGCCACCACCACGGTCGAGCG
CATGGCCGATCTGTGGAAGCTCGAGGCCGAGGTGCGCGGCCAGAGCCCCGAGGCCCGCGCCGCCGCGCGGCAGGCCATCTCCGCGCCCATCGTCGCCGAG
TTGTTCGCCCTCTGGCAGCAGACACTGCCCCGCATCTCGGGCAAATCGAAGCTGGCCGAGGCCATCCGCTATGCCATCGCCCGGCGCCACATCTTCGAGC
GCTTCCTGACCGACGGCCTGATTGAACTCGACTCCAACATCGTCGAGCGGGCTATCAGGCCCCAGACCATCACCCGCAAGAACAGCCTCTTCGCTGGCTC
GGACGGCGGTGGCCGCACATGGGCCACCATCGCCACCCTCCTGCAGACCTGCAAGATGAACAACGTCGATCCTACGGCCTGGCTGACGCAGACCCTCGAA
CGCATCGCCAACCAATGGCCGAGCGCCAAAATCGACGTCCTCATGCCTTGGAACTACACGCCCTGAACGGCATCAGCTTCCCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 107 | 454 | + | No |
AG : IS66 TnpA
ORF sequence :
MPATGSLQLVEAVIERFDGAPVGERRRWSDEFKAQAVAAALEPGVNVSALARRLGISPPQLFGWRKAFLKKQNADAPAGTPTPMVEIVVGEVMIRVGPDL
SEAVLRRILRAVRSA
SEAVLRRILRAVRSA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 435 | 791 | + | No |
AG : IS66 TnpB
ORF sequence :
VRCVRHDAVGCEGLSGQPVDFRKGPDSLLSLVRDAGSDPFSGALYVFRAKRADRVKIVWWDGSGICLFAKRLEKSSFCWPRIGPARVQLNHAQLMALLDG
LDWKRVRPVVVKAPVFAG
LDWKRVRPVVVKAPVFAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1590 bp | 529 aa | 877 | 2466 | + | No |
Chemistry : DDE
ORF sequence :
MRTDDLSLPDDMDLLKAMVRAMAEKTAVLEDENAALKARSLDADARIKRLMQILKAYDRARFGRRSEKLGTAGAGADEEAQQAFVFEEIETGISALRSQT
GQGRTSGEKQAPRPRKGFAPHLERVEVVIEPENLPEHAGRKKVLIGEDISERLDVIPARFRVIVTRRPKYAFKDADGVVQALAPAHIIESGLPTEALLAQ
IAVSKYADGLPLFRQEGIHARDRVEIDRRLMAQWMGRVGFELEILAAHVLGEILKGPRFFADETSLPTLAPGTGAVKKAWLWAYARDDSTFGGSGPPMVA
YRFEDSRSGECVRRHLGGYGGILQVDGYAAYNQLVRKDGGNDGPRLAGCWAHSRRRFFELHTAGDSQVATTTVERMADLWKLEAEVRGQSPEARAAARQA
ISAPIVAELFALWQQTLPRISGKSKLAEAIRYAIARRHIFERFLTDGLIELDSNIVERAIRPQTITRKNSLFAGSDGGGRTWATIATLLQTCKMNNVDPT
AWLTQTLERIANQWPSAKIDVLMPWNYTP
GQGRTSGEKQAPRPRKGFAPHLERVEVVIEPENLPEHAGRKKVLIGEDISERLDVIPARFRVIVTRRPKYAFKDADGVVQALAPAHIIESGLPTEALLAQ
IAVSKYADGLPLFRQEGIHARDRVEIDRRLMAQWMGRVGFELEILAAHVLGEILKGPRFFADETSLPTLAPGTGAVKKAWLWAYARDDSTFGGSGPPMVA
YRFEDSRSGECVRRHLGGYGGILQVDGYAAYNQLVRKDGGNDGPRLAGCWAHSRRRFFELHTAGDSQVATTTVERMADLWKLEAEVRGQSPEARAAARQA
ISAPIVAELFALWQQTLPRISGKSKLAEAIRYAIARRHIFERFLTDGLIELDSNIVERAIRPQTITRKNSLFAGSDGGGRTWATIATLLQTCKMNNVDPT
AWLTQTLERIANQWPSAKIDVLMPWNYTP
Blast result :
Comments
ISPye9 is 86% aa similar to ISRel15.
ISPye9 was identified by in silico nucleotide sequence analysis of Paracoccus yeei strains: FDAARGOS_252, TT13 and CCUG 32053.
ISPye9 was identified by in silico nucleotide sequence analysis of Paracoccus yeei strains: FDAARGOS_252, TT13 and CCUG 32053.
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.