ISPye33
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP020442 | ND | Paracoccus yeei | Paracoccus yeei Paracoccus yeei FDAARGOS_252 |
DNA section
IS Length : 2366 bp
Ends
IR Length : 19/22
IRL : GTAACCGTCCGGTCTAACCGCTCTGCCGCGGTGTGAGAGTGCGGGATAGT
IRR : GTAACCGTCCGGCGTGACCGCTATCCGTTCCAGCGCCACGGCATCAGGTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
cgtcctcgat | GATGAGGG | ggatgagatc | 8 |
DNA sequence
GTAACCGTCCGGTCTAACCGCTCTGCCGCGGTGTGAGAGTGCGGGATAGTGTCCACCATTGATAGTGGACACTGTGGTGATGACGTGGCGCGTCGGACGA
AGCGGCTTTGGACGGATGAGGAGAAGCGGTCGATCTGCTTCCAGACGGCGGCGCCGGGGGTATCGGTCGCCCAGGTGGCGCGGCGCTACGCGGTGAATGC
GAACTTGATCTTCAAGTGGCTGCGCGATCGCCGTTACGCGCCGGACCCCGCCTCGGCTCTTCCCCCGTCAGAGGAGGCGCGCTTTCTGCCTGTGGAGATT
GTCGCGGAGACCAGGTCTGCCCGGGCAACACCTGCCGCCCACAACCACATCGAGATCGAATTGGCGGGCGGGCACCGGATGCGCATCAGCGGGAGCTATG
ATCCCGAGGCTCTGGCGCGGCTGATCCGGGGCGTGACGGCGTGATCCCGGTTCCGGCGAACACGCGGGTCTGGCTGGCGGCGGGCGCCACCGACATGCGC
AAGGGGTTCACGGCACTGGCGGCGCAGGCCGAGGCGGTGCTGAAGCAGGACCCGTTCGCCGGGCATCTGTTCGTCTTCCGTGGCCGTCGGGGCGATCTGG
TGAAGGTGATCTGGTGGGATGGCCAAGGCGCGTGCATGTTCATGAAGCGGCTGGAGAAGGGTCGGTTCGTCTGGCCCTCGGCCAAGGAGGGCAAGGTGGC
TTTGTCGCCTGCGCAACTGTCGATGCTGCTGGAAGGCATCGACTGGCGGGCACCGGAACGGACGTGGCGGCCTCTGGCGGCGGGATAGTCCACGGCGCCT
GCAATCAGTGATTCCCACAAGTAATACAGTGGGATAAACTTTCCGCATGCTCACCCAGGCCCTGACATTGCCGGAAGACCCCGAGGAGCTGCGCAGCTTC
ACCGCGCGGCTTCTGGCCGAGGTGAAGGCCCAGGCGATCCTGATCGAGAAGCTCCGGCATCAGCTGGCCGGGCACCGGGCGCACCGGTTTGGTGCGTCGT
CCGAGACGGCGGAACAGTTGCAGCTGGCCCTTGAGACCAGCGAGATCGCCGCCGCCGCGATGACCGCGCGGATGAAGCTGCCGGACATCGAGGAGAAGGA
CAAACCCAAGCGTCGTCCGATCCCGGACCACATCCCGCGAATGGAGGTGGAGCTGACACCGAGCACAGATGCCTGTGCCGATTGCGGCGGGCGCCTGCGC
CGGATCGGGGAAGATGTGACGGAAGAGCTGGAATACGTTCCCGGGCGGTTCATCGTGAACCGGCTTGTCCGCCCTCGGCTGACCTGCTCGTGCTGTGAAC
GCTTCGTGCAGTCCCCACTGCCCTCGCGCCCGATCGAGCGCGGTCGCCCGGGGCCGGGCCTCCTCGCCCATGTCTTGGTGAGCAAGTATGCCGACCACCT
TCCGCTCTACCGCCAGAGCCAGATCTTCGAGCGCGAAGGTCTCGACCTGGACCGGTCCACCCTGGCCGACTGGGTGGGCAAGAGCACGGCCCTGTTGGAG
CCGCTGGCCGACGCCATCGGGCGCCATGTCTTCTCAGCCGAGGCGATCTTCGCCGACGACACGCCGATCAGCATGCTGGCGCCCGGCACCGGCAAGACCC
AGACCGCCCGGCTCTGGACCTATGCGCGCGACGAACGCCCTTGGGGCGGGCAGGCCCCACCGGCGGCATGGTATCGCTTCTCCGGTGACCGCAAGGGCCA
GCATCCCAAGGACCACCTCGCCCGCTTCCGCGGCTGGATGCACGCCGACGGTTATGCCGGGTTCGAGGATCTCTACCGCTCCGGCACCATCCGCGAGGTC
GCCTGCATGGCCCATGTCCGGCGAAAGTTCGTCGATATCCATCGGTCGCAGGGCTCCCCGATCGCCGGAGAGGCCATCGGCCGGATCGCACAGCTCTACG
CCGTCGAGAAAGAAGCCCGAGGGTCGCCGCCAGACCGCCGCACGGAACTCCGCAAAGCTCACGCCGCCCCGGTCTTCGACGATCTGCAGCGATGGCTGGC
CATGAGACTGACGGAAATCTCGGGCAAATCCCCGCTCGCGGCCGCCATCCGCTATGCCCTGACCCGAATGGACCGCCTGCGCCCCTACCTCGACCACGGC
ATCCTGGAGTTGGACAACAACACCGCCGAACGCGGCATGCGCGCCATCGCCCTCGGGCGGAAGAACTACCTCTTCGTCGGCTCCGAGGCGGGCGGCAACG
CCGCTGCCATCGCCTACACCCTGATCGAAACGGCCAAGCTCAACGCCGTCGATCCCCACGCCTGGCTCGCTGACACTCTCGCCCGCATTCCCGACTACAA
GATCACCAAGGTCGATGACCTGATGCCGTGGCGCTGGAACGGATAGCGGTCACGCCGGACGGTTAC
AGCGGCTTTGGACGGATGAGGAGAAGCGGTCGATCTGCTTCCAGACGGCGGCGCCGGGGGTATCGGTCGCCCAGGTGGCGCGGCGCTACGCGGTGAATGC
GAACTTGATCTTCAAGTGGCTGCGCGATCGCCGTTACGCGCCGGACCCCGCCTCGGCTCTTCCCCCGTCAGAGGAGGCGCGCTTTCTGCCTGTGGAGATT
GTCGCGGAGACCAGGTCTGCCCGGGCAACACCTGCCGCCCACAACCACATCGAGATCGAATTGGCGGGCGGGCACCGGATGCGCATCAGCGGGAGCTATG
ATCCCGAGGCTCTGGCGCGGCTGATCCGGGGCGTGACGGCGTGATCCCGGTTCCGGCGAACACGCGGGTCTGGCTGGCGGCGGGCGCCACCGACATGCGC
AAGGGGTTCACGGCACTGGCGGCGCAGGCCGAGGCGGTGCTGAAGCAGGACCCGTTCGCCGGGCATCTGTTCGTCTTCCGTGGCCGTCGGGGCGATCTGG
TGAAGGTGATCTGGTGGGATGGCCAAGGCGCGTGCATGTTCATGAAGCGGCTGGAGAAGGGTCGGTTCGTCTGGCCCTCGGCCAAGGAGGGCAAGGTGGC
TTTGTCGCCTGCGCAACTGTCGATGCTGCTGGAAGGCATCGACTGGCGGGCACCGGAACGGACGTGGCGGCCTCTGGCGGCGGGATAGTCCACGGCGCCT
GCAATCAGTGATTCCCACAAGTAATACAGTGGGATAAACTTTCCGCATGCTCACCCAGGCCCTGACATTGCCGGAAGACCCCGAGGAGCTGCGCAGCTTC
ACCGCGCGGCTTCTGGCCGAGGTGAAGGCCCAGGCGATCCTGATCGAGAAGCTCCGGCATCAGCTGGCCGGGCACCGGGCGCACCGGTTTGGTGCGTCGT
CCGAGACGGCGGAACAGTTGCAGCTGGCCCTTGAGACCAGCGAGATCGCCGCCGCCGCGATGACCGCGCGGATGAAGCTGCCGGACATCGAGGAGAAGGA
CAAACCCAAGCGTCGTCCGATCCCGGACCACATCCCGCGAATGGAGGTGGAGCTGACACCGAGCACAGATGCCTGTGCCGATTGCGGCGGGCGCCTGCGC
CGGATCGGGGAAGATGTGACGGAAGAGCTGGAATACGTTCCCGGGCGGTTCATCGTGAACCGGCTTGTCCGCCCTCGGCTGACCTGCTCGTGCTGTGAAC
GCTTCGTGCAGTCCCCACTGCCCTCGCGCCCGATCGAGCGCGGTCGCCCGGGGCCGGGCCTCCTCGCCCATGTCTTGGTGAGCAAGTATGCCGACCACCT
TCCGCTCTACCGCCAGAGCCAGATCTTCGAGCGCGAAGGTCTCGACCTGGACCGGTCCACCCTGGCCGACTGGGTGGGCAAGAGCACGGCCCTGTTGGAG
CCGCTGGCCGACGCCATCGGGCGCCATGTCTTCTCAGCCGAGGCGATCTTCGCCGACGACACGCCGATCAGCATGCTGGCGCCCGGCACCGGCAAGACCC
AGACCGCCCGGCTCTGGACCTATGCGCGCGACGAACGCCCTTGGGGCGGGCAGGCCCCACCGGCGGCATGGTATCGCTTCTCCGGTGACCGCAAGGGCCA
GCATCCCAAGGACCACCTCGCCCGCTTCCGCGGCTGGATGCACGCCGACGGTTATGCCGGGTTCGAGGATCTCTACCGCTCCGGCACCATCCGCGAGGTC
GCCTGCATGGCCCATGTCCGGCGAAAGTTCGTCGATATCCATCGGTCGCAGGGCTCCCCGATCGCCGGAGAGGCCATCGGCCGGATCGCACAGCTCTACG
CCGTCGAGAAAGAAGCCCGAGGGTCGCCGCCAGACCGCCGCACGGAACTCCGCAAAGCTCACGCCGCCCCGGTCTTCGACGATCTGCAGCGATGGCTGGC
CATGAGACTGACGGAAATCTCGGGCAAATCCCCGCTCGCGGCCGCCATCCGCTATGCCCTGACCCGAATGGACCGCCTGCGCCCCTACCTCGACCACGGC
ATCCTGGAGTTGGACAACAACACCGCCGAACGCGGCATGCGCGCCATCGCCCTCGGGCGGAAGAACTACCTCTTCGTCGGCTCCGAGGCGGGCGGCAACG
CCGCTGCCATCGCCTACACCCTGATCGAAACGGCCAAGCTCAACGCCGTCGATCCCCACGCCTGGCTCGCTGACACTCTCGCCCGCATTCCCGACTACAA
GATCACCAAGGTCGATGACCTGATGCCGTGGCGCTGGAACGGATAGCGGTCACGCCGGACGGTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
396 bp | 131 aa | 49 | 444 | + | No |
AG : IS66 TnpA
ORF sequence :
VSTIDSGHCGDDVARRTKRLWTDEEKRSICFQTAAPGVSVAQVARRYAVNANLIFKWLRDRRYAPDPASALPPSEEARFLPVEIVAETRSARATPAAHNH
IEIELAGGHRMRISGSYDPEALARLIRGVTA
IEIELAGGHRMRISGSYDPEALARLIRGVTA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 441 | 788 | + | No |
AG : IS66 TnpB
ORF sequence :
VIPVPANTRVWLAAGATDMRKGFTALAAQAEAVLKQDPFAGHLFVFRGRRGDLVKVIWWDGQGACMFMKRLEKGRFVWPSAKEGKVALSPAQLSMLLEGI
DWRAPERTWRPLAAG
DWRAPERTWRPLAAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1500 bp | 498 aa | 847 | 2346 | + | No |
Chemistry : DDE
ORF sequence :
MLTQALTLPEDPEELRSFTARLLAEVKAQAILIEKLRHQLAGHRAHRFGASSETAEQLQLALETSEIAAAAMTARMKLPDIEEKDKPKRRPIPDHIPRME
VELTPSTDACADCGGRLRRIGEDVTEELEYVPGRFIVNRLVRPRLTCSCCERFVQSPLPSRPIERGRPGPGLLAHVLVSKYADHLPLYRQSQIFEREGLD
LDRSTLADWVGKSTALLEPLADAIGRHVFSAEAIFADDTPISMLAPGTGKTQTARLWTYARDERPWGGQAPPAAWYRFSGDRKGQHPKDHLARFRGWMHA
DGYAGFEDLYRSGTIREVACMAHVRRKFVDIHRSQGSPIAGEAIGRIAQLYAVEKEARGSPPDRRTELRKAHAAPVFDDLQRWLAMRLTEISGKSPLAAA
IRYALTRMDRLRPYLDHGILELNNTAERGMRAIALGRKNYLFVGSEAGGNAAAIAYTLIETAKLNAVDPHAWLADTLARIPDYKITKVDDLMPWRWNG
VELTPSTDACADCGGRLRRIGEDVTEELEYVPGRFIVNRLVRPRLTCSCCERFVQSPLPSRPIERGRPGPGLLAHVLVSKYADHLPLYRQSQIFEREGLD
LDRSTLADWVGKSTALLEPLADAIGRHVFSAEAIFADDTPISMLAPGTGKTQTARLWTYARDERPWGGQAPPAAWYRFSGDRKGQHPKDHLARFRGWMHA
DGYAGFEDLYRSGTIREVACMAHVRRKFVDIHRSQGSPIAGEAIGRIAQLYAVEKEARGSPPDRRTELRKAHAAPVFDDLQRWLAMRLTEISGKSPLAAA
IRYALTRMDRLRPYLDHGILELNNTAERGMRAIALGRKNYLFVGSEAGGNAAAIAYTLIETAKLNAVDPHAWLADTLARIPDYKITKVDDLMPWRWNG
Blast result :
Comments
ISPye33 is 67% aa similar to ISPre3.
ISPye33 was identified by in silico sequence analysis of Paracoccus yeei strain FDAARGOS_252 Reconstructed - disrupted by insertion of ISPye32 and ISPye31.
ISPye33 was identified by in silico sequence analysis of Paracoccus yeei strain FDAARGOS_252 Reconstructed - disrupted by insertion of ISPye32 and ISPye31.
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.