ISPye2
- Family IS5
- Group IS5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP020442 | ND | Paracoccus yeei | Paracoccus yeei Paracoccus yeei FDAARGOS_252 |
DNA section
IS Length : 1484 bp
Ends
IR Length : 13/16
IRL : GTCCGGCCTGAACAACCCTTAGCGCGTTGATTTTGTGTGCTGAAGATGCG
IRR : GTCGGTTCTGAACAACTCCTCAGGCGGCCACTGCATGGCGGTGAAGGGTC
Comments : For some of the copies we identified empty insertion sites in other Paracoccus strains, which enabled precisely defining ends and DR's.
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
cccctcctcc | CAATTA | gggcgctcgg | 6 |
gttgcgcctt | TACTTA | ttatggtctg | 6 |
cggcttcacg | TACTGA | aagtcctcgg | 6 |
ccggtgacga | TAATGA | tgatcaccgc | 6 |
catccttgcc | TCGATA | tggtccgaac | 6 |
ccgcgtctct | TCATTA | ggctcttcga | 6 |
atcaaattgt | TAACGA | agagttccgc | 6 |
ttagattggg | TAAATA | accaagactc | 6 |
agcccccttc | TAGGTA | gagaccttat | 6 |
ccagattgca | TAGCGTA | ttgacaatca | 7 |
ggaggccgca | TAATGG | CCAGAACCTA | 6 |
gttcagcccc | CAATTA | cacccaccgc | 6 |
gccgacacct | TAACCA | gcgccgatgt | 6 |
atttgatttt | TCATTA | accctgtggt | 6 |
tagctggacc | TACTTG | gtccggcctg | 6 |
cagaaccgac | TACTTG | tcttctagcg | 6 |
aagctgtctt | TCCATA | ggccggaact | 6 |
DNA sequence
GTCCGGCCTGAACAACCCTTAGCGCGTTGATTTTGTGTGCTGAAGATGCGGTTTTGTTGGCTTTGAATTGCAGGGTTTCGTATGCTTTTGCCAGAAACCC
TGCAATTTCCGGTCGCCGATGAAGCCCCATTCCCGCGCCCCTGAACAGGATGACCTGCTCCGCCCGAGACTGGTCGACATGATCGACGCCCGCCATGAGT
TGGTGAAGCTTGCGGCACTGATCGACTGGGATTTCTTTGAACGGGAGTGGGCGAGCTTCTTTCCGTCTCGCCAGGGACGGCCAGCGACATCTCCACGCCT
GGTGGCGGGGCTCATGTATCTCCAGCACGCCTTCAAGCTCTCTGACGAGGCAGTCGTCGCCCGGTGGGTCGAGAACCCGTATTACCAGCACTTCACCGGC
GAGACGTTCTTCCAGCACCGTCCCCCGATCGATCCCTCGTCGCTGGTGCGCTGGCGCAAGCGGATCGGGGAGGAAGGAGTGGAGTGGCTGCTGACCAAGA
CGATCGAGGCAGGCCGAGCTTCTGGCGCGGTCACCGACAAGAGCCTGAAGCGGGTGGCTGTGGATACGACCGTGATGGAAAAGACCATCGCCCATCCCAC
GGATGCGCGGCTTTACGAGCGCGCCCGCGCGCTGTTGGTCGGCTTGGCGAAGGAAGCGGGGGTCGATCTGCGCCAGAACTACGCGCGCCTTGCCCCGCGG
CTGGCCGCCCAGGTGGGGCGGTATGCCCATGCCCGGCAGTTCAAGCGCATGCGCAAGGCCCTGCGCCAACTCAAGGGCTATGTTGGCCGCGTGCGCCGGG
ACCTGCGCCGCCACCTGCAGGACATCCCCGAAGGCGCGCTGCGCGGACGGGTGCTGGAGGCGCTCTGGCTGGTCGGTCGCCTGCTCGAACAGACACCGAA
GAGCAAGAACAAGATCTACGCCCTGCACGAGCCCGAGGTCGACTGCATCTCCAAGGGCAAGGCGCGCATCCGCTATGAGTTCGGCACCAAGGTCAGCCTT
GCTACTACCCTCGACGGAGGCTTTGTCGTCGGCGCCCGCAGCTTCCCTGACAACCCCTACGACGGCCATACATTGGCGCCTGCACTGGAGCAGGTTGCCA
TCCTGACCGAGCAGGTGCCGGATCTCGCCGTCGTCGATCGCGGCTATCGCGGCCATGGCGTGGAGACCACCAAGGTCCTGATCAGCGGCACAAGACGCGG
CATCACCCCGCTCCTGGCAAAGCTCCTCAGGCGACGAAGTGCCATCGAGCCTGAGATCGGGCACATGAAGAGCGATGGTCGCCTGGCCAGATGCCCGCTG
AAAGGCCGCATCGGCGACGCGGTCTTCGCCGTCCTCTGCGCCTGCGGGCACAATATCCGCAAGATCCTCGCCCATCTCAGGGCTCTTTGGACTCTATTGC
TTCACCTGATCACGGCGATCATCCGGGGCGAAAGGACCCTTCACCGCCATGCAGTGGCCGCCTGAGGAGTTGTTCAGAACCGAC
TGCAATTTCCGGTCGCCGATGAAGCCCCATTCCCGCGCCCCTGAACAGGATGACCTGCTCCGCCCGAGACTGGTCGACATGATCGACGCCCGCCATGAGT
TGGTGAAGCTTGCGGCACTGATCGACTGGGATTTCTTTGAACGGGAGTGGGCGAGCTTCTTTCCGTCTCGCCAGGGACGGCCAGCGACATCTCCACGCCT
GGTGGCGGGGCTCATGTATCTCCAGCACGCCTTCAAGCTCTCTGACGAGGCAGTCGTCGCCCGGTGGGTCGAGAACCCGTATTACCAGCACTTCACCGGC
GAGACGTTCTTCCAGCACCGTCCCCCGATCGATCCCTCGTCGCTGGTGCGCTGGCGCAAGCGGATCGGGGAGGAAGGAGTGGAGTGGCTGCTGACCAAGA
CGATCGAGGCAGGCCGAGCTTCTGGCGCGGTCACCGACAAGAGCCTGAAGCGGGTGGCTGTGGATACGACCGTGATGGAAAAGACCATCGCCCATCCCAC
GGATGCGCGGCTTTACGAGCGCGCCCGCGCGCTGTTGGTCGGCTTGGCGAAGGAAGCGGGGGTCGATCTGCGCCAGAACTACGCGCGCCTTGCCCCGCGG
CTGGCCGCCCAGGTGGGGCGGTATGCCCATGCCCGGCAGTTCAAGCGCATGCGCAAGGCCCTGCGCCAACTCAAGGGCTATGTTGGCCGCGTGCGCCGGG
ACCTGCGCCGCCACCTGCAGGACATCCCCGAAGGCGCGCTGCGCGGACGGGTGCTGGAGGCGCTCTGGCTGGTCGGTCGCCTGCTCGAACAGACACCGAA
GAGCAAGAACAAGATCTACGCCCTGCACGAGCCCGAGGTCGACTGCATCTCCAAGGGCAAGGCGCGCATCCGCTATGAGTTCGGCACCAAGGTCAGCCTT
GCTACTACCCTCGACGGAGGCTTTGTCGTCGGCGCCCGCAGCTTCCCTGACAACCCCTACGACGGCCATACATTGGCGCCTGCACTGGAGCAGGTTGCCA
TCCTGACCGAGCAGGTGCCGGATCTCGCCGTCGTCGATCGCGGCTATCGCGGCCATGGCGTGGAGACCACCAAGGTCCTGATCAGCGGCACAAGACGCGG
CATCACCCCGCTCCTGGCAAAGCTCCTCAGGCGACGAAGTGCCATCGAGCCTGAGATCGGGCACATGAAGAGCGATGGTCGCCTGGCCAGATGCCCGCTG
AAAGGCCGCATCGGCGACGCGGTCTTCGCCGTCCTCTGCGCCTGCGGGCACAATATCCGCAAGATCCTCGCCCATCTCAGGGCTCTTTGGACTCTATTGC
TTCACCTGATCACGGCGATCATCCGGGGCGAAAGGACCCTTCACCGCCATGCAGTGGCCGCCTGAGGAGTTGTTCAGAACCGAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1347 bp | 448 aa | 119 | 1465 | + | No |
Chemistry : DDE
ORF sequence :
MKPHSRAPEQDDLLRPRLVDMIDARHELVKLAALIDWDFFEREWASFFPSRQGRPATSPRLVAGLMYLQHAFKLSDEAVVARWVENPYYQHFTGETFFQH
RPPIDPSSLVRWRKRIGEEGVEWLLTKTIEAGRASGAVTDKSLKRVAVDTTVMEKTIAHPTDARLYERARALLVGLAKEAGVDLRQNYARLAPRLAAQVG
RYAHARQFKRMRKALRQLKGYVGRVRRDLRRHLQDIPEGALRGRVLEALWLVGRLLEQTPKSKNKIYALHEPEVDCISKGKARIRYEFGTKVSLATTLDG
GFVVGARSFPDNPYDGHTLAPALEQVAILTEQVPDLAVVDRGYRGHGVETTKVLISGTRRGITPLLAKLLRRRSAIEPEIGHMKSDGRLARCPLKGRIGD
AVFAVLCACGHNIRKILAHLRALWTLLLHLITAIIRGERTLHRHAVAA
RPPIDPSSLVRWRKRIGEEGVEWLLTKTIEAGRASGAVTDKSLKRVAVDTTVMEKTIAHPTDARLYERARALLVGLAKEAGVDLRQNYARLAPRLAAQVG
RYAHARQFKRMRKALRQLKGYVGRVRRDLRRHLQDIPEGALRGRVLEALWLVGRLLEQTPKSKNKIYALHEPEVDCISKGKARIRYEFGTKVSLATTLDG
GFVVGARSFPDNPYDGHTLAPALEQVAILTEQVPDLAVVDRGYRGHGVETTKVLISGTRRGITPLLAKLLRRRSAIEPEIGHMKSDGRLARCPLKGRIGD
AVFAVLCACGHNIRKILAHLRALWTLLLHLITAIIRGERTLHRHAVAA
Blast result :
Comments
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.