ISPye32
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP020444 | ND | Paracoccus yeei | Paracoccus yeei Paracoccus yeei FDAARGOS_252 |
DNA section
IS Length : 1468 bp
Ends
IR Length : 16/22
IRL : ATCGGGCATGCTGTTATCGGCGCCCTTCGAATGGCACACGATTCTGACCT
IRR : CGGATGTTGTTGAACAGCCCCTTTCTCAGGTCTTGTTCGGTTTTGATGGT
Comments : For some of the copies we identified empty insertion sites in other strains, which enabled precisely defining ends and DR's. IRs are not at the termini of the IS.
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
gcggtgttgtT | Tgtccaactcc | 0 | |
gcgccgttgtT | Tgtcgacatag | 0 | |
gcggtgttgtT | Ttgtccagctc | 0 | |
gcaggattgtT | Tgtcgatcccc | 0 | |
acggtgttgtT | Tgtcgatctcg | 0 |
DNA sequence
ATCGGGCATGCTGTTATCGGCGCCCTTCGAATGGCACACGATTCTGACCTGGTCATGGTAGAGTTGCCGTGTCGGGACAGGGGCGTCGGAGTGCGAAGAA
GCGGGCAGGCCGAATTGACTATGAGCTGCGAGCTCGCGTTGTTATCTGGCCGCCCCTGCGACTATCCCGTCTGCGCTGCGAGCGCGTATCCGTAGATGTC
GCACAGCCCGCGCACTCCCCATCGGCAACAGGGAGGATTTTGCATGCGATCCATTGGAATGGATGTCCACCGAAGCTTCGCGCAGGTCGCGATCCTTGAG
GGAGGAAAGACGACAGAGATCAGGATCGATCTTGATCATGAAGCGGTGGTGGCCTTCGGGCAGGCCCTACGTTCAGATGACGAGGTCGTTCTGGAAGCGA
CCGGCAATACCGCGGCCATCGTCAGACTTCTGACACCTTTCGTGGCGCGGGTCGTCATCGCGAACCCGCTGCAGGTGAAGGCCATCGCCCATGCACGGGT
GAAGACCGACAAGGTGGACGCCAAAATTCTGGCGCAACTTCATGCCGCAGGGTTCCTGCCAGAAGTCTGGGCCGCAGATGATCAGACGCTGAACCTGCGG
CGCCTCGTTTCCGAACGCGCCGCAGCGGTGAGATCAATCCGCCGGGTCAAGAGCCGGGTTCAGGCTGTTCTGCATGCGAACCTTGTCGCCAAATATACCG
GGCACCTGTTTGGCAAGGGCGGCAGGAGATGGCTGGCCTCGGCACCGCTGCCGAAGGCAGAACGTGATCTGCTGATCCGGCATCTGGATGAGTTGGACTG
GCTGGGGGGTAAGCTGGCAAAACTGGACCAAACATTGGCGGGAATTGCACTCGACGACAGCCGAATGCGCAAGTTGATGAGCATCGCCGGGATCAATGTC
GCGGTCGCCACTGCGGTTATCGCCGCGATCGGCGACATCTCCCGCTTTTCCGCACCGGATCGTCTCGCAAGCTATTTCGGGCTGACACCCCGGATCCGAC
AGTCCGGCGATCGCGGTGCCATCCATGGTCGGATCTCAAAGCAGGGGAATACGATCGGGCGCACCATGCTGATAGAGGCCGCCTGGTCGGCGGCTTCCGT
GCCAGGCCCGTTGCGGGCCTTCTTTCTTCGGATCAAGGATCGCAAGGGGCACAACGTCGCCGCCGTCGCGACCGCGCGCAAGATTGCAGCGCTGATCTGG
CAACTGCTGACAAAGGAGGCCCCCTATCGATGGGCGCGCCCGGCCTTTGTCGCGATGAAGATGCGGAAGCTTGAGTTGCGCGCCGGGGCCCCGCGGGCTC
ATGGGCCAGCAGGTCCGGGCCACGACTATTGGATCAAGGAAATCCGTCACCGGGAGATGGAACTCGTCGCACAAGCAGAAGCGGCCTATGCCCGCATGGT
TGAAGCGTGGAGGGACAAACCATCAAAACCGAACAAGACCTGAGAAAGGGGCTGTTCAACAACATCCG
GCGGGCAGGCCGAATTGACTATGAGCTGCGAGCTCGCGTTGTTATCTGGCCGCCCCTGCGACTATCCCGTCTGCGCTGCGAGCGCGTATCCGTAGATGTC
GCACAGCCCGCGCACTCCCCATCGGCAACAGGGAGGATTTTGCATGCGATCCATTGGAATGGATGTCCACCGAAGCTTCGCGCAGGTCGCGATCCTTGAG
GGAGGAAAGACGACAGAGATCAGGATCGATCTTGATCATGAAGCGGTGGTGGCCTTCGGGCAGGCCCTACGTTCAGATGACGAGGTCGTTCTGGAAGCGA
CCGGCAATACCGCGGCCATCGTCAGACTTCTGACACCTTTCGTGGCGCGGGTCGTCATCGCGAACCCGCTGCAGGTGAAGGCCATCGCCCATGCACGGGT
GAAGACCGACAAGGTGGACGCCAAAATTCTGGCGCAACTTCATGCCGCAGGGTTCCTGCCAGAAGTCTGGGCCGCAGATGATCAGACGCTGAACCTGCGG
CGCCTCGTTTCCGAACGCGCCGCAGCGGTGAGATCAATCCGCCGGGTCAAGAGCCGGGTTCAGGCTGTTCTGCATGCGAACCTTGTCGCCAAATATACCG
GGCACCTGTTTGGCAAGGGCGGCAGGAGATGGCTGGCCTCGGCACCGCTGCCGAAGGCAGAACGTGATCTGCTGATCCGGCATCTGGATGAGTTGGACTG
GCTGGGGGGTAAGCTGGCAAAACTGGACCAAACATTGGCGGGAATTGCACTCGACGACAGCCGAATGCGCAAGTTGATGAGCATCGCCGGGATCAATGTC
GCGGTCGCCACTGCGGTTATCGCCGCGATCGGCGACATCTCCCGCTTTTCCGCACCGGATCGTCTCGCAAGCTATTTCGGGCTGACACCCCGGATCCGAC
AGTCCGGCGATCGCGGTGCCATCCATGGTCGGATCTCAAAGCAGGGGAATACGATCGGGCGCACCATGCTGATAGAGGCCGCCTGGTCGGCGGCTTCCGT
GCCAGGCCCGTTGCGGGCCTTCTTTCTTCGGATCAAGGATCGCAAGGGGCACAACGTCGCCGCCGTCGCGACCGCGCGCAAGATTGCAGCGCTGATCTGG
CAACTGCTGACAAAGGAGGCCCCCTATCGATGGGCGCGCCCGGCCTTTGTCGCGATGAAGATGCGGAAGCTTGAGTTGCGCGCCGGGGCCCCGCGGGCTC
ATGGGCCAGCAGGTCCGGGCCACGACTATTGGATCAAGGAAATCCGTCACCGGGAGATGGAACTCGTCGCACAAGCAGAAGCGGCCTATGCCCGCATGGT
TGAAGCGTGGAGGGACAAACCATCAAAACCGAACAAGACCTGAGAAAGGGGCTGTTCAACAACATCCG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1200 bp | 399 aa | 244 | 1443 | + | No |
Chemistry : DEDD
ORF sequence :
MRSIGMDVHRSFAQVAILEGGKTTEIRIDLDHEAVVAFGQALRSDDEVVLEATGNTAAIVRLLTPFVARVVIANPLQVKAIAHARVKTDKVDAKILAQLH
AAGFLPEVWAADDQTLNLRRLVSERAAAVRSIRRVKSRVQAVLHANLVAKYTGHLFGKGGRRWLASAPLPKAERDLLIRHLDELDWLGGKLAKLDQTLAG
IALDDSRMRKLMSIAGINVAVATAVIAAIGDISRFSAPDRLASYFGLTPRIRQSGDRGAIHGRISKQGNTIGRTMLIEAAWSAASVPGPLRAFFLRIKDR
KGHNVAAVATARKIAALIWQLLTKEAPYRWARPAFVAMKMRKLELRAGAPRAHGPAGPGHDYWIKEIRHREMELVAQAEAAYARMVEAWRDKPSKPNKT
AAGFLPEVWAADDQTLNLRRLVSERAAAVRSIRRVKSRVQAVLHANLVAKYTGHLFGKGGRRWLASAPLPKAERDLLIRHLDELDWLGGKLAKLDQTLAG
IALDDSRMRKLMSIAGINVAVATAVIAAIGDISRFSAPDRLASYFGLTPRIRQSGDRGAIHGRISKQGNTIGRTMLIEAAWSAASVPGPLRAFFLRIKDR
KGHNVAAVATARKIAALIWQLLTKEAPYRWARPAFVAMKMRKLELRAGAPRAHGPAGPGHDYWIKEIRHREMELVAQAEAAYARMVEAWRDKPSKPNKT
Blast result :
Comments
ISPye32 is 92% aa similar to ISAtu7.
ISPye32 was identified by in silico sequence analysis of Paracoccus yeei strain FDAARGOS_252 (4 copies, 1 isoform). ISPye32 seems to have a preferred insertion site sequence.
ISPye32 was identified by in silico sequence analysis of Paracoccus yeei strain FDAARGOS_252 (4 copies, 1 isoform). ISPye32 seems to have a preferred insertion site sequence.
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.