ISPye42
- Family IS3
- Group IS51
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Paracoccus yeei | Paracoccus yeei Paracoccus yeei CCUG 32053 |
DNA section
IS Length : 1242 bp
Ends
IR Length : 23/25
IRL : TGAGCTGCTGCCGGTTTGATGGACACGAAGATAAGATCGTGACCAAGGAA
IRR : TGAGCTGCTGCCGGTTTCGTGGACAGCGGGTTTAAGCTAGCATGGCGCGG
Comments : Ends based on alignment with other related IS.
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
tccggccctt | gggagacgcg | 0 | |
ccagtcccac | gtctgtatcc | 0 | |
gtgagagaac | aaccgaacct | 0 | |
gctagcttcg | caggcacttg | 0 | |
cacggctgcc | ggaaccgttc | 0 | |
gagaccaaac | ggcggatggc | 0 | |
tggtcatttg | tatgcccatg | 0 | |
cgtctctcgg | ggaaccgttc | 0 | |
ctgcgggtgc | ggccatgccg | 0 |
DNA sequence
TGAGCTGCTGCCGGTTTGATGGACACGAAGATAAGATCGTGACCAAGGAACTGGAGTGCAGATATGACGAGACGGAAGTTCAGCCGCGAGTTCACAGTCG
AGGCCGTGAGGCTGGTGACGGACCGGGGCGTTGCAGTAGCGCAGGCGGCCCGAGACCTCGACGTTGCGGAGAGCGTGCTGCGCCGCTGGATGCGGGAATT
GACTGCAACACCGGCCACGGCGTTTCCCGGGAACGGGCAGATGCGGGCCGATCTGGCCGAGATTGCAGCTTTAAAGAAAGAGGTCGCGCGGCTGCGTGCG
GAGCGTGACATCCTGAGGAAAGCGGCCGCTTTTTTCGCGCGCGAGGCGATATGAGGTTCGCCTTCATCGCCAAACACCGCCACATCTGGCCGATCACCTG
GCTCTGCGAGGTCCTGAATGTATCGCGGTCCGGCTACCATGCTTGGCTGACCCGCCCGATCAGCACCCGCGAGAGCTATGACGCCAAGCTCGTCGCGGCG
ATCGAGACAAGCTTCAAGGCCAGTGACCGGACCTATGGCGCCCGTCGTGTCTGGCGGGATGTTCTTGAGGACGGGCTTGCCTGTGGCCTTCACCGGATCG
AACGGTTGATGCGGATCAATGCCTTGCGGGCACGACCCAGGCGCCGCGGCAAGCCGAAGGATGACGGCGAGCGGTCGGTGATCGCCGACAACCTCCTGGA
CCGGGATTTCGAGACGGACCGGCCGAACCACAAGTGGCTGGCCGATTTCACCGACATCTGGACCGCGGAAGGCTGGCTCTACGTCGCAGTCGTGCTGGAC
CTCTTTTCCCGGCGGGCCGTGGGCTGGTCCATGAAGGCTGACAGGGATGCTTCACTGGTCATGGATGCACTGATGATGGCGGTCTGGCAGCGCGGAAAGA
TCGACGCTCTGCTGCATCACTCGGACCAGGGATCGCAATACACAAGCGAGCAGTTCCAGCGGCTTCTGGCCGACAATGGGATCACCTGCTCGATGAGCCG
CGCCGGTAACGTCTGGGATAACTCGGCAATGGAGAGCTTCTTCTCAACGCTGAAGACCGAACGGACAGCCAGCAAGGTCTATCGAACCCGAAACGAAGCG
CGTGCCGATGTCTTCGATTACATCGAACGCTTCTACAACCCGCGTCGGCGGCACTCGAAACTGGGCTACATCAGCCCAATGGAGTTCGAAGCCCGCGCCA
TGCTAGCTTAAACCCGCTGTCCACGAAACCGGCAGCAGCTCA
AGGCCGTGAGGCTGGTGACGGACCGGGGCGTTGCAGTAGCGCAGGCGGCCCGAGACCTCGACGTTGCGGAGAGCGTGCTGCGCCGCTGGATGCGGGAATT
GACTGCAACACCGGCCACGGCGTTTCCCGGGAACGGGCAGATGCGGGCCGATCTGGCCGAGATTGCAGCTTTAAAGAAAGAGGTCGCGCGGCTGCGTGCG
GAGCGTGACATCCTGAGGAAAGCGGCCGCTTTTTTCGCGCGCGAGGCGATATGAGGTTCGCCTTCATCGCCAAACACCGCCACATCTGGCCGATCACCTG
GCTCTGCGAGGTCCTGAATGTATCGCGGTCCGGCTACCATGCTTGGCTGACCCGCCCGATCAGCACCCGCGAGAGCTATGACGCCAAGCTCGTCGCGGCG
ATCGAGACAAGCTTCAAGGCCAGTGACCGGACCTATGGCGCCCGTCGTGTCTGGCGGGATGTTCTTGAGGACGGGCTTGCCTGTGGCCTTCACCGGATCG
AACGGTTGATGCGGATCAATGCCTTGCGGGCACGACCCAGGCGCCGCGGCAAGCCGAAGGATGACGGCGAGCGGTCGGTGATCGCCGACAACCTCCTGGA
CCGGGATTTCGAGACGGACCGGCCGAACCACAAGTGGCTGGCCGATTTCACCGACATCTGGACCGCGGAAGGCTGGCTCTACGTCGCAGTCGTGCTGGAC
CTCTTTTCCCGGCGGGCCGTGGGCTGGTCCATGAAGGCTGACAGGGATGCTTCACTGGTCATGGATGCACTGATGATGGCGGTCTGGCAGCGCGGAAAGA
TCGACGCTCTGCTGCATCACTCGGACCAGGGATCGCAATACACAAGCGAGCAGTTCCAGCGGCTTCTGGCCGACAATGGGATCACCTGCTCGATGAGCCG
CGCCGGTAACGTCTGGGATAACTCGGCAATGGAGAGCTTCTTCTCAACGCTGAAGACCGAACGGACAGCCAGCAAGGTCTATCGAACCCGAAACGAAGCG
CGTGCCGATGTCTTCGATTACATCGAACGCTTCTACAACCCGCGTCGGCGGCACTCGAAACTGGGCTACATCAGCCCAATGGAGTTCGAAGCCCGCGCCA
TGCTAGCTTAAACCCGCTGTCCACGAAACCGGCAGCAGCTCA
Recoding section
- Recoding by frameshift
- Frame -1
- Type translational
- Experimentally demonstrated No
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
291 bp | 96 aa | 64 | 354 | + | No |
Description : First part of the transposase
ORF sequence :
MTRRKFSREFTVEAVRLVTDRGVAVAQAARDLDVAESVLRRWMRELTATPATAFPGNGQMRADLAEIAALKKEVARLRAERDILRKAAAFFAREAI
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
903 bp | 300 aa | 309 | 1211 | + | No |
Description : Second part of the transposase
ORF sequence :
HPEESGRFFRARGDMRFAFIAKHRHIWPITWLCEVLNVSRSGYHAWLTRPISTRESYDAKLVAAIETSFKASDRTYGARRVWRDVLEDGLACGLHRIERL
MRINALRARPRRRGKPKDDGERSVIADNLLDRDFETDRPNHKWLADFTDIWTAEGWLYVAVVLDLFSRRAVGWSMKADRDASLVMDALMMAVWQRGKIDA
LLHHSDQGSQYTSEQFQRLLADNGITCSMSRAGNVWDNSAMESFFSTLKTERTASKVYRTRNEARADVFDYIERFYNPRRRHSKLGYISPMEFEARAMLA
MRINALRARPRRRGKPKDDGERSVIADNLLDRDFETDRPNHKWLADFTDIWTAEGWLYVAVVLDLFSRRAVGWSMKADRDASLVMDALMMAVWQRGKIDA
LLHHSDQGSQYTSEQFQRLLADNGITCSMSRAGNVWDNSAMESFFSTLKTERTASKVYRTRNEARADVFDYIERFYNPRRRHSKLGYISPMEFEARAMLA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1148 bp | 382 aa | 64 | 1211 | + | Yes |
Chemistry : DDE
ORF sequence :
MTRRKFSREFTVEAVRLVTDRGVAVAQAARDLDVAESVLRRWMRELTATPATAFPGNGQMRADLAEIAALKKEVARLRAERDILRKAAAFFRARGDMRFA
FIAKHRHIWPITWLCEVLNVSRSGYHAWLTRPISTRESYDAKLVAAIETSFKASDRTYGARRVWRDVLEDGLACGLHRIERLMRINALRARPRRRGKPKD
DGERSVIADNLLDRDFETDRPNHKWLADFTDIWTAEGWLYVAVVLDLFSRRAVGWSMKADRDASLVMDALMMAVWQRGKIDALLHHSDQGSQYTSEQFQR
LLADNGITCSMSRAGNVWDNSAMESFFSTLKTERTASKVYRTRNEARADVFDYIERFYNPRRRHSKLGYISPMEFEARAMLA
FIAKHRHIWPITWLCEVLNVSRSGYHAWLTRPISTRESYDAKLVAAIETSFKASDRTYGARRVWRDVLEDGLACGLHRIERLMRINALRARPRRRGKPKD
DGERSVIADNLLDRDFETDRPNHKWLADFTDIWTAEGWLYVAVVLDLFSRRAVGWSMKADRDASLVMDALMMAVWQRGKIDALLHHSDQGSQYTSEQFQR
LLADNGITCSMSRAGNVWDNSAMESFFSTLKTERTASKVYRTRNEARADVFDYIERFYNPRRRHSKLGYISPMEFEARAMLA
Blast result :
Comments
ISPye42 is 86% aa similar to ISMlo4.
ISPye42 was identified by in silico sequence analysis of Paracoccus yeei CCUG 32053 (9 copies of ISPye42 and isoforms). The third ORF is the putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
ISPye42 was identified by in silico sequence analysis of Paracoccus yeei CCUG 32053 (9 copies of ISPye42 and isoforms). The third ORF is the putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.