ISPye14
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP020442 | ND | Paracoccus yeei | Paracoccus yeei Paracoccus yeei FDAARGOS_252 |
DNA section
IS Length : 2526 bp
Ends
IR Length : 17/22
IRL : GTAAGCGCGACGCCGATCCCCTCCTGCCATTTTTGAGTTGATGCCTTCAT
IRR : GTAAGCGCGACGTTGCGGCCCTATGCTGCGAGGCTTGACGGCTGGTTGAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCAATCGATG | GCCCGAAATA | 0 |
DNA sequence
GTAAGCGCGACGCCGATCCCCTCCTGCCATTTTTGAGTTGATGCCTTCATTCTCCGCTGGACATGTGCAGCGGAGGTTTGGGATTTGGAGAGAGACGGCG
AAATGGGCGTCCATTTGGGCGGCTCGAGGGGTGTTGGAGTTTCTCGGCTTGAGGTGATCGAGGGGCCGAGTGGTCAGCGGCGGAGCACGAAGGCTGAGCG
GGCGCGGATCGCGGTGGAGAGCATGATGCCGGGGGTGAAGGTCGCCGATACTGCGCGCAAGCACGGCACGACCCGCTGGCAGATTTACGATTGGCGCAAG
CAGATACGCAAAGGCAACCTGGTGGTGCCCGAGAGCGTGGCAGCCTTGCCGATCTTCGCGGAGCTGGTGGTTGATGACAGTTCGGCCGAGGCGTCGGCGG
CTTTCACACGGTCCGATCTCGAGATCGTTGTCGGCGACATCGTGATCCGTGCTTGCGCTGGTGTTGATGAGGGCCAGTTGACGCGGGCGATCCGGGCGGC
ACGGTCTGCAGTATCGTGATGTTCGGTCAGGGCGGCCCGGTGAAGTTCTTTGTGGCGACCCGGCCTGTCGACTTTCGCAAGGGGATCGACGGCCTGGCAC
TGGCGGTGCAGGAGATGTTCGGGATGGACCCGTTCTGCGGAGCCGCCTTTGTGTTCCGAGCGAAGCGGGCGGATCGGATCAAGCTCTTGATCTGGGATCA
GACCGGGATGGTGCTGGTTCACAAGCGGCTCGAGGGCGGCAAATTCGTCTGGCCACAGGTGCGCGACGGGGTCATGCGCATGTCCGGAGCGCAGTTTGCG
GCGCTGTTTGAGGGCGTGGATTGGCGGCTGGTCCGACCCGAACGGGCGCGGCGCCCGCTGGCGGCGGGGTGACTGGCAGGCTGCGTCGAAACGCCTGTTT
TTGCTGGCCTGCAAGGGCATCCTGTGATTCACTTCCCGCCATGGAAACAGCTGATCTTGCGCGCGAAAACGCCCTGCTGAAGGCCCGCCTCGCCGAGGTG
GAGGCGACGCTGGCCGAGACGCAAGAGGCCAATCGGCGGCTGCAGGATATCCTGCACGCGGCGCAGCGCGAGAAATTCGGCAAGCGCTCCGAAAAGCTCT
CGCCCGACCAGTTCAACCTGCCCCTGGAAGATGCCGAATTTGCTCAAGGTGTGCTCGAGGCAGCACAAGAAAAGGCCGAAGCGGCGATGCAGCGGGCGCG
CGGGGAAACACCCCGCAAGCCCAAGCGCAACCGCGGGCATCTGCCGTCCCATCTGCCTCGGGTTGAGCGCGTGATCGAGCCTGCCAGCACGCTCTGTCCC
TGCGGTTGCGGCGAGATGTCCAGGATCGGGGAAGACGTGTCCGAACGTCTCGACGTGATCCCCGCCCAGTTCCGGGTGCTGGTCACGCGGCGCCCGAAAT
ACGCCTGCCGCCGTTGCTCGCAAGCTGTCGCGCAGGCCCATGCCCCCGAGCATGTCGTGCCCGGCGGGCTGCCCACGGAACTCTTCATCGCCTGGATTAT
CGTCTCGAAGTTCGGTGACCACCTGCCGTTTTATCGGCAGGCCGAGATCTTCAAGCGGCAAGGGATCGATCTGGACCGCGGCACACTCGGCAACTGGGTC
GGGCGCGCCTGTTTCCACCTCATGCCCGTCATCAACCACATGCGCGCCCATCTGCGCAGTGCCGACCGCATCTTCGTGGATGAAACCCGCGCGCCGGTGC
TGGAACCGGGCCTGAAACGCACCAAGAGCGGATTCTTCTGGGCCGTCGTATCCGATGATCGCGGCCATGGCGGGGCTGGCCCACCCATCGTGCTCTTCCA
CTATGCCCCTGACCGGGGTAAGGCGCATCCTTTGAAGTTCCTTGGCGGATACCGGGGCCGCTTCCTGCAATGCGACGCCTACCAGTCCTACAACGCGATG
ACCGAGATCGCGCGCGACAATGGTCCGTGGCAACTGGTCTATTGCTGGACCCATGTCCGCCGCCGCTTCGTAAAGCGCTTCGAGAGCGATGGCTCACCCA
TTGCCGAGGAAATGCTGCGCCACATCGCGCTGCTTTATCAGATCGAGAAGTCCGTGCGGAGCAAGGAGGCCGCTATGCGCCTTGCTGCCCGGCGTGAGCA
TTCGGCCCCGATCATCGCGGCGCTGAAACCCTGGCTGGAAGCCCAACTCTCGCGCATCCCGCAGAAATCCCAGCTTGCCGAAGACATCCGCTACTCCCTC
GCGCACTGGCCCGGACTGATCCGCTTCCTCGAGCACGGCATGCTGGAGCTCGACACCAACCCGGTCGAGAACCAGATCAGGCCGATTGCCCTGACAAGAA
AAAATGCACTCTTCGCCGGGAATGAAGTCGGCGCCGAAAACTGGGCTATGCTCGCCTCGCTGGTCGCTACCTGCAAGATGTCCGACGTGAACCCGGTCGA
CTATCTCGCCGCCACACTGCGCGCCATCCTCGACGATCACCCGCAAAGCGGTATCGAAGACCTCATGCCATGGCGCTTCAACCAGCCGTCAAGCCTCGCA
GCATAGGGCCGCAACGTCGCGCTTAC
AAATGGGCGTCCATTTGGGCGGCTCGAGGGGTGTTGGAGTTTCTCGGCTTGAGGTGATCGAGGGGCCGAGTGGTCAGCGGCGGAGCACGAAGGCTGAGCG
GGCGCGGATCGCGGTGGAGAGCATGATGCCGGGGGTGAAGGTCGCCGATACTGCGCGCAAGCACGGCACGACCCGCTGGCAGATTTACGATTGGCGCAAG
CAGATACGCAAAGGCAACCTGGTGGTGCCCGAGAGCGTGGCAGCCTTGCCGATCTTCGCGGAGCTGGTGGTTGATGACAGTTCGGCCGAGGCGTCGGCGG
CTTTCACACGGTCCGATCTCGAGATCGTTGTCGGCGACATCGTGATCCGTGCTTGCGCTGGTGTTGATGAGGGCCAGTTGACGCGGGCGATCCGGGCGGC
ACGGTCTGCAGTATCGTGATGTTCGGTCAGGGCGGCCCGGTGAAGTTCTTTGTGGCGACCCGGCCTGTCGACTTTCGCAAGGGGATCGACGGCCTGGCAC
TGGCGGTGCAGGAGATGTTCGGGATGGACCCGTTCTGCGGAGCCGCCTTTGTGTTCCGAGCGAAGCGGGCGGATCGGATCAAGCTCTTGATCTGGGATCA
GACCGGGATGGTGCTGGTTCACAAGCGGCTCGAGGGCGGCAAATTCGTCTGGCCACAGGTGCGCGACGGGGTCATGCGCATGTCCGGAGCGCAGTTTGCG
GCGCTGTTTGAGGGCGTGGATTGGCGGCTGGTCCGACCCGAACGGGCGCGGCGCCCGCTGGCGGCGGGGTGACTGGCAGGCTGCGTCGAAACGCCTGTTT
TTGCTGGCCTGCAAGGGCATCCTGTGATTCACTTCCCGCCATGGAAACAGCTGATCTTGCGCGCGAAAACGCCCTGCTGAAGGCCCGCCTCGCCGAGGTG
GAGGCGACGCTGGCCGAGACGCAAGAGGCCAATCGGCGGCTGCAGGATATCCTGCACGCGGCGCAGCGCGAGAAATTCGGCAAGCGCTCCGAAAAGCTCT
CGCCCGACCAGTTCAACCTGCCCCTGGAAGATGCCGAATTTGCTCAAGGTGTGCTCGAGGCAGCACAAGAAAAGGCCGAAGCGGCGATGCAGCGGGCGCG
CGGGGAAACACCCCGCAAGCCCAAGCGCAACCGCGGGCATCTGCCGTCCCATCTGCCTCGGGTTGAGCGCGTGATCGAGCCTGCCAGCACGCTCTGTCCC
TGCGGTTGCGGCGAGATGTCCAGGATCGGGGAAGACGTGTCCGAACGTCTCGACGTGATCCCCGCCCAGTTCCGGGTGCTGGTCACGCGGCGCCCGAAAT
ACGCCTGCCGCCGTTGCTCGCAAGCTGTCGCGCAGGCCCATGCCCCCGAGCATGTCGTGCCCGGCGGGCTGCCCACGGAACTCTTCATCGCCTGGATTAT
CGTCTCGAAGTTCGGTGACCACCTGCCGTTTTATCGGCAGGCCGAGATCTTCAAGCGGCAAGGGATCGATCTGGACCGCGGCACACTCGGCAACTGGGTC
GGGCGCGCCTGTTTCCACCTCATGCCCGTCATCAACCACATGCGCGCCCATCTGCGCAGTGCCGACCGCATCTTCGTGGATGAAACCCGCGCGCCGGTGC
TGGAACCGGGCCTGAAACGCACCAAGAGCGGATTCTTCTGGGCCGTCGTATCCGATGATCGCGGCCATGGCGGGGCTGGCCCACCCATCGTGCTCTTCCA
CTATGCCCCTGACCGGGGTAAGGCGCATCCTTTGAAGTTCCTTGGCGGATACCGGGGCCGCTTCCTGCAATGCGACGCCTACCAGTCCTACAACGCGATG
ACCGAGATCGCGCGCGACAATGGTCCGTGGCAACTGGTCTATTGCTGGACCCATGTCCGCCGCCGCTTCGTAAAGCGCTTCGAGAGCGATGGCTCACCCA
TTGCCGAGGAAATGCTGCGCCACATCGCGCTGCTTTATCAGATCGAGAAGTCCGTGCGGAGCAAGGAGGCCGCTATGCGCCTTGCTGCCCGGCGTGAGCA
TTCGGCCCCGATCATCGCGGCGCTGAAACCCTGGCTGGAAGCCCAACTCTCGCGCATCCCGCAGAAATCCCAGCTTGCCGAAGACATCCGCTACTCCCTC
GCGCACTGGCCCGGACTGATCCGCTTCCTCGAGCACGGCATGCTGGAGCTCGACACCAACCCGGTCGAGAACCAGATCAGGCCGATTGCCCTGACAAGAA
AAAATGCACTCTTCGCCGGGAATGAAGTCGGCGCCGAAAACTGGGCTATGCTCGCCTCGCTGGTCGCTACCTGCAAGATGTCCGACGTGAACCCGGTCGA
CTATCTCGCCGCCACACTGCGCGCCATCCTCGACGATCACCCGCAAAGCGGTATCGAAGACCTCATGCCATGGCGCTTCAACCAGCCGTCAAGCCTCGCA
GCATAGGGCCGCAACGTCGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
435 bp | 144 aa | 85 | 519 | + | No |
AG : IS66 TnpA
ORF sequence :
LERDGEMGVHLGGSRGVGVSRLEVIEGPSGQRRSTKAERARIAVESMMPGVKVADTARKHGTTRWQIYDWRKQIRKGNLVVPESVAALPIFAELVVDDSS
AEASAAFTRSDLEIVVGDIVIRACAGVDEGQLTRAIRAARSAVS
AEASAAFTRSDLEIVVGDIVIRACAGVDEGQLTRAIRAARSAVS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 519 | 872 | + | No |
AG : IS66 TnpB
ORF sequence :
MFGQGGPVKFFVATRPVDFRKGIDGLALAVQEMFGMDPFCGAAFVFRAKRADRIKLLIWDQTGMVLVHKRLEGGKFVWPQVRDGVMRMSGAQFAALFEGV
DWRLVRPERARRPLAAG
DWRLVRPERARRPLAAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1566 bp | 521 aa | 941 | 2506 | + | No |
Chemistry : DDE
ORF sequence :
METADLARENALLKARLAEVEATLAETQEANRRLQDILHAAQREKFGKRSEKLSPDQFNLPLEDAEFAQGVLEAAQEKAEAAMQRARGETPRKPKRNRGH
LPSHLPRVERVIEPASTLCPCGCGEMSRIGEDVSERLDVIPAQFRVLVTRRPKYACRRCSQAVAQAHAPEHVVPGGLPTELFIAWIIVSKFGDHLPFYRQ
AEIFKRQGIDLDRGTLGNWVGRACFHLMPVINHMRAHLRSADRIFVDETRAPVLEPGLKRTKSGFFWAVVSDDRGHGGAGPPIVLFHYAPDRGKAHPLKF
LGGYRGRFLQCDAYQSYNAMTEIARDNGPWQLVYCWTHVRRRFVKRFESDGSPIAEEMLRHIALLYQIEKSVRSKEAAMRLAARREHSAPIIAALKPWLE
AQLSRIPQKSQLAEDIRYSLAHWPGLIRFLEHGMLELDTNPVENQIRPIALTRKNALFAGNEVGAENWAMLASLVATCKMSDVNPVDYLAATLRAILDDH
PQSGIEDLMPWRFNQPSSLAA
LPSHLPRVERVIEPASTLCPCGCGEMSRIGEDVSERLDVIPAQFRVLVTRRPKYACRRCSQAVAQAHAPEHVVPGGLPTELFIAWIIVSKFGDHLPFYRQ
AEIFKRQGIDLDRGTLGNWVGRACFHLMPVINHMRAHLRSADRIFVDETRAPVLEPGLKRTKSGFFWAVVSDDRGHGGAGPPIVLFHYAPDRGKAHPLKF
LGGYRGRFLQCDAYQSYNAMTEIARDNGPWQLVYCWTHVRRRFVKRFESDGSPIAEEMLRHIALLYQIEKSVRSKEAAMRLAARREHSAPIIAALKPWLE
AQLSRIPQKSQLAEDIRYSLAHWPGLIRFLEHGMLELDTNPVENQIRPIALTRKNALFAGNEVGAENWAMLASLVATCKMSDVNPVDYLAATLRAILDDH
PQSGIEDLMPWRFNQPSSLAA
Blast result :
Comments
ISPye14 is 91% aa similar to ISPye8.
ISPye14 was identified by in silico nucleotide sequence analysis of Paracoccus yeei strain FDAARGOS_252. Possibly part of a larger mosaic transposable elements(including ISPye14 and a partial IS from the IS66 family) which may have been inserted in a single event, as suggested by the surrounding 8 bp DRs - CTTCGAAGTC(AATCGATG)GGCCATATGG.
ISPye14 was identified by in silico nucleotide sequence analysis of Paracoccus yeei strain FDAARGOS_252. Possibly part of a larger mosaic transposable elements(including ISPye14 and a partial IS from the IS66 family) which may have been inserted in a single event, as suggested by the surrounding 8 bp DRs - CTTCGAAGTC(AATCGATG)GGCCATATGG.
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.
2] Goldberg,B., Campos,J., Tallon,L., Sadzewicz,L., Sengamalay,N., Ott,S., Godinez,A., Nagaraj,S., Vavikolanu,K., Aluvathingal,J., Nadendla,S. and Sichtig,H. (2017) Direct GenBank submission.