ISPye37
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Paracoccus yeei | Paracoccus yeei Paracoccus yeei CCUG 32053 |
DNA section
IS Length : 2524 bp
Ends
IR Length : 14/22
IRL : GTAAGCGCGACGCCGACCCCCTCCTGCCATTGCTAAGCTGAACCGGCCAT
IRR : GTAAGCGCTACGGCTCTGCCCTATGCGGCGAGGCTTGACGTCTGGCTGAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
cagggcagcc | TGACGGCC | ggcaacagtt | 8 |
taggaccata | GTTGTTTT | gattttagag | 8 |
cgcctcgttc | GATGAGAA | tgatgtgtca | 8 |
caaagccaaa | GATTTGCA | ggcaaagatc | 8 |
DNA sequence
GTAAGCGCGACGCCGACCCCCTCCTGCCATTGCTAAGCTGAACCGGCCATTGTCGGCGTGTGTGTTGGTTTTAGGGGCTTGGGATTTGGGAAGAGGCAAA
GAAGTGGGCGTCCATTTGGACGGCTCCTGGAATGGTTCGGTTTCCCGGCTGGAGATGATCGAGGGACCGAGCGGTCGGCGGCAGCGCACAGAAGCGGAGC
GGGCGCGGATCGCGGCAGAAAGCCTGATTCCAGGGATGCGGGTGGCCGACATCGCCCGCAAACACGGCACGACACGTTGGCAGATTTACAATTGGCGCAA
CAAGTTGCGGAGTGGGCATCTGATGGTGCCGGAGAGCCTCGCATCCCTGCCGATGTTCGCGGAGCTGATGGTTGAGAGCGATACTGCGACTGTGCCGCCA
GCCCGCGCTCTGTCGGAGGTGGAGATCGTGGTCGGCGATGTGGTGATCCGCGCCAGTGCTGGTGCAGATGAGGGCCAGCTGACGCGGGCGATCCGCGCGG
CTCGAGCGGCAATGTCATGATGCTGGGTCAAGGCGGGCCAGTGAAGGTCTTCGTGGCAACCCGACCGGTCGACTTCCGCAAGGGTATCGACGGCCTGGCC
CTGGCAGTCCAGGAGATGTTCGGGCTGGATCCGTTCTGTGGGGCGGTCTTTGTCTTCCGTTCCAAACGCGCTGACCGCATCAAGCTGCTGGCTTGGGATC
AGACTGGGATGGTGCTGGTTCACAAGCGACTGGAAGGCGGCAAGTTTGTCTGGCCGCAGGTGCGGGACGGGGTGATGCGGATGTCGTCCGCACAGATGGC
CGCGTTGTTCGAGGGCTTGGACTGGCGCCTTGTCCGACCTGAGCGAGTGCGGCGCCCTTTAACAGCAGGCTGATCGGTCGATCACTTGGGAATGCCTGTT
TTTGCTGGCTGAAAGGGATAATCCATGATTCACTTCGAGCCATGGATGCTGATGCTCTTGCCTCGGAAAACACCCTCCTGAAGGCTCGCCTGGCCGAGCT
TGAGGCGGCGCTATCCGAAGTGCAGGAGGCCAACCGTCGGCTGGAGGACATTTTGCGCACGGCGCAACGCGCGCAGTTTGGCAAAAGCTCCGAGAAGCTG
TCACCCGACCAGTTCAACCTGCCCTTGGAGGATGCCGAATTTGCTCAAGGGGTGCTCGAGGCGGCACAGGAAAAGGCCGAGGCGGCGCTACAGGGGTTGG
GGGCAAAGACACCCCGCAAACCTGCGCGCAACCGCGGGCATCTGCCGCTGCATCTGCCCCGGATCGAGCGCGTGATCGAACCCGCCAGCACCCTGTGTCC
CTGCGGTTGCGAGATGGCGCGGATCGGCGAAGACGTTTCCAAACGTCTCGATGTGATCCCCGCCCAGTTCCGGGTTCTGGTAACGCGTCGCCCCAAATAT
GCATGCCGCCGCTGCTCTCAGACTGTGGCGCAAGCTCATGCTCCCGAGCATGTGGTGCCTGGCGGAATGCCCACGGAGCGGTTTATCGCCTGGATTATTG
TCTCGAAGTTCGGCGACCACCTGCCGTTCTATCGCCAAGCCGAGATCTTCAAGCGTCAGGGGATCGATCTGGATCGCGGCACGCTTGGCAACTGGGTCGG
GCGCGCCTGCTTCCATCTGACGCCGGTCATCGACCACATGCGCGCTCATCTGCGCGGTGTGGACCGGATCTTCGTCGACGAGACCCGCGCGCCAGTGTTG
GATCCCGGCCGAAAGGCCACAAAGAGCGGATACTTCTGGGCCGTCGTGTCCGATGATCGCGGACATGGCGGTGCCGGACCGCCGATCGTGCTGTTCCACT
ATGCCCCCGGCCGGGGGAAAGAGCATCCGCTGAAGTTCCTCGGCGGATACCGGGGCCGGTTCCTGCAATGCGACGCCTATCAGTCCTACAATGCGATGAC
CGGGATGGAGCGTGACAACGGCCCCTGGCAACTGGTCTATTGCTGGACCCATGTCCGCCGCCGCTTCGTGAAGCGCTTCGAGAATGAGGGCTCACCCATT
GCGGAGGAGATGCTGCGCCAGATTGCACTGCTTTATCAGGTCGAGAAGACGGTCCGAGGAAGGGATCCAGCCGTGCGCCTTGCTGCTCGGCGCGACTGTT
CGGCACCGGTCATCGCCGCGCTCAAGCCGTGGCTGGAGGCCAAGCTCTCCCGCATTCCGCAGAAATCCCAACTGGCCGAGGACATCCGCTACACCCTTGC
ACATTGGCCCGGCCTGATCCGCTTCCTTGAGGACGGCACGCTTGAGCTGGACACCAACCCGGTCGAGAACCAGATCCGCCCAATAGCTCTTACCAGAAAA
AATGCGCTTTTTGCCGGCAATGAAACCGGCGCTGAGAACTGGGCAATGCTGGCCTCGCTGGTCGCCACCTGCAAAATGTCCAGCGTGAACCCGGTCGACT
ATATCGCCAACACTCTTGGGGCCATTCTCGATGGACATCCGAAATCCCGCATCGAGGACCTCATGCCCTGGCGCTTCAGCCAGACGTCAAGCCTCGCCGC
ATAGGGCAGAGCCGTAGCGCTTAC
GAAGTGGGCGTCCATTTGGACGGCTCCTGGAATGGTTCGGTTTCCCGGCTGGAGATGATCGAGGGACCGAGCGGTCGGCGGCAGCGCACAGAAGCGGAGC
GGGCGCGGATCGCGGCAGAAAGCCTGATTCCAGGGATGCGGGTGGCCGACATCGCCCGCAAACACGGCACGACACGTTGGCAGATTTACAATTGGCGCAA
CAAGTTGCGGAGTGGGCATCTGATGGTGCCGGAGAGCCTCGCATCCCTGCCGATGTTCGCGGAGCTGATGGTTGAGAGCGATACTGCGACTGTGCCGCCA
GCCCGCGCTCTGTCGGAGGTGGAGATCGTGGTCGGCGATGTGGTGATCCGCGCCAGTGCTGGTGCAGATGAGGGCCAGCTGACGCGGGCGATCCGCGCGG
CTCGAGCGGCAATGTCATGATGCTGGGTCAAGGCGGGCCAGTGAAGGTCTTCGTGGCAACCCGACCGGTCGACTTCCGCAAGGGTATCGACGGCCTGGCC
CTGGCAGTCCAGGAGATGTTCGGGCTGGATCCGTTCTGTGGGGCGGTCTTTGTCTTCCGTTCCAAACGCGCTGACCGCATCAAGCTGCTGGCTTGGGATC
AGACTGGGATGGTGCTGGTTCACAAGCGACTGGAAGGCGGCAAGTTTGTCTGGCCGCAGGTGCGGGACGGGGTGATGCGGATGTCGTCCGCACAGATGGC
CGCGTTGTTCGAGGGCTTGGACTGGCGCCTTGTCCGACCTGAGCGAGTGCGGCGCCCTTTAACAGCAGGCTGATCGGTCGATCACTTGGGAATGCCTGTT
TTTGCTGGCTGAAAGGGATAATCCATGATTCACTTCGAGCCATGGATGCTGATGCTCTTGCCTCGGAAAACACCCTCCTGAAGGCTCGCCTGGCCGAGCT
TGAGGCGGCGCTATCCGAAGTGCAGGAGGCCAACCGTCGGCTGGAGGACATTTTGCGCACGGCGCAACGCGCGCAGTTTGGCAAAAGCTCCGAGAAGCTG
TCACCCGACCAGTTCAACCTGCCCTTGGAGGATGCCGAATTTGCTCAAGGGGTGCTCGAGGCGGCACAGGAAAAGGCCGAGGCGGCGCTACAGGGGTTGG
GGGCAAAGACACCCCGCAAACCTGCGCGCAACCGCGGGCATCTGCCGCTGCATCTGCCCCGGATCGAGCGCGTGATCGAACCCGCCAGCACCCTGTGTCC
CTGCGGTTGCGAGATGGCGCGGATCGGCGAAGACGTTTCCAAACGTCTCGATGTGATCCCCGCCCAGTTCCGGGTTCTGGTAACGCGTCGCCCCAAATAT
GCATGCCGCCGCTGCTCTCAGACTGTGGCGCAAGCTCATGCTCCCGAGCATGTGGTGCCTGGCGGAATGCCCACGGAGCGGTTTATCGCCTGGATTATTG
TCTCGAAGTTCGGCGACCACCTGCCGTTCTATCGCCAAGCCGAGATCTTCAAGCGTCAGGGGATCGATCTGGATCGCGGCACGCTTGGCAACTGGGTCGG
GCGCGCCTGCTTCCATCTGACGCCGGTCATCGACCACATGCGCGCTCATCTGCGCGGTGTGGACCGGATCTTCGTCGACGAGACCCGCGCGCCAGTGTTG
GATCCCGGCCGAAAGGCCACAAAGAGCGGATACTTCTGGGCCGTCGTGTCCGATGATCGCGGACATGGCGGTGCCGGACCGCCGATCGTGCTGTTCCACT
ATGCCCCCGGCCGGGGGAAAGAGCATCCGCTGAAGTTCCTCGGCGGATACCGGGGCCGGTTCCTGCAATGCGACGCCTATCAGTCCTACAATGCGATGAC
CGGGATGGAGCGTGACAACGGCCCCTGGCAACTGGTCTATTGCTGGACCCATGTCCGCCGCCGCTTCGTGAAGCGCTTCGAGAATGAGGGCTCACCCATT
GCGGAGGAGATGCTGCGCCAGATTGCACTGCTTTATCAGGTCGAGAAGACGGTCCGAGGAAGGGATCCAGCCGTGCGCCTTGCTGCTCGGCGCGACTGTT
CGGCACCGGTCATCGCCGCGCTCAAGCCGTGGCTGGAGGCCAAGCTCTCCCGCATTCCGCAGAAATCCCAACTGGCCGAGGACATCCGCTACACCCTTGC
ACATTGGCCCGGCCTGATCCGCTTCCTTGAGGACGGCACGCTTGAGCTGGACACCAACCCGGTCGAGAACCAGATCCGCCCAATAGCTCTTACCAGAAAA
AATGCGCTTTTTGCCGGCAATGAAACCGGCGCTGAGAACTGGGCAATGCTGGCCTCGCTGGTCGCCACCTGCAAAATGTCCAGCGTGAACCCGGTCGACT
ATATCGCCAACACTCTTGGGGCCATTCTCGATGGACATCCGAAATCCCGCATCGAGGACCTCATGCCCTGGCGCTTCAGCCAGACGTCAAGCCTCGCCGC
ATAGGGCAGAGCCGTAGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
417 bp | 138 aa | 104 | 520 | + | No |
AG : IS66 TnpA
ORF sequence :
VGVHLDGSWNGSVSRLEMIEGPSGRRQRTEAERARIAAESLIPGMRVADIARKHGTTRWQIYNWRNKLRSGHLMVPESLASLPMFAELMVESDTATVPPA
RALSEVEIVVGDVVIRASAGADEGQLTRAIRAARAAMS
RALSEVEIVVGDVVIRASAGADEGQLTRAIRAARAAMS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 517 | 873 | + | No |
AG : IS66 TnpB
ORF sequence :
MMLGQGGPVKVFVATRPVDFRKGIDGLALAVQEMFGLDPFCGAVFVFRSKRADRIKLLAWDQTGMVLVHKRLEGGKFVWPQVRDGVMRMSSAQMAALFEG
LDWRLVRPERVRRPLTAG
LDWRLVRPERVRRPLTAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1563 bp | 520 aa | 942 | 2504 | + | No |
Chemistry : DDE
ORF sequence :
MDADALASENTLLKARLAELEAALSEVQEANRRLEDILRTAQRAQFGKSSEKLSPDQFNLPLEDAEFAQGVLEAAQEKAEAALQGLGAKTPRKPARNRGH
LPLHLPRIERVIEPASTLCPCGCEMARIGEDVSKRLDVIPAQFRVLVTRRPKYACRRCSQTVAQAHAPEHVVPGGMPTERFIAWIIVSKFGDHLPFYRQA
EIFKRQGIDLDRGTLGNWVGRACFHLTPVIDHMRAHLRGVDRIFVDETRAPVLDPGRKATKSGYFWAVVSDDRGHGGAGPPIVLFHYAPGRGKEHPLKFL
GGYRGRFLQCDAYQSYNAMTGMERDNGPWQLVYCWTHVRRRFVKRFENEGSPIAEEMLRQIALLYQVEKTVRGRDPAVRLAARRDCSAPVIAALKPWLEA
KLSRIPQKSQLAEDIRYTLAHWPGLIRFLEDGTLELDTNPVENQIRPIALTRKNALFAGNETGAENWAMLASLVATCKMSSVNPVDYIANTLGAILDGHP
KSRIEDLMPWRFSQTSSLAA
LPLHLPRIERVIEPASTLCPCGCEMARIGEDVSKRLDVIPAQFRVLVTRRPKYACRRCSQTVAQAHAPEHVVPGGMPTERFIAWIIVSKFGDHLPFYRQA
EIFKRQGIDLDRGTLGNWVGRACFHLTPVIDHMRAHLRGVDRIFVDETRAPVLDPGRKATKSGYFWAVVSDDRGHGGAGPPIVLFHYAPGRGKEHPLKFL
GGYRGRFLQCDAYQSYNAMTGMERDNGPWQLVYCWTHVRRRFVKRFENEGSPIAEEMLRQIALLYQVEKTVRGRDPAVRLAARRDCSAPVIAALKPWLEA
KLSRIPQKSQLAEDIRYTLAHWPGLIRFLEDGTLELDTNPVENQIRPIALTRKNALFAGNETGAENWAMLASLVATCKMSSVNPVDYIANTLGAILDGHP
KSRIEDLMPWRFSQTSSLAA
Blast result :
Comments
ISPye37 is 92% aa similar to ISPye14.
ISPye37 was identified by in silico sequence analysis of Paracoccus yeei CCUG 32053 (4 copies of ISPye37 and isoforms of ISPye37 present in the genome).
ISPye37 was identified by in silico sequence analysis of Paracoccus yeei CCUG 32053 (4 copies of ISPye37 and isoforms of ISPye37 present in the genome).
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.