ISPye37

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Paracoccus yeei
Paracoccus yeei
Paracoccus yeei CCUG 32053
DNA section
IS Length : 2524 bp

Ends


IR Length : 14/22

IRL : GTAAGCGCGACGCCGACCCCCTCCTGCCATTGCTAAGCTGAACCGGCCAT
IRR : GTAAGCGCTACGGCTCTGCCCTATGCGGCGAGGCTTGACGTCTGGCTGAA

Insertion site


Left flankDirect repeatRight flankDR Length
cagggcagccTGACGGCCggcaacagtt8
taggaccataGTTGTTTTgattttagag8
cgcctcgttcGATGAGAAtgatgtgtca8
caaagccaaaGATTTGCAggcaaagatc8

DNA sequence

GTAAGCGCGACGCCGACCCCCTCCTGCCATTGCTAAGCTGAACCGGCCATTGTCGGCGTGTGTGTTGGTTTTAGGGGCTTGGGATTTGGGAAGAGGCAAA
GAAGTGGGCGTCCATTTGGACGGCTCCTGGAATGGTTCGGTTTCCCGGCTGGAGATGATCGAGGGACCGAGCGGTCGGCGGCAGCGCACAGAAGCGGAGC
GGGCGCGGATCGCGGCAGAAAGCCTGATTCCAGGGATGCGGGTGGCCGACATCGCCCGCAAACACGGCACGACACGTTGGCAGATTTACAATTGGCGCAA
CAAGTTGCGGAGTGGGCATCTGATGGTGCCGGAGAGCCTCGCATCCCTGCCGATGTTCGCGGAGCTGATGGTTGAGAGCGATACTGCGACTGTGCCGCCA
GCCCGCGCTCTGTCGGAGGTGGAGATCGTGGTCGGCGATGTGGTGATCCGCGCCAGTGCTGGTGCAGATGAGGGCCAGCTGACGCGGGCGATCCGCGCGG
CTCGAGCGGCAATGTCATGATGCTGGGTCAAGGCGGGCCAGTGAAGGTCTTCGTGGCAACCCGACCGGTCGACTTCCGCAAGGGTATCGACGGCCTGGCC
CTGGCAGTCCAGGAGATGTTCGGGCTGGATCCGTTCTGTGGGGCGGTCTTTGTCTTCCGTTCCAAACGCGCTGACCGCATCAAGCTGCTGGCTTGGGATC
AGACTGGGATGGTGCTGGTTCACAAGCGACTGGAAGGCGGCAAGTTTGTCTGGCCGCAGGTGCGGGACGGGGTGATGCGGATGTCGTCCGCACAGATGGC
CGCGTTGTTCGAGGGCTTGGACTGGCGCCTTGTCCGACCTGAGCGAGTGCGGCGCCCTTTAACAGCAGGCTGATCGGTCGATCACTTGGGAATGCCTGTT
TTTGCTGGCTGAAAGGGATAATCCATGATTCACTTCGAGCCATGGATGCTGATGCTCTTGCCTCGGAAAACACCCTCCTGAAGGCTCGCCTGGCCGAGCT
TGAGGCGGCGCTATCCGAAGTGCAGGAGGCCAACCGTCGGCTGGAGGACATTTTGCGCACGGCGCAACGCGCGCAGTTTGGCAAAAGCTCCGAGAAGCTG
TCACCCGACCAGTTCAACCTGCCCTTGGAGGATGCCGAATTTGCTCAAGGGGTGCTCGAGGCGGCACAGGAAAAGGCCGAGGCGGCGCTACAGGGGTTGG
GGGCAAAGACACCCCGCAAACCTGCGCGCAACCGCGGGCATCTGCCGCTGCATCTGCCCCGGATCGAGCGCGTGATCGAACCCGCCAGCACCCTGTGTCC
CTGCGGTTGCGAGATGGCGCGGATCGGCGAAGACGTTTCCAAACGTCTCGATGTGATCCCCGCCCAGTTCCGGGTTCTGGTAACGCGTCGCCCCAAATAT
GCATGCCGCCGCTGCTCTCAGACTGTGGCGCAAGCTCATGCTCCCGAGCATGTGGTGCCTGGCGGAATGCCCACGGAGCGGTTTATCGCCTGGATTATTG
TCTCGAAGTTCGGCGACCACCTGCCGTTCTATCGCCAAGCCGAGATCTTCAAGCGTCAGGGGATCGATCTGGATCGCGGCACGCTTGGCAACTGGGTCGG
GCGCGCCTGCTTCCATCTGACGCCGGTCATCGACCACATGCGCGCTCATCTGCGCGGTGTGGACCGGATCTTCGTCGACGAGACCCGCGCGCCAGTGTTG
GATCCCGGCCGAAAGGCCACAAAGAGCGGATACTTCTGGGCCGTCGTGTCCGATGATCGCGGACATGGCGGTGCCGGACCGCCGATCGTGCTGTTCCACT
ATGCCCCCGGCCGGGGGAAAGAGCATCCGCTGAAGTTCCTCGGCGGATACCGGGGCCGGTTCCTGCAATGCGACGCCTATCAGTCCTACAATGCGATGAC
CGGGATGGAGCGTGACAACGGCCCCTGGCAACTGGTCTATTGCTGGACCCATGTCCGCCGCCGCTTCGTGAAGCGCTTCGAGAATGAGGGCTCACCCATT
GCGGAGGAGATGCTGCGCCAGATTGCACTGCTTTATCAGGTCGAGAAGACGGTCCGAGGAAGGGATCCAGCCGTGCGCCTTGCTGCTCGGCGCGACTGTT
CGGCACCGGTCATCGCCGCGCTCAAGCCGTGGCTGGAGGCCAAGCTCTCCCGCATTCCGCAGAAATCCCAACTGGCCGAGGACATCCGCTACACCCTTGC
ACATTGGCCCGGCCTGATCCGCTTCCTTGAGGACGGCACGCTTGAGCTGGACACCAACCCGGTCGAGAACCAGATCCGCCCAATAGCTCTTACCAGAAAA
AATGCGCTTTTTGCCGGCAATGAAACCGGCGCTGAGAACTGGGCAATGCTGGCCTCGCTGGTCGCCACCTGCAAAATGTCCAGCGTGAACCCGGTCGACT
ATATCGCCAACACTCTTGGGGCCATTCTCGATGGACATCCGAAATCCCGCATCGAGGACCTCATGCCCTGGCGCTTCAGCCAGACGTCAAGCCTCGCCGC
ATAGGGCAGAGCCGTAGCGCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
417 bp138 aa104520+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

VGVHLDGSWNGSVSRLEMIEGPSGRRQRTEAERARIAAESLIPGMRVADIARKHGTTRWQIYNWRNKLRSGHLMVPESLASLPMFAELMVESDTATVPPA
RALSEVEIVVGDVVIRASAGADEGQLTRAIRAARAAMS

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
357 bp118 aa517873+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MMLGQGGPVKVFVATRPVDFRKGIDGLALAVQEMFGLDPFCGAVFVFRSKRADRIKLLAWDQTGMVLVHKRLEGGKFVWPQVRDGVMRMSSAQMAALFEG
LDWRLVRPERVRRPLTAG

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1563 bp520 aa9422504+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MDADALASENTLLKARLAELEAALSEVQEANRRLEDILRTAQRAQFGKSSEKLSPDQFNLPLEDAEFAQGVLEAAQEKAEAALQGLGAKTPRKPARNRGH
LPLHLPRIERVIEPASTLCPCGCEMARIGEDVSKRLDVIPAQFRVLVTRRPKYACRRCSQTVAQAHAPEHVVPGGMPTERFIAWIIVSKFGDHLPFYRQA
EIFKRQGIDLDRGTLGNWVGRACFHLTPVIDHMRAHLRGVDRIFVDETRAPVLDPGRKATKSGYFWAVVSDDRGHGGAGPPIVLFHYAPGRGKEHPLKFL
GGYRGRFLQCDAYQSYNAMTGMERDNGPWQLVYCWTHVRRRFVKRFENEGSPIAEEMLRQIALLYQVEKTVRGRDPAVRLAARRDCSAPVIAALKPWLEA
KLSRIPQKSQLAEDIRYTLAHWPGLIRFLEDGTLELDTNPVENQIRPIALTRKNALFAGNETGAENWAMLASLVATCKMSSVNPVDYIANTLGAILDGHP
KSRIEDLMPWRFSQTSSLAA

 

Blast result :
Comments
ISPye37 is 92% aa similar to ISPye14.
ISPye37 was identified by in silico sequence analysis of Paracoccus yeei CCUG 32053 (4 copies of ISPye37 and isoforms of ISPye37 present in the genome).
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission.