ISPye45
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Paracoccus yeei | Paracoccus yeei Paracoccus yeei CCUG 32053 |
DNA section
IS Length : 2462 bp
Ends
IR Length : 14/20
IRL : GTAAGCGTTTCCTGGCCTCCACCTTCCCGTCTTTGACCTGATCATCGCAT
IRR : GTAAGCGGTCGCTGGAAACCGCGGTCGGCTTAGCTTGACGGCTTGAAGTT
Comments : Empty insertion sites in other P. yeei strains enabled precisely defining ends and DRs.
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ttccggccga | AGACAGCC | tgcctgaagg | 8 |
cacggcagga | GTATGTAT | gaagaagata | 8 |
ctgagctact | CCCCAATC | cttggaactt | 8 |
gccctcttgt | gtcttggcgg | 0 |
DNA sequence
GTAAGCGTTTCCTGGCCTCCACCTTCCCGTCTTTGACCTGATCATCGCATTGGATGTTTGTGATGATCAGGACGGAGTTTCTCCATGGAGGCTGCATTGG
AGGTTCTCCCGGTTGGTGGCCGAAGCCGCCGGAACCGGAAGTGGCCAGATGAGGTGAAGGCGCGGGTTGTGGCGGAAACGCTGCTGCCGGGCGCAACGGT
GAACGAAGTGGCGCGCCGGCACGGGGTTTGCGCGAACCATGTTTCGTCGTGGCGGACGCTGGCGCGGAAAGGGCGGTTGGTCCTGCCCGCGCCGGAGGAT
CCGGTGGAATTTGTGGCGCTGATGGTCGGCTCCACTGAGGATGCGCCGCGCGCTGATGTTGGCGGCGAGGCCCGGCCGGAGATCATCGCCGGCACCGTTG
TGATCCGCCTGGAAACCGGGGCCACGGCGGAGCGGATTGCCTCGGTCGCGCATGCGTTGGCAGCGCGGCCATGATGTTTCCGTCGAACCGGGTGCGGATC
ATGGTCGCGACAAAGCCCGTCGACTTCAGGAAGGGACATGACGGCTTGTGTGCGCTGGTTGCCTCGATGTTGCGCAAGGATCCGTTCACCGGCACGGTGT
TCGTGTTCCGCTCGCGGCGGCTGGACAGGCTGAAGCTGCTTTACTGGGACGGCACCGGTCTGGTGATGGCCTATAAGCGGCTGGAGGCCACGAGTTTCAC
CTGGCCCGCGATCAAGGACGGGGTGATTGCGCTGAACCATGCGCAGTTCGAGGCGCTGTTTGCCGGGCTGGACTGGCGCAAGGTCAAGGCATTGGATGTG
CGCCCGCCTGCTGCGACAGAATGAATCAACCGGTTGTTTTTTCGTGTTTTTGCTGAGGTTTGATGATAGTGTCAGGCCATGCAGGCCGTCTCCTCTCGAT
CTGTCCGCCATCCCGACTGCGCAACGCGCGGCGGTTCAGGCATTGATGGAGCAGGTGGCGGCACTCACGGAAATCACCCGGCGGCAAGAGCACCTGATCG
CTGAGTTGAATCATGCCCTGCATGGCAAGCGGTCAGAAAAGCTGACCGAGGACGAGCGGCAGCTGGCCTTCGAGGACTTATCCATCGCGCTGGCCGAGGT
CGAGGCCGAGAAAGAAACCCGCTCTGCCAAAGGGGGCGGTGGGGCAGCCAGGCCCGCGCCAAAGCGCACCATCGGCAACCTCCCGGCCGCGCTGCCCCGC
ATCGAGGAAGTCATCGAGCCTGACAGCCTGATCTGCCCCTGTGGCTGCGGCATCATGCACAAGATCGGCGAAGATCGCTCGGAGCGGCTGGACATCGTGC
CGGCTCAGTTGCGGGTCATCGTCACCGTGCGGCCGAAGTATGCTTGCCGGACTTGCACCGACGGTGTGACCCAGGCGCCCGCGCCGTCCCACCTGATCAC
GGGCGGCCTGCCGACCGAGGCAACCATCGCCCATGTGCTGGTCAGCAAATATGCGGACCATCTGCCTCTGTATCGCCAGAGCCAGATCCTGGCGCGGGCA
GGCCTTGATCTGCACCGCGCCGTGCTGGCTGATTGGGTCGGCAAGGCTGCATTCCATCTGAAGCCCGTCGTCGACCGGCTGGCCGAGCATCTGAAACGAT
CCGGCAAGCTGTTCATGGACGAAACTACCGCCCCGGTGCTGGATCCGGGGCGCGGCACGACCAAGACTGGATATCTCTGGGCCTTTGGCCGCGATGACCG
ACCATGGGGCGGGGACGATCCGCCCGGCGTGGTCTACTTCTACGCGCCCGGTCGCGCCGGCGAGAATGCCGAAACCTTCCTGACCGGCTTCGACGGTATC
CTGCAAGTCGATGGGTATCCCGGCTACAACCGGCTGACTAAAGCCTCGCGCAAGGGCGGCGATCCCATTCGGGTGGCCCATTGCTGGGCGCATGCACGGC
GCAAGCTGAAAGAGGTCTTCGACCGCGACGGCTCCGAGATCGCCGCCGAGGGCCTGCGCCGCATCGCCGAATTCTATGCCGTCGAGGCCGATATCCGCGG
CGTCTCGCCCGGCCAACGCCTGTCTGCCCGCCAGGCCCGCACCGCGCCGCTGGTGGCAGCCTTCGGCGGCTGGCTTCAGGCGCAGCGCCGCAAGATATCC
GCCAAATCCCGCCTGGGCGAAAAGCTGACCTACATCCATAACCAATGGGACGGGCTGCAAACCTTCCTGACCGACGGCCGCGTCGAGATCGACTCCAACA
GGGTCGAGAATCTCGTCCGCCCGATCGCCCTCAATCGCAAGAATGCGCTCTTCGCCGGTCACGACGAAGGCGGTATCGCCTGGGGGCGCATCGCCTCGCT
GATCGAAACCTGCAAGATCAATGGCGTCGAGCCCTTCGCCTACCTCAACGCCACGCTCACCGCCATCGCCAGCGGTCATCCGCAAAGCCGCATCGACGAC
CTGCTGCCATGGAACTTCAAGCCGTCAAGCTAAGCCGACCGCGGTTTCCAGCGACCGCTTAC
AGGTTCTCCCGGTTGGTGGCCGAAGCCGCCGGAACCGGAAGTGGCCAGATGAGGTGAAGGCGCGGGTTGTGGCGGAAACGCTGCTGCCGGGCGCAACGGT
GAACGAAGTGGCGCGCCGGCACGGGGTTTGCGCGAACCATGTTTCGTCGTGGCGGACGCTGGCGCGGAAAGGGCGGTTGGTCCTGCCCGCGCCGGAGGAT
CCGGTGGAATTTGTGGCGCTGATGGTCGGCTCCACTGAGGATGCGCCGCGCGCTGATGTTGGCGGCGAGGCCCGGCCGGAGATCATCGCCGGCACCGTTG
TGATCCGCCTGGAAACCGGGGCCACGGCGGAGCGGATTGCCTCGGTCGCGCATGCGTTGGCAGCGCGGCCATGATGTTTCCGTCGAACCGGGTGCGGATC
ATGGTCGCGACAAAGCCCGTCGACTTCAGGAAGGGACATGACGGCTTGTGTGCGCTGGTTGCCTCGATGTTGCGCAAGGATCCGTTCACCGGCACGGTGT
TCGTGTTCCGCTCGCGGCGGCTGGACAGGCTGAAGCTGCTTTACTGGGACGGCACCGGTCTGGTGATGGCCTATAAGCGGCTGGAGGCCACGAGTTTCAC
CTGGCCCGCGATCAAGGACGGGGTGATTGCGCTGAACCATGCGCAGTTCGAGGCGCTGTTTGCCGGGCTGGACTGGCGCAAGGTCAAGGCATTGGATGTG
CGCCCGCCTGCTGCGACAGAATGAATCAACCGGTTGTTTTTTCGTGTTTTTGCTGAGGTTTGATGATAGTGTCAGGCCATGCAGGCCGTCTCCTCTCGAT
CTGTCCGCCATCCCGACTGCGCAACGCGCGGCGGTTCAGGCATTGATGGAGCAGGTGGCGGCACTCACGGAAATCACCCGGCGGCAAGAGCACCTGATCG
CTGAGTTGAATCATGCCCTGCATGGCAAGCGGTCAGAAAAGCTGACCGAGGACGAGCGGCAGCTGGCCTTCGAGGACTTATCCATCGCGCTGGCCGAGGT
CGAGGCCGAGAAAGAAACCCGCTCTGCCAAAGGGGGCGGTGGGGCAGCCAGGCCCGCGCCAAAGCGCACCATCGGCAACCTCCCGGCCGCGCTGCCCCGC
ATCGAGGAAGTCATCGAGCCTGACAGCCTGATCTGCCCCTGTGGCTGCGGCATCATGCACAAGATCGGCGAAGATCGCTCGGAGCGGCTGGACATCGTGC
CGGCTCAGTTGCGGGTCATCGTCACCGTGCGGCCGAAGTATGCTTGCCGGACTTGCACCGACGGTGTGACCCAGGCGCCCGCGCCGTCCCACCTGATCAC
GGGCGGCCTGCCGACCGAGGCAACCATCGCCCATGTGCTGGTCAGCAAATATGCGGACCATCTGCCTCTGTATCGCCAGAGCCAGATCCTGGCGCGGGCA
GGCCTTGATCTGCACCGCGCCGTGCTGGCTGATTGGGTCGGCAAGGCTGCATTCCATCTGAAGCCCGTCGTCGACCGGCTGGCCGAGCATCTGAAACGAT
CCGGCAAGCTGTTCATGGACGAAACTACCGCCCCGGTGCTGGATCCGGGGCGCGGCACGACCAAGACTGGATATCTCTGGGCCTTTGGCCGCGATGACCG
ACCATGGGGCGGGGACGATCCGCCCGGCGTGGTCTACTTCTACGCGCCCGGTCGCGCCGGCGAGAATGCCGAAACCTTCCTGACCGGCTTCGACGGTATC
CTGCAAGTCGATGGGTATCCCGGCTACAACCGGCTGACTAAAGCCTCGCGCAAGGGCGGCGATCCCATTCGGGTGGCCCATTGCTGGGCGCATGCACGGC
GCAAGCTGAAAGAGGTCTTCGACCGCGACGGCTCCGAGATCGCCGCCGAGGGCCTGCGCCGCATCGCCGAATTCTATGCCGTCGAGGCCGATATCCGCGG
CGTCTCGCCCGGCCAACGCCTGTCTGCCCGCCAGGCCCGCACCGCGCCGCTGGTGGCAGCCTTCGGCGGCTGGCTTCAGGCGCAGCGCCGCAAGATATCC
GCCAAATCCCGCCTGGGCGAAAAGCTGACCTACATCCATAACCAATGGGACGGGCTGCAAACCTTCCTGACCGACGGCCGCGTCGAGATCGACTCCAACA
GGGTCGAGAATCTCGTCCGCCCGATCGCCCTCAATCGCAAGAATGCGCTCTTCGCCGGTCACGACGAAGGCGGTATCGCCTGGGGGCGCATCGCCTCGCT
GATCGAAACCTGCAAGATCAATGGCGTCGAGCCCTTCGCCTACCTCAACGCCACGCTCACCGCCATCGCCAGCGGTCATCCGCAAAGCCGCATCGACGAC
CTGCTGCCATGGAACTTCAAGCCGTCAAGCTAAGCCGACCGCGGTTTCCAGCGACCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
390 bp | 129 aa | 85 | 474 | + | No |
AG : IS66 TnpA
ORF sequence :
MEAALEVLPVGGRSRRNRKWPDEVKARVVAETLLPGATVNEVARRHGVCANHVSSWRTLARKGRLVLPAPEDPVEFVALMVGSTEDAPRADVGGEARPEI
IAGTVVIRLETGATAERIASVAHALAARP
IAGTVVIRLETGATAERIASVAHALAARP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 471 | 824 | + | No |
AG : IS66 TnpB
ORF sequence :
MMFPSNRVRIMVATKPVDFRKGHDGLCALVASMLRKDPFTGTVFVFRSRRLDRLKLLYWDGTGLVMAYKRLEATSFTWPAIKDGVIALNHAQFEALFAGL
DWRKVKALDVRPPAATE
DWRKVKALDVRPPAATE
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1488 bp | 495 aa | 946 | 2433 | + | No |
Chemistry : DDE
ORF sequence :
MEQVAALTEITRRQEHLIAELNHALHGKRSEKLTEDERQLAFEDLSIALAEVEAEKETRSAKGGGGAARPAPKRTIGNLPAALPRIEEVIEPDSLICPCG
CGIMHKIGEDRSERLDIVPAQLRVIVTVRPKYACRTCTDGVTQAPAPSHLITGGLPTEATIAHVLVSKYADHLPLYRQSQILARAGLDLHRAVLADWVGK
AAFHLKPVVDRLAEHLKRSGKLFMDETTAPVLDPGRGTTKTGYLWAFGRDDRPWGGDDPPGVVYFYAPGRAGENAETFLTGFDGILQVDGYPGYNRLTKA
SRKGGDPIRVAHCWAHARRKLKEVFDRDGSEIAAEGLRRIAEFYAVEADIRGVSPGQRLSARQARTAPLVAAFGGWLQAQRRKISAKSRLGEKLTYIHNQ
WDGLQTFLTDGRVEIDSNRVENLVRPIALNRKNALFAGHDEGGIAWGRIASLIETCKINGVEPFAYLNATLTAIASGHPQSRIDDLLPWNFKPSS
CGIMHKIGEDRSERLDIVPAQLRVIVTVRPKYACRTCTDGVTQAPAPSHLITGGLPTEATIAHVLVSKYADHLPLYRQSQILARAGLDLHRAVLADWVGK
AAFHLKPVVDRLAEHLKRSGKLFMDETTAPVLDPGRGTTKTGYLWAFGRDDRPWGGDDPPGVVYFYAPGRAGENAETFLTGFDGILQVDGYPGYNRLTKA
SRKGGDPIRVAHCWAHARRKLKEVFDRDGSEIAAEGLRRIAEFYAVEADIRGVSPGQRLSARQARTAPLVAAFGGWLQAQRRKISAKSRLGEKLTYIHNQ
WDGLQTFLTDGRVEIDSNRVENLVRPIALNRKNALFAGHDEGGIAWGRIASLIETCKINGVEPFAYLNATLTAIASGHPQSRIDDLLPWNFKPSS
Blast result :
Comments
ISPye45 is 72% aa similar to ISRel8.
ISPye45 was identified by in silico sequence analysis of Paracoccus yeei strain CCUG 32053 (4 copies of ISPye45 and isoforms present in the genome).
ISPye45 was identified by in silico sequence analysis of Paracoccus yeei strain CCUG 32053 (4 copies of ISPye45 and isoforms present in the genome).
References
1] Chmielowska C., Szuplewska M., Bartosik D. (2018) Direct submission