ISCph14
- Family ISNCY
- Group ISDol1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP001101 | ND | Chlorobium phaeobacteroides | Chlorobium phaeobacteroides BS1 |
DNA section
IS Length : 1878 bp
Ends
IR Length : 17/20
IRL : GTGTCTGACAGAAAAGCCATTATAATATTATAAAATATAGTCTTATATTC
IRR : GTGCCTGACAGAAAAGTCTTGCTTTTATAAAAAACAACTCATTTTATAAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGATGTTCCT | TACGTA | GAAGCTCACA | 6 |
CCGAGCAGTA | TAAGTA | CATAGAAAAA | 6 |
GTGAACAAGA | TAATTA | TATACAGTAA | 6 |
CTGAAAAAGA | TAAGTA | GTTGTCCATC | 6 |
DNA sequence
GTGTCTGACAGAAAAGCCATTATAATATTATAAAATATAGTCTTATATTCAATTATAAGTCACAAAGATAACGATGCCATACGGGAAAGAGACAGAAAAT
TCAGTCATTACCGGCTTTTTTGCTCTCAAAACCATCATATCGGGCATTTATCCTGTATCCAGAATAAAAACTTCGATTTTTCAGCTTCACCATGCGCACG
GTCATTAACCCGCAAATGATGTTTGGTCAGATTGATATATCGGCCATTACGTTTGACCCGAAATCCCGAGATGATATCCCAAAATTGCTCCGTGGTCTAC
AGGGTATCTATACCAACATCGAGTTGCGCCAGCGTGTGTTTGCTATCCTTGAGGAAGTTCGTCCTGACCGAAAAAACGGTACAGGCAAAGCTGATATGCA
TAACGGTCGTCCTGGCATGGAACAATGGACGATTCTGGTGCTTGGTGTTCTCCGGCTTGGCCTCAATATTGATTACGATCGTCTGCAGGAATTGGTCAAT
CAGCATAAGAATATTCGCCAGATGATTGGTCTGGACGGTTGGTACGATAAGACAACCTATGAATTACAGACGATCAAAGATAATGTGCAATTGATTACTC
CCGAGTTGTTGGACAGAATCAATCAGGAGGTGGTGCGAGCAGGCCATGCTTTGGTAAAAAAAAAGATCAAACCAACGACCTGCCCTTAATGGCCAGAGGT
GACTCATTCGTCGTTGAAACCAATGTTCATTTTCCTGCCGATACGTCCCAGTTGTTCGATGCCATACGCAAAGTTATAGAATTAACCGCAAAATTGAGCA
TCCTGCAGGGCCTCAGTCAATGGCGTCAATACCGGCATGTGATTCTGGGCCTGAAAAGAGATCTTCGGGTTGTGCAAAAGCTGAAGCACTCAACAAGCGG
GGATGCTGAAAAGCAAGAAAAAGCCAATAAGGCAATAAAACAGGCCTATCTGAACTATTGTGGCAACGTTGAATATCAACTGCTCAGGGCTAAGATAACG
GTCGATGAATCGAAAGAGAACGATTCTCTATTGTTATCTAAAATAAGCAGTTACATTACCCATGCCGAAATCCAGCTCGATCAGATACATCGTCGGATAT
TCATGAATGAAAAGATTCCACATGGAGAAAAAGTATTTTCAGTATTCGAACCACATACCGAGTGGATCAGTAAAGGAAAAATCGGTGTGCCCGTTGAATT
GGGATTGAACGTCTGTATAATACAAGATCAGTACCAGTTTATCCTGCATCATCACGTGATGGAAAAAGTTACTGATAGCGAGATTGCCGTCTCGATAGTC
AAAGAGACGAAAAGCCGATTTACGAATTTGCGAGCAATCAGCTTCGATAAAGGATTTCATAGTCCCGATAACCAGAAAGCACTCAAAGAACTTGTTGCTG
TCGTGGTGCTCCCCAAAAAAGGAAACCGGTCAGCGAGTGACAAAGCAAGAGAGACGGCACCGGAATTCAAACGATTGAGAAAGAAGCATTCGGCAGTTGA
GTCCGGTATCCATGCACTTGAAGTGCATGGGCTGGATATCTGCCCGGATCACGGAATAGACGGATTCAAACGATATGTTTCGCTCAGTGTCCTGGCATAC
AATATACATCGACTCGGAGCGCTGCTGCAGAAACAAGACATGAGGCGATACCGGCGACGGCTCAGACAAGCCGCCTAAAGGTCTTTTGAAGCAGCAATTT
TAATCACTTCGATGGGTGAAATACGTGCAATATGAAGCTTTTGATGGTAATATTCGGTAATAGTGAATGCTATGAACATCTTATGCGCATAAAAACGGCA
ATTGCACTAAATAACAGGTTTTTTATGGCTTATAAAATGAGTTGTTTTTTATAAAAGCAAGACTTTTCTGTCAGGCAC
TCAGTCATTACCGGCTTTTTTGCTCTCAAAACCATCATATCGGGCATTTATCCTGTATCCAGAATAAAAACTTCGATTTTTCAGCTTCACCATGCGCACG
GTCATTAACCCGCAAATGATGTTTGGTCAGATTGATATATCGGCCATTACGTTTGACCCGAAATCCCGAGATGATATCCCAAAATTGCTCCGTGGTCTAC
AGGGTATCTATACCAACATCGAGTTGCGCCAGCGTGTGTTTGCTATCCTTGAGGAAGTTCGTCCTGACCGAAAAAACGGTACAGGCAAAGCTGATATGCA
TAACGGTCGTCCTGGCATGGAACAATGGACGATTCTGGTGCTTGGTGTTCTCCGGCTTGGCCTCAATATTGATTACGATCGTCTGCAGGAATTGGTCAAT
CAGCATAAGAATATTCGCCAGATGATTGGTCTGGACGGTTGGTACGATAAGACAACCTATGAATTACAGACGATCAAAGATAATGTGCAATTGATTACTC
CCGAGTTGTTGGACAGAATCAATCAGGAGGTGGTGCGAGCAGGCCATGCTTTGGTAAAAAAAAAGATCAAACCAACGACCTGCCCTTAATGGCCAGAGGT
GACTCATTCGTCGTTGAAACCAATGTTCATTTTCCTGCCGATACGTCCCAGTTGTTCGATGCCATACGCAAAGTTATAGAATTAACCGCAAAATTGAGCA
TCCTGCAGGGCCTCAGTCAATGGCGTCAATACCGGCATGTGATTCTGGGCCTGAAAAGAGATCTTCGGGTTGTGCAAAAGCTGAAGCACTCAACAAGCGG
GGATGCTGAAAAGCAAGAAAAAGCCAATAAGGCAATAAAACAGGCCTATCTGAACTATTGTGGCAACGTTGAATATCAACTGCTCAGGGCTAAGATAACG
GTCGATGAATCGAAAGAGAACGATTCTCTATTGTTATCTAAAATAAGCAGTTACATTACCCATGCCGAAATCCAGCTCGATCAGATACATCGTCGGATAT
TCATGAATGAAAAGATTCCACATGGAGAAAAAGTATTTTCAGTATTCGAACCACATACCGAGTGGATCAGTAAAGGAAAAATCGGTGTGCCCGTTGAATT
GGGATTGAACGTCTGTATAATACAAGATCAGTACCAGTTTATCCTGCATCATCACGTGATGGAAAAAGTTACTGATAGCGAGATTGCCGTCTCGATAGTC
AAAGAGACGAAAAGCCGATTTACGAATTTGCGAGCAATCAGCTTCGATAAAGGATTTCATAGTCCCGATAACCAGAAAGCACTCAAAGAACTTGTTGCTG
TCGTGGTGCTCCCCAAAAAAGGAAACCGGTCAGCGAGTGACAAAGCAAGAGAGACGGCACCGGAATTCAAACGATTGAGAAAGAAGCATTCGGCAGTTGA
GTCCGGTATCCATGCACTTGAAGTGCATGGGCTGGATATCTGCCCGGATCACGGAATAGACGGATTCAAACGATATGTTTCGCTCAGTGTCCTGGCATAC
AATATACATCGACTCGGAGCGCTGCTGCAGAAACAAGACATGAGGCGATACCGGCGACGGCTCAGACAAGCCGCCTAAAGGTCTTTTGAAGCAGCAATTT
TAATCACTTCGATGGGTGAAATACGTGCAATATGAAGCTTTTGATGGTAATATTCGGTAATAGTGAATGCTATGAACATCTTATGCGCATAAAAACGGCA
ATTGCACTAAATAACAGGTTTTTTATGGCTTATAAAATGAGTTGTTTTTTATAAAAGCAAGACTTTTCTGTCAGGCAC
Recoding section
- Recoding by frameshift
- Frame -1
- Type transcriptional
- Experimentally demonstrated No
Stimulators :
- Shine-Dalgarno sequence : No
- Secondary structure : stem-loop
Recoding motif :
AAAAAAAAGATCAAACCAACGACCTGCCCTTAATGGCCAGAGGTGACTCA
................(((((((..(((......)))..(((....))).
TTCGTCGTTGAAACCAATG
...))))))).........
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
498 bp | 165 aa | 192 | 689 | + | No |
Description : First part of the transposase
ORF sequence :
MRTVINPQMMFGQIDISAITFDPKSRDDIPKLLRGLQGIYTNIELRQRVFAILEEVRPDRKNGTGKADMHNGRPGMEQWTILVLGVLRLGLNIDYDRLQE
LVNQHKNIRQMIGLDGWYDKTTYELQTIKDNVQLITPELLDRINQEVVRAGHALVKKKIKPTTCP
LVNQHKNIRQMIGLDGWYDKTTYELQTIKDNVQLITPELLDRINQEVVRAGHALVKKKIKPTTCP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1095 bp | 364 aa | 584 | 1678 | + | No |
Description : Second part of the transposase
ORF sequence :
CAIDYSRVVGQNQSGGGASRPCFGKKKDQTNDLPLMARGDSFVVETNVHFPADTSQLFDAIRKVIELTAKLSILQGLSQWRQYRHVILGLKRDLRVVQKL
KHSTSGDAEKQEKANKAIKQAYLNYCGNVEYQLLRAKITVDESKENDSLLLSKISSYITHAEIQLDQIHRRIFMNEKIPHGEKVFSVFEPHTEWISKGKI
GVPVELGLNVCIIQDQYQFILHHHVMEKVTDSEIAVSIVKETKSRFTNLRAISFDKGFHSPDNQKALKELVAVVVLPKKGNRSASDKARETAPEFKRLRK
KHSAVESGIHALEVHGLDICPDHGIDGFKRYVSLSVLAYNIHRLGALLQKQDMRRYRRRLRQAA
KHSTSGDAEKQEKANKAIKQAYLNYCGNVEYQLLRAKITVDESKENDSLLLSKISSYITHAEIQLDQIHRRIFMNEKIPHGEKVFSVFEPHTEWISKGKI
GVPVELGLNVCIIQDQYQFILHHHVMEKVTDSEIAVSIVKETKSRFTNLRAISFDKGFHSPDNQKALKELVAVVVLPKKGNRSASDKARETAPEFKRLRK
KHSAVESGIHALEVHGLDICPDHGIDGFKRYVSLSVLAYNIHRLGALLQKQDMRRYRRRLRQAA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1487 bp | 495 aa | 192 | 1678 | + | Yes |
Chemistry : DDE
ORF sequence :
MRTVINPQMMFGQIDISAITFDPKSRDDIPKLLRGLQGIYTNIELRQRVFAILEEVRPDRKNGTGKADMHNGRPGMEQWTILVLGVLRLGLNIDYDRLQE
LVNQHKNIRQMIGLDGWYDKTTYELQTIKDNVQLITPELLDRINQEVVRAGHALVKKKDQTNDLPLMARGDSFVVETNVHFPADTSQLFDAIRKVIELTA
KLSILQGLSQWRQYRHVILGLKRDLRVVQKLKHSTSGDAEKQEKANKAIKQAYLNYCGNVEYQLLRAKITVDESKENDSLLLSKISSYITHAEIQLDQIH
RRIFMNEKIPHGEKVFSVFEPHTEWISKGKIGVPVELGLNVCIIQDQYQFILHHHVMEKVTDSEIAVSIVKETKSRFTNLRAISFDKGFHSPDNQKALKE
LVAVVVLPKKGNRSASDKARETAPEFKRLRKKHSAVESGIHALEVHGLDICPDHGIDGFKRYVSLSVLAYNIHRLGALLQKQDMRRYRRRLRQAA
LVNQHKNIRQMIGLDGWYDKTTYELQTIKDNVQLITPELLDRINQEVVRAGHALVKKKDQTNDLPLMARGDSFVVETNVHFPADTSQLFDAIRKVIELTA
KLSILQGLSQWRQYRHVILGLKRDLRVVQKLKHSTSGDAEKQEKANKAIKQAYLNYCGNVEYQLLRAKITVDESKENDSLLLSKISSYITHAEIQLDQIH
RRIFMNEKIPHGEKVFSVFEPHTEWISKGKIGVPVELGLNVCIIQDQYQFILHHHVMEKVTDSEIAVSIVKETKSRFTNLRAISFDKGFHSPDNQKALKE
LVAVVVLPKKGNRSASDKARETAPEFKRLRKKHSAVESGIHALEVHGLDICPDHGIDGFKRYVSLSVLAYNIHRLGALLQKQDMRRYRRRLRQAA
Blast result :Comments : This ORF is the putative ORFAB transposase reconstructed in silico by possible -1 frameshift.
Comments
ISCph14 is 82% aa similar to ISPph5.
References
1] ISfinder annotation (2016)
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ovchinnikova,G., Li,T., Liu,Z., Zhao,F., Overmann,J., Bryant,D.A. and Richardson,P. (2008) Direct submission GenBank.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Schmutz,J., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ovchinnikova,G., Li,T., Liu,Z., Zhao,F., Overmann,J., Bryant,D.A. and Richardson,P. (2008) Direct submission GenBank.