ISPre4
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004444 | ND | Pseudomonas resinovorans | Pseudomonas resinovorans plasmid pCAR1 |
DNA section
IS Length : 2575 bp
Ends
IR Length : 44/49
IRL : TGCGGATTCCACGGCCATCCGGCCACCTATTCCATAAACATCCGGCCACT
IRR : TGCGCATTCCACGCCATCCGGCCACCCAGTCCACCAACATCCGGCCACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATA | CCCAAGA | TGCAGAAAGCC | 7 |
DNA sequence
TGCGGATTCCACGGCCATCCGGCCACCTATTCCATAAACATCCGGCCACTGATTCCACGATGATCCGGCCACCCATTCCACGCTCATCCGGCCACCCTTA
AGGCAGGCAGCAACGCAGGATTATTCACTACCATCGACCTCTTTTATCGAAGCAGAGAGGTCGTCGTGGAGCGTTTATCCATGCGTAAAATTCGAGAAGT
GCTACGTCTCAAATTCGAGGTCGGCCTATCTGCCCGCCAGGTCGCTGGCAGCTTGCAAGTAGGCCGAGCCAGCGTCGGTGAGTACCTCAATCGCTTTGCT
GCCAGCGGCCTGACCTGGCCCTCTGCGCTGACTGACGCCGAGTTACAACGGCATCTTTTTCCGCCGCCGCCCACCGTTCCCAGCGATCAAAGGCCGGTGC
CAGACTGGGCTCATGTACACGGGGAGTTACGCCGCCCCGGCGTAACTCTGGCCCTGCTTTGGCAGGAATACCGACTCGCCCACCCGCAAGGCTTCCAATA
CAGCTGGTTCTGCGAGCACTACCGCCTCTGGGCCGCCAAGGTCGACGTGGTCATGCGCCAGGAGCACCGTGCCGGCGAAAAGCTGTTCGTCGATTACGCC
GGCCAGACCGTACCGGTCATTGATCGGCGAACCGGCGAGATCCGCCAAACGCAGATCTTCGTCGCCGTTCTCGGCGCGTCCAGCTACACCTTTGCCGAGG
CCACCTGGTCGCAGAAGCTGCCTGACTGGCTGGGCTCGCACGCCCGCTGTTTTGCTTTTTTCGGTGGCACATCGCAGATCCTGGTGCCCGATAATCTGCG
CAGCGGCGTCACCAAGGCGCATCGCTACGAGCCCGACATCAACCCCAGTTACCGCGATCTCGCCGAGCATTACGGCATCGCCGTATTACCTGCTCGCTCG
CGCAAACCCAAGGACAAAGCCAAGGTCGAAGTCGGCGTGCAAGTCGTCGAGCGCTGGATCCTCGCCGCGCTGCGCAACCGCCAGTTCTTCTCGCTGGGCG
AACTCAACACGGCAATCAGCCTGCTGCTCGAACGCCTCAATCAAAAGCCCTTCAAGAAGCTGCCCGGCTCACGCCGCTCGGCCTTTGAGACCATCGACCA
ACCCGCACTGCAACCGCTACCAGAACATCCGTACATCTACGCCGAATGGAAAAAGGTGCGGGTGCACATCGACTACCACGTCGAGGTCGAGGGCCATTAC
TACTCGGTGCCGTACCAGTTGGTGAAGCACCAACTCGAAGTGCGGCTAACCGCTAAGACCGTCGAGTGCTTCCACGCCAACCAGCGGGTGGCCAGTCACC
TGCGCTCGCTGCACAAAGGCCGCCACACCACCCAAACCGAGCACATGCCCAAGAGCCACCGGGAGCACGCCGAATGGACACCGCAACGGCTGATCCGCTG
GGCCGAGCAGACCGGGCCTAACACCGCTGGCGTTATCGCCTACATCCTTGAGCGCCGAATCCATCCTCAGCACGGCTTCCGCGCCTGCCTGGGCATCCTG
CGCCTGAGCAAGCAGCACGGCGAAGAGCGGCTGGAAGCCGCTTGCCAGCGAGCGCTGGCACTGGGCGCGTGCAGCTACAAAAGCCTTGAATCGATCCTGC
GCCAAGGGCTGGAAAAGCTACCGCTGTCCCAGCAAAACCTGCCGCTACTGCCCGACGAACACATCAACCTGCGCGGCCCTGGCTACTACCACTGACCTGA
AAAAGGAGCACCAAAATGCTGCCCAATCCGACACTCGACAAGCTGCAAACCCTGCGCCTGCACGGCATGATCAAGGCGCTGAGCGAGCAACACGCCACGC
CCGATATCAATGATCTGAGCTTCGACGAACGCCTCGGCCTGATGATCGACCGCGAGCTGACCGAACGTGAAAACACGCGTCTGAGCAGCCGGCTGAAACT
GGCACGACTGCGCCACAACGCTTGCCTGGAAGACATCGATTACCGCAGCCCACGCGGCCTGGACAAGTCCCTGATCCTGCAACTGGGCAGCGGCCAATGG
CTGCGCGATGGCTTGAACCTGATCATCGGCGGCCCGACCGGTGTAGGCAAAACCTGGCTCGCCTGCGCGCTGGCCCACAAGGCCTGCCGTGACGGCTACA
GCGTGCGCTACTTGCGTCTGCCGCGCCTGATGGAGGAACTGGGCCTGGCCCACGGAGACGGTCGATTCGCGAAACTGATGGCGGGCTATGCCAAGACCGA
CTTGCTGATCCTGGACGACTGGGGCCTGGCGCCGTTTACCGCACCGCAACGGCGCGACATGCTTGAGCTATTGGACGACCGCTACGGCAACCGCTCGACG
CTGGTCACCAGCCAGATGCCCGTGGACAAATGGCATGCACTGATCGGCGATCCGACCTTGGGCGACGCGATCCTCGACCGGCTGGTGCACAACGCTTATC
GGATCGAACTGAAGGGCGAATCGATGCGCAGACGCGCAACGAAATTGACGACGACAGGGACTTCAGACTAACAATGCAAACCTGCGTCGCTGCGCTCCGA
CTGCCTGGCCGGATGAGCGTGGAACGGGTGGCCGGATGTTGGTGGACTGGGTGGCCGGATGGCGTGGAATGCGCA
AGGCAGGCAGCAACGCAGGATTATTCACTACCATCGACCTCTTTTATCGAAGCAGAGAGGTCGTCGTGGAGCGTTTATCCATGCGTAAAATTCGAGAAGT
GCTACGTCTCAAATTCGAGGTCGGCCTATCTGCCCGCCAGGTCGCTGGCAGCTTGCAAGTAGGCCGAGCCAGCGTCGGTGAGTACCTCAATCGCTTTGCT
GCCAGCGGCCTGACCTGGCCCTCTGCGCTGACTGACGCCGAGTTACAACGGCATCTTTTTCCGCCGCCGCCCACCGTTCCCAGCGATCAAAGGCCGGTGC
CAGACTGGGCTCATGTACACGGGGAGTTACGCCGCCCCGGCGTAACTCTGGCCCTGCTTTGGCAGGAATACCGACTCGCCCACCCGCAAGGCTTCCAATA
CAGCTGGTTCTGCGAGCACTACCGCCTCTGGGCCGCCAAGGTCGACGTGGTCATGCGCCAGGAGCACCGTGCCGGCGAAAAGCTGTTCGTCGATTACGCC
GGCCAGACCGTACCGGTCATTGATCGGCGAACCGGCGAGATCCGCCAAACGCAGATCTTCGTCGCCGTTCTCGGCGCGTCCAGCTACACCTTTGCCGAGG
CCACCTGGTCGCAGAAGCTGCCTGACTGGCTGGGCTCGCACGCCCGCTGTTTTGCTTTTTTCGGTGGCACATCGCAGATCCTGGTGCCCGATAATCTGCG
CAGCGGCGTCACCAAGGCGCATCGCTACGAGCCCGACATCAACCCCAGTTACCGCGATCTCGCCGAGCATTACGGCATCGCCGTATTACCTGCTCGCTCG
CGCAAACCCAAGGACAAAGCCAAGGTCGAAGTCGGCGTGCAAGTCGTCGAGCGCTGGATCCTCGCCGCGCTGCGCAACCGCCAGTTCTTCTCGCTGGGCG
AACTCAACACGGCAATCAGCCTGCTGCTCGAACGCCTCAATCAAAAGCCCTTCAAGAAGCTGCCCGGCTCACGCCGCTCGGCCTTTGAGACCATCGACCA
ACCCGCACTGCAACCGCTACCAGAACATCCGTACATCTACGCCGAATGGAAAAAGGTGCGGGTGCACATCGACTACCACGTCGAGGTCGAGGGCCATTAC
TACTCGGTGCCGTACCAGTTGGTGAAGCACCAACTCGAAGTGCGGCTAACCGCTAAGACCGTCGAGTGCTTCCACGCCAACCAGCGGGTGGCCAGTCACC
TGCGCTCGCTGCACAAAGGCCGCCACACCACCCAAACCGAGCACATGCCCAAGAGCCACCGGGAGCACGCCGAATGGACACCGCAACGGCTGATCCGCTG
GGCCGAGCAGACCGGGCCTAACACCGCTGGCGTTATCGCCTACATCCTTGAGCGCCGAATCCATCCTCAGCACGGCTTCCGCGCCTGCCTGGGCATCCTG
CGCCTGAGCAAGCAGCACGGCGAAGAGCGGCTGGAAGCCGCTTGCCAGCGAGCGCTGGCACTGGGCGCGTGCAGCTACAAAAGCCTTGAATCGATCCTGC
GCCAAGGGCTGGAAAAGCTACCGCTGTCCCAGCAAAACCTGCCGCTACTGCCCGACGAACACATCAACCTGCGCGGCCCTGGCTACTACCACTGACCTGA
AAAAGGAGCACCAAAATGCTGCCCAATCCGACACTCGACAAGCTGCAAACCCTGCGCCTGCACGGCATGATCAAGGCGCTGAGCGAGCAACACGCCACGC
CCGATATCAATGATCTGAGCTTCGACGAACGCCTCGGCCTGATGATCGACCGCGAGCTGACCGAACGTGAAAACACGCGTCTGAGCAGCCGGCTGAAACT
GGCACGACTGCGCCACAACGCTTGCCTGGAAGACATCGATTACCGCAGCCCACGCGGCCTGGACAAGTCCCTGATCCTGCAACTGGGCAGCGGCCAATGG
CTGCGCGATGGCTTGAACCTGATCATCGGCGGCCCGACCGGTGTAGGCAAAACCTGGCTCGCCTGCGCGCTGGCCCACAAGGCCTGCCGTGACGGCTACA
GCGTGCGCTACTTGCGTCTGCCGCGCCTGATGGAGGAACTGGGCCTGGCCCACGGAGACGGTCGATTCGCGAAACTGATGGCGGGCTATGCCAAGACCGA
CTTGCTGATCCTGGACGACTGGGGCCTGGCGCCGTTTACCGCACCGCAACGGCGCGACATGCTTGAGCTATTGGACGACCGCTACGGCAACCGCTCGACG
CTGGTCACCAGCCAGATGCCCGTGGACAAATGGCATGCACTGATCGGCGATCCGACCTTGGGCGACGCGATCCTCGACCGGCTGGTGCACAACGCTTATC
GGATCGAACTGAAGGGCGAATCGATGCGCAGACGCGCAACGAAATTGACGACGACAGGGACTTCAGACTAACAATGCAAACCTGCGTCGCTGCGCTCCGA
CTGCCTGGCCGGATGAGCGTGGAACGGGTGGCCGGATGTTGGTGGACTGGGTGGCCGGATGGCGTGGAATGCGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1530 bp | 509 aa | 166 | 1695 | + | No |
Chemistry : DDE
ORF sequence :
MERLSMRKIREVLRLKFEVGLSARQVAGSLQVGRASVGEYLNRFAASGLTWPSALTDAELQRHLFPPPPTVPSDQRPVPDWAHVHGELRRPGVTLALLWQ
EYRLAHPQGFQYSWFCEHYRLWAAKVDVVMRQEHRAGEKLFVDYAGQTVPVIDRRTGEIRQTQIFVAVLGASSYTFAEATWSQKLPDWLGSHARCFAFFG
GTSQILVPDNLRSGVTKAHRYEPDINPSYRDLAEHYGIAVLPARSRKPKDKAKVEVGVQVVERWILAALRNRQFFSLGELNTAISLLLERLNQKPFKKLP
GSRRSAFETIDQPALQPLPEHPYIYAEWKKVRVHIDYHVEVEGHYYSVPYQLVKHQLEVRLTAKTVECFHANQRVASHLRSLHKGRHTTQTEHMPKSHRE
HAEWTPQRLIRWAEQTGPNTAGVIAYILERRIHPQHGFRACLGILRLSKQHGEERLEAACQRALALGACSYKSLESILRQGLEKLPLSQQNLPLLPDEHI
NLRGPGYYH
EYRLAHPQGFQYSWFCEHYRLWAAKVDVVMRQEHRAGEKLFVDYAGQTVPVIDRRTGEIRQTQIFVAVLGASSYTFAEATWSQKLPDWLGSHARCFAFFG
GTSQILVPDNLRSGVTKAHRYEPDINPSYRDLAEHYGIAVLPARSRKPKDKAKVEVGVQVVERWILAALRNRQFFSLGELNTAISLLLERLNQKPFKKLP
GSRRSAFETIDQPALQPLPEHPYIYAEWKKVRVHIDYHVEVEGHYYSVPYQLVKHQLEVRLTAKTVECFHANQRVASHLRSLHKGRHTTQTEHMPKSHRE
HAEWTPQRLIRWAEQTGPNTAGVIAYILERRIHPQHGFRACLGILRLSKQHGEERLEAACQRALALGACSYKSLESILRQGLEKLPLSQQNLPLLPDEHI
NLRGPGYYH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
756 bp | 251 aa | 1716 | 2471 | + | No |
AG : IS21 helper
ORF sequence :
MLPNPTLDKLQTLRLHGMIKALSEQHATPDINDLSFDERLGLMIDRELTERENTRLSSRLKLARLRHNACLEDIDYRSPRGLDKSLILQLGSGQWLRDGL
NLIIGGPTGVGKTWLACALAHKACRDGYSVRYLRLPRLMEELGLAHGDGRFAKLMAGYAKTDLLILDDWGLAPFTAPQRRDMLELLDDRYGNRSTLVTSQ
MPVDKWHALIGDPTLGDAILDRLVHNAYRIELKGESMRRRATKLTTTGTSD
NLIIGGPTGVGKTWLACALAHKACRDGYSVRYLRLPRLMEELGLAHGDGRFAKLMAGYAKTDLLILDDWGLAPFTAPQRRDMLELLDDRYGNRSTLVTSQ
MPVDKWHALIGDPTLGDAILDRLVHNAYRIELKGESMRRRATKLTTTGTSD
Blast result :
Comments
ISPre4 is 94%(ORF1) and 97% (ORF2) aa similar to ISPsy14.
References
1] Urata,M., Miyakoshi,M., Kai,S., Maeda,K., Habe,H., Omori,T., Yamane,H. and Nojiri,H. (2004) J. Bacteriol. 186 (20), 6815-6823
2] Maeda,K., Nojiri,H., Shintani,M., Yoshida,T., Habe,H. and Omori,T.
(2003) J. Mol. Biol. 326 (1), 21-33
2] Maeda,K., Nojiri,H., Shintani,M., Yoshida,T., Habe,H. and Omori,T.
(2003) J. Mol. Biol. 326 (1), 21-33