ISPre3
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004444 | ND | Pseudomonas resinovorans | Pseudomonas resinovorans plasmid pCAR1 |
DNA section
IS Length : 2957 bp
Ends
IR Length : 17/24
IRL : GTAAGCGTCCGCCCTGCACCCACTGCCTGCCCTAGCCTTGAATAGCCACA
IRR : GTAAGCGTACGCTCGCTACCGTCTTGACCCTCGGGTCTTAATCCGCTTCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCCGCCTCCA | GAATAATT | GGCCTTTTGA | 8 |
CTTTCAGCGG | GGTACATA | TCTGCTGAGG | 8 |
DNA sequence
GTAAGCGTCCGCCCTGCACCCACTGCCTGCCCTAGCCTTGAATAGCCACAATGGTGCCCACCATTTTTTAGTGGGCACCATTGTGGATACGCTCCCTTCT
ACTCCTCGTCGCAGTCGACGCCCCAACTTTTCGGTGGAGTTCAAGCGTCGTGTCGTCGAGGCCACGCTATTGCCCGGCGCATCAGTCGCGTTGATCGCCC
GAGAGCATGAGATCAATGCCAATCTCGTCTTCAAATGGCGCCGTCACTATCGGGAGGGGCAACTGGGGTCAGTCGCCCAGCAGGCCACTTTGTTACCGGT
CAATCTGAGCAAGGCGCCGACGTCTTCAGCGGAAGAGGCCGTGCGACTGCCGCTGAGTCCGGGCGGTCTGGTCGTGGAGTGCGGTCGGGTCACCTTGCGC
ATTGAGGGCGTGCCCGACCCGCAAACCTTGCAACTGGTCTTGCAGCAGGTGTTGCGATGATTGCTCCGCCCATCGGGACTCGCATTTGGATCGCTGCCGG
GGTCACGGACATGCGGCGGGGGTTCGATGGCCTGGCGGCCCTGGTGCAAACTCAGCTTGAAGCGGATCCATTCTCTGGCCAGATCTTTGTGTTTCGAGGC
CGACGGGGTGACCGGATTAAATTGCTGTGGTGGGATGGCGATGGCTTGTGTTTGTTCTGCAAGCGGCTCGAGCAGGGGCGCTTCGTCTGGCCGCAGGCCG
CCAGCGGCAGCGTATCCTTGACGACGGCACAACTCGCGATGCTGTTGGAGGGCATCGATTGGCGCCGACCGATACGCACCGCACCGGTACGAACGGTCTG
ATTTTCAACGCAGGAGAACACCAGTAACATCGCGCCATGACCTGCGCGACTGACTGCCTCCCTGACGACATCCAGGCCTTGAAAGCCTTGGTAACCGCCC
AGCGCGGGGAAATCGAGCACCTGAAACTGATGATCGCCAAGCTGCGGCGCACGCAGTTTGGTCGGCGTTCTGAGCAACTCGATGGCATGCTCGACCAACT
GCAACTGACGCTCGAGGAGTTGCAGGTCAGCGAGGCCGCACTGGCCTCGCCACCACCAGCCGAATCGCATCCACGTCCCCCTGTACGACGCAAGCCATTG
CCCGCGCACTTGCCGCGTGAAATCCACGTCCATCGACCCGATACGCAGTGTCCTGGCTGTGGCGGCGAACTTCGCCATTTGGGTGATGACGTGGCTGAAG
TTCTGGAGTACGTGCCGGCACGGTTCAAGGTGGTTCGCCATGTGCGGCCCAAGTTGGCTTGCCGCTGCTGTGACGGGATTGTGCAGGTTCCTGCCCCTAG
CCGCCCGATTGCCCGGGGGCTTGCCGGGCCAGGATTGTTGGCCCATGTGCTGGTGTCCAAATACGTGGATCATCTGCCGCTGTATCGGCAGTCCGAAATC
TATGCGCGCGAGGGCGTGTCCCTGGAGCGCTCGACCATGGCCGACTGGGTCGGCGAAGCGAGCCGGCTCTTGCAGCCATTGGTCGGCCGGTTGCGGCAGC
ACGTCATGACCAGCAACAAAGTCCACGCCGACGATACGCCGATCGCCGTGCTCGCCCCCGGCCAGGGTAAAACCAAGACCGGACGCTTGTGGACGTATGT
GCGTGACGAGCGCCCCACAGGTAGTAGCACACCTGCGGCGGTCTGGTTTGCCTACTCGCCGGATCGCAAAGGCGAGCACCCCCGAGCCCATCTCAAAGGC
TTCGCCGGGACATTACAAGCCGATGGTTATGCCGGTTTCGCTCAACTCTATGCCGCGGGCACGATTCACGAGGCGGCTTGTTGGGCGCATGCCCGGCGCA
AGTTTTTCGACCTGCACAAGGCATTGGCCTCACCGATCGCCGCTGAAGCCTTGCAGCGGATTGGTGCGCTATACGCCATTGAGGCGGAAATTCGTGGACA
ACCGCCCGATCAACGACGCGCAGTGCGGCAGCGGCGAGCCAGCCCGCTGCTAGCGCAATTTCACGCCTGGCTTAACCACACGCTGACGCAACTGCCGTCC
AAATCGGCGCTGAGCGGTGCGATCTACTATGCCCTCGCACGTTGGCAGGCGCTGACGCGCTACTGCACGGATGGCCGCATTGAGATCGACAACAACGCCG
CCGAACGCGCCCTGCGTACAGTCGCGCTGGGTCGCAAGAATTATCTGTTCGTCGGTTCCGACGCCGGCGGAGAAAGAGCGGCGGCGATCTATAGCCTGGT
CGGTTCAGCCAAGCTCAACGCATTGAACCCACAGGCTTATTTGACGCACGTGCTGGAGCGCATCGCCGACTATCCAATCAATCAGCTCGACGATTTGCTG
CCCTGGAATATTGCTCTGCCTACCTTAGAACATGAGGCCGCTTGACCCATGCCCAACGTCACCCGTTTGCGCCAGCCGGCCAGCGACCGTCTCCTGCAAC
TGCACATCGAGCTGACGTGGATAAAACCCGCCATCTGGCGCCGGGTCGCCGTGCCCGAGCGCATCACCCTGAGCAAGCTGCACCAGGTCATTCAGGTGGT
GATGGGCTGGAGCGACACCCATCTTCACGAGTTTGAAATCGCTGGCGAAAGCTACGGTATTCCCGATCCCGACTGGGGCCCTTCGGTCGTTTCGGAACAG
CGCAAGACACTGACAAAGGTGCTGTATGGGTCGAAGACGTTTCGTTACGTCTATGACTTTGGCGACAACTGGGAGCACCGGATCAAGACCGAGCGGCTGT
TGCCGGCCATTGCGTGCCCGCAAGTGCCGTATTGCATCGACGGCGCCAATGCCAGTCCGCCGGAGGATGTTGGCGGCGCACCGGGCTACGCTGATTTTCT
GGATGCGCTTGCCGACCCCGAGCATCCCGAATACTTGAACATGCTCGACTGGTACGGCGATACCTTCGATCCCACGGCTTTTGACCGCGACGCCATCAAC
CAGCGCCTGAAGCGGATTAAGACCCGAGGGTCAAGACGGTAGCGAGCGTACGCTTAC
ACTCCTCGTCGCAGTCGACGCCCCAACTTTTCGGTGGAGTTCAAGCGTCGTGTCGTCGAGGCCACGCTATTGCCCGGCGCATCAGTCGCGTTGATCGCCC
GAGAGCATGAGATCAATGCCAATCTCGTCTTCAAATGGCGCCGTCACTATCGGGAGGGGCAACTGGGGTCAGTCGCCCAGCAGGCCACTTTGTTACCGGT
CAATCTGAGCAAGGCGCCGACGTCTTCAGCGGAAGAGGCCGTGCGACTGCCGCTGAGTCCGGGCGGTCTGGTCGTGGAGTGCGGTCGGGTCACCTTGCGC
ATTGAGGGCGTGCCCGACCCGCAAACCTTGCAACTGGTCTTGCAGCAGGTGTTGCGATGATTGCTCCGCCCATCGGGACTCGCATTTGGATCGCTGCCGG
GGTCACGGACATGCGGCGGGGGTTCGATGGCCTGGCGGCCCTGGTGCAAACTCAGCTTGAAGCGGATCCATTCTCTGGCCAGATCTTTGTGTTTCGAGGC
CGACGGGGTGACCGGATTAAATTGCTGTGGTGGGATGGCGATGGCTTGTGTTTGTTCTGCAAGCGGCTCGAGCAGGGGCGCTTCGTCTGGCCGCAGGCCG
CCAGCGGCAGCGTATCCTTGACGACGGCACAACTCGCGATGCTGTTGGAGGGCATCGATTGGCGCCGACCGATACGCACCGCACCGGTACGAACGGTCTG
ATTTTCAACGCAGGAGAACACCAGTAACATCGCGCCATGACCTGCGCGACTGACTGCCTCCCTGACGACATCCAGGCCTTGAAAGCCTTGGTAACCGCCC
AGCGCGGGGAAATCGAGCACCTGAAACTGATGATCGCCAAGCTGCGGCGCACGCAGTTTGGTCGGCGTTCTGAGCAACTCGATGGCATGCTCGACCAACT
GCAACTGACGCTCGAGGAGTTGCAGGTCAGCGAGGCCGCACTGGCCTCGCCACCACCAGCCGAATCGCATCCACGTCCCCCTGTACGACGCAAGCCATTG
CCCGCGCACTTGCCGCGTGAAATCCACGTCCATCGACCCGATACGCAGTGTCCTGGCTGTGGCGGCGAACTTCGCCATTTGGGTGATGACGTGGCTGAAG
TTCTGGAGTACGTGCCGGCACGGTTCAAGGTGGTTCGCCATGTGCGGCCCAAGTTGGCTTGCCGCTGCTGTGACGGGATTGTGCAGGTTCCTGCCCCTAG
CCGCCCGATTGCCCGGGGGCTTGCCGGGCCAGGATTGTTGGCCCATGTGCTGGTGTCCAAATACGTGGATCATCTGCCGCTGTATCGGCAGTCCGAAATC
TATGCGCGCGAGGGCGTGTCCCTGGAGCGCTCGACCATGGCCGACTGGGTCGGCGAAGCGAGCCGGCTCTTGCAGCCATTGGTCGGCCGGTTGCGGCAGC
ACGTCATGACCAGCAACAAAGTCCACGCCGACGATACGCCGATCGCCGTGCTCGCCCCCGGCCAGGGTAAAACCAAGACCGGACGCTTGTGGACGTATGT
GCGTGACGAGCGCCCCACAGGTAGTAGCACACCTGCGGCGGTCTGGTTTGCCTACTCGCCGGATCGCAAAGGCGAGCACCCCCGAGCCCATCTCAAAGGC
TTCGCCGGGACATTACAAGCCGATGGTTATGCCGGTTTCGCTCAACTCTATGCCGCGGGCACGATTCACGAGGCGGCTTGTTGGGCGCATGCCCGGCGCA
AGTTTTTCGACCTGCACAAGGCATTGGCCTCACCGATCGCCGCTGAAGCCTTGCAGCGGATTGGTGCGCTATACGCCATTGAGGCGGAAATTCGTGGACA
ACCGCCCGATCAACGACGCGCAGTGCGGCAGCGGCGAGCCAGCCCGCTGCTAGCGCAATTTCACGCCTGGCTTAACCACACGCTGACGCAACTGCCGTCC
AAATCGGCGCTGAGCGGTGCGATCTACTATGCCCTCGCACGTTGGCAGGCGCTGACGCGCTACTGCACGGATGGCCGCATTGAGATCGACAACAACGCCG
CCGAACGCGCCCTGCGTACAGTCGCGCTGGGTCGCAAGAATTATCTGTTCGTCGGTTCCGACGCCGGCGGAGAAAGAGCGGCGGCGATCTATAGCCTGGT
CGGTTCAGCCAAGCTCAACGCATTGAACCCACAGGCTTATTTGACGCACGTGCTGGAGCGCATCGCCGACTATCCAATCAATCAGCTCGACGATTTGCTG
CCCTGGAATATTGCTCTGCCTACCTTAGAACATGAGGCCGCTTGACCCATGCCCAACGTCACCCGTTTGCGCCAGCCGGCCAGCGACCGTCTCCTGCAAC
TGCACATCGAGCTGACGTGGATAAAACCCGCCATCTGGCGCCGGGTCGCCGTGCCCGAGCGCATCACCCTGAGCAAGCTGCACCAGGTCATTCAGGTGGT
GATGGGCTGGAGCGACACCCATCTTCACGAGTTTGAAATCGCTGGCGAAAGCTACGGTATTCCCGATCCCGACTGGGGCCCTTCGGTCGTTTCGGAACAG
CGCAAGACACTGACAAAGGTGCTGTATGGGTCGAAGACGTTTCGTTACGTCTATGACTTTGGCGACAACTGGGAGCACCGGATCAAGACCGAGCGGCTGT
TGCCGGCCATTGCGTGCCCGCAAGTGCCGTATTGCATCGACGGCGCCAATGCCAGTCCGCCGGAGGATGTTGGCGGCGCACCGGGCTACGCTGATTTTCT
GGATGCGCTTGCCGACCCCGAGCATCCCGAATACTTGAACATGCTCGACTGGTACGGCGATACCTTCGATCCCACGGCTTTTGACCGCGACGCCATCAAC
CAGCGCCTGAAGCGGATTAAGACCCGAGGGTCAAGACGGTAGCGAGCGTACGCTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
390 bp | 129 aa | 71 | 460 | + | No |
AG : IS66 TnpA
ORF sequence :
MGTIVDTLPSTPRRSRRPNFSVEFKRRVVEATLLPGASVALIAREHEINANLVFKWRRHYREGQLGSVAQQATLLPVNLSKAPTSSAEEAVRLPLSPGGL
VVECGRVTLRIEGVPDPQTLQLVLQQVLR
VVECGRVTLRIEGVPDPQTLQLVLQQVLR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
345 bp | 114 aa | 457 | 801 | + | No |
AG : IS66 TnpB
ORF sequence :
MIAPPIGTRIWIAAGVTDMRRGFDGLAALVQTQLEADPFSGQIFVFRGRRGDRIKLLWWDGDGLCLFCKRLEQGRFVWPQAASGSVSLTTAQLAMLLEGI
DWRRPIRTAPVRTV
DWRRPIRTAPVRTV
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1509 bp | 502 aa | 837 | 2345 | + | No |
Chemistry : DDE
ORF sequence :
MTCATDCLPDDIQALKALVTAQRGEIEHLKLMIAKLRRTQFGRRSEQLDGMLDQLQLTLEELQVSEAALASPPPAESHPRPPVRRKPLPAHLPREIHVHR
PDTQCPGCGGELRHLGDDVAEVLEYVPARFKVVRHVRPKLACRCCDGIVQVPAPSRPIARGLAGPGLLAHVLVSKYVDHLPLYRQSEIYAREGVSLERST
MADWVGEASRLLQPLVGRLRQHVMTSNKVHADDTPIAVLAPGQGKTKTGRLWTYVRDERPTGSSTPAAVWFAYSPDRKGEHPRAHLKGFAGTLQADGYAG
FAQLYAAGTIHEAACWAHARRKFFDLHKALASPIAAEALQRIGALYAIEAEIRGQPPDQRRAVRQRRASPLLAQFHAWLNHTLTQLPSKSALSGAIYYAL
ARWQALTRYCTDGRIEIDNNAAERALRTVALGRKNYLFVGSDAGGERAAAIYSLVGSAKLNALNPQAYLTHVLERIADYPINQLDDLLPWNIALPTLEHE
AA
PDTQCPGCGGELRHLGDDVAEVLEYVPARFKVVRHVRPKLACRCCDGIVQVPAPSRPIARGLAGPGLLAHVLVSKYVDHLPLYRQSEIYAREGVSLERST
MADWVGEASRLLQPLVGRLRQHVMTSNKVHADDTPIAVLAPGQGKTKTGRLWTYVRDERPTGSSTPAAVWFAYSPDRKGEHPRAHLKGFAGTLQADGYAG
FAQLYAAGTIHEAACWAHARRKFFDLHKALASPIAAEALQRIGALYAIEAEIRGQPPDQRRAVRQRRASPLLAQFHAWLNHTLTQLPSKSALSGAIYYAL
ARWQALTRYCTDGRIEIDNNAAERALRTVALGRKNYLFVGSDAGGERAAAIYSLVGSAKLNALNPQAYLTHVLERIADYPINQLDDLLPWNIALPTLEHE
AA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
594 bp | 197 aa | 2349 | 2942 | + | No |
Annotation : Description :
ORF sequence :
MPNVTRLRQPASDRLLQLHIELTWIKPAIWRRVAVPERITLSKLHQVIQVVMGWSDTHLHEFEIAGESYGIPDPDWGPSVVSEQRKTLTKVLYGSKTFRY
VYDFGDNWEHRIKTERLLPAIACPQVPYCIDGANASPPEDVGGAPGYADFLDALADPEHPEYLNMLDWYGDTFDPTAFDRDAINQRLKRIKTRGSRR
VYDFGDNWEHRIKTERLLPAIACPQVPYCIDGANASPPEDVGGAPGYADFLDALADPEHPEYLNMLDWYGDTFDPTAFDRDAINQRLKRIKTRGSRR
Blast result :
Comments
ISPre3 is 56%(ORF1) aa similar to ISBcen14, 79% (ORF2) to IS883, 71% (ORF3) to ISBcen14. The ORF4 is a passenger gene annotated as hypothetical protein.
References
1] Urata,M., Miyakoshi,M., Kai,S., Maeda,K., Habe,H., Omori,T., Yamane,H. and Nojiri,H. (2004) J. Bacteriol. 186 (20), 6815-6823
2] Maeda,K., Nojiri,H., Shintani,M., Yoshida,T., Habe,H. and Omori,T.
(2003) J. Mol. Biol. 326 (1), 21-33
2] Maeda,K., Nojiri,H., Shintani,M., Yoshida,T., Habe,H. and Omori,T.
(2003) J. Mol. Biol. 326 (1), 21-33