ISPfu4
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_003413 | ND | Pyrococcus furiosus | Pyrococcus furiosus DSM 3638 |
DNA section
IS Length : 1961 bp
Ends
IR Length : 0
IRL : GCCTTTTTTCACTGTTCCTCTTAACCGAAAGCTTTAAATATTTTCGTGAA
IRR : GGGGTCTGTCCGTCGTGAGTTTTTATGAGTGAATGTGAGTAAATCTTCGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCTACTAGAGCTGAAAA | ATGATATCCAATAAATCC | 0 |
DNA sequence
GCCTTTTTTCACTGTTCCTCTTAACCGAAAGCTTTAAATATTTTCGTGAATAATTGTGAGTGTATGGTAGTAAAAGAGAAACTCTACACGGTAAAGCAGG
CAAGTGAGATACTCGGCGTCCACCCAAAGACAATCCAAAAATGGGACAGAGAAGGGAAAATCAAAACCGTTAGAACACCCGGCGGGAGAAGAAGAATACC
AGAAAGCGAAATAAAAAGACTCCTCGGAATAAGCGAGGAAAAAGGCCTAATCATCGGCTACGCAAGGGTATCAAGCCACACACAAAAAGACTACTTAGAA
AGACAAGTCAAAGCAATAGAGCAATACGCAAAAGAACGTGGGTGGCAAGTCCAAATACTCACGGACATCGGCTCAGGATTGAACGAGAACAGGAAAAACT
ACCGCAAACTCCTCGAACTCGTGGCAAAGAGAGAAGTCTCAAAAGTCATCATCACCCATCCCGACAGGCTTACACGCTTTGGCTTCAAAACTCTCGAATT
CTTCTTCAAGGAGAACGGTGCAGAGATAATCATTATCACCGACAAAGAAAAATCCCCACGAGAAGAACTCATTGAAGACTTAACAACCATAATCTCACAC
TTTGCTGGGAAACTCTACGGAATGCGCTCCCACAAATACAAAAAGCTCAAAGAAGGCGTAAAAAAACTAATCGAGGAGGTCGAGAATGGGTAAAATTGTC
TTAACATACAGAATGCCCCACAACTGGAACATTAATCCCTTCCTCAAGGAATACCAGAGACTCCTCCAGAGGGCAATTGACGAAATATGGGATAACACGA
GTTGGAAGGAAAAGAAGGTTAAGCATAGATATTCTCTCGGAAACGAGGAATACCGGTACTACGAGACAATCCGCCTAATCCCATATTTTCCACAATCAAA
CGATTTTAAGCGAGGGCTGAGGAATAAACTCCTCCGAGAATGGCCTTTTGCTAAGCACTACGTTGATTCTGCAATAAAGACTGCTTACTCAATCCTCAAA
AGCTGGAGGCAGAACTACCTCAAGGGGAGAAGAAAAAGAGTGAAACCAGTCGTTAAGAGGAAGTTCGTGAGGGTTAAAACAACACTAATGAAGGTTGAAG
GCTCGAAAATCAGAATAACAATTAAACCGAGGGAAGAATACCTTGAACTGGACTTCTCAAGGGAGTGGTTTTATGAGAGGATTAAGGATTGGAAAGTTGG
CGAGTTAATAATCAGGGAGAGGGATATCTTATTAACCTTCTCAAAGGAGGTTGAGTTCTCTGGAAGAATCAAAATCGGCATTGACAGTAACTTAACGAGC
CTTGACGTTTATCACCCTGAAAAGGGCTGGATTAGAGTGGACTTGAGCGAACTGCATAGAATTTCCGAGACTTATGATAGAATTATTGATATGCTAAAAA
GCATTCAGCGGAAAGCTCCAAAGAGGATTGGTTTATTGCTTGAGAAATACTGGACTAGGAGGAGGAACAGGATTGAGGATTACTTGAACAAACTCGCAGT
CCAGCTTTCGAGGGAGTTCCCCGATGCGATTTTCATCTTCGAGGGTTTGAACAAGTTTAAAATGCTCCAGAATGGTTCGAGAAAGTTTAACAGGAAGCTT
TCCCGTGCCACTTGGAAGAAGATTGTTGGAAAGCTTTCTTATCGTGTTCCTATCGAGTTTGTTAATCCTGCTTATACTTCCTCCACCTGCCCGATATGTG
GGAGTAAGTTAGAGTCCCGAAACGGGCTGGTGGAGTGTTTTAACTGTGGGTTTAAGGCGGATAGGCAGTTTGTTGGTGCTTTTAATATTTTGATGCGGGG
ACTTGGGGTCGCCCTGAGCGGGGTTGAGCGTGATGATTTGCCCCCCAATGAACCCAGAGGGGAGCTGAACGCGATGAGGCCCAAGTCCGTCGTGAGGGTT
GACTTGAATGGACGAAGATTTACTCACATTCACTCATAAAAACTCACGACGGACAGACCCC
CAAGTGAGATACTCGGCGTCCACCCAAAGACAATCCAAAAATGGGACAGAGAAGGGAAAATCAAAACCGTTAGAACACCCGGCGGGAGAAGAAGAATACC
AGAAAGCGAAATAAAAAGACTCCTCGGAATAAGCGAGGAAAAAGGCCTAATCATCGGCTACGCAAGGGTATCAAGCCACACACAAAAAGACTACTTAGAA
AGACAAGTCAAAGCAATAGAGCAATACGCAAAAGAACGTGGGTGGCAAGTCCAAATACTCACGGACATCGGCTCAGGATTGAACGAGAACAGGAAAAACT
ACCGCAAACTCCTCGAACTCGTGGCAAAGAGAGAAGTCTCAAAAGTCATCATCACCCATCCCGACAGGCTTACACGCTTTGGCTTCAAAACTCTCGAATT
CTTCTTCAAGGAGAACGGTGCAGAGATAATCATTATCACCGACAAAGAAAAATCCCCACGAGAAGAACTCATTGAAGACTTAACAACCATAATCTCACAC
TTTGCTGGGAAACTCTACGGAATGCGCTCCCACAAATACAAAAAGCTCAAAGAAGGCGTAAAAAAACTAATCGAGGAGGTCGAGAATGGGTAAAATTGTC
TTAACATACAGAATGCCCCACAACTGGAACATTAATCCCTTCCTCAAGGAATACCAGAGACTCCTCCAGAGGGCAATTGACGAAATATGGGATAACACGA
GTTGGAAGGAAAAGAAGGTTAAGCATAGATATTCTCTCGGAAACGAGGAATACCGGTACTACGAGACAATCCGCCTAATCCCATATTTTCCACAATCAAA
CGATTTTAAGCGAGGGCTGAGGAATAAACTCCTCCGAGAATGGCCTTTTGCTAAGCACTACGTTGATTCTGCAATAAAGACTGCTTACTCAATCCTCAAA
AGCTGGAGGCAGAACTACCTCAAGGGGAGAAGAAAAAGAGTGAAACCAGTCGTTAAGAGGAAGTTCGTGAGGGTTAAAACAACACTAATGAAGGTTGAAG
GCTCGAAAATCAGAATAACAATTAAACCGAGGGAAGAATACCTTGAACTGGACTTCTCAAGGGAGTGGTTTTATGAGAGGATTAAGGATTGGAAAGTTGG
CGAGTTAATAATCAGGGAGAGGGATATCTTATTAACCTTCTCAAAGGAGGTTGAGTTCTCTGGAAGAATCAAAATCGGCATTGACAGTAACTTAACGAGC
CTTGACGTTTATCACCCTGAAAAGGGCTGGATTAGAGTGGACTTGAGCGAACTGCATAGAATTTCCGAGACTTATGATAGAATTATTGATATGCTAAAAA
GCATTCAGCGGAAAGCTCCAAAGAGGATTGGTTTATTGCTTGAGAAATACTGGACTAGGAGGAGGAACAGGATTGAGGATTACTTGAACAAACTCGCAGT
CCAGCTTTCGAGGGAGTTCCCCGATGCGATTTTCATCTTCGAGGGTTTGAACAAGTTTAAAATGCTCCAGAATGGTTCGAGAAAGTTTAACAGGAAGCTT
TCCCGTGCCACTTGGAAGAAGATTGTTGGAAAGCTTTCTTATCGTGTTCCTATCGAGTTTGTTAATCCTGCTTATACTTCCTCCACCTGCCCGATATGTG
GGAGTAAGTTAGAGTCCCGAAACGGGCTGGTGGAGTGTTTTAACTGTGGGTTTAAGGCGGATAGGCAGTTTGTTGGTGCTTTTAATATTTTGATGCGGGG
ACTTGGGGTCGCCCTGAGCGGGGTTGAGCGTGATGATTTGCCCCCCAATGAACCCAGAGGGGAGCTGAACGCGATGAGGCCCAAGTCCGTCGTGAGGGTT
GACTTGAATGGACGAAGATTTACTCACATTCACTCATAAAAACTCACGACGGACAGACCCC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
648 bp | 215 aa | 46 | 693 | + | No |
Chemistry : Serine
ORF sequence :
MNNCECMVVKEKLYTVKQASEILGVHPKTIQKWDREGKIKTVRTPGGRRRIPESEIKRLLGISEEKGLIIGYARVSSHTQKDYLERQVKAIEQYAKERGW
QVQILTDIGSGLNENRKNYRKLLELVAKREVSKVIITHPDRLTRFGFKTLEFFFKENGAEIIIITDKEKSPREELIEDLTTIISHFAGKLYGMRSHKYKK
LKEGVKKLIEEVENG
QVQILTDIGSGLNENRKNYRKLLELVAKREVSKVIITHPDRLTRFGFKTLEFFFKENGAEIIIITDKEKSPREELIEDLTTIISHFAGKLYGMRSHKYKK
LKEGVKKLIEEVENG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1254 bp | 417 aa | 686 | 1939 | + | No |
AG : TnpB
ORF sequence :
MGKIVLTYRMPHNWNINPFLKEYQRLLQRAIDEIWDNTSWKEKKVKHRYSLGNEEYRYYETIRLIPYFPQSNDFKRGLRNKLLREWPFAKHYVDSAIKTA
YSILKSWRQNYLKGRRKRVKPVVKRKFVRVKTTLMKVEGSKIRITIKPREEYLELDFSREWFYERIKDWKVGELIIRERDILLTFSKEVEFSGRIKIGID
SNLTSLDVYHPEKGWIRVDLSELHRISETYDRIIDMLKSIQRKAPKRIGLLLEKYWTRRRNRIEDYLNKLAVQLSREFPDAIFIFEGLNKFKMLQNGSRK
FNRKLSRATWKKIVGKLSYRVPIEFVNPAYTSSTCPICGSKLESRNGLVECFNCGFKADRQFVGAFNILMRGLGVALSGVERDDLPPNEPRGELNAMRPK
SVVRVDLNGRRFTHIHS
YSILKSWRQNYLKGRRKRVKPVVKRKFVRVKTTLMKVEGSKIRITIKPREEYLELDFSREWFYERIKDWKVGELIIRERDILLTFSKEVEFSGRIKIGID
SNLTSLDVYHPEKGWIRVDLSELHRISETYDRIIDMLKSIQRKAPKRIGLLLEKYWTRRRNRIEDYLNKLAVQLSREFPDAIFIFEGLNKFKMLQNGSRK
FNRKLSRATWKKIVGKLSYRVPIEFVNPAYTSSTCPICGSKLESRNGLVECFNCGFKADRQFVGAFNILMRGLGVALSGVERDDLPPNEPRGELNAMRPK
SVVRVDLNGRRFTHIHS
Blast result :
Comments
ISPfu4 is 69% (ORF A) and 52% (ORF B) aa similar to IS1921.
References
1] Robb,F.T., Maeder,D.L., Brown,J.R., DiRuggiero,J., Stump,M.D., Yeh,R.K., Weiss,R.B. and Dunn,D.M. (2001) Meth. Enzymol. 330, 134-157