ISChy4
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007503 | ND | Carboxydothermus hydrogenoformans | Carboxydothermus hydrogenoformans Z-2901 |
DNA section
IS Length : 2172 bp
Ends
IR Length : 19/27
IRL : TGTCACCGCCGGTATATAATTGACCCAGGAACAACGAAAAGGAAATGACC
IRR : TGTCAATAGCGGATTTAAATTGACCCATTTTCAGCGGTTTTAAATTGACC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGCAAAAGTTAAGTTGATA | TTTAATC | CGATATGTTTTATTA | 7 |
DNA sequence
TGTCACCGCCGGTATATAATTGACCCAGGAACAACGAAAAGGAAATGACCCACCCCCTGGCGAAATATCCCTCTGATACCACTGCAAAGGATCGGAGGGA
AATAAATGCTAGGGAGTGGATCTATTATCATGTTACATGAATTACAAGCAAGAGGCAAGAGTATTCGTGCAATCGCACGAGAAACAGGGCATTCCAGAAA
TACGGTAAGAAAATACCTCAGAGCAGAGGGCATTCCTGAAAGGAAGCCCCGTCCCAAAAGAGGTTCAAAGCTTGACCCCTACAAAGATACTATTCAAGAG
CTCATGAATCTGGGGATATTCAATTGCGAAGTCATTTATGAAAGAATCAAGGAAGAAGGCTACACCGGAGGTCGCACTATTTTAAGGGACTATGTAAGAC
AATTTAGACCTCCAAAACAGGTCCCTGCCGTATGCCGCTATGAAACCAAGCCTGGCCAGCAAGCCCAGGTCGACTGGGGCGAATACACCTACATTGACGA
GGAAACCGGTGAAATACGTAAGCTTTACGTCTTCGTCATGGTGCTGGGCTACTCCAGAGCCATATACGTGGAATTCACAAACCGCTGCGACGTCCGTACC
TTCATCCGCTGCTTGATCCACGGGTTTGAATACTTCGGCGGAGTAACCGATATAGTTCTTACCGATAGAATGAAAACCGTAATACTGGGCACCGGTGAAA
ACAAAAAGCCCATATGGAACCCTACCTTTGAAGACCTTGCGGCGACCCTTGGATTTATTTCTAAAGTGTGCAGGGCACGGCGCCCCCAAACAAAAGGCAA
GGTAGAAAGCGGCATAGATTTTGTCAAAAACAACTTCCTGCCGGGTAGGAAGTTCGTAGACTACGGTGATTTAAACCGCCAGGCAATAGTGTGGTGCGAA
AAGAAAAACAGGAGAATACACGGTACTACCGGGGAAAGGCCTATTGACCGTCTGAAGAAGGAAAACCTCAAACCCCTGCCTGCCCTTGATAAATACCAAA
AATTCCTGGAAGAAGCAAGGAAAGTTCACAAAGACGGCTTTTTGAGTTTCGACGGCGTAAGATATGGCGTTCCCTGGCAGTATAGCGGAAAAGAGGTGGT
TGTAAGGGACAAAAACGGTAAAATCGAAATCCTCTATGATGGGAAGGTGATAGCTGTCCACGAAAAGCATTACCGTTCAAGGAGCACCGTTTTTCTCAAA
GACCAGTATAAGGGTCTAAAAGAAGCGGAGGGCATGTTTTATCCCAGGCCGAAGGCGATCAAGTTATCTTCCCTGGAAGTTGAGAAGCGTCCTCTGGGGG
TTTACGAAAGCCTCCTGGAGGTAGGCACAGTATGATAGACCTTGAAAAAGCTCGGTCCCACCTTGAAGAACTTGGGCTTTTAAGCGCAGCAGCTTTTCTT
GACGCCCTCCTGGAAAGGGCCCAGCGGGAAAACCGCACATATCTTGATTTCTTAAATGATCTGCTTGAAACTGAACTTGCCGAAAGGCAAAGGCGAAATG
TCGAGGTAAGGTCAAAACTTGCCCGGCTTCCGTATAAAAAGACCCTGAAGGAATTTGACTTTACCTTCCAGCCCAGTATCGATGAAAAACTGATAAGAGA
GCTTGCCACAATGGCCTTTGTCCACCGGGCAGAAAACGTAATATTCCTTGGGCCGCCTGGAGTAGGGAAGACGCACCTTGCCGTAGCCCTTGCTATAGAA
GCCCTATCCCAGGGCATATCAGTTTACTTTACGAGCCTTTCCAGGCTCATTGAAGACCTAAAAACAGCCCATAAAGAAAGCCGGTTGGAAAGGCGAATGA
GGATCTACCTTAGGCCCAAGCTCCTTATTATCGACGAAGTGGGCTATCTCCCTTTAGATGGCCTTGGCTCAAACCTCTTTTTCCAGCTAATTAGTGCCCG
GTATGAAAAGGGGAGCATCATCCTCACCAGCAACAAAAGCTTTGGGGAATGGGGGGAGCTCATGGGAGACCCGGTGCTTGCCACTGCAGTGTTGGATAGG
CTATTACACCATGCCCATATAATCAACATAAGGGGCAACAGCTACCGCCTAAAAGACAGGTTAAAAACCGGTCTCTACGGTAATCCACATGTCAAAGCTT
AATTTTAAAAAAAGCCAGGGTGGGTCAATTTAAAACCGCTGAAAATGGGTCAATTTAAATCCGCTATTGACA
AATAAATGCTAGGGAGTGGATCTATTATCATGTTACATGAATTACAAGCAAGAGGCAAGAGTATTCGTGCAATCGCACGAGAAACAGGGCATTCCAGAAA
TACGGTAAGAAAATACCTCAGAGCAGAGGGCATTCCTGAAAGGAAGCCCCGTCCCAAAAGAGGTTCAAAGCTTGACCCCTACAAAGATACTATTCAAGAG
CTCATGAATCTGGGGATATTCAATTGCGAAGTCATTTATGAAAGAATCAAGGAAGAAGGCTACACCGGAGGTCGCACTATTTTAAGGGACTATGTAAGAC
AATTTAGACCTCCAAAACAGGTCCCTGCCGTATGCCGCTATGAAACCAAGCCTGGCCAGCAAGCCCAGGTCGACTGGGGCGAATACACCTACATTGACGA
GGAAACCGGTGAAATACGTAAGCTTTACGTCTTCGTCATGGTGCTGGGCTACTCCAGAGCCATATACGTGGAATTCACAAACCGCTGCGACGTCCGTACC
TTCATCCGCTGCTTGATCCACGGGTTTGAATACTTCGGCGGAGTAACCGATATAGTTCTTACCGATAGAATGAAAACCGTAATACTGGGCACCGGTGAAA
ACAAAAAGCCCATATGGAACCCTACCTTTGAAGACCTTGCGGCGACCCTTGGATTTATTTCTAAAGTGTGCAGGGCACGGCGCCCCCAAACAAAAGGCAA
GGTAGAAAGCGGCATAGATTTTGTCAAAAACAACTTCCTGCCGGGTAGGAAGTTCGTAGACTACGGTGATTTAAACCGCCAGGCAATAGTGTGGTGCGAA
AAGAAAAACAGGAGAATACACGGTACTACCGGGGAAAGGCCTATTGACCGTCTGAAGAAGGAAAACCTCAAACCCCTGCCTGCCCTTGATAAATACCAAA
AATTCCTGGAAGAAGCAAGGAAAGTTCACAAAGACGGCTTTTTGAGTTTCGACGGCGTAAGATATGGCGTTCCCTGGCAGTATAGCGGAAAAGAGGTGGT
TGTAAGGGACAAAAACGGTAAAATCGAAATCCTCTATGATGGGAAGGTGATAGCTGTCCACGAAAAGCATTACCGTTCAAGGAGCACCGTTTTTCTCAAA
GACCAGTATAAGGGTCTAAAAGAAGCGGAGGGCATGTTTTATCCCAGGCCGAAGGCGATCAAGTTATCTTCCCTGGAAGTTGAGAAGCGTCCTCTGGGGG
TTTACGAAAGCCTCCTGGAGGTAGGCACAGTATGATAGACCTTGAAAAAGCTCGGTCCCACCTTGAAGAACTTGGGCTTTTAAGCGCAGCAGCTTTTCTT
GACGCCCTCCTGGAAAGGGCCCAGCGGGAAAACCGCACATATCTTGATTTCTTAAATGATCTGCTTGAAACTGAACTTGCCGAAAGGCAAAGGCGAAATG
TCGAGGTAAGGTCAAAACTTGCCCGGCTTCCGTATAAAAAGACCCTGAAGGAATTTGACTTTACCTTCCAGCCCAGTATCGATGAAAAACTGATAAGAGA
GCTTGCCACAATGGCCTTTGTCCACCGGGCAGAAAACGTAATATTCCTTGGGCCGCCTGGAGTAGGGAAGACGCACCTTGCCGTAGCCCTTGCTATAGAA
GCCCTATCCCAGGGCATATCAGTTTACTTTACGAGCCTTTCCAGGCTCATTGAAGACCTAAAAACAGCCCATAAAGAAAGCCGGTTGGAAAGGCGAATGA
GGATCTACCTTAGGCCCAAGCTCCTTATTATCGACGAAGTGGGCTATCTCCCTTTAGATGGCCTTGGCTCAAACCTCTTTTTCCAGCTAATTAGTGCCCG
GTATGAAAAGGGGAGCATCATCCTCACCAGCAACAAAAGCTTTGGGGAATGGGGGGAGCTCATGGGAGACCCGGTGCTTGCCACTGCAGTGTTGGATAGG
CTATTACACCATGCCCATATAATCAACATAAGGGGCAACAGCTACCGCCTAAAAGACAGGTTAAAAACCGGTCTCTACGGTAATCCACATGTCAAAGCTT
AATTTTAAAAAAAGCCAGGGTGGGTCAATTTAAAACCGCTGAAAATGGGTCAATTTAAATCCGCTATTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1230 bp | 409 aa | 106 | 1335 | + | No |
Chemistry : DDE
ORF sequence :
MLGSGSIIMLHELQARGKSIRAIARETGHSRNTVRKYLRAEGIPERKPRPKRGSKLDPYKDTIQELMNLGIFNCEVIYERIKEEGYTGGRTILRDYVRQF
RPPKQVPAVCRYETKPGQQAQVDWGEYTYIDEETGEIRKLYVFVMVLGYSRAIYVEFTNRCDVRTFIRCLIHGFEYFGGVTDIVLTDRMKTVILGTGENK
KPIWNPTFEDLAATLGFISKVCRARRPQTKGKVESGIDFVKNNFLPGRKFVDYGDLNRQAIVWCEKKNRRIHGTTGERPIDRLKKENLKPLPALDKYQKF
LEEARKVHKDGFLSFDGVRYGVPWQYSGKEVVVRDKNGKIEILYDGKVIAVHEKHYRSRSTVFLKDQYKGLKEAEGMFYPRPKAIKLSSLEVEKRPLGVY
ESLLEVGTV
RPPKQVPAVCRYETKPGQQAQVDWGEYTYIDEETGEIRKLYVFVMVLGYSRAIYVEFTNRCDVRTFIRCLIHGFEYFGGVTDIVLTDRMKTVILGTGENK
KPIWNPTFEDLAATLGFISKVCRARRPQTKGKVESGIDFVKNNFLPGRKFVDYGDLNRQAIVWCEKKNRRIHGTTGERPIDRLKKENLKPLPALDKYQKF
LEEARKVHKDGFLSFDGVRYGVPWQYSGKEVVVRDKNGKIEILYDGKVIAVHEKHYRSRSTVFLKDQYKGLKEAEGMFYPRPKAIKLSSLEVEKRPLGVY
ESLLEVGTV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
771 bp | 256 aa | 1332 | 2102 | + | No |
AG : IS21 helper
ORF sequence :
MIDLEKARSHLEELGLLSAAAFLDALLERAQRENRTYLDFLNDLLETELAERQRRNVEVRSKLARLPYKKTLKEFDFTFQPSIDEKLIRELATMAFVHRA
ENVIFLGPPGVGKTHLAVALAIEALSQGISVYFTSLSRLIEDLKTAHKESRLERRMRIYLRPKLLIIDEVGYLPLDGLGSNLFFQLISARYEKGSIILTS
NKSFGEWGELMGDPVLATAVLDRLLHHAHIINIRGNSYRLKDRLKTGLYGNPHVKA
ENVIFLGPPGVGKTHLAVALAIEALSQGISVYFTSLSRLIEDLKTAHKESRLERRMRIYLRPKLLIIDEVGYLPLDGLGSNLFFQLISARYEKGSIILTS
NKSFGEWGELMGDPVLATAVLDRLLHHAHIINIRGNSYRLKDRLKTGLYGNPHVKA
Blast result :
Comments
ISChy4 is 60% (ORF1) and 72% (ORF2) aa similar to ISMac3.
L1: TGACCCACCC
R1: TGACCCA
R2: TGACCCACCC
L1: TGACCCACCC
R1: TGACCCA
R2: TGACCCACCC
References
1] Wu,M., Ren,Q., Durkin,A.S., Daugherty,S.C., Brinkac,L.M., Dodson,R.J., Madupu,R., Sullivan,S.A., Kolonay,J.F., Nelson,W.C., Tallon,L.J., Jones,K.M., Ulrich,L.E., Gonzalez,J.M., Zhulin,I.B., Robb,F.T. and Eisen,J.A. (2005) PLoS Genet. 1 (5), E65.