ISH34
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AM774415 | ND | Halobacterium salinarum | Halobacterium salinarum R1 (DSM 671) |
DNA section
IS Length : 1853 bp
Ends
Left end : ACAACCTGCGAGGGCGAGTTCCCCGACCCACCGGGGTCGGGGGTGAAGCCCGACACTCGACCTAATGTTCTGTCTGCTCGATATACCGTTCAACCACTTC II struct. : Yes
Right end : GCCGCCCAACCTCATCCCGTTCCCTCGCGGAACAGGGAGTGTTAGCGCACGGCTGAGGGAACTACAAAAGCCTCGGGCCATCGTGCCCGAGGCTGTTTAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GCATCACGCACTT | CTAA | TCCGTTCTCGCG | TTAC |
DNA sequence
ACAACCTGCGAGGGCGAGTTCCCCGACCCACCGGGGTCGGGGGTGAAGCCCGACACTCGACCTAATGTTCTGTCTGCTCGATATACCGTTCAACCACTTC
TTCTGACACCTCACCCGTCGTCCCCACGTAGTACCCAACCTTCCAGAACCCACCACCCCAGAAATACGACTCCCGAATCTCAGGATACCTTTCCAGCAAG
TGCTTGCCCGAATACGACTTGAACTGTCGAGCGATGTCGGCAGGACTGTGCTTCGGGTCGCACTGGACGAACAGGTGAACGTGGTCGTCTGCTATCTCCA
ACGCCAGAATCTCGTGCCCGAAGTGGTCGGCAGTCTCCGAAAACAACTCCCGCACATCGTCTTCAACCACGTTGAGAACCGGGTGGCGGTACTTGGGACA
CCACACAAAGTGATACTTGCAGGAACTAACCGAATGCGCATGACTGCGGTACTCCTCCATCCCTAATCCCTACATTTATGACAGTATGGAACGATTATCA
AGGTATGGGTGAAGAAGCCACGAAAACCATTCAGACGCGCCTTCAAATAGCGTCTGGTGAACGGTCGTGGCTTCACGACGCCCGCCTCGCCTCACGCGAG
ATATTCAACCAAACCATCCGCCTCAAACAGCAAGGGTACAATCGCACCGAGATACAGCGGGAAGTTGACCGCGACGACTTCTTGCGGAACAACAAGTGCG
CGGTCGTCGGGAAAGCCCTCCAAACGTGGAACTCCTACCAGTCACTCAAAGACTGGTGGGAGAACCAAGACGACCCTGACGGCGGGAAGCCGACCCCGCC
GAGTACCGACAAATCTGGTGTGTACCCGCTCGTGATGGCGCACGCGGAAGGCTACCGCCTCACCGTAGATGACGACACGAACCGCGTCCGGTTCCGCATC
AGCCCGAAACCCTACAAGAAGGTGAAGGGCCACCTTCGCGGAGAGCCGGACGCGATGGATGAACTTCGAGACGCCATCACGTCGGATAAGATCGATGTGG
GGCAGGCAGAACTCCTGTACCGCGATGGCGTGTACTACCTACACGTCACGGTCACACGCGAGTTCGACGTGCCCGAACCAGACACCAGCGACACGCTCGT
CGGCGTGGACATCAACGAGCGTAACGTCGCACTCACCGCCCTCGACCGCGAGACGATGCGGACAAAGGGCACGCTCGTCCTCGACTATGGACGGGTGAAG
GAGGAACGTCAACGCTACCATACAATCACCACTCGCTGTCAGGAACACGGCAAGACGAGCATCCACCGGAAACTCGGTGACGACGAAGAGCGATTCACCG
AGTGGGTGTTGCACCGTCTCTCCCGTGCGGTCGTGGAGTTCTCGGAGCAGTTCTCGAATCCGGTTATCGTGTTCGAGGATATGACCGGCATCCGCGACGA
AATCAAGTACGGGACGTATATGAACCGGCGGTTGCACAAATTGCCGTTCCACAAGTTCGAGACGTTCGTCTCGTATAAGGCGACGTGGCGAGAGATTCCT
ATGGATACGGTGGATGCGTACTACAATTCGAAGACGTGTTCGTGCTGTGGTGAGCGTGGGAGTCGTCAAGGACGGCGTTTCCGGTGTACGAACGACGAGT
GTGTTGTGGTGCAAGATCACGCCGACCGGAATGCGTCGGTGAACATCGCGTGGCGCGAGACGCTGAAACTCGACGGTAACGAATCGAATTACCGGACTCA
CAAAACCCAACCACAAGTTCGGTTGGTGCGTCTGTCCGGGTCGGGGCGCGTAAGCCGCCCAACCTCATCCCGTTCCCTCGCGGAACAGGGAGTGTTAGCG
CACGGCTGAGGGAACTACAAAAGCCTCGGGCCATCGTGCCCGAGGCTGTTTAC
TTCTGACACCTCACCCGTCGTCCCCACGTAGTACCCAACCTTCCAGAACCCACCACCCCAGAAATACGACTCCCGAATCTCAGGATACCTTTCCAGCAAG
TGCTTGCCCGAATACGACTTGAACTGTCGAGCGATGTCGGCAGGACTGTGCTTCGGGTCGCACTGGACGAACAGGTGAACGTGGTCGTCTGCTATCTCCA
ACGCCAGAATCTCGTGCCCGAAGTGGTCGGCAGTCTCCGAAAACAACTCCCGCACATCGTCTTCAACCACGTTGAGAACCGGGTGGCGGTACTTGGGACA
CCACACAAAGTGATACTTGCAGGAACTAACCGAATGCGCATGACTGCGGTACTCCTCCATCCCTAATCCCTACATTTATGACAGTATGGAACGATTATCA
AGGTATGGGTGAAGAAGCCACGAAAACCATTCAGACGCGCCTTCAAATAGCGTCTGGTGAACGGTCGTGGCTTCACGACGCCCGCCTCGCCTCACGCGAG
ATATTCAACCAAACCATCCGCCTCAAACAGCAAGGGTACAATCGCACCGAGATACAGCGGGAAGTTGACCGCGACGACTTCTTGCGGAACAACAAGTGCG
CGGTCGTCGGGAAAGCCCTCCAAACGTGGAACTCCTACCAGTCACTCAAAGACTGGTGGGAGAACCAAGACGACCCTGACGGCGGGAAGCCGACCCCGCC
GAGTACCGACAAATCTGGTGTGTACCCGCTCGTGATGGCGCACGCGGAAGGCTACCGCCTCACCGTAGATGACGACACGAACCGCGTCCGGTTCCGCATC
AGCCCGAAACCCTACAAGAAGGTGAAGGGCCACCTTCGCGGAGAGCCGGACGCGATGGATGAACTTCGAGACGCCATCACGTCGGATAAGATCGATGTGG
GGCAGGCAGAACTCCTGTACCGCGATGGCGTGTACTACCTACACGTCACGGTCACACGCGAGTTCGACGTGCCCGAACCAGACACCAGCGACACGCTCGT
CGGCGTGGACATCAACGAGCGTAACGTCGCACTCACCGCCCTCGACCGCGAGACGATGCGGACAAAGGGCACGCTCGTCCTCGACTATGGACGGGTGAAG
GAGGAACGTCAACGCTACCATACAATCACCACTCGCTGTCAGGAACACGGCAAGACGAGCATCCACCGGAAACTCGGTGACGACGAAGAGCGATTCACCG
AGTGGGTGTTGCACCGTCTCTCCCGTGCGGTCGTGGAGTTCTCGGAGCAGTTCTCGAATCCGGTTATCGTGTTCGAGGATATGACCGGCATCCGCGACGA
AATCAAGTACGGGACGTATATGAACCGGCGGTTGCACAAATTGCCGTTCCACAAGTTCGAGACGTTCGTCTCGTATAAGGCGACGTGGCGAGAGATTCCT
ATGGATACGGTGGATGCGTACTACAATTCGAAGACGTGTTCGTGCTGTGGTGAGCGTGGGAGTCGTCAAGGACGGCGTTTCCGGTGTACGAACGACGAGT
GTGTTGTGGTGCAAGATCACGCCGACCGGAATGCGTCGGTGAACATCGCGTGGCGCGAGACGCTGAAACTCGACGGTAACGAATCGAATTACCGGACTCA
CAAAACCCAACCACAAGTTCGGTTGGTGCGTCTGTCCGGGTCGGGGCGCGTAAGCCGCCCAACCTCATCCCGTTCCCTCGCGGAACAGGGAGTGTTAGCG
CACGGCTGAGGGAACTACAAAAGCCTCGGGCCATCGTGCCCGAGGCTGTTTAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
399 bp | 132 aa | 460 | 62 | - | No |
Chemistry : Y1
ORF sequence :
MEEYRSHAHSVSSCKYHFVWCPKYRHPVLNVVEDDVRELFSETADHFGHEILALEIADDHVHLFVQCDPKHSPADIARQFKSYSGKHLLERYPEIRESYF
WGGGFWKVGYYVGTTGEVSEEVVERYIEQTEH
WGGGFWKVGYYVGTTGEVSEEVVERYIEQTEH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1305 bp | 434 aa | 505 | 1809 | + | No |
AG : TnpB
ORF sequence :
MGEEATKTIQTRLQIASGERSWLHDARLASREIFNQTIRLKQQGYNRTEIQREVDRDDFLRNNKCAVVGKALQTWNSYQSLKDWWENQDDPDGGKPTPPS
TDKSGVYPLVMAHAEGYRLTVDDDTNRVRFRISPKPYKKVKGHLRGEPDAMDELRDAITSDKIDVGQAELLYRDGVYYLHVTVTREFDVPEPDTSDTLVG
VDINERNVALTALDRETMRTKGTLVLDYGRVKEERQRYHTITTRCQEHGKTSIHRKLGDDEERFTEWVLHRLSRAVVEFSEQFSNPVIVFEDMTGIRDEI
KYGTYMNRRLHKLPFHKFETFVSYKATWREIPMDTVDAYYNSKTCSCCGERGSRQGRRFRCTNDECVVVQDHADRNASVNIAWRETLKLDGNESNYRTHK
TQPQVRLVRLSGSGRVSRPTSSRSLAEQGVLAHG
TDKSGVYPLVMAHAEGYRLTVDDDTNRVRFRISPKPYKKVKGHLRGEPDAMDELRDAITSDKIDVGQAELLYRDGVYYLHVTVTREFDVPEPDTSDTLVG
VDINERNVALTALDRETMRTKGTLVLDYGRVKEERQRYHTITTRCQEHGKTSIHRKLGDDEERFTEWVLHRLSRAVVEFSEQFSNPVIVFEDMTGIRDEI
KYGTYMNRRLHKLPFHKFETFVSYKATWREIPMDTVDAYYNSKTCSCCGERGSRQGRRFRCTNDECVVVQDHADRNASVNIAWRETLKLDGNESNYRTHK
TQPQVRLVRLSGSGRVSRPTSSRSLAEQGVLAHG
Blast result :
Comments
ISH34 is 94% (TnpA : the transposase) and 96% (TnpB) aa similar to ISNph6.
References
1] Pfeiffer F., Schuster S.C., Broicher A., Falb M., Palm P., Rodewald K., Ruepp.A., Soppa J., Tittor J., Oesterhelt D. (2008) Genomics 91: 335-346