ISHwa21
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AM180088 | ND | Haloquadratum walsbyi | Haloquadratum walsbyi DSM 16790 Haloquadratum walsbyi C23 |
DNA section
IS Length : 1590 bp
Ends
Left end : AGAATTTCGAGGAGAATACCCGCGTCTTCAGGCCGGGATGAATCCGACAACTCCTCTACAACCACCGCCGATGGCTGGACAGGATATTCCACCGTACTCA II struct. : Yes
Right end : CCAACCTCGGATAGAGGTGGTGGACTGCAAACCGTAATATCCCAACCCAGCGGTGCGGTGCCGTGGGATTCCTCCGCGTTCACGCGGAGGAGGATGTCAA II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
ATCGAGAGTA | TTGT | TTCTGTAGAT | TCAA |
CACACCACAA | TTAT | TTGATAATGT | TCAA |
CTCAGAGTTA | TTAT | TTTGATCGAG | TCAA |
TGTGAGGGAG | TTAT | TTGATGAGAC | TCAA |
DNA sequence
AGAATTTCGAGGAGAATACCCGCGTCTTCAGGCCGGGATGAATCCGACAACTCCTCTACAACCACCGCCGATGGCTGGACAGGATATTCCACCGTACTCA
AACACCAAACGTTAATACTGAAGCGAGTATAAATTATTATATAGACGTTCAGAATGTTAGAAGTCCACCGAACCCATCGAGTGAAAATCCTCAATCACGC
ACAGGTAGAGGGATCGCTTGACAGGCATGGGTGGAGTGCATCGAAACTCTGGAATGTCGCTAATTACCACTCTCGACAACAGTGGGAAGCCACGGGCGAG
ATACCTGACCACGGCGACCTCAAAAACGAGTTGAAGACTCACAACAACTACAAGGGATTGCACAGTCAATCCAGTCAGCGCGTTCTGGAGGAACTCGCTG
AAGGCTTCAACTCGTGGTACGGCAAGAGGAAGTCTGACACTCGAGCGAATCCGCCCGGCTACCGCAAACGAAACTACTACGACCAACAGGGTCGCCGCGT
CCACGAAGAACATCCTCGTTCAACAGTCACGTGGAAGCAGAACGGCATCAAACACGACGTCAAGAACAACCGTGTTCGACTCTCAAAAGGCGCGAATCAC
AAACAACACCCGAAAGCGTGGGAATACATCCTTGTTGAATACGAAACCCGGTCCAGTGTTACCGTCGAAAACGTACAACAGGTCAGAGCAGTCTACGACA
AGGCTAAACAGCGGTGGGAACTTCACCTCGTCTGCAAGCACGAAATCGAGACACCCACCGCGCCCGGCACCGAGACTGCTGGTATCGACCTCGGTATCTG
TAACTTCGCCGCTGTTGCGTATAGCACCGAGGAAGCCGACCTCTACCCCGGCAATCGCCTCAAACAAGATGGCTACTACTTCCCGAAAGAAATCGCCAAG
TGTGATGACTCTGGTGGTGAAGAAGCCACTCGACTCCATGCGAAGTGGTCGGAGCGCCGAACCCACTTCTTCCACTCCTTGGCGACACACATCGTTGAGC
GATGTATCGAGAACAGTGTTGGGCGCATTAACATCGGAAAACTCGCTGGTGTCCGCGAAGATAATAATGGTAACGGTAACTCGAAGAACTGGGGGAAGCA
CGGAAATCTCGACTTGCACGGGTGGGCGTTCGACAGGTTCTCGAACATCCTCGAATACAAGGCGAAGGTCGAGGGAATCGAAGTCGGAGAAGTGTCAGAG
CATGACACGAGCAACACATGTTGTGTCTGTGGGAGAAAAGACGAGAGTCAGCGTGTTGAGCGTGGCTTGTACGTGTGTGAAGAACACGACGACGGCGACG
GCGACGGCGACGGCGACGGCGACGGCGACGGGGATGCGTTCAACGCTGATGTGAATGGGGCGGAGAACATCCGTCTCGACATCAATCACACAGAAAGTAA
CTCTGAGTCTGCACCCGATTTGGGTGGGGATAGGAGTACCGGCTGGTTGGCACAGCCTGGAGTCTACTTTCATGACCTCTCCCGAGGATTCCAACCTCGG
ATAGAGGTGGTGGACTGCAAACCGTAATATCCCAACCCAGCGGTGCGGTGCCGTGGGATTCCTCCGCGTTCACGCGGAGGAGGATGTCAA
AACACCAAACGTTAATACTGAAGCGAGTATAAATTATTATATAGACGTTCAGAATGTTAGAAGTCCACCGAACCCATCGAGTGAAAATCCTCAATCACGC
ACAGGTAGAGGGATCGCTTGACAGGCATGGGTGGAGTGCATCGAAACTCTGGAATGTCGCTAATTACCACTCTCGACAACAGTGGGAAGCCACGGGCGAG
ATACCTGACCACGGCGACCTCAAAAACGAGTTGAAGACTCACAACAACTACAAGGGATTGCACAGTCAATCCAGTCAGCGCGTTCTGGAGGAACTCGCTG
AAGGCTTCAACTCGTGGTACGGCAAGAGGAAGTCTGACACTCGAGCGAATCCGCCCGGCTACCGCAAACGAAACTACTACGACCAACAGGGTCGCCGCGT
CCACGAAGAACATCCTCGTTCAACAGTCACGTGGAAGCAGAACGGCATCAAACACGACGTCAAGAACAACCGTGTTCGACTCTCAAAAGGCGCGAATCAC
AAACAACACCCGAAAGCGTGGGAATACATCCTTGTTGAATACGAAACCCGGTCCAGTGTTACCGTCGAAAACGTACAACAGGTCAGAGCAGTCTACGACA
AGGCTAAACAGCGGTGGGAACTTCACCTCGTCTGCAAGCACGAAATCGAGACACCCACCGCGCCCGGCACCGAGACTGCTGGTATCGACCTCGGTATCTG
TAACTTCGCCGCTGTTGCGTATAGCACCGAGGAAGCCGACCTCTACCCCGGCAATCGCCTCAAACAAGATGGCTACTACTTCCCGAAAGAAATCGCCAAG
TGTGATGACTCTGGTGGTGAAGAAGCCACTCGACTCCATGCGAAGTGGTCGGAGCGCCGAACCCACTTCTTCCACTCCTTGGCGACACACATCGTTGAGC
GATGTATCGAGAACAGTGTTGGGCGCATTAACATCGGAAAACTCGCTGGTGTCCGCGAAGATAATAATGGTAACGGTAACTCGAAGAACTGGGGGAAGCA
CGGAAATCTCGACTTGCACGGGTGGGCGTTCGACAGGTTCTCGAACATCCTCGAATACAAGGCGAAGGTCGAGGGAATCGAAGTCGGAGAAGTGTCAGAG
CATGACACGAGCAACACATGTTGTGTCTGTGGGAGAAAAGACGAGAGTCAGCGTGTTGAGCGTGGCTTGTACGTGTGTGAAGAACACGACGACGGCGACG
GCGACGGCGACGGCGACGGCGACGGCGACGGGGATGCGTTCAACGCTGATGTGAATGGGGCGGAGAACATCCGTCTCGACATCAATCACACAGAAAGTAA
CTCTGAGTCTGCACCCGATTTGGGTGGGGATAGGAGTACCGGCTGGTTGGCACAGCCTGGAGTCTACTTTCATGACCTCTCCCGAGGATTCCAACCTCGG
ATAGAGGTGGTGGACTGCAAACCGTAATATCCCAACCCAGCGGTGCGGTGCCGTGGGATTCCTCCGCGTTCACGCGGAGGAGGATGTCAA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1374 bp | 457 aa | 154 | 1527 | + | No |
AG : TnpB
ORF sequence :
MLEVHRTHRVKILNHAQVEGSLDRHGWSASKLWNVANYHSRQQWEATGEIPDHGDLKNELKTHNNYKGLHSQSSQRVLEELAEGFNSWYGKRKSDTRANP
PGYRKRNYYDQQGRRVHEEHPRSTVTWKQNGIKHDVKNNRVRLSKGANHKQHPKAWEYILVEYETRSSVTVENVQQVRAVYDKAKQRWELHLVCKHEIET
PTAPGTETAGIDLGICNFAAVAYSTEEADLYPGNRLKQDGYYFPKEIAKCDDSGGEEATRLHAKWSERRTHFFHSLATHIVERCIENSVGRINIGKLAGV
REDNNGNGNSKNWGKHGNLDLHGWAFDRFSNILEYKAKVEGIEVGEVSEHDTSNTCCVCGRKDESQRVERGLYVCEEHDDGDGDGDGDGDGDGDAFNADV
NGAENIRLDINHTESNSESAPDLGGDRSTGWLAQPGVYFHDLSRGFQPRIEVVDCKP
PGYRKRNYYDQQGRRVHEEHPRSTVTWKQNGIKHDVKNNRVRLSKGANHKQHPKAWEYILVEYETRSSVTVENVQQVRAVYDKAKQRWELHLVCKHEIET
PTAPGTETAGIDLGICNFAAVAYSTEEADLYPGNRLKQDGYYFPKEIAKCDDSGGEEATRLHAKWSERRTHFFHSLATHIVERCIENSVGRINIGKLAGV
REDNNGNGNSKNWGKHGNLDLHGWAFDRFSNILEYKAKVEGIEVGEVSEHDTSNTCCVCGRKDESQRVERGLYVCEEHDDGDGDGDGDGDGDGDAFNADV
NGAENIRLDINHTESNSESAPDLGGDRSTGWLAQPGVYFHDLSRGFQPRIEVVDCKP
Blast result :
Comments
ISHwa21 is 89% aa similar to ISHarch11.
In the genome, this transposon is targetted by mobiles elements :
in the Haloquadratum walsbyi DSM 16790:
copy A : 450666-452523 : MITE of ISHwa2 (IS4 family IS50 group), with a direct repeat of 10 pb;
copy B : complement (1293551-1291706): MITE of ISHwa2(IS4 family IS50 group), with no direct repeat;
in Haloquadratum walsbyi C23 :
Copy C : complement (2212797-2209292) : complete ISHwa2 (IS4 family IS50 group), with a direct repeat of 10 pb;
Copy D : 2890644-2892485 :MITE of ISHwa2 (IS4 family IS50 group), with a direct repeat of 10 pb.
In this file, the inserted MITE or IS were in silico deleted with one of the direct repeat.
In the genome, this transposon is targetted by mobiles elements :
in the Haloquadratum walsbyi DSM 16790:
copy A : 450666-452523 : MITE of ISHwa2 (IS4 family IS50 group), with a direct repeat of 10 pb;
copy B : complement (1293551-1291706): MITE of ISHwa2(IS4 family IS50 group), with no direct repeat;
in Haloquadratum walsbyi C23 :
Copy C : complement (2212797-2209292) : complete ISHwa2 (IS4 family IS50 group), with a direct repeat of 10 pb;
Copy D : 2890644-2892485 :MITE of ISHwa2 (IS4 family IS50 group), with a direct repeat of 10 pb.
In this file, the inserted MITE or IS were in silico deleted with one of the direct repeat.
References
1] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968
2] Bolhuis, H., Palm, P., Wende, A., Falb, M., Rampp, M.,Rodriguez-Valera, F., Pfeiffer, F., and Oesterhelt, D. (2006) BMC Genomics 7, 169
2] Bolhuis, H., Palm, P., Wende, A., Falb, M., Rampp, M.,Rodriguez-Valera, F., Pfeiffer, F., and Oesterhelt, D. (2006) BMC Genomics 7, 169