ISHwa18
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AM180088 | ND | Haloquadratum walsbyi | Haloquadratum walsbyi DSM 16790 |
DNA section
IS Length : 1449 bp
Ends
Left end : CAGGCGGTCGAGGCGACTGCCCCGGGGTTGAAACCGACAACACACAATCGTTATGACGATTGAGTTTGATAAAGTGTTACCCGAAACTCAGAGAACGCGG II struct. : Yes
Right end : AGACTGCGCTCCCTACGTTCACACCTCACCGAGAGGCTGTGGATGCAAAGCGCGTCATTGAACCAGGAAGCCCCGGGGCTTGACCTCGGGGTGAGTTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
AATTTTTCTC | TGTA | GCGGCTGCGC | tcac |
DNA sequence
CAGGCGGTCGAGGCGACTGCCCCGGGGTTGAAACCGACAACACACAATCGTTATGACGATTGAGTTTGATAAAGTGTTACCCGAAACTCAGAGAACGCGG
GCAAGTTACAATCCCAGAAGAACTCCGAGATGGGCTTTATTTGGAGGAAGGTGACCAATTGAAACTCACCGTCGAAAGGCTCGACTAACAATAATGAAAC
ACACGCTTCGCTTCCGCGTATACCTGTCCGATGACGTGGCTAGCGAAGCGTGGCGGCACATCGACATTCTCCGGCAGATTCGCAATCACGCCGTTCGGGA
CTACTACCGGAGTGACTACAACAACCGTCCATCTGACTACGACCAACACAATACATTCACAGACTGGACAGAACGGTGGTCAATCTTCGCTGAACCGTCA
CAACACGCCGCACAGCAAGCCATTAGTCAGATTCACAGCGATCTTGTGACCCTCCAAGAACGGCGAAATAACGGCTCCGATGTCGGGCGATTGCAGTGGC
AAGCGAAAGGCGAATTTCGGTCGGTGTCGTACAATCAGTCCAGTCGCTTCAACGTGGATCACAACACGGGCGACAACCGATTCGTGCGACTTCGAGTTGA
GAAAATCGGCTGGTTCAAGATTCGCGCAGACCGCGACATCCCACCAGTAGATGAGATTGACAAGGTTATTCTGAAAAAAGAGACAACCGGCGAGTGGTAC
GTCTCACTCGTCACAACTGTCGAAGACACACCAGACAAACCGCCACTCAGTCAGATTGAACCTGAAAATTGTGTTGGTGTCGACCTCGGCATCACAAGCT
ACATTCACACCTCGGAAAACCTGTCTGTTGATATGCTTGAGCTGTCGAATGAGTACGACCGCTACGCACGCGAACAGCGGAAACTCGACAGGAAAGAACA
CGGCTCTGCCAACTGGGAGAAACAACGCCGAAAGGTGGCACAAGCAAAACGAACAATCAAACGGAAGGTGTGTGACTACCAGCACAAACTCACGACGTGG
CTTGTCACAGAGTACGACGTTGTCGCTGTCGAGGACTTGGACGTAAAGCCGATACTGGAAACCAGCCAGAACGCCAAGAACAAGCAAGACGCCGCGTGGT
CACGCTTTCTTGAGCTGCTGGAGTACAAGGCTGACCTCCACGGCACACACGTCGAAAAGGTGAAACCAGAAGGCACGACGAAAGAATGCGCTGTGTGTGG
TGTCGAAACAGCCAAGCCAATCTGGGTACGCGAGCATTCGTGTCCGGCGTGCGGTCATACCGAAGACCGTGACTTGAACGCAGCGAAGAATATTCTCAAT
CGTGGGTTGAAGCAATTAGGGGCGGGACGCTCCGAATCAACGTCTGTGCAGACTGCGCTCCCTACGTTCACACCTCACCGAGAGGCTGTGGATGCAAAGC
GCGTCATTGAACCAGGAAGCCCCGGGGCTTGACCTCGGGGTGAGTTCAC
GCAAGTTACAATCCCAGAAGAACTCCGAGATGGGCTTTATTTGGAGGAAGGTGACCAATTGAAACTCACCGTCGAAAGGCTCGACTAACAATAATGAAAC
ACACGCTTCGCTTCCGCGTATACCTGTCCGATGACGTGGCTAGCGAAGCGTGGCGGCACATCGACATTCTCCGGCAGATTCGCAATCACGCCGTTCGGGA
CTACTACCGGAGTGACTACAACAACCGTCCATCTGACTACGACCAACACAATACATTCACAGACTGGACAGAACGGTGGTCAATCTTCGCTGAACCGTCA
CAACACGCCGCACAGCAAGCCATTAGTCAGATTCACAGCGATCTTGTGACCCTCCAAGAACGGCGAAATAACGGCTCCGATGTCGGGCGATTGCAGTGGC
AAGCGAAAGGCGAATTTCGGTCGGTGTCGTACAATCAGTCCAGTCGCTTCAACGTGGATCACAACACGGGCGACAACCGATTCGTGCGACTTCGAGTTGA
GAAAATCGGCTGGTTCAAGATTCGCGCAGACCGCGACATCCCACCAGTAGATGAGATTGACAAGGTTATTCTGAAAAAAGAGACAACCGGCGAGTGGTAC
GTCTCACTCGTCACAACTGTCGAAGACACACCAGACAAACCGCCACTCAGTCAGATTGAACCTGAAAATTGTGTTGGTGTCGACCTCGGCATCACAAGCT
ACATTCACACCTCGGAAAACCTGTCTGTTGATATGCTTGAGCTGTCGAATGAGTACGACCGCTACGCACGCGAACAGCGGAAACTCGACAGGAAAGAACA
CGGCTCTGCCAACTGGGAGAAACAACGCCGAAAGGTGGCACAAGCAAAACGAACAATCAAACGGAAGGTGTGTGACTACCAGCACAAACTCACGACGTGG
CTTGTCACAGAGTACGACGTTGTCGCTGTCGAGGACTTGGACGTAAAGCCGATACTGGAAACCAGCCAGAACGCCAAGAACAAGCAAGACGCCGCGTGGT
CACGCTTTCTTGAGCTGCTGGAGTACAAGGCTGACCTCCACGGCACACACGTCGAAAAGGTGAAACCAGAAGGCACGACGAAAGAATGCGCTGTGTGTGG
TGTCGAAACAGCCAAGCCAATCTGGGTACGCGAGCATTCGTGTCCGGCGTGCGGTCATACCGAAGACCGTGACTTGAACGCAGCGAAGAATATTCTCAAT
CGTGGGTTGAAGCAATTAGGGGCGGGACGCTCCGAATCAACGTCTGTGCAGACTGCGCTCCCTACGTTCACACCTCACCGAGAGGCTGTGGATGCAAAGC
GCGTCATTGAACCAGGAAGCCCCGGGGCTTGACCTCGGGGTGAGTTCAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1239 bp | 412 aa | 194 | 1432 | + | No |
AG : TnpB
ORF sequence :
MKHTLRFRVYLSDDVASEAWRHIDILRQIRNHAVRDYYRSDYNNRPSDYDQHNTFTDWTERWSIFAEPSQHAAQQAISQIHSDLVTLQERRNNGSDVGRL
QWQAKGEFRSVSYNQSSRFNVDHNTGDNRFVRLRVEKIGWFKIRADRDIPPVDEIDKVILKKETTGEWYVSLVTTVEDTPDKPPLSQIEPENCVGVDLGI
TSYIHTSENLSVDMLELSNEYDRYAREQRKLDRKEHGSANWEKQRRKVAQAKRTIKRKVCDYQHKLTTWLVTEYDVVAVEDLDVKPILETSQNAKNKQDA
AWSRFLELLEYKADLHGTHVEKVKPEGTTKECAVCGVETAKPIWVREHSCPACGHTEDRDLNAAKNILNRGLKQLGAGRSESTSVQTALPTFTPHREAVD
AKRVIEPGSPGA
QWQAKGEFRSVSYNQSSRFNVDHNTGDNRFVRLRVEKIGWFKIRADRDIPPVDEIDKVILKKETTGEWYVSLVTTVEDTPDKPPLSQIEPENCVGVDLGI
TSYIHTSENLSVDMLELSNEYDRYAREQRKLDRKEHGSANWEKQRRKVAQAKRTIKRKVCDYQHKLTTWLVTEYDVVAVEDLDVKPILETSQNAKNKQDA
AWSRFLELLEYKADLHGTHVEKVKPEGTTKECAVCGVETAKPIWVREHSCPACGHTEDRDLNAAKNILNRGLKQLGAGRSESTSVQTALPTFTPHREAVD
AKRVIEPGSPGA
Blast result :
Comments
ISHwa18 is 93% aa similar to ISNph15. There LE end secondary structure is not canonical but there is an empty site in Haloquadratum walsbyi C23 complete genome.
References
1] Bolhuis, H., Palm, P., Wende, A., Falb, M., Rampp, M., Rodriguez-Valera, F., Pfeiffer, F., and Oesterhelt, D. (2006)BMC Genomics 7, 169
2] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968
2] Dyall-Smith,M.L., Pfeiffer,F., Klee,K., Palm,P., Gross,K., Schuster,S.C., Rampp, M., and Oesterhelt,D. (2011) PLoS One 6: e20968