ISArch9
- Family IS4
- Group IS4
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY714815 | ND | uncultured archaeon | uncultured archaeon GZfos11A10 |
DNA section
IS Length : 1520 bp
Ends
IR Length : 18
IRL : CAATGGCACTACCTTAAGAAAATAATCCAAATAGAAGCAATAAAGTAAAT
IRR : CAATGGCACTACCTTAAGCGGATATCCCACCTCCACGTAATTCGTTGCGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCTCGACAAA | GCCCATATAG | TCCGTCTTTC | 10 |
DNA sequence
CAATGGCACTACCTTAAGAAAATAATCCAAATAGAAGCAATAAAGTAAATGTGATTGGTATCGTATTATGCCAACATAAAGATTCGTAAGTTATTATGAA
TCCGCAAATGAGGTACTAACCATGTTTGATTCCATCCACAAACGGATTTTCCGCCAGATAAAGGCAATAAGGAGGCGCTTTGCACAAACTGCGGGACTAC
CCTTCAACGAGATCCTTTCGACTGAAACCATCCTCGATATTATGGATGAAGAAGTAGACGCCTATCGAGATCGTATATTTTCTCCTCTAATCACTCTATC
TGCCTTCTTGTCTCAAGTACTTAGTTCAGATCAGTCCTGCAATAATGCGGTAGCGAAGGTGATTGCGGAAAGAGCGGCGCAAGGCGAGGCGCCCTGCTCA
TCAAACAACAAATCCTACTGCAACGCCCGCAGGCGCCTGCATGAAGGCTTTGTCAAGAGATTGATGCGTGAGACGGGGAGATTACTGCATCTACAGTCTG
AAACTGATTGGAAATGGAAAGGTCGTTCAGTTAAGCTTGTGGATGGAACGACTGTATCAATGCCAGATACCCCTGAAAACCAGAAAGAGTATCCGCAGCC
TGAAGGGCAGAAAGAAGGCGTAGGCTTTCCAATCGCCCGTCTCGTCGCTATAATCTCATTATCGTGTGGAACAGTCCTTGACATAGCGATTGGGCCTTAC
AAAGGCAAGGAAACTGGCGAACATGCTCTGTTGCGCCAGATACTGGGCAGCATATCCGCAGGCGATATTATATTGGGAGATCGCTACTATTGTTCGTATT
TCTTGATTGCCATGCTACAGCGGTTGGGCGCGGACGCGGTCTTCCGGATCCATGGCAGCCGCAAAAGCGATTTCCGCCGGGGTGAGAAGCTTGGTAAAAA
AGATCACATTGTCACGTGGGAAAAGCCAAAACAACGACCGGATTGGATGGATGCGGCTACATACCGTCAAATGCCAGATACGTTGACAATACGTGAAATC
AAAATCAATGGGAAAGTCATTACCACAACCCTCCTTGACCCAAAAGAAGTTACAAGAAAGGAGATCGGTGAACTTTACACGAAACGGTGGTTGATTGAAG
TGGACTTCGATTCCATCAAAACAGTTCTCCAGATGGATGTTTTGAGGTGTAAAACACCGGATTTAGTGCGCAAGGAAATATATGTTCATCTACTGGCATA
TAATCTGATCCGAACGGTTATGGCACAGACTGCGCATCGTTATGATGTTTCACCGCGGACATTGAGCTTCAAAGGCGCGTTGCAGCAGTTAAATGCATTT
AAAGACACATTTCTGTGCGCTGACAAAAAGTCACTGCCCGGCTTGTATGAGCATCTTCTGAAAGCCATTTCTTCCCATCATGTGGGGAATATGCCCGGAC
GTAGTGAACCGCGTGTCGTCAAACGCCGACGTAAACCATATCCATTACTCACTAAACCACGGGACGAAGCTCGCAACGAATTACGTGGAGGTGGGATATC
CGCTTAAGGTAGTGCCATTG
TCCGCAAATGAGGTACTAACCATGTTTGATTCCATCCACAAACGGATTTTCCGCCAGATAAAGGCAATAAGGAGGCGCTTTGCACAAACTGCGGGACTAC
CCTTCAACGAGATCCTTTCGACTGAAACCATCCTCGATATTATGGATGAAGAAGTAGACGCCTATCGAGATCGTATATTTTCTCCTCTAATCACTCTATC
TGCCTTCTTGTCTCAAGTACTTAGTTCAGATCAGTCCTGCAATAATGCGGTAGCGAAGGTGATTGCGGAAAGAGCGGCGCAAGGCGAGGCGCCCTGCTCA
TCAAACAACAAATCCTACTGCAACGCCCGCAGGCGCCTGCATGAAGGCTTTGTCAAGAGATTGATGCGTGAGACGGGGAGATTACTGCATCTACAGTCTG
AAACTGATTGGAAATGGAAAGGTCGTTCAGTTAAGCTTGTGGATGGAACGACTGTATCAATGCCAGATACCCCTGAAAACCAGAAAGAGTATCCGCAGCC
TGAAGGGCAGAAAGAAGGCGTAGGCTTTCCAATCGCCCGTCTCGTCGCTATAATCTCATTATCGTGTGGAACAGTCCTTGACATAGCGATTGGGCCTTAC
AAAGGCAAGGAAACTGGCGAACATGCTCTGTTGCGCCAGATACTGGGCAGCATATCCGCAGGCGATATTATATTGGGAGATCGCTACTATTGTTCGTATT
TCTTGATTGCCATGCTACAGCGGTTGGGCGCGGACGCGGTCTTCCGGATCCATGGCAGCCGCAAAAGCGATTTCCGCCGGGGTGAGAAGCTTGGTAAAAA
AGATCACATTGTCACGTGGGAAAAGCCAAAACAACGACCGGATTGGATGGATGCGGCTACATACCGTCAAATGCCAGATACGTTGACAATACGTGAAATC
AAAATCAATGGGAAAGTCATTACCACAACCCTCCTTGACCCAAAAGAAGTTACAAGAAAGGAGATCGGTGAACTTTACACGAAACGGTGGTTGATTGAAG
TGGACTTCGATTCCATCAAAACAGTTCTCCAGATGGATGTTTTGAGGTGTAAAACACCGGATTTAGTGCGCAAGGAAATATATGTTCATCTACTGGCATA
TAATCTGATCCGAACGGTTATGGCACAGACTGCGCATCGTTATGATGTTTCACCGCGGACATTGAGCTTCAAAGGCGCGTTGCAGCAGTTAAATGCATTT
AAAGACACATTTCTGTGCGCTGACAAAAAGTCACTGCCCGGCTTGTATGAGCATCTTCTGAAAGCCATTTCTTCCCATCATGTGGGGAATATGCCCGGAC
GTAGTGAACCGCGTGTCGTCAAACGCCGACGTAAACCATATCCATTACTCACTAAACCACGGGACGAAGCTCGCAACGAATTACGTGGAGGTGGGATATC
CGCTTAAGGTAGTGCCATTG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1383 bp | 461 aa | 122 | 1504 | + | No |
Chemistry : DDE
ORF sequence :
MFDSIHKRIFRQIKAIRRRFAQTAGLPFNEILSTETILDIMDEEVDAYRDRIFSPLITLSAFLSQVLSSDQSCNNAVAKVIAERAAQGEAPCSSNNKSYC
NARRRLHEGFVKRLMRETGRLLHLQSETDWKWKGRSVKLVDGTTVSMPDTPENQKEYPQPEGQKEGVGFPIARLVAIISLSCGTVLDIAIGPYKGKETGE
HALLRQILGSISAGDIILGDRYYCSYFLIAMLQRLGADAVFRIHGSRKSDFRRGEKLGKKDHIVTWEKPKQRPDWMDAATYRQMPDTLTIREIKINGKVI
TTTLLDPKEVTRKEIGELYTKRWLIEVDFDSIKTVLQMDVLRCKTPDLVRKEIYVHLLAYNLIRTVMAQTAHRYDVSPRTLSFKGALQQLNAFKDTFLCA
DKKSLPGLYEHLLKAISSHHVGNMPGRSEPRVVKRRRKPYPLLTKPRDEARNELRGGGISA
NARRRLHEGFVKRLMRETGRLLHLQSETDWKWKGRSVKLVDGTTVSMPDTPENQKEYPQPEGQKEGVGFPIARLVAIISLSCGTVLDIAIGPYKGKETGE
HALLRQILGSISAGDIILGDRYYCSYFLIAMLQRLGADAVFRIHGSRKSDFRRGEKLGKKDHIVTWEKPKQRPDWMDAATYRQMPDTLTIREIKINGKVI
TTTLLDPKEVTRKEIGELYTKRWLIEVDFDSIKTVLQMDVLRCKTPDLVRKEIYVHLLAYNLIRTVMAQTAHRYDVSPRTLSFKGALQQLNAFKDTFLCA
DKKSLPGLYEHLLKAISSHHVGNMPGRSEPRVVKRRRKPYPLLTKPRDEARNELRGGGISA
Blast result :
Comments
ISArch9 is 87% aa similar to ISArch10. ISArch9 was found by screening completely sequenced genomes for sequences homologous to the ISRso13 transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-78-D(N3)-105-E(C1). The sequencing of the Uncultured archaeon GZfos11A10 is currently in progress. Accession: AY714815.
References
1] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] Hallam,S.J., Putnam,N., Preston,C.M., Detter,J.C., Rokhsar,D., Richardson,P.M. and DeLong,E.F.(2004) Science 305 (5689), 1457-1462
2] Hallam,S.J., Putnam,N., Preston,C.M., Detter,J.C., Rokhsar,D., Richardson,P.M. and DeLong,E.F.(2004) Science 305 (5689), 1457-1462