ISArch10
- Family IS4
- Group IS4
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY714815 | ND | uncultured archaeon | uncultured archaeon GZfos11A10 |
DNA section
IS Length : 1519 bp
Ends
IR Length : 17/18
IRL : CAATGGCACTACCTTAAGAAAATAATCCGATAGAATCAATAAAGTATATG
IRR : TAATGGCACTACCTTAAGCAGACACTCCACCTCTACGTAATTCGTTGCGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTGAGTAGCT | TGTTATACAT | GCCTCTGTAA | 10 |
DNA sequence
CAATGGCACTACCTTAAGAAAATAATCCGATAGAATCAATAAAGTATATGTGCTTGGCATCTTATTATATTATCATAAAGTTTCGTAAATTATTACGAAT
CCACAAACGAGGTGCCAACAATGTTCAATTCCATCCGCAAACGGATTTCCTACCAGATAATCGCACTGAGGCTATGCTTTGCCCAAATCGATGGTTTGCC
CTTCAGTGATGTCCTTTCGGCTGAAACCATCCGGAATATTATGGATGAAGAAGTAGGCAGTTATCGAGACCGTATATACTCCCCCCTAATCACCCTATCT
GCCTTCTTGTCTCAAGTACTCAGTTCGGATCATTCCTGCAAAAATGCGGTAGCGAAGGTGCTTGCGGAACGAGTGGCGCAAGGCAAGTTGCCCTGCTCAT
CAAACACCAAATCTTATTGCGAAGCCAGATTACGCCTACCTATAAACCTTGTCAGGAGGTTGGTGCGTGAAACCGGGAAATTACTGCACCTGAAGTCAGA
AGAGGCTTGGAAATGGAAAGGTCGTTCAGTTAAGCTTGTAGATGGAACGACGGTCTCGATGCCAGATACCCCTGAAAACCAGAAGATGTATCCACAGCCA
GAAGGGCAGAAAGAGGGGGTAGGGTTTCCAATTGCCCGTCTCGTAGCTATAATCTCATTATCTTGCGGAGCAGTCCTTGACATTGCAATTGGACCTTATA
AAGGCAAGGAAACTGGTGAACATGCTCTGTTGCGCCAGATATTGGGCAGCATATCCACAGGCGATATCCTATTGGGAGATCGCTACTATTGTTCATACTT
CTTGATTGTCATGCTGCAGCAGTTGGGCGCGGACTCAGTCTTCCGGATCCATGGCAGTCGCAAAAAAGATTTCCGCCGAGGTAAGCATCTTGGCAAAAAA
GACCACATCGTTATATGGAAAAAGCCAAAACAACGACCGAATTGGATGACTGAGTCCATGTACCTCCAAATGCCAGACACGTTGACAATACGCGAGATCA
AAATCAATAGAAAAGTTATTACTACCACTCTCCTTGACCCAAAAGAATTTACAAGAGAGGAAATCGATGAACTTTACGCGAAACGGTGGTTGATCGAAGT
GGACTTCAGGTTCATCAAAACAGTTCTCCAGATGGATATTTTGAGGTGCAAAACACCTGACATGGTTTGTAAGGAAATCTGGGTGCATTTACTGGCGTAT
AATCTGATCCGAACGGTCATGGCGCAAGCGGCGCATCGTTACAATCTTCCACCACGAACATTGAGTTTCAAAGGCACGTTACAGCAGTTAAATGCGTTTA
AGGAGAGATTTCTGCGCACTGCCAAAAAACGTTTGTCTACTATATGCGGACATCTTCTGAAAGCTATTGTCAGTCATCGCGTGGGGAACAGACCCGGACG
TAGTGAACCGCGTGCCGTCAAACGACGGCGTAAACCTTATCCGTTACTCACTAAACCACGGGAAGAAGCTCGCAACGAATTACGTAGAGGTGGAGTGTCT
GCTTAAGGTAGTGCCATTA
CCACAAACGAGGTGCCAACAATGTTCAATTCCATCCGCAAACGGATTTCCTACCAGATAATCGCACTGAGGCTATGCTTTGCCCAAATCGATGGTTTGCC
CTTCAGTGATGTCCTTTCGGCTGAAACCATCCGGAATATTATGGATGAAGAAGTAGGCAGTTATCGAGACCGTATATACTCCCCCCTAATCACCCTATCT
GCCTTCTTGTCTCAAGTACTCAGTTCGGATCATTCCTGCAAAAATGCGGTAGCGAAGGTGCTTGCGGAACGAGTGGCGCAAGGCAAGTTGCCCTGCTCAT
CAAACACCAAATCTTATTGCGAAGCCAGATTACGCCTACCTATAAACCTTGTCAGGAGGTTGGTGCGTGAAACCGGGAAATTACTGCACCTGAAGTCAGA
AGAGGCTTGGAAATGGAAAGGTCGTTCAGTTAAGCTTGTAGATGGAACGACGGTCTCGATGCCAGATACCCCTGAAAACCAGAAGATGTATCCACAGCCA
GAAGGGCAGAAAGAGGGGGTAGGGTTTCCAATTGCCCGTCTCGTAGCTATAATCTCATTATCTTGCGGAGCAGTCCTTGACATTGCAATTGGACCTTATA
AAGGCAAGGAAACTGGTGAACATGCTCTGTTGCGCCAGATATTGGGCAGCATATCCACAGGCGATATCCTATTGGGAGATCGCTACTATTGTTCATACTT
CTTGATTGTCATGCTGCAGCAGTTGGGCGCGGACTCAGTCTTCCGGATCCATGGCAGTCGCAAAAAAGATTTCCGCCGAGGTAAGCATCTTGGCAAAAAA
GACCACATCGTTATATGGAAAAAGCCAAAACAACGACCGAATTGGATGACTGAGTCCATGTACCTCCAAATGCCAGACACGTTGACAATACGCGAGATCA
AAATCAATAGAAAAGTTATTACTACCACTCTCCTTGACCCAAAAGAATTTACAAGAGAGGAAATCGATGAACTTTACGCGAAACGGTGGTTGATCGAAGT
GGACTTCAGGTTCATCAAAACAGTTCTCCAGATGGATATTTTGAGGTGCAAAACACCTGACATGGTTTGTAAGGAAATCTGGGTGCATTTACTGGCGTAT
AATCTGATCCGAACGGTCATGGCGCAAGCGGCGCATCGTTACAATCTTCCACCACGAACATTGAGTTTCAAAGGCACGTTACAGCAGTTAAATGCGTTTA
AGGAGAGATTTCTGCGCACTGCCAAAAAACGTTTGTCTACTATATGCGGACATCTTCTGAAAGCTATTGTCAGTCATCGCGTGGGGAACAGACCCGGACG
TAGTGAACCGCGTGCCGTCAAACGACGGCGTAAACCTTATCCGTTACTCACTAAACCACGGGAAGAAGCTCGCAACGAATTACGTAGAGGTGGAGTGTCT
GCTTAAGGTAGTGCCATTA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1383 bp | 461 aa | 121 | 1503 | + | No |
Chemistry : DDE
ORF sequence :
MFNSIRKRISYQIIALRLCFAQIDGLPFSDVLSAETIRNIMDEEVGSYRDRIYSPLITLSAFLSQVLSSDHSCKNAVAKVLAERVAQGKLPCSSNTKSYC
EARLRLPINLVRRLVRETGKLLHLKSEEAWKWKGRSVKLVDGTTVSMPDTPENQKMYPQPEGQKEGVGFPIARLVAIISLSCGAVLDIAIGPYKGKETGE
HALLRQILGSISTGDILLGDRYYCSYFLIVMLQQLGADSVFRIHGSRKKDFRRGKHLGKKDHIVIWKKPKQRPNWMTESMYLQMPDTLTIREIKINRKVI
TTTLLDPKEFTREEIDELYAKRWLIEVDFRFIKTVLQMDILRCKTPDMVCKEIWVHLLAYNLIRTVMAQAAHRYNLPPRTLSFKGTLQQLNAFKERFLRT
AKKRLSTICGHLLKAIVSHRVGNRPGRSEPRAVKRRRKPYPLLTKPREEARNELRRGGVSA
EARLRLPINLVRRLVRETGKLLHLKSEEAWKWKGRSVKLVDGTTVSMPDTPENQKMYPQPEGQKEGVGFPIARLVAIISLSCGAVLDIAIGPYKGKETGE
HALLRQILGSISTGDILLGDRYYCSYFLIVMLQQLGADSVFRIHGSRKKDFRRGKHLGKKDHIVIWKKPKQRPNWMTESMYLQMPDTLTIREIKINRKVI
TTTLLDPKEFTREEIDELYAKRWLIEVDFRFIKTVLQMDILRCKTPDMVCKEIWVHLLAYNLIRTVMAQAAHRYNLPPRTLSFKGTLQQLNAFKERFLRT
AKKRLSTICGHLLKAIVSHRVGNRPGRSEPRAVKRRRKPYPLLTKPREEARNELRRGGVSA
Blast result :
Comments
ISArch10 is 87% aa similar to ISArch9. ISArch10 was found by screening completely sequenced genomes for sequences homologous to the ISRso13 transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-78-D(N3)-105-E(C1). The sequencing of the Uncultured archaeon GZfos11A10 chromosome is currently in progress. Accession: AY714815.
References
1] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] Hallam,S.J., Putnam,N., Preston,C.M., Detter,J.C., Rokhsar,D., Richardson,P.M. and DeLong,E.F.(2004) Science 305 (5689), 1457-1462
2] Hallam,S.J., Putnam,N., Preston,C.M., Detter,J.C., Rokhsar,D., Richardson,P.M. and DeLong,E.F.(2004) Science 305 (5689), 1457-1462