ISArch8
- Family IS1634
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY714833 | ND | Uncultured archaeon | Uncultured archaeon GZfos1D1 clone GZfos1D1 |
DNA section
IS Length : 1923 bp
Ends
IR Length : 20/24
IRL : CTTGACTTTCAAAGATAACTGAAACTGCCCCTATTTCCCGAAGTTTTTCG
IRR : CTTGAGTTTCAAACATATGTGAAAAATCGATAGCCAGAACCATCATCCAC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
CTTGACTTTCAAAGATAACTGAAACTGCCCCTATTTCCCGAAGTTTTTCGATAGCGAGCGCCTGCGTCGATCATTTCAGATGTTAGTCCCCTACTAATTA
ATAATTAATTTTACCTCTATTCCTATCGATAATTGGCCAAAATATTTATATACTATAGTCCCCTACTACTAATACATGGTCTTCCTGGAAAAGAAGAAAA
AGAAGGGACACATCTACTGGTACGCAACAGAAAGAAAAATGGTCAACGGGGTGGTGAAACGAACATGGCAGGAATATCTCGGAACGGCAGAAAAGATACG
GGAGTGCGCTAGAAGATCAAAAGATCTGCCGCACATCAAACTAAAATCGTTCCAGTACGGAAAGACCGCCGCGCTCCTCGCCGTATCTGACGAACTGAAC
TTTGTAGAGACCGTAAACAAGCACACCAACAAAAAGAAGATCGAAGGTCTGACAGTTGGTGAGTATCTACTTTTAAACATCATTGGACGGGGCGATGGGG
CATTATCAGAGAATGCCCTGCAGAAATGGTTTAACAAGTCCACACTCAGCACACTCTGGAAGTTTCCGCACCAACTGAGCTGTCAGAACTTCTTAAATCA
CTATAAATATATAGATCAGGAGACCAGCAGGAAGATAGAGGATGACCTATGCGCGGCGCTGATCGAGATGGGTCTAACGCCCCAGCTACTCTTCCTGGAC
GAGTCAAACTGGTTTACATATATTGAAAAAGGAGAGGAAATACCTCAAAAGGGAAAAAGCAAGCAGTTCAGGTACGATAAGAACCTGATCTCAGTGGGGC
TCGCAGTATCCGAAGATAATGTGCCATTTATGCATGAAACTTATGAAGGAAACACGCACGACTCAAAGATATTCCCAAGACTGCTGGACACGCTCACAGA
ACGGCTGCGTAACCTGGAAATAACCACTAAAAACCTGGTACTGGTCTTTGACAAGGGTAATAACTCTGAAGTGAATATCAATGACGTATTGTCTGATATG
CACATCGTAGCCTCTGCAAAGCCTGATCAGGCAAGGGATTTGCTGAGAATACCGCTCGATAAGTACAAATACCTGTACACAAACTCCAAAGGTCACAAAA
TATATGGTTACAGGACACAATATGAGTTTTTTGGGCGGGAATTCACTACAGTCGTTGCGTATAGCGATGCCTCGCACAAAAAACAGATGGGAAGTTATGA
GAAGCGAAAATCAAAGATTCTGGATAAATTTGCGGATCTGAAGAGAAGGCTTGGGAGCAACAGAGGTAAGGAGCGGGATGCGAGTAGCGTGGAGAGGGAA
GTAAATGAGATCATTCATAAGGATTTCAGAGCGATAATCGGATATAAGATTGGCGAGGTACCAGAGGGTAAGAAGAAACCCGCGCTTGTGTACTGGATCA
AAGACGCGGAGAAAGAACGTTACGACGGGTTCGGAAAGATGGTAATTTTCACTGACAAGGACCGATGGCACTCTGAAAAGATAGTAAAGACTTATAACAA
GAAGTCTCTTGTAGAAGACGACTTCAAACTGCTGAACGATGTATTACTCGTTCCTATAGGCCCTGTCAACCATCATAAGGACGATAACATCAGAGTGCAT
ACATTCCTTTGCGTTACAGGGCTGATTTTTTACAGATACTTAGCTTACCGGTGCAAACATTTTCATATAAGTCTGAAACGACTTGTGGAGGAGCTTTCAG
GGATACGTATTGCACTTGCGGAGAACAAGGCGAAGAGTGGCAAGATCGAGTTGGTGGTGGAGGAGATGGATTCGACACAAGCCAGGTTATTCTCCCATCT
GAATCTCGGGAAATTTATTACTGCGGGATAGTGAAATCGGTATTAAGGGGCTTAATGTTTTGGTGGGGACTATGTGGATGATGGTTCTGGCTATCGATTT
TTCACATATGTTTGAAACTCAAG
ATAATTAATTTTACCTCTATTCCTATCGATAATTGGCCAAAATATTTATATACTATAGTCCCCTACTACTAATACATGGTCTTCCTGGAAAAGAAGAAAA
AGAAGGGACACATCTACTGGTACGCAACAGAAAGAAAAATGGTCAACGGGGTGGTGAAACGAACATGGCAGGAATATCTCGGAACGGCAGAAAAGATACG
GGAGTGCGCTAGAAGATCAAAAGATCTGCCGCACATCAAACTAAAATCGTTCCAGTACGGAAAGACCGCCGCGCTCCTCGCCGTATCTGACGAACTGAAC
TTTGTAGAGACCGTAAACAAGCACACCAACAAAAAGAAGATCGAAGGTCTGACAGTTGGTGAGTATCTACTTTTAAACATCATTGGACGGGGCGATGGGG
CATTATCAGAGAATGCCCTGCAGAAATGGTTTAACAAGTCCACACTCAGCACACTCTGGAAGTTTCCGCACCAACTGAGCTGTCAGAACTTCTTAAATCA
CTATAAATATATAGATCAGGAGACCAGCAGGAAGATAGAGGATGACCTATGCGCGGCGCTGATCGAGATGGGTCTAACGCCCCAGCTACTCTTCCTGGAC
GAGTCAAACTGGTTTACATATATTGAAAAAGGAGAGGAAATACCTCAAAAGGGAAAAAGCAAGCAGTTCAGGTACGATAAGAACCTGATCTCAGTGGGGC
TCGCAGTATCCGAAGATAATGTGCCATTTATGCATGAAACTTATGAAGGAAACACGCACGACTCAAAGATATTCCCAAGACTGCTGGACACGCTCACAGA
ACGGCTGCGTAACCTGGAAATAACCACTAAAAACCTGGTACTGGTCTTTGACAAGGGTAATAACTCTGAAGTGAATATCAATGACGTATTGTCTGATATG
CACATCGTAGCCTCTGCAAAGCCTGATCAGGCAAGGGATTTGCTGAGAATACCGCTCGATAAGTACAAATACCTGTACACAAACTCCAAAGGTCACAAAA
TATATGGTTACAGGACACAATATGAGTTTTTTGGGCGGGAATTCACTACAGTCGTTGCGTATAGCGATGCCTCGCACAAAAAACAGATGGGAAGTTATGA
GAAGCGAAAATCAAAGATTCTGGATAAATTTGCGGATCTGAAGAGAAGGCTTGGGAGCAACAGAGGTAAGGAGCGGGATGCGAGTAGCGTGGAGAGGGAA
GTAAATGAGATCATTCATAAGGATTTCAGAGCGATAATCGGATATAAGATTGGCGAGGTACCAGAGGGTAAGAAGAAACCCGCGCTTGTGTACTGGATCA
AAGACGCGGAGAAAGAACGTTACGACGGGTTCGGAAAGATGGTAATTTTCACTGACAAGGACCGATGGCACTCTGAAAAGATAGTAAAGACTTATAACAA
GAAGTCTCTTGTAGAAGACGACTTCAAACTGCTGAACGATGTATTACTCGTTCCTATAGGCCCTGTCAACCATCATAAGGACGATAACATCAGAGTGCAT
ACATTCCTTTGCGTTACAGGGCTGATTTTTTACAGATACTTAGCTTACCGGTGCAAACATTTTCATATAAGTCTGAAACGACTTGTGGAGGAGCTTTCAG
GGATACGTATTGCACTTGCGGAGAACAAGGCGAAGAGTGGCAAGATCGAGTTGGTGGTGGAGGAGATGGATTCGACACAAGCCAGGTTATTCTCCCATCT
GAATCTCGGGAAATTTATTACTGCGGGATAGTGAAATCGGTATTAAGGGGCTTAATGTTTTGGTGGGGACTATGTGGATGATGGTTCTGGCTATCGATTT
TTCACATATGTTTGAAACTCAAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1656 bp | 551 aa | 176 | 1831 | + | No |
Chemistry : DDE
ORF sequence :
MVFLEKKKKKGHIYWYATERKMVNGVVKRTWQEYLGTAEKIRECARRSKDLPHIKLKSFQYGKTAALLAVSDELNFVETVNKHTNKKKIEGLTVGEYLLL
NIIGRGDGALSENALQKWFNKSTLSTLWKFPHQLSCQNFLNHYKYIDQETSRKIEDDLCAALIEMGLTPQLLFLDESNWFTYIEKGEEIPQKGKSKQFRY
DKNLISVGLAVSEDNVPFMHETYEGNTHDSKIFPRLLDTLTERLRNLEITTKNLVLVFDKGNNSEVNINDVLSDMHIVASAKPDQARDLLRIPLDKYKYL
YTNSKGHKIYGYRTQYEFFGREFTTVVAYSDASHKKQMGSYEKRKSKILDKFADLKRRLGSNRGKERDASSVEREVNEIIHKDFRAIIGYKIGEVPEGKK
KPALVYWIKDAEKERYDGFGKMVIFTDKDRWHSEKIVKTYNKKSLVEDDFKLLNDVLLVPIGPVNHHKDDNIRVHTFLCVTGLIFYRYLAYRCKHFHISL
KRLVEELSGIRIALAENKAKSGKIELVVEEMDSTQARLFSHLNLGKFITAG
NIIGRGDGALSENALQKWFNKSTLSTLWKFPHQLSCQNFLNHYKYIDQETSRKIEDDLCAALIEMGLTPQLLFLDESNWFTYIEKGEEIPQKGKSKQFRY
DKNLISVGLAVSEDNVPFMHETYEGNTHDSKIFPRLLDTLTERLRNLEITTKNLVLVFDKGNNSEVNINDVLSDMHIVASAKPDQARDLLRIPLDKYKYL
YTNSKGHKIYGYRTQYEFFGREFTTVVAYSDASHKKQMGSYEKRKSKILDKFADLKRRLGSNRGKERDASSVEREVNEIIHKDFRAIIGYKIGEVPEGKK
KPALVYWIKDAEKERYDGFGKMVIFTDKDRWHSEKIVKTYNKKSLVEDDFKLLNDVLLVPIGPVNHHKDDNIRVHTFLCVTGLIFYRYLAYRCKHFHISL
KRLVEELSGIRIALAENKAKSGKIELVVEEMDSTQARLFSHLNLGKFITAG
Blast result :
Comments
ISArch9 is 74% aa similar to ISMac10.
References
1] Hallam,S.J., Putnam,N., Preston,C.M., Detter,J.C., Rokhsar,D., Richardson,P.M. and DeLong,E.F. (2004) Science 305 (5689), 1457-1462