ISArch2
- Family IS1182
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY714861 | ND | Uncultured archaeon | Uncultured archaeon GZfos34H10 clone GZfos34H10 |
DNA section
IS Length : 1924 bp
Ends
IR Length : 14
IRL : CAGGCTGTCCCACATTTACCCAACAACCAATAGAAATTATACAATAGTAC
IRR : CAGGCTGTCCCACAAACTAATTATAGAATTTTATTATACATAAAGGCAAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTGTCTTTACCT | TAAA | GGTATAAGCACCAATA | 4 |
DNA sequence
CAGGCTGTCCCACATTTACCCAACAACCAATAGAAATTATACAATAGTACAAAAATCTAAAAAAACGCTTTCTTTCTAACGGAATGCACATTTATAGCCA
GACTTTTCTCAAATTGCAAAAATACCACTTAGAAGAATACATATCTTAGTTCCAACCGAAATTAAACCGAAAGGTTTTTATTATGGCATGCACAACTATT
CTTTAGTTATAAAAACGAGGTGATTGTAACGGATGGCGATACGAAGCGATAAAATAGGGCAATCTTGGCTATTGCCCCCTGCTGTTAGTGAACTCATACC
AGAAGACCATATTTGCAACTTGGTTGAGGTAGTAGTGGATAACATGGACGTTGGTGAAATTGAGCAGAAGTACAGGTCCGGACCGGGCAATCCTGCATAT
TCACGGCGGATGTTGCTAAGAATAGTGATAATGGCGTCTGCTGATGCCATTTGGTCTTCGAGAAAGATAGCAAAGCTAGCTCATGAGAATGTAGTGTATA
TGTATCTAACAGGGCATGAGAAGCCGGATTTCCGAACGATATGCAATTTTAAAAAGGAGTGTGAGGGACTAATTGAAGCGGCGTTCAAGGAAACCGTCAC
GATTGCTAAAGCATTAGGAATACTTAACTTAGAGCACATATCAACCGATGGTACCAAGATGAAAGCAAATGCTTCAAACAAGCACACGCTGAGTAAGGAA
GAGATAGAGTGGATAAGAAAAATAATAGCGAGAGGGATTGCGATAGATAAAGAGGAGGACAAGCTCTACGGAGATAAGAGAGGTGACGAGCTGCCACCCG
AACTCAATACGCAGGAGAAGATTCGTGAGAAGATAAAGGAGATTGAAGAAGCTTCCGGCCAGAAAATGAAGGGCGCGGCGAAGAAAATTATCGTGCAGCA
TGTGCTTGGCGATGAAAAAGATAAGGCGGCAATAATGAAAAAGCTCGATAAGGCAGAGGAAGAGCTTACAAAGAGTGGACAAGGTGCCGTCAGCATAACC
GACCCAGAATCCCGATTTATGGAGAACAAGAAGAAACGTAAAGAGTTATCATACAATCCTCAGATAACGGTAGACCACGGTTCTGGGATAGTTCTTGCTA
ATGGCGTTACTCAGGACTGTACAGACCATTACCAGTTGCAACCGCAGCTAGAAATGACGGTTGAGAACATAGATAGCTTGCCTGAATGGACAAAGGTGAG
TATGGATAATGGCTACTTCAACGGTCCCAACCTCCGGTATTTGGAAGAGGCAGGGGTGGATGGCTATATTCCGGACAGCAAGCAGGCCCAGAAGATGAAT
GGTAAGAAGGTAAAAGACAGTCCTTATTCGAAGGACAAGTTTGTGTATGATGAAGAGAATGACCAATTTATATGCCCTAATGGAGATATACTCACCAGAA
AGGGGGAGTATGTGCGTAAAGGTAAGCTACAGTACTCTTATTATGGTGCAAACTGTGGAGAATGCCCTTTCAGGGAGGAATGTGCCGGTAAAAGCAAAAA
AAGAAAGATCACGAGCGATGACTACGAGGCAGAACGTAGGCGGATGGCAGGTAAAATGTGTTCCGAGAAAGGGAAAGAGGAATACAAGAAGCGTAAGGAG
ACTGTTGAATGGCCCTTTGGCAACATTAAGCAAAATATGAAGTTTCGGGAGTTTCATACAAGAGGGCTAGAAAATGTGCAGATCGAGCATAATCTTGTAT
GCACTGCTCATAATCTGGGGGTGATGTGGGGTAAATTAGGTGGCAGTGTGGCTGCTTTATCTAATATTAAGGGTTTGGTAGCTAATTTCGCATTTAGGGT
ATCAAGTATTTAGGTTTTTATTCATATTTCCTCTCGGAGTGTCATGAACTTTCGATTTTTGTTGATTTACTGTCATTGCCTTTATGTATAATAAAATTCT
ATAATTAGTTTGTGGGACAGCCTG
GACTTTTCTCAAATTGCAAAAATACCACTTAGAAGAATACATATCTTAGTTCCAACCGAAATTAAACCGAAAGGTTTTTATTATGGCATGCACAACTATT
CTTTAGTTATAAAAACGAGGTGATTGTAACGGATGGCGATACGAAGCGATAAAATAGGGCAATCTTGGCTATTGCCCCCTGCTGTTAGTGAACTCATACC
AGAAGACCATATTTGCAACTTGGTTGAGGTAGTAGTGGATAACATGGACGTTGGTGAAATTGAGCAGAAGTACAGGTCCGGACCGGGCAATCCTGCATAT
TCACGGCGGATGTTGCTAAGAATAGTGATAATGGCGTCTGCTGATGCCATTTGGTCTTCGAGAAAGATAGCAAAGCTAGCTCATGAGAATGTAGTGTATA
TGTATCTAACAGGGCATGAGAAGCCGGATTTCCGAACGATATGCAATTTTAAAAAGGAGTGTGAGGGACTAATTGAAGCGGCGTTCAAGGAAACCGTCAC
GATTGCTAAAGCATTAGGAATACTTAACTTAGAGCACATATCAACCGATGGTACCAAGATGAAAGCAAATGCTTCAAACAAGCACACGCTGAGTAAGGAA
GAGATAGAGTGGATAAGAAAAATAATAGCGAGAGGGATTGCGATAGATAAAGAGGAGGACAAGCTCTACGGAGATAAGAGAGGTGACGAGCTGCCACCCG
AACTCAATACGCAGGAGAAGATTCGTGAGAAGATAAAGGAGATTGAAGAAGCTTCCGGCCAGAAAATGAAGGGCGCGGCGAAGAAAATTATCGTGCAGCA
TGTGCTTGGCGATGAAAAAGATAAGGCGGCAATAATGAAAAAGCTCGATAAGGCAGAGGAAGAGCTTACAAAGAGTGGACAAGGTGCCGTCAGCATAACC
GACCCAGAATCCCGATTTATGGAGAACAAGAAGAAACGTAAAGAGTTATCATACAATCCTCAGATAACGGTAGACCACGGTTCTGGGATAGTTCTTGCTA
ATGGCGTTACTCAGGACTGTACAGACCATTACCAGTTGCAACCGCAGCTAGAAATGACGGTTGAGAACATAGATAGCTTGCCTGAATGGACAAAGGTGAG
TATGGATAATGGCTACTTCAACGGTCCCAACCTCCGGTATTTGGAAGAGGCAGGGGTGGATGGCTATATTCCGGACAGCAAGCAGGCCCAGAAGATGAAT
GGTAAGAAGGTAAAAGACAGTCCTTATTCGAAGGACAAGTTTGTGTATGATGAAGAGAATGACCAATTTATATGCCCTAATGGAGATATACTCACCAGAA
AGGGGGAGTATGTGCGTAAAGGTAAGCTACAGTACTCTTATTATGGTGCAAACTGTGGAGAATGCCCTTTCAGGGAGGAATGTGCCGGTAAAAGCAAAAA
AAGAAAGATCACGAGCGATGACTACGAGGCAGAACGTAGGCGGATGGCAGGTAAAATGTGTTCCGAGAAAGGGAAAGAGGAATACAAGAAGCGTAAGGAG
ACTGTTGAATGGCCCTTTGGCAACATTAAGCAAAATATGAAGTTTCGGGAGTTTCATACAAGAGGGCTAGAAAATGTGCAGATCGAGCATAATCTTGTAT
GCACTGCTCATAATCTGGGGGTGATGTGGGGTAAATTAGGTGGCAGTGTGGCTGCTTTATCTAATATTAAGGGTTTGGTAGCTAATTTCGCATTTAGGGT
ATCAAGTATTTAGGTTTTTATTCATATTTCCTCTCGGAGTGTCATGAACTTTCGATTTTTGTTGATTTACTGTCATTGCCTTTATGTATAATAAAATTCT
ATAATTAGTTTGTGGGACAGCCTG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1581 bp | 526 aa | 233 | 1813 | + | No |
Chemistry : DDE
ORF sequence :
MAIRSDKIGQSWLLPPAVSELIPEDHICNLVEVVVDNMDVGEIEQKYRSGPGNPAYSRRMLLRIVIMASADAIWSSRKIAKLAHENVVYMYLTGHEKPDF
RTICNFKKECEGLIEAAFKETVTIAKALGILNLEHISTDGTKMKANASNKHTLSKEEIEWIRKIIARGIAIDKEEDKLYGDKRGDELPPELNTQEKIREK
IKEIEEASGQKMKGAAKKIIVQHVLGDEKDKAAIMKKLDKAEEELTKSGQGAVSITDPESRFMENKKKRKELSYNPQITVDHGSGIVLANGVTQDCTDHY
QLQPQLEMTVENIDSLPEWTKVSMDNGYFNGPNLRYLEEAGVDGYIPDSKQAQKMNGKKVKDSPYSKDKFVYDEENDQFICPNGDILTRKGEYVRKGKLQ
YSYYGANCGECPFREECAGKSKKRKITSDDYEAERRRMAGKMCSEKGKEEYKKRKETVEWPFGNIKQNMKFREFHTRGLENVQIEHNLVCTAHNLGVMWG
KLGGSVAALSNIKGLVANFAFRVSSI
RTICNFKKECEGLIEAAFKETVTIAKALGILNLEHISTDGTKMKANASNKHTLSKEEIEWIRKIIARGIAIDKEEDKLYGDKRGDELPPELNTQEKIREK
IKEIEEASGQKMKGAAKKIIVQHVLGDEKDKAAIMKKLDKAEEELTKSGQGAVSITDPESRFMENKKKRKELSYNPQITVDHGSGIVLANGVTQDCTDHY
QLQPQLEMTVENIDSLPEWTKVSMDNGYFNGPNLRYLEEAGVDGYIPDSKQAQKMNGKKVKDSPYSKDKFVYDEENDQFICPNGDILTRKGEYVRKGKLQ
YSYYGANCGECPFREECAGKSKKRKITSDDYEAERRRMAGKMCSEKGKEEYKKRKETVEWPFGNIKQNMKFREFHTRGLENVQIEHNLVCTAHNLGVMWG
KLGGSVAALSNIKGLVANFAFRVSSI
Blast result :
Comments
ISArch2 is 62% aa similar to ISArch1.
References
1] ISfinder annotation (2005)
2] Hallam,S.J., Putnam,N., Preston,C.M., Detter,J.C., Rokhsar,D., Richardson,P.M. and DeLong,E.F. (2004) Science 305 (5689), 1457-1462
2] Hallam,S.J., Putnam,N., Preston,C.M., Detter,J.C., Rokhsar,D., Richardson,P.M. and DeLong,E.F. (2004) Science 305 (5689), 1457-1462