ISArch7
- Family IS66
- Group ISBst12
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CR937010 | ND | Uncultured archeon | Uncultured archeon |
DNA section
IS Length : 1839 bp
Ends
IR Length : 18/22
IRL : GTAACTACTCACGTTTTAGAAGTCAGTAGGAACCAACCCCCTTTGTAGAT
IRR : GTAACCACTCACCTTTAGGAAGCCCCCACCTCACAACCCGCCACTCCGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AGAAAACGAA | GTTTTTGG | ATGCAAATTA | 8 |
DNA sequence
GTAACTACTCACGTTTTAGAAGTCAGTAGGAACCAACCCCCTTTGTAGATTGCACCTCCGAATCCGAACCGCCATTTCCCTTCAGATTTACAGCCTCCGA
ATGACCACAAGCTTTATATACTTGAACTATCTAATAGATAGCTAATGAAAAAAGTCCCCATAATACCGGAAACCGAGATCCGGATCAACAACATCCGTGG
ATTCTATATCCGAAAGGTCACACGTTTAGGAACGAGCGCAAAAGTCGATTGTCCAAAAGAACATTTAGGAAAAACCGTATATCTGGTGATCTTGAACCAT
GATGATGAATGAAGACCTGCCAGAGGATCTGAGGGAATATCTCCACAATTTAGAGCGAAAAATCGCTCAATTAGAGAAAGATCTCAAAAAGGAGAGCGCT
GGTAAGGAGGCATTCCGGCAAAGCAACAAAAAACTCAAAAAAGAAAACGAAAAGCTCAAAAAGGAACTTGCGCGGATGCAAGGTTCCGCACCGGTTCTTG
CAGGGTCTGATAAGACCGCCGAAGCAGGCGGCGTTCCCACTTCAAAGACTTTCTACAGACGAAACAGACAGGAAGGAAAGAAAAAGCCCACTGGTGGTCA
ACCCGGGCACCCCGGCCACGCGAGGAAAAAACCAACTCCAAACTCTCCCCCGATACCCATCACACTGGATAAATGCCCAGAATGCGGTACACCACTGGGC
GAACCCCGGAAAGGCGCCGAACAGAAGCGAACGGTGACAGATATTCCGCTTCCTGACCATATCATCTATGAAATCGTTTATCCAAGGTACTGGTGCGGGG
AGTGTAAAAAATTGGTCCGTGGCGAAGCCCCGTGGCTGCCGCCAAAGCAACATTTCGGACCTGCCGTAGCCTGCTGGATTGCGTACCAGAGAATGCTTGG
TTTGACCATTGGTAAGATACAATCGAGTTTATTAGAGACTTATGGTATTACTATGGGGGAATCAACGATTCTTAAGCTGGAAAAGTGGGTGGCAGATACA
CTCCATGAAGATTACGAGAAGATCCGCGAGGAGATCGTAAAATCCAGCGCAGTGAATGCCGACGAGACCAGTTTCAGGATCGGCGGAACGAATGGTTGGT
TGTGGGTCTTTACTTCAACGGTGGGTTCATATTACATGGTTGCTCCAACCAGAGGTCACAAAGTTCCGGAAGAAACGCTCGAAGGGTTCGAAGGCGTTCT
GGGCAGAGATGCCTGGAAGCCGTACGATGTAGTAAAATGCGAAGGGCATCAGCTCGATCTTCTTCATGTAAATCGGTGGCTGGAGAGGGCAGAGATCAAA
CACAGGATAGAACCGCGCACACTCTTATCATCTAGGTCTGCGAAATTAACGAAACCGGGAAGACCTCCCGGGCAGTTTATAGATTTTGCTGACGGGATTC
GGTCGATTCTGAAAAGGGCAGTCGAGTATACGGAGAACGACCCTCCGCCATCCATGGAAGAACGCAAAAACGCTTGCAGAGAGTTTCAGAAAGAGATGAA
AGCATTTCTTGATAGGAAATGGACCGATGATGACGCCATACGGATCTCCAAGGAACTCCGAAAGCGACTGGATATGTTATTCACGTTCATGGACCACGAG
GGCGTACCCTGGCACAACAACGACGCAGAACGAGCTATCCGCCAGGGCGTACTCCACCGCAAGATTAGTGGTGGCAGGAGGACGTGGACCGGCGCTGAAG
TATTTGAAGTGATCTTAAGCACCTACGAAACAGCAAAGAAGAGAGGAGAGAGATTTATCGAAATGGTCAGGGCAAAGTTAGATCCACCCGCCGGAGTGGC
GGGTTGTGAGGTGGGGGCTTCCTAAAGGTGAGTGGTTAC
ATGACCACAAGCTTTATATACTTGAACTATCTAATAGATAGCTAATGAAAAAAGTCCCCATAATACCGGAAACCGAGATCCGGATCAACAACATCCGTGG
ATTCTATATCCGAAAGGTCACACGTTTAGGAACGAGCGCAAAAGTCGATTGTCCAAAAGAACATTTAGGAAAAACCGTATATCTGGTGATCTTGAACCAT
GATGATGAATGAAGACCTGCCAGAGGATCTGAGGGAATATCTCCACAATTTAGAGCGAAAAATCGCTCAATTAGAGAAAGATCTCAAAAAGGAGAGCGCT
GGTAAGGAGGCATTCCGGCAAAGCAACAAAAAACTCAAAAAAGAAAACGAAAAGCTCAAAAAGGAACTTGCGCGGATGCAAGGTTCCGCACCGGTTCTTG
CAGGGTCTGATAAGACCGCCGAAGCAGGCGGCGTTCCCACTTCAAAGACTTTCTACAGACGAAACAGACAGGAAGGAAAGAAAAAGCCCACTGGTGGTCA
ACCCGGGCACCCCGGCCACGCGAGGAAAAAACCAACTCCAAACTCTCCCCCGATACCCATCACACTGGATAAATGCCCAGAATGCGGTACACCACTGGGC
GAACCCCGGAAAGGCGCCGAACAGAAGCGAACGGTGACAGATATTCCGCTTCCTGACCATATCATCTATGAAATCGTTTATCCAAGGTACTGGTGCGGGG
AGTGTAAAAAATTGGTCCGTGGCGAAGCCCCGTGGCTGCCGCCAAAGCAACATTTCGGACCTGCCGTAGCCTGCTGGATTGCGTACCAGAGAATGCTTGG
TTTGACCATTGGTAAGATACAATCGAGTTTATTAGAGACTTATGGTATTACTATGGGGGAATCAACGATTCTTAAGCTGGAAAAGTGGGTGGCAGATACA
CTCCATGAAGATTACGAGAAGATCCGCGAGGAGATCGTAAAATCCAGCGCAGTGAATGCCGACGAGACCAGTTTCAGGATCGGCGGAACGAATGGTTGGT
TGTGGGTCTTTACTTCAACGGTGGGTTCATATTACATGGTTGCTCCAACCAGAGGTCACAAAGTTCCGGAAGAAACGCTCGAAGGGTTCGAAGGCGTTCT
GGGCAGAGATGCCTGGAAGCCGTACGATGTAGTAAAATGCGAAGGGCATCAGCTCGATCTTCTTCATGTAAATCGGTGGCTGGAGAGGGCAGAGATCAAA
CACAGGATAGAACCGCGCACACTCTTATCATCTAGGTCTGCGAAATTAACGAAACCGGGAAGACCTCCCGGGCAGTTTATAGATTTTGCTGACGGGATTC
GGTCGATTCTGAAAAGGGCAGTCGAGTATACGGAGAACGACCCTCCGCCATCCATGGAAGAACGCAAAAACGCTTGCAGAGAGTTTCAGAAAGAGATGAA
AGCATTTCTTGATAGGAAATGGACCGATGATGACGCCATACGGATCTCCAAGGAACTCCGAAAGCGACTGGATATGTTATTCACGTTCATGGACCACGAG
GGCGTACCCTGGCACAACAACGACGCAGAACGAGCTATCCGCCAGGGCGTACTCCACCGCAAGATTAGTGGTGGCAGGAGGACGTGGACCGGCGCTGAAG
TATTTGAAGTGATCTTAAGCACCTACGAAACAGCAAAGAAGAGAGGAGAGAGATTTATCGAAATGGTCAGGGCAAAGTTAGATCCACCCGCCGGAGTGGC
GGGTTGTGAGGTGGGGGCTTCCTAAAGGTGAGTGGTTAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
168 bp | 55 aa | 145 | 312 | + | No |
Annotation : Description :
ORF sequence :
MKKVPIIPETEIRINNIRGFYIRKVTRLGTSAKVDCPKEHLGKTVYLVILNHDDE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1527 bp | 508 aa | 299 | 1825 | + | No |
Chemistry : DDE
ORF sequence :
MMMNEDLPEDLREYLHNLERKIAQLEKDLKKESAGKEAFRQSNKKLKKENEKLKKELARMQGSAPVLAGSDKTAEAGGVPTSKTFYRRNRQEGKKKPTGG
QPGHPGHARKKPTPNSPPIPITLDKCPECGTPLGEPRKGAEQKRTVTDIPLPDHIIYEIVYPRYWCGECKKLVRGEAPWLPPKQHFGPAVACWIAYQRML
GLTIGKIQSSLLETYGITMGESTILKLEKWVADTLHEDYEKIREEIVKSSAVNADETSFRIGGTNGWLWVFTSTVGSYYMVAPTRGHKVPEETLEGFEGV
LGRDAWKPYDVVKCEGHQLDLLHVNRWLERAEIKHRIEPRTLLSSRSAKLTKPGRPPGQFIDFADGIRSILKRAVEYTENDPPPSMEERKNACREFQKEM
KAFLDRKWTDDDAIRISKELRKRLDMLFTFMDHEGVPWHNNDAERAIRQGVLHRKISGGRRTWTGAEVFEVILSTYETAKKRGERFIEMVRAKLDPPAGV
AGCEVGAS
QPGHPGHARKKPTPNSPPIPITLDKCPECGTPLGEPRKGAEQKRTVTDIPLPDHIIYEIVYPRYWCGECKKLVRGEAPWLPPKQHFGPAVACWIAYQRML
GLTIGKIQSSLLETYGITMGESTILKLEKWVADTLHEDYEKIREEIVKSSAVNADETSFRIGGTNGWLWVFTSTVGSYYMVAPTRGHKVPEETLEGFEGV
LGRDAWKPYDVVKCEGHQLDLLHVNRWLERAEIKHRIEPRTLLSSRSAKLTKPGRPPGQFIDFADGIRSILKRAVEYTENDPPPSMEERKNACREFQKEM
KAFLDRKWTDDDAIRISKELRKRLDMLFTFMDHEGVPWHNNDAERAIRQGVLHRKISGGRRTWTGAEVFEVILSTYETAKKRGERFIEMVRAKLDPPAGV
AGCEVGAS
Blast result :
Comments
ISArch7 is 62% (ORF1) aa similar to ISA1214-1(ORF1) and 79% (ORF2) aa similar to ISMhu3.
References
1] Meyerdierks,A., Kube,M., Lombardot,T., Knittel,K., Bauer,M., Glockner,F.O., Reinhardt,R. and Amann,R. (2005) Environ. Microbiol. 7 (12), 1937-1951