ISC1926
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY671948 | ND | Sulfolobus sp. | Sulfolobus sp. L00 11 |
DNA section
IS Length : 1925 bp
Ends
IR Length : 0
IRL : AAGGGCTGAATCCTTTCTCACGTTAATAGAAATCTTTTTATATTCATTTA
IRR : GTTTGTCCCTACACTTTCATTTCAATCATTTTATGTTCATTCATGATCGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
AAGGGCTGAATCCTTTCTCACGTTAATAGAAATCTTTTTATATTCATTTATTATCAACTAATTTTGTGGAGAGACTACTGAGGCCTAAGGAGGCTTGCCA
ACTACTCAGCATTTCATACTCAACTCTCCTACGGTGGATTAGAGAAGGGAAAATAAGGGTGGTAACGACTGAAGGAGGGAAGTACAGAATACCTTACAGC
GAAATTAAGAAGTACTTAGAGAAGAGGGAGGAAATAAGGGCAGTAATTTACGCAAGAGTTTCATCATCAGATCAAAAAGAAGATTTGGAGAGACAAATAA
ACTACCTAACAAATTACGCAACAGCAAAGGGTTACAAGGTAGTTGAGGTGTTGAAAGATATAGCTAGCGGGTTAAACACGCAAAGGAAAGGATTGCTGAA
GCTCTTCAAACTTGTTGAGGGGAGGAGTGTTGACGTCGTATTAATAACATACAAAGACAGACTAACGCGTTTTGGATTTGAGTACATTGAAGAGCTCTTC
TCAACCATGGGAGTTAAGATTGAAGTAGTTTTCGGAGAAGAACCTAAGGATGCCACACAAGAACTTGTGGAAGATTTGATTTCCATTATTACATCATTCG
CTGGTAAAATTTACGGTATGAGGAGTCATAAGAAGACAGTCCTAGTTCAAGGTGTAAAAAAGTTGATAGGTGAGTTAAGTGGAGAGGACGATAAAGTTAA
GGGTTAGGGTTGACTATATTACATACTCAGCACTTAAGGAAGTTGAGGGGGAGTACAGAGAGGTTCTAGAGGACGCAATAAATTATGGGCTGTCAAACAA
AACTACCTCCTTCACTAGAATTAAAGCTGGAGTTTACAAGACTGAGAGGGAAAAGCACAAGGACTTACCATCCCATTACATCTACACCGCTTGTGAAGAT
GCAAGCGAGAGGTTGGACAGCTTTGAGAAGTTGAAGAAGAGAGGTAGGAGTTACACTGAGAAACCTTCAGTGAGGAAGGTCACTGTGCATCTAGATGATC
ATCTGTGGAAGTTCAGTCTCGATAAGATCTCAATTTCCACAATGCAAGGTAGGGTTTTCATTTCACCAACCTTCCCTAAGATCTTCTGGAGATATTATAA
CACGGAGTGGAGGATTGCGAGTGAAGCCAGGTTTAAGTTGTTGAAGGGAAATGTTGTAGAGTTCTTCATAGTTTTTAAGAGGGACGAGCCTAAACCTTAT
GAACCTAAGGGTTTCATCCCCGTCGACCTTAACGAGGATTCGGTCTCTGTATTAGTTGATGGAAAACCGATGCTTTTAGAGACTAACACTAAGAGGATTA
CTCTGGGCTATGAGTATAGGAGGAAGGCAATAACAACTCGTAGGTCAGCTGAGGATAGAGAAGTGAAGAGGAAGTTAAAGAGGCTGAGGGAGAGGGATAA
GAAAGTAGTCATTAGGAGGAAGTTGGCTAAGCTGATCGTTAAAGAGGCTTTTGAAAGTATGAGTGCAATTGTCTTAGAGGCCTTGCCAAGGAGACCTCCA
GAGCATATGATAAAGGACGTGAAAGACTCTCAGCTTAGGTTGAGGATTTATAGATCGGCATTTTCCTCAATGAAGAATGCTATTATAGAGAAGGCTAAGG
AGTTTAGAGTCCCCGTAGTCTTAGTTAATCCCTCATATACTTCTTCAACTTGTCCAATCCACGGGGCGAAGATCGTTTACCAACCCGATGGGGGCGATGC
CCCAAGGGTTGGTGTTTGTGAGAAGGGGAAGGAAAAGTGGCATAGGGATGTGGTTGCCCTCTATAATTTGAGGAAAAGGGCTGGAGATGTGAGCCCCGTG
CCGTTGGGCTCGAAGGAGTCCCATGACCCACCTACCGTTAAGTTAGGCAGGTGGTTGAGGGCTAAGTCCCTACACTCGATCATGAATGAACATAAAATGA
TTGAAATGAAAGTGTAGGGACAAAC
ACTACTCAGCATTTCATACTCAACTCTCCTACGGTGGATTAGAGAAGGGAAAATAAGGGTGGTAACGACTGAAGGAGGGAAGTACAGAATACCTTACAGC
GAAATTAAGAAGTACTTAGAGAAGAGGGAGGAAATAAGGGCAGTAATTTACGCAAGAGTTTCATCATCAGATCAAAAAGAAGATTTGGAGAGACAAATAA
ACTACCTAACAAATTACGCAACAGCAAAGGGTTACAAGGTAGTTGAGGTGTTGAAAGATATAGCTAGCGGGTTAAACACGCAAAGGAAAGGATTGCTGAA
GCTCTTCAAACTTGTTGAGGGGAGGAGTGTTGACGTCGTATTAATAACATACAAAGACAGACTAACGCGTTTTGGATTTGAGTACATTGAAGAGCTCTTC
TCAACCATGGGAGTTAAGATTGAAGTAGTTTTCGGAGAAGAACCTAAGGATGCCACACAAGAACTTGTGGAAGATTTGATTTCCATTATTACATCATTCG
CTGGTAAAATTTACGGTATGAGGAGTCATAAGAAGACAGTCCTAGTTCAAGGTGTAAAAAAGTTGATAGGTGAGTTAAGTGGAGAGGACGATAAAGTTAA
GGGTTAGGGTTGACTATATTACATACTCAGCACTTAAGGAAGTTGAGGGGGAGTACAGAGAGGTTCTAGAGGACGCAATAAATTATGGGCTGTCAAACAA
AACTACCTCCTTCACTAGAATTAAAGCTGGAGTTTACAAGACTGAGAGGGAAAAGCACAAGGACTTACCATCCCATTACATCTACACCGCTTGTGAAGAT
GCAAGCGAGAGGTTGGACAGCTTTGAGAAGTTGAAGAAGAGAGGTAGGAGTTACACTGAGAAACCTTCAGTGAGGAAGGTCACTGTGCATCTAGATGATC
ATCTGTGGAAGTTCAGTCTCGATAAGATCTCAATTTCCACAATGCAAGGTAGGGTTTTCATTTCACCAACCTTCCCTAAGATCTTCTGGAGATATTATAA
CACGGAGTGGAGGATTGCGAGTGAAGCCAGGTTTAAGTTGTTGAAGGGAAATGTTGTAGAGTTCTTCATAGTTTTTAAGAGGGACGAGCCTAAACCTTAT
GAACCTAAGGGTTTCATCCCCGTCGACCTTAACGAGGATTCGGTCTCTGTATTAGTTGATGGAAAACCGATGCTTTTAGAGACTAACACTAAGAGGATTA
CTCTGGGCTATGAGTATAGGAGGAAGGCAATAACAACTCGTAGGTCAGCTGAGGATAGAGAAGTGAAGAGGAAGTTAAAGAGGCTGAGGGAGAGGGATAA
GAAAGTAGTCATTAGGAGGAAGTTGGCTAAGCTGATCGTTAAAGAGGCTTTTGAAAGTATGAGTGCAATTGTCTTAGAGGCCTTGCCAAGGAGACCTCCA
GAGCATATGATAAAGGACGTGAAAGACTCTCAGCTTAGGTTGAGGATTTATAGATCGGCATTTTCCTCAATGAAGAATGCTATTATAGAGAAGGCTAAGG
AGTTTAGAGTCCCCGTAGTCTTAGTTAATCCCTCATATACTTCTTCAACTTGTCCAATCCACGGGGCGAAGATCGTTTACCAACCCGATGGGGGCGATGC
CCCAAGGGTTGGTGTTTGTGAGAAGGGGAAGGAAAAGTGGCATAGGGATGTGGTTGCCCTCTATAATTTGAGGAAAAGGGCTGGAGATGTGAGCCCCGTG
CCGTTGGGCTCGAAGGAGTCCCATGACCCACCTACCGTTAAGTTAGGCAGGTGGTTGAGGGCTAAGTCCCTACACTCGATCATGAATGAACATAAAATGA
TTGAAATGAAAGTGTAGGGACAAAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
666 bp | 221 aa | 42 | 707 | + | No |
Chemistry : Serine
ORF sequence :
MHLLSTNFVERLLRPKEACQLLSISYSTLLRWIREGKIRVVTTEGGKYRIPYSEIKKYLEKREEIRAVIYARVSSSDQKEDLERQINYLTNYATAKGYKV
VEVLKDIASGLNTQRKGLLKLFKLVEGRSVDVVLITYKDRLTRFGFEYIEELFSTMGVKIEVVFGEEPKDATQELVEDLISIITSFAGKIYGMRSHKKTV
LVQGVKKLIGELSGEDDKVKG
VEVLKDIASGLNTQRKGLLKLFKLVEGRSVDVVLITYKDRLTRFGFEYIEELFSTMGVKIEVVFGEEPKDATQELVEDLISIITSFAGKIYGMRSHKKTV
LVQGVKKLIGELSGEDDKVKG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1239 bp | 412 aa | 679 | 1917 | + | No |
AG : TnpB
ORF sequence :
MERTIKLRVRVDYITYSALKEVEGEYREVLEDAINYGLSNKTTSFTRIKAGVYKTEREKHKDLPSHYIYTACEDASERLDSFEKLKKRGRSYTEKPSVRK
VTVHLDDHLWKFSLDKISISTMQGRVFISPTFPKIFWRYYNTEWRIASEARFKLLKGNVVEFFIVFKRDEPKPYEPKGFIPVDLNEDSVSVLVDGKPMLL
ETNTKRITLGYEYRRKAITTRRSAEDREVKRKLKRLRERDKKVVIRRKLAKLIVKEAFESMSAIVLEALPRRPPEHMIKDVKDSQLRLRIYRSAFSSMKN
AIIEKAKEFRVPVVLVNPSYTSSTCPIHGAKIVYQPDGGDAPRVGVCEKGKEKWHRDVVALYNLRKRAGDVSPVPLGSKESHDPPTVKLGRWLRAKSLHS
IMNEHKMIEMKV
VTVHLDDHLWKFSLDKISISTMQGRVFISPTFPKIFWRYYNTEWRIASEARFKLLKGNVVEFFIVFKRDEPKPYEPKGFIPVDLNEDSVSVLVDGKPMLL
ETNTKRITLGYEYRRKAITTRRSAEDREVKRKLKRLRERDKKVVIRRKLAKLIVKEAFESMSAIVLEALPRRPPEHMIKDVKDSQLRLRIYRSAFSSMKN
AIIEKAKEFRVPVVLVNPSYTSSTCPIHGAKIVYQPDGGDAPRVGVCEKGKEKWHRDVVALYNLRKRAGDVSPVPLGSKESHDPPTVKLGRWLRAKSLHS
IMNEHKMIEMKV
Blast result :
Comments
ISc1926 is 95% (ORF1) and 90% (ORF2) aa similar to ISSto13.
References
1] Blount,Z.D. and Grogan,D.W. (2005) Mol. Microbiol. 55 (1), 312-325