ISSce1
- Family IS1
- Group ISMhu11
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_010162 | ND | Sorangium cellulosum | Sorangium cellulosum 'So ce 56' |
DNA section
IS Length : 4601 bp
Ends
IR Length : 17
IRL : GGTAATGGTCAAGGTGCAGCCCACTTTGACCAATGGTCGGACACAGGCTG
IRR : GGTAATGGTCAAGGTGCCAGTCGATTGGTCAAAGTGAGCACTTCGCGATA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGAGGGTGTCGACAGCCTCCCGGGAGC | GTTGAAAC | TGAAACCGGAGGGTGTCGACAGCCTCC | 8 |
DNA sequence
GGTAATGGTCAAGGTGCAGCCCACTTTGACCAATGGTCGGACACAGGCTGTCCTAGCTGCCGTGCGGCTCTGACAGCGAGGAATCGCGAAATCAGCTAGA
ATTAGGGCCTACCAAGCAAAAGCGCCATGTCCGGCTGGACCCCGGTCATGGCGCCAGAGCACCGAGCGGTAGGCCCGCCCGTCGCCCGCAACGTCGAGCA
TGGGCCATGCCGCTGAGAAAGGCAAGGCCCGGCCATGTCTCTGCTGTCGGCGGAGGCGCGCGCTCGCGTCGTCTCTCAGCTCGTCGAAGGCTGCTCCATC
GCGTCCACGTGCAGGCTGACCGGGGTCAGCAAGCCGACCGTGCTCGCGCTGCTGCTCCGCATCGGAGCAGGGTGCGAACGGCTGCACAATCGCATCGTCC
GTGGCGTCACATGCCACGTTGCGCAGTGCGACGAGATCTGGAGCTACGTCCAGAAGAAGCAGTCTCGGGTGACAGCCTCCGATCCTGCTGAGTACGGCGA
CGCGTACACGTTCGTCGGGATGGCGAGCGCGTCGAAGCTCATCATCAGCTATCGCGTGGGGAAGCGCGATGAGGAGAACACGCGCGCGTTCGTCAAGGAT
CTGCGCGCTCGGCTCACCACCATCCCGCAGCTCTACACGGACGGCTGGCAGCCGTACATCGGGGCGGTCGGAGCGTCGTTTACGGGCGGCGTCGACTACT
GCCAGGTCGTCAAGAACTACAGTCGCAGACCGCGCCGAGACGATGAGGTGAGGTACGAGCCGCCGCGCGACCCGTTCATCACCAAGACACCGATCTTCGG
CATCCCTGATGTGGAGCACGCTTCTACAAGTCACGTGGAGCGCCAGAACTGGACCATCCGCATGCACATCCGCCGGTTCACCCGGCTCTGCAACGGCTTC
TCGCGGAAGCTGGCGAACCACCGCGCCGCTGTCGCGCTCCACGTCGCCTGGTACAACCTGGCTCGGATCCACGAGTCGATCCGCTGCACGCCCGCCATGG
AGGCGGGCATCGCGCGGCACGTCTGGTCGATCCCGGAGCTCGTCGAGGCCGCCCTGGCGGAGCCGGAGACCGAGCCGCCGACGCCGGAACCGCTCAAGCT
CCGGACGCCGCCCGCCGGGACGCCGACGACGCCCGCCAGGGCGCTGCCGAACGGTAGAGGGTTCCTGCGACTGGTCGGCGCCGCGCCGGCTGCGCCGGTG
GCCCCTACGCCGCCCGAGGGGCCGGAGCCACCGCCCGCGCCGGCGCCGCGCAGGATGGTGCAGCTAAATTTATTCAACGAATAACCATCGGGTTGCGCTC
TACCCTGTGCACTGCCTCGCGCAGAGGTCACACCCGCCGCAGCATGGAGCACACGCTGCGCTAGAGCGCGCTGAGTCCCCCTAGATGAACGACGAGAGCA
CCGTATCCTACAACAGCGCCTTGGGATACTTTTCTGGACCGATGACCGCCTCATGCCCGACAATCGAAGGCATGGGGCGCAGGTCGCGGGCAGATCGGGC
GGGGCAGGGGCATCTCTTCCGGCAGGGCACCCTCTATTACGGGGACAACCTCAAAGTGCTCCGGGAGCATGTGAGGGACGAGTCTGTAGACCTGATTTAT
CTGGACCCTCCGTTCAACTCGAAGCGAAATTACAACGTCATATACAAAGAGCCCGACTCTAGCGACTCAGTAGCTCAGAAGCGCGCGTTTGACGATTCGT
GGCATTGGGACTTCGCCGCGGACGCTGCGTACAGGCGACTTGTTGGCAGCGGCGCAGAGGAGCGCGGGGTGCCGACGAAGCTGGTTTCGCTTGTCGAGGC
ATTTAGAATATTCCTTGGCCAGACCGACATGTTGGCCTACGTGGTCATGATGGCGGAGCGAATAGTGGAGCTGCATCGGGTTCTGAAGCGAACCGGAAGC
CTGTATCTCCATTGCGATCCAACGGCGAGCCACTACCTGAAACTGGTGCTCGATGCGATATTTGGACCAGATAACTTTCGCAACGAGATCGTGTGGCAGC
GCTCGACGGCCAAGAACGATCCGAGCAGATATGGCCGCTGTCATGATATCATCTTCTTCTACACGAAGAGTCAAGAATTCTATTGGGACACGCAATATAG
TCCATTTCAGGACTATTCAGTAGAAAAGAACTACACGGCCGTGGAAGAGGGTACCGGCAGGAGATATAGGCTTAGCGATCTCACGGCAAACAAGCCCGGT
GGGGACACGGACTACGAGTGGCACGGCAAGCGCCCATACAGAGGAAGGTTCTGGGCCTTCTCAAAGGAGAAGATGGACCAGATGTACGCAGACGGGCGAA
TCGTCTTCCGGCGCACGGGGATGCCTGTGTACAAGCGGTATTTGGATGAGATGCCAGGCGTTCCTTTCCAGGATGTGTGGACGGACGTAAGGCTTGCGTC
TGCCTCTACGGAGCGGATTGGCTACCCAACACAGAAGCCGCTGGCGCTGCTGGAGCGCATCATCGCGTCTTCTTCGAAGAGCGGCGACCTGGTGCTCGAT
CCGTTTTGTGGATGCGGCACGACGATCGAAGCCGCTCATAAGCTGGGCCGAAAATGGGTTGGGATTGACATTACGTATTTGTCTATCGACATAATCAAAG
GGCGCATCGACGCTCTGTCGCCTGGTAGTGACGATATGTACACAGTCATTGGCGAGCCGGTTGATGTAGAATCCGCGCGCAGGCTTGCCGAGGAAGACCC
CGAGGAGTTCCAGCGGTGGGCTGTGCCGTTCATCGGTGCTCGACATGTCGGCGACGGGCCTGGGGCTGGTACCTTCAAACGAGGGCGCGACCGCGGGGTC
GACGGAACAATACGATTCCAAGATGACACGGATGCGCCGAGCAAACGAGCCATCGTCTCAGTGAAGGCTGGGCAGCGGCTCGGGCCCGCCATGGTGCGCG
AGCTTCGCGGCACCATGGAGCGAGAAGGGGCGCCCGTCGGCGTGCTGTTCACAATGTATGAACCAACCAAGGAGATGATGAGCGAGGCGGTTAGAGCCGG
AACATACCGTGAGTGCCCGCGCATACAGATCATAACGGTCGCTGATGCCTTTGGTGGGAAGCGGCCGCGTGTGCCCGGAGGGGGCATACTCAGAAGGTCG
TCTCGGCCGCCGGCTGCGGCGACCGAGAGTCCAACTGTACAGGATGGATTGAAGAAGCTGGCGAAGGCTGGCCAGGCTGCTCGTGAGATGCCCGCCAGGC
GAGGATCTGGCAGGCGCTAAATGAGTGCGCAGGCGCTGGCGTTCACCGCCGCCTCCCTTGCCGACGCGCGGGCCTAGTCCTCGGTGCGCCAGACCAGCCG
CGAACGGTGGCGCTCCGGTCCCCTGCCGGCAGCAATGTCAGCGAGCACTTCGCCCGGGGACCGGCGTTCGCGCATCGCGATCATGGCGATGATGCGGGTG
AGGGGAACGCTCCGGTCGGCCCTGGCTGGGAGCAAGCCGTCGTCCACCCAGCCGGTGGCTTCGAGGTAGCGGTCGATCTGCTCGGGCGTGACCGAGCGCG
TGACGGGCTTCCCTGGTACGGTGACGCGGACGCGTCGCTTCGGATCGCGGGGTTTCATCTCCGCCACCTCCTCAGCCGCAGCGCCAGCGCCTCGCGGGCG
ATGCGCAGGCGGCTCGCCGCCGTGTGCGGGTTCATCCTGCGCCGCTTGGCGTACGCGACGAGCGAGTCGGGGCTGTCAACGGAGAGCAGCGCCTCCAGCT
GCCAGTCCGGGAGCTCGGCCATGGCGTCGAGCACTTCGCGTGCGGCGACCTGCGCCTCCAGATCGGGCCCGACGGGCTCGCGCAGCATGCCGAGCGGGCT
CGGATGGATCACGGCGCGGCGCGACCACGCGCTGTTGACGTAGTGCGACGCGAGGCGCCACGCGATGCCGTGAAGCCACTTGCGGAGCGCGTCGCGCGGC
TTGTCCTTCGGGTCGGGCCGGTAGAGCCCCCGGCGGACGGCGTTCCAGGCGTTGAGCAGCACCTCGGCCTCCACGTCGCGGCGGTCGCGCTTCGGGATGC
CGGCGAGCGTCGCCAGGACGATCCCGCGCTCGGCCATGATCGCCTCCATCGTCGGCTTCTTCCTGGGGTGTCGAGCGGCAGGCGGGCGGTGAGCCCCCTT
CCGCGGCTTCGTCACCCGCTCCTCCCGCACGTCGGGCAGCGGTCCGCTGCGACGGGGCGGCCCAAGAGCCGCTCGATGGTGGTTCCAGCCGCACGCGCGG
CGCGCATGGCGGTGCCGGGCGACCCGGCGAACTGGCCCGAGGCGATCGCGCGGATCGAACGGGTCGGCATGCCCATCACCTCGGCAAGGCATGCCCACGT
GCCGTACGCGGTGCGAAGGTTCTTGAGCGCGGCACGGAGGCGCTTGCGCTCGGGTTCGGTGAGGGAACGGGTGACGAAGGGGCGGCGCCGTGTAGGTACA
CGTCCGCCTTGCCCACCGTCGGGTGGGATCAGGTATAGCTGCACTGTCGGACTCCGGTGCTATCGGGGTTCGGCCACGCAGCCCGGACGGTTGGCGCCGT
CGCGGGCTGCACTACTTCCGGCCGAACCTACGGGCCCGATTCGAGGTCCGCTATCGCGAAGTGCTCACTTTGACCAATCGACTGGCACCTTGACCATTAC
C
ATTAGGGCCTACCAAGCAAAAGCGCCATGTCCGGCTGGACCCCGGTCATGGCGCCAGAGCACCGAGCGGTAGGCCCGCCCGTCGCCCGCAACGTCGAGCA
TGGGCCATGCCGCTGAGAAAGGCAAGGCCCGGCCATGTCTCTGCTGTCGGCGGAGGCGCGCGCTCGCGTCGTCTCTCAGCTCGTCGAAGGCTGCTCCATC
GCGTCCACGTGCAGGCTGACCGGGGTCAGCAAGCCGACCGTGCTCGCGCTGCTGCTCCGCATCGGAGCAGGGTGCGAACGGCTGCACAATCGCATCGTCC
GTGGCGTCACATGCCACGTTGCGCAGTGCGACGAGATCTGGAGCTACGTCCAGAAGAAGCAGTCTCGGGTGACAGCCTCCGATCCTGCTGAGTACGGCGA
CGCGTACACGTTCGTCGGGATGGCGAGCGCGTCGAAGCTCATCATCAGCTATCGCGTGGGGAAGCGCGATGAGGAGAACACGCGCGCGTTCGTCAAGGAT
CTGCGCGCTCGGCTCACCACCATCCCGCAGCTCTACACGGACGGCTGGCAGCCGTACATCGGGGCGGTCGGAGCGTCGTTTACGGGCGGCGTCGACTACT
GCCAGGTCGTCAAGAACTACAGTCGCAGACCGCGCCGAGACGATGAGGTGAGGTACGAGCCGCCGCGCGACCCGTTCATCACCAAGACACCGATCTTCGG
CATCCCTGATGTGGAGCACGCTTCTACAAGTCACGTGGAGCGCCAGAACTGGACCATCCGCATGCACATCCGCCGGTTCACCCGGCTCTGCAACGGCTTC
TCGCGGAAGCTGGCGAACCACCGCGCCGCTGTCGCGCTCCACGTCGCCTGGTACAACCTGGCTCGGATCCACGAGTCGATCCGCTGCACGCCCGCCATGG
AGGCGGGCATCGCGCGGCACGTCTGGTCGATCCCGGAGCTCGTCGAGGCCGCCCTGGCGGAGCCGGAGACCGAGCCGCCGACGCCGGAACCGCTCAAGCT
CCGGACGCCGCCCGCCGGGACGCCGACGACGCCCGCCAGGGCGCTGCCGAACGGTAGAGGGTTCCTGCGACTGGTCGGCGCCGCGCCGGCTGCGCCGGTG
GCCCCTACGCCGCCCGAGGGGCCGGAGCCACCGCCCGCGCCGGCGCCGCGCAGGATGGTGCAGCTAAATTTATTCAACGAATAACCATCGGGTTGCGCTC
TACCCTGTGCACTGCCTCGCGCAGAGGTCACACCCGCCGCAGCATGGAGCACACGCTGCGCTAGAGCGCGCTGAGTCCCCCTAGATGAACGACGAGAGCA
CCGTATCCTACAACAGCGCCTTGGGATACTTTTCTGGACCGATGACCGCCTCATGCCCGACAATCGAAGGCATGGGGCGCAGGTCGCGGGCAGATCGGGC
GGGGCAGGGGCATCTCTTCCGGCAGGGCACCCTCTATTACGGGGACAACCTCAAAGTGCTCCGGGAGCATGTGAGGGACGAGTCTGTAGACCTGATTTAT
CTGGACCCTCCGTTCAACTCGAAGCGAAATTACAACGTCATATACAAAGAGCCCGACTCTAGCGACTCAGTAGCTCAGAAGCGCGCGTTTGACGATTCGT
GGCATTGGGACTTCGCCGCGGACGCTGCGTACAGGCGACTTGTTGGCAGCGGCGCAGAGGAGCGCGGGGTGCCGACGAAGCTGGTTTCGCTTGTCGAGGC
ATTTAGAATATTCCTTGGCCAGACCGACATGTTGGCCTACGTGGTCATGATGGCGGAGCGAATAGTGGAGCTGCATCGGGTTCTGAAGCGAACCGGAAGC
CTGTATCTCCATTGCGATCCAACGGCGAGCCACTACCTGAAACTGGTGCTCGATGCGATATTTGGACCAGATAACTTTCGCAACGAGATCGTGTGGCAGC
GCTCGACGGCCAAGAACGATCCGAGCAGATATGGCCGCTGTCATGATATCATCTTCTTCTACACGAAGAGTCAAGAATTCTATTGGGACACGCAATATAG
TCCATTTCAGGACTATTCAGTAGAAAAGAACTACACGGCCGTGGAAGAGGGTACCGGCAGGAGATATAGGCTTAGCGATCTCACGGCAAACAAGCCCGGT
GGGGACACGGACTACGAGTGGCACGGCAAGCGCCCATACAGAGGAAGGTTCTGGGCCTTCTCAAAGGAGAAGATGGACCAGATGTACGCAGACGGGCGAA
TCGTCTTCCGGCGCACGGGGATGCCTGTGTACAAGCGGTATTTGGATGAGATGCCAGGCGTTCCTTTCCAGGATGTGTGGACGGACGTAAGGCTTGCGTC
TGCCTCTACGGAGCGGATTGGCTACCCAACACAGAAGCCGCTGGCGCTGCTGGAGCGCATCATCGCGTCTTCTTCGAAGAGCGGCGACCTGGTGCTCGAT
CCGTTTTGTGGATGCGGCACGACGATCGAAGCCGCTCATAAGCTGGGCCGAAAATGGGTTGGGATTGACATTACGTATTTGTCTATCGACATAATCAAAG
GGCGCATCGACGCTCTGTCGCCTGGTAGTGACGATATGTACACAGTCATTGGCGAGCCGGTTGATGTAGAATCCGCGCGCAGGCTTGCCGAGGAAGACCC
CGAGGAGTTCCAGCGGTGGGCTGTGCCGTTCATCGGTGCTCGACATGTCGGCGACGGGCCTGGGGCTGGTACCTTCAAACGAGGGCGCGACCGCGGGGTC
GACGGAACAATACGATTCCAAGATGACACGGATGCGCCGAGCAAACGAGCCATCGTCTCAGTGAAGGCTGGGCAGCGGCTCGGGCCCGCCATGGTGCGCG
AGCTTCGCGGCACCATGGAGCGAGAAGGGGCGCCCGTCGGCGTGCTGTTCACAATGTATGAACCAACCAAGGAGATGATGAGCGAGGCGGTTAGAGCCGG
AACATACCGTGAGTGCCCGCGCATACAGATCATAACGGTCGCTGATGCCTTTGGTGGGAAGCGGCCGCGTGTGCCCGGAGGGGGCATACTCAGAAGGTCG
TCTCGGCCGCCGGCTGCGGCGACCGAGAGTCCAACTGTACAGGATGGATTGAAGAAGCTGGCGAAGGCTGGCCAGGCTGCTCGTGAGATGCCCGCCAGGC
GAGGATCTGGCAGGCGCTAAATGAGTGCGCAGGCGCTGGCGTTCACCGCCGCCTCCCTTGCCGACGCGCGGGCCTAGTCCTCGGTGCGCCAGACCAGCCG
CGAACGGTGGCGCTCCGGTCCCCTGCCGGCAGCAATGTCAGCGAGCACTTCGCCCGGGGACCGGCGTTCGCGCATCGCGATCATGGCGATGATGCGGGTG
AGGGGAACGCTCCGGTCGGCCCTGGCTGGGAGCAAGCCGTCGTCCACCCAGCCGGTGGCTTCGAGGTAGCGGTCGATCTGCTCGGGCGTGACCGAGCGCG
TGACGGGCTTCCCTGGTACGGTGACGCGGACGCGTCGCTTCGGATCGCGGGGTTTCATCTCCGCCACCTCCTCAGCCGCAGCGCCAGCGCCTCGCGGGCG
ATGCGCAGGCGGCTCGCCGCCGTGTGCGGGTTCATCCTGCGCCGCTTGGCGTACGCGACGAGCGAGTCGGGGCTGTCAACGGAGAGCAGCGCCTCCAGCT
GCCAGTCCGGGAGCTCGGCCATGGCGTCGAGCACTTCGCGTGCGGCGACCTGCGCCTCCAGATCGGGCCCGACGGGCTCGCGCAGCATGCCGAGCGGGCT
CGGATGGATCACGGCGCGGCGCGACCACGCGCTGTTGACGTAGTGCGACGCGAGGCGCCACGCGATGCCGTGAAGCCACTTGCGGAGCGCGTCGCGCGGC
TTGTCCTTCGGGTCGGGCCGGTAGAGCCCCCGGCGGACGGCGTTCCAGGCGTTGAGCAGCACCTCGGCCTCCACGTCGCGGCGGTCGCGCTTCGGGATGC
CGGCGAGCGTCGCCAGGACGATCCCGCGCTCGGCCATGATCGCCTCCATCGTCGGCTTCTTCCTGGGGTGTCGAGCGGCAGGCGGGCGGTGAGCCCCCTT
CCGCGGCTTCGTCACCCGCTCCTCCCGCACGTCGGGCAGCGGTCCGCTGCGACGGGGCGGCCCAAGAGCCGCTCGATGGTGGTTCCAGCCGCACGCGCGG
CGCGCATGGCGGTGCCGGGCGACCCGGCGAACTGGCCCGAGGCGATCGCGCGGATCGAACGGGTCGGCATGCCCATCACCTCGGCAAGGCATGCCCACGT
GCCGTACGCGGTGCGAAGGTTCTTGAGCGCGGCACGGAGGCGCTTGCGCTCGGGTTCGGTGAGGGAACGGGTGACGAAGGGGCGGCGCCGTGTAGGTACA
CGTCCGCCTTGCCCACCGTCGGGTGGGATCAGGTATAGCTGCACTGTCGGACTCCGGTGCTATCGGGGTTCGGCCACGCAGCCCGGACGGTTGGCGCCGT
CGCGGGCTGCACTACTTCCGGCCGAACCTACGGGCCCGATTCGAGGTCCGCTATCGCGAAGTGCTCACTTTGACCAATCGACTGGCACCTTGACCATTAC
C
Protein section
ORF number : 4
ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
333 bp | 110 aa | 4444 | 4112 | - | No |
Annotation : Putative transcriptional regulatorDescription : Transcriptional Regulator factor
ORF sequence :
MQLYLIPPDGGQGGRVPTRRRPFVTRSLTEPERKRLRAALKNLRTAYGTWACLAEVMGMPTRSIRAIASGQFAGSPGTAMRAARAAGTTIERLLGRPVAA
DRCPTCGRSG
DRCPTCGRSG
Blast result :ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1050 bp | 349 aa | 235 | 1284 | + | No |
Chemistry : DDE
ORF sequence :
MSLLSAEARARVVSQLVEGCSIASTCRLTGVSKPTVLALLLRIGAGCERLHNRIVRGVTCHVAQCDEIWSYVQKKQSRVTASDPAEYGDAYTFVGMASAS
KLIISYRVGKRDEENTRAFVKDLRARLTTIPQLYTDGWQPYIGAVGASFTGGVDYCQVVKNYSRRPRRDDEVRYEPPRDPFITKTPIFGIPDVEHASTSH
VERQNWTIRMHIRRFTRLCNGFSRKLANHRAAVALHVAWYNLARIHESIRCTPAMEAGIARHVWSIPELVEAALAEPETEPPTPEPLKLRTPPAGTPTTP
ARALPNGRGFLRLVGAAPAAPVAPTPPEGPEPPPAPAPRRMVQLNLFNE
KLIISYRVGKRDEENTRAFVKDLRARLTTIPQLYTDGWQPYIGAVGASFTGGVDYCQVVKNYSRRPRRDDEVRYEPPRDPFITKTPIFGIPDVEHASTSH
VERQNWTIRMHIRRFTRLCNGFSRKLANHRAAVALHVAWYNLARIHESIRCTPAMEAGIARHVWSIPELVEAALAEPETEPPTPEPLKLRTPPAGTPTTP
ARALPNGRGFLRLVGAAPAAPVAPTPPEGPEPPPAPAPRRMVQLNLFNE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1836 bp | 611 aa | 1385 | 3220 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MNDESTVSYNSALGYFSGPMTASCPTIEGMGRRSRADRAGQGHLFRQGTLYYGDNLKVLREHVRDESVDLIYLDPPFNSKRNYNVIYKEPDSSDSVAQKR
AFDDSWHWDFAADAAYRRLVGSGAEERGVPTKLVSLVEAFRIFLGQTDMLAYVVMMAERIVELHRVLKRTGSLYLHCDPTASHYLKLVLDAIFGPDNFRN
EIVWQRSTAKNDPSRYGRCHDIIFFYTKSQEFYWDTQYSPFQDYSVEKNYTAVEEGTGRRYRLSDLTANKPGGDTDYEWHGKRPYRGRFWAFSKEKMDQM
YADGRIVFRRTGMPVYKRYLDEMPGVPFQDVWTDVRLASASTERIGYPTQKPLALLERIIASSSKSGDLVLDPFCGCGTTIEAAHKLGRKWVGIDITYLS
IDIIKGRIDALSPGSDDMYTVIGEPVDVESARRLAEEDPEEFQRWAVPFIGARHVGDGPGAGTFKRGRDRGVDGTIRFQDDTDAPSKRAIVSVKAGQRLG
PAMVRELRGTMEREGAPVGVLFTMYEPTKEMMSEAVRAGTYRECPRIQIITVADAFGGKRPRVPGGGILRRSSRPPAAATESPTVQDGLKKLAKAGQAAR
EMPARRGSGRR
AFDDSWHWDFAADAAYRRLVGSGAEERGVPTKLVSLVEAFRIFLGQTDMLAYVVMMAERIVELHRVLKRTGSLYLHCDPTASHYLKLVLDAIFGPDNFRN
EIVWQRSTAKNDPSRYGRCHDIIFFYTKSQEFYWDTQYSPFQDYSVEKNYTAVEEGTGRRYRLSDLTANKPGGDTDYEWHGKRPYRGRFWAFSKEKMDQM
YADGRIVFRRTGMPVYKRYLDEMPGVPFQDVWTDVRLASASTERIGYPTQKPLALLERIIASSSKSGDLVLDPFCGCGTTIEAAHKLGRKWVGIDITYLS
IDIIKGRIDALSPGSDDMYTVIGEPVDVESARRLAEEDPEEFQRWAVPFIGARHVGDGPGAGTFKRGRDRGVDGTIRFQDDTDAPSKRAIVSVKAGQRLG
PAMVRELRGTMEREGAPVGVLFTMYEPTKEMMSEAVRAGTYRECPRIQIITVADAFGGKRPRVPGGGILRRSSRPPAAATESPTVQDGLKKLAKAGQAAR
EMPARRGSGRR
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
495 bp | 163 aa | 4049 | 3555 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MEAIMAERGIVLATLAGIPKRDRRDVEAEVLLNAWNAVRRGLYRPDPKDKPRDALRKWLHGIAWRLASHYVNSAWSRRAVIHPSPLGMLREPVGPDLEAQ
VAAREVLDAMAELPDWQLEALLSVDSPDSLVAYAKRRRMNPHTAASRLRIAREALALRLRWRR
VAAREVLDAMAELPDWQLEALLSVDSPDSLVAYAKRRRMNPHTAASRLRIAREALALRLRWRR
Blast result :
Comments
ISSce1 is 36% aa similar to ISLysp1.
The first ORF is the transposase, others are passengers genes.
The first ORF is the transposase, others are passengers genes.
References
1] Schneiker,S., Perlova,O., Kaiser,O., Gerth,K., Alici,A., Altmeyer,M.O., Bartels,D., Bekel,T., Beyer,S., Bode,E., Bode,H.B., Bolten,C., Choudhuri,J.V., Doss,S., Elnakady,Y.A., Frank,B., Gaigalat,L., Goesmann,A., Groeger,C., Gross,F., Jelsbak,L.,Jelsbak,L., Kalinowski,J., Kegler,C., Knauber,T., Konietzny,S.,Kopp,M., Krause,L., Krug,D., Linke,B., Mahmud,T.,Martinez-Arias,R., McHardy,A.C., Merai,M., Meyer,F., Mormann,S.,Munoz-Dorado,J., Perez,J., Pradella,S., Rachid,S., Raddatz,G., Rosenau,F., Ruckert,C., Sasse,F., Scharfe,M., Schuster,S.C., Suen,G., Treuner-Lange,A., Velicer,G.J., Vorholter,F.J., Weissman,K.J., Welch,R.D., Wenzel,S.C., Whitworth,D., Wilhelm,S., Wittmann,C., Blocker,H., Puhler,A. and Muler,R. (2007) Nature Biotech In press