ISSce1

  • Family IS1
  • Group ISMhu11
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
NC_010162 ND Sorangium cellulosum
Sorangium cellulosum 'So ce 56'
DNA section
IS Length : 4601 bp

Ends


IR Length : 17

IRL : GGTAATGGTCAAGGTGCAGCCCACTTTGACCAATGGTCGGACACAGGCTG
IRR : GGTAATGGTCAAGGTGCCAGTCGATTGGTCAAAGTGAGCACTTCGCGATA

Insertion site


Left flankDirect repeatRight flankDR Length
GGAGGGTGTCGACAGCCTCCCGGGAGCGTTGAAACTGAAACCGGAGGGTGTCGACAGCCTCC8

DNA sequence

GGTAATGGTCAAGGTGCAGCCCACTTTGACCAATGGTCGGACACAGGCTGTCCTAGCTGCCGTGCGGCTCTGACAGCGAGGAATCGCGAAATCAGCTAGA
ATTAGGGCCTACCAAGCAAAAGCGCCATGTCCGGCTGGACCCCGGTCATGGCGCCAGAGCACCGAGCGGTAGGCCCGCCCGTCGCCCGCAACGTCGAGCA
TGGGCCATGCCGCTGAGAAAGGCAAGGCCCGGCCATGTCTCTGCTGTCGGCGGAGGCGCGCGCTCGCGTCGTCTCTCAGCTCGTCGAAGGCTGCTCCATC
GCGTCCACGTGCAGGCTGACCGGGGTCAGCAAGCCGACCGTGCTCGCGCTGCTGCTCCGCATCGGAGCAGGGTGCGAACGGCTGCACAATCGCATCGTCC
GTGGCGTCACATGCCACGTTGCGCAGTGCGACGAGATCTGGAGCTACGTCCAGAAGAAGCAGTCTCGGGTGACAGCCTCCGATCCTGCTGAGTACGGCGA
CGCGTACACGTTCGTCGGGATGGCGAGCGCGTCGAAGCTCATCATCAGCTATCGCGTGGGGAAGCGCGATGAGGAGAACACGCGCGCGTTCGTCAAGGAT
CTGCGCGCTCGGCTCACCACCATCCCGCAGCTCTACACGGACGGCTGGCAGCCGTACATCGGGGCGGTCGGAGCGTCGTTTACGGGCGGCGTCGACTACT
GCCAGGTCGTCAAGAACTACAGTCGCAGACCGCGCCGAGACGATGAGGTGAGGTACGAGCCGCCGCGCGACCCGTTCATCACCAAGACACCGATCTTCGG
CATCCCTGATGTGGAGCACGCTTCTACAAGTCACGTGGAGCGCCAGAACTGGACCATCCGCATGCACATCCGCCGGTTCACCCGGCTCTGCAACGGCTTC
TCGCGGAAGCTGGCGAACCACCGCGCCGCTGTCGCGCTCCACGTCGCCTGGTACAACCTGGCTCGGATCCACGAGTCGATCCGCTGCACGCCCGCCATGG
AGGCGGGCATCGCGCGGCACGTCTGGTCGATCCCGGAGCTCGTCGAGGCCGCCCTGGCGGAGCCGGAGACCGAGCCGCCGACGCCGGAACCGCTCAAGCT
CCGGACGCCGCCCGCCGGGACGCCGACGACGCCCGCCAGGGCGCTGCCGAACGGTAGAGGGTTCCTGCGACTGGTCGGCGCCGCGCCGGCTGCGCCGGTG
GCCCCTACGCCGCCCGAGGGGCCGGAGCCACCGCCCGCGCCGGCGCCGCGCAGGATGGTGCAGCTAAATTTATTCAACGAATAACCATCGGGTTGCGCTC
TACCCTGTGCACTGCCTCGCGCAGAGGTCACACCCGCCGCAGCATGGAGCACACGCTGCGCTAGAGCGCGCTGAGTCCCCCTAGATGAACGACGAGAGCA
CCGTATCCTACAACAGCGCCTTGGGATACTTTTCTGGACCGATGACCGCCTCATGCCCGACAATCGAAGGCATGGGGCGCAGGTCGCGGGCAGATCGGGC
GGGGCAGGGGCATCTCTTCCGGCAGGGCACCCTCTATTACGGGGACAACCTCAAAGTGCTCCGGGAGCATGTGAGGGACGAGTCTGTAGACCTGATTTAT
CTGGACCCTCCGTTCAACTCGAAGCGAAATTACAACGTCATATACAAAGAGCCCGACTCTAGCGACTCAGTAGCTCAGAAGCGCGCGTTTGACGATTCGT
GGCATTGGGACTTCGCCGCGGACGCTGCGTACAGGCGACTTGTTGGCAGCGGCGCAGAGGAGCGCGGGGTGCCGACGAAGCTGGTTTCGCTTGTCGAGGC
ATTTAGAATATTCCTTGGCCAGACCGACATGTTGGCCTACGTGGTCATGATGGCGGAGCGAATAGTGGAGCTGCATCGGGTTCTGAAGCGAACCGGAAGC
CTGTATCTCCATTGCGATCCAACGGCGAGCCACTACCTGAAACTGGTGCTCGATGCGATATTTGGACCAGATAACTTTCGCAACGAGATCGTGTGGCAGC
GCTCGACGGCCAAGAACGATCCGAGCAGATATGGCCGCTGTCATGATATCATCTTCTTCTACACGAAGAGTCAAGAATTCTATTGGGACACGCAATATAG
TCCATTTCAGGACTATTCAGTAGAAAAGAACTACACGGCCGTGGAAGAGGGTACCGGCAGGAGATATAGGCTTAGCGATCTCACGGCAAACAAGCCCGGT
GGGGACACGGACTACGAGTGGCACGGCAAGCGCCCATACAGAGGAAGGTTCTGGGCCTTCTCAAAGGAGAAGATGGACCAGATGTACGCAGACGGGCGAA
TCGTCTTCCGGCGCACGGGGATGCCTGTGTACAAGCGGTATTTGGATGAGATGCCAGGCGTTCCTTTCCAGGATGTGTGGACGGACGTAAGGCTTGCGTC
TGCCTCTACGGAGCGGATTGGCTACCCAACACAGAAGCCGCTGGCGCTGCTGGAGCGCATCATCGCGTCTTCTTCGAAGAGCGGCGACCTGGTGCTCGAT
CCGTTTTGTGGATGCGGCACGACGATCGAAGCCGCTCATAAGCTGGGCCGAAAATGGGTTGGGATTGACATTACGTATTTGTCTATCGACATAATCAAAG
GGCGCATCGACGCTCTGTCGCCTGGTAGTGACGATATGTACACAGTCATTGGCGAGCCGGTTGATGTAGAATCCGCGCGCAGGCTTGCCGAGGAAGACCC
CGAGGAGTTCCAGCGGTGGGCTGTGCCGTTCATCGGTGCTCGACATGTCGGCGACGGGCCTGGGGCTGGTACCTTCAAACGAGGGCGCGACCGCGGGGTC
GACGGAACAATACGATTCCAAGATGACACGGATGCGCCGAGCAAACGAGCCATCGTCTCAGTGAAGGCTGGGCAGCGGCTCGGGCCCGCCATGGTGCGCG
AGCTTCGCGGCACCATGGAGCGAGAAGGGGCGCCCGTCGGCGTGCTGTTCACAATGTATGAACCAACCAAGGAGATGATGAGCGAGGCGGTTAGAGCCGG
AACATACCGTGAGTGCCCGCGCATACAGATCATAACGGTCGCTGATGCCTTTGGTGGGAAGCGGCCGCGTGTGCCCGGAGGGGGCATACTCAGAAGGTCG
TCTCGGCCGCCGGCTGCGGCGACCGAGAGTCCAACTGTACAGGATGGATTGAAGAAGCTGGCGAAGGCTGGCCAGGCTGCTCGTGAGATGCCCGCCAGGC
GAGGATCTGGCAGGCGCTAAATGAGTGCGCAGGCGCTGGCGTTCACCGCCGCCTCCCTTGCCGACGCGCGGGCCTAGTCCTCGGTGCGCCAGACCAGCCG
CGAACGGTGGCGCTCCGGTCCCCTGCCGGCAGCAATGTCAGCGAGCACTTCGCCCGGGGACCGGCGTTCGCGCATCGCGATCATGGCGATGATGCGGGTG
AGGGGAACGCTCCGGTCGGCCCTGGCTGGGAGCAAGCCGTCGTCCACCCAGCCGGTGGCTTCGAGGTAGCGGTCGATCTGCTCGGGCGTGACCGAGCGCG
TGACGGGCTTCCCTGGTACGGTGACGCGGACGCGTCGCTTCGGATCGCGGGGTTTCATCTCCGCCACCTCCTCAGCCGCAGCGCCAGCGCCTCGCGGGCG
ATGCGCAGGCGGCTCGCCGCCGTGTGCGGGTTCATCCTGCGCCGCTTGGCGTACGCGACGAGCGAGTCGGGGCTGTCAACGGAGAGCAGCGCCTCCAGCT
GCCAGTCCGGGAGCTCGGCCATGGCGTCGAGCACTTCGCGTGCGGCGACCTGCGCCTCCAGATCGGGCCCGACGGGCTCGCGCAGCATGCCGAGCGGGCT
CGGATGGATCACGGCGCGGCGCGACCACGCGCTGTTGACGTAGTGCGACGCGAGGCGCCACGCGATGCCGTGAAGCCACTTGCGGAGCGCGTCGCGCGGC
TTGTCCTTCGGGTCGGGCCGGTAGAGCCCCCGGCGGACGGCGTTCCAGGCGTTGAGCAGCACCTCGGCCTCCACGTCGCGGCGGTCGCGCTTCGGGATGC
CGGCGAGCGTCGCCAGGACGATCCCGCGCTCGGCCATGATCGCCTCCATCGTCGGCTTCTTCCTGGGGTGTCGAGCGGCAGGCGGGCGGTGAGCCCCCTT
CCGCGGCTTCGTCACCCGCTCCTCCCGCACGTCGGGCAGCGGTCCGCTGCGACGGGGCGGCCCAAGAGCCGCTCGATGGTGGTTCCAGCCGCACGCGCGG
CGCGCATGGCGGTGCCGGGCGACCCGGCGAACTGGCCCGAGGCGATCGCGCGGATCGAACGGGTCGGCATGCCCATCACCTCGGCAAGGCATGCCCACGT
GCCGTACGCGGTGCGAAGGTTCTTGAGCGCGGCACGGAGGCGCTTGCGCTCGGGTTCGGTGAGGGAACGGGTGACGAAGGGGCGGCGCCGTGTAGGTACA
CGTCCGCCTTGCCCACCGTCGGGTGGGATCAGGTATAGCTGCACTGTCGGACTCCGGTGCTATCGGGGTTCGGCCACGCAGCCCGGACGGTTGGCGCCGT
CGCGGGCTGCACTACTTCCGGCCGAACCTACGGGCCCGATTCGAGGTCCGCTATCGCGAAGTGCTCACTTTGACCAATCGACTGGCACCTTGACCATTAC
C
Protein section
ORF number : 4

 

ORF 4
LengthBeginEndStrandFusion ORF
333 bp110 aa44444112-No
ORF function : Passenger Gene
Annotation : Putative transcriptional regulatorDescription : Transcriptional Regulator factor

ORF sequence :

MQLYLIPPDGGQGGRVPTRRRPFVTRSLTEPERKRLRAALKNLRTAYGTWACLAEVMGMPTRSIRAIASGQFAGSPGTAMRAARAAGTTIERLLGRPVAA
DRCPTCGRSG

 

Blast result :
ORF 1
LengthBeginEndStrandFusion ORF
1050 bp349 aa2351284+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MSLLSAEARARVVSQLVEGCSIASTCRLTGVSKPTVLALLLRIGAGCERLHNRIVRGVTCHVAQCDEIWSYVQKKQSRVTASDPAEYGDAYTFVGMASAS
KLIISYRVGKRDEENTRAFVKDLRARLTTIPQLYTDGWQPYIGAVGASFTGGVDYCQVVKNYSRRPRRDDEVRYEPPRDPFITKTPIFGIPDVEHASTSH
VERQNWTIRMHIRRFTRLCNGFSRKLANHRAAVALHVAWYNLARIHESIRCTPAMEAGIARHVWSIPELVEAALAEPETEPPTPEPLKLRTPPAGTPTTP
ARALPNGRGFLRLVGAAPAAPVAPTPPEGPEPPPAPAPRRMVQLNLFNE

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
1836 bp611 aa13853220+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MNDESTVSYNSALGYFSGPMTASCPTIEGMGRRSRADRAGQGHLFRQGTLYYGDNLKVLREHVRDESVDLIYLDPPFNSKRNYNVIYKEPDSSDSVAQKR
AFDDSWHWDFAADAAYRRLVGSGAEERGVPTKLVSLVEAFRIFLGQTDMLAYVVMMAERIVELHRVLKRTGSLYLHCDPTASHYLKLVLDAIFGPDNFRN
EIVWQRSTAKNDPSRYGRCHDIIFFYTKSQEFYWDTQYSPFQDYSVEKNYTAVEEGTGRRYRLSDLTANKPGGDTDYEWHGKRPYRGRFWAFSKEKMDQM
YADGRIVFRRTGMPVYKRYLDEMPGVPFQDVWTDVRLASASTERIGYPTQKPLALLERIIASSSKSGDLVLDPFCGCGTTIEAAHKLGRKWVGIDITYLS
IDIIKGRIDALSPGSDDMYTVIGEPVDVESARRLAEEDPEEFQRWAVPFIGARHVGDGPGAGTFKRGRDRGVDGTIRFQDDTDAPSKRAIVSVKAGQRLG
PAMVRELRGTMEREGAPVGVLFTMYEPTKEMMSEAVRAGTYRECPRIQIITVADAFGGKRPRVPGGGILRRSSRPPAAATESPTVQDGLKKLAKAGQAAR
EMPARRGSGRR

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
495 bp163 aa40493555-No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MEAIMAERGIVLATLAGIPKRDRRDVEAEVLLNAWNAVRRGLYRPDPKDKPRDALRKWLHGIAWRLASHYVNSAWSRRAVIHPSPLGMLREPVGPDLEAQ
VAAREVLDAMAELPDWQLEALLSVDSPDSLVAYAKRRRMNPHTAASRLRIAREALALRLRWRR

 

Blast result :
Comments
ISSce1 is 36% aa similar to ISLysp1.
The first ORF is the transposase, others are passengers genes.
References
1] Schneiker,S., Perlova,O., Kaiser,O., Gerth,K., Alici,A., Altmeyer,M.O., Bartels,D., Bekel,T., Beyer,S., Bode,E., Bode,H.B., Bolten,C., Choudhuri,J.V., Doss,S., Elnakady,Y.A., Frank,B., Gaigalat,L., Goesmann,A., Groeger,C., Gross,F., Jelsbak,L.,Jelsbak,L., Kalinowski,J., Kegler,C., Knauber,T., Konietzny,S.,Kopp,M., Krause,L., Krug,D., Linke,B., Mahmud,T.,Martinez-Arias,R., McHardy,A.C., Merai,M., Meyer,F., Mormann,S.,Munoz-Dorado,J., Perez,J., Pradella,S., Rachid,S., Raddatz,G., Rosenau,F., Ruckert,C., Sasse,F., Scharfe,M., Schuster,S.C., Suen,G., Treuner-Lange,A., Velicer,G.J., Vorholter,F.J., Weissman,K.J., Welch,R.D., Wenzel,S.C., Whitworth,D., Wilhelm,S., Wittmann,C., Blocker,H., Puhler,A. and Muler,R. (2007) Nature Biotech In press