ISUnCu8
- Family IS4
- Group IS50
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | uncultured bacterium | uncultured bacterium |
DNA section
IS Length : 1509 bp
Ends
IR Length : 15/20
IRL : CTGTCTCTCATACACAAATCCCTGAACAATGCTCAGGGATTTTTTTTGAA
IRR : CTGTCTCTTGATCACAAGTCTTCGTGATTGAGAGACATGGCCAATTCGTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
CTGTCTCTCATACACAAATCCCTGAACAATGCTCAGGGATTTTTTTTGAACTCATTCCCCGCATTGTGATCAGATCGTCTAGGGCTAAATATTCGAGGCG
ATTATGATGAAGATGATGGATCCAGAGCAGTGGGCGCAATGTCAATTTGGGCACGCAAACCTCAACGACCCAAGACGTACACAAAGACTTGTTTCACTGG
CTACCTCAATTACCCAACAACCCGGAGTTGCCGTGTCCAAACTTCCCTTATCCCCCGCTGAGATGGAAGGGGCCTACCGATTTATCCGCAATGAGAATAT
TCAAGCCAGTGATATTGCTGAAGCGGGTTTTCAAGCCACCGCGCAGCAAGCTCGAGAGCATGACATCTTGTTAGCACTAGAGGATACCACCTCGCTGACT
TACAAGCATGACAGTGTCAGGGAGGAGTTAGGTCATACCAATCAAGGGGATAAAAACCGTGCCATTCTGGCGCATTCTATCTTGCTGTTTTCACCGCAAA
GCCAGCAAGTCGTTGGGCTGATTGAGCAACAGCGTTGGACGCGGGATATCACTAAGCGCGGGCAACGCCGCCAACATGCTACTCGCCCTTACGAGGAAAA
AGAGAGCTACAAATGGGAACAAGCCTCCGCCAATATGTCTGCCCGTTTAGGCGAGCATATCAACAAGGTGATTTCCGTCTGTGACCGTGAGGCTGATTTG
TTTGAATATCTGACCTATAAAACCCAGCACCAACAGCGCTTTGTCGTCCGCTCGATGCAGAGCCGATGCATTCAAGAGCATGCACACAAACTCTATGACT
ACTCGAACACATTACCGCTTGCAGCGACGAAATCTCTCCTCATTCCTCAGAAAGGCGGAAGGAAAGCCCGGGCGGTGACGCTGGAAATGAGATATGCCCG
AGTGACCCTCAAAGCTCCAGCCAACAAACGCACTCAGGCAGACATCCCTCTGTATTACGTCAGTGTCGTAGAGCAAAGTCATCGTGAAGAAAAACTGGCA
TGGCACTTACTCACCTCAGAACCTATCACATGCTCGAAGGAGGCACTGGAGGTTGTCGGCTATTACGAGCGCCGTTGGTTGATTGAAGATTACCACAAGG
TATGGAAAAGCAGTGGCACCGCGGTGGAAGAGCTGAGAATGCAGTGCCGTGAGAATCTGGAGCGTATGAGTGTGATACTGGCTTTTATCGCCACACGCTT
ACTCCAGCTTCGCTTTATGAAGGTATCCAAAGCGGAAGTGGCAGCACAATGTTGTGAAACGCTATTGGGTCAGAAAGCGTGGAAATTACTTTGGCTGAAA
ATGGAGGGACGTCCGTTACCTCAACAGGCACCCGATATGCAGTGGGCGTACGAGCGTCTGGCCAGATTAGGGGGTTGGAAAGATACCAAGCGGACAGGTC
GAGCTTGTGTAGAGGTATTGTGGGAAGGATGGTTTAGGTTACAAACCATCCTTGAGGGTTACGAATTGGCCATGTCTCTCAATCACGAAGACTTGTGATC
AAGAGACAG
ATTATGATGAAGATGATGGATCCAGAGCAGTGGGCGCAATGTCAATTTGGGCACGCAAACCTCAACGACCCAAGACGTACACAAAGACTTGTTTCACTGG
CTACCTCAATTACCCAACAACCCGGAGTTGCCGTGTCCAAACTTCCCTTATCCCCCGCTGAGATGGAAGGGGCCTACCGATTTATCCGCAATGAGAATAT
TCAAGCCAGTGATATTGCTGAAGCGGGTTTTCAAGCCACCGCGCAGCAAGCTCGAGAGCATGACATCTTGTTAGCACTAGAGGATACCACCTCGCTGACT
TACAAGCATGACAGTGTCAGGGAGGAGTTAGGTCATACCAATCAAGGGGATAAAAACCGTGCCATTCTGGCGCATTCTATCTTGCTGTTTTCACCGCAAA
GCCAGCAAGTCGTTGGGCTGATTGAGCAACAGCGTTGGACGCGGGATATCACTAAGCGCGGGCAACGCCGCCAACATGCTACTCGCCCTTACGAGGAAAA
AGAGAGCTACAAATGGGAACAAGCCTCCGCCAATATGTCTGCCCGTTTAGGCGAGCATATCAACAAGGTGATTTCCGTCTGTGACCGTGAGGCTGATTTG
TTTGAATATCTGACCTATAAAACCCAGCACCAACAGCGCTTTGTCGTCCGCTCGATGCAGAGCCGATGCATTCAAGAGCATGCACACAAACTCTATGACT
ACTCGAACACATTACCGCTTGCAGCGACGAAATCTCTCCTCATTCCTCAGAAAGGCGGAAGGAAAGCCCGGGCGGTGACGCTGGAAATGAGATATGCCCG
AGTGACCCTCAAAGCTCCAGCCAACAAACGCACTCAGGCAGACATCCCTCTGTATTACGTCAGTGTCGTAGAGCAAAGTCATCGTGAAGAAAAACTGGCA
TGGCACTTACTCACCTCAGAACCTATCACATGCTCGAAGGAGGCACTGGAGGTTGTCGGCTATTACGAGCGCCGTTGGTTGATTGAAGATTACCACAAGG
TATGGAAAAGCAGTGGCACCGCGGTGGAAGAGCTGAGAATGCAGTGCCGTGAGAATCTGGAGCGTATGAGTGTGATACTGGCTTTTATCGCCACACGCTT
ACTCCAGCTTCGCTTTATGAAGGTATCCAAAGCGGAAGTGGCAGCACAATGTTGTGAAACGCTATTGGGTCAGAAAGCGTGGAAATTACTTTGGCTGAAA
ATGGAGGGACGTCCGTTACCTCAACAGGCACCCGATATGCAGTGGGCGTACGAGCGTCTGGCCAGATTAGGGGGTTGGAAAGATACCAAGCGGACAGGTC
GAGCTTGTGTAGAGGTATTGTGGGAAGGATGGTTTAGGTTACAAACCATCCTTGAGGGTTACGAATTGGCCATGTCTCTCAATCACGAAGACTTGTGATC
AAGAGACAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1380 bp | 460 aa | 116 | 1495 | + | No |
Chemistry : DDE
ORF sequence :
MDPEQWAQCQFGHANLNDPRRTQRLVSLATSITQQPGVAVSKLPLSPAEMEGAYRFIRNENIQASDIAEAGFQATAQQAREHDILLALEDTTSLTYKHDS
VREELGHTNQGDKNRAILAHSILLFSPQSQQVVGLIEQQRWTRDITKRGQRRQHATRPYEEKESYKWEQASANMSARLGEHINKVISVCDREADLFEYLT
YKTQHQQRFVVRSMQSRCIQEHAHKLYDYSNTLPLAATKSLLIPQKGGRKARAVTLEMRYARVTLKAPANKRTQADIPLYYVSVVEQSHREEKLAWHLLT
SEPITCSKEALEVVGYYERRWLIEDYHKVWKSSGTAVEELRMQCRENLERMSVILAFIATRLLQLRFMKVSKAEVAAQCCETLLGQKAWKLLWLKMEGRP
LPQQAPDMQWAYERLARLGGWKDTKRTGRACVEVLWEGWFRLQTILEGYELAMSLNHEDL
VREELGHTNQGDKNRAILAHSILLFSPQSQQVVGLIEQQRWTRDITKRGQRRQHATRPYEEKESYKWEQASANMSARLGEHINKVISVCDREADLFEYLT
YKTQHQQRFVVRSMQSRCIQEHAHKLYDYSNTLPLAATKSLLIPQKGGRKARAVTLEMRYARVTLKAPANKRTQADIPLYYVSVVEQSHREEKLAWHLLT
SEPITCSKEALEVVGYYERRWLIEDYHKVWKSSGTAVEELRMQCRENLERMSVILAFIATRLLQLRFMKVSKAEVAAQCCETLLGQKAWKLLWLKMEGRP
LPQQAPDMQWAYERLARLGGWKDTKRTGRACVEVLWEGWFRLQTILEGYELAMSLNHEDL
Blast result :
Comments
ISUnCu8 is 61% aa similar to ISPpr3. ISUnCu8 has been found by screening completely sequenced bacterial genomes for sequences homologous to the IS50 transposase using BLASTP. Multiple alignments revealed a conserved DDE motif : D(N2)-99-D(N3)-133-E(C1).
Accession number : AACY01017159.
ISUnCu8 is from uncultured bacterium (environmental sequence from Sargasso Sea).
Accession number : AACY01017159.
ISUnCu8 is from uncultured bacterium (environmental sequence from Sargasso Sea).
References
1] Venter,J.C., Remington,K., Heidelberg,J.F., Halpern,A.L., Rusch,D., Eisen,J.A., Wu,D., Paulsen,I., Nelson,K.E., Nelson,W., Fouts,D.E., Levy,S., Knap,A.H., Lomas,M.W., Nealson,K., White,O., Peterson,J., Hoffman,J., Parsons,R., Baden-Tillson,H., Pfannkoch,C., Rogers,Y.H. and Smith,H.O. (2004) Science 304 (5667), 66-74
2] Reznikoff, W.S., Bordenstein, S.R. and Apodaca, J. (2004) J. Bact. 186(24), 8240-8247
3] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] Reznikoff, W.S., Bordenstein, S.R. and Apodaca, J. (2004) J. Bact. 186(24), 8240-8247
3] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18