ISUnCu9
- Family IS4
- Group IS50
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | uncultured bacterium | uncultured bacterium |
DNA section
IS Length : 1486 bp
Ends
IR Length : 16/20
IRL : CTGTCTCTTATACACAAATCCCTACTAAGTAATTAGTGGGGATTTTTTTG
IRR : CTGTCTCTTGATCACAAGTCGATATCAAGAGACAGGGCTAGCTCATAGCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AACAAAGCCG | CCATATATTGG | CGGCTTTTTT | 11 |
DNA sequence
CTGTCTCTTATACACAAATCCCTACTAAGTAATTAGTGGGGATTTTTTTGAACTCTTCCTTCCAACTCTGATCAGATCTTTCACTTAACTCATTGGGAAA
GATTATGATTGATAATGTAGAGTGGGCTGAAAAAACGTTTGGAGAAGCAAATCTAGGAGATCCTCGTCGTACATCTCGTCTGGTAAAATTAGCTGCAACA
CTTGCCAATACACCAGGAAAGCCTCTTGTGAATATCACTGAATCCCCCGCCGATATGGAAGGAGCATACCGCTTCATTAGAAATGAAAGTGTAGATGCGA
CTTCAATCGCTGAAACTGGATTTAACGTTACAGCTGTACAAGCTGCGCAACACAAAACATTGCTTGCCCTAGAAGATACTACGACCATTTCATACAGCCA
TAAAAGTATTCGAAACGAACTTGGACACGTGAATCAGGGTAACCGTTGCCGCGGTATGCTTGCCCATAGTGTCTTACTTTTTGCGCCGGAATCACAAGAA
ATAGTTGGACTGATTGAACAGTCGCGTTGGACACGAGACATAACAACGCGTGGTAAAAGAGCGAAACATGCATCGACACCATACACTGAGAAAGAAGGCT
ACAAGTGGGAGTCAGCATCAATCAATATGTCAGCAAGACTGGGCGCAAAGATAGATGATGTGATATCCGTCTGTGACCGAGAAGCCGATATCTACGAGTA
CCTACTTTATAAGCTAACAACTCAGCAGCGGTTTATTGTTCGTTCAATGCAAAGTCGGCACATTGAGGAAGGCGAAAACAAGCTATATCACTATGCCAGT
CAGTTAAAAAGTGCAGGGCACAAGCAAATACATATTCCGCAAAAAGGAGGGCGCAAAGCACGAAACGTGACGTTAGATATTGTATTTGCTCCCGTTACGC
TCAAGGTTCCTAGCAATAAACGTGGTGAATCACTTCCTTTATACTACGTCGGATGTGTAGAGCGCGGTAATTCAAAAGAGGCGTTATGCTGGCATTTACT
GACAAATGAACCCATCACAAATCAAGCGCAAGCACGAAAAATAATTGGCTATTACGAGCATCGTTGGCTAGTAGAAGAATATCATAAGGCGTGGAAAAGT
GACGGTACTAATATCGAAGCTTCGCGCTTACAAAGTAAGGAAAACGTTGAGCGCCTAGTCACAATAAACGCGTTTATTGCCGTAAGGATCCTTCAACTCA
AATTCGCAAAAGACCGACCGGACGATAGTAGCTGTGAAGAAGTCCTATCACCTAAAGCATGGAAGTTATTATGGTTAAAACGAGTGAGTAAGACACCACC
TGACTCAGCCCCCTCAATGAAATGGGCCTATACGGAGCTTGCAAAGCTAGGAGGGTGGAAAGATACAAAAAGAACAGGAAGAGCGTCTGTGAAAGTCATA
TGGCAAGGATGGTTCAAACTGCAAACCATCCTTGAAGGCTATGAGCTAGCCCTGTCTCTTGATATCGACTTGTGATCAAGAGACAG
GATTATGATTGATAATGTAGAGTGGGCTGAAAAAACGTTTGGAGAAGCAAATCTAGGAGATCCTCGTCGTACATCTCGTCTGGTAAAATTAGCTGCAACA
CTTGCCAATACACCAGGAAAGCCTCTTGTGAATATCACTGAATCCCCCGCCGATATGGAAGGAGCATACCGCTTCATTAGAAATGAAAGTGTAGATGCGA
CTTCAATCGCTGAAACTGGATTTAACGTTACAGCTGTACAAGCTGCGCAACACAAAACATTGCTTGCCCTAGAAGATACTACGACCATTTCATACAGCCA
TAAAAGTATTCGAAACGAACTTGGACACGTGAATCAGGGTAACCGTTGCCGCGGTATGCTTGCCCATAGTGTCTTACTTTTTGCGCCGGAATCACAAGAA
ATAGTTGGACTGATTGAACAGTCGCGTTGGACACGAGACATAACAACGCGTGGTAAAAGAGCGAAACATGCATCGACACCATACACTGAGAAAGAAGGCT
ACAAGTGGGAGTCAGCATCAATCAATATGTCAGCAAGACTGGGCGCAAAGATAGATGATGTGATATCCGTCTGTGACCGAGAAGCCGATATCTACGAGTA
CCTACTTTATAAGCTAACAACTCAGCAGCGGTTTATTGTTCGTTCAATGCAAAGTCGGCACATTGAGGAAGGCGAAAACAAGCTATATCACTATGCCAGT
CAGTTAAAAAGTGCAGGGCACAAGCAAATACATATTCCGCAAAAAGGAGGGCGCAAAGCACGAAACGTGACGTTAGATATTGTATTTGCTCCCGTTACGC
TCAAGGTTCCTAGCAATAAACGTGGTGAATCACTTCCTTTATACTACGTCGGATGTGTAGAGCGCGGTAATTCAAAAGAGGCGTTATGCTGGCATTTACT
GACAAATGAACCCATCACAAATCAAGCGCAAGCACGAAAAATAATTGGCTATTACGAGCATCGTTGGCTAGTAGAAGAATATCATAAGGCGTGGAAAAGT
GACGGTACTAATATCGAAGCTTCGCGCTTACAAAGTAAGGAAAACGTTGAGCGCCTAGTCACAATAAACGCGTTTATTGCCGTAAGGATCCTTCAACTCA
AATTCGCAAAAGACCGACCGGACGATAGTAGCTGTGAAGAAGTCCTATCACCTAAAGCATGGAAGTTATTATGGTTAAAACGAGTGAGTAAGACACCACC
TGACTCAGCCCCCTCAATGAAATGGGCCTATACGGAGCTTGCAAAGCTAGGAGGGTGGAAAGATACAAAAAGAACAGGAAGAGCGTCTGTGAAAGTCATA
TGGCAAGGATGGTTCAAACTGCAAACCATCCTTGAAGGCTATGAGCTAGCCCTGTCTCTTGATATCGACTTGTGATCAAGAGACAG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1368 bp | 456 aa | 105 | 1472 | + | No |
Chemistry : DDE
ORF sequence :
MIDNVEWAEKTFGEANLGDPRRTSRLVKLAATLANTPGKPLVNITESPADMEGAYRFIRNESVDATSIAETGFNVTAVQAAQHKTLLALEDTTTISYSHK
SIRNELGHVNQGNRCRGMLAHSVLLFAPESQEIVGLIEQSRWTRDITTRGKRAKHASTPYTEKEGYKWESASINMSARLGAKIDDVISVCDREADIYEYL
LYKLTTQQRFIVRSMQSRHIEEGENKLYHYASQLKSAGHKQIHIPQKGGRKARNVTLDIVFAPVTLKVPSNKRGESLPLYYVGCVERGNSKEALCWHLLT
NEPITNQAQARKIIGYYEHRWLVEEYHKAWKSDGTNIEASRLQSKENVERLVTINAFIAVRILQLKFAKDRPDDSSCEEVLSPKAWKLLWLKRVSKTPPD
SAPSMKWAYTELAKLGGWKDTKRTGRASVKVIWQGWFKLQTILEGYELALSLDIDL
SIRNELGHVNQGNRCRGMLAHSVLLFAPESQEIVGLIEQSRWTRDITTRGKRAKHASTPYTEKEGYKWESASINMSARLGAKIDDVISVCDREADIYEYL
LYKLTTQQRFIVRSMQSRHIEEGENKLYHYASQLKSAGHKQIHIPQKGGRKARNVTLDIVFAPVTLKVPSNKRGESLPLYYVGCVERGNSKEALCWHLLT
NEPITNQAQARKIIGYYEHRWLVEEYHKAWKSDGTNIEASRLQSKENVERLVTINAFIAVRILQLKFAKDRPDDSSCEEVLSPKAWKLLWLKRVSKTPPD
SAPSMKWAYTELAKLGGWKDTKRTGRASVKVIWQGWFKLQTILEGYELALSLDIDL
Blast result :
Comments
ISUnCu9 is 63% aa similar to ISPpr3. ISUnCu9 has been found by screening completely sequenced bacterial genomes for sequences homologous to the IS50 transposase using BLASTP. Multiple alignments revealed a conserved DDE motif : D(N2)-99-D(N3)-132-E(C1). ACCESSION AACY01103432.
ISUnCu9 is from uncultured bacterium (environmental sequence from Sargasso Sea).
ISUnCu9 is from uncultured bacterium (environmental sequence from Sargasso Sea).
References
1] Venter,J.C., Remington,K., Heidelberg,J.F., Halpern,A.L., Rusch,D., Eisen,J.A., Wu,D., Paulsen,I., Nelson,K.E., Nelson,W., Fouts,D.E., Levy,S., Knap,A.H., Lomas,M.W., Nealson,K., White,O., Peterson,J., Hoffman,J., Parsons,R., Baden-Tillson,H., Pfannkoch,C., Rogers,Y.H. and Smith,H.O. (2004) Science 304 (5667), 66-74
2] Reznikoff, W.S., Bordenstein, S.R. and Apodaca, J. (2004) J. Bact. 186(24), 8240-8247
3] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] Reznikoff, W.S., Bordenstein, S.R. and Apodaca, J. (2004) J. Bact. 186(24), 8240-8247
3] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18