ISBlo12
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004307 | ND | Bifidobacterium longum | Bifidobacterium longum NCC2705 |
DNA section
IS Length : 2003 bp
Ends
IR Length : 19/46
IRL : GCAACCAGTTAACTTAATTGCTTGCTTTCGATTGTTTTCGTATGATATAC
IRR : CGGCTGGTTCGTCCGTTGTAGATGACTTTGTTTTAGCTTGTCGCTAGAGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
GCAACCAGTTAACTTAATTGCTTGCTTTCGATTGTTTTCGTATGATATACTTGAGTCATGCGTGTCAGGGAATGGGCGAGGCGTGAGGGCTTCAACGAGC
AGACGGTATGGCAGTGGTGCCGTGAGAACCGCATGCCCGTCCCATTTGAGCGCATGAGCACGGGTACGATAATCATCCACGACCCGAAATACGAGAGCCA
GCCCGTCGCCCCCACGGCGAACGGCAGGACGGTGTGTTACGCGCGTGTCAGCTCGTCTGACCAGAAGGACGATTTGACACGTCAGGCCGACCGGTTGAGG
GCGTTCGCCGTCAACATGGGCGTCGAGAAGCCCGAGGTGGTCACGGAGACGGGTTCCGGCATGAACGACAAACGACGCAAGCTCAACCGGCTGTTGGCCG
ACCCGACCGTCGGCACGCTGATCGTGGAGCATCGCGACCGGCTCGCCCGTATGAACGCGGGCTTGGTGGAGAGCGCGTTGAAGGCGCAGGGCAGACGCGT
CGTCGTCGTGGACGACACGGGACTCGACGACGATCTGGTGCGCGACATGACCGAGGTGCTGACCTCGTTCTGCGCGAGACTGTACGGACGCCGCGGGGCC
AGGCACCGTGCGGAGAAGGCGTTGGAGGCGATGCGCGATGAGCGCGTATGAGGCCGTCAGAATCCGGCTCGACCCGACCCCACGGCAGACACGGCTGTTG
GAGTCCCATGCGGGTGGCGCGCGTTTCGCGTACAATCTGATGCTCGCGCACGTCCGGCGCCAAATCTCCTTGGGTGAGAAACCGGACTGGACGTTGTACG
CGATGCGCCGCTGGTGGAACGAGTGGAAGGACGAAATCGCCCCGTGGTGGCGGGAGAACAGCAAGGAGGCGTACGGCAGCGCGTTCGAATGGCTGTCCCA
GGCGTTGAGGAACTGGTCGGACAGCAGGAAGGGCAGGCGCGCGGGCCGTAGGGTGGGCTGGCCGAAATACAAGTCGAAACGCTCCAGTGTCCCGCGTTTC
GCATACACGACCGGCAGCTTCGGCCTTATCGAGGACGACCCGAAGGCGTTGAGACTGCCACGCATCGGACGCGTGCACTGCATGGAGAACGCCACCGAAC
GCGTCCACGGCAGACGAATCGTGCGCATGACCGTCAGTCGCCATGCGGGCTTCTGGTATGCGGCCCTCACCGTCGAACGTCCCACCGAAAGCGTTCCAGC
GAAAAACAGAAAACGGAAGAACCATGATCGTCAGGTCGGCGTGGATTTGGGCGTCAGGACCCTCGCCACCCTCTCGGATGGCACCACGTTCCCCAATCCA
CGCAACTACGTCCGCACGCAACGGAAACTCCGCCACGCCCAACAGTCGTTGAGCCGCCGCGACAGGGGCATGAGCCATGGATGCGGGTCGAAACGGTACA
ACAGGGCGTTGGAGCGTGTGCGCCGAATCCACGCCCGCATAGCCGCCCAACGAGCCGACAACATCGGCAAGCTCACCACGTGGCTCGCCGACAATTATTC
CGACATCAGCATCGAGGACCTCAACGTGCAGGGCATGAGCCATAACAGGAGGCTTGCCAAACACATACTGGACGCGGACTTCCACGAGTTCCGCCGCCAA
CTGGAATACAAGACCGCACGCGCCGGCACGAGGCTGCACGTCATCGACCGCTGGTATCCAAGCTCGAAGACCTGTTCGAACTGCGGGACGGTGAAAGCCA
AGCTGTCCCTGTCCGAGCGCGTCTACCATTGCGAGGAGTGCGGGCTTGTCATCGACCGTGATGTGAACGCGGCCATCAACATCCAAGTCGCCGGGAGTGC
CCCGGAGACGTTAAACGCGCGTGGAGGAAGCGTAGGACAGACCCGCCTTGAGTGCGGGACAATGCGGCATCCGGCGAAACGCGAACCAAGCGGCGGCGAC
AGTCGCGTGAGACTTGGAGCTGGTCTCGGCAACGAGGCCATGCAGATGACTTCGCTCTAGCGACAAGCTAAAACAAAGTCATCTACAACGGACGAACCAG
CCG
AGACGGTATGGCAGTGGTGCCGTGAGAACCGCATGCCCGTCCCATTTGAGCGCATGAGCACGGGTACGATAATCATCCACGACCCGAAATACGAGAGCCA
GCCCGTCGCCCCCACGGCGAACGGCAGGACGGTGTGTTACGCGCGTGTCAGCTCGTCTGACCAGAAGGACGATTTGACACGTCAGGCCGACCGGTTGAGG
GCGTTCGCCGTCAACATGGGCGTCGAGAAGCCCGAGGTGGTCACGGAGACGGGTTCCGGCATGAACGACAAACGACGCAAGCTCAACCGGCTGTTGGCCG
ACCCGACCGTCGGCACGCTGATCGTGGAGCATCGCGACCGGCTCGCCCGTATGAACGCGGGCTTGGTGGAGAGCGCGTTGAAGGCGCAGGGCAGACGCGT
CGTCGTCGTGGACGACACGGGACTCGACGACGATCTGGTGCGCGACATGACCGAGGTGCTGACCTCGTTCTGCGCGAGACTGTACGGACGCCGCGGGGCC
AGGCACCGTGCGGAGAAGGCGTTGGAGGCGATGCGCGATGAGCGCGTATGAGGCCGTCAGAATCCGGCTCGACCCGACCCCACGGCAGACACGGCTGTTG
GAGTCCCATGCGGGTGGCGCGCGTTTCGCGTACAATCTGATGCTCGCGCACGTCCGGCGCCAAATCTCCTTGGGTGAGAAACCGGACTGGACGTTGTACG
CGATGCGCCGCTGGTGGAACGAGTGGAAGGACGAAATCGCCCCGTGGTGGCGGGAGAACAGCAAGGAGGCGTACGGCAGCGCGTTCGAATGGCTGTCCCA
GGCGTTGAGGAACTGGTCGGACAGCAGGAAGGGCAGGCGCGCGGGCCGTAGGGTGGGCTGGCCGAAATACAAGTCGAAACGCTCCAGTGTCCCGCGTTTC
GCATACACGACCGGCAGCTTCGGCCTTATCGAGGACGACCCGAAGGCGTTGAGACTGCCACGCATCGGACGCGTGCACTGCATGGAGAACGCCACCGAAC
GCGTCCACGGCAGACGAATCGTGCGCATGACCGTCAGTCGCCATGCGGGCTTCTGGTATGCGGCCCTCACCGTCGAACGTCCCACCGAAAGCGTTCCAGC
GAAAAACAGAAAACGGAAGAACCATGATCGTCAGGTCGGCGTGGATTTGGGCGTCAGGACCCTCGCCACCCTCTCGGATGGCACCACGTTCCCCAATCCA
CGCAACTACGTCCGCACGCAACGGAAACTCCGCCACGCCCAACAGTCGTTGAGCCGCCGCGACAGGGGCATGAGCCATGGATGCGGGTCGAAACGGTACA
ACAGGGCGTTGGAGCGTGTGCGCCGAATCCACGCCCGCATAGCCGCCCAACGAGCCGACAACATCGGCAAGCTCACCACGTGGCTCGCCGACAATTATTC
CGACATCAGCATCGAGGACCTCAACGTGCAGGGCATGAGCCATAACAGGAGGCTTGCCAAACACATACTGGACGCGGACTTCCACGAGTTCCGCCGCCAA
CTGGAATACAAGACCGCACGCGCCGGCACGAGGCTGCACGTCATCGACCGCTGGTATCCAAGCTCGAAGACCTGTTCGAACTGCGGGACGGTGAAAGCCA
AGCTGTCCCTGTCCGAGCGCGTCTACCATTGCGAGGAGTGCGGGCTTGTCATCGACCGTGATGTGAACGCGGCCATCAACATCCAAGTCGCCGGGAGTGC
CCCGGAGACGTTAAACGCGCGTGGAGGAAGCGTAGGACAGACCCGCCTTGAGTGCGGGACAATGCGGCATCCGGCGAAACGCGAACCAAGCGGCGGCGAC
AGTCGCGTGAGACTTGGAGCTGGTCTCGGCAACGAGGCCATGCAGATGACTTCGCTCTAGCGACAAGCTAAAACAAAGTCATCTACAACGGACGAACCAG
CCG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
594 bp | 197 aa | 58 | 651 | + | No |
Chemistry : Serine
ORF sequence :
MRVREWARREGFNEQTVWQWCRENRMPVPFERMSTGTIIIHDPKYESQPVAPTANGRTVCYARVSSSDQKDDLTRQADRLRAFAVNMGVEKPEVVTETGS
GMNDKRRKLNRLLADPTVGTLIVEHRDRLARMNAGLVESALKAQGRRVVVVDDTGLDDDLVRDMTEVLTSFCARLYGRRGARHRAEKALEAMRDERV
GMNDKRRKLNRLLADPTVGTLIVEHRDRLARMNAGLVESALKAQGRRVVVVDDTGLDDDLVRDMTEVLTSFCARLYGRRGARHRAEKALEAMRDERV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1323 bp | 440 aa | 638 | 1960 | + | No |
AG : TnpB
ORF sequence :
MSAYEAVRIRLDPTPRQTRLLESHAGGARFAYNLMLAHVRRQISLGEKPDWTLYAMRRWWNEWKDEIAPWWRENSKEAYGSAFEWLSQALRNWSDSRKGR
RAGRRVGWPKYKSKRSSVPRFAYTTGSFGLIEDDPKALRLPRIGRVHCMENATERVHGRRIVRMTVSRHAGFWYAALTVERPTESVPAKNRKRKNHDRQV
GVDLGVRTLATLSDGTTFPNPRNYVRTQRKLRHAQQSLSRRDRGMSHGCGSKRYNRALERVRRIHARIAAQRADNIGKLTTWLADNYSDISIEDLNVQGM
SHNRRLAKHILDADFHEFRRQLEYKTARAGTRLHVIDRWYPSSKTCSNCGTVKAKLSLSERVYHCEECGLVIDRDVNAAINIQVAGSAPETLNARGGSVG
QTRLECGTMRHPAKREPSGGDSRVRLGAGLGNEAMQMTSL
RAGRRVGWPKYKSKRSSVPRFAYTTGSFGLIEDDPKALRLPRIGRVHCMENATERVHGRRIVRMTVSRHAGFWYAALTVERPTESVPAKNRKRKNHDRQV
GVDLGVRTLATLSDGTTFPNPRNYVRTQRKLRHAQQSLSRRDRGMSHGCGSKRYNRALERVRRIHARIAAQRADNIGKLTTWLADNYSDISIEDLNVQGM
SHNRRLAKHILDADFHEFRRQLEYKTARAGTRLHVIDRWYPSSKTCSNCGTVKAKLSLSERVYHCEECGLVIDRDVNAAINIQVAGSAPETLNARGGSVG
QTRLECGTMRHPAKREPSGGDSRVRLGAGLGNEAMQMTSL
Blast result :
Comments
ISBlo12 contains two overlapping ORFs. Both ORFs have a good homology with IS1535. The limits of the element have been predicted by comparison with the structure of IS607 (imperfect IRs and terminal GC/CG dinucleotides) but they also show some similarities of sequence with the IRs predicted for IS1535.
Coordinates:
ISBlo12 1183262..1185264
Prediction of the IRs:
L-GCAACCAGTTAACTTAATTGCTTGCTTTCGATTGTTTTCGTATGATAT
||| | ||| | | ||||||| | ||
R-CGGCTG-GTT-CGTCCGTTG--TAGATGACTTTGTTTTAGCTTGTCGC
Coordinates:
ISBlo12 1183262..1185264
Prediction of the IRs:
L-GCAACCAGTTAACTTAATTGCTTGCTTTCGATTGTTTTCGTATGATAT
||| | ||| | | ||||||| | ||
R-CGGCTG-GTT-CGTCCGTTG--TAGATGACTTTGTTTTAGCTTGTCGC
References
1] Schell,M.A., Karmirantzou,M., Snel,B., Vilanova,D., Berger,B., Pessi,G., Zwahlen,M.-C., Desiere,F., Bork,P., Delley,M., Pridmore,D. and Arigoni,F. (2002)Proc. Natl. Acad. Sci. U.S.A. 99 (22), 14422-14427