ISBlo1
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004307 | ND | Bifidobacterium longum | Bifidobacterium longum NCC2705 |
DNA section
IS Length : 2590 bp
Ends
IR Length : 47/68
IRL : TGTTTTGCTTCAATGTTTGTTCGACAGGTTCGGCAGTGTTTTTCCGCAGT
IRR : TGTGTTTTAGCAAGTTGGCGTCCGCTTTGTGAGCGGTGTTTTTCCGCAGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGTCGGGGCT | GGCCGGGA | ATGGTCCCGG | 8 |
cctgcggctt | CCGTGa/gC | atggactact | 9 |
TGCGTCGGGT | TATGAG | ATCAGGGCGC | 6 |
GACGAACAGG | AGCATC | GCCGGCCACG | 6 |
DNA sequence
TGTTTTGCTTCAATGTTTGTTCGACAGGTTCGGCAGTGTTTTTCCGCAGTGTGGGCTGATGGTTTTCCCCGGTGCCGTCATCGTTTCCCGGTGATGTGAG
AGGCATCGGTGCCGCCGCCTGTATATTCCCGAGTTGCCAATCGGTTTTGATTCGGCCGGTTGCCGCCGGCCGGCGCTCGGGAAGGAAAGGAACATGGCGA
TACCGATGCCCATTGTGCAAGATATCAGGAGACTCGACCGGCAGGGAATGTCGCGCGCGCAGATAGCGCGTCGTCTGCATGTGGATCGCGGGACGGTCGC
GAAGTACGCGGATATGGAGGATTGCTCGCCCAAGCCGAAGGCGGATCGCAGGTACGGGTCGAAGATCGACCCGTACGCGCATCTGGTGGACGGGTGGCTG
GAGGCCGATCGTCTGCTGCCCAGGAAGCAGCGGCACACGATCAGGCGCGTGCACGACCGTCTGCTGGCGGAGACGGACTACGACGGCGAGTATTCGACCA
CGATGCGTTACGTGCACCGGTGGCGCGAGGCGAACCGCGGCGTGCCGGATCGCGAGGGGTACGTGCGGCTCGAGTGGGCGGCGGGCAGCATGCAGGTCGA
TTTCGGCGTGGCCCGGGCCCGGATCGCTGGCGAGATGGCGGACGTGCATTGCCTGGTGGTCTCGTTGCCGTATTCGAACATGCGGTTGTGCGTGGCGTTG
CCGGGCGAGAACGCGGAGTGTCTGTGCCATGGCCTGATGCTCGTGTTCGAGCATATCGGGGGTGTTCCTCCCGTGATCGTGATGGACAACGCGACCGGTG
CCGGCCGGCGCAACGCGAAGGGCGAGGTCACGTTGACCGGGGTGTTCTCCGCGTTCGTGGCGCATTACCGGCTCGAGGTCCGGTTCTGCAACCCGTACTC
GGGCAACGAGAAGGGCAGCGTGGAGAACGCGGTCGGGTTTTTGCGGCGCAATCTCATGGTGCCGCCCATGCACGCGGAATCCTACGGGCAGCTCAGTCGT
TTCATGCTTGAACGGTGCGACGGGCTGGCGGCCTCTTCGTATTGCCCGAGGTTGCCGGGCGTGCCCGTGGCCGAGGTGTTCGACGAGGAGAGGGCCGCGT
TGATGCCGTTGCCGTCCACGGCGTTCGACCCGGTTCGCTGGGAAAGCCGGACGGCCGACAAGTACGGGCTGGTCGACATCGACTCGAACCGGTACCTCGC
CGGCCCCGATTCGGCGCGTTCCAAGGTGCTGGCCGCGATCCGGTGGGACACGGTCACGCTCGCATCGCCCGCCACCGGCGAGCTCCTCGCGGAATATCCC
AGACAGTACGGACGGTCGCGCAATGTGGAGGATCCCGCGCTCGTGCTTCCCCGGCTCGCGGTCAAACCCCGCGCGTGGCGGGAAAGCTCGATCCGCCCGG
ACGTGCCCGACGATATACGCGCGTGGCTTGATTCCATGGACGAAAAGACGTTGAGGGAGAGCCTCAAAGCGATCGGGGACGCGTGCCGGGCGGCCGGGTT
CGATCCCGCGATGCAGGCGTGCGGCGAGATCCTGCGCTCGAACAGGGACATGGGCCTGCACGCGGACTCGCTCACCCCTATCGCGTTGCGCATGCGCGAC
GGCGAGTGGGAATACCCCGGCGGGATCGAGGAGCCCGACCTGAGCGGCTACGACCGGTTCATCACCGGCACGGACGACGGAGGGGAAGAACGGTGAGCGT
CAGACCCGATCCGGTGATCCCGGAGACCCGACGCAGGCGCGCGTCCACGACCGAGAAAAGCGAACGCATCCTGAAGATGAGCCGCAGCCTGACCCTGACA
CGCAGCGTGCTCGCCGGCACGCTCGCCGAGGCCACGCCCAACCAGCTCGACTTCATCGAACGATGGTTCACGGCCGAACTCGACTCGCGCGAGCGATCCA
AACGCCTGCGCCTGCTCAAACAGGCCGGCTTCCCCGCCGACAAGACCCTCGACGGCTATGACTGGACCAACCTGAAGATGCCCGCCGACTGGGGGCGCGC
GCAGCTCGAGAACCTCGACTTCGTCGCCGGATGCGAGGACCTCGTGCTCTACGGGCCCGTCGGCACCGGCAAGAGCCACCTGGCCATCGCGATCGGACGG
CTCGCCTGCGAGCGGGGCGTCCCGGTGCGCTTCTTCACCGCGACCGGACTGCTCATGCGTCTGCGCCGCGCCCAGCAGGAGAACCGGCTCGACCGGGAAC
TCGCGAGCATCGGCAAGGCCCGACTTCTCATCATCGACGAGTTCGGCTACCTGCCCATAGACGAGGAGGGCAGCAGGCTCCTCTTCCAGATCATTTCCGA
TTCCTACGAGACAAGGAGCATCATCTACACCACCAACATCGAATTCAGCGGATGGGGACGCGTGCTCGGCGACAAGAACATGGCCGCCGCCCTCATCGAC
CGCACCGTCCACCACGGACGGCTCATCAGATTCGAGGGCCGCTCCTACCGAAGCGAACACGCCCTCATGACCAAATAACCAACACAACGCAGACAGGCAG
CGGCCGATACCGCACACCCTGCGGAAAACCCGCTGCCTACACTGCGGAAAAACACCGCTCACAAAGCGGACGCCAACTTGCTAAAACACA
AGGCATCGGTGCCGCCGCCTGTATATTCCCGAGTTGCCAATCGGTTTTGATTCGGCCGGTTGCCGCCGGCCGGCGCTCGGGAAGGAAAGGAACATGGCGA
TACCGATGCCCATTGTGCAAGATATCAGGAGACTCGACCGGCAGGGAATGTCGCGCGCGCAGATAGCGCGTCGTCTGCATGTGGATCGCGGGACGGTCGC
GAAGTACGCGGATATGGAGGATTGCTCGCCCAAGCCGAAGGCGGATCGCAGGTACGGGTCGAAGATCGACCCGTACGCGCATCTGGTGGACGGGTGGCTG
GAGGCCGATCGTCTGCTGCCCAGGAAGCAGCGGCACACGATCAGGCGCGTGCACGACCGTCTGCTGGCGGAGACGGACTACGACGGCGAGTATTCGACCA
CGATGCGTTACGTGCACCGGTGGCGCGAGGCGAACCGCGGCGTGCCGGATCGCGAGGGGTACGTGCGGCTCGAGTGGGCGGCGGGCAGCATGCAGGTCGA
TTTCGGCGTGGCCCGGGCCCGGATCGCTGGCGAGATGGCGGACGTGCATTGCCTGGTGGTCTCGTTGCCGTATTCGAACATGCGGTTGTGCGTGGCGTTG
CCGGGCGAGAACGCGGAGTGTCTGTGCCATGGCCTGATGCTCGTGTTCGAGCATATCGGGGGTGTTCCTCCCGTGATCGTGATGGACAACGCGACCGGTG
CCGGCCGGCGCAACGCGAAGGGCGAGGTCACGTTGACCGGGGTGTTCTCCGCGTTCGTGGCGCATTACCGGCTCGAGGTCCGGTTCTGCAACCCGTACTC
GGGCAACGAGAAGGGCAGCGTGGAGAACGCGGTCGGGTTTTTGCGGCGCAATCTCATGGTGCCGCCCATGCACGCGGAATCCTACGGGCAGCTCAGTCGT
TTCATGCTTGAACGGTGCGACGGGCTGGCGGCCTCTTCGTATTGCCCGAGGTTGCCGGGCGTGCCCGTGGCCGAGGTGTTCGACGAGGAGAGGGCCGCGT
TGATGCCGTTGCCGTCCACGGCGTTCGACCCGGTTCGCTGGGAAAGCCGGACGGCCGACAAGTACGGGCTGGTCGACATCGACTCGAACCGGTACCTCGC
CGGCCCCGATTCGGCGCGTTCCAAGGTGCTGGCCGCGATCCGGTGGGACACGGTCACGCTCGCATCGCCCGCCACCGGCGAGCTCCTCGCGGAATATCCC
AGACAGTACGGACGGTCGCGCAATGTGGAGGATCCCGCGCTCGTGCTTCCCCGGCTCGCGGTCAAACCCCGCGCGTGGCGGGAAAGCTCGATCCGCCCGG
ACGTGCCCGACGATATACGCGCGTGGCTTGATTCCATGGACGAAAAGACGTTGAGGGAGAGCCTCAAAGCGATCGGGGACGCGTGCCGGGCGGCCGGGTT
CGATCCCGCGATGCAGGCGTGCGGCGAGATCCTGCGCTCGAACAGGGACATGGGCCTGCACGCGGACTCGCTCACCCCTATCGCGTTGCGCATGCGCGAC
GGCGAGTGGGAATACCCCGGCGGGATCGAGGAGCCCGACCTGAGCGGCTACGACCGGTTCATCACCGGCACGGACGACGGAGGGGAAGAACGGTGAGCGT
CAGACCCGATCCGGTGATCCCGGAGACCCGACGCAGGCGCGCGTCCACGACCGAGAAAAGCGAACGCATCCTGAAGATGAGCCGCAGCCTGACCCTGACA
CGCAGCGTGCTCGCCGGCACGCTCGCCGAGGCCACGCCCAACCAGCTCGACTTCATCGAACGATGGTTCACGGCCGAACTCGACTCGCGCGAGCGATCCA
AACGCCTGCGCCTGCTCAAACAGGCCGGCTTCCCCGCCGACAAGACCCTCGACGGCTATGACTGGACCAACCTGAAGATGCCCGCCGACTGGGGGCGCGC
GCAGCTCGAGAACCTCGACTTCGTCGCCGGATGCGAGGACCTCGTGCTCTACGGGCCCGTCGGCACCGGCAAGAGCCACCTGGCCATCGCGATCGGACGG
CTCGCCTGCGAGCGGGGCGTCCCGGTGCGCTTCTTCACCGCGACCGGACTGCTCATGCGTCTGCGCCGCGCCCAGCAGGAGAACCGGCTCGACCGGGAAC
TCGCGAGCATCGGCAAGGCCCGACTTCTCATCATCGACGAGTTCGGCTACCTGCCCATAGACGAGGAGGGCAGCAGGCTCCTCTTCCAGATCATTTCCGA
TTCCTACGAGACAAGGAGCATCATCTACACCACCAACATCGAATTCAGCGGATGGGGACGCGTGCTCGGCGACAAGAACATGGCCGCCGCCCTCATCGAC
CGCACCGTCCACCACGGACGGCTCATCAGATTCGAGGGCCGCTCCTACCGAAGCGAACACGCCCTCATGACCAAATAACCAACACAACGCAGACAGGCAG
CGGCCGATACCGCACACCCTGCGGAAAACCCGCTGCCTACACTGCGGAAAAACACCGCTCACAAAGCGGACGCCAACTTGCTAAAACACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1503 bp | 500 aa | 194 | 1696 | + | No |
Chemistry : DDE
ORF sequence :
MAIPMPIVQDIRRLDRQGMSRAQIARRLHVDRGTVAKYADMEDCSPKPKADRRYGSKIDPYAHLVDGWLEADRLLPRKQRHTIRRVHDRLLAETDYDGEY
STTMRYVHRWREANRGVPDREGYVRLEWAAGSMQVDFGVARARIAGEMADVHCLVVSLPYSNMRLCVALPGENAECLCHGLMLVFEHIGGVPPVIVMDNA
TGAGRRNAKGEVTLTGVFSAFVAHYRLEVRFCNPYSGNEKGSVENAVGFLRRNLMVPPMHAESYGQLSRFMLERCDGLAASSYCPRLPGVPVAEVFDEER
AALMPLPSTAFDPVRWESRTADKYGLVDIDSNRYLAGPDSARSKVLAAIRWDTVTLASPATGELLAEYPRQYGRSRNVEDPALVLPRLAVKPRAWRESSI
RPDVPDDIRAWLDSMDEKTLRESLKAIGDACRAAGFDPAMQACGEILRSNRDMGLHADSLTPIALRMRDGEWEYPGGIEEPDLSGYDRFITGTDDGGEER
STTMRYVHRWREANRGVPDREGYVRLEWAAGSMQVDFGVARARIAGEMADVHCLVVSLPYSNMRLCVALPGENAECLCHGLMLVFEHIGGVPPVIVMDNA
TGAGRRNAKGEVTLTGVFSAFVAHYRLEVRFCNPYSGNEKGSVENAVGFLRRNLMVPPMHAESYGQLSRFMLERCDGLAASSYCPRLPGVPVAEVFDEER
AALMPLPSTAFDPVRWESRTADKYGLVDIDSNRYLAGPDSARSKVLAAIRWDTVTLASPATGELLAEYPRQYGRSRNVEDPALVLPRLAVKPRAWRESSI
RPDVPDDIRAWLDSMDEKTLRESLKAIGDACRAAGFDPAMQACGEILRSNRDMGLHADSLTPIALRMRDGEWEYPGGIEEPDLSGYDRFITGTDDGGEER
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
786 bp | 261 aa | 1693 | 2478 | + | No |
AG : IS21 helper
ORF sequence :
MSVRPDPVIPETRRRRASTTEKSERILKMSRSLTLTRSVLAGTLAEATPNQLDFIERWFTAELDSRERSKRLRLLKQAGFPADKTLDGYDWTNLKMPADW
GRAQLENLDFVAGCEDLVLYGPVGTGKSHLAIAIGRLACERGVPVRFFTATGLLMRLRRAQQENRLDRELASIGKARLLIIDEFGYLPIDEEGSRLLFQI
ISDSYETRSIIYTTNIEFSGWGRVLGDKNMAAALIDRTVHHGRLIRFEGRSYRSEHALMTK
GRAQLENLDFVAGCEDLVLYGPVGTGKSHLAIAIGRLACERGVPVRFFTATGLLMRLRRAQQENRLDRELASIGKARLLIIDEFGYLPIDEEGSRLLFQI
ISDSYETRSIIYTTNIEFSGWGRVLGDKNMAAALIDRTVHHGRLIRFEGRSYRSEHALMTK
Blast result :
Comments
Four copies of this element are found in the genome of NCC2705. Amongst them, one is split in two pieces by the insertion of two ISBlo2 elements. This last copy is largely altered (in addition of a deletion of approx. 60 bp).
Coordinates:
ISBlo1a complement(170873..173462)
ISBlo1b 192151..194740
ISBlo1c 609803..612392
ISBlo1d (complement(298059..299931))join(complement(1522427..1522935))
DR :
ISBlo1a 8 bp
ISBlo1b 7 bp, one mismatch but the sequence of the protein truncated by the insertion is consistent with a simple insertion
ISBlo1c 6 bp
ISBlo1d 6 bp
Identity between IS copies:
ISBlo1a 100%
ISBlo1b 98.5%
ISBlo1c 99.7%
ISBlo1d 63.3% (it is debatable if this copy is really ISBlo1, but as it is truncated this is not fundamental)
Coordinates:
ISBlo1a complement(170873..173462)
ISBlo1b 192151..194740
ISBlo1c 609803..612392
ISBlo1d (complement(298059..299931))join(complement(1522427..1522935))
DR :
ISBlo1a 8 bp
ISBlo1b 7 bp, one mismatch but the sequence of the protein truncated by the insertion is consistent with a simple insertion
ISBlo1c 6 bp
ISBlo1d 6 bp
Identity between IS copies:
ISBlo1a 100%
ISBlo1b 98.5%
ISBlo1c 99.7%
ISBlo1d 63.3% (it is debatable if this copy is really ISBlo1, but as it is truncated this is not fundamental)
References
1] Schell,M.A., Karmirantzou,M., Snel,B., Vilanova,D., Berger,B., Pessi,G., Zwahlen,M.-C., Desiere,F., Bork,P., Delley,M., Pridmore,D. and Arigoni,F. (2002)Proc. Natl. Acad. Sci. U.S.A. 99 (22), 14422-14427