ISEbi1
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Eubacterium biforme | Eubacterium biforme DSM 3989 |
DNA section
IS Length : 2558 bp
Ends
IR Length : 24/30
IRL : GTAAGCGCCAAGTAATATGCTGGATTGAAAATCATTTCAAAAAAATCGAC
IRR : GTAAGCGCCAAATAAAATGATGTAGTTAAATAAATGAAAAAGAATCAGTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
GTAAGCGCCAAGTAATATGCTGGATTGAAAATCATTTCAAAAAAATCGACAATTATATAGAAAATTAGAGTTTCTAGAGTACAGTCTAAAAACTCTATAA
AATGATTGGAGATGATTAGATGAAATTAGAGGATTTAACGCCCAAGCAGAGACTGAAGGCGGAAAAGTGGCTGGCAATCATTCGTGAATGTATAAATAGT
GATCTATCAAACGAAGAATGGTGCGAACAGAATAACGTTAGTATTAAAAGTTATTACTATTACCTGGCAAAACTAAGAAAGATGGCAATTGAACAGATTC
CATACAAAAAAAGAAATAATGTACTTCCTCTGGTTCCTGAATCAAACGACATTGTAGAAATACAGGCTCCTCAATCTACTCCTGTATGCATATCTTCTTC
AGTAATGACGATTTGTCAAGGAAACACAAAAATAGAAATCAATTCTAATTGTCCTGAATGGATGATCCGGGAGCTGTTGAATAAATATGTTAGGTGATAT
TTCTAAAGCAGAACATATCTATATTGCCTGTGGATATACAGATATGAGAAAATCAATTGATGGACTTGCAGCTATTGTACAACAGAATTTCAAACTGGAT
CCTTTTTCGAATTCACTGTTTCTCTTTTGTGGAAGGAGCAGTACAAAAATGAAAGTCTTGTACTGGGAAGGTGACGGATTTGTTCTTTTGTATAAGCGTC
TTGAAAATGGAAAATTCATATGGCCAAGAAATAAAGATGAAGTAATGAAAATAACAGATCAGCAGTTCAGATGGCTTTTAGAAGGACTTAAAATAGAGCA
GTCACGAACAATAAAAAAGAGTAATAAAAGATGTGCTATTTAGCTTTATATTAAGGATATTTTGCATTAAATATGGTATAATATAGACAGTTAAGGAGTG
CTTCGAATGATTGAAAATGCAGGTATCGACTACAAAGAATTATATCTACAGATGCAGTCAGCAGTTGTAGCACTATCAAAGACATTGGAAGAAATCCAAA
AAGAAAACAAGAATCTAAAGGAAGAAAATGAATATTTAAAGCGCAAGCTGTTTGGAACAAAAAGTGAAACATCAAAGTCACTTGGTTTTGAGCAGCTTTC
TCTTTTTGATGAGGCTGAAGTAGAAGCAGATCCAGACGAGGAACAATTTATTCTTGAAGAAGTCAAATTCAATAAAAAGAAAAAGTATAAAGGACAGTTG
GATGATAAACTATCTAAATTGCCACATATCGAAGTTATTATGACACTTCCTGAATCTGAACTTGTATGTCCAGTCTGTAACAGTAAACTTGTCTCTATCG
GAAAGAAATTTGTTCGACACGAAATTGAATTTGTACCGGCCAAGTTAAAAGTAAGAGATATATATACAACGACATATGAGTGCAGAAAATGTCGTGCAAA
TGGAAAGAGTGTAATGAAAAGTCCTGGAATTCCAGAACCTGTCATTCCACATTCTTATGCCTCTGCTGAAAGTGTAGCTTTCGTTATGAAACAAAAGTTC
GTAAATGGCGTTCCACTCTATAGACAGGAATCCGAATGGAAACAAATGGGTCTAGATCTTTCAAGAACGACAATGGCTAACTGGATCATATACTCAAGTG
AGCACTGGTTAAAACCACTGACAGACAGAATGCATGAAATCCTGCTTGAATCAAAACATGCCCATGCCGATGAAACACCTGTTCAAGTTCTAAATGAACC
TGGTAAAAAGGCAACAACTAAATCCTATATGTGGGTATATTCAAGCATCAAAGAGAGTGCATATCCAATCCGACTATTTGTATACGCACCAAACAGATGC
GGATATAATCCACAGATATTCTTCAACGGTTTTCACGGAACTGTTATATCAGATGCATATTCCGGATATAACAATATAGAAGGCTGTGTCAATGCATACT
GCTGGGCACATGCAAGAAGAAAGTTCAGAGACAGCCTTCCAAATGATCTCGAAAATGCAGAAAATACGCTTCCGATTATTGCGATGAATAAGATCAAGAA
ACTCTTCGCAATAGAAAAAGAAATAGAAGCGTTGTCACCTGATAAAAAGGTCAAAATACGTCAGCTAAAGTCAAAGCCATTAATAGATGACTTCTTTTCA
TGGTGCAAATCCAATCAAAATGCCACAGCTAGTGCAAAGCTGAGCAGAGCATTTAAATATGCTCTAAATCATGAAGAAGGATTGAGACAATACTTATATG
ATGGATATATTCCAATGACAAACTCATTAGATGAAAGAGTAATTCGTCCTTTTACGACAGGAAGAAAGAACTGGTTATTCTCGGCAAGCGTATCAGGAGC
AGAATCAAGTGCTAATGCATACAGTATAATTGAAACAGCAAAAGCAAATGGATTGGATCCATATAAATATTTGACAACAATATTTACATATCTTCCAAGC
CAAGATCTAATCAAAAATCCAGAAATAATAGACGAGTTTCTACCATGGAGTGAGTTCATACAAAAGAATTGTAAATAAAAGAATCAGCATACCATTTTAG
CAATGATAAACTGATTCTTTTTCATTTATTTAACTACATCATTTTATTTGGCGCTTAC
AATGATTGGAGATGATTAGATGAAATTAGAGGATTTAACGCCCAAGCAGAGACTGAAGGCGGAAAAGTGGCTGGCAATCATTCGTGAATGTATAAATAGT
GATCTATCAAACGAAGAATGGTGCGAACAGAATAACGTTAGTATTAAAAGTTATTACTATTACCTGGCAAAACTAAGAAAGATGGCAATTGAACAGATTC
CATACAAAAAAAGAAATAATGTACTTCCTCTGGTTCCTGAATCAAACGACATTGTAGAAATACAGGCTCCTCAATCTACTCCTGTATGCATATCTTCTTC
AGTAATGACGATTTGTCAAGGAAACACAAAAATAGAAATCAATTCTAATTGTCCTGAATGGATGATCCGGGAGCTGTTGAATAAATATGTTAGGTGATAT
TTCTAAAGCAGAACATATCTATATTGCCTGTGGATATACAGATATGAGAAAATCAATTGATGGACTTGCAGCTATTGTACAACAGAATTTCAAACTGGAT
CCTTTTTCGAATTCACTGTTTCTCTTTTGTGGAAGGAGCAGTACAAAAATGAAAGTCTTGTACTGGGAAGGTGACGGATTTGTTCTTTTGTATAAGCGTC
TTGAAAATGGAAAATTCATATGGCCAAGAAATAAAGATGAAGTAATGAAAATAACAGATCAGCAGTTCAGATGGCTTTTAGAAGGACTTAAAATAGAGCA
GTCACGAACAATAAAAAAGAGTAATAAAAGATGTGCTATTTAGCTTTATATTAAGGATATTTTGCATTAAATATGGTATAATATAGACAGTTAAGGAGTG
CTTCGAATGATTGAAAATGCAGGTATCGACTACAAAGAATTATATCTACAGATGCAGTCAGCAGTTGTAGCACTATCAAAGACATTGGAAGAAATCCAAA
AAGAAAACAAGAATCTAAAGGAAGAAAATGAATATTTAAAGCGCAAGCTGTTTGGAACAAAAAGTGAAACATCAAAGTCACTTGGTTTTGAGCAGCTTTC
TCTTTTTGATGAGGCTGAAGTAGAAGCAGATCCAGACGAGGAACAATTTATTCTTGAAGAAGTCAAATTCAATAAAAAGAAAAAGTATAAAGGACAGTTG
GATGATAAACTATCTAAATTGCCACATATCGAAGTTATTATGACACTTCCTGAATCTGAACTTGTATGTCCAGTCTGTAACAGTAAACTTGTCTCTATCG
GAAAGAAATTTGTTCGACACGAAATTGAATTTGTACCGGCCAAGTTAAAAGTAAGAGATATATATACAACGACATATGAGTGCAGAAAATGTCGTGCAAA
TGGAAAGAGTGTAATGAAAAGTCCTGGAATTCCAGAACCTGTCATTCCACATTCTTATGCCTCTGCTGAAAGTGTAGCTTTCGTTATGAAACAAAAGTTC
GTAAATGGCGTTCCACTCTATAGACAGGAATCCGAATGGAAACAAATGGGTCTAGATCTTTCAAGAACGACAATGGCTAACTGGATCATATACTCAAGTG
AGCACTGGTTAAAACCACTGACAGACAGAATGCATGAAATCCTGCTTGAATCAAAACATGCCCATGCCGATGAAACACCTGTTCAAGTTCTAAATGAACC
TGGTAAAAAGGCAACAACTAAATCCTATATGTGGGTATATTCAAGCATCAAAGAGAGTGCATATCCAATCCGACTATTTGTATACGCACCAAACAGATGC
GGATATAATCCACAGATATTCTTCAACGGTTTTCACGGAACTGTTATATCAGATGCATATTCCGGATATAACAATATAGAAGGCTGTGTCAATGCATACT
GCTGGGCACATGCAAGAAGAAAGTTCAGAGACAGCCTTCCAAATGATCTCGAAAATGCAGAAAATACGCTTCCGATTATTGCGATGAATAAGATCAAGAA
ACTCTTCGCAATAGAAAAAGAAATAGAAGCGTTGTCACCTGATAAAAAGGTCAAAATACGTCAGCTAAAGTCAAAGCCATTAATAGATGACTTCTTTTCA
TGGTGCAAATCCAATCAAAATGCCACAGCTAGTGCAAAGCTGAGCAGAGCATTTAAATATGCTCTAAATCATGAAGAAGGATTGAGACAATACTTATATG
ATGGATATATTCCAATGACAAACTCATTAGATGAAAGAGTAATTCGTCCTTTTACGACAGGAAGAAAGAACTGGTTATTCTCGGCAAGCGTATCAGGAGC
AGAATCAAGTGCTAATGCATACAGTATAATTGAAACAGCAAAAGCAAATGGATTGGATCCATATAAATATTTGACAACAATATTTACATATCTTCCAAGC
CAAGATCTAATCAAAAATCCAGAAATAATAGACGAGTTTCTACCATGGAGTGAGTTCATACAAAAGAATTGTAAATAAAAGAATCAGCATACCATTTTAG
CAATGATAAACTGATTCTTTTTCATTTATTTAACTACATCATTTTATTTGGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
378 bp | 125 aa | 120 | 497 | + | No |
AG : IS66 TnpA
ORF sequence :
MKLEDLTPKQRLKAEKWLAIIRECINSDLSNEEWCEQNNVSIKSYYYYLAKLRKMAIEQIPYKKRNNVLPLVPESNDIVEIQAPQSTPVCISSSVMTICQ
GNTKIEINSNCPEWMIRELLNKYVR
GNTKIEINSNCPEWMIRELLNKYVR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 487 | 843 | + | No |
AG : IS66 TnpB
ORF sequence :
MLGDISKAEHIYIACGYTDMRKSIDGLAAIVQQNFKLDPFSNSLFLFCGRSSTKMKVLYWEGDGFVLLYKRLENGKFIWPRNKDEVMKITDQQFRWLLEG
LKIEQSRTIKKSNKRCAI
LKIEQSRTIKKSNKRCAI
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1572 bp | 523 aa | 907 | 2478 | + | No |
Chemistry : DDE
ORF sequence :
MIENAGIDYKELYLQMQSAVVALSKTLEEIQKENKNLKEENEYLKRKLFGTKSETSKSLGFEQLSLFDEAEVEADPDEEQFILEEVKFNKKKKYKGQLDD
KLSKLPHIEVIMTLPESELVCPVCNSKLVSIGKKFVRHEIEFVPAKLKVRDIYTTTYECRKCRANGKSVMKSPGIPEPVIPHSYASAESVAFVMKQKFVN
GVPLYRQESEWKQMGLDLSRTTMANWIIYSSEHWLKPLTDRMHEILLESKHAHADETPVQVLNEPGKKATTKSYMWVYSSIKESAYPIRLFVYAPNRCGY
NPQIFFNGFHGTVISDAYSGYNNIEGCVNAYCWAHARRKFRDSLPNDLENAENTLPIIAMNKIKKLFAIEKEIEALSPDKKVKIRQLKSKPLIDDFFSWC
KSNQNATASAKLSRAFKYALNHEEGLRQYLYDGYIPMTNSLDERVIRPFTTGRKNWLFSASVSGAESSANAYSIIETAKANGLDPYKYLTTIFTYLPSQD
LIKNPEIIDEFLPWSEFIQKNCK
KLSKLPHIEVIMTLPESELVCPVCNSKLVSIGKKFVRHEIEFVPAKLKVRDIYTTTYECRKCRANGKSVMKSPGIPEPVIPHSYASAESVAFVMKQKFVN
GVPLYRQESEWKQMGLDLSRTTMANWIIYSSEHWLKPLTDRMHEILLESKHAHADETPVQVLNEPGKKATTKSYMWVYSSIKESAYPIRLFVYAPNRCGY
NPQIFFNGFHGTVISDAYSGYNNIEGCVNAYCWAHARRKFRDSLPNDLENAENTLPIIAMNKIKKLFAIEKEIEALSPDKKVKIRQLKSKPLIDDFFSWC
KSNQNATASAKLSRAFKYALNHEEGLRQYLYDGYIPMTNSLDERVIRPFTTGRKNWLFSASVSGAESSANAYSIIETAKANGLDPYKYLTTIFTYLPSQD
LIKNPEIIDEFLPWSEFIQKNCK
Blast result :
Comments
Genome in progress. Accession number: NZ_ABYT01000129.
ISEbi1 is 52%(orfA), 85%(orfB), and 64%(orfC) aa similar to ISCth11.
ISEbi1 is 52%(orfA), 85%(orfB), and 64%(orfC) aa similar to ISCth11.
References
1] ISfinder annotation (2009).
2] Fulton,L., Clifton,S., Fulton,B., Xu,J., Minx,P., Pepin,K.H., Johnson,M., Bhonagiri,V., Nash,W.E., Mardis,E.R. and Wilson,R.K.(2008) Direct submission GenBank.
2] Fulton,L., Clifton,S., Fulton,B., Xu,J., Minx,P., Pepin,K.H., Johnson,M., Bhonagiri,V., Nash,W.E., Mardis,E.R. and Wilson,R.K.(2008) Direct submission GenBank.