ISRel24
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007762 | ND | Rhizobium etli | Rhizobium etli CFN 42 plasmid p42a |
DNA section
IS Length : 2837 bp
Ends
IR Length : 19/21
IRL : GTATCCATCCGGCGAAGGCCGTCTGTCTTGATTTGCTGATTGGCTTTAAG
IRR : GTAACCATCCGGCCAAGGCCGCGGGGCTTATGATTACGGCTCGTCTGCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AGGCCTCATC | GAACTATCTC | 0 |
DNA sequence
GTATCCATCCGGCGAAGGCCGTCTGTCTTGATTTGCTGATTGGCTTTAAGTCCATACCTTATGTCATGCGCTACAATCATTTCCCCCATTGAGGTTCTGT
CCGTCGACGATCTTGGTCGCCGTCGAGATTGGTCGGATGAAGAGAAGGTGCGGATTGTCGAAGAAAGTCTGCACGGATACCGGCAGGGTTCGGCGACAGC
ACGGCGTTATGGATTGTCGCGGTCATTGTTGACGACCTGGCGGCGGGAGTGCCGAAGCGGGCTTCTGAGCGTTTCCGCATCAACGAGCTTCGTGCCGCTT
TCGATTTTGCCGCCGCCTGCAGCATCTTCCGAGATGATGGCTCCACTCCAGGCGGACGGCGATAAGTTGATCGAGATCGGCCTGCCGAACGGCCGACGAC
TGATGATCCCAGCCTCGCTTGATCCGACCATTCTTGCCCGCCTGTTGCCCGTCGTGGATGGGTCATGATCGCGTTTCCCGCTGGTGTGAAGGTCTGGATC
GCGGGTGGCGTGACGGACATGCGTTGCGGCATGAACAGCCTGGCGCTGAAGGTCCAGCAAGGCCTTGGCCGTGGGACCCTGATGGCGGTGAGGTCTTCTG
CTTCCGGGGTCGCAAGGGTGACCTGATCAAGGTCCTCTGGCATGACGGCGTCGGCATGTCGCTTTACCTGAAGCGGCTGGAAGCTGGAAAGTTCATCTGG
CCGGTCAGCCAGAATGGCTCCGCCGTGCCTGTATCGTCGGCGCAGCTCGGCTATCTCCTGGAAGGGATCGACTGGCGCAACCCGCGCTGGACGCAGCGGC
CTTCGAAGGCAGGCTGGCCGCCTGCATTCCTCTGTTTTTGTTGGGCTTTCGGCATGCGGCATGGTAGCTTTCGGCCATGGATGATGCTGCTTCGGAGATA
GCCAGACTGCGCGCCGCGCTTGCGGCATCGGAAGCGCGTGCCGCCTCTGCCGAGGCCGACCTCGCACAGGTGCGCGCGGTCGTGACGACGTCTGAGGCGA
TGATCCGGCATCTCAAGCTCGAGATCGCCAAGATACGTCGCGAGCAGTACGGCCAGAGCTCGGAGCGCCGCGCCCGGCTGATCGAGCAAATGAAATTGCA
GCTCCAAGAACTTGAAGCCGACGCCACCGAAGACGAGATCGCTGCGGAACGCGTGGCGACGAGAATCACAAATGTCTCCGCTTTCGAACGCCGCCGGCCG
GCCCGCAAGCCGTTTCCCGAGCACCTGCCGCGCGAGCGCCTGGTCATCGATGCCCCGTCGACCTGCACCTGCTGCGGCTCGCCCCGCATCGTGAAGATGG
GCGAAGACATCACCGAGACGCTGGAGATCATTCCGCGCCAGTGGAAGGTGATCCAGACGGTTCGCGAGAAGTTCACCTGTAGGGACTGCGAGAAGATCAG
CCAGCCACCGGCCCCTTTCCATGCGACACCGCGGGGATGGGCAGGACCGCACCTGCTGGCGACGATCCTGTTCGAGAAGTTCGGCCAACATCAGCCATTG
AACCGCCAGGCTGAGCGCTACGCCAGGGAAGGCGTCGCTCTCAGTCTCTCCACACTGGCCGATCAGGTCGGAGCCTGCACGACGGCCCTGCAGCCGATCC
ATGACCTGATCCGTGCCCATGTTCTGGCCGCCGAGCGGCTGCATGGTGACGACACCACCGTGCCGCTTCTGGCCAGGGGAGCAACAAAGCAGGCGAGGCT
CTGGACTTACGTCCGCGATGACCGCCCTTTCGCGGGCGGCGCGCCTCCCGCCGCACTCTTCCACTTCTCTCCCGATCGCGAGAAGACCCACCCCAACACG
CATCTCGCCGGATGGCACGGCACCCTGCAAGCCGATGCCTATGGCGGCTACAACGACCTCTATCGTGTCGACCGCCGCCCCGCGCTGGTGATCAGCGCAC
TTTGCTGGAGCCACGCGCGGCGCAAATTCTTCGAACTCGCTGACATCGCCGGCAACGTGCGCAAGGGCAAACCTGCCCACGAGATATCGCCCGTCGCGCT
TGAGGCCGTTGCCCGCATCGACGCGCTCTTCGACATCGAGCGCGGCATAAACGGAATGCCTGCCGAGGATAGGCTCGCAGCGAGGCTGCAACATGCTCGC
CCGCTCGTCGAAGAACTGCACGATTGGCTCATGGCCCAGCGCGGGCAAATGTCGAAGCACAACCCCGTCGCCAAGGCGATCAACTACATGTTCGAGAAGG
AGGGTCGCTGGGAAGCCTTCGCCCGGTTCCTCGACGACGGCAGACTGTGTCTGACGAACAATGCCGCCGAACGAGCCCTGCGCGGCGTTGCTCTCGGAAG
AACGGCATGGCTATTCGCCGGTTCCCAGCGCGGAGGCGAGCGTGCTGCCTTCATGTATTCCCTAATCGTCACGGCAAAGATGAACGATATCGATCCGCAG
GCCTGGTTGGCGGACGTGCTCGCCCGCATGCCTGGCATTCCCGTATCACGGCTGCCGGAACTGTTGCCGTGGAACTTGCCCGCCGGAAGCGCCCGGCAGG
TGGCGGCCTGATGGCGCGCCCGACGCATGTCTACACCATCGAATATGTCGCCACGCTGATCGGTGAGAACCTCGAGTTGCTTGAAGAAATCGCCAGCAAT
TCCGACAACATCGACTACGGCGAGATGATCCATGTCTACGACGGCACCGACGAGGGCATCACAACCTTTACCGACCGCGGCATCGAGAGCCTGCAGGAGT
TCCTTGCCGACGTGCGCTCCTGGGAAGGCGGCGTTCGCCAATTTCTTCTCGACGAGCAACGTGATCCGGAAAAGATCGAACGCATCATGGCAGACGAGCC
GTAATCATAAGCCCCGCGGCCTTGGCCGGATGGTTAC
CCGTCGACGATCTTGGTCGCCGTCGAGATTGGTCGGATGAAGAGAAGGTGCGGATTGTCGAAGAAAGTCTGCACGGATACCGGCAGGGTTCGGCGACAGC
ACGGCGTTATGGATTGTCGCGGTCATTGTTGACGACCTGGCGGCGGGAGTGCCGAAGCGGGCTTCTGAGCGTTTCCGCATCAACGAGCTTCGTGCCGCTT
TCGATTTTGCCGCCGCCTGCAGCATCTTCCGAGATGATGGCTCCACTCCAGGCGGACGGCGATAAGTTGATCGAGATCGGCCTGCCGAACGGCCGACGAC
TGATGATCCCAGCCTCGCTTGATCCGACCATTCTTGCCCGCCTGTTGCCCGTCGTGGATGGGTCATGATCGCGTTTCCCGCTGGTGTGAAGGTCTGGATC
GCGGGTGGCGTGACGGACATGCGTTGCGGCATGAACAGCCTGGCGCTGAAGGTCCAGCAAGGCCTTGGCCGTGGGACCCTGATGGCGGTGAGGTCTTCTG
CTTCCGGGGTCGCAAGGGTGACCTGATCAAGGTCCTCTGGCATGACGGCGTCGGCATGTCGCTTTACCTGAAGCGGCTGGAAGCTGGAAAGTTCATCTGG
CCGGTCAGCCAGAATGGCTCCGCCGTGCCTGTATCGTCGGCGCAGCTCGGCTATCTCCTGGAAGGGATCGACTGGCGCAACCCGCGCTGGACGCAGCGGC
CTTCGAAGGCAGGCTGGCCGCCTGCATTCCTCTGTTTTTGTTGGGCTTTCGGCATGCGGCATGGTAGCTTTCGGCCATGGATGATGCTGCTTCGGAGATA
GCCAGACTGCGCGCCGCGCTTGCGGCATCGGAAGCGCGTGCCGCCTCTGCCGAGGCCGACCTCGCACAGGTGCGCGCGGTCGTGACGACGTCTGAGGCGA
TGATCCGGCATCTCAAGCTCGAGATCGCCAAGATACGTCGCGAGCAGTACGGCCAGAGCTCGGAGCGCCGCGCCCGGCTGATCGAGCAAATGAAATTGCA
GCTCCAAGAACTTGAAGCCGACGCCACCGAAGACGAGATCGCTGCGGAACGCGTGGCGACGAGAATCACAAATGTCTCCGCTTTCGAACGCCGCCGGCCG
GCCCGCAAGCCGTTTCCCGAGCACCTGCCGCGCGAGCGCCTGGTCATCGATGCCCCGTCGACCTGCACCTGCTGCGGCTCGCCCCGCATCGTGAAGATGG
GCGAAGACATCACCGAGACGCTGGAGATCATTCCGCGCCAGTGGAAGGTGATCCAGACGGTTCGCGAGAAGTTCACCTGTAGGGACTGCGAGAAGATCAG
CCAGCCACCGGCCCCTTTCCATGCGACACCGCGGGGATGGGCAGGACCGCACCTGCTGGCGACGATCCTGTTCGAGAAGTTCGGCCAACATCAGCCATTG
AACCGCCAGGCTGAGCGCTACGCCAGGGAAGGCGTCGCTCTCAGTCTCTCCACACTGGCCGATCAGGTCGGAGCCTGCACGACGGCCCTGCAGCCGATCC
ATGACCTGATCCGTGCCCATGTTCTGGCCGCCGAGCGGCTGCATGGTGACGACACCACCGTGCCGCTTCTGGCCAGGGGAGCAACAAAGCAGGCGAGGCT
CTGGACTTACGTCCGCGATGACCGCCCTTTCGCGGGCGGCGCGCCTCCCGCCGCACTCTTCCACTTCTCTCCCGATCGCGAGAAGACCCACCCCAACACG
CATCTCGCCGGATGGCACGGCACCCTGCAAGCCGATGCCTATGGCGGCTACAACGACCTCTATCGTGTCGACCGCCGCCCCGCGCTGGTGATCAGCGCAC
TTTGCTGGAGCCACGCGCGGCGCAAATTCTTCGAACTCGCTGACATCGCCGGCAACGTGCGCAAGGGCAAACCTGCCCACGAGATATCGCCCGTCGCGCT
TGAGGCCGTTGCCCGCATCGACGCGCTCTTCGACATCGAGCGCGGCATAAACGGAATGCCTGCCGAGGATAGGCTCGCAGCGAGGCTGCAACATGCTCGC
CCGCTCGTCGAAGAACTGCACGATTGGCTCATGGCCCAGCGCGGGCAAATGTCGAAGCACAACCCCGTCGCCAAGGCGATCAACTACATGTTCGAGAAGG
AGGGTCGCTGGGAAGCCTTCGCCCGGTTCCTCGACGACGGCAGACTGTGTCTGACGAACAATGCCGCCGAACGAGCCCTGCGCGGCGTTGCTCTCGGAAG
AACGGCATGGCTATTCGCCGGTTCCCAGCGCGGAGGCGAGCGTGCTGCCTTCATGTATTCCCTAATCGTCACGGCAAAGATGAACGATATCGATCCGCAG
GCCTGGTTGGCGGACGTGCTCGCCCGCATGCCTGGCATTCCCGTATCACGGCTGCCGGAACTGTTGCCGTGGAACTTGCCCGCCGGAAGCGCCCGGCAGG
TGGCGGCCTGATGGCGCGCCCGACGCATGTCTACACCATCGAATATGTCGCCACGCTGATCGGTGAGAACCTCGAGTTGCTTGAAGAAATCGCCAGCAAT
TCCGACAACATCGACTACGGCGAGATGATCCATGTCTACGACGGCACCGACGAGGGCATCACAACCTTTACCGACCGCGGCATCGAGAGCCTGCAGGAGT
TCCTTGCCGACGTGCGCTCCTGGGAAGGCGGCGTTCGCCAATTTCTTCTCGACGAGCAACGTGATCCGGAAAAGATCGAACGCATCATGGCAGACGAGCC
GTAATCATAAGCCCCGCGGCCTTGGCCGGATGGTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
408 bp | 135 aa | 61 | 468 | + | No |
AG : IS66 TnpA
ORF sequence :
MSCATIISPIEVLSVDDLGRRRDWSDEEKVRIVEESLHGYRQGSATARRYGLSRSLLTTWRRECRSGLLSVSASTSFVPLSILPPPAASSEMMAPLQADG
DKLIEIGLPNGRRLMIPASLDPTILARLLPVVDGS
DKLIEIGLPNGRRLMIPASLDPTILARLLPVVDGS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
118 bp | 145 aa | 458 | 575 | + | No |
AG : IS66 TnpB
ORF sequence :
MIAFPAGVKVWIAGGVTDMRCGMNSLALKVQQGLGRGDPDGGEVFCFRGRKGDLIKVLWHDGVGMSLYLKRLEAGKFIWPVSQNGSAVPVSSAQLGYLLE
GIDWRNPRWTQRPSKAGWPPAFLCFCWAFGMRHGSFRPWMMLLRR
GIDWRNPRWTQRPSKAGWPPAFLCFCWAFGMRHGSFRPWMMLLRR
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1635 bp | 544 aa | 877 | 2511 | + | No |
Chemistry : DDE
ORF sequence :
MDDAASEIARLRAALAASEARAASAEADLAQVRAVVTTSEAMIRHLKLEIAKIRREQYGQSSERRARLIEQMKLQLQELEADATEDEIAAERVATRITNV
SAFERRRPARKPFPEHLPRERLVIDAPSTCTCCGSPRIVKMGEDITETLEIIPRQWKVIQTVREKFTCRDCEKISQPPAPFHATPRGWAGPHLLATILFE
KFGQHQPLNRQAERYAREGVALSLSTLADQVGACTTALQPIHDLIRAHVLAAERLHGDDTTVPLLARGATKQARLWTYVRDDRPFAGGAPPAALFHFSPD
REKTHPNTHLAGWHGTLQADAYGGYNDLYRVDRRPALVISALCWSHARRKFFELADIAGNVRKGKPAHEISPVALEAVARIDALFDIERGINGMPAEDRL
AARLQHARPLVEELHDWLMAQRGQMSKHNPVAKAINYMFEKEGRWEAFARFLDDGRLCLTNNAAERALRGVALGRTAWLFAGSQRGGERAAFMYSLIVTA
KMNDIDPQAWLADVLARMPGIPVSRLPELLPWNLPAGSARQVAA
SAFERRRPARKPFPEHLPRERLVIDAPSTCTCCGSPRIVKMGEDITETLEIIPRQWKVIQTVREKFTCRDCEKISQPPAPFHATPRGWAGPHLLATILFE
KFGQHQPLNRQAERYAREGVALSLSTLADQVGACTTALQPIHDLIRAHVLAAERLHGDDTTVPLLARGATKQARLWTYVRDDRPFAGGAPPAALFHFSPD
REKTHPNTHLAGWHGTLQADAYGGYNDLYRVDRRPALVISALCWSHARRKFFELADIAGNVRKGKPAHEISPVALEAVARIDALFDIERGINGMPAEDRL
AARLQHARPLVEELHDWLMAQRGQMSKHNPVAKAINYMFEKEGRWEAFARFLDDGRLCLTNNAAERALRGVALGRTAWLFAGSQRGGERAAFMYSLIVTA
KMNDIDPQAWLADVLARMPGIPVSRLPELLPWNLPAGSARQVAA
Blast result :
Comments
ISRel24 is 57% (ORFA) aa similar to ISRsp1, 72% (ORFB) to IS66 and 86% (ORFC : the transposase) to IS66-1.
The second ORF was reconstructed in silico, there is a non programmed frameshift at position 575, may be it is a sequencing error.
The IS was reconstructed in silico by deletion of the ISRel19(IS66 family member) and of one the direct repeat generated by its insertion inside ISRel24.
The second ORF was reconstructed in silico, there is a non programmed frameshift at position 575, may be it is a sequencing error.
The IS was reconstructed in silico by deletion of the ISRel19(IS66 family member) and of one the direct repeat generated by its insertion inside ISRel24.
References