ISRsp4
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AE000090 | ND | Rhizobium sp. | Rhizobium sp. NGR234 plasmid pNGR234a |
DNA section
IS Length : 1398 bp
Ends
IR Length : 13
IRL : TGTTGAGGTGGACGGCTCCCAACGGCATCGAATGTGCCAGAATGAGTTCG
IRR : ----TATGTGGACGGCTCCCGCTTGCAAGTGTTTTCTGCAGATATTTTTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
TGTTGAGGTGGACGGCTCCCAACGGCATCGAATGTGCCAGAATGAGTTCGTTGAAGTTCTCAATCGAGGGAAACCGTCCGTGGAACAGATTATCCGAATT
GGTATGGATACGTCAAAGAGCGTCTTCCAATTGCATGGTGTGAATGCGGTTGAGCAGCCAATCCTTCGTAAGAAGCTGTCGCGCCGGGAAATGGTGAAGT
TCTTCGAGAAAACACCGCCGACCATTATCGCACTTGAAGCGTGCGGTGGGTCACATCACTGGGCACGGCTGTTAAGTTCGTTCGGCCATGAAGTAAAGTT
GATAGCACCGCAACTGGCCAAGCCGTATGTGAAACGGGGCAAGAACGATGCCGCGGATGCGGAAGCATTGTGTGAGGCAATGAGCCGCCCGACCATGCGC
TTCGTGCCAATGAAAACGGCTGATCAGCAGGCGGCGTTAATGCTGGTCGGTATGCGAGAGAGACTCATCCGCAATCGGACACAGCTCGCCAATGCCATTC
GCGGTTTTGCCATGGAATTTGGCATCGTTGCGGCCAAAGGCATGTGCCGCATCGAAGCGCTATTGGAGCGCATTGCCGCTGACCCGTCCCTACCGGAACT
GGCGCAGGATCTCTTCGCGCTGCATGGACAGGAATACCGCGAGCTAGCTTGCCAGATCAAAACGCTCGATGAAAAGTTGATGAAGTTGCATCGTGCTGAT
GAATGCAGCAAACGCCTGGCCGAGATCCCGGGTGTCGGCCCGATAGGCGCCTCCCTTCTCTTGATGAAGACACCGGATCCTCGAATGTTTAAATCGGGCC
GCGACTTTGCAGCATGGATCGGTCTGACGCCGAAGGACCATTCAACTGCCGGAAAGGTCAGGCTTGGTGTCATTACCCGGGCTGGCGACGAAATGTTGCG
GAGCATCTTGGTGGTTGGCGCGACATCTCTTCTACAGCAAGTCAGAACAGGCCGCAGCAGGCACGCCTCTGCATGGCTTATGGGGCTGCTGCAGCGGAAA
AGGCCAAAGTTGGTTGCCGTAGCGCTCGCCAACAAGCTGGCGCGGATTGCGTGGAAGCTTATGACGAGCGGCGAGTCATATCGCCAGGCAGAAGGCCAAG
CGCAGACATCATAAAAGTTCAGCCACCACGGTGAGCAAAGACAAAATGCTACCGTGTGAGCTGATGCTGTGAACTTGCAAGCGAAGAGCAGATGATGCGA
TCGGACGATCCAAGACGCGTGAAACTCCGTTAGGCCCACTGACCGCTAAAGGTCGAGACCATGTTAGGAATACGCGTTGCGGAAATCATCTTGGCCAGTG
GTCATGTGCGACCACACCAAAAGGCCGCACATATGGAAGCAAGCGATCCGGTCAAAAATATCTGCAGAAAACACTTGCAAGCGGGAGCCGTCCACATA
GGTATGGATACGTCAAAGAGCGTCTTCCAATTGCATGGTGTGAATGCGGTTGAGCAGCCAATCCTTCGTAAGAAGCTGTCGCGCCGGGAAATGGTGAAGT
TCTTCGAGAAAACACCGCCGACCATTATCGCACTTGAAGCGTGCGGTGGGTCACATCACTGGGCACGGCTGTTAAGTTCGTTCGGCCATGAAGTAAAGTT
GATAGCACCGCAACTGGCCAAGCCGTATGTGAAACGGGGCAAGAACGATGCCGCGGATGCGGAAGCATTGTGTGAGGCAATGAGCCGCCCGACCATGCGC
TTCGTGCCAATGAAAACGGCTGATCAGCAGGCGGCGTTAATGCTGGTCGGTATGCGAGAGAGACTCATCCGCAATCGGACACAGCTCGCCAATGCCATTC
GCGGTTTTGCCATGGAATTTGGCATCGTTGCGGCCAAAGGCATGTGCCGCATCGAAGCGCTATTGGAGCGCATTGCCGCTGACCCGTCCCTACCGGAACT
GGCGCAGGATCTCTTCGCGCTGCATGGACAGGAATACCGCGAGCTAGCTTGCCAGATCAAAACGCTCGATGAAAAGTTGATGAAGTTGCATCGTGCTGAT
GAATGCAGCAAACGCCTGGCCGAGATCCCGGGTGTCGGCCCGATAGGCGCCTCCCTTCTCTTGATGAAGACACCGGATCCTCGAATGTTTAAATCGGGCC
GCGACTTTGCAGCATGGATCGGTCTGACGCCGAAGGACCATTCAACTGCCGGAAAGGTCAGGCTTGGTGTCATTACCCGGGCTGGCGACGAAATGTTGCG
GAGCATCTTGGTGGTTGGCGCGACATCTCTTCTACAGCAAGTCAGAACAGGCCGCAGCAGGCACGCCTCTGCATGGCTTATGGGGCTGCTGCAGCGGAAA
AGGCCAAAGTTGGTTGCCGTAGCGCTCGCCAACAAGCTGGCGCGGATTGCGTGGAAGCTTATGACGAGCGGCGAGTCATATCGCCAGGCAGAAGGCCAAG
CGCAGACATCATAAAAGTTCAGCCACCACGGTGAGCAAAGACAAAATGCTACCGTGTGAGCTGATGCTGTGAACTTGCAAGCGAAGAGCAGATGATGCGA
TCGGACGATCCAAGACGCGTGAAACTCCGTTAGGCCCACTGACCGCTAAAGGTCGAGACCATGTTAGGAATACGCGTTGCGGAAATCATCTTGGCCAGTG
GTCATGTGCGACCACACCAAAAGGCCGCACATATGGAAGCAAGCGATCCGGTCAAAAATATCTGCAGAAAACACTTGCAAGCGGGAGCCGTCCACATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1035 bp | 344 aa | 80 | 1114 | + | No |
Chemistry : DEDD
ORF sequence :
MEQIIRIGMDTSKSVFQLHGVNAVEQPILRKKLSRREMVKFFEKTPPTIIALEACGGSHHWARLLSSFGHEVKLIAPQLAKPYVKRGKNDAADAEALCEA
MSRPTMRFVPMKTADQQAALMLVGMRERLIRNRTQLANAIRGFAMEFGIVAAKGMCRIEALLERIAADPSLPELAQDLFALHGQEYRELACQIKTLDEKL
MKLHRADECSKRLAEIPGVGPIGASLLLMKTPDPRMFKSGRDFAAWIGLTPKDHSTAGKVRLGVITRAGDEMLRSILVVGATSLLQQVRTGRSRHASAWL
MGLLQRKRPKLVAVALANKLARIAWKLMTSGESYRQAEGQAQTS
MSRPTMRFVPMKTADQQAALMLVGMRERLIRNRTQLANAIRGFAMEFGIVAAKGMCRIEALLERIAADPSLPELAQDLFALHGQEYRELACQIKTLDEKL
MKLHRADECSKRLAEIPGVGPIGASLLLMKTPDPRMFKSGRDFAAWIGLTPKDHSTAGKVRLGVITRAGDEMLRSILVVGATSLLQQVRTGRSRHASAWL
MGLLQRKRPKLVAVALANKLARIAWKLMTSGESYRQAEGQAQTS
Blast result :
Comments
The IR of this IS are not at its termini. In the IS sequence as given 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.
There are 2 copies in pNGR234a, with the transposase gene corresponding to orfs y4pF and y4sB. The transposase protein is 74, 74, 70 and 37% identical to those of ISMlo2, ISShsp1, ISMlo1 and IS1111, respectively.
By analogy with IS4321, ISRsp4 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
There are 2 copies in pNGR234a, with the transposase gene corresponding to orfs y4pF and y4sB. The transposase protein is 74, 74, 70 and 37% identical to those of ISMlo2, ISShsp1, ISMlo1 and IS1111, respectively.
By analogy with IS4321, ISRsp4 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
References
1] Freiberg et al (1997) Nature 387, 394-401
2] Partridge and Hall (2003 J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003 J. Bacteriol. 185, 6371-6384