ISRsp1
- Family IS66
- Group
Isoform Synonym(s) ISRm8
Accession number | Transposition | Origin | Host |
---|---|---|---|
U00090 | ND | Rhizobium sp. | Rhizobium sp. NGR234 Rhizobium meliloti 41 plasmid pSym-a Rhizobium sp. NGR234 plasmid pNGR234a |
DNA section
IS Length : 3481 bp
Ends
IR Length : 17/22
IRL : GTATCCGTCCGAGCGCGACCGCCTTTGCATATTGAGTAAAACTGTTCAGA
IRR : GTATCCATCCGAGGTGGGCCGCGGGACTTGGGGGTCTCGCGCGTGGTAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AGGCGCGCCA | GATAAGGG | CACTCGTCTC | 8 |
GCCTTCTGCA | TTCGGATG | CACGAGGGCG | 8 |
DNA sequence
GTATCCGTCCGAGCGCGACCGCCTTTGCATATTGAGTAAAACTGTTCAGACCCTGTCCTTATTCCCATGCATAAGGACAGTTGCCAACATGGAGATTTTT
GAAGGAGACAGCGGGTCTCGGGTGAGCCGTCTTGAGGTGATCAACACCGGACGGCGGCGTCGATTTACGGAAGATGAGAAGCTGCGGATTGTCGCAGAAA
GCTTTGCCGGGAGAGGCCGTGCGTCGGCCACGGCCCGTCAGTACGGCATCAGTCGCTCCCTGTTGAACCGTTGGCGCAAATCGGTTCGCCAGGGTTTGCA
CGGCCAGAAACAAACCGATGGTTTTGTGCCGGCGTTCGTCATGCCGGAAACTTTTGTGCCGGTGAAGCAGGTCACTCCACCTGCTGCGATGGAGCAGCCG
GTGGCGTCTCCTTCCGGCCGCATGGAGATTGTTGCGGCGAACGGCCGTCGTGTGGTCGTGGACGGCAGCGTCGACGTTGAGGCGCTGCTGCGGATCATGC
GGGGGCTGGAGACGTTGCGGTGATCATGCTTCCTTCCGGTCAAAATGTGCGGGTGTGGATTGCAACGGGCCATACGGACATGCGGTGTGGGTTTCCATCG
CTTGCGTTGCGGGTGCAGGAGGTGCTGAAACTGAACCCTTTGGACGGCAATCTTTTTGTGTTTCGCGGTCGCAGCGGATCGCTGCTAAAAGTGATCTGGA
GTGACGGCCAGGGGAGCTGCCTTTTTACAAAAAGATTGGACCGTGGCCGGTTCGTCTGGCCTTCTGCCGAAGGCGGAGCGATAGCGATATCACCCGCGCA
GCTCAGTTATCTTCTGTCCGGAATCGACTGGAGGCATCCTCAGGAAACCTGGCGGCCGACGAAGGTCGGCTAGCATTATTCTATTGAAAATACAGGGGAA
ATCTGATCGAATGGCTTCATGATCTCAAAGCCTGTCGATCTTCCTGTGGATGTTGTTGGCGCTTACCTGGCGCTGCGTGGCGAGCATGAAGCCTTGCAGG
CTAAACACGCTATCGCAGTAGCGGAAGCCGCCAATGCGCAGGCGATGCTCTCTGACAACGAGGCGCTGATCGTTGCTCTGGAATTGAAGATCGAGAAGCT
CAGGCGCGAGTTGCGGGGCCAGCGCTCTGAGCGCACGGCGCGCCTGCTCGACCAGTTGGAACTGCAGCTCGAGGAACTCGTGGCGGCGGCGACGGAGGAT
GAGGTCGCGGCACAAGCAGCAAGCGCCAGAACCTCGAGCGTACGTTCGTTCACGCGCAAACGGCCGGTGCGCAAACCATGGCCGGACGATATCGAGCGCG
AACGTGTTGTCATCGAGCCGCCGACGACTTGCACCTGCTGTGGTGGTTCGCGCCTGTCGAAGCTGGGCGAGGACGTCACCGAGACGCTGGAAGAGATCCC
ACGCCGGTTCAAAGTGATCGAGACGGTGCGGGAAAAGTTCACCTGCCGCGACTGTGAGGCGATCAGTCAGACGCCCGCACCCTTCCATGCCACGCCGCGC
GGCTTTATCGGGCCGAACCTGCTGGCGACGATCCTGTTCGACAAGTTCGGCATGCACAGCCCGCTCAACCGGCAGAGTGCCCGGTTCAAATGCGAGGGGA
TCGATCTTTCGACCTCGACGCTGGCCGACCAGGTCGGGTACGCAACAGCCGCTCTCATGCCTGTCTTCGATCTGATCGAGGCGCATGTCTTCGCGGCCGA
GCGTCTTCACGGTGATGACACCACCATTCCCATTCAGGCCAGGGACAAATGCACGACCGGACGCATATGGACTTACGTATGCGATGACCGGCCATTCGGG
GGAACGGCGCCGCCAGCCGCAATCTATTATGCGTCGAGTGACCGGCGCGGCGAACATCCGCAGAAACACCTGGCCGGATACGGCGGCATTCTGCAGAGCG
ATTGCTACAATGGCTTCGAGCCGATCGCTGTTGCCGCAACGAAAGCGGTCCCGATCACATTCGCCTTTTGTCACGCGCATGCGCGGCGGAAATTCTTTGA
GCTGGCCGATATCCAGAAGAATGCGCGGGATCGCAAACGGAGGGGCAAGCCGATCTCGCCGATCGCATTGGAAGCCGTCAAGCGCTACGACGAATTGTTC
GAGATCGAGCGCCAGATCAACGGATTGAGCGCCGAAGAACGACTGGCTGTGCGGCAGGAGAAGAGCAAGCCACTGTTCGATGACATGCACGAGTGGTTGA
CGAAGGAACGCGCCATGCTCAGCAGATCGTCCGAGGTCATCGAGCCGATCGATTACATGCTCAAGCGCTGGGAGGGCTTTGCTCTCTTCCTCAAAGACGG
GAGAGTTTGTTTAACGAACAACGCAGCCGAGCGGGCGCTGAGAAGTGTCGCATTGGGAAGACGGAACTGGACCTTCGCCGGTTCTCAGCGCGGGGCCGAT
CGTGCCGCTGTCATGCTGACCGTCATCACCACCTGCCGCCTCAACGATATCGACCCGAAGGCATGGCTTGCAGACGTGCTGGCTCGCATCGCCGACCATC
CTGTCACGCGCCTGTACGAACTGCTGCCCTGGGAGTGGAAACGTGCATCGGCGGCAACCGTCATGCTGGCGGCCTGACGATGGTGGCTTCCGTCGTCCGC
CTGAAGGTCACCCTCGATCATGTCGAGCCGATAGTGATGCGGCGCGTTGTCGTGCCGTTCACCATCAGGCTCAGTCGGCTGCACGAGGTGTTGCAGGCGG
CGATGGGCTGGACCAACAGCCACCTTTACGAATTCCGCATGCGTGACGTCGGCTTTGGTTTGCCTGACGAGGAATGGGGGGATGGTCCGATCGATGCCCG
CAGGGTCTCGCTGCTGTCGGCAGTCCAGGACACCGGTGCGAAATCGTTCAAATACCTTTACGACTTTGGCGACGGCTGGGAACACAGCATCAAGATCGAG
CGCACCTTTCCTGCGGTCGGCACGGAAGGACCGATGCTTCTCGAAGCAACGGGGCACTGCCCGCCCGAAGACGTTGGTGGTCCATGGGGTTATCAGGAGT
TCTGCGAGGCGCTTGCAGACCCGGCGCACGAGCGACATGCCGAAACCCTCGAATGGTGCGGCAGCAGCGATTATGATTCCGCCGCCGCCAACTTCTCGCA
GCTCAACAAAGCCGTCGATGACCTCGCTGCAAAATGGGCCCGAAAAGCGCGCCGCAAAACCTGAACGCCGGGCTGTGGCCGAAGGCCGAAAGTTGATTTT
ACCGTGTAAACAACCGGCGGAACAGTTCTTCCGAGCGCCGTAGACCTTCGTCCGTCAGCACCAGCGACTTCGACTTGTTCACCGGATCACAGATCAGTCC
CCTCTCATGCAACCGGTCCGTCGTCGCCCAGTCAAATCCCTTCCAGGCGCAACGCTCGTTGTGGAGCGTCAGCCATAACAGTGCCAGCACCGCGTCATCG
ATTTTGTCCTCGTCGATCTCCATTCTGCTATCCTACCACGCGCGAGACCCCCAAGTCCCGCGGCCCACCTCGGATGGATAC
GAAGGAGACAGCGGGTCTCGGGTGAGCCGTCTTGAGGTGATCAACACCGGACGGCGGCGTCGATTTACGGAAGATGAGAAGCTGCGGATTGTCGCAGAAA
GCTTTGCCGGGAGAGGCCGTGCGTCGGCCACGGCCCGTCAGTACGGCATCAGTCGCTCCCTGTTGAACCGTTGGCGCAAATCGGTTCGCCAGGGTTTGCA
CGGCCAGAAACAAACCGATGGTTTTGTGCCGGCGTTCGTCATGCCGGAAACTTTTGTGCCGGTGAAGCAGGTCACTCCACCTGCTGCGATGGAGCAGCCG
GTGGCGTCTCCTTCCGGCCGCATGGAGATTGTTGCGGCGAACGGCCGTCGTGTGGTCGTGGACGGCAGCGTCGACGTTGAGGCGCTGCTGCGGATCATGC
GGGGGCTGGAGACGTTGCGGTGATCATGCTTCCTTCCGGTCAAAATGTGCGGGTGTGGATTGCAACGGGCCATACGGACATGCGGTGTGGGTTTCCATCG
CTTGCGTTGCGGGTGCAGGAGGTGCTGAAACTGAACCCTTTGGACGGCAATCTTTTTGTGTTTCGCGGTCGCAGCGGATCGCTGCTAAAAGTGATCTGGA
GTGACGGCCAGGGGAGCTGCCTTTTTACAAAAAGATTGGACCGTGGCCGGTTCGTCTGGCCTTCTGCCGAAGGCGGAGCGATAGCGATATCACCCGCGCA
GCTCAGTTATCTTCTGTCCGGAATCGACTGGAGGCATCCTCAGGAAACCTGGCGGCCGACGAAGGTCGGCTAGCATTATTCTATTGAAAATACAGGGGAA
ATCTGATCGAATGGCTTCATGATCTCAAAGCCTGTCGATCTTCCTGTGGATGTTGTTGGCGCTTACCTGGCGCTGCGTGGCGAGCATGAAGCCTTGCAGG
CTAAACACGCTATCGCAGTAGCGGAAGCCGCCAATGCGCAGGCGATGCTCTCTGACAACGAGGCGCTGATCGTTGCTCTGGAATTGAAGATCGAGAAGCT
CAGGCGCGAGTTGCGGGGCCAGCGCTCTGAGCGCACGGCGCGCCTGCTCGACCAGTTGGAACTGCAGCTCGAGGAACTCGTGGCGGCGGCGACGGAGGAT
GAGGTCGCGGCACAAGCAGCAAGCGCCAGAACCTCGAGCGTACGTTCGTTCACGCGCAAACGGCCGGTGCGCAAACCATGGCCGGACGATATCGAGCGCG
AACGTGTTGTCATCGAGCCGCCGACGACTTGCACCTGCTGTGGTGGTTCGCGCCTGTCGAAGCTGGGCGAGGACGTCACCGAGACGCTGGAAGAGATCCC
ACGCCGGTTCAAAGTGATCGAGACGGTGCGGGAAAAGTTCACCTGCCGCGACTGTGAGGCGATCAGTCAGACGCCCGCACCCTTCCATGCCACGCCGCGC
GGCTTTATCGGGCCGAACCTGCTGGCGACGATCCTGTTCGACAAGTTCGGCATGCACAGCCCGCTCAACCGGCAGAGTGCCCGGTTCAAATGCGAGGGGA
TCGATCTTTCGACCTCGACGCTGGCCGACCAGGTCGGGTACGCAACAGCCGCTCTCATGCCTGTCTTCGATCTGATCGAGGCGCATGTCTTCGCGGCCGA
GCGTCTTCACGGTGATGACACCACCATTCCCATTCAGGCCAGGGACAAATGCACGACCGGACGCATATGGACTTACGTATGCGATGACCGGCCATTCGGG
GGAACGGCGCCGCCAGCCGCAATCTATTATGCGTCGAGTGACCGGCGCGGCGAACATCCGCAGAAACACCTGGCCGGATACGGCGGCATTCTGCAGAGCG
ATTGCTACAATGGCTTCGAGCCGATCGCTGTTGCCGCAACGAAAGCGGTCCCGATCACATTCGCCTTTTGTCACGCGCATGCGCGGCGGAAATTCTTTGA
GCTGGCCGATATCCAGAAGAATGCGCGGGATCGCAAACGGAGGGGCAAGCCGATCTCGCCGATCGCATTGGAAGCCGTCAAGCGCTACGACGAATTGTTC
GAGATCGAGCGCCAGATCAACGGATTGAGCGCCGAAGAACGACTGGCTGTGCGGCAGGAGAAGAGCAAGCCACTGTTCGATGACATGCACGAGTGGTTGA
CGAAGGAACGCGCCATGCTCAGCAGATCGTCCGAGGTCATCGAGCCGATCGATTACATGCTCAAGCGCTGGGAGGGCTTTGCTCTCTTCCTCAAAGACGG
GAGAGTTTGTTTAACGAACAACGCAGCCGAGCGGGCGCTGAGAAGTGTCGCATTGGGAAGACGGAACTGGACCTTCGCCGGTTCTCAGCGCGGGGCCGAT
CGTGCCGCTGTCATGCTGACCGTCATCACCACCTGCCGCCTCAACGATATCGACCCGAAGGCATGGCTTGCAGACGTGCTGGCTCGCATCGCCGACCATC
CTGTCACGCGCCTGTACGAACTGCTGCCCTGGGAGTGGAAACGTGCATCGGCGGCAACCGTCATGCTGGCGGCCTGACGATGGTGGCTTCCGTCGTCCGC
CTGAAGGTCACCCTCGATCATGTCGAGCCGATAGTGATGCGGCGCGTTGTCGTGCCGTTCACCATCAGGCTCAGTCGGCTGCACGAGGTGTTGCAGGCGG
CGATGGGCTGGACCAACAGCCACCTTTACGAATTCCGCATGCGTGACGTCGGCTTTGGTTTGCCTGACGAGGAATGGGGGGATGGTCCGATCGATGCCCG
CAGGGTCTCGCTGCTGTCGGCAGTCCAGGACACCGGTGCGAAATCGTTCAAATACCTTTACGACTTTGGCGACGGCTGGGAACACAGCATCAAGATCGAG
CGCACCTTTCCTGCGGTCGGCACGGAAGGACCGATGCTTCTCGAAGCAACGGGGCACTGCCCGCCCGAAGACGTTGGTGGTCCATGGGGTTATCAGGAGT
TCTGCGAGGCGCTTGCAGACCCGGCGCACGAGCGACATGCCGAAACCCTCGAATGGTGCGGCAGCAGCGATTATGATTCCGCCGCCGCCAACTTCTCGCA
GCTCAACAAAGCCGTCGATGACCTCGCTGCAAAATGGGCCCGAAAAGCGCGCCGCAAAACCTGAACGCCGGGCTGTGGCCGAAGGCCGAAAGTTGATTTT
ACCGTGTAAACAACCGGCGGAACAGTTCTTCCGAGCGCCGTAGACCTTCGTCCGTCAGCACCAGCGACTTCGACTTGTTCACCGGATCACAGATCAGTCC
CCTCTCATGCAACCGGTCCGTCGTCGCCCAGTCAAATCCCTTCCAGGCGCAACGCTCGTTGTGGAGCGTCAGCCATAACAGTGCCAGCACCGCGTCATCG
ATTTTGTCCTCGTCGATCTCCATTCTGCTATCCTACCACGCGCGAGACCCCCAAGTCCCGCGGCCCACCTCGGATGGATAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
435 bp | 144 aa | 89 | 523 | + | No |
AG : IS66 TnpA
ORF sequence :
MEIFEGDSGSRVSRLEVINTGRRRRFTEDEKLRIVAESFAGRGRASATARQYGISRSLLNRWRKSVRQGLHGQKQTDGFVPAFVMPETFVPVKQVTPPAA
MEQPVASPSGRMEIVAANGRRVVVDGSVDVEALLRIMRGLETLR
MEQPVASPSGRMEIVAANGRRVVVDGSVDVEALLRIMRGLETLR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 526 | 873 | + | No |
AG : IS66 TnpB
ORF sequence :
MLPSGQNVRVWIATGHTDMRCGFPSLALRVQEVLKLNPLDGNLFVFRGRSGSLLKVIWSDGQGSCLFTKRLDRGRFVWPSAEGGAIAISPAQLSYLLSGI
DWRHPQETWRPTKVG
DWRHPQETWRPTKVG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1659 bp | 552 aa | 919 | 2577 | + | No |
Chemistry : DDE
ORF sequence :
MISKPVDLPVDVVGAYLALRGEHEALQAKHAIAVAEAANAQAMLSDNEALIVALELKIEKLRRELRGQRSERTARLLDQLELQLEELVAAATEDEVAAQA
ASARTSSVRSFTRKRPVRKPWPDDIERERVVIEPPTTCTCCGGSRLSKLGEDVTETLEEIPRRFKVIETVREKFTCRDCEAISQTPAPFHATPRGFIGPN
LLATILFDKFGMHSPLNRQSARFKCEGIDLSTSTLADQVGYATAALMPVFDLIEAHVFAAERLHGDDTTIPIQARDKCTTGRIWTYVCDDRPFGGTAPPA
AIYYASSDRRGEHPQKHLAGYGGILQSDCYNGFEPIAVAATKAVPITFAFCHAHARRKFFELADIQKNARDRKRRGKPISPIALEAVKRYDELFEIERQI
NGLSAEERLAVRQEKSKPLFDDMHEWLTKERAMLSRSSEVIEPIDYMLKRWEGFALFLKDGRVCLTNNAAERALRSVALGRRNWTFAGSQRGADRAAVML
TVITTCRLNDIDPKAWLADVLARIADHPVTRLYELLPWEWKRASAATVMLAA
ASARTSSVRSFTRKRPVRKPWPDDIERERVVIEPPTTCTCCGGSRLSKLGEDVTETLEEIPRRFKVIETVREKFTCRDCEAISQTPAPFHATPRGFIGPN
LLATILFDKFGMHSPLNRQSARFKCEGIDLSTSTLADQVGYATAALMPVFDLIEAHVFAAERLHGDDTTIPIQARDKCTTGRIWTYVCDDRPFGGTAPPA
AIYYASSDRRGEHPQKHLAGYGGILQSDCYNGFEPIAVAATKAVPITFAFCHAHARRKFFELADIQKNARDRKRRGKPISPIALEAVKRYDELFEIERQI
NGLSAEERLAVRQEKSKPLFDDMHEWLTKERAMLSRSSEVIEPIDYMLKRWEGFALFLKDGRVCLTNNAAERALRSVALGRRNWTFAGSQRGADRAAVML
TVITTCRLNDIDPKAWLADVLARIADHPVTRLYELLPWEWKRASAATVMLAA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
585 bp | 204 aa | 2580 | 3164 | + | No |
Annotation : pRiA4b ORF-3 family protein Description :
ORF sequence :
MVASVVRLKVTLDHVEPIVMRRVVVPFTIRLSRLHEVLQAAMGWTNSHLYEFRMRDVGFGLPDEEWGDGPIDARRVSLLSAVQDTGAKSFKYLYDFGDGW
EHSIKIERTFPAVGTEGPMLLEATGHCPPEDVGGPWGYQEFCEALADPAHERHAETLEWCGSSDYDSAAANFSQLNKAVDDLAAKWARKARRKT
EHSIKIERTFPAVGTEGPMLLEATGHCPPEDVGGPWGYQEFCEALADPAHERHAETLEWCGSSDYDSAAANFSQLNKAVDDLAAKWARKARRKT
Blast result :
Comments
ISRsp1 is 51%(orfA) aa similar to ISAli10, 80%(orfB) and 77%(orfC, Transposase) to IS66, and 62%(orfD) to ISBrsp6.
ISRm8, partial IS, is 81 % identical to ISRsp1 (1002-1785), including a gap of 42 bp around position 380. The same region is also similar to IS66 (66 %, 855-1619). ISRm8 is located downstream of the nodQ1 (APS kinase) gene.
ISRm8, partial IS, is 81 % identical to ISRsp1 (1002-1785), including a gap of 42 bp around position 380. The same region is also similar to IS66 (66 %, 855-1619). ISRm8 is located downstream of the nodQ1 (APS kinase) gene.
References
1] Freiberg,C., Fellay,R., Bairoch,A., Broughton,W.J., Rosenthal,A. and Perret,X. (1997) Nature 387 (6631), 394-401.
2] Freiberg,C. (1996) Direct submission GenBank.
3] Schwedock, J., and Long, S.R. (1994) Mol. Plant-Microbe Interact. 7, 151-153.
2] Freiberg,C. (1996) Direct submission GenBank.
3] Schwedock, J., and Long, S.R. (1994) Mol. Plant-Microbe Interact. 7, 151-153.