ISRsp2
- Family IS21
- Group
Isoform Synonym(s) NGRIS-3
Accession number | Transposition | Origin | Host |
---|---|---|---|
AE000081 | ND | Rhizobium sp. | Rhizobium sp. NGR234 plasmid pNGR234a |
DNA section
IS Length : 2623 bp
Ends
IR Length : 12/16
IRL : TGCGGGTTTCGGCAAAGTCCGGACAGGATTTCCGGTAATCTCCGGACACA
IRR : TGCGCGCTTCGCAAAAACTGGACATTGATTCCGCTAAATCCCGGACAGCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
TGCGGGTTTCGGCAAAGTCCGGACAGGATTTCCGGTAATCTCCGGACACAGTTTTCAGTAATTCCCGGACAGCGATTCCGCTAAATCCCGGACAGTTTCC
GGCGGCGGTTTGACGGGTTGGTTCGCGGCTGCGCCGTCTTGTTGGTTAGGCCTCTCACGACAACGTTGAGAGGGGCCGATGCCGAGACGGAAGCAAGCAA
GACGAACGACAGTGAGGGATATCCGGACTATTCTGCGCCTGACCCATGAAGAGGGTCTTTCGGTGCGCGAGATTGCCGAGCGGCTGAAGATCGGCAAGAG
CTCGGTATCGACCTATTTGCTGCGATCTCGGGAAGCCGGGCTTTCGTGGCCGTTGCCAATCGGCGCGGACGAGGATGCAAAGCTGGAGCGGCGGCTGTTC
GGCCGAGCCGGTCGACCGCCGCGCGATCTCAGCGAGCCGGACTGGGCGCTGGTGGTTCGGGAGCTGAAGCGCAAAGGCGTGACGCTGACGCTTCTATGGC
AGGAATACCGCGCCAGCCATCCTGATGGTTACGGCTTCACCTGGTTCTGCGAGCAGGTTGCCGCCTTTCGGCAGCGCACCAGTGTAGCGTTCCGCAATCG
GCACGCGGCGGGCGCCGTGATGCAGACCGACTATGCCGGGCCGACGGTGCCGGTGATCGATCCGGCAACCGGTGTCATCCATCCGGCCCAAATCTTTGTC
GCGGTGCTGGGCGCCTCCAACCTGACCTTCGCCCATGCCAGCTTTAGCCAGCAGTTGCCGGATTGGATCGACGGTCAGGTGCGTGCTCTGACCTTCTATG
GTGGGGTCACCAAGGCAATCGTGTGCGACAACCTCAAATCGGGGGTGGCCAAGGCCCTTTGGTTCGAGCCGACATTGACTGCGACGTTCGCCGCCATGGC
GGAGCATTACGACACCACGATCCTGCCGACCCGCAGCAGGAAACCGCGCGACAAAGGCCGGGTCGAAGGCGCGGTATTGATCGTGGAACGCTGGATTCTG
GCCCGGCTCAGGAACCGTACCTTCTTCTCGCTTGCCGCCCTCAACACGGCGATTGCCGAATTGCTCGAGGACCTGAACAACCGGACGATGCGCCATGTCG
GCAAAAGCCGCCGCGAGCTGTTCGAGGAGATCGAGCGGCCAGCCTTGAAGCCCTTGCCGGCGATACCGTTCGAATATGCGGAATGGAAGTCGGCGAAGGT
CCATCCGGACTATCATGTCGAGGTCGACAAGACCTTCTACTCGGTGCCGCATCGGCTGATCGGATGCACCCTCCAGGTGCGGCTCACCCACCGGGTGGTC
GAGATCTTCCACGACCACCAGCGTGTCGCCAGCCATGTTCGCCGCTCCCAGCGTTCCGGCCACGTCACCGTCAACGACCATATGCCCAAGGCGCATCAGC
GCTATGCCAACACCACGCCGGCCAATCTGATCGGCCGTGCGACCCAGATCGGCCCCAATGCCGCCATCCTGGTCGAACGCATGATGCGCGACAGGCCGCA
TCCGGAACAGGGATACCGCTCGGCCATGGGCATTCTGTCGCTGGCGCCGCGCTATGGATCCCAGCGCCTCGAGGCGGCCTGTGAGCGGGCGCTCACCATT
AATGCCATCACCTATTCCTCCGTCGCCTCCATCCTCAAATCCGGCCTCGACCGGGAAAGACCGCAGGCTGAACACGCGGCCCCCACGCCTGCGCATACCA
ATATCCGTGGCCGATCCTACTACCAGTGAAGGAAGCAAGAATGCTGACGAACCCCACCCTCGACCAGATGCAGGCCCTCGGGCTGACGGGCATGGCCGCC
GCCTGGCGCGAATTGACCGAGCAGTCCGGCACCAATGAGCTCAGCCGCGATGAGTGGCTCGGACTGATGCTCGACCGCGAAGTCACCCTGCGCGCCGACA
AGCGCATCCGTAACCGGCTCGCCTCCGCCAAGCTACGCTTTGCCCAGGCCTGTATCGAAGATATCGACTTCGCCGCCGCCCGCGGTCTCGACCGGCGCAA
CACCATGGCGCTCGCCCAGGGGCAATGGCTCACCGCCCACGAGGGCCTGATCATCACCGGCCACACCGGCACCGGCAAGTCATGGCTGGCCTGTGCCTTC
GGCAGGCAAGCGGCCAGGCTCGGTCACTCCGTGCTCTATGTGCGCGTGCCCCGAATGTTCGAGGAGCTCGCGCTTGCCCGCCTCGACGGCTCCTTCCCCC
GCCTCATCGACAGGCTCACCCGCGTCCAGCTCCTCATCCTCGATGACTTTGGAACCCATACGCTCTCCGATCAGCAGCGCTTTCACCTCTTCGAAATCGT
CGAGGAGCGTTATCAGCGAAAGTCCACCCTGATCACAGCTCAGGTTCCCGTGGCAAGCTGGCACGACCTTATTGCCGACAGCACGGTCGCCGACGCCATA
CTCGACCGTATCGTCCACAATGCTCACCGCATCACCCTCCGGGGCGAGAGCATGCGAAAGCAAAAAAGCGCACCCCTCTTGACTGGGGCAGAAAACGGCG
AAATCAATCAGCCCTGATGCTAACCAGGCTGCCGAGGACCAACCGCATCAAACTGTCCGGACTTTAATGAAATTGCTGTCCGGGATTTAGCGGAATCAAT
GTCCAGTTTTTGCGAAGCGCGCA
GGCGGCGGTTTGACGGGTTGGTTCGCGGCTGCGCCGTCTTGTTGGTTAGGCCTCTCACGACAACGTTGAGAGGGGCCGATGCCGAGACGGAAGCAAGCAA
GACGAACGACAGTGAGGGATATCCGGACTATTCTGCGCCTGACCCATGAAGAGGGTCTTTCGGTGCGCGAGATTGCCGAGCGGCTGAAGATCGGCAAGAG
CTCGGTATCGACCTATTTGCTGCGATCTCGGGAAGCCGGGCTTTCGTGGCCGTTGCCAATCGGCGCGGACGAGGATGCAAAGCTGGAGCGGCGGCTGTTC
GGCCGAGCCGGTCGACCGCCGCGCGATCTCAGCGAGCCGGACTGGGCGCTGGTGGTTCGGGAGCTGAAGCGCAAAGGCGTGACGCTGACGCTTCTATGGC
AGGAATACCGCGCCAGCCATCCTGATGGTTACGGCTTCACCTGGTTCTGCGAGCAGGTTGCCGCCTTTCGGCAGCGCACCAGTGTAGCGTTCCGCAATCG
GCACGCGGCGGGCGCCGTGATGCAGACCGACTATGCCGGGCCGACGGTGCCGGTGATCGATCCGGCAACCGGTGTCATCCATCCGGCCCAAATCTTTGTC
GCGGTGCTGGGCGCCTCCAACCTGACCTTCGCCCATGCCAGCTTTAGCCAGCAGTTGCCGGATTGGATCGACGGTCAGGTGCGTGCTCTGACCTTCTATG
GTGGGGTCACCAAGGCAATCGTGTGCGACAACCTCAAATCGGGGGTGGCCAAGGCCCTTTGGTTCGAGCCGACATTGACTGCGACGTTCGCCGCCATGGC
GGAGCATTACGACACCACGATCCTGCCGACCCGCAGCAGGAAACCGCGCGACAAAGGCCGGGTCGAAGGCGCGGTATTGATCGTGGAACGCTGGATTCTG
GCCCGGCTCAGGAACCGTACCTTCTTCTCGCTTGCCGCCCTCAACACGGCGATTGCCGAATTGCTCGAGGACCTGAACAACCGGACGATGCGCCATGTCG
GCAAAAGCCGCCGCGAGCTGTTCGAGGAGATCGAGCGGCCAGCCTTGAAGCCCTTGCCGGCGATACCGTTCGAATATGCGGAATGGAAGTCGGCGAAGGT
CCATCCGGACTATCATGTCGAGGTCGACAAGACCTTCTACTCGGTGCCGCATCGGCTGATCGGATGCACCCTCCAGGTGCGGCTCACCCACCGGGTGGTC
GAGATCTTCCACGACCACCAGCGTGTCGCCAGCCATGTTCGCCGCTCCCAGCGTTCCGGCCACGTCACCGTCAACGACCATATGCCCAAGGCGCATCAGC
GCTATGCCAACACCACGCCGGCCAATCTGATCGGCCGTGCGACCCAGATCGGCCCCAATGCCGCCATCCTGGTCGAACGCATGATGCGCGACAGGCCGCA
TCCGGAACAGGGATACCGCTCGGCCATGGGCATTCTGTCGCTGGCGCCGCGCTATGGATCCCAGCGCCTCGAGGCGGCCTGTGAGCGGGCGCTCACCATT
AATGCCATCACCTATTCCTCCGTCGCCTCCATCCTCAAATCCGGCCTCGACCGGGAAAGACCGCAGGCTGAACACGCGGCCCCCACGCCTGCGCATACCA
ATATCCGTGGCCGATCCTACTACCAGTGAAGGAAGCAAGAATGCTGACGAACCCCACCCTCGACCAGATGCAGGCCCTCGGGCTGACGGGCATGGCCGCC
GCCTGGCGCGAATTGACCGAGCAGTCCGGCACCAATGAGCTCAGCCGCGATGAGTGGCTCGGACTGATGCTCGACCGCGAAGTCACCCTGCGCGCCGACA
AGCGCATCCGTAACCGGCTCGCCTCCGCCAAGCTACGCTTTGCCCAGGCCTGTATCGAAGATATCGACTTCGCCGCCGCCCGCGGTCTCGACCGGCGCAA
CACCATGGCGCTCGCCCAGGGGCAATGGCTCACCGCCCACGAGGGCCTGATCATCACCGGCCACACCGGCACCGGCAAGTCATGGCTGGCCTGTGCCTTC
GGCAGGCAAGCGGCCAGGCTCGGTCACTCCGTGCTCTATGTGCGCGTGCCCCGAATGTTCGAGGAGCTCGCGCTTGCCCGCCTCGACGGCTCCTTCCCCC
GCCTCATCGACAGGCTCACCCGCGTCCAGCTCCTCATCCTCGATGACTTTGGAACCCATACGCTCTCCGATCAGCAGCGCTTTCACCTCTTCGAAATCGT
CGAGGAGCGTTATCAGCGAAAGTCCACCCTGATCACAGCTCAGGTTCCCGTGGCAAGCTGGCACGACCTTATTGCCGACAGCACGGTCGCCGACGCCATA
CTCGACCGTATCGTCCACAATGCTCACCGCATCACCCTCCGGGGCGAGAGCATGCGAAAGCAAAAAAGCGCACCCCTCTTGACTGGGGCAGAAAACGGCG
AAATCAATCAGCCCTGATGCTAACCAGGCTGCCGAGGACCAACCGCATCAAACTGTCCGGACTTTAATGAAATTGCTGTCCGGGATTTAGCGGAATCAAT
GTCCAGTTTTTGCGAAGCGCGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1551 bp | 516 aa | 179 | 1729 | + | No |
Chemistry : DDE
ORF sequence :
MPRRKQARRTTVRDIRTILRLTHEEGLSVREIAERLKIGKSSVSTYLLRSREAGLSWPLPIGADEDAKLERRLFGRAGRPPRDLSEPDWALVVRELKRKG
VTLTLLWQEYRASHPDGYGFTWFCEQVAAFRQRTSVAFRNRHAAGAVMQTDYAGPTVPVIDPATGVIHPAQIFVAVLGASNLTFAHASFSQQLPDWIDGQ
VRALTFYGGVTKAIVCDNLKSGVAKALWFEPTLTATFAAMAEHYDTTILPTRSRKPRDKGRVEGAVLIVERWILARLRNRTFFSLAALNTAIAELLEDLN
NRTMRHVGKSRRELFEEIERPALKPLPAIPFEYAEWKSAKVHPDYHVEVDKTFYSVPHRLIGCTLQVRLTHRVVEIFHDHQRVASHVRRSQRSGHVTVND
HMPKAHQRYANTTPANLIGRATQIGPNAAILVERMMRDRPHPEQGYRSAMGILSLAPRYGSQRLEAACERALTINAITYSSVASILKSGLDRERPQAEHA
APTPAHTNIRGRSYYQ
VTLTLLWQEYRASHPDGYGFTWFCEQVAAFRQRTSVAFRNRHAAGAVMQTDYAGPTVPVIDPATGVIHPAQIFVAVLGASNLTFAHASFSQQLPDWIDGQ
VRALTFYGGVTKAIVCDNLKSGVAKALWFEPTLTATFAAMAEHYDTTILPTRSRKPRDKGRVEGAVLIVERWILARLRNRTFFSLAALNTAIAELLEDLN
NRTMRHVGKSRRELFEEIERPALKPLPAIPFEYAEWKSAKVHPDYHVEVDKTFYSVPHRLIGCTLQVRLTHRVVEIFHDHQRVASHVRRSQRSGHVTVND
HMPKAHQRYANTTPANLIGRATQIGPNAAILVERMMRDRPHPEQGYRSAMGILSLAPRYGSQRLEAACERALTINAITYSSVASILKSGLDRERPQAEHA
APTPAHTNIRGRSYYQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
792 bp | 170 aa | 1726 | 2517 | + | No |
AG : IS21 helper
ORF sequence :
VKEARMLTNPTLDQMQALGLTGMAAAWRELTEQSGTNELSRDEWLGLMLDREVTLRADKRIRNRLASAKLRFAQACIEDIDFAAARGLDRRNTMALAQGQ
WLTAHEGLIITGHTGTGKSWLACAFGRQAARLGHSVLYVRVPRMFEELALARLDGSFPRLIDRLTRVQLLILDDFGTHTLSDQQRFHLFEIVEERYQRKS
TLITAQVPVASWHDLIADSTVADAILDRIVHNAHRITLRGESMRKQKSAPLLTGAENGEINQP
WLTAHEGLIITGHTGTGKSWLACAFGRQAARLGHSVLYVRVPRMFEELALARLDGSFPRLIDRLTRVQLLILDDFGTHTLSDQQRFHLFEIVEERYQRKS
TLITAQVPVASWHDLIADSTVADAILDRIVHNAHRITLRGESMRKQKSAPLLTGAENGEINQP
Blast result :
Comments
ISRsp2 is located on the pNGR234a plasmid of Rhizobium sp. NGR234, (section 18 of 46, bases 208688 to 222367). ISRsp2 IstA and IstB ORFs overlap with a start-stop configuration "GTGA".
References
1] Freiberg, C., Fellay, R., Bairoch, A., Broughton, W.J., and Rosenthal, A. (1997) Nature 387, 394-401.