ISRm14
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AF134706 | Y | Sinorhizobium meliloti | Sinorhizobium meliloti |
DNA section
IS Length : 2695 bp
Ends
IR Length : 18/22
IRL : GTAAGCGCCGTCTCCGCCCCATTGGATAGGCTGTATTTTGAGGCTGGCTG
IRR : GTATGCGGCGTCCACGCCCCATCGATTATTCAGTTGCGGCGATCTGAGGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
GTAAGCGCCGTCTCCGCCCCATTGGATAGGCTGTATTTTGAGGCTGGCTGGGCGTGCGAGAAGGTTTCCAGGACTATCTGGAGATTTGCATGGCAGATGA
TGGATTTGTTGGGCGCTACGAAGTTGTCGAGCCGCGCCGCGGAAACCGGCGTTGGCCCGATGATGTGAAGGCGCGGATTGTCGCGGAGAGCCTTGAGCCG
GGTGTTCGAGTTGTTGATGTCGCGCGCCGTCATGACGTTGTTCCGCACCAGCTTTCCTTCTGGCGTCGACAAGCGCGCGAGGGCATTCTGGCGCTGCCGT
TTGAGGCTATGCCGGGCCTGTCGGAGAGCGGCGATGCCGAGCCTGCATTTGTGCCTTTGGCGATTGCGGCAGAGCCGAGCGAGGCTGTGAATGTTTTGGC
GCCGCCGCTGTCGGAGGCGGTTTCGTCGGTCTTGACGTTGGAGATCGGCCCGGACGTTGTGTTGCGGGTTCCCGGCGATGTGCCGGTTGAACGTGTGGCG
GCTCTGGTGCGAGCCATGCGAGCGCCGGTATGATCGTCGCGGGCCAACGACTGCCGATCCTGATTGCAACGCGGCCGGTGGACTTCCGCTGTGGGCATCA
GGCGCTGGCTCTGATGGTGCAGACCGAGTTGAAGCTCGACCCGCATTCCGGGGTGACGGTGATCTTCCGGTCGAAGCGCGGTGATCGCCTGAAAATCCTG
GTGTGGGATGGCACCGGAATGGTGCTAACCTACAAAATTCTTGAACATGGAAGCTTTGCCTGGCCCAAGGTGCAGGATGGGACGATGCGTCTTTCCAGGG
GTCAATATGAGGCTTTGTTCGAAGGTCTTGACTGGCGACGGGTGATGGCGCAACGGGTGACGGCGCCGTCGGCGGCAGGGTGAGTATCCGGCCGTCTTTG
CATTGTTTTATTTGGCTTTTTTGTGCTGCCTTGCTATAAGGCCGCATGTCGTCTCCCCTTGATCTCAGCCTGTTTCCGAACCTTCCGCCAGAGGTGGTGA
AGGCGTTTGCGGCGATGCAGTTCGAGCTGTCGGTCGAGCGTGCTGCGCGTCAGCATGAGCAGGCTGTGGTGGCCGAAAAGGACGCGTTCATCGCCGAGTT
GAAGGAACTGATCGAGAAGCTTGAGGGGCAGGTTCACGACTATCGGCGCACGAAGTTCGGGCCGAAATCGGAAAAGCTCGATCCGGCGCAGATGGAACTG
GCGCTGGAAGACCTTGAAACGGCAATTGCCGAAACACAGGCGCGGATTGCCGCCGTCGAGAAAAAGATCGAAGCCAGCGCATCTGATCCGGAAAAAGTCG
CTCCTCGCAAGGAGCGTAAGGCCCGTGCACTGCCCGAACATTTGCCGCGGGTCGAGCGGGTGATCGAGCCTGAGAGCATCGTTTGTCCCTGCGGTTGCGG
CAACATGGTCCGGATCGGCGAAGACCGGACGGAACGGCTCGACCGGATTCCGGCGCGCTACGAGGTGATCGTCACGATCCGCCCGAAATACGCCTGCCCC
AAGGGTCGAACGGGCGTCGTCCAGGCCAGAGCGCCGGCGCATCTCTTGGAAGGGAGCTGGCCGACCGAAGCCCTTCTGGCTGAGATTGCCGTCTCCAAGC
ATTCCGAACATATGCCGCTCAACCGGCAGGCCGAGGTCATGGCGCGACACGGGGTGCCGATAGACCGCACCGTCCTGGCCGATTGGATGGGCAGGACGGG
TGCTGCGATCGCGCCGGTGGTCGACCATATGGCCAAACGGCTGCTGTGGGAAAGCACGCGGCTTTATGTCGACGAGACAACGGCTCCGGTGCTTGATCCG
GGGCGAGGCAAAACGAAGACCGGTTATCTATGGGCCGTGTTGCGTGACGATCGCGGCTGGAATGGCTCTGCGCCGCCAGGCGTGGTGTTCCATTACCGGC
CCGGGCGTAAAGGCGAATATGCCGCTGAAATCCTCGACGGGTTCAACGGGACAATCCAGGTGGATGCCTACGGTGGTTACTCTCACCTCGCCACGTTGGA
CCGGGTGGGTGGCGATCCCTTGAAGCTGGCTTTCTGTTGGGCGCACGGGCGCAGAAAGCTGATCAAAGCCACGCCAAAGAGTGGATCGCCCATCGTCGAC
GAGGCGCTGGTGCGGATCGCCGCGCTCTACAAGATCGAAGACAGTATCCGTGGCTCAGATCCCGAACATCGCCGGGCAGTTCGACAGGACCTCTCCCTCC
CGCTGGTGGACGCGTTCTTCGCCTGGCTGGCAGCGCAAGCCAAGCGCGTCTCACGCAAGTCTGACCTCGGAAAAGCCCTGGCCTATATGCTAACGCGGCA
GGACGGGTTCCGGCTGTTCCTGGACGACGGCCACGTCGATATCGACTCCAACCTGGTGGAAAACGCGATCCGCCGACCGGCCATGAACCGCCGCAATGCG
CTCTTTGCGGGGCACGATGAAGGGGGCCGCAATTGGGCCCGGTTTGCCAGCCTGATCGGCACTTGTAAAATGAACGGCGTTGAGCCCTACGCCTATCTGT
GCAACCTCTTCACCCGCCTCGCAAACGGCCACCTCGCCAAAGACATCGATGCCCTGATGCCATGGGCCTATGCCGCTCGCATCCAGGCCTCACAATGAGC
TCGTCAGATACTCTTCGGTGAGCTCATTTGACGGCCCGTCGATCAACCTCAGATCGCCGCAACTGAATAATCGATGGGGCGTGGACGCCGCATAC
TGGATTTGTTGGGCGCTACGAAGTTGTCGAGCCGCGCCGCGGAAACCGGCGTTGGCCCGATGATGTGAAGGCGCGGATTGTCGCGGAGAGCCTTGAGCCG
GGTGTTCGAGTTGTTGATGTCGCGCGCCGTCATGACGTTGTTCCGCACCAGCTTTCCTTCTGGCGTCGACAAGCGCGCGAGGGCATTCTGGCGCTGCCGT
TTGAGGCTATGCCGGGCCTGTCGGAGAGCGGCGATGCCGAGCCTGCATTTGTGCCTTTGGCGATTGCGGCAGAGCCGAGCGAGGCTGTGAATGTTTTGGC
GCCGCCGCTGTCGGAGGCGGTTTCGTCGGTCTTGACGTTGGAGATCGGCCCGGACGTTGTGTTGCGGGTTCCCGGCGATGTGCCGGTTGAACGTGTGGCG
GCTCTGGTGCGAGCCATGCGAGCGCCGGTATGATCGTCGCGGGCCAACGACTGCCGATCCTGATTGCAACGCGGCCGGTGGACTTCCGCTGTGGGCATCA
GGCGCTGGCTCTGATGGTGCAGACCGAGTTGAAGCTCGACCCGCATTCCGGGGTGACGGTGATCTTCCGGTCGAAGCGCGGTGATCGCCTGAAAATCCTG
GTGTGGGATGGCACCGGAATGGTGCTAACCTACAAAATTCTTGAACATGGAAGCTTTGCCTGGCCCAAGGTGCAGGATGGGACGATGCGTCTTTCCAGGG
GTCAATATGAGGCTTTGTTCGAAGGTCTTGACTGGCGACGGGTGATGGCGCAACGGGTGACGGCGCCGTCGGCGGCAGGGTGAGTATCCGGCCGTCTTTG
CATTGTTTTATTTGGCTTTTTTGTGCTGCCTTGCTATAAGGCCGCATGTCGTCTCCCCTTGATCTCAGCCTGTTTCCGAACCTTCCGCCAGAGGTGGTGA
AGGCGTTTGCGGCGATGCAGTTCGAGCTGTCGGTCGAGCGTGCTGCGCGTCAGCATGAGCAGGCTGTGGTGGCCGAAAAGGACGCGTTCATCGCCGAGTT
GAAGGAACTGATCGAGAAGCTTGAGGGGCAGGTTCACGACTATCGGCGCACGAAGTTCGGGCCGAAATCGGAAAAGCTCGATCCGGCGCAGATGGAACTG
GCGCTGGAAGACCTTGAAACGGCAATTGCCGAAACACAGGCGCGGATTGCCGCCGTCGAGAAAAAGATCGAAGCCAGCGCATCTGATCCGGAAAAAGTCG
CTCCTCGCAAGGAGCGTAAGGCCCGTGCACTGCCCGAACATTTGCCGCGGGTCGAGCGGGTGATCGAGCCTGAGAGCATCGTTTGTCCCTGCGGTTGCGG
CAACATGGTCCGGATCGGCGAAGACCGGACGGAACGGCTCGACCGGATTCCGGCGCGCTACGAGGTGATCGTCACGATCCGCCCGAAATACGCCTGCCCC
AAGGGTCGAACGGGCGTCGTCCAGGCCAGAGCGCCGGCGCATCTCTTGGAAGGGAGCTGGCCGACCGAAGCCCTTCTGGCTGAGATTGCCGTCTCCAAGC
ATTCCGAACATATGCCGCTCAACCGGCAGGCCGAGGTCATGGCGCGACACGGGGTGCCGATAGACCGCACCGTCCTGGCCGATTGGATGGGCAGGACGGG
TGCTGCGATCGCGCCGGTGGTCGACCATATGGCCAAACGGCTGCTGTGGGAAAGCACGCGGCTTTATGTCGACGAGACAACGGCTCCGGTGCTTGATCCG
GGGCGAGGCAAAACGAAGACCGGTTATCTATGGGCCGTGTTGCGTGACGATCGCGGCTGGAATGGCTCTGCGCCGCCAGGCGTGGTGTTCCATTACCGGC
CCGGGCGTAAAGGCGAATATGCCGCTGAAATCCTCGACGGGTTCAACGGGACAATCCAGGTGGATGCCTACGGTGGTTACTCTCACCTCGCCACGTTGGA
CCGGGTGGGTGGCGATCCCTTGAAGCTGGCTTTCTGTTGGGCGCACGGGCGCAGAAAGCTGATCAAAGCCACGCCAAAGAGTGGATCGCCCATCGTCGAC
GAGGCGCTGGTGCGGATCGCCGCGCTCTACAAGATCGAAGACAGTATCCGTGGCTCAGATCCCGAACATCGCCGGGCAGTTCGACAGGACCTCTCCCTCC
CGCTGGTGGACGCGTTCTTCGCCTGGCTGGCAGCGCAAGCCAAGCGCGTCTCACGCAAGTCTGACCTCGGAAAAGCCCTGGCCTATATGCTAACGCGGCA
GGACGGGTTCCGGCTGTTCCTGGACGACGGCCACGTCGATATCGACTCCAACCTGGTGGAAAACGCGATCCGCCGACCGGCCATGAACCGCCGCAATGCG
CTCTTTGCGGGGCACGATGAAGGGGGCCGCAATTGGGCCCGGTTTGCCAGCCTGATCGGCACTTGTAAAATGAACGGCGTTGAGCCCTACGCCTATCTGT
GCAACCTCTTCACCCGCCTCGCAAACGGCCACCTCGCCAAAGACATCGATGCCCTGATGCCATGGGCCTATGCCGCTCGCATCCAGGCCTCACAATGAGC
TCGTCAGATACTCTTCGGTGAGCTCATTTGACGGCCCGTCGATCAACCTCAGATCGCCGCAACTGAATAATCGATGGGGCGTGGACGCCGCATAC
Protein section
ORF number : 5
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
444 bp | 147 aa | 90 | 533 | + | No |
AG : IS66 TnpA
ORF sequence :
MADDGFVGRYEVVEPRRGNRRWPDDVKARIVAESLEPGVRVVDVARRHDVVPHQLSFWRRQAREGILALPFEAMPGLSESGDAEPAFVPLAIAAEPSEAV
NVLAPPLSEAVSSVLTLEIGPDVVLRVPGDVPVERVAALVRAMRAPV
NVLAPPLSEAVSSVLTLEIGPDVVLRVPGDVPVERVAALVRAMRAPV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 530 | 883 | + | No |
AG : IS66 TnpB
ORF sequence :
MIVAGQRLPILIATRPVDFRCGHQALALMVQTELKLDPHSGVTVIFRSKRGDRLKILVWDGTGMVLTYKILEHGSFAWPKVQDGTMRLSRGQYEALFEGL
DWRRVMAQRVTAPSAAG
DWRRVMAQRVTAPSAAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1653 bp | 550 aa | 946 | 2598 | + | No |
Chemistry : DDE
ORF sequence :
MSSPLDLSLFPNLPPEVVKAFAAMQFELSVERAARQHEQAVVAEKDAFIAELKELIEKLEGQVHDYRRTKFGPKSEKLDPAQMELALEDLETAIAETQAR
IAAVEKKIEASASDPEKVAPRKERKARALPEHLPRVERVIEPESIVCPCGCGNMVRIGEDRTERLDRIPARYEVIVTIRPKYACPKGRTGVVQARAPAHL
LEGSWPTEALLAEIAVSKHSEHMPLNRQAEVMARHGVPIDRTVLADWMGRTGAAIAPVVDHMAKRLLWESTRLYVDETTAPVLDPGRGKTKTGYLWAVLR
DDRGWNGSAPPGVVFHYRPGRKGEYAAEILDGFNGTIQVDAYGGYSHLATLDRVGGDPLKLAFCWAHGRRKLIKATPKSGSPIVDEALVRIAALYKIEDS
IRGSDPEHRRAVRQDLSLPLVDAFFAWLAAQAKRVSRKSDLGKALAYMLTRQDGFRLFLDDGHVDIDSNLVENAIRRPAMNRRNALFAGHDEGGRNWARF
ASLIGTCKMNGVEPYAYLCNLFTRLANGHLAKDIDALMPWAYAARIQASQ
IAAVEKKIEASASDPEKVAPRKERKARALPEHLPRVERVIEPESIVCPCGCGNMVRIGEDRTERLDRIPARYEVIVTIRPKYACPKGRTGVVQARAPAHL
LEGSWPTEALLAEIAVSKHSEHMPLNRQAEVMARHGVPIDRTVLADWMGRTGAAIAPVVDHMAKRLLWESTRLYVDETTAPVLDPGRGKTKTGYLWAVLR
DDRGWNGSAPPGVVFHYRPGRKGEYAAEILDGFNGTIQVDAYGGYSHLATLDRVGGDPLKLAFCWAHGRRKLIKATPKSGSPIVDEALVRIAALYKIEDS
IRGSDPEHRRAVRQDLSLPLVDAFFAWLAAQAKRVSRKSDLGKALAYMLTRQDGFRLFLDDGHVDIDSNLVENAIRRPAMNRRNALFAGHDEGGRNWARF
ASLIGTCKMNGVEPYAYLCNLFTRLANGHLAKDIDALMPWAYAARIQASQ
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
417 bp | 138 aa | 2049 | 2465 | - | No |
Annotation : Description :
ORF sequence :
MRVELQLGLHHQSQRLMPTAEVHRPRCNQDRQSLARDDHTGARMARTRAATRSTGTSPGTRNTTSGPISNVKTDETASDSGGAKTFTASLGSAAIAKGTN
AGSASPLSDRPGIASNGSARMPSRACRRQKESWCGTTS
AGSASPLSDRPGIASNGSARMPSRACRRQKESWCGTTS
Blast result :ORF 5
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
795 bp | 264 aa | 967 | 1761 | - | No |
Annotation : Description :
ORF sequence :
MVDHRRDRSTRPAHPIGQDGAVYRHPVSRHDLGLPVERHMFGMLGDGNLSQKGFGRPAPFQEMRRRSGLDDARSTLGAGVFRADRDDHLVARRNPVEPFR
PVFADPDHVAATAGTNDALRLDHPLDPRQMFGQCTGLTLLARSDFFRIRCAGFDLFLDGGNPRLCFGNCRFKVFQRQFHLRRIELFRFRPELRAPIVVNL
PLKLLDQFLQLGDERVLFGHHSLLMLTRSTLDRQLELHRRKRLHHLWRKVRKQAEIKGRRHAAL
PVFADPDHVAATAGTNDALRLDHPLDPRQMFGQCTGLTLLARSDFFRIRCAGFDLFLDGGNPRLCFGNCRFKVFQRQFHLRRIELFRFRPELRAPIVVNL
PLKLLDQFLQLGDERVLFGHHSLLMLTRSTLDRQLELHRRKRLHHLWRKVRKQAEIKGRRHAAL
Blast result :
Comments
ORFs 4 and 5 are on the complementary strand. ISRm14 has been chosen as the family type. It is present with a copy number from 1 to 6 in 66% of S. meliloti strains tested. (Schneiker et al., 1999) and is closely related to ISRLdTAL1145-1.
References
1] Simon,R., Hotte,B., Klauke,B. and Kosier,B.(1991) J. Bacteriol. 173 (4), 1502-1508.Schneiker,S., Kosier,B., Puehler,A. and Selbitschka,W.(1999) Curr.Microbiol. 39, 274-281.