ISRel19
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007762 | ND | Rhizobium etli | Rhizobium etli CFN 42 plasmid p42a Rhizobium etli CFN 42 |
DNA section
IS Length : 2798 bp
Ends
IR Length : 17/24
IRL : GTGAGCGTACGGTGAGCTGTTTTTGCTGATCGGCCAAATCAGGCGATGTT
IRR : GTAAGCGTCCGGCGAACGCATTTTCGGGTGTCGAGAGATGGAATCTATAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGAGCATTCT | GATTGAAG | CGATTTGCGC | 8 |
AGCAATTCCG | ACAACATC | GACTACGGCG | 8 |
GGAGAAAGCG | CATCCAAA | GGCGTTTTTG | 8 |
GCGGCTCACG | TAAGCATC | ACGGCGCTCA | 8 |
DNA sequence
GTGAGCGTACGGTGAGCTGTTTTTGCTGATCGGCCAAATCAGGCGATGTTCTGGCATTAGAACCAGCATTAGTGCTGACATTAATACCAGAACGAAGAGG
TTTCATGGCTCGCATAGAGATCATGTCCGGTACCGAGCGCAGACGGCGTTGGTCGGACGAGGCGAAGCTGAGGATACTTGCGGAAGCTGATGAGCCCGGT
GCTCGCATTGGCGATGTGGCGCGCCGGCACGACATTCATCCGGGCCAGATCCGCTTGTGGCGGCAATCATTCAACTATGCCGATCGGCCGACGGTGTTCC
TCCCGGTGGAAATCACGGAGGAGGTTGGCGTAAGCCAGCCGTCTACCGCATCAACAAGGCCGACGATCGTTGAGATCTTGCTTCGAAACGGTCGGAGCTT
GAAGGTTGCGGTTGACGTTGAGTTGAAGCTGCTTGGCCCGCTCGTCGCTTGCGTGGAGGCGGTATGATCGGCCCATCGGGGAATGTGAGGGTCTATCTGG
CCTGCGGAGTGACCGATATGCGGCGTGGCATTGATGGTCTGTCCGCGCTGGTCGAGACGGTCGTGAGGGAGGCACCGGGCTCGGGCGCAATCTTCGGCTT
TCGCGGAAAACGCGCGGACCGGATCAAGCTGCTTTGGTGGGATGGCCAGGGGTTCTGTCTGTTCTACAAGATTTTGGAGCGCGGATACTTTCCCTGGCCG
ACGGCGAAAGAGGGTGTAGCGCACCTGACGCAGGCGCAGCTTTCGATGCTCGTTGAGGGGATCGATTGGCGACGCCCGGCGTGGACTTCCGCTCCCGGCC
GAACGGGATAAAAGCTATATTTTGCAGGGGCATGGGAAGCTTAACGCTGGTGCGTCAGAGGCAAATCGGCTAGTTTCGCGGATATGGAAACAGCGCCGCT
GGACAGTCAGGACGAGCTCAATACTTTGCGCGCACTCGTCGCGGAACAGGCGGCAAAACTTGAGAGCCAGGAAGCCGAGGTCATCAAGCGAGACTCCATA
ATCGGGCTTCTTCGCGCGCAACTGGAGCTTCTCCGACATCGGCAACATGGCGCGTCTTCGGAAAAGATCGACAGGAAGATCGAGCAATTCGAGCTGATGT
TGGAGGAGATCGAGGCCTCCCGGGCCGAGGCTGAACAGCGCTCCGGGAAACCGCCTCTGCCGGAGTTGGACGACGCATCCGAGAAGCCGAAGCGCAAGCC
TCTACCCGATGATCTCACCACCGAAGAACTGGTCTATGCAGCTCCCTGCAATTGCCCGACCTGCGGTGGCACTTCGTTCCTGAAGGCGGCCGACAGAGTG
GTCCAGGTGCTGGAGCACGTGCCGGCGTCGGTCAAGATTGTCCGGCATGTCGAGAAGCGCATGATCTGCAGGGAATGCGATACGACAGTGGCTGGCGAGA
TGCCGACCTTGCCGATCGAGCGCGGCAAACCCGGGCCGGGGCTGCTCGCCCACATCATGATCGCCAAATTCGACGATCACATTCCCCTTTACCGCCTGTC
AGAGATGTACGACCGGTTGGGGATAGACATCTCCCGATCCGTAATGGCCGACTGGGTCGGCCGGGTATCTGCATTGCTGACACCACTTGTCCTGTTAATC
AGGGCCCATATCGCCGCACTCGACCGAATACATACGGACGATACCCCGGTCGATGTTCTCGACCCCGGGCGGGGCAAGACAAAAACCGGCAGGGTCTGGG
TCTACATCTTTGACGGCAGTGGCTATCAATCCGCCACTCCCGCAGCCATTGCCTATTACTACAGCCCCGACCGCAGGGGCGCACATCCGGCTGATCACCT
GGCAAGCTTCAGCGGCGTCATGCATGCCGACGGCTACGGGGGTTACAAGAAGCTCTATGGCAACCAGATCATTGAAGCCGCCTGCATGGCGCATGTGCGC
CGCAAGTTCCATGATGTGATAAAGCTCAAGCCGTCTCCGATCGCTGAGGAAGCGCTGTCACGCATCGGCGCTCTCTACGATATCGAGAACCGTATCCGAG
GCATGTCGGCTGACGAGCGGCGCACCCTGCGCCAACACCATGCCCGGCCTGTTCTGGACGAACTCAAGGCCTGGATCGAGGCGACACTCTCGACTTTGCC
TCAGAAGCAGAGGCTGGCCGAGGCGATGCGATATGCCCTGTCGCGATGGGCAGCCTTAAGCGTCTACATCGACGATGGCCGTGTCGAAATCGACAACAAC
ATCGCTGAACGAGCCATGCGTCCGCTTGGAATTGGAAGGAAGAACTGGCTGTTTGCCGGCTCGGACAAGGGCGGCGAGCGCATCGCCAACATCCTCACCA
TCATCGAGACGGTCAAACTGCAAGGCCATAATCCAGAGGTCTATCTGACGGATGTCCTGACCCGGATCCAGGATCACCCCGAAGACCGAATTGAAGACCT
CCTGCCTTGGAACTGGACGCCGGCAAAAGCTCGATGCGAGGCCGCCTGATGGCGCGCTCGAGGTTCATCTACACACTCAGCCAAGTCGCCGGCATGATCG
GCGAAAACCTCGAACTGATCGAAGAAGTGACCGCAAACTCGGACAACATCTCCGAAGGCGAACTGGTTTACGTCGGCGATGGCAGTGAAGACGGCACGAA
GGGTCTGACCGAGAATGGCATCGAAGAACTTCAAAGCCTACTCGCCGACATCAGGACGTGGGACGGCGGTATCCGCGAGTTCCTCATCGACACACAGTGC
GATCCCGAAATAATCGACCGCATCATGGCCGATGAGATGAAACGCGGCCTATAGATTCCATCTCTCGACACCCGAAAATGCGTTCGCCGGACGCTTAC
TTTCATGGCTCGCATAGAGATCATGTCCGGTACCGAGCGCAGACGGCGTTGGTCGGACGAGGCGAAGCTGAGGATACTTGCGGAAGCTGATGAGCCCGGT
GCTCGCATTGGCGATGTGGCGCGCCGGCACGACATTCATCCGGGCCAGATCCGCTTGTGGCGGCAATCATTCAACTATGCCGATCGGCCGACGGTGTTCC
TCCCGGTGGAAATCACGGAGGAGGTTGGCGTAAGCCAGCCGTCTACCGCATCAACAAGGCCGACGATCGTTGAGATCTTGCTTCGAAACGGTCGGAGCTT
GAAGGTTGCGGTTGACGTTGAGTTGAAGCTGCTTGGCCCGCTCGTCGCTTGCGTGGAGGCGGTATGATCGGCCCATCGGGGAATGTGAGGGTCTATCTGG
CCTGCGGAGTGACCGATATGCGGCGTGGCATTGATGGTCTGTCCGCGCTGGTCGAGACGGTCGTGAGGGAGGCACCGGGCTCGGGCGCAATCTTCGGCTT
TCGCGGAAAACGCGCGGACCGGATCAAGCTGCTTTGGTGGGATGGCCAGGGGTTCTGTCTGTTCTACAAGATTTTGGAGCGCGGATACTTTCCCTGGCCG
ACGGCGAAAGAGGGTGTAGCGCACCTGACGCAGGCGCAGCTTTCGATGCTCGTTGAGGGGATCGATTGGCGACGCCCGGCGTGGACTTCCGCTCCCGGCC
GAACGGGATAAAAGCTATATTTTGCAGGGGCATGGGAAGCTTAACGCTGGTGCGTCAGAGGCAAATCGGCTAGTTTCGCGGATATGGAAACAGCGCCGCT
GGACAGTCAGGACGAGCTCAATACTTTGCGCGCACTCGTCGCGGAACAGGCGGCAAAACTTGAGAGCCAGGAAGCCGAGGTCATCAAGCGAGACTCCATA
ATCGGGCTTCTTCGCGCGCAACTGGAGCTTCTCCGACATCGGCAACATGGCGCGTCTTCGGAAAAGATCGACAGGAAGATCGAGCAATTCGAGCTGATGT
TGGAGGAGATCGAGGCCTCCCGGGCCGAGGCTGAACAGCGCTCCGGGAAACCGCCTCTGCCGGAGTTGGACGACGCATCCGAGAAGCCGAAGCGCAAGCC
TCTACCCGATGATCTCACCACCGAAGAACTGGTCTATGCAGCTCCCTGCAATTGCCCGACCTGCGGTGGCACTTCGTTCCTGAAGGCGGCCGACAGAGTG
GTCCAGGTGCTGGAGCACGTGCCGGCGTCGGTCAAGATTGTCCGGCATGTCGAGAAGCGCATGATCTGCAGGGAATGCGATACGACAGTGGCTGGCGAGA
TGCCGACCTTGCCGATCGAGCGCGGCAAACCCGGGCCGGGGCTGCTCGCCCACATCATGATCGCCAAATTCGACGATCACATTCCCCTTTACCGCCTGTC
AGAGATGTACGACCGGTTGGGGATAGACATCTCCCGATCCGTAATGGCCGACTGGGTCGGCCGGGTATCTGCATTGCTGACACCACTTGTCCTGTTAATC
AGGGCCCATATCGCCGCACTCGACCGAATACATACGGACGATACCCCGGTCGATGTTCTCGACCCCGGGCGGGGCAAGACAAAAACCGGCAGGGTCTGGG
TCTACATCTTTGACGGCAGTGGCTATCAATCCGCCACTCCCGCAGCCATTGCCTATTACTACAGCCCCGACCGCAGGGGCGCACATCCGGCTGATCACCT
GGCAAGCTTCAGCGGCGTCATGCATGCCGACGGCTACGGGGGTTACAAGAAGCTCTATGGCAACCAGATCATTGAAGCCGCCTGCATGGCGCATGTGCGC
CGCAAGTTCCATGATGTGATAAAGCTCAAGCCGTCTCCGATCGCTGAGGAAGCGCTGTCACGCATCGGCGCTCTCTACGATATCGAGAACCGTATCCGAG
GCATGTCGGCTGACGAGCGGCGCACCCTGCGCCAACACCATGCCCGGCCTGTTCTGGACGAACTCAAGGCCTGGATCGAGGCGACACTCTCGACTTTGCC
TCAGAAGCAGAGGCTGGCCGAGGCGATGCGATATGCCCTGTCGCGATGGGCAGCCTTAAGCGTCTACATCGACGATGGCCGTGTCGAAATCGACAACAAC
ATCGCTGAACGAGCCATGCGTCCGCTTGGAATTGGAAGGAAGAACTGGCTGTTTGCCGGCTCGGACAAGGGCGGCGAGCGCATCGCCAACATCCTCACCA
TCATCGAGACGGTCAAACTGCAAGGCCATAATCCAGAGGTCTATCTGACGGATGTCCTGACCCGGATCCAGGATCACCCCGAAGACCGAATTGAAGACCT
CCTGCCTTGGAACTGGACGCCGGCAAAAGCTCGATGCGAGGCCGCCTGATGGCGCGCTCGAGGTTCATCTACACACTCAGCCAAGTCGCCGGCATGATCG
GCGAAAACCTCGAACTGATCGAAGAAGTGACCGCAAACTCGGACAACATCTCCGAAGGCGAACTGGTTTACGTCGGCGATGGCAGTGAAGACGGCACGAA
GGGTCTGACCGAGAATGGCATCGAAGAACTTCAAAGCCTACTCGCCGACATCAGGACGTGGGACGGCGGTATCCGCGAGTTCCTCATCGACACACAGTGC
GATCCCGAAATAATCGACCGCATCATGGCCGATGAGATGAAACGCGGCCTATAGATTCCATCTCTCGACACCCGAAAATGCGTTCGCCGGACGCTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
363 bp | 120 aa | 105 | 467 | + | No |
AG : IS66 TnpA
ORF sequence :
MARIEIMSGTERRRRWSDEAKLRILAEADEPGARIGDVARRHDIHPGQIRLWRQSFNYADRPTVFLPVEITEEVGVSQPSTASTRPTIVEILLRNGRSLK
VAVDVELKLLGPLVACVEAV
VAVDVELKLLGPLVACVEAV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 464 | 811 | + | No |
AG : IS66 TnpB
ORF sequence :
MIGPSGNVRVYLACGVTDMRRGIDGLSALVETVVREAPGSGAIFGFRGKRADRIKLLWWDGQGFCLFYKILERGYFPWPTAKEGVAHLTQAQLSMLVEGI
DWRRPAWTSAPGRTG
DWRRPAWTSAPGRTG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1566 bp | 521 aa | 884 | 2449 | + | No |
Chemistry : DDE
ORF sequence :
METAPLDSQDELNTLRALVAEQAAKLESQEAEVIKRDSIIGLLRAQLELLRHRQHGASSEKIDRKIEQFELMLEEIEASRAEAEQRSGKPPLPELDDASE
KPKRKPLPDDLTTEELVYAAPCNCPTCGGTSFLKAADRVVQVLEHVPASVKIVRHVEKRMICRECDTTVAGEMPTLPIERGKPGPGLLAHIMIAKFDDHI
PLYRLSEMYDRLGIDISRSVMADWVGRVSALLTPLVLLIRAHIAALDRIHTDDTPVDVLDPGRGKTKTGRVWVYIFDGSGYQSATPAAIAYYYSPDRRGA
HPADHLASFSGVMHADGYGGYKKLYGNQIIEAACMAHVRRKFHDVIKLKPSPIAEEALSRIGALYDIENRIRGMSADERRTLRQHHARPVLDELKAWIEA
TLSTLPQKQRLAEAMRYALSRWAALSVYIDDGRVEIDNNIAERAMRPLGIGRKNWLFAGSDKGGERIANILTIIETVKLQGHNPEVYLTDVLTRIQDHPE
DRIEDLLPWNWTPAKARCEAA
KPKRKPLPDDLTTEELVYAAPCNCPTCGGTSFLKAADRVVQVLEHVPASVKIVRHVEKRMICRECDTTVAGEMPTLPIERGKPGPGLLAHIMIAKFDDHI
PLYRLSEMYDRLGIDISRSVMADWVGRVSALLTPLVLLIRAHIAALDRIHTDDTPVDVLDPGRGKTKTGRVWVYIFDGSGYQSATPAAIAYYYSPDRRGA
HPADHLASFSGVMHADGYGGYKKLYGNQIIEAACMAHVRRKFHDVIKLKPSPIAEEALSRIGALYDIENRIRGMSADERRTLRQHHARPVLDELKAWIEA
TLSTLPQKQRLAEAMRYALSRWAALSVYIDDGRVEIDNNIAERAMRPLGIGRKNWLFAGSDKGGERIANILTIIETVKLQGHNPEVYLTDVLTRIQDHPE
DRIEDLLPWNWTPAKARCEAA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
306 bp | 101 aa | 2449 | 2754 | + | No |
Annotation : Description :
ORF sequence :
MARSRFIYTLSQVAGMIGENLELIEEVTANSDNISEGELVYVGDGSEDGTKGLTENGIEELQSLLADIRTWDGGIREFLIDTQCDPEIIDRIMADEMKRG
L
L
Blast result :
Comments
ISRel19 is 91%(orfA), 100%(orfB), 97%(orfC) and 97%(orfD) aa similar to ISAtu6.
References