ISRel8
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004041 | ND | Rhizobium etli | Rhizobium etli CFN 42 symbiotic plasmid p42d |
DNA section
IS Length : 3226 bp
Ends
IR Length : 17/22
IRL : GTAACCGGTGCTCCGCTCCCACGTTGTCTTTCCTGCCATGATCTGTGATC
IRR : GTAAGCGATGTGCCGCTCACACCTCGACAAAGAAGCAGTTTTTTGATGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCTGTTCCGAC | GGATCGCA | TTTAATCGCG | 8 |
DNA sequence
GTAACCGGTGCTCCGCTCCCACGTTGTCTTTCCTGCCATGATCTGTGATCGCCTTTCCATGGACGAGCAACGGGAGTTTCTCCATGGAGACTACATTGGA
GGTTCTCACAACCAGGAAGTCTGGACGTGAGGTTCACCGGCATTGGCCGGATGACGTGAAGGCGCAGATCGTTTCGGAGAGCCTTCGGCCCGGCGCGATG
GTAAGTGAGGTCGCAGAGCGGCATGGGTTGAAGCCGAACCACCTCTCCACTTGGAGAACGATGGCGCGGCAGGGTAAGTTGCTTCTGCCCGCACCCGAAG
ACCCGATGGAGTTTGCAGCGGTAATCGTTGATCCGCCCGTTTCGGAGCCGCCGATCAAGAAAGCCAGTCGCGCCGAGATCATCGTCGGTTCCGTCACCAT
TCGTCTGGAAGAAGGTGCGTCTGCCGCTCGGATCGCCGCTGTCGCGCGTGCCTGTGCGGCTCCCGCATGATCTTTCCATCCAACCGCGTGCGGATCATGG
TGGCGACCAAGCCAGTCGACTTTCGCAAAGGCCATGATGGCTTGGCGGCACTGGTCAAGAACGAGCTACACAAAGACCCATTCACTGGAACTGTCTTCGT
ATTCCGGTCGCGCAAAGCAGATCGGTTAAAGCTGATCTACTGGGACGGCAGCGGTCTGGTGATGGCATACAAGCGGCTGGAGGAACACACGTTCACTTGG
CCGGGCATCAAGGATGGCCTGATGACACTGGGCCATGCTCAGTTCGAGGCGCTGTTTGCAGGGCTCGACTGGCGTCGGGTCCGTGCGGTAGAAGCGAGAG
CACCGGACGCCATCGAATAGATGCGGCACGATGACTCGGTCGGCAAATTCCGACGGCGAAGCGCCGCTGGTTTTGATAGGCTTCCACCATGTTGGACACC
GCCGATTTCCCTGACGATATCGATGCCCTGAAGACGATGCTGATCGCGGCGCAAGCGCGTGAAGTTCGTAAGGATGAGCGGATCGAGCGACTGGAGAAGC
TTGTCGCCGCATTCAAGCAGGCTGCCTTCGGTCGCAAATCCGAGAAGGCTGATCCTGAGCAGTTCGATCTCGCGCTGGAAGACCTGGAAACGACCATCGC
TGCCATCCATGCCGAAGACGAGGCCGATACATCCTCGGGCAAGCAGCGATCCAAATCCCGCGCCGTCAATCGCGGCTCTCTTCTCGCCCATCTTCCTCGT
ATCGAAGAGATAATTGAACCGGAAAGCCTGATCTGCGCTTGCGGCGGTTGCCTGCATCGCATCGGAGAGGACGTGTCTGAGCGGCTGGATGTTATCCCGG
CGCAGTTCCGCGTCATCGTCACTCGCCGACCCAAATATGCCTGCCGTGCCTGTACTGATGGCGTTGCCCAGGCTCCAGCGCCCGCACGACTGATCCAAGC
CGGATTGCCGACAGAGGCAACCATCGCGCATGTGCTGGTGTCCAAATACGCCGACCATCTGCCGCTTTATCGACAGGCCCAGATTATGAGCCGCCAAGGC
ATCGATCTCGACCGTTCCACTCTGGCCGATTGGGTTGGTCGGGCGGCATTCGAGTTGCGCCCGGTCTTCGACGCTTTGATCGCCGATCTAAAGCGCTCGA
CAAAACTGTTCATGGACGAAACCCGTGCGCCGGTCCTCGATCCAGGCTCTCGCAAAACCAAGACCGGATACTTCTGGGCGCTGGCCCGCGACGACCGACC
ATGGAGCGGCGGGGCTCCACCCGGAGTGGCTTTCACCTACGCTCCCGGTCGTGGCGGCCAACATGCCGAACGGATATTGCGGGGGTTCACGGGCGTTCTC
CAAGTGGACGGCTACGCCGGATATAATCGGCTGATCGCACCGGACCGTGTCGGCCCGGACATCCGGCTTGCCTATTGTTGGGCACATGCACGGCGCAAGC
TGGTCGAGATCACTCGTACCGGCACAGCGCCGATTGCCGAGGAAGGCGTGGAGCGGATTGGTGAACTCTATCGGGTCGAGTCCGAGCTACGCGGGCTTCC
CGCAGAAGCTCGCCTCGCCGGGCGACAGGAACGATCAGCGCCATTGATTGCGGACATGCGAACTTGGCTCACGCAGCATCGTGCCCGCGTCGCTGGCAAG
TCGTCGCTTGGCGAGGCGCTCGCCTACATCGCCAAATATTGGGATGGCCTCTGCGTCTTCTTGACCGACGGTCGGATCGAGATCGACAACAATAGTATCG
AGAGAACTATCCGGCCGATAGCGCTCAATCGGAAGAACGCTCTCTTCGCCGGGCACGACATGGGAGCACGGAATTGGGCAACCATCGCCTCGCTCATCGA
GACGTGCAAGCTCAATGCCGTTGATCCACAAGCCTACCTCACCAGCACGCTCACCGCTATCGTAAACGGCCATAAGCAAAACCGGATCGATGAACTCCTG
CCGTGGAATCATCCGGTTTAGTATGGTCAATCGCAAAATGACTGCAGCAATCAGACGCGAAGAGACATAGGAACCTTATCGTTCCGTCGGCTACGATTAA
CGCTATCTACGGGTTGATCTGATGTAAGCGGGAAGCCTAAATGCAAGTACGAATGCGGCTAGGTAAAATCCAAAGATGTGGATCATACTCTTCCAGATGT
TCACCTCTATCGGCGACTCGAAGCTCATTCGATAACGCGAATAATGCTCAGTTGGACCGATATACGCCGACGCCAAGCTGGCAGCTAATATAATGATGAA
CGCTGTCATCATTCCCTGGACGTGGTAGCTGTCGGATGCAAACAGTTGCTTTACAAGAAACACGACGACGGCAGGAAGCATGACCACGCTGGCAAGGGAT
CGCATTGCGATCAACAGGCATTGATCAGACTGCGAATAACGACTTGTTTTTAAAGCAGATAGTATCGCGGTATTTGGGGGCCAAAACCGCAGCATAACCT
GGTTATTGACAAACTCGCAGGATATTGACGGGTGGGCTCGAAAAATGAGAAACGCAAAGAATATCACGGCACCATAGGCTGCGATGGTGGCGCAAATGAT
GCCTAACTGTCGGCTTGCAATATTTTGATCTGCCTTAATCTTCTTGGCTACGTGATCGAAAATATCTTCGTTGACCTCGTCCTTCATCCAATAACCACTT
CCGAAGTAACACTCGAGGTTGAGTAACACCGCAAATGTTGGAAGACAATGACATGGCTGTTGCGACCCCGCTCCAAGCCATCAAAAAACTGCTTCTTTGT
CGAGGTGTGAGCGGCACATCGCTTAC
GGTTCTCACAACCAGGAAGTCTGGACGTGAGGTTCACCGGCATTGGCCGGATGACGTGAAGGCGCAGATCGTTTCGGAGAGCCTTCGGCCCGGCGCGATG
GTAAGTGAGGTCGCAGAGCGGCATGGGTTGAAGCCGAACCACCTCTCCACTTGGAGAACGATGGCGCGGCAGGGTAAGTTGCTTCTGCCCGCACCCGAAG
ACCCGATGGAGTTTGCAGCGGTAATCGTTGATCCGCCCGTTTCGGAGCCGCCGATCAAGAAAGCCAGTCGCGCCGAGATCATCGTCGGTTCCGTCACCAT
TCGTCTGGAAGAAGGTGCGTCTGCCGCTCGGATCGCCGCTGTCGCGCGTGCCTGTGCGGCTCCCGCATGATCTTTCCATCCAACCGCGTGCGGATCATGG
TGGCGACCAAGCCAGTCGACTTTCGCAAAGGCCATGATGGCTTGGCGGCACTGGTCAAGAACGAGCTACACAAAGACCCATTCACTGGAACTGTCTTCGT
ATTCCGGTCGCGCAAAGCAGATCGGTTAAAGCTGATCTACTGGGACGGCAGCGGTCTGGTGATGGCATACAAGCGGCTGGAGGAACACACGTTCACTTGG
CCGGGCATCAAGGATGGCCTGATGACACTGGGCCATGCTCAGTTCGAGGCGCTGTTTGCAGGGCTCGACTGGCGTCGGGTCCGTGCGGTAGAAGCGAGAG
CACCGGACGCCATCGAATAGATGCGGCACGATGACTCGGTCGGCAAATTCCGACGGCGAAGCGCCGCTGGTTTTGATAGGCTTCCACCATGTTGGACACC
GCCGATTTCCCTGACGATATCGATGCCCTGAAGACGATGCTGATCGCGGCGCAAGCGCGTGAAGTTCGTAAGGATGAGCGGATCGAGCGACTGGAGAAGC
TTGTCGCCGCATTCAAGCAGGCTGCCTTCGGTCGCAAATCCGAGAAGGCTGATCCTGAGCAGTTCGATCTCGCGCTGGAAGACCTGGAAACGACCATCGC
TGCCATCCATGCCGAAGACGAGGCCGATACATCCTCGGGCAAGCAGCGATCCAAATCCCGCGCCGTCAATCGCGGCTCTCTTCTCGCCCATCTTCCTCGT
ATCGAAGAGATAATTGAACCGGAAAGCCTGATCTGCGCTTGCGGCGGTTGCCTGCATCGCATCGGAGAGGACGTGTCTGAGCGGCTGGATGTTATCCCGG
CGCAGTTCCGCGTCATCGTCACTCGCCGACCCAAATATGCCTGCCGTGCCTGTACTGATGGCGTTGCCCAGGCTCCAGCGCCCGCACGACTGATCCAAGC
CGGATTGCCGACAGAGGCAACCATCGCGCATGTGCTGGTGTCCAAATACGCCGACCATCTGCCGCTTTATCGACAGGCCCAGATTATGAGCCGCCAAGGC
ATCGATCTCGACCGTTCCACTCTGGCCGATTGGGTTGGTCGGGCGGCATTCGAGTTGCGCCCGGTCTTCGACGCTTTGATCGCCGATCTAAAGCGCTCGA
CAAAACTGTTCATGGACGAAACCCGTGCGCCGGTCCTCGATCCAGGCTCTCGCAAAACCAAGACCGGATACTTCTGGGCGCTGGCCCGCGACGACCGACC
ATGGAGCGGCGGGGCTCCACCCGGAGTGGCTTTCACCTACGCTCCCGGTCGTGGCGGCCAACATGCCGAACGGATATTGCGGGGGTTCACGGGCGTTCTC
CAAGTGGACGGCTACGCCGGATATAATCGGCTGATCGCACCGGACCGTGTCGGCCCGGACATCCGGCTTGCCTATTGTTGGGCACATGCACGGCGCAAGC
TGGTCGAGATCACTCGTACCGGCACAGCGCCGATTGCCGAGGAAGGCGTGGAGCGGATTGGTGAACTCTATCGGGTCGAGTCCGAGCTACGCGGGCTTCC
CGCAGAAGCTCGCCTCGCCGGGCGACAGGAACGATCAGCGCCATTGATTGCGGACATGCGAACTTGGCTCACGCAGCATCGTGCCCGCGTCGCTGGCAAG
TCGTCGCTTGGCGAGGCGCTCGCCTACATCGCCAAATATTGGGATGGCCTCTGCGTCTTCTTGACCGACGGTCGGATCGAGATCGACAACAATAGTATCG
AGAGAACTATCCGGCCGATAGCGCTCAATCGGAAGAACGCTCTCTTCGCCGGGCACGACATGGGAGCACGGAATTGGGCAACCATCGCCTCGCTCATCGA
GACGTGCAAGCTCAATGCCGTTGATCCACAAGCCTACCTCACCAGCACGCTCACCGCTATCGTAAACGGCCATAAGCAAAACCGGATCGATGAACTCCTG
CCGTGGAATCATCCGGTTTAGTATGGTCAATCGCAAAATGACTGCAGCAATCAGACGCGAAGAGACATAGGAACCTTATCGTTCCGTCGGCTACGATTAA
CGCTATCTACGGGTTGATCTGATGTAAGCGGGAAGCCTAAATGCAAGTACGAATGCGGCTAGGTAAAATCCAAAGATGTGGATCATACTCTTCCAGATGT
TCACCTCTATCGGCGACTCGAAGCTCATTCGATAACGCGAATAATGCTCAGTTGGACCGATATACGCCGACGCCAAGCTGGCAGCTAATATAATGATGAA
CGCTGTCATCATTCCCTGGACGTGGTAGCTGTCGGATGCAAACAGTTGCTTTACAAGAAACACGACGACGGCAGGAAGCATGACCACGCTGGCAAGGGAT
CGCATTGCGATCAACAGGCATTGATCAGACTGCGAATAACGACTTGTTTTTAAAGCAGATAGTATCGCGGTATTTGGGGGCCAAAACCGCAGCATAACCT
GGTTATTGACAAACTCGCAGGATATTGACGGGTGGGCTCGAAAAATGAGAAACGCAAAGAATATCACGGCACCATAGGCTGCGATGGTGGCGCAAATGAT
GCCTAACTGTCGGCTTGCAATATTTTGATCTGCCTTAATCTTCTTGGCTACGTGATCGAAAATATCTTCGTTGACCTCGTCCTTCATCCAATAACCACTT
CCGAAGTAACACTCGAGGTTGAGTAACACCGCAAATGTTGGAAGACAATGACATGGCTGTTGCGACCCCGCTCCAAGCCATCAAAAAACTGCTTCTTTGT
CGAGGTGTGAGCGGCACATCGCTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
387 bp | 128 aa | 84 | 470 | + | No |
AG : IS66 TnpA
ORF sequence :
METTLEVLTTRKSGREVHRHWPDDVKAQIVSESLRPGAMVSEVAERHGLKPNHLSTWRTMARQGKLLLPAPEDPMEFAAVIVDPPVSEPPIKKASRAEII
VGSVTIRLEEGASAARIAAVARACAAPA
VGSVTIRLEEGASAARIAAVARACAAPA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 467 | 820 | + | No |
AG : IS66 TnpB
ORF sequence :
MIFPSNRVRIMVATKPVDFRKGHDGLAALVKNELHKDPFTGTVFVFRSRKADRLKLIYWDGSGLVMAYKRLEEHTFTWPGIKDGLMTLGHAQFEALFAGL
DWRRVRAVEARAPDAIE
DWRRVRAVEARAPDAIE
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1533 bp | 510 aa | 889 | 2421 | + | No |
Chemistry : DDE
ORF sequence :
MLDTADFPDDIDALKTMLIAAQAREVRKDERIERLEKLVAAFKQAAFGRKSEKADPEQFDLALEDLETTIAAIHAEDEADTSSGKQRSKSRAVNRGSLLA
HLPRIEEIIEPESLICACGGCLHRIGEDVSERLDVIPAQFRVIVTRRPKYACRACTDGVAQAPAPARLIQAGLPTEATIAHVLVSKYADHLPLYRQAQIM
SRQGIDLDRSTLADWVGRAAFELRPVFDALIADLKRSTKLFMDETRAPVLDPGSRKTKTGYFWALARDDRPWSGGAPPGVAFTYAPGRGGQHAERILRGF
TGVLQVDGYAGYNRLIAPDRVGPDIRLAYCWAHARRKLVEITRTGTAPIAEEGVERIGELYRVESELRGLPAEARLAGRQERSAPLIADMRTWLTQHRAR
VAGKSSLGEALAYIAKYWDGLCVFLTDGRIEIDNNSIERTIRPIALNRKNALFAGHDMGARNWATIASLIETCKLNAVDPQAYLTSTLTAIVNGHKQNRI
DELLPWNHPV
HLPRIEEIIEPESLICACGGCLHRIGEDVSERLDVIPAQFRVIVTRRPKYACRACTDGVAQAPAPARLIQAGLPTEATIAHVLVSKYADHLPLYRQAQIM
SRQGIDLDRSTLADWVGRAAFELRPVFDALIADLKRSTKLFMDETRAPVLDPGSRKTKTGYFWALARDDRPWSGGAPPGVAFTYAPGRGGQHAERILRGF
TGVLQVDGYAGYNRLIAPDRVGPDIRLAYCWAHARRKLVEITRTGTAPIAEEGVERIGELYRVESELRGLPAEARLAGRQERSAPLIADMRTWLTQHRAR
VAGKSSLGEALAYIAKYWDGLCVFLTDGRIEIDNNSIERTIRPIALNRKNALFAGHDMGARNWATIASLIETCKLNAVDPQAYLTSTLTAIVNGHKQNRI
DELLPWNHPV
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
627 bp | 208 aa | 3129 | 2503 | - | No |
Annotation : Description :
ORF sequence :
Blast result :
Comments
ISRel8 is 76% (ORFA) aa similar to IS71, 73% (ORFB) to ISBj7 and 70% (ORFC : the transposase) to ISApr6. The fourth ORF is a passenger gene annotated as hypothetical protein.
References