ISBdi7
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Bradyrhizobium diazoefficiens | Bradyrhizobium diazoefficiens SEMIA 5080 Bradyrhizobium diazoefficiens Bradyrhizobium diazoefficiens USDA 122 |
DNA section
IS Length : 2479 bp
Ends
IR Length : 23/26
IRL : TGTCATTCGGCTTGCAATATTGACCCCCTAAGCCGGGGGATCGGCGTCCA
IRR : TGTCAATCGGCTTGCAAATTTGACCCTTCATCGGCGTCCAATTTTGACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCTTCGATGG | GCTATGGCTC | 0 |
DNA sequence
TGTCATTCGGCTTGCAATATTGACCCCCTAAGCCGGGGGATCGGCGTCCAAAATTGCCCCCCTACAGGTTGGTCTGTTGCGCTGCCTGCTTCGTAACAAA
GCAGGTGGTGGAGGATGCTGGTCGTGGAGACGATTGCGCGGATTCGGCGCGAGCACTTCATCAAGGGCAAGACGATCAAGGAGATCGCCCGTGACCTGAA
GGTGTCACGGAACACGGTTCGGAAGGTGCTGAGGTCGGGAGAGACCTCCTTCGAGTATGAGCGGCAAGTGCAGCCGCGGCCAAAGCTCGGACGATGGGCA
GTCGAACTTGACGGGCTGCTTGCGGCGAACGCGGCTAAATCGGCTCGTGAGCAGCTGACATTGATCCGGATCTTCGAAGAGCTGCGCGGTCGCGGCTATG
ACGGCGGCTACGATGCGGTGCGACGTTACGCCAGGCGGTGGAGCAAGGAACGCGGGCAATCGACCGCGGCGGCCTATGTCCCGCTGAGTTTTGCGCCAGG
CGAAGCCTACCAGTTCGACTGGAGCCATGAGGTGGTGCTGTTGAGCGGAACCACCGTGGTCGTGAAGGTTGCTCACGTCCGGCTCTGCCACAGCCGTATG
CTATTTGCCCGGGCGTATCCGCGAGAGACGCAGGAGATGGTGTTCGACGCCCACGACCGAGCGTTCGCGCTGTTCAAGGGCACCTGCACCCGCGGCATCT
ACGACAACATGAAGACCGCCGTGGAGACGATCTTCGTCGGTAAAGGGCGTCTTTACAATCGCCGGTTTTTGCAGATGTGCAGCCACTATCTGGTCGATCC
AGTCGCCTGCACGCCGGCGTCTGGCTGGGAGAAGGGGCAGGTTGAGAACCAGGTCGGGCTGGTCAGGGAACGCTTCTTCACGCCGCGTCTGCGGTTCAAA
AACCTCGACGAGTTAAACGCCTGGCTGCTCGACAAATGCATCACCTACGCCAAGGCTCATCGCCATCCGGAACTGGTCGATCAGACGATCTGGGACGTGT
TCGAAGTCGAACGCCCTAGGCTCGTTCCCTATGCAGGGCGATTTGACGGCTTCCATGCGGTGACGGCGTCGGTCTCGAAGACCTGCCTGGTGCGCTTCGA
CAACAACAAGTACTCGGTCGCAGCCAGTGCAGTCGGACGACCGGTCGAGGTTCAAGCCTATGCCGATCGCATCGTGATCCGCCAGGATGGACGTATCGTT
GCCGAGCATCAGCGATCCTTTGGCCGCGGCGATACCGTCTACGACCCTTGGCATTATGTGCCGGTGCTCGCCCGCAACCCCGGCGCCTTACGCAACGGCG
CTCCCTTCAAGGACTGGGTGCTGCCGGCCGCGATCGAGCGGATCCGGCGCAAGCTTGCCAGCATCGACGATGGCAATCGGCAGATGGTCGACATCCTCAA
CGCGGTGCTGACTGACGGTCTGCCTGCGGTGGAAGCGGCCTGTGCCGAAGCGCTCAGTCACAGCGTTCATTCCGCCGATGTGGTGCTCAACATCCTGGCC
CGTCAACGTGAGCCAGCCCCGCCGGCCAACATCATGACGCCAGCCGCCCTGACGCTCCGTCATGCACCAATCGCCGATTGTGCCCGCTACGACAACCTTC
GGAGGACCAACTGATGGAACGAACTCAAATCTTCGACCTCATGGGCGAGCTCAAGCTCTATGGCATGAAGGCTGCCTTCGACGAGATCATGGCAACTGCC
GTCAAACGTCAGCATGAACCCCAGCGCATCGTTGGCGATCTACTCTCCGCCGAGATCAACGAGAAGCAAGCCAGATCTATCAAATACCAGCTCACCATTG
CCAAGCTGCCGCTTGCCAAGGACATTGCCGACTTCCAGTTCGACGGCACGCCGATCAATCAGACGCTCGTCAATGATCTTGCTGGCGGCGGCTTCGTCGC
CCAACAGCGCAACGTCGTGCTGGTCGGCGGCACCGGCACAGGCAAAACCCACCTGGCCATTGCCATCGCAAGAAGCTGCATCCGATCTGGTGCCCGCGGC
CGCTTCTTCAACGTGGTCGACCTCGTCAACCGCCTCGAGACCGAGACCCGCAATGGACGGCAGGGACGGCTTGCCGAGCATCTGACCCGGATGGACTTCA
TCGTGCTGGACGAACTCGGCTATTTGCCCTTCGCCCAGTCCGGTGGCCAGCTTCTCTTCCACCTCGTCAGCCGGCTCTATGAGCGCGCCTCCGTCATCGT
GACCACCAATCTCGCATTCGGCGAATGGCCCAGCGTGTTCGGCGACGCCAAAATGACCACAGCGTTGCTCGACCGATTGACCCATCACTGCGACATCGTC
GAGACCGGCAACGATAGCTGGCGGTTCAAAAGCCGAGACGACGATCACGCCACCCGCGCTCGTCTCGCCTCCGCTATCCCGGCCAGCTCCGACGAGACGA
GCGCTACCAGCAAACCCCGCCGCGGAAAGGGGTCAAAATTGGACGCCGATGAAGGGTCAAATTTGCAAGCCGATTGACA
GCAGGTGGTGGAGGATGCTGGTCGTGGAGACGATTGCGCGGATTCGGCGCGAGCACTTCATCAAGGGCAAGACGATCAAGGAGATCGCCCGTGACCTGAA
GGTGTCACGGAACACGGTTCGGAAGGTGCTGAGGTCGGGAGAGACCTCCTTCGAGTATGAGCGGCAAGTGCAGCCGCGGCCAAAGCTCGGACGATGGGCA
GTCGAACTTGACGGGCTGCTTGCGGCGAACGCGGCTAAATCGGCTCGTGAGCAGCTGACATTGATCCGGATCTTCGAAGAGCTGCGCGGTCGCGGCTATG
ACGGCGGCTACGATGCGGTGCGACGTTACGCCAGGCGGTGGAGCAAGGAACGCGGGCAATCGACCGCGGCGGCCTATGTCCCGCTGAGTTTTGCGCCAGG
CGAAGCCTACCAGTTCGACTGGAGCCATGAGGTGGTGCTGTTGAGCGGAACCACCGTGGTCGTGAAGGTTGCTCACGTCCGGCTCTGCCACAGCCGTATG
CTATTTGCCCGGGCGTATCCGCGAGAGACGCAGGAGATGGTGTTCGACGCCCACGACCGAGCGTTCGCGCTGTTCAAGGGCACCTGCACCCGCGGCATCT
ACGACAACATGAAGACCGCCGTGGAGACGATCTTCGTCGGTAAAGGGCGTCTTTACAATCGCCGGTTTTTGCAGATGTGCAGCCACTATCTGGTCGATCC
AGTCGCCTGCACGCCGGCGTCTGGCTGGGAGAAGGGGCAGGTTGAGAACCAGGTCGGGCTGGTCAGGGAACGCTTCTTCACGCCGCGTCTGCGGTTCAAA
AACCTCGACGAGTTAAACGCCTGGCTGCTCGACAAATGCATCACCTACGCCAAGGCTCATCGCCATCCGGAACTGGTCGATCAGACGATCTGGGACGTGT
TCGAAGTCGAACGCCCTAGGCTCGTTCCCTATGCAGGGCGATTTGACGGCTTCCATGCGGTGACGGCGTCGGTCTCGAAGACCTGCCTGGTGCGCTTCGA
CAACAACAAGTACTCGGTCGCAGCCAGTGCAGTCGGACGACCGGTCGAGGTTCAAGCCTATGCCGATCGCATCGTGATCCGCCAGGATGGACGTATCGTT
GCCGAGCATCAGCGATCCTTTGGCCGCGGCGATACCGTCTACGACCCTTGGCATTATGTGCCGGTGCTCGCCCGCAACCCCGGCGCCTTACGCAACGGCG
CTCCCTTCAAGGACTGGGTGCTGCCGGCCGCGATCGAGCGGATCCGGCGCAAGCTTGCCAGCATCGACGATGGCAATCGGCAGATGGTCGACATCCTCAA
CGCGGTGCTGACTGACGGTCTGCCTGCGGTGGAAGCGGCCTGTGCCGAAGCGCTCAGTCACAGCGTTCATTCCGCCGATGTGGTGCTCAACATCCTGGCC
CGTCAACGTGAGCCAGCCCCGCCGGCCAACATCATGACGCCAGCCGCCCTGACGCTCCGTCATGCACCAATCGCCGATTGTGCCCGCTACGACAACCTTC
GGAGGACCAACTGATGGAACGAACTCAAATCTTCGACCTCATGGGCGAGCTCAAGCTCTATGGCATGAAGGCTGCCTTCGACGAGATCATGGCAACTGCC
GTCAAACGTCAGCATGAACCCCAGCGCATCGTTGGCGATCTACTCTCCGCCGAGATCAACGAGAAGCAAGCCAGATCTATCAAATACCAGCTCACCATTG
CCAAGCTGCCGCTTGCCAAGGACATTGCCGACTTCCAGTTCGACGGCACGCCGATCAATCAGACGCTCGTCAATGATCTTGCTGGCGGCGGCTTCGTCGC
CCAACAGCGCAACGTCGTGCTGGTCGGCGGCACCGGCACAGGCAAAACCCACCTGGCCATTGCCATCGCAAGAAGCTGCATCCGATCTGGTGCCCGCGGC
CGCTTCTTCAACGTGGTCGACCTCGTCAACCGCCTCGAGACCGAGACCCGCAATGGACGGCAGGGACGGCTTGCCGAGCATCTGACCCGGATGGACTTCA
TCGTGCTGGACGAACTCGGCTATTTGCCCTTCGCCCAGTCCGGTGGCCAGCTTCTCTTCCACCTCGTCAGCCGGCTCTATGAGCGCGCCTCCGTCATCGT
GACCACCAATCTCGCATTCGGCGAATGGCCCAGCGTGTTCGGCGACGCCAAAATGACCACAGCGTTGCTCGACCGATTGACCCATCACTGCGACATCGTC
GAGACCGGCAACGATAGCTGGCGGTTCAAAAGCCGAGACGACGATCACGCCACCCGCGCTCGTCTCGCCTCCGCTATCCCGGCCAGCTCCGACGAGACGA
GCGCTACCAGCAAACCCCGCCGCGGAAAGGGGTCAAAATTGGACGCCGATGAAGGGTCAAATTTGCAAGCCGATTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1500 bp | 499 aa | 115 | 1614 | + | No |
Chemistry : DDE
ORF sequence :
MLVVETIARIRREHFIKGKTIKEIARDLKVSRNTVRKVLRSGETSFEYERQVQPRPKLGRWAVELDGLLAANAAKSAREQLTLIRIFEELRGRGYDGGYD
AVRRYARRWSKERGQSTAAAYVPLSFAPGEAYQFDWSHEVVLLSGTTVVVKVAHVRLCHSRMLFARAYPRETQEMVFDAHDRAFALFKGTCTRGIYDNMK
TAVETIFVGKGRLYNRRFLQMCSHYLVDPVACTPASGWEKGQVENQVGLVRERFFTPRLRFKNLDELNAWLLDKCITYAKAHRHPELVDQTIWDVFEVER
PRLVPYAGRFDGFHAVTASVSKTCLVRFDNNKYSVAASAVGRPVEVQAYADRIVIRQDGRIVAEHQRSFGRGDTVYDPWHYVPVLARNPGALRNGAPFKD
WVLPAAIERIRRKLASIDDGNRQMVDILNAVLTDGLPAVEAACAEALSHSVHSADVVLNILARQREPAPPANIMTPAALTLRHAPIADCARYDNLRRTN
AVRRYARRWSKERGQSTAAAYVPLSFAPGEAYQFDWSHEVVLLSGTTVVVKVAHVRLCHSRMLFARAYPRETQEMVFDAHDRAFALFKGTCTRGIYDNMK
TAVETIFVGKGRLYNRRFLQMCSHYLVDPVACTPASGWEKGQVENQVGLVRERFFTPRLRFKNLDELNAWLLDKCITYAKAHRHPELVDQTIWDVFEVER
PRLVPYAGRFDGFHAVTASVSKTCLVRFDNNKYSVAASAVGRPVEVQAYADRIVIRQDGRIVAEHQRSFGRGDTVYDPWHYVPVLARNPGALRNGAPFKD
WVLPAAIERIRRKLASIDDGNRQMVDILNAVLTDGLPAVEAACAEALSHSVHSADVVLNILARQREPAPPANIMTPAALTLRHAPIADCARYDNLRRTN
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
864 bp | 287 aa | 1614 | 2477 | + | No |
AG : IS21 helper
ORF sequence :
MERTQIFDLMGELKLYGMKAAFDEIMATAVKRQHEPQRIVGDLLSAEINEKQARSIKYQLTIAKLPLAKDIADFQFDGTPINQTLVNDLAGGGFVAQQRN
VVLVGGTGTGKTHLAIAIARSCIRSGARGRFFNVVDLVNRLETETRNGRQGRLAEHLTRMDFIVLDELGYLPFAQSGGQLLFHLVSRLYERASVIVTTNL
AFGEWPSVFGDAKMTTALLDRLTHHCDIVETGNDSWRFKSRDDDHATRARLASAIPASSDETSATSKPRRGKGSKLDADEGSNLQAD
VVLVGGTGTGKTHLAIAIARSCIRSGARGRFFNVVDLVNRLETETRNGRQGRLAEHLTRMDFIVLDELGYLPFAQSGGQLLFHLVSRLYERASVIVTTNL
AFGEWPSVFGDAKMTTALLDRLTHHCDIVETGNDSWRFKSRDDDHATRARLASAIPASSDETSATSKPRRGKGSKLDADEGSNLQAD
Blast result :
Comments
ISBdi7 is 86% aa (transposase) and 90% aa (helper of transposition) similar to ISMex39.
References
1] Gesiele Carvalho (2018) Direct submission.