ISGdi4
- Family IS630
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Gluconacetobacter diazotrophicus | Gluconacetobacter diazotrophicus |
DNA section
IS Length : 1176 bp
Ends
IR Length : 12/13
IRL : TACCAACTCCGGTCCGCCCTGACTCATATGTGATTGCCTGACGGTATCGT
IRR : TACCAACTCCTGTTTGTTAAAACCCATGTGCCCATTGTCGTAGCCCGAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CAACCCGACAGCCTGC | TAA | GAGCCTGTTTGGAAAG | 3 |
DNA sequence
TACCAACTCCGGTCCGCCCTGACTCATATGTGATTGCCTGACGGTATCGTTCTGGAACGAGCTGTGATTCACAGGATGCCGAACCGGGCGGGAGGCATCC
GAATGGGCGGGGCGTTAGCGTTGCGTGAGGATTATGATGCGGCGGGACTGCGTGCTCTGGCGCGGACAACGAGGCATGCGGGCCAGGCGCGTCGGCTTCT
GGCGCTGGCGGCGATCTACGATGGTGCGTCACGCGGAGACGCGGCACGACTGGCTGGGACGGATCGGCAGATTGTGCGGGACTGGGTGGTGCGTTTCAAC
GCCGAGGGCCCGGATGGCGTGCGGGATCATCATGGGGGCGGTGTCGTTCCCCGCCTGACACCAGCCATGTTGGAAGCGCTGATGCGCCGGATCGAGGACG
GCCCGATCGCTGCCGTGCATGGGGTGGTGCGCTGGCGGCAGGCTGATCTGGGGCAATGGCTTTATGAGGAATTCGGCGTCTCTCTTTCGCGCAGCCGGCT
GAGCGCCGTTATCCGGGGCCTCGACTTCCGCCTTCTGACGGGACGCCCCCGGCACCATGCCCAGGATCCCGAGGCCCAGGACGTTTTTAAAAAAGCTTCC
CCGACGTCATGGCCGGGATCCGGGCCCGGCATCCCGGCAAGGCCATCGAACTCTGGTGGGGCGACGAGGCGAGGGTCGGCCAGAAAACGAAGCTGACGCG
CCGCTGGGCCAGACGCGGCACCCGTCCACGCGCGCCTGCCGATCAGCGCACACGTTCGGCCTGGATCTTCGGAGCGATCTGTCCGGCGCTAGGCAAGGGA
GCGGCCCTCGTCCTGCCCTGGTGCAACCTCCACGCCATGAACCGGCATCTCGACGAGATCTCGCAGGCCGTAGCGCCGGGCGCTCACGCTATCCTCATCG
TCGATCAGGCAGCGTGGCATACCAGCCCGAAACTCGATATCCCCGCCAACATCACCATCCTGCCGCTCCCGCCACGCTCGCCCGAACTCAATCCGGTGGA
AAACGTCTGGCAGTTCATGCGCAATACCTGGCTGTCGAACCGGATCTTCCGCACCTACGACGACATCGTCGATATCTGCTGCCACGCCTGGAACCAGCTC
GTCGACCAGCCCTGGCGCATCATGTCCCTCGGGCTACGACAATGGGCACATGGGTTTTAACAAACAGGAGTTGGTA
GAATGGGCGGGGCGTTAGCGTTGCGTGAGGATTATGATGCGGCGGGACTGCGTGCTCTGGCGCGGACAACGAGGCATGCGGGCCAGGCGCGTCGGCTTCT
GGCGCTGGCGGCGATCTACGATGGTGCGTCACGCGGAGACGCGGCACGACTGGCTGGGACGGATCGGCAGATTGTGCGGGACTGGGTGGTGCGTTTCAAC
GCCGAGGGCCCGGATGGCGTGCGGGATCATCATGGGGGCGGTGTCGTTCCCCGCCTGACACCAGCCATGTTGGAAGCGCTGATGCGCCGGATCGAGGACG
GCCCGATCGCTGCCGTGCATGGGGTGGTGCGCTGGCGGCAGGCTGATCTGGGGCAATGGCTTTATGAGGAATTCGGCGTCTCTCTTTCGCGCAGCCGGCT
GAGCGCCGTTATCCGGGGCCTCGACTTCCGCCTTCTGACGGGACGCCCCCGGCACCATGCCCAGGATCCCGAGGCCCAGGACGTTTTTAAAAAAGCTTCC
CCGACGTCATGGCCGGGATCCGGGCCCGGCATCCCGGCAAGGCCATCGAACTCTGGTGGGGCGACGAGGCGAGGGTCGGCCAGAAAACGAAGCTGACGCG
CCGCTGGGCCAGACGCGGCACCCGTCCACGCGCGCCTGCCGATCAGCGCACACGTTCGGCCTGGATCTTCGGAGCGATCTGTCCGGCGCTAGGCAAGGGA
GCGGCCCTCGTCCTGCCCTGGTGCAACCTCCACGCCATGAACCGGCATCTCGACGAGATCTCGCAGGCCGTAGCGCCGGGCGCTCACGCTATCCTCATCG
TCGATCAGGCAGCGTGGCATACCAGCCCGAAACTCGATATCCCCGCCAACATCACCATCCTGCCGCTCCCGCCACGCTCGCCCGAACTCAATCCGGTGGA
AAACGTCTGGCAGTTCATGCGCAATACCTGGCTGTCGAACCGGATCTTCCGCACCTACGACGACATCGTCGATATCTGCTGCCACGCCTGGAACCAGCTC
GTCGACCAGCCCTGGCGCATCATGTCCCTCGGGCTACGACAATGGGCACATGGGTTTTAACAAACAGGAGTTGGTA
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
492 bp | 163 aa | 103 | 594 | + | No |
Description : First part of the transposase
ORF sequence :
MGGALALREDYDAAGLRALARTTRHAGQARRLLALAAIYDGASRGDAARLAGTDRQIVRDWVVRFNAEGPDGVRDHHGGGVVPRLTPAMLEALMRRIEDG
PIAAVHGVVRWRQADLGQWLYEEFGVSLSRSRLSAVIRGLDFRLLTGRPRHHAQDPEAQDVFK
PIAAVHGVVRWRQADLGQWLYEEFGVSLSRSRLSAVIRGLDFRLLTGRPRHHAQDPEAQDVFK
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
570 bp | 189 aa | 591 | 1160 | + | No |
Description : Second part of the transposase
ORF sequence :
KSFPDVMAGIRARHPGKAIELWWGDEARVGQKTKLTRRWARRGTRPRAPADQRTRSAWIFGAICPALGKGAALVLPWCNLHAMNRHLDEISQAVAPGAHA
ILIVDQAAWHTSPKLDIPANITILPLPPRSPELNPVENVWQFMRNTWLSNRIFRTYDDIVDICCHAWNQLVDQPWRIMSLGLRQWAHGF
ILIVDQAAWHTSPKLDIPANITILPLPPRSPELNPVENVWQFMRNTWLSNRIFRTYDDIVDICCHAWNQLVDQPWRIMSLGLRQWAHGF
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1058 bp | 0 aa | 103 | 1160 | + | Yes |
Chemistry : DDE
ORF sequence :
MGGALALREDYDAAGLRALARTTRHAGQARRLLALAAIYDGASRGDAARLAGTDRQIVRDWVVRFNAEGPDGVRDHHGGGVVPRLTPAMLEALMRRIEDG
PIAAVHGVVRWRQADLGQWLYEEFGVSLSRSRLSAVIRGLDFRLLTGRPRHHAQDPEAQDVFKKSFPDVMAGIRARHPGKAIELWWGDEARVGQKTKLTR
RWARRGTRPRAPADQRTRSAWIFGAICPALGKGAALVLPWCNLHAMNRHLDEISQAVAPGAHAILIVDQAAWHTSPKLDIPANITILPLPPRSPELNPVE
NVWQFMRNTWLSNRIFRTYDDIVDICCHAWNQLVDQPWRIMSLGLRQWAHGF
PIAAVHGVVRWRQADLGQWLYEEFGVSLSRSRLSAVIRGLDFRLLTGRPRHHAQDPEAQDVFKKSFPDVMAGIRARHPGKAIELWWGDEARVGQKTKLTR
RWARRGTRPRAPADQRTRSAWIFGAICPALGKGAALVLPWCNLHAMNRHLDEISQAVAPGAHAILIVDQAAWHTSPKLDIPANITILPLPPRSPELNPVE
NVWQFMRNTWLSNRIFRTYDDIVDICCHAWNQLVDQPWRIMSLGLRQWAHGF
Blast result :
Comments
ISGdi4 is 61% aa similar to ISAli3. The third ORF is a potential ORFAB transposase reconstructed in silico by possible -1 frameshift.
References
1] Miriam Land (2008) Direct submission