IS1111A
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
M80806 | ND | Coxiella burnetii | Coxiella burnetii NineMile7 |
DNA section
IS Length : 1374 bp
Ends
IR Length : 12
IRL : CAATGAAATGGACCCACCCCTTAAAGACGGCGTCATAATGCGCCAACATA
IRR : ----tatATGGACCCACCCTTGTTGTCAACAATCAGTTTTGATGAAAAGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
CAATGAAATGGACCCACCCCTTAAAGACGGCGTCATAATGCGCCAACATAGAATTTCTATTTTCAAAAAAAGGAGAAGGTCCATGAAAGATATTAAAATA
CTGGGTGTTGATATTGCAAAAGATGTTTTTCAACTGTGTGGAATTGATGAGTGGGGTAAAGTGATCTACACGAGACGGGTTAAGCGTGCTCAGTATGTAT
CCACCGTAGCCAGTCTTAAGGTGGGCTGCGTGGTGATGGAAGCGTGTGGAGGAGCGAACCATTGGTATCGGACGTTTATGGGGATGGGTATCCCAACGCA
GTTGATCAGTCCGCAGCACGTCAAACCGTATGTCAAAAGTAACAAGAATGATCGTAACGATGCGCAGGCGATAGCTGAAGCGGCTTCCCGCGCCTCGATG
CGGTTTGTGCAGGGTAAAACGGTGGAACAACAAGACGTTCAAGCGCTGTTAAAGATACGCGATCGTTTAGTCAAAAGCCGCACGGCGCTGATCAATGAGA
TTCGGGGGTTGTTGCAAGAATACGGACTCACGATGGCGCGTGGTGCCAAGCGATTTTATGAAGAGCTCCCGTTGATTTTAGCGAGCGAAGCGGTGGGATT
AACACCGCGGATGAAACGGGTGTTGAATTGTTTGTATACCGAATTGTTGAACCGGGACGAAGCGATTGGTGATTACGAGGAGGAATTAAAAGCGGTGGCA
AAAGCCAATGAGGATTGTCAACGGGTACAGAGCATCCCGGGGGTGGGTTATTTAACGGCGCTCTCGGTTTATGCGAGCGTGGGTGACATTCATCAATTTC
ATCGTTCCCGGCAGTTGTCGGCGTTTATTGGGTTGGTCCCTCGACAACATTCGAGTGGGAATAAGGAGGTGTTGTTGGGGATTAGTAAACGCGGCAATGT
GATGTTAAGGACGTTATTGATTCATGGCGCCCGTGCGCTATTGCGTCATGTAAAAAATAAAACGGATAAAAAGAGTCTGTGGTTAAAAGCACTCATTGAG
CGCCGCGGAATGAATCGCGCTTGTGTGGCGTTAGCGAATAAAAATGCGCCGATCATTTGGGCGCTTTTAACACGCCAAGAAACGTATCGCTGTGGCGCCT
AAACACCGCCGTGGGTAAAAAAAAGAATTAACAAAAGGAGACACACCAACCGAGTTCGAAACAATGAGGGCTGATGAAAGTAAGGTAAAACCTGAGGTTG
ATTAAGCTGATTCATACGGTGGCTCTGTGAAGCCGATAGCCCGATAAGCATCAACCTTGCATAATTCATCAAGGCACCAATGGTGGCCAATTTAAATCGT
GATGCCGGATATACGAATGCAACCGCTTTCTTTTCATCAAAACTGATTGTTGACAACAAGGGTGGGTCCATATA
CTGGGTGTTGATATTGCAAAAGATGTTTTTCAACTGTGTGGAATTGATGAGTGGGGTAAAGTGATCTACACGAGACGGGTTAAGCGTGCTCAGTATGTAT
CCACCGTAGCCAGTCTTAAGGTGGGCTGCGTGGTGATGGAAGCGTGTGGAGGAGCGAACCATTGGTATCGGACGTTTATGGGGATGGGTATCCCAACGCA
GTTGATCAGTCCGCAGCACGTCAAACCGTATGTCAAAAGTAACAAGAATGATCGTAACGATGCGCAGGCGATAGCTGAAGCGGCTTCCCGCGCCTCGATG
CGGTTTGTGCAGGGTAAAACGGTGGAACAACAAGACGTTCAAGCGCTGTTAAAGATACGCGATCGTTTAGTCAAAAGCCGCACGGCGCTGATCAATGAGA
TTCGGGGGTTGTTGCAAGAATACGGACTCACGATGGCGCGTGGTGCCAAGCGATTTTATGAAGAGCTCCCGTTGATTTTAGCGAGCGAAGCGGTGGGATT
AACACCGCGGATGAAACGGGTGTTGAATTGTTTGTATACCGAATTGTTGAACCGGGACGAAGCGATTGGTGATTACGAGGAGGAATTAAAAGCGGTGGCA
AAAGCCAATGAGGATTGTCAACGGGTACAGAGCATCCCGGGGGTGGGTTATTTAACGGCGCTCTCGGTTTATGCGAGCGTGGGTGACATTCATCAATTTC
ATCGTTCCCGGCAGTTGTCGGCGTTTATTGGGTTGGTCCCTCGACAACATTCGAGTGGGAATAAGGAGGTGTTGTTGGGGATTAGTAAACGCGGCAATGT
GATGTTAAGGACGTTATTGATTCATGGCGCCCGTGCGCTATTGCGTCATGTAAAAAATAAAACGGATAAAAAGAGTCTGTGGTTAAAAGCACTCATTGAG
CGCCGCGGAATGAATCGCGCTTGTGTGGCGTTAGCGAATAAAAATGCGCCGATCATTTGGGCGCTTTTAACACGCCAAGAAACGTATCGCTGTGGCGCCT
AAACACCGCCGTGGGTAAAAAAAAGAATTAACAAAAGGAGACACACCAACCGAGTTCGAAACAATGAGGGCTGATGAAAGTAAGGTAAAACCTGAGGTTG
ATTAAGCTGATTCATACGGTGGCTCTGTGAAGCCGATAGCCCGATAAGCATCAACCTTGCATAATTCATCAAGGCACCAATGGTGGCCAATTTAAATCGT
GATGCCGGATATACGAATGCAACCGCTTTCTTTTCATCAAAACTGATTGTTGACAACAAGGGTGGGTCCATATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1020 bp | 339 aa | 83 | 1102 | + | No |
Chemistry : DEDD
ORF sequence :
MKDIKILGVDIAKDVFQLCGIDEWGKVIYTRRVKRAQYVSTVASLKVGCVVMEACGGANHWYRTFMGMGIPTQLISPQHVKPYVKSNKNDRNDAQAIAEA
ASRASMRFVQGKTVEQQDVQALLKIRDRLVKSRTALINEIRGLLQEYGLTMARGAKRFYEELPLILASEAVGLTPRMKRVLNCLYTELLNRDEAIGDYEE
ELKAVAKANEDCQRVQSIPGVGYLTALSVYASVGDIHQFHRSRQLSAFIGLVPRQHSSGNKEVLLGISKRGNVMLRTLLIHGARALLRHVKNKTDKKSLW
LKALIERRGMNRACVALANKNAPIIWALLTRQETYRCGA
ASRASMRFVQGKTVEQQDVQALLKIRDRLVKSRTALINEIRGLLQEYGLTMARGAKRFYEELPLILASEAVGLTPRMKRVLNCLYTELLNRDEAIGDYEE
ELKAVAKANEDCQRVQSIPGVGYLTALSVYASVGDIHQFHRSRQLSAFIGLVPRQHSSGNKEVLLGISKRGNVMLRTLLIHGARALLRHVKNKTDKKSLW
LKALIERRGMNRACVALANKNAPIIWALLTRQETYRCGA
Blast result :
Comments
Possible problems concerning the limits of these IS1111 (Length, IR, and ORF). IS1111A differs from IS1111B and IS1111C by 25 and 16 bp, respectively (2 aa).
The IR of this IS are not at its termini. In the IS sequence as given 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.
There are 20 copies of IS1111 in the RSA 493 genome.
IS1111 seems to target stem-loop inverted repeat structures.
By analogy with IS4321, IS1111 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
The IR of this IS are not at its termini. In the IS sequence as given 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.
There are 20 copies of IS1111 in the RSA 493 genome.
IS1111 seems to target stem-loop inverted repeat structures.
By analogy with IS4321, IS1111 may exist in a circular form in which a -10 region created by the abutted terminal sequences and a -35 region located just inside the right-hand end of the element are correctly spaced to form a promoter.
References
1] Hoover et al. (1992) J. Bacteriol. 174, 5540-5548
2] Seshadri et al. PNAS 100, 5455-5460
3] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Seshadri et al. PNAS 100, 5455-5460
3] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384