ISPa53
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
FJ917747 | ND | Pseudomonas aeruginosa | Pseudomonas aeruginosa PA601 |
DNA section
IS Length : 1381 bp
Ends
IR Length : 12/13
IRL : tgctgtaATGGACTTTCCCCGCGCCACAGTTCTATACCGAGAACGGTGGT
IRR : ---atatATGGACTCTCCCCACAAGTAGCGGGCAAAGCTTTTCTTTTGCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCGGCTTGAA | GAGTTGTTAG | 0 |
DNA sequence
TGCTGTAATGGACTTTCCCCGCGCCACAGTTCTATACCGAGAACGGTGGTGACTGAAATTGGGAGTGTCGTCATGAAGCGCATCGCAGTTGATCTGGCCA
AGTCCGTTTACCAAGTTGCCGAGAGTGTCCGTTCCGGCCTGGTGGTGCAGCGCAAGCGGCTGAACCGAGAGGCGTTCCGGCGATATATACAGGAGCAGAC
CGAGTCGGTCGAGTGGGTGATGGAAGCCTGTGGCACAGCGCACTATTGGGGGCGACTGGCGCAATCGCTCGGCCATCGGGTGAAGTTGCTGCATCCACGT
TACGTGCGGCCCTACCGCCGTCGCAACAAAACCGACCGCAATGACTGCGACGCCATGCTGGAAGCGGCACGCTGCACCGATATTCATCCCGTGCCGGTCA
AGAGCCACGGTCAGCAACAGCTTCAGCAATTGCACGGGCTACGTGAAACCTGGAAGAAGAGCCGCACCCAGCGAATCAATCTGCTGCGCGGCATGCTGCG
TGAAGCGGGTATCGAAGCGCCGGCTTCGACAGCCGACTTCATCCGGGCTGCCAGTGAGCCGGTTGATCTGCCTGAACTCGCGCCCCTGAGTTGTCTGCTG
CATATCGTTCTGGCCGAGATCAACCTGTATGAACAGTGCATGGCCGAATGCGAACAGCAGCTTCAGCGCTGGCATGCTGACGATGAAATAGTGCGCAAAC
TGGATGAGGTCAGTGGTATTGGTCTGTTGACCGCCAGTGCCCTGACCGCCGCTGTCGGTAAGCCCGAACGGTTCGCCAGTGGTCGCCAGCTCAGCGCTTG
GCTGGGCATGACGCCACGTGAGTTCAGCAGTGGCAATAGCCGCAAGCTCGGCCATATCAGCCGACAGGGCAATGTCTATGTGCGCACTTTGTTTATCCAT
GGATCCCGCGCGGCCTTGCTGGCGGCACAACGTTGCCAGGCTCGTACACCGGAAAAGCTGACCCAACTGCAGCGCTGGGCTGTGCAGACGGCAGCCCGTA
TCGGTCACAACAAGGCGGCGGTGGCGCTGGCCAACAAACTGGTGCGGATCTGCTGGGCGGTGTGGTGCCACGAGCGGCGGTTCAATGGCAATTGGCAGAG
CACAAAGCCCGCCTGACGGCGGGTGTAATCAGTAAGGCGTGGTGGTTTTCTTGTGGTTGTGGTGAGTAGCAGCACTGTCGAAACAGGCGAGTCCTGCAGC
GGAACAAGGCCGATAACAACGATGATCCCGGGGATCGATTGAACGCTTGGCCCCCGCTGCGCGAACTACAGAATGGCCAGGGCGGAAAGCCCAGAACAGG
CCGGATATACGAATGCAACCGCGCTGACGACAAGTGCAAAAGAAAAGCTTTGCCCGCTACTTGTGGGGAGAGTCCATATAT
AGTCCGTTTACCAAGTTGCCGAGAGTGTCCGTTCCGGCCTGGTGGTGCAGCGCAAGCGGCTGAACCGAGAGGCGTTCCGGCGATATATACAGGAGCAGAC
CGAGTCGGTCGAGTGGGTGATGGAAGCCTGTGGCACAGCGCACTATTGGGGGCGACTGGCGCAATCGCTCGGCCATCGGGTGAAGTTGCTGCATCCACGT
TACGTGCGGCCCTACCGCCGTCGCAACAAAACCGACCGCAATGACTGCGACGCCATGCTGGAAGCGGCACGCTGCACCGATATTCATCCCGTGCCGGTCA
AGAGCCACGGTCAGCAACAGCTTCAGCAATTGCACGGGCTACGTGAAACCTGGAAGAAGAGCCGCACCCAGCGAATCAATCTGCTGCGCGGCATGCTGCG
TGAAGCGGGTATCGAAGCGCCGGCTTCGACAGCCGACTTCATCCGGGCTGCCAGTGAGCCGGTTGATCTGCCTGAACTCGCGCCCCTGAGTTGTCTGCTG
CATATCGTTCTGGCCGAGATCAACCTGTATGAACAGTGCATGGCCGAATGCGAACAGCAGCTTCAGCGCTGGCATGCTGACGATGAAATAGTGCGCAAAC
TGGATGAGGTCAGTGGTATTGGTCTGTTGACCGCCAGTGCCCTGACCGCCGCTGTCGGTAAGCCCGAACGGTTCGCCAGTGGTCGCCAGCTCAGCGCTTG
GCTGGGCATGACGCCACGTGAGTTCAGCAGTGGCAATAGCCGCAAGCTCGGCCATATCAGCCGACAGGGCAATGTCTATGTGCGCACTTTGTTTATCCAT
GGATCCCGCGCGGCCTTGCTGGCGGCACAACGTTGCCAGGCTCGTACACCGGAAAAGCTGACCCAACTGCAGCGCTGGGCTGTGCAGACGGCAGCCCGTA
TCGGTCACAACAAGGCGGCGGTGGCGCTGGCCAACAAACTGGTGCGGATCTGCTGGGCGGTGTGGTGCCACGAGCGGCGGTTCAATGGCAATTGGCAGAG
CACAAAGCCCGCCTGACGGCGGGTGTAATCAGTAAGGCGTGGTGGTTTTCTTGTGGTTGTGGTGAGTAGCAGCACTGTCGAAACAGGCGAGTCCTGCAGC
GGAACAAGGCCGATAACAACGATGATCCCGGGGATCGATTGAACGCTTGGCCCCCGCTGCGCGAACTACAGAATGGCCAGGGCGGAAAGCCCAGAACAGG
CCGGATATACGAATGCAACCGCGCTGACGACAAGTGCAAAAGAAAAGCTTTGCCCGCTACTTGTGGGGAGAGTCCATATAT
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1044 bp | 347 aa | 73 | 1116 | + | No |
Chemistry : DEDD
ORF sequence :
MKRIAVDLAKSVYQVAESVRSGLVVQRKRLNREAFRRYIQEQTESVEWVMEACGTAHYWGRLAQSLGHRVKLLHPRYVRPYRRRNKTDRNDCDAMLEAAR
CTDIHPVPVKSHGQQQLQQLHGLRETWKKSRTQRINLLRGMLREAGIEAPASTADFIRAASEPVDLPELAPLSCLLHIVLAEINLYEQCMAECEQQLQRW
HADDEIVRKLDEVSGIGLLTASALTAAVGKPERFASGRQLSAWLGMTPREFSSGNSRKLGHISRQGNVYVRTLFIHGSRAALLAAQRCQARTPEKLTQLQ
RWAVQTAARIGHNKAAVALANKLVRICWAVWCHERRFNGNWQSTKPA
CTDIHPVPVKSHGQQQLQQLHGLRETWKKSRTQRINLLRGMLREAGIEAPASTADFIRAASEPVDLPELAPLSCLLHIVLAEINLYEQCMAECEQQLQRW
HADDEIVRKLDEVSGIGLLTASALTAAVGKPERFASGRQLSAWLGMTPREFSSGNSRKLGHISRQGNVYVRTLFIHGSRAALLAAQRCQARTPEKLTQLQ
RWAVQTAARIGHNKAAVALANKLVRICWAVWCHERRFNGNWQSTKPA
Blast result :
Comments
ISPa53 is 86% aa similar to ISKpn4.
The IR of this IS (8-ATGGACTTTCCCC-20 and 1365-GGGGAGAGTCCAT-1377)are not at its termini.
Like most IS1111 family elements, the first nt of the sequence shown (T) does match the 4th nt to the right of IRr (same base).
ISPa53 is embedded within and the L' motif of the attC recombination site of the aacA3 gene cassette, itself a part of the In996 class 1 integron, thereby preveting this gene cassette IntI-mediated excision.
The IR of this IS (8-ATGGACTTTCCCC-20 and 1365-GGGGAGAGTCCAT-1377)are not at its termini.
Like most IS1111 family elements, the first nt of the sequence shown (T) does match the 4th nt to the right of IRr (same base).
ISPa53 is embedded within and the L' motif of the attC recombination site of the aacA3 gene cassette, itself a part of the In996 class 1 integron, thereby preveting this gene cassette IntI-mediated excision.
References