ISPrre11
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
MT219827.1 | ND | Providencia rettgeri | Providencia rettgeri RF14-2 |
DNA section
IS Length : 2457 bp
Ends
IR Length : 19/20
IRL : GTAAGCGTCCGGCGAACACATCTTTTATTTGCTAACTGATCACGCATCAT
IRR : GTAAGCGTCCGGCGATCACAGGTAGGTGTTCTAATTTAAGTGTCCACTTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTTAAAAGAAGC | TCATGTTG | CACTTGGTGGTG | 8 |
DNA sequence
GTAAGCGTCCGGCGAACACATCTTTTATTTGCTAACTGATCACGCATCATAGTGCTCACTGTTGCCAACTGTGAGACTTATGAATGCCAGCTTTAAAACA
CCCCGATGTGCGCCGGGTCTTTTCCCCTGAGTTTAAATGGCAGATTGTCCGAGAATCCAAAGAGCGTATACTGTCCGTTTCTGAATTAGCCCGCAAATAC
GATGTGAACACCAACCAGGTATTTCGCTGGATGCGCGAAGCTGAGGCAGGTCAGGCTTTGTGGGTCAGACGCGCTAAAGGTGAGCCGGATACGCCTGTAG
CTGCTTCAGCTTTTCTACCTGTCAGCGTCCGCCCGGCTAATGCAGCAGCCCCCGTTTTGCCTAAACCTGCCATCACGGTGTCATTTCGAAGTGGCCATCA
GTTGATGTTACACGAGGCAACACCGGCCATGTTAGCGCAACTGGTGGCAGCGTTATCATGATGATGTTATCGCATGATGTGCGTATCTGGCTGTGTGCCG
GTTACACTGATATGCGCAAAGGGTTTGATGGCCTGGCTGCCATGGCGCAGCACGTCCTGCAAAAAGACCCTTTCAGTGGTCATGTGTTTGTCTTTCGCGG
ACGGCGCAGTGACCGGGTGAAATTACTCTGGTGGGATGGTCAGGGCTTGTGTTTATTCTACAAGCGTCTTGAGCAGGGACAGTTCGTCTGGCCTGTCGCA
AAATCCGGTGCTGTGCATTTAACCCAAGCGGAGTTAGCGCTACTGCTTGAAGGCTTAGACTGGCGTCCGCCCGCCAGAAAAAATACGCCAAAAAACACAG
CTTAAATAAAATTAATCAATGAGTTAGCCATCGCTTTGCGCTATAATACGCCGCATGAAAATGATGGCACCTACCTTACCCAATGACCCGAAACAGTTGC
AGGCGATTATCCTGCAGCTGCAGGAAACGCTTGCCCAGCAAGACACGGTTATTGCGTCATTACGCCATCAATTAGCTGTATTAAAGCGCGCCCGTTTTGG
TCGCTCCTCGGAGCATCTGGATAAGCAAATCCATCAACTGGAATTGCAGTTAGAAGAGCTGGAGATGCAGACGGCCGCCTTACCGAAACTTACCGATGGC
GCTGTCGAAGAGAAAGCCACTGCGCGTCGTCGTGTGACCTTACCGGCACAGTTACCACGTGAAGAAAAGCGTGTTGATGCACCTTGCCAGTGTCCGGCAT
GTCAGGGCGAATTAACCCATATCGGTGAGGATGTCTCTGAAATGTTGGATGTCGTACCCGCATCTTATCGGGTTATCCGCATTGTACGCCCTAAGTTCAG
TTGCCAGCACTGCGACACGCTGGTACAAGGTCAGGCTCCGGAGCGGGTGATTAAAAAAGGCCTGGCCAGCGCAGCATTGCTGACACAAGTGATTGTCGAT
AAATACCTCGACCATCAACCACTGTACCGTCAGGCTGAGCGGATGGCGCGGGAAGGCATTGATATCGAGCGCTCGACGCTGGCAGACTGGGTTGGTCAAT
CCGGTGCATTATTAACACCCCTTGCTGAGGCGATTGGACGCCATGTCAAAGCCGGTCAACATATCTTTGCGGATGACACGACCGCGCCAACCTTATCACC
GGGAAAAGGCCGCACACAAACCGGCCGCTACTGGACGTATGTGCGTGATGGTCGTTCCTGGGGCGATACCACACCGCCTGCCGTGTGGCTGCAATACAGC
GAAGATAGAAAAAGTCTGCACCCCACTGCCCATTTAACCGGGTATAAAGGTTCACTGCAGGCTGATGCTTATGCCGGTTACAATGCCTTATACCAAAACC
ACATCACCCAGCATGGTTGTTGGGCCCATGTGCGGCGTAAGTTCTATGACATTACCCAATCCGGCCCGTCCCCCTTGGCAGAAGAGGCCCTGACCATAAT
CCAATCACTATACGTGATTGAGAAACAGGCACGAGGCCAACCACCGGATGAACGACAGCACCTGCGAAGTCTGCAGGCAAAACCTATCCTGGTACAGTTT
AACAACTGGTTACAGACAACCCTGCGCACCTTATCAAAAGGCTCTGCGTTGTCTAAAGCTATCCAGTATGCACTAAAGCAATGGGACGCGTTGGTGGCGT
ACGTGGATAATGGTTATGCTGAGATAGATAACAACAGTGCGGAGCGGTCATTACGGCCCATCGCCCTGGGTCGGAAAAACTACCTGTTTGCCGGCTCTGT
TGGCGGAGGGGAACGTGCCGCTGTGCTGTATTCGATACTTGGCACCGCCAAGCTCAATGGCATCAATCCCAATGCGTACTTAACTGCCGTGTTAAAACGT
ATCGGTAATCATCCGATCAACCGTATTGATGAACTGTTACCCTGGAACATCGACTTATCTGCCCAGCCAGGTGATGCTATCTAAGACACTAAGTGTCCAC
TAAAAATTAAGTGGACACTTAAATTAGAACACCTACCTGTGATCGCCGGACGCTTAC
CCCCGATGTGCGCCGGGTCTTTTCCCCTGAGTTTAAATGGCAGATTGTCCGAGAATCCAAAGAGCGTATACTGTCCGTTTCTGAATTAGCCCGCAAATAC
GATGTGAACACCAACCAGGTATTTCGCTGGATGCGCGAAGCTGAGGCAGGTCAGGCTTTGTGGGTCAGACGCGCTAAAGGTGAGCCGGATACGCCTGTAG
CTGCTTCAGCTTTTCTACCTGTCAGCGTCCGCCCGGCTAATGCAGCAGCCCCCGTTTTGCCTAAACCTGCCATCACGGTGTCATTTCGAAGTGGCCATCA
GTTGATGTTACACGAGGCAACACCGGCCATGTTAGCGCAACTGGTGGCAGCGTTATCATGATGATGTTATCGCATGATGTGCGTATCTGGCTGTGTGCCG
GTTACACTGATATGCGCAAAGGGTTTGATGGCCTGGCTGCCATGGCGCAGCACGTCCTGCAAAAAGACCCTTTCAGTGGTCATGTGTTTGTCTTTCGCGG
ACGGCGCAGTGACCGGGTGAAATTACTCTGGTGGGATGGTCAGGGCTTGTGTTTATTCTACAAGCGTCTTGAGCAGGGACAGTTCGTCTGGCCTGTCGCA
AAATCCGGTGCTGTGCATTTAACCCAAGCGGAGTTAGCGCTACTGCTTGAAGGCTTAGACTGGCGTCCGCCCGCCAGAAAAAATACGCCAAAAAACACAG
CTTAAATAAAATTAATCAATGAGTTAGCCATCGCTTTGCGCTATAATACGCCGCATGAAAATGATGGCACCTACCTTACCCAATGACCCGAAACAGTTGC
AGGCGATTATCCTGCAGCTGCAGGAAACGCTTGCCCAGCAAGACACGGTTATTGCGTCATTACGCCATCAATTAGCTGTATTAAAGCGCGCCCGTTTTGG
TCGCTCCTCGGAGCATCTGGATAAGCAAATCCATCAACTGGAATTGCAGTTAGAAGAGCTGGAGATGCAGACGGCCGCCTTACCGAAACTTACCGATGGC
GCTGTCGAAGAGAAAGCCACTGCGCGTCGTCGTGTGACCTTACCGGCACAGTTACCACGTGAAGAAAAGCGTGTTGATGCACCTTGCCAGTGTCCGGCAT
GTCAGGGCGAATTAACCCATATCGGTGAGGATGTCTCTGAAATGTTGGATGTCGTACCCGCATCTTATCGGGTTATCCGCATTGTACGCCCTAAGTTCAG
TTGCCAGCACTGCGACACGCTGGTACAAGGTCAGGCTCCGGAGCGGGTGATTAAAAAAGGCCTGGCCAGCGCAGCATTGCTGACACAAGTGATTGTCGAT
AAATACCTCGACCATCAACCACTGTACCGTCAGGCTGAGCGGATGGCGCGGGAAGGCATTGATATCGAGCGCTCGACGCTGGCAGACTGGGTTGGTCAAT
CCGGTGCATTATTAACACCCCTTGCTGAGGCGATTGGACGCCATGTCAAAGCCGGTCAACATATCTTTGCGGATGACACGACCGCGCCAACCTTATCACC
GGGAAAAGGCCGCACACAAACCGGCCGCTACTGGACGTATGTGCGTGATGGTCGTTCCTGGGGCGATACCACACCGCCTGCCGTGTGGCTGCAATACAGC
GAAGATAGAAAAAGTCTGCACCCCACTGCCCATTTAACCGGGTATAAAGGTTCACTGCAGGCTGATGCTTATGCCGGTTACAATGCCTTATACCAAAACC
ACATCACCCAGCATGGTTGTTGGGCCCATGTGCGGCGTAAGTTCTATGACATTACCCAATCCGGCCCGTCCCCCTTGGCAGAAGAGGCCCTGACCATAAT
CCAATCACTATACGTGATTGAGAAACAGGCACGAGGCCAACCACCGGATGAACGACAGCACCTGCGAAGTCTGCAGGCAAAACCTATCCTGGTACAGTTT
AACAACTGGTTACAGACAACCCTGCGCACCTTATCAAAAGGCTCTGCGTTGTCTAAAGCTATCCAGTATGCACTAAAGCAATGGGACGCGTTGGTGGCGT
ACGTGGATAATGGTTATGCTGAGATAGATAACAACAGTGCGGAGCGGTCATTACGGCCCATCGCCCTGGGTCGGAAAAACTACCTGTTTGCCGGCTCTGT
TGGCGGAGGGGAACGTGCCGCTGTGCTGTATTCGATACTTGGCACCGCCAAGCTCAATGGCATCAATCCCAATGCGTACTTAACTGCCGTGTTAAAACGT
ATCGGTAATCATCCGATCAACCGTATTGATGAACTGTTACCCTGGAACATCGACTTATCTGCCCAGCCAGGTGATGCTATCTAAGACACTAAGTGTCCAC
TAAAAATTAAGTGGACACTTAAATTAGAACACCTACCTGTGATCGCCGGACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
378 bp | 125 aa | 84 | 461 | + | No |
AG : IS66 TnpA
ORF sequence :
MPALKHPDVRRVFSPEFKWQIVRESKERILSVSELARKYDVNTNQVFRWMREAEAGQALWVRRAKGEPDTPVAASAFLPVSVRPANAAAPVLPKPAITVS
FRSGHQLMLHEATPAMLAQLVAALS
FRSGHQLMLHEATPAMLAQLVAALS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 458 | 805 | + | No |
AG : IS66 TnpB
ORF sequence :
MMMLSHDVRIWLCAGYTDMRKGFDGLAAMAQHVLQKDPFSGHVFVFRGRRSDRVKLLWWDGQGLCLFYKRLEQGQFVWPVAKSGAVHLTQAELALLLEGL
DWRPPARKNTPKNTA
DWRPPARKNTPKNTA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1521 bp | 509 aa | 864 | 2384 | + | No |
Chemistry : DDE
ORF sequence :
MAPTLPNDPKQLQAIILQLQETLAQQDTVIASLRHQLAVLKRARFGRSSEHLDKQIHQLELQLEELEMQTAALPKLTDGAVEEKATARRRVTLPAQLPRE
EKRVDAPCQCPACQGELTHIGEDVSEMLDVVPASYRVIRIVRPKFSCQHCDTLVQGQAPERVIKKGLASAALLTQVIVDKYLDHQPLYRQAERMAREGID
IERSTLADWVGQSGALLTPLAEAIGRHVKAGQHIFADDTTAPTLSPGKGRTQTGRYWTYVRDGRSWGDTTPPAVWLQYSEDRKSLHPTAHLTGYKGSLQA
DAYAGYNALYQNHITQHGCWAHVRRKFYDITQSGPSPLAEEALTIIQSLYVIEKQARGQPPDERQHLRSLQAKPILVQFNNWLQTTLRTLSKGSALSKAI
QYALKQWDALVAYVDNGYAEIDNNSAERSLRPIALGRKNYLFAGSVGGGERAAVLYSILGTAKLNGINPNAYLTAVLKRIGNHPINRIDELLPWNIDLSA
QPGDAI
EKRVDAPCQCPACQGELTHIGEDVSEMLDVVPASYRVIRIVRPKFSCQHCDTLVQGQAPERVIKKGLASAALLTQVIVDKYLDHQPLYRQAERMAREGID
IERSTLADWVGQSGALLTPLAEAIGRHVKAGQHIFADDTTAPTLSPGKGRTQTGRYWTYVRDGRSWGDTTPPAVWLQYSEDRKSLHPTAHLTGYKGSLQA
DAYAGYNALYQNHITQHGCWAHVRRKFYDITQSGPSPLAEEALTIIQSLYVIEKQARGQPPDERQHLRSLQAKPILVQFNNWLQTTLRTLSKGSALSKAI
QYALKQWDALVAYVDNGYAEIDNNSAERSLRPIALGRKNYLFAGSVGGGERAAVLYSILGTAKLNGINPNAYLTAVLKRIGNHPINRIDELLPWNIDLSA
QPGDAI
Blast result :
Comments
ISPrre11 is 79% aa similar (transposase) to ISSpu24.
References
1] Dongsheng Zhou (2020) Direct submission.