ISPlu20
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
BX470251 | ND | Photorhabdus luminescens | Photorhabdus luminescens subsp. laumondii TTO1 |
DNA section
IS Length : 2339 bp
Ends
IR Length : 20/25
IRL : GTAAACGTCCGGTCATCTCACCTTGTTCTTCCTCTCCGGCCCATGCACGC
IRR : GTAATGTTCCGGTGAACTCACCTTGCGTGGTCAGATAACCACAAAATGGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATAGCGTGCATA | GTTCATCT | AGGCATGTTTTC | 8 |
AGGGATATTCTG | TTTCGTTC | TCATTTGATGCG | 8 |
ACCAACCGTGAC | AGTTGATA | GCCCGAAAGGAT | 8 |
TGAAAACAGGTC | AGTACTGC | CCGATCGGCGGC | 8 |
GATGTTTTTATT | GATTAAAG | CATCAAAGTGAG | 8 |
DNA sequence
GTAAACGTCCGGTCATCTCACCTTGTTCTTCCTCTCCGGCCCATGCACGCTACCGCTTCCTTATCTTCACCTGGAGACTGACACCATGAGCCGTAGCCGA
TATACCCCGGAACAAAAACAACACCATGTGACCCAATGGCGCCACAGTGACCTGACCCGAAAACAGTATTGCGAACAGCATCAGCTGAGTTTTTCCGCTT
TCCGCGACTGGATTGCCGACAGTAACAACATCCCCCAACCTCTCAGCCAGACACTGCCCGCCCTCTTACCGGTTTCACTCCAGCCTGACGACGCGCGCAC
CGTCACCCTGCATACCCCAGACGGCTATGCCATCGCCTGTCCATTGACGCTGTTGCCTGACGTGATGCGGGTACTGACCCGATGCTGAAACCGCAACAGC
TCTTTCTGGTGCGAGAGCCCGTCGACATGCGCCGGGGCATTGATGCCCTGACCCAGCACCTGGAGGGGCTGAATTTGCGCTGGCAGGACGAAGCCGCTTT
TATTTTCTGCAACAAGGCCCGCTCCCGTCTCAAGGTGTTGCGCTGGGACCGACACGGCGTCTGGCTGTGTACCCGCCGTCTGCACCGGGCCCATTTTGTC
TGGCCGAAGCAGGGCGAGCGCAGCTGGGTGATGACGCCCGCCCAGTTTGACTGGCTCATCCGCGGCATTCACTGGCAACAGGTCGAGGGAGATGACCTGT
CCGGCTGGCGAAACTAATTCCGCTACCCCGGTCACACTTTTACCGGATAACGACGGGGAATTATCCGGTACACTGTCGGCATGACGCTCAATGACTTAAT
CGCGCTGGATGACCCCGGCCAGTTCCGCCTGCGGGCACTGGCGCTGCTTCAGCAACAAGGTGATTATATTCGCCAGCTGGAAGAGGCCCTGAAACAGGCG
CAACGCTGGCGTTTTGGCGCGCACAGCGAAACCCTGCCTGCCGGTCCCAAACGCAGTCAGTTTGAGGAAGATGCCGATACCGATATCGCGGTGCTGGAAA
CCCGGCTGTCACGCCTGCATATCAAGGAAAACGCGACCCCGGCCCAGCCTAAACGTCAGCCTTTGCCGGCGACTTTACCCCGTGAAGATATCCGGCTGGC
GCCGGAGACGGAGAGCTGCCCGGATTGCGGTCATGCCCTGCGCTTTTTGCGTGATGAAATCAGTGAGCGACTGGAATACCGTCCGGCCACGTTTATCGTC
CGGCGCTATATCAGCCCGCAGTACAGCTGCGCGCGCTGTCAGTGTGTGCATGCTAAAGCCCAGCCCGCCCATCTTATTGAAAAGGGCCTTCCAGAGCCGG
GGTTGCTGGCTCAGGTGGTGGTCGCTAAATATCGTGACCACCTGCCGCTCTACCGTCAGCAGCAAATCTATGCCCGCAGCGGTGTCACCCTGGCCCGCAG
CACCCTGTCGGACTGGGTGGGGCAGGTGGCGGTGGCGTTGCAACCGTTAGCCGATGCCCTGAAGCAGACGCTGCTGACGTCCCCGGTGCTGCATGCCGAT
GAGACCCCGTTGCCGATACTGGCCCCGGGAAAAGGCCAGACCCAGCGAGCTTACCTGTGGACGTCTGTCACCGGCCCCGATACCTCGCCGGCAGTGGTAT
ACTATGAGCTGCATCCCGGCCGCAGCGGCCGTTATGCCCAGAGCCTGCTGAAAGACTGGTCGGGCGGCACGTTGGTGACGGATGACTATGCCGGTTACAA
CGCACTGCATGCCCGGGCGGATATCACCGAAGCGGGTTGCTGGGCTCACGCCCGCCGCAAATTCTTTGACCAATACAAAGCGAGCCAAAGTCCGGTGGCG
AAACAGGCGCTGGATGGCATTCGTGACCTGTACAAGCTGGAGCGAAAAATCAAACACCGGCCGCCGGATAAACGGCGTCAGTGGCGGCAGCGTTATGCCC
GGCCGTGGCTGAATGAGTTCCGGTCCTGGTTGCAAACGATGCAGACCCAAACGGCCCCCAACTCGGGGTTACGCAAAGCGATAGATTACACGCTGAAGCG
CTGGTCGGCGCTGGTGTGTTATCTGGATGACGGGGGGGTGCCGATAGACAACAACCGGGCGGAGAATGCGGTCCGGGGTGTGGCGCTCGGCCGAAAGAAC
TGGCTTTTTGCGGGTTCACTGGCCGCGGGGCAACGGGCGGCCATGATAATGAGCCTGCTGGAAACGGCCAAAGCCAACGGACATGAGCCCTGGGTCTGGT
TACGTGATGTCCTCAGCCGGCTGCCTGTCTGGCCGAACAATCGGTTGAATGAGCTGTTGCCCTGGCCTGAGAATCCCTTCCGTTAACATCCCATTTTGTG
GTTATCTGACCACGCAAGGTGAGTTCACCGGAACATTAC
TATACCCCGGAACAAAAACAACACCATGTGACCCAATGGCGCCACAGTGACCTGACCCGAAAACAGTATTGCGAACAGCATCAGCTGAGTTTTTCCGCTT
TCCGCGACTGGATTGCCGACAGTAACAACATCCCCCAACCTCTCAGCCAGACACTGCCCGCCCTCTTACCGGTTTCACTCCAGCCTGACGACGCGCGCAC
CGTCACCCTGCATACCCCAGACGGCTATGCCATCGCCTGTCCATTGACGCTGTTGCCTGACGTGATGCGGGTACTGACCCGATGCTGAAACCGCAACAGC
TCTTTCTGGTGCGAGAGCCCGTCGACATGCGCCGGGGCATTGATGCCCTGACCCAGCACCTGGAGGGGCTGAATTTGCGCTGGCAGGACGAAGCCGCTTT
TATTTTCTGCAACAAGGCCCGCTCCCGTCTCAAGGTGTTGCGCTGGGACCGACACGGCGTCTGGCTGTGTACCCGCCGTCTGCACCGGGCCCATTTTGTC
TGGCCGAAGCAGGGCGAGCGCAGCTGGGTGATGACGCCCGCCCAGTTTGACTGGCTCATCCGCGGCATTCACTGGCAACAGGTCGAGGGAGATGACCTGT
CCGGCTGGCGAAACTAATTCCGCTACCCCGGTCACACTTTTACCGGATAACGACGGGGAATTATCCGGTACACTGTCGGCATGACGCTCAATGACTTAAT
CGCGCTGGATGACCCCGGCCAGTTCCGCCTGCGGGCACTGGCGCTGCTTCAGCAACAAGGTGATTATATTCGCCAGCTGGAAGAGGCCCTGAAACAGGCG
CAACGCTGGCGTTTTGGCGCGCACAGCGAAACCCTGCCTGCCGGTCCCAAACGCAGTCAGTTTGAGGAAGATGCCGATACCGATATCGCGGTGCTGGAAA
CCCGGCTGTCACGCCTGCATATCAAGGAAAACGCGACCCCGGCCCAGCCTAAACGTCAGCCTTTGCCGGCGACTTTACCCCGTGAAGATATCCGGCTGGC
GCCGGAGACGGAGAGCTGCCCGGATTGCGGTCATGCCCTGCGCTTTTTGCGTGATGAAATCAGTGAGCGACTGGAATACCGTCCGGCCACGTTTATCGTC
CGGCGCTATATCAGCCCGCAGTACAGCTGCGCGCGCTGTCAGTGTGTGCATGCTAAAGCCCAGCCCGCCCATCTTATTGAAAAGGGCCTTCCAGAGCCGG
GGTTGCTGGCTCAGGTGGTGGTCGCTAAATATCGTGACCACCTGCCGCTCTACCGTCAGCAGCAAATCTATGCCCGCAGCGGTGTCACCCTGGCCCGCAG
CACCCTGTCGGACTGGGTGGGGCAGGTGGCGGTGGCGTTGCAACCGTTAGCCGATGCCCTGAAGCAGACGCTGCTGACGTCCCCGGTGCTGCATGCCGAT
GAGACCCCGTTGCCGATACTGGCCCCGGGAAAAGGCCAGACCCAGCGAGCTTACCTGTGGACGTCTGTCACCGGCCCCGATACCTCGCCGGCAGTGGTAT
ACTATGAGCTGCATCCCGGCCGCAGCGGCCGTTATGCCCAGAGCCTGCTGAAAGACTGGTCGGGCGGCACGTTGGTGACGGATGACTATGCCGGTTACAA
CGCACTGCATGCCCGGGCGGATATCACCGAAGCGGGTTGCTGGGCTCACGCCCGCCGCAAATTCTTTGACCAATACAAAGCGAGCCAAAGTCCGGTGGCG
AAACAGGCGCTGGATGGCATTCGTGACCTGTACAAGCTGGAGCGAAAAATCAAACACCGGCCGCCGGATAAACGGCGTCAGTGGCGGCAGCGTTATGCCC
GGCCGTGGCTGAATGAGTTCCGGTCCTGGTTGCAAACGATGCAGACCCAAACGGCCCCCAACTCGGGGTTACGCAAAGCGATAGATTACACGCTGAAGCG
CTGGTCGGCGCTGGTGTGTTATCTGGATGACGGGGGGGTGCCGATAGACAACAACCGGGCGGAGAATGCGGTCCGGGGTGTGGCGCTCGGCCGAAAGAAC
TGGCTTTTTGCGGGTTCACTGGCCGCGGGGCAACGGGCGGCCATGATAATGAGCCTGCTGGAAACGGCCAAAGCCAACGGACATGAGCCCTGGGTCTGGT
TACGTGATGTCCTCAGCCGGCTGCCTGTCTGGCCGAACAATCGGTTGAATGAGCTGTTGCCCTGGCCTGAGAATCCCTTCCGTTAACATCCCATTTTGTG
GTTATCTGACCACGCAAGGTGAGTTCACCGGAACATTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
303 bp | 100 aa | 86 | 388 | + | No |
AG : IS66 TnpA
ORF sequence :
MSRSRYTPEQKQHHVTQWRHSDLTRKQYCEQHQLSFSAFRDWIADSNNIPQPLSQTLPALLPVSLQPDDARTVTLHTPDGYAIACPLTLLPDVMRVLTRC
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
336 bp | 111 aa | 382 | 717 | + | No |
AG : IS66 TnpB
ORF sequence :
MLKPQQLFLVREPVDMRRGIDALTQHLEGLNLRWQDEAAFIFCNKARSRLKVLRWDRHGVWLCTRRLHRAHFVWPKQGERSWVMTPAQFDWLIRGIHWQQ
VEGDDLSGWRN
VEGDDLSGWRN
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1503 bp | 501 aa | 781 | 2283 | + | No |
Chemistry : DDE
ORF sequence :
MTLNDLIALDDPGQFRLRALALLQQQGDYIRQLEEALKQAQRWRFGAHSETLPAGPKRSQFEEDADTDIAVLETRLSRLHIKENATPAQPKRQPLPATLP
REDIRLAPETESCPDCGHALRFLRDEISERLEYRPATFIVRRYISPQYSCARCQCVHAKAQPAHLIEKGLPEPGLLAQVVVAKYRDHLPLYRQQQIYARS
GVTLARSTLSDWVGQVAVALQPLADALKQTLLTSPVLHADETPLPILAPGKGQTQRAYLWTSVTGPDTSPAVVYYELHPGRSGRYAQSLLKDWSGGTLVT
DDYAGYNALHARADITEAGCWAHARRKFFDQYKASQSPVAKQALDGIRDLYKLERKIKHRPPDKRRQWRQRYARPWLNEFRSWLQTMQTQTAPNSGLRKA
IDYTLKRWSALVCYLDDGGVPIDNNRAENAVRGVALGRKNWLFAGSLAAGQRAAMIMSLLETAKANGHEPWVWLRDVLSRLPVWPNNRLNELLPWPENPF
R
REDIRLAPETESCPDCGHALRFLRDEISERLEYRPATFIVRRYISPQYSCARCQCVHAKAQPAHLIEKGLPEPGLLAQVVVAKYRDHLPLYRQQQIYARS
GVTLARSTLSDWVGQVAVALQPLADALKQTLLTSPVLHADETPLPILAPGKGQTQRAYLWTSVTGPDTSPAVVYYELHPGRSGRYAQSLLKDWSGGTLVT
DDYAGYNALHARADITEAGCWAHARRKFFDQYKASQSPVAKQALDGIRDLYKLERKIKHRPPDKRRQWRQRYARPWLNEFRSWLQTMQTQTAPNSGLRKA
IDYTLKRWSALVCYLDDGGVPIDNNRAENAVRGVALGRKNWLFAGSLAAGQRAAMIMSLLETAKANGHEPWVWLRDVLSRLPVWPNNRLNELLPWPENPF
R
Blast result :
Comments
ISPlu20 is 68% (ORFC) aa similar to ISShdy2.
References
1] Friedhelm Pfeiffer (2016) Direct submission.
2] Duchaud,E., Rusniok,C., Frangeul,L., Buchrieser,C., Taourit,S., Bocs,S., Boursaux-Eude,C., Chandler,M., Dassa,E., Derose,R., Derzelle,S., Freyssinet,G., Gaudriault,S., Givaudan,A., Glaser,P., Medigue,C., Lanois,A., Powell,K., Siguier,P., Wingate,V., Zouine,M., Boemare,N., Danchin,A. and Kunst,F. (2003) Nat. Biotechnol. 11: 1307-1313
2] Duchaud,E., Rusniok,C., Frangeul,L., Buchrieser,C., Taourit,S., Bocs,S., Boursaux-Eude,C., Chandler,M., Dassa,E., Derose,R., Derzelle,S., Freyssinet,G., Gaudriault,S., Givaudan,A., Glaser,P., Medigue,C., Lanois,A., Powell,K., Siguier,P., Wingate,V., Zouine,M., Boemare,N., Danchin,A. and Kunst,F. (2003) Nat. Biotechnol. 11: 1307-1313