ISPpu30
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP016215 | ND | Pseudomonas putida | Pseudomonas putida Pseudomonas putida SY153 Pseudomonas aeruginosa PA121617 plasmid pBM413 |
DNA section
IS Length : 3000 bp
Ends
IR Length : 20/23
IRL : GTAAGCGTCTGGCGAACACACCTACAGAATCCGCAGATTAAGGGCGATGG
IRR : GTAAGCGTACGGCAAACACACCTTGCAAGCAGAACAGGCAATCAACGAGT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGCCTCGAATGCC | TCAAGAGC | CGCGAAGTAGGAGC | 8 |
DNA sequence
GTAAGCGTCTGGCGAACACACCTACAGAATCCGCAGATTAAGGGCGATGGTGTCCACCTAAAAATACGTGGACACCATCGCCCTTAATTCAAGGAACACC
ATGCGCCAACGCACCTCTTATCCCAAATCCTTCAAGGCTCAGGTTGTTCAGGAATGCCTGAACCCTGACGTATCAATTGCCAGCGTAGCCCTTCGGCACG
GCATCAACGCCAACCTGGTTCGGAAGTGGATACCCCTCTACCGTGACCCAAAAGTTTCTGCCTTACCTGCATTTGTACCGGTGAAGCTTGAGGCTGCGGC
GATGCCGGTTCGACAGGCTGTAGCTCGCATCGATATTTCTTCTGGGCATCAAACACTCACGGTGAGTTGGCCTGCCTCTGATCCAGACGGCTGCGCCCGC
TTCATTCGCAGCCTCTCGCAATGATCCGTATCGACGCCATCTGGCTCGCGACCGAGCCCATGGACATGCGTGCCGGCACGGATACAGCATTGGCCCGCGT
GGTCGCCGTATTCGGTGCGGCGAAGCCGCACTGCGCTTATCTGTTCGCCAACCGACGCGCCAACCGGATGAAAGTGCTGGTGCACGATGGCGTGGGCCTC
TGGCTGGCCGCGCGCCGCCTGAACCAAGGTAAATTCCATTGGCCCGGCACTCATCGAGGCCATGAAGTTGAACTCGATGCCGAACAACTCCAAGCGCTGG
TGCTGGGCCTGCCCTGGCAGCGGGTTGGCGCTAACGGCGCAATTACGTTGCTCTGAATCGGGTCTTTTAACTGTGACAGGTGGGCTGCTTTGGTAAAATC
CACGGCATGACTTCCTCGCCCAATCTCGACCAAATGACTCCCGATCAACTGCGCGCACTCGCTAAGCAGTTGCTGTCGAAGGTCGACACCATGGGCCAAG
AGAGCCGTCGCGACAAAACCATCATCGAGCAGCTCTCTCACGAGATCGCCATCCTCAAACGCCACAAGTTTGCCAAGCGCAGCGAGCAGATCAGCCCGGC
GCAAGGCAGCTTGCTGGATGACCTGCTCAGCACCGACCTCGAAGCCATCGACGCCGAGTTGAAAGCACTTCGTCCCGCACCAGCACCGGACGAACCACGC
CAACAACCCAAACGTGCGCCATTGCCGCCGCAGTTTCCTCGTACTGTCATTCATCATGAGCCAGAGAGCACCGAATGTACCTGCGGCTGCCAACTTCAGC
GCATCGGCGAGGATGTCAGTGAGAAGCTGGATTACACCCCGGGTGTGTTCACCGTTGAGCAACATGTACGTGGCAAATGGGCCTGCCGACAGTGCGAAAC
GCTGATCCAAGCGTCGGTGCCGGCACTGGTGATCGACAAGGGCATCCCTACTGCCGGCTTGCTGGCCCACGTGGTGGTGGCCAAGTTCGCCGATCATCTC
CCACTGTACCGGCAGGAAAAGATCTTTGGCCGTGCTGGCCTCGCTATCCCGCGCTCGACGCTTGCTCAGTGGGTCGGCCAAACCGGCCTACAACTGCAAC
CGTTGGTGGATGCACTGCGCGAGACAGTGCTGGCCCATCGAGTCGTCCATGCCGATGAAACACCGGTACAGATGCTGGCACCGGGTAAAAAGAAAACGCA
CCGAGCGTATGTCTGGGCCTACTGCACTACGCCGTTTTCGGCGTTGAAAGCTGTGGTTTACGACTTCAGCCCCAGCCGTGCTGGCGAACATGCGCGCAAC
TTCCTGGGCACTTGGAATGGCAAGCTGGTCTGCGATGACTTCGCAGGCTACAAGGCTGGCTTCGAACAGGGCATCACCGAAATCGGCTGCATGGCCCATG
CGCGGCGCAAGTTTTTCGACCTACATGTCGCGAACAAAAGCCAATTGGCAGAACAAGCGCTGCACTCCATCGGCGGCTTGTACGAAGTCGAACGGCAGGC
CAAAGGCATGAGCGATGAAAAGCGCTGGCGGTTACGCCAGCAAATAGCAGTGCCCATCGCCGAGAAACTGCATGAGTGGATGATGGCTCAGCGCGCGCTT
GTGCCCGAGGGCTCGGCTACGGCCAAGGCACTGGGTTACAGCCTGAAACGCTGGGTAGTGCTGACGCGCTACCTGGATGATGGTGCAGTGCCCATTGACA
ATAATGCAGTCGAAAACACGATCAGGCCGTGGGCGCTTGGGCGTTCCAACTGGCTCTTCGCTGGGTCACTGCGCAGTGGTAAACGGGCGGCGGCGATCAT
GAGCCTGATCCAGTCGGCGCGCATGAATGGGCATGATCCGTATATTTATCTGAAAGACGTTTTAGCGAGGTTGCCGACGCAACGAGCATCTAGCGTTGCG
CAACTGCTACCGCATCAATGGATTGCTGTATGACTCTCGGTAGGTATTCTCTGCTGAGTAAACTTTTCATTTGAAGAAAAATCGGAGCGTTTATGCGCCA
GGTTCTGCTCGTAGTCGATATTCAGTCCACATTCAACCCACCGGAATGGTTGGTTGACGGCGTGCAGGCGCTGTCGATGAAGATCCCGACAATTGCGTCT
ATCGAGCTTCACGATGAACAAGCGACGCCCTTCCAGCGTCAGCTCGGTTGGAGCCCGGCAAGCACTGACAAATGCCTTATCAAGGCCGACAGGGTCTTCG
TAAAAAATGGATATGGACAGACCTTAGAAACCATCAAGTACATCAAGGCGCTGGGCGTTGATCGAGTCCTGGTCTGCGGAATACAGACCGAAACCTGTGT
CCTCGCAGCTGGATTTGCATTGTTCGACGCTGGCCTCACTCCAACGCTGATAACGGATTTAACGGTGGGTTCGTCTTTGGATCGGTCTGGTCAACTCGGC
ATTGACCTGTGGGCACATCACTTCCGCAACGTGACAACGGCAGACGAAGTGATTGCAGGGCTGTCAGCGCTGCGTTGAAGGCCTGAGTAGCTCGATCAAG
AATCAACATCACCGATGTGCTCGCACTCCAAATTGGCAATATCCGAGCTGACTCGTTGATTGCCTGTTCTGCTTGCAAGGTGTGTTTGCCGTACGCTTAC
ATGCGCCAACGCACCTCTTATCCCAAATCCTTCAAGGCTCAGGTTGTTCAGGAATGCCTGAACCCTGACGTATCAATTGCCAGCGTAGCCCTTCGGCACG
GCATCAACGCCAACCTGGTTCGGAAGTGGATACCCCTCTACCGTGACCCAAAAGTTTCTGCCTTACCTGCATTTGTACCGGTGAAGCTTGAGGCTGCGGC
GATGCCGGTTCGACAGGCTGTAGCTCGCATCGATATTTCTTCTGGGCATCAAACACTCACGGTGAGTTGGCCTGCCTCTGATCCAGACGGCTGCGCCCGC
TTCATTCGCAGCCTCTCGCAATGATCCGTATCGACGCCATCTGGCTCGCGACCGAGCCCATGGACATGCGTGCCGGCACGGATACAGCATTGGCCCGCGT
GGTCGCCGTATTCGGTGCGGCGAAGCCGCACTGCGCTTATCTGTTCGCCAACCGACGCGCCAACCGGATGAAAGTGCTGGTGCACGATGGCGTGGGCCTC
TGGCTGGCCGCGCGCCGCCTGAACCAAGGTAAATTCCATTGGCCCGGCACTCATCGAGGCCATGAAGTTGAACTCGATGCCGAACAACTCCAAGCGCTGG
TGCTGGGCCTGCCCTGGCAGCGGGTTGGCGCTAACGGCGCAATTACGTTGCTCTGAATCGGGTCTTTTAACTGTGACAGGTGGGCTGCTTTGGTAAAATC
CACGGCATGACTTCCTCGCCCAATCTCGACCAAATGACTCCCGATCAACTGCGCGCACTCGCTAAGCAGTTGCTGTCGAAGGTCGACACCATGGGCCAAG
AGAGCCGTCGCGACAAAACCATCATCGAGCAGCTCTCTCACGAGATCGCCATCCTCAAACGCCACAAGTTTGCCAAGCGCAGCGAGCAGATCAGCCCGGC
GCAAGGCAGCTTGCTGGATGACCTGCTCAGCACCGACCTCGAAGCCATCGACGCCGAGTTGAAAGCACTTCGTCCCGCACCAGCACCGGACGAACCACGC
CAACAACCCAAACGTGCGCCATTGCCGCCGCAGTTTCCTCGTACTGTCATTCATCATGAGCCAGAGAGCACCGAATGTACCTGCGGCTGCCAACTTCAGC
GCATCGGCGAGGATGTCAGTGAGAAGCTGGATTACACCCCGGGTGTGTTCACCGTTGAGCAACATGTACGTGGCAAATGGGCCTGCCGACAGTGCGAAAC
GCTGATCCAAGCGTCGGTGCCGGCACTGGTGATCGACAAGGGCATCCCTACTGCCGGCTTGCTGGCCCACGTGGTGGTGGCCAAGTTCGCCGATCATCTC
CCACTGTACCGGCAGGAAAAGATCTTTGGCCGTGCTGGCCTCGCTATCCCGCGCTCGACGCTTGCTCAGTGGGTCGGCCAAACCGGCCTACAACTGCAAC
CGTTGGTGGATGCACTGCGCGAGACAGTGCTGGCCCATCGAGTCGTCCATGCCGATGAAACACCGGTACAGATGCTGGCACCGGGTAAAAAGAAAACGCA
CCGAGCGTATGTCTGGGCCTACTGCACTACGCCGTTTTCGGCGTTGAAAGCTGTGGTTTACGACTTCAGCCCCAGCCGTGCTGGCGAACATGCGCGCAAC
TTCCTGGGCACTTGGAATGGCAAGCTGGTCTGCGATGACTTCGCAGGCTACAAGGCTGGCTTCGAACAGGGCATCACCGAAATCGGCTGCATGGCCCATG
CGCGGCGCAAGTTTTTCGACCTACATGTCGCGAACAAAAGCCAATTGGCAGAACAAGCGCTGCACTCCATCGGCGGCTTGTACGAAGTCGAACGGCAGGC
CAAAGGCATGAGCGATGAAAAGCGCTGGCGGTTACGCCAGCAAATAGCAGTGCCCATCGCCGAGAAACTGCATGAGTGGATGATGGCTCAGCGCGCGCTT
GTGCCCGAGGGCTCGGCTACGGCCAAGGCACTGGGTTACAGCCTGAAACGCTGGGTAGTGCTGACGCGCTACCTGGATGATGGTGCAGTGCCCATTGACA
ATAATGCAGTCGAAAACACGATCAGGCCGTGGGCGCTTGGGCGTTCCAACTGGCTCTTCGCTGGGTCACTGCGCAGTGGTAAACGGGCGGCGGCGATCAT
GAGCCTGATCCAGTCGGCGCGCATGAATGGGCATGATCCGTATATTTATCTGAAAGACGTTTTAGCGAGGTTGCCGACGCAACGAGCATCTAGCGTTGCG
CAACTGCTACCGCATCAATGGATTGCTGTATGACTCTCGGTAGGTATTCTCTGCTGAGTAAACTTTTCATTTGAAGAAAAATCGGAGCGTTTATGCGCCA
GGTTCTGCTCGTAGTCGATATTCAGTCCACATTCAACCCACCGGAATGGTTGGTTGACGGCGTGCAGGCGCTGTCGATGAAGATCCCGACAATTGCGTCT
ATCGAGCTTCACGATGAACAAGCGACGCCCTTCCAGCGTCAGCTCGGTTGGAGCCCGGCAAGCACTGACAAATGCCTTATCAAGGCCGACAGGGTCTTCG
TAAAAAATGGATATGGACAGACCTTAGAAACCATCAAGTACATCAAGGCGCTGGGCGTTGATCGAGTCCTGGTCTGCGGAATACAGACCGAAACCTGTGT
CCTCGCAGCTGGATTTGCATTGTTCGACGCTGGCCTCACTCCAACGCTGATAACGGATTTAACGGTGGGTTCGTCTTTGGATCGGTCTGGTCAACTCGGC
ATTGACCTGTGGGCACATCACTTCCGCAACGTGACAACGGCAGACGAAGTGATTGCAGGGCTGTCAGCGCTGCGTTGAAGGCCTGAGTAGCTCGATCAAG
AATCAACATCACCGATGTGCTCGCACTCCAAATTGGCAATATCCGAGCTGACTCGTTGATTGCCTGTTCTGCTTGCAAGGTGTGTTTGCCGTACGCTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
324 bp | 107 aa | 101 | 424 | + | No |
AG : IS66 TnpA
ORF sequence :
MRQRTSYPKSFKAQVVQECLNPDVSIASVALRHGINANLVRKWIPLYRDPKVSALPAFVPVKLEAAAMPVRQAVARIDISSGHQTLTVSWPASDPDGCAR
FIRSLSQ
FIRSLSQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
336 bp | 111 aa | 421 | 756 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRIDAIWLATEPMDMRAGTDTALARVVAVFGAAKPHCAYLFANRRANRMKVLVHDGVGLWLAARRLNQGKFHWPGTHRGHEVELDAEQLQALVLGLPWQ
RVGANGAITLL
RVGANGAITLL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1527 bp | 508 aa | 807 | 2333 | + | No |
Chemistry : DDE
ORF sequence :
MTSSPNLDQMTPDQLRALAKQLLSKVDTMGQESRRDKTIIEQLSHEIAILKRHKFAKRSEQISPAQGSLLDDLLSTDLEAIDAELKALRPAPAPDEPRQQ
PKRAPLPPQFPRTVIHHEPESTECTCGCQLQRIGEDVSEKLDYTPGVFTVEQHVRGKWACRQCETLIQASVPALVIDKGIPTAGLLAHVVVAKFADHLPL
YRQEKIFGRAGLAIPRSTLAQWVGQTGLQLQPLVDALRETVLAHRVVHADETPVQMLAPGKKKTHRAYVWAYCTTPFSALKAVVYDFSPSRAGEHARNFL
GTWNGKLVCDDFAGYKAGFEQGITEIGCMAHARRKFFDLHVANKSQLAEQALHSIGGLYEVERQAKGMSDEKRWRLRQQIAVPIAEKLHEWMMAQRALVP
EGSATAKALGYSLKRWVVLTRYLDDGAVPIDNNAVENTIRPWALGRSNWLFAGSLRSGKRAAAIMSLIQSARMNGHDPYIYLKDVLARLPTQRASSVAQL
LPHQWIAV
PKRAPLPPQFPRTVIHHEPESTECTCGCQLQRIGEDVSEKLDYTPGVFTVEQHVRGKWACRQCETLIQASVPALVIDKGIPTAGLLAHVVVAKFADHLPL
YRQEKIFGRAGLAIPRSTLAQWVGQTGLQLQPLVDALRETVLAHRVVHADETPVQMLAPGKKKTHRAYVWAYCTTPFSALKAVVYDFSPSRAGEHARNFL
GTWNGKLVCDDFAGYKAGFEQGITEIGCMAHARRKFFDLHVANKSQLAEQALHSIGGLYEVERQAKGMSDEKRWRLRQQIAVPIAEKLHEWMMAQRALVP
EGSATAKALGYSLKRWVVLTRYLDDGAVPIDNNAVENTIRPWALGRSNWLFAGSLRSGKRAAAIMSLIQSARMNGHDPYIYLKDVLARLPTQRASSVAQL
LPHQWIAV
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
486 bp | 161 aa | 2393 | 2878 | + | No |
Annotation : HydrolaseDescription :
ORF sequence :
MRQVLLVVDIQSTFNPPEWLVDGVQALSMKIPTIASIELHDEQATPFQRQLGWSPASTDKCLIKADRVFVKNGYGQTLETIKYIKALGVDRVLVCGIQTE
TCVLAAGFALFDAGLTPTLITDLTVGSSLDRSGQLGIDLWAHHFRNVTTADEVIAGLSALR
TCVLAAGFALFDAGLTPTLITDLTVGSSLDRSGQLGIDLWAHHFRNVTTADEVIAGLSALR
Blast result :
Comments
ISPpu30 is 78% (ORFA), 94% (ORFB) and 90% (ORFC : the transposase) aa similar to ISPpu14. ISPpu30 is disrupted by ISPpu29 (IS3 family member) and was reconstructed in silico by deletion of the ISPpu29 (IS3 family member)and of a copy of the direct repeat generated by ISPpu29 insertion (sequence GTA).
References
1] Dongsheng Zhou (2016) Direct submission.
2] Chen,D., Xu,Z. and Liu,J. (2016) Direct submission GenBank.
2] Chen,D., Xu,Z. and Liu,J. (2016) Direct submission GenBank.