ISSphsp11
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Sphingobium sp. | Sphingobium sp. Sphingobium sp. SA2 |
DNA section
IS Length : 2826 bp
Ends
IR Length : 17/22
IRL : GTAACCGACCGGTTGAGGCCGCCTGCCTTGCTGTGATGTGAGCGCGTAAG
IRR : GTAACCGTCCGATTGTTACCGCGGCTTATGGGCCTTGAAGCGCGCGGCGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGTCCTGGCC | AGACGATC | CTGCCTCGGC | 8 |
GGCCGACCTT | GATGCCTG | GATGGAAAGC | 8 |
GCTGGCATCT | CAGGCGGC | ATCGCGAAAC | 8 |
TCCTGGAGTG | GATCGAAC | AGCGTCGAGT | 8 |
TGTGCCGCCC | GTAGCTCC | GGGTCGCGCG | 8 |
CTGTTCGATC | CATCAATCTT | 0 |
DNA sequence
GTAACCGACCGGTTGAGGCCGCCTGCCTTGCTGTGATGTGAGCGCGTAAGAGACGTCCATAAGGACGTCGTTATGGACGTCCTGAGCCAAAGCGAGGGAC
GGCGGTGGAGATATTGGGCCGGGAACGGCGTCGTCGGTGGAGCAACGCGGAGAAGCTCGAGATCGTCGCCGCTGTGGGAATGAACGGAGAGACGTTGGCG
CGCGTCGCGCGGCGCTACGATGTGTCGCGAAGCCAGATCTATCAATGGCGCCATCTGTTCAGGAAGCGAGGGTTGCTGCCTGCGGCGGACGGGCCGACCT
TTTTGCCGGTTGATATCGGGGCGCCCATGCTGAGCGCGGAGCCCGTGCTCGATGACCGGGTGGTGGTCTCGCCCTTGATCGTCGAGCTGTGTCTGGCGCA
AGGACGGCGGCTGCGCTTCGATGCCGGCATCGAGGCTACTGCCTTGACCCAGCTTATCCGGTCGGTGGAAGCTGCATGATCGGACCGGGCACCGGCGTAA
GGGTCTATCTCGCCTGCGGCGTCACTGACATGCGCAAAGGGATTTCGGGTCTTGCAATGCTGGCCCAGACCGTGCTGCGACAGAAGCCGGCCAGTGGTGC
CGTATTCGCCTTTCGGGGACGGCGCGGCGACAGGCTGAAGCTGCTTTACTGGGATGGTCAGGGATTTTGCCTCTATTACAAGATCCTGGAGCGCGGCCGT
TTTCCCTGGCCCTCTGCGGCTGACGGGGCGGCGCGACTGACATCGGCCCAACTCGCGATGCTGTGGGAAGGGATCGACTGGAGGCGTCCGAACTGGGGTG
CGCCGCCTACACGCGCGGGGTAACTTTTCTTCTGCAAAAGCGGCTGTTCTGCTAGGCTTTGCGGCCGATTTCCTGTATAAAAAGCCATGCTCAACGCGGC
ATCCGATCTTCCCGAAGACCCGGCCCTTCTCAAGGCGCTGATCGCCCAGTTGCAGGCCCAGAACAAGAAGCTGACGACGACGCTGCGCGCCCATGACCTG
GTCGTCCAGGCACTGCGTTTGCAGATCGCCAGGCTCAGGAAGCAGGCTTTCGGCAAATCCTCCGAGAAGATCGAACGCGAGATCGAGCAACTCGAGTTCG
CTCTCGAAGACGTGCTTGTTTCGGTCGCGCAACAGGGCCTCGTGACAAGTGGCCTGGACGATTCCGAGGCGCAAGCGGCGGTCGCGTCTGCCGAGACCAC
CCCTGGGCCGCGTCCATCTCGGCGCCCACGCGTCTCGGCGGATACGCCGCGCCAACGCCACGAACTTGACCCCGGCAGCACCTGTCCGGAGTGCGGCGGC
GAGCTGCGTCTCGTCGGCGAAGACGTGAGCGAGATCCTCGAGATGATCACGGCAAAGCTCCAGGTCATCGAAGTCGTCCGCCCGAAGAAGTCCTGCCGTT
GCTGCGAGAAGATGGTCCAGGTGCCAGCCCCGAGCCGACCGATCCCCGGTAGCATGGCCGGAGGAAGTCTCCTGGCTTACGTGCTGATCTCGAAGTTTGA
CGACCATTTGCCGCTGTACCGGCAGAACGAGATCATCGCGCGCATGGGTGCCGACATCCCACGCAGCACGCTGGCCGATTGGTGTGGTCGCTCGATGCGG
ATCCTGCAGCCGGTGATCGACCGGATCGAGGCCTCGGTCCTGGGCAGCGACATCCTGCACGCCGACGATACACCGATCCGCGTGCTGGCTCCGGAACGAC
GCGCCAAGGGTATCGGCAAGGGTGTGATGCAAGGCCGGATCTGGGGCTATGTCTGTGATCAGCGGCCTTGGGCCGGCACCGCGCCGCCTGGCGTCCTCTA
CCGCTATGCGCCCAACTGGAAAGCCGAGCACGTGCTGGCGCACTTGGGCAGCGCAAGCGGTATTCTTCAAGCCGACGCCTACAAAGGCTATGCTAAGCTC
TACGAACCCGTCGCGGATGGTGAGCCTCGCTTCCGGGAAGCGGCCTGCTTTGCGCATTGGCGACGCGATTTTCATGACATCTGGACGTCGCAAAAATCCG
AGATCGCACACGAGGCGCTCGAGCGTATCGGGCAACTCTACGACATTGAGCGCAAGATCGCGGGCAAGCCTGCCGATATCCGCCAGGCAATACGCCAGGA
GTTGAGCCGACCCAAGCTTGAGGCTCTGCATAGCTGGGCCGAGAAGCAACTCACCCGTATCTCGACCAAGGGAGACCTGGCGAAGGCCTTTCGCTACGCC
CTGGGGCGCTGGCACGCCTTCAGCCTCTTTATCGACGATGGACGCGTCGCCATCGACAACAACGCTGCCGAGCGGGCTGTCCGGCCCATATGCCTGGGCA
AGAAGAACTGGTTATTCGCCGGCTCCGAAACAGGCGCCGAAACCCTCGCCCGCGCCATGACGCTGATCGAAAGCGCCAAAATGAACGGTCTCGATCCACA
AGCATACATCACCGACTTGCTCAATCGCATCCACGATCACAAGATCAACAGGATCGACGAGCTGTTACCCTGGAACTGGGTGCCGCTCACCATCCCGCTA
GCACTGGCCGCCTGATGGCCGCCGTCACTTCCTTGCTCACCCTCGACTATGTCGCCAAGATACTCGACGAGAACGTGGAGTTGCTGGAAGCCATCGTTTC
AAACGACGACAACCTCACCTACGGCGCCATCGTCAGCGTCCACACCGGTCTGGACGAGTCCATCACCGCGCTGACGGATCACGGCGTGGAAGAACTGACG
GACATGCTTAGCCAAGCACGTCTCACGACCAAGGCCTGGCACCAATTCCTCGATGACTTTATCGACGATCCCGAGCTCGCCGCGCGCTTCAAGGCCCATA
AGCCGCGGTAACAATCGGACGGTTAC
GGCGGTGGAGATATTGGGCCGGGAACGGCGTCGTCGGTGGAGCAACGCGGAGAAGCTCGAGATCGTCGCCGCTGTGGGAATGAACGGAGAGACGTTGGCG
CGCGTCGCGCGGCGCTACGATGTGTCGCGAAGCCAGATCTATCAATGGCGCCATCTGTTCAGGAAGCGAGGGTTGCTGCCTGCGGCGGACGGGCCGACCT
TTTTGCCGGTTGATATCGGGGCGCCCATGCTGAGCGCGGAGCCCGTGCTCGATGACCGGGTGGTGGTCTCGCCCTTGATCGTCGAGCTGTGTCTGGCGCA
AGGACGGCGGCTGCGCTTCGATGCCGGCATCGAGGCTACTGCCTTGACCCAGCTTATCCGGTCGGTGGAAGCTGCATGATCGGACCGGGCACCGGCGTAA
GGGTCTATCTCGCCTGCGGCGTCACTGACATGCGCAAAGGGATTTCGGGTCTTGCAATGCTGGCCCAGACCGTGCTGCGACAGAAGCCGGCCAGTGGTGC
CGTATTCGCCTTTCGGGGACGGCGCGGCGACAGGCTGAAGCTGCTTTACTGGGATGGTCAGGGATTTTGCCTCTATTACAAGATCCTGGAGCGCGGCCGT
TTTCCCTGGCCCTCTGCGGCTGACGGGGCGGCGCGACTGACATCGGCCCAACTCGCGATGCTGTGGGAAGGGATCGACTGGAGGCGTCCGAACTGGGGTG
CGCCGCCTACACGCGCGGGGTAACTTTTCTTCTGCAAAAGCGGCTGTTCTGCTAGGCTTTGCGGCCGATTTCCTGTATAAAAAGCCATGCTCAACGCGGC
ATCCGATCTTCCCGAAGACCCGGCCCTTCTCAAGGCGCTGATCGCCCAGTTGCAGGCCCAGAACAAGAAGCTGACGACGACGCTGCGCGCCCATGACCTG
GTCGTCCAGGCACTGCGTTTGCAGATCGCCAGGCTCAGGAAGCAGGCTTTCGGCAAATCCTCCGAGAAGATCGAACGCGAGATCGAGCAACTCGAGTTCG
CTCTCGAAGACGTGCTTGTTTCGGTCGCGCAACAGGGCCTCGTGACAAGTGGCCTGGACGATTCCGAGGCGCAAGCGGCGGTCGCGTCTGCCGAGACCAC
CCCTGGGCCGCGTCCATCTCGGCGCCCACGCGTCTCGGCGGATACGCCGCGCCAACGCCACGAACTTGACCCCGGCAGCACCTGTCCGGAGTGCGGCGGC
GAGCTGCGTCTCGTCGGCGAAGACGTGAGCGAGATCCTCGAGATGATCACGGCAAAGCTCCAGGTCATCGAAGTCGTCCGCCCGAAGAAGTCCTGCCGTT
GCTGCGAGAAGATGGTCCAGGTGCCAGCCCCGAGCCGACCGATCCCCGGTAGCATGGCCGGAGGAAGTCTCCTGGCTTACGTGCTGATCTCGAAGTTTGA
CGACCATTTGCCGCTGTACCGGCAGAACGAGATCATCGCGCGCATGGGTGCCGACATCCCACGCAGCACGCTGGCCGATTGGTGTGGTCGCTCGATGCGG
ATCCTGCAGCCGGTGATCGACCGGATCGAGGCCTCGGTCCTGGGCAGCGACATCCTGCACGCCGACGATACACCGATCCGCGTGCTGGCTCCGGAACGAC
GCGCCAAGGGTATCGGCAAGGGTGTGATGCAAGGCCGGATCTGGGGCTATGTCTGTGATCAGCGGCCTTGGGCCGGCACCGCGCCGCCTGGCGTCCTCTA
CCGCTATGCGCCCAACTGGAAAGCCGAGCACGTGCTGGCGCACTTGGGCAGCGCAAGCGGTATTCTTCAAGCCGACGCCTACAAAGGCTATGCTAAGCTC
TACGAACCCGTCGCGGATGGTGAGCCTCGCTTCCGGGAAGCGGCCTGCTTTGCGCATTGGCGACGCGATTTTCATGACATCTGGACGTCGCAAAAATCCG
AGATCGCACACGAGGCGCTCGAGCGTATCGGGCAACTCTACGACATTGAGCGCAAGATCGCGGGCAAGCCTGCCGATATCCGCCAGGCAATACGCCAGGA
GTTGAGCCGACCCAAGCTTGAGGCTCTGCATAGCTGGGCCGAGAAGCAACTCACCCGTATCTCGACCAAGGGAGACCTGGCGAAGGCCTTTCGCTACGCC
CTGGGGCGCTGGCACGCCTTCAGCCTCTTTATCGACGATGGACGCGTCGCCATCGACAACAACGCTGCCGAGCGGGCTGTCCGGCCCATATGCCTGGGCA
AGAAGAACTGGTTATTCGCCGGCTCCGAAACAGGCGCCGAAACCCTCGCCCGCGCCATGACGCTGATCGAAAGCGCCAAAATGAACGGTCTCGATCCACA
AGCATACATCACCGACTTGCTCAATCGCATCCACGATCACAAGATCAACAGGATCGACGAGCTGTTACCCTGGAACTGGGTGCCGCTCACCATCCCGCTA
GCACTGGCCGCCTGATGGCCGCCGTCACTTCCTTGCTCACCCTCGACTATGTCGCCAAGATACTCGACGAGAACGTGGAGTTGCTGGAAGCCATCGTTTC
AAACGACGACAACCTCACCTACGGCGCCATCGTCAGCGTCCACACCGGTCTGGACGAGTCCATCACCGCGCTGACGGATCACGGCGTGGAAGAACTGACG
GACATGCTTAGCCAAGCACGTCTCACGACCAAGGCCTGGCACCAATTCCTCGATGACTTTATCGACGATCCCGAGCTCGCCGCGCGCTTCAAGGCCCATA
AGCCGCGGTAACAATCGGACGGTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
375 bp | 124 aa | 114 | 488 | + | No |
AG : IS66 TnpA
ORF sequence :
VEILGRERRRRWSNAEKLEIVAAVGMNGETLARVARRYDVSRSQIYQWRHLFRKRGLLPAADGPTFLPVDIGAPMLSAEPVLDDRVVVSPLIVELCLAQG
RRLRFDAGIEATALTQLIRSVEAA
RRLRFDAGIEATALTQLIRSVEAA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 485 | 832 | + | No |
AG : IS66 TnpB
ORF sequence :
MIGPGTGVRVYLACGVTDMRKGISGLAMLAQTVLRQKPASGAVFAFRGRRGDRLKLLYWDGQGFCLYYKILERGRFPWPSAADGAARLTSAQLAMLWEGI
DWRRPNWGAPPTRAG
DWRRPNWGAPPTRAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1629 bp | 542 aa | 896 | 2524 | + | No |
Chemistry : DDE
ORF sequence :
MLNAASDLPEDPALLKALIAQLQAQNKKLTTTLRAHDLVVQALRLQIARLRKQAFGKSSEKIEREIEQLEFALEDVLVSVAQQGLVTSGLDDSEAQAAVA
SAETTPGPRPSRRPRVSADTPRQRHELDPGSTCPECGGELRLVGEDVSEILEMITAKLQVIEVVRPKKSCRCCEKMVQVPAPSRPIPGSMAGGSLLAYVL
ISKFDDHLPLYRQNEIIARMGADIPRSTLADWCGRSMRILQPVIDRIEASVLGSDILHADDTPIRVLAPERRAKGIGKGVMQGRIWGYVCDQRPWAGTAP
PGVLYRYAPNWKAEHVLAHLGSASGILQADAYKGYAKLYEPVADGEPRFREAACFAHWRRDFHDIWTSQKSEIAHEALERIGQLYDIERKIAGKPADIRQ
AIRQELSRPKLEALHSWAEKQLTRISTKGDLAKAFRYALGRWHAFSLFIDDGRVAIDNNAAERAVRPICLGKKNWLFAGSETGAETLARAMTLIESAKMN
GLDPQAYITDLLNRIHDHKINRIDELLPWNWVPLTIPLALAA
SAETTPGPRPSRRPRVSADTPRQRHELDPGSTCPECGGELRLVGEDVSEILEMITAKLQVIEVVRPKKSCRCCEKMVQVPAPSRPIPGSMAGGSLLAYVL
ISKFDDHLPLYRQNEIIARMGADIPRSTLADWCGRSMRILQPVIDRIEASVLGSDILHADDTPIRVLAPERRAKGIGKGVMQGRIWGYVCDQRPWAGTAP
PGVLYRYAPNWKAEHVLAHLGSASGILQADAYKGYAKLYEPVADGEPRFREAACFAHWRRDFHDIWTSQKSEIAHEALERIGQLYDIERKIAGKPADIRQ
AIRQELSRPKLEALHSWAEKQLTRISTKGDLAKAFRYALGRWHAFSLFIDDGRVAIDNNAAERAVRPICLGKKNWLFAGSETGAETLARAMTLIESAKMN
GLDPQAYITDLLNRIHDHKINRIDELLPWNWVPLTIPLALAA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
297 bp | 98 aa | 2515 | 2811 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MAAVTSLLTLDYVAKILDENVELLEAIVSNDDNLTYGAIVSVHTGLDESITALTDHGVEELTDMLSQARLTTKAWHQFLDDFIDDPELAARFKAHKPR
Blast result :
Comments
ISSphsp11 is 81% aa (transposase) similar to ISPpa5.
References
1] Maurizio Labbate (2021) Direct submission.