ISSphsp9
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Sphingobium sp. | Sphingobium sp. Sphingobium sp. SA2 Sphingobium sp. KCTC 72723 |
DNA section
IS Length : 2789 bp
Ends
IR Length : 34/49
IRL : TGCGCGGCGACAACTACGATGGCCGGTTGAGCGGCGACTATGATGGCCGG
IRR : TGTAGCGCGACGATTACGGAGTCCGGTCGGCGTCAATTTTGATGGCCGAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGCGCGATGC | GATGCGTCCA | 0 |
DNA sequence
TGCGCGGCGACAACTACGATGGCCGGTTGAGCGGCGACTATGATGGCCGGTAGGATCGGTTGTCTGGTCTGATCGTAGGGAGGGGCGCAGCCCCGACCGG
AGAGCGGACCAGACAACCGGTGGCGATCTTTTTCCCCTTCTTTCGGGGGGCAGATCGGGGCTGTGGCGGGCGGTGGTATCGGAACACTCATCGAACAACG
AGCGATGGGTTTGCGATGCCGGGCCACCACATTTCCGATCAGCAGGTATTTCTCTTCATGACCCATCGTCGCCAACACACCCAGGCCGTCGCGGCTGCCA
AGGCCGGTATCAGCGAACGCAGCGCACGCCGGATCGAGAACGATCCGCAGCTTCCGTCCCAGAAGAAGAAGGAGCGCCACTGGCGCACCCGCGCCGATCC
GCTCGAGCCATTCTGGCCACGTATAGAGGAGTTGCTCCAGATCGACGGTATCATTGCCGTCACGGTCTTCGAGACGCTCCAGGACGAGTTCGGCGAGGAT
GCTGTTCCCGATGCGATACGACGAACACTGGAACGCCGGATCGCCCGCTGGCGGGCACTGCACGGCGGCGAGAAGGAGATCTTCTTCCCGCAGCATCATG
AGCCCGGTCGGCAGGGCCTGTCGGATTTCACGGTATGCGACAGTCTCAAGGTCACCGTTGCCGGCGAGACCCTGGCCTATCGCCTCTACCACTTCCGCTT
GGCGGCGAGTGGCTGGGAGCATGCGGCTGTCGTGCTGGGCGGGGAGAGCTTTGCCGCCCTTTCGGAGCACCTGCAGGATGCGTTGTGGAAGCTGGGCGGT
GCGCCGGCCGAACACCGCAGCGATTCCCTGTCAGCCGCCTACAAAAACCTCGACGCCGATGCGCAGCGGGATTTCACCCGAAGCTATGACGAGCTGTGTC
GTCATTACGGCATGCTTGCTACCCGCAACAACCGCGGCGAGGCGCACGAGAACGGATCGATCGAAGGTCCCCATGCCCATCTCAAGCGACGGCTCGATCA
GGCCTTACGCCGGCGGGGCAGCCGCGATTTCGTCAGCATCGAGGCCTGGCGCGAGTTCGTTGAGGCGCAGGTCGCCAGACAGAACCGGCGGCATGCTGCG
CGCATCGATGCAGAACGCAGGGTACTCAAGGCGCTGCCCGCAAGGCGAACCACCGATTTCGCCATGGCCACCGTCGATGTCACCCGCAACGGCACCGTCG
CCATCGATCGGGTTACCTATTCGGTGCCTTCCCGCCTCGTCGGACGGCGCCTCAACGCGCATCTCTTTGACGATCGCATCGAGCTCTTCCTCGGTCCGGA
CAGGGTAATGTCCACGCCGCGTGTGCGGATCAGTCATCCCCACCGGGGGCATAGCATCGATTTCCGGCACATGATCGGTAACCTGCGCCGCAAGCGCGGT
GCACTGCGCAACCTCGTCTACCGCGAAGCCCTCTTCCCCGATCACGCCTACCGGCGGGCCTGGCAAGCCTTCGATGCCCAACTCGATGGACGGCAGGCCT
GCCGCGATGCCGTCGCGCTGCTCGATATCGCCGCCAGGGGCGACTGTGTCGACGTGCTGGCCCGGCGGATCGATGAGGCTCTCGACAGCGGGCGCTTGCC
CGATGTCGATGCGCTCAGGGACGAGTTCCTGCCAACCGCAAGATCGCAGCGCGATGTCGCTATCCCGCCACCCGATCTGCACAGCTACAACAGCCTGATC
GCCAGCGGGGAGGTGCACTGATGACCCGCACCAAGGATCAGGCCGCCGCCGTACTGCCTACCCTGCTGAAGGCCTTGCGCCTGCCGAGCATCAACCGCAA
CTGGAAGCGCCTCACCGACACCGCCGATCGCGATGGCTGGCCGGCCGCCAACCTGCTGGCCTCGCTTCTCGAGATCGAGATGGCTGATCGCTCCTCCCGG
CGCATCCAGCGCCATCGCGACCAGTCCGGCTTGCCCGCAGGCAAGACCTTCGCCACCTTCGATTTCGACGCCGCCCCCGGCATCCGCAAACCGCACCTCT
TGTCCCTCGCCGCCGGTGACGACTGGATCGAGAACGGCGGCAACCTGCTGCTGTTCGGCCAGAGCGGGACCGGCAAGACGCACGCAGTTGCCGCCATTGG
CCATGCCCTCATCGACACGGGGCGGCGCGTCCTGTTCTGCTCCACCACCGACATGGTCCAGAAGCTCCAGTCCGCGCGCCGCGACCTCAGCCTGCCCGCC
ATGCTCGACAAGCTCGACAAGTTCGATCTCATCGTGCTCGACGATCTGTCCTACGTCCGCAAGGACCAGGTCGAGACCAGCGCCTTGTTCGAGCTCATCG
CCCACCGCTACGAACGCCACTCGCTCGCCATTACCGCCAACTAGCCATTTTCGGCATGGGACAACGTCTTCCCTGATCCCGCCATGACTGTCGCCGCGAT
CGACCGCCTCGTGCACCACTCGACCATCATCGAGATGAACGGCGAAAGCTACCGCAAGCGTTCCGCCGTCGCCCGCATCAACGCCGGCGATTACGACCCG
CCCAATGGCGCCCCGGACCGGCCATCATAATTGTCGCTGGCTTCGCGCTCGGGAGCACCCTTGCTGAGGCAACGGGTAATTGTCGCCAGCGGCAATTACA
TGCCAACCATCCCATGCGCTTCAAACCCTCGCTTCCGGCCAGCGCCAGCGACAATTACCCAGCGTCGTAATTGTCGCTCGACGGTTGCTCCTCGCAACAG
CAAATGGTAGCCAGAATTCACCAACCGGCGCGGCCACCTATCGGCCATCAAAATTGACGCCGACCGGACTCCGTAATCGTCGCGCTACA
AGAGCGGACCAGACAACCGGTGGCGATCTTTTTCCCCTTCTTTCGGGGGGCAGATCGGGGCTGTGGCGGGCGGTGGTATCGGAACACTCATCGAACAACG
AGCGATGGGTTTGCGATGCCGGGCCACCACATTTCCGATCAGCAGGTATTTCTCTTCATGACCCATCGTCGCCAACACACCCAGGCCGTCGCGGCTGCCA
AGGCCGGTATCAGCGAACGCAGCGCACGCCGGATCGAGAACGATCCGCAGCTTCCGTCCCAGAAGAAGAAGGAGCGCCACTGGCGCACCCGCGCCGATCC
GCTCGAGCCATTCTGGCCACGTATAGAGGAGTTGCTCCAGATCGACGGTATCATTGCCGTCACGGTCTTCGAGACGCTCCAGGACGAGTTCGGCGAGGAT
GCTGTTCCCGATGCGATACGACGAACACTGGAACGCCGGATCGCCCGCTGGCGGGCACTGCACGGCGGCGAGAAGGAGATCTTCTTCCCGCAGCATCATG
AGCCCGGTCGGCAGGGCCTGTCGGATTTCACGGTATGCGACAGTCTCAAGGTCACCGTTGCCGGCGAGACCCTGGCCTATCGCCTCTACCACTTCCGCTT
GGCGGCGAGTGGCTGGGAGCATGCGGCTGTCGTGCTGGGCGGGGAGAGCTTTGCCGCCCTTTCGGAGCACCTGCAGGATGCGTTGTGGAAGCTGGGCGGT
GCGCCGGCCGAACACCGCAGCGATTCCCTGTCAGCCGCCTACAAAAACCTCGACGCCGATGCGCAGCGGGATTTCACCCGAAGCTATGACGAGCTGTGTC
GTCATTACGGCATGCTTGCTACCCGCAACAACCGCGGCGAGGCGCACGAGAACGGATCGATCGAAGGTCCCCATGCCCATCTCAAGCGACGGCTCGATCA
GGCCTTACGCCGGCGGGGCAGCCGCGATTTCGTCAGCATCGAGGCCTGGCGCGAGTTCGTTGAGGCGCAGGTCGCCAGACAGAACCGGCGGCATGCTGCG
CGCATCGATGCAGAACGCAGGGTACTCAAGGCGCTGCCCGCAAGGCGAACCACCGATTTCGCCATGGCCACCGTCGATGTCACCCGCAACGGCACCGTCG
CCATCGATCGGGTTACCTATTCGGTGCCTTCCCGCCTCGTCGGACGGCGCCTCAACGCGCATCTCTTTGACGATCGCATCGAGCTCTTCCTCGGTCCGGA
CAGGGTAATGTCCACGCCGCGTGTGCGGATCAGTCATCCCCACCGGGGGCATAGCATCGATTTCCGGCACATGATCGGTAACCTGCGCCGCAAGCGCGGT
GCACTGCGCAACCTCGTCTACCGCGAAGCCCTCTTCCCCGATCACGCCTACCGGCGGGCCTGGCAAGCCTTCGATGCCCAACTCGATGGACGGCAGGCCT
GCCGCGATGCCGTCGCGCTGCTCGATATCGCCGCCAGGGGCGACTGTGTCGACGTGCTGGCCCGGCGGATCGATGAGGCTCTCGACAGCGGGCGCTTGCC
CGATGTCGATGCGCTCAGGGACGAGTTCCTGCCAACCGCAAGATCGCAGCGCGATGTCGCTATCCCGCCACCCGATCTGCACAGCTACAACAGCCTGATC
GCCAGCGGGGAGGTGCACTGATGACCCGCACCAAGGATCAGGCCGCCGCCGTACTGCCTACCCTGCTGAAGGCCTTGCGCCTGCCGAGCATCAACCGCAA
CTGGAAGCGCCTCACCGACACCGCCGATCGCGATGGCTGGCCGGCCGCCAACCTGCTGGCCTCGCTTCTCGAGATCGAGATGGCTGATCGCTCCTCCCGG
CGCATCCAGCGCCATCGCGACCAGTCCGGCTTGCCCGCAGGCAAGACCTTCGCCACCTTCGATTTCGACGCCGCCCCCGGCATCCGCAAACCGCACCTCT
TGTCCCTCGCCGCCGGTGACGACTGGATCGAGAACGGCGGCAACCTGCTGCTGTTCGGCCAGAGCGGGACCGGCAAGACGCACGCAGTTGCCGCCATTGG
CCATGCCCTCATCGACACGGGGCGGCGCGTCCTGTTCTGCTCCACCACCGACATGGTCCAGAAGCTCCAGTCCGCGCGCCGCGACCTCAGCCTGCCCGCC
ATGCTCGACAAGCTCGACAAGTTCGATCTCATCGTGCTCGACGATCTGTCCTACGTCCGCAAGGACCAGGTCGAGACCAGCGCCTTGTTCGAGCTCATCG
CCCACCGCTACGAACGCCACTCGCTCGCCATTACCGCCAACTAGCCATTTTCGGCATGGGACAACGTCTTCCCTGATCCCGCCATGACTGTCGCCGCGAT
CGACCGCCTCGTGCACCACTCGACCATCATCGAGATGAACGGCGAAAGCTACCGCAAGCGTTCCGCCGTCGCCCGCATCAACGCCGGCGATTACGACCCG
CCCAATGGCGCCCCGGACCGGCCATCATAATTGTCGCTGGCTTCGCGCTCGGGAGCACCCTTGCTGAGGCAACGGGTAATTGTCGCCAGCGGCAATTACA
TGCCAACCATCCCATGCGCTTCAAACCCTCGCTTCCGGCCAGCGCCAGCGACAATTACCCAGCGTCGTAATTGTCGCTCGACGGTTGCTCCTCGCAACAG
CAAATGGTAGCCAGAATTCACCAACCGGCGCGGCCACCTATCGGCCATCAAAATTGACGCCGACCGGACTCCGTAATCGTCGCGCTACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1506 bp | 501 aa | 216 | 1721 | + | No |
Chemistry : DDE
ORF sequence :
MTHRRQHTQAVAAAKAGISERSARRIENDPQLPSQKKKERHWRTRADPLEPFWPRIEELLQIDGIIAVTVFETLQDEFGEDAVPDAIRRTLERRIARWRA
LHGGEKEIFFPQHHEPGRQGLSDFTVCDSLKVTVAGETLAYRLYHFRLAASGWEHAAVVLGGESFAALSEHLQDALWKLGGAPAEHRSDSLSAAYKNLDA
DAQRDFTRSYDELCRHYRMLATRNNRGEAHENGSIEGPHAHLKRRLDQALRRRGSRDFVSIEAWREFVEAQVARQNRRHAARIDAERRVLKALPARRTTD
FAMVTVDVTRNGTVAIDRVTYSVPSRLVGRRLNAHLFDDRIELFLGPDRVMSTPRVRISHPHRGHSIDFRHMIGNLRRKPGALRNLVYREALFPDHAYRR
AWQAFDAQLDGRQACRDAVALLDIAARGDCVDVLARRIDEALDSGRLPDVDALRDEFLPTARSQRDVAIPPPDLHSYNSLIASGEVH
LHGGEKEIFFPQHHEPGRQGLSDFTVCDSLKVTVAGETLAYRLYHFRLAASGWEHAAVVLGGESFAALSEHLQDALWKLGGAPAEHRSDSLSAAYKNLDA
DAQRDFTRSYDELCRHYRMLATRNNRGEAHENGSIEGPHAHLKRRLDQALRRRGSRDFVSIEAWREFVEAQVARQNRRHAARIDAERRVLKALPARRTTD
FAMVTVDVTRNGTVAIDRVTYSVPSRLVGRRLNAHLFDDRIELFLGPDRVMSTPRVRISHPHRGHSIDFRHMIGNLRRKPGALRNLVYREALFPDHAYRR
AWQAFDAQLDGRQACRDAVALLDIAARGDCVDVLARRIDEALDSGRLPDVDALRDEFLPTARSQRDVAIPPPDLHSYNSLIASGEVH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
624 bp | 207 aa | 1721 | 2344 | + | No |
AG : IS21 helper
ORF sequence :
MTRTKDQAAAVLPTLLKALRLPSINRNWKRLTDTADRDGWPAANLLASLLEIEMADRSSRRIQRHRDQSGLPAGKTFATFDFDAAPGIRKPHLLSLAAGD
DWIENGGNLLLFGQSGTGKTHAVAAIGHALIDTGRRVLFCSTTDMVQKLQSARRDLSLPAMLDKLDKFDLIVLDDLSYVRKDQVETSALFELIAHRYERH
SLAITANQPFSAWDNVFPDPAMTVAAIDRLVHHSTIIEMNGESYRKRSAVARINAGDYDPPNGAPDRPS
DWIENGGNLLLFGQSGTGKTHAVAAIGHALIDTGRRVLFCSTTDMVQKLQSARRDLSLPAMLDKLDKFDLIVLDDLSYVRKDQVETSALFELIAHRYERH
SLAITANQPFSAWDNVFPDPAMTVAAIDRLVHHSTIIEMNGESYRKRSAVARINAGDYDPPNGAPDRPS
Blast result :
Comments
ISSphsp9 is 74% aa similar to ISAli13.
References
1] Maurizio Labbate (2021) Direct submission.