ISSphsp7
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Sphingobium sp. | Sphingobium sp. Sphingobium sp. SA2 Sphingobium yanoikuyae SK-NIH.Env6_1116 |
DNA section
IS Length : 2519 bp
Ends
IR Length : 31/39
IRL : TGTGGATCGGCGTGCAAAAAGGACCCCGTTAGCGGGGTGATCGGCGTCTA
IRR : TGTCAATCGGCGTGCAAACGGGACCCCTTATCGGCGCGCAAAAGGGACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGAAGTAGAA | AAGTAGAAGG | 0 | |
CTATCGTCAG | ACCTTGATGG | 0 | |
GTCGCAAAAC | TCTCTGCCGA | 0 | |
GACAACCATT | GGC | ACATGACCGG | 3 |
DNA sequence
TGTGGATCGGCGTGCAAAAAGGACCCCGTTAGCGGGGTGATCGGCGTCTAAAAGGGACCCCTCATTTCGATGGTTTAAGCAGCCGGCTGGATTTTCAGGC
GGCGAGATCGGGATGTTGGTTTTGGAGACAGTTTTACGGATCCGGCGCGAGTATGCCGGAGGCAAGGCGATCAAGGCGATCGCACGGGATCTGCATGTGT
CGCGGAAGGTCATCCGCAAAGCGGTCCGAGCGCCGGAGGGCGCCTTTGATTATCAGCGCAAGGTTCAGCCGCTGCCCAGGATCGGTCCGTTTCAGGAGCG
CCTGAACACGCTGCTGGAAGAGAACGAGCTGCGCGGCAGGCGTGACCGGCTGCGGATGACGCGGATCCATGACCTGCTGGTGCGCGAAGGCTTTGAGGGT
TCTTACGATGCGGTGCGGCGCTATACGACGCGCTGGAAGATCGAGCGGCGCAAAGATGCCGGCGATGGTGTCACGGCCTTCATCCCGCTGATGTTCAGGC
CAGGCGAGGCCTACCAGTTCGACTGGAGCCATGAAGATGTTGAGATCGCCGGGGCACCGATGCGGGTGAAGGTCGCGCATATGCGACTGTGTGCCTCACG
CGCGGTCTATGTCCGGGCTTATCCTCGTGAGAGCCAGGAGATGCTGTTCGACGCGCATGCGCGCGGCTTTGCTTTCTTCGGCGGCGTGCCAGGGCGCGGC
ATCTACGATAATATGAAGACGGCTGTGACGAGCGTGTTCACCGGCAAGGAACGCGTCTTCAACCGGCGGTTCCTGATCATGACCGACCATTATATGGTCG
AGCCCACGGCCTGCTCGCCTGCGGCGGGATGGGAGAAGGGTCAGGTCGAGAACCAGGTGCAGACGATCCGGGGTCGCTTCTTCCAGCCCCGGTTGCGGTT
CGCCAGTCTCGAAGAGCTCAATGGCTGGCTGGAGGCCGAGTGTCGGCGCTGGGCGGAACGGCAGCCCCATCCCGAACAGGGAGAGCTGACCGTGGCGCAG
ATGCTGGAGATCGAACGATCTGCGCTGCAGCCGATGCTGGGACCGTTCGACGGCTTCAATGAGAGCGAGCATGCTGTGACCGGCACCTGCCTGATCAGCT
TCGACCGCAATCGCTACTCGGTGCTATCGACGGTGGCACGACGCACGGTGCAGGTCCGCGCCTATGCCGATCGCATCGTCGTTCGCTGCGGCGAAGAGGT
TGTCGCTGAACATCCTCGCTACTTTGGGCGCAACCGCACGATCTATGACCCCTGGCATTATCTGCCAGTGCTGGCCCGCAAGCCTGGCGCGCTGCGGAAC
GGTGCCCCCTTCCAGGACTGGGATCTGCCACCGGCGCTTGCCCGCCTGCGCCGCAAGCTGGGTAATGGCGACGATGCCGATCGCCGGTTCGTCCGTGTAC
TGTCGGCCGTACTGACCGATGGCCTGGAGCCTGTCGAAGCCGCTGTGCATGAGGCACTGGCGACGGGCACGGCAAGCGACGACCTGATCCTCAACATCCT
GGCACGACGCCGGGAACCGCCGCGTCCGCTGACGATCATCACCTCCGAGGATAGCGCTCTGCGCCATCCCCCGATCGCCGACTGTGCCCGTTACGACCAG
CTGAGGACCTTCGATGCAGCGGCATGATATGATCGAGGCCATGCGCGGGCTTGGACTCAAGGGCATGGCGGGCGCGTTCGACGATGCCGTCATCACCGGC
CTTCAGCGCCAGCGCACCACCATGGAGATACTGACCGATCTCCTGCGAGCGGAGGCGACGCATCGTCATGCCGCGTCGATCCGATACCGGATGGCCGCTG
CCAGGCTGCCAGTCGTAAAGGACATCGATGCCTTCCGGTTCGAGGGTACCCCGATCAACGAGGGGCTTGTGCGTTCATTGCACAGCGGCGCGTTCCTGCC
CGCACGGCGCAATATCGTCCTGGTCGGCGGCACAGGCACAGGAAAGACCCATCTGGCCATCGCCATCACCGCCAATGTAGTGCGCTCCGGCGCCCGAGGC
CGCTACTTCAACACGGTCGATCTGGTGACCCGCCTCGAAGAGGAGGCCAGGATCGGCAAGAGTGGCGCTCTCGCCGCCCAGCTATCCCGCCTCGATCTGA
TCGTGCTCGATGAACTGGGCTATCTGCCGTTCGCCCGATCAGGCGGCCAGCTGCTGTTCCACCTGATCAGCAAACTCTATGAGCAGACCAGCGTCATCAT
CACCACCAACCTCGCCTTTGGCGAGTGGCCCACCGTGTTCGGCGATCCCAAGATGACCACCGCGCTCCTCGACCGCGTCACCCATCATTGCGACATCGTC
GAGACCGGCAATGACAGCTGGCGCTTCAAAAACCGCAGCTGATCCCCCCCAGCACCGATCAATTAGAAGATGTTTTGCGCTGCGCGCGCCTCCGGTCGGG
CTACGCCCTCCCTACGCCGCGCGCAGCGCAAGGAAGCCCGTCACATCAACGCTCCGCCATCCTGACAGGGGGTCCCTTTTGCGCGCCGATAAGGGGTCCC
GTTTGCACGCCGATTGACA
GGCGAGATCGGGATGTTGGTTTTGGAGACAGTTTTACGGATCCGGCGCGAGTATGCCGGAGGCAAGGCGATCAAGGCGATCGCACGGGATCTGCATGTGT
CGCGGAAGGTCATCCGCAAAGCGGTCCGAGCGCCGGAGGGCGCCTTTGATTATCAGCGCAAGGTTCAGCCGCTGCCCAGGATCGGTCCGTTTCAGGAGCG
CCTGAACACGCTGCTGGAAGAGAACGAGCTGCGCGGCAGGCGTGACCGGCTGCGGATGACGCGGATCCATGACCTGCTGGTGCGCGAAGGCTTTGAGGGT
TCTTACGATGCGGTGCGGCGCTATACGACGCGCTGGAAGATCGAGCGGCGCAAAGATGCCGGCGATGGTGTCACGGCCTTCATCCCGCTGATGTTCAGGC
CAGGCGAGGCCTACCAGTTCGACTGGAGCCATGAAGATGTTGAGATCGCCGGGGCACCGATGCGGGTGAAGGTCGCGCATATGCGACTGTGTGCCTCACG
CGCGGTCTATGTCCGGGCTTATCCTCGTGAGAGCCAGGAGATGCTGTTCGACGCGCATGCGCGCGGCTTTGCTTTCTTCGGCGGCGTGCCAGGGCGCGGC
ATCTACGATAATATGAAGACGGCTGTGACGAGCGTGTTCACCGGCAAGGAACGCGTCTTCAACCGGCGGTTCCTGATCATGACCGACCATTATATGGTCG
AGCCCACGGCCTGCTCGCCTGCGGCGGGATGGGAGAAGGGTCAGGTCGAGAACCAGGTGCAGACGATCCGGGGTCGCTTCTTCCAGCCCCGGTTGCGGTT
CGCCAGTCTCGAAGAGCTCAATGGCTGGCTGGAGGCCGAGTGTCGGCGCTGGGCGGAACGGCAGCCCCATCCCGAACAGGGAGAGCTGACCGTGGCGCAG
ATGCTGGAGATCGAACGATCTGCGCTGCAGCCGATGCTGGGACCGTTCGACGGCTTCAATGAGAGCGAGCATGCTGTGACCGGCACCTGCCTGATCAGCT
TCGACCGCAATCGCTACTCGGTGCTATCGACGGTGGCACGACGCACGGTGCAGGTCCGCGCCTATGCCGATCGCATCGTCGTTCGCTGCGGCGAAGAGGT
TGTCGCTGAACATCCTCGCTACTTTGGGCGCAACCGCACGATCTATGACCCCTGGCATTATCTGCCAGTGCTGGCCCGCAAGCCTGGCGCGCTGCGGAAC
GGTGCCCCCTTCCAGGACTGGGATCTGCCACCGGCGCTTGCCCGCCTGCGCCGCAAGCTGGGTAATGGCGACGATGCCGATCGCCGGTTCGTCCGTGTAC
TGTCGGCCGTACTGACCGATGGCCTGGAGCCTGTCGAAGCCGCTGTGCATGAGGCACTGGCGACGGGCACGGCAAGCGACGACCTGATCCTCAACATCCT
GGCACGACGCCGGGAACCGCCGCGTCCGCTGACGATCATCACCTCCGAGGATAGCGCTCTGCGCCATCCCCCGATCGCCGACTGTGCCCGTTACGACCAG
CTGAGGACCTTCGATGCAGCGGCATGATATGATCGAGGCCATGCGCGGGCTTGGACTCAAGGGCATGGCGGGCGCGTTCGACGATGCCGTCATCACCGGC
CTTCAGCGCCAGCGCACCACCATGGAGATACTGACCGATCTCCTGCGAGCGGAGGCGACGCATCGTCATGCCGCGTCGATCCGATACCGGATGGCCGCTG
CCAGGCTGCCAGTCGTAAAGGACATCGATGCCTTCCGGTTCGAGGGTACCCCGATCAACGAGGGGCTTGTGCGTTCATTGCACAGCGGCGCGTTCCTGCC
CGCACGGCGCAATATCGTCCTGGTCGGCGGCACAGGCACAGGAAAGACCCATCTGGCCATCGCCATCACCGCCAATGTAGTGCGCTCCGGCGCCCGAGGC
CGCTACTTCAACACGGTCGATCTGGTGACCCGCCTCGAAGAGGAGGCCAGGATCGGCAAGAGTGGCGCTCTCGCCGCCCAGCTATCCCGCCTCGATCTGA
TCGTGCTCGATGAACTGGGCTATCTGCCGTTCGCCCGATCAGGCGGCCAGCTGCTGTTCCACCTGATCAGCAAACTCTATGAGCAGACCAGCGTCATCAT
CACCACCAACCTCGCCTTTGGCGAGTGGCCCACCGTGTTCGGCGATCCCAAGATGACCACCGCGCTCCTCGACCGCGTCACCCATCATTGCGACATCGTC
GAGACCGGCAATGACAGCTGGCGCTTCAAAAACCGCAGCTGATCCCCCCCAGCACCGATCAATTAGAAGATGTTTTGCGCTGCGCGCGCCTCCGGTCGGG
CTACGCCCTCCCTACGCCGCGCGCAGCGCAAGGAAGCCCGTCACATCAACGCTCCGCCATCCTGACAGGGGGTCCCTTTTGCGCGCCGATAAGGGGTCCC
GTTTGCACGCCGATTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1515 bp | 504 aa | 113 | 1627 | + | No |
Chemistry : DDE
ORF sequence :
MLVLETVLRIRREYAGGKAIKAIARDLHVSRKVIRKAVRAPEGAFDYQRKVQPLPRIGPFQERLNTLLEENELRGRRDRLRMTRIHDLLVREGFEGSYDA
VRRYTTRWKIERRKDAGDGVTAFIPLMFRPGEAYQFDWSHEDVEIAGAPMRVKVAHMRLCASRAVYVRAYPRESQEMLFDAHARGFAFFGGVPGRGIYDN
MKTAVTSVFTGKERVFNRRFLIMTDHYMVEPTACSPAAGWEKGQVENQVQTIRGRFFQPRLRFASLEELNGWLEAECRRWAERQPHPEQGELTVAQMLEI
ERSALQPMLGPFDGFNESEHAVTGTCLISFDRNRYSVLSTVARRTVQVRAYADRIVVRCGEEVVAEHPRYFGRNRTIYDPWHYLPVLARKPGALRNGAPF
QDWDLPPALARLRRKLGNGDDADRRFVRVLSAVLTDGLEPVEAAVREALATGTASDDLILNILARRREPPRPLTIITSEDSALRHPPIADCARYDQLRTF
DAAA
VRRYTTRWKIERRKDAGDGVTAFIPLMFRPGEAYQFDWSHEDVEIAGAPMRVKVAHMRLCASRAVYVRAYPRESQEMLFDAHARGFAFFGGVPGRGIYDN
MKTAVTSVFTGKERVFNRRFLIMTDHYMVEPTACSPAAGWEKGQVENQVQTIRGRFFQPRLRFASLEELNGWLEAECRRWAERQPHPEQGELTVAQMLEI
ERSALQPMLGPFDGFNESEHAVTGTCLISFDRNRYSVLSTVARRTVQVRAYADRIVVRCGEEVVAEHPRYFGRNRTIYDPWHYLPVLARKPGALRNGAPF
QDWDLPPALARLRRKLGNGDDADRRFVRVLSAVLTDGLEPVEAAVREALATGTASDDLILNILARRREPPRPLTIITSEDSALRHPPIADCARYDQLRTF
DAAA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
729 bp | 242 aa | 1614 | 2342 | + | No |
AG : IS21 helper
ORF sequence :
MQRHDMIEAMRGLGLKGMAGAFDDAVTTGLQRQRTTMEILTDLLRAEATHRHAASIRYRMAAARLPVVKDIDAFRFEGTPINEGLVRSLHSGAFLPARRN
IVLVGGTGTGKTHLAIAITANVVRSGARGRYFNTVDLVTRLEEEARIGKSGALAAQLSRLDLIVLDELGYLPFARSGGQLLFHLISKLYEQTSVIITTNL
AFGEWPTVFCDPKMTTALLDRVTHHCDIVETGNDSWRFKNRS
IVLVGGTGTGKTHLAIAITANVVRSGARGRYFNTVDLVTRLEEEARIGKSGALAAQLSRLDLIVLDELGYLPFARSGGQLLFHLISKLYEQTSVIITTNL
AFGEWPTVFCDPKMTTALLDRVTHHCDIVETGNDSWRFKNRS
Blast result :
Comments
ISSphsp7 is 83% aa (transposase) similar to ISSsp4.
References
1] Maurizio Labbate (2021) Direct submission.