ISSphsp14
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Sphingobium sp. | Sphingobium sp. Sphingobium sp. SA2 |
DNA section
IS Length : 2460 bp
Ends
IR Length : 17/22
IRL : GTAAGCGGTGTCCTGCCCCCACCTTGCGCAGTTGAAGCGCAGCGTTGAGT
IRR : GTAAGCGGTGTTTTCAGCCCACGGCACTCCATGGCATGAGGTCGCCGACG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGGCATGTGG | AGTGATTC | TGCATCGAAT | 8 |
TCCTGGAGTG | GATCGAAC | AGCGTCGAGT | 8 |
DNA sequence
GTAAGCGGTGTCCTGCCCCCACCTTGCGCAGTTGAAGCGCAGCGTTGAGTTTCCTGGCCCTGAACGGGCTGGAGAGCGCTGCTACTATGATAGTCTCGGG
TGATGATCCTGTGAGCAAAGGCGCGCATCGGTTTGAGGTGTTCACGGGCGCCGGCAAGCGGCGGGATTGGCCGCCGGAAGTGAAGGCCTCGATTGTCGCG
GAGTGTTATACCGGCCGCGAAGGCGTCAGTGCCGTGGCGCGTCGGCATGGACTGGATCCCTCGCAAGTTTACGCGTGGCGGCGGGATTTGCGCAAGCAGC
TCGAGGCCGAGGGGGTAATCGTACCGCCGACGGAACCGGGAGCGCCGTCGTTTGTGCCCGCCGTGGTCGAACCGAGCGTGGCGGCGGATACTGTTGCCAA
GCGTCGCCCTCGTCGCCGACGCCGCGCGACCGAAGCCGCCGTAGAGTTGGAGATCGATGGTGTCGCTGTGAAGATCGGCCGCGGCGCTGACGCGGGCACG
ATTACCGCGGTGATCGAAGCGCTCAGGACGCCCCGGTGATCGGACCTGGCGCCGGGGCGCGCGTGATGGTGGCGACACGACCTGTGGATTTTCGCAAGGG
TGCGGATTCGCTCGCGGCGCTGGTCGGCGCCGAGTACGGCGGCGATCCCTATTCGGGGGTGATCTACGTGTTCCGGGCCAAGCGCGCAGACCGGATCAAG
CTGGTGTGGTGGGACGGCACCGGCCTGTGCCTGATGGCCAAGAAGCTGGAGAGCGGCGGGTTCAAGTGGCCGGGCATCCAGGACGGCGTGATGCGCGTGA
CGGCGGCGCAACTTGGCGCTTTGCTGGAAGGTCTGGACTGGCGCCGGGTGCATGGCGGACGTCGCCCCATTGCCCCGCAGATTGCTGGTTGACGGGGCAT
TCTTCTGCTGATTCACTACTGTCATGCTGATGGAAGCGGACCTTCCCGATGACGTCGAAGCGCTGCGTGCGCTTATCCGCGAACAGGCCCGCGAACTCGA
TGCGCTCAAGGTTTTCCAGGCTGAGGTCGAGCGCCTGAAGGCGATTATAGAGGCCCTCCAGCGTCACCGTTTCGGTCGTCGCTCGGAGCAGCTGGATCCC
GACCAGCTTCAGCTTGCCCTGGAGGAAGTCGAGACGGCCTTGGCCGAGGCGGAGCACGCGCACGACAAGGCAAGCCGGAGGCAAGCCGCTCGTCCGCGCA
AGACCAACCGCGGTTCACTGCCGGCTCATCTCGAACGGATCGAGCAAGTCGTCGACGTCGAGGACAAGGCCTGTCCGTGCTGCGGCGACGCGCTCCACCA
GATCGGCGAGGACGTGGCCGAGCGCCTCGACGTCGTGCCCACCACTTTCCGCGTCCTCGTCACCCGCCGGCCGCGCTACGGCTGTCGTTCGTGCGAGAGC
GCAGTCGTTCAGGCCCCGGCACCAGCACGGATCGTCGAGGGCGGTATTCCCACCGAGGCGCTGATCGCCCAGGTGCTCGTCGCCAAGTACGCCGATCACC
TGCCGCTCTACCGGCAGGCCCAGATCTACGCCCGGCAAGGCATCCAGCTTGATCGATCCACCCTGGCTGACTGGGTGGGTCGGGCAGCCTGGTATCTGCG
CCCCTTGCGTGATCACATCCTCGAACGGCTTCGACGATCCGAACGGCTGTTCGCGGACGAGACGACTGCGCCGGTGCTCGATCCGGGGCGTGGGCGGACC
AAGACCGGCCAGCTATGGGCCTATGCCCGCGACGACCGACCTTGGGGCGCCGATGATCCGCCGATGGTCGCCTATGTCTATGCGGCCGATCGCAGGGGCG
AACGGGCAGAAGCGCATCTCGGCGATTTTGCAGGTATCCTGCAGGTCGATGGCTATGGCGGCTATGCCGCGCTCGCCAGGCGTCGTCAGCAGATCAGCCT
TGCCTTTTGCTGGGCACACGTCCGGCGCAAGTTCTACGAGCTGGCCGACAGCTCTCCGGTGGCAACGGAAGTGCTGCGTCGCGTCGCCTTGCTCTATGCC
ATCGAAGATGAGGTGCGAGGATCATCGGCGGAGCAACGCCGGGCTGTACGCCACGACCGCAGCCGCATCATCGTCGATGACCTTCGCCAATATCTCGATG
CCCGCAATCGCCAGGTCAGCGCCAAGAGCAAGATCGGCGAAGCGATCCGCTATGCGCTCAACCGCTGGGATGGCCTGTCGCGCTTCCTGGACGATGGTCG
CATCGACCTCGACAGCAACACCGTCGAACGCTCTATCCGCCCCCTCGCGCTCAATCGGAAGAATGCGCTGTTCGCCGGCTCCGATGAAGGCGGCGACAAC
TGGGCGGTGATCGCCACGCTCATCGAGAACTGCAAACTCTCCGGCACCAACCCGAATATCTGGCTGACCGAAACCCTCACCAGCCTGGCCAATGGTCATC
CCGCAAACAGCGTCGGCGACCTCATGCCATGGAGTGCCGTGGGCTGAAAACACCGCTTAC
TGATGATCCTGTGAGCAAAGGCGCGCATCGGTTTGAGGTGTTCACGGGCGCCGGCAAGCGGCGGGATTGGCCGCCGGAAGTGAAGGCCTCGATTGTCGCG
GAGTGTTATACCGGCCGCGAAGGCGTCAGTGCCGTGGCGCGTCGGCATGGACTGGATCCCTCGCAAGTTTACGCGTGGCGGCGGGATTTGCGCAAGCAGC
TCGAGGCCGAGGGGGTAATCGTACCGCCGACGGAACCGGGAGCGCCGTCGTTTGTGCCCGCCGTGGTCGAACCGAGCGTGGCGGCGGATACTGTTGCCAA
GCGTCGCCCTCGTCGCCGACGCCGCGCGACCGAAGCCGCCGTAGAGTTGGAGATCGATGGTGTCGCTGTGAAGATCGGCCGCGGCGCTGACGCGGGCACG
ATTACCGCGGTGATCGAAGCGCTCAGGACGCCCCGGTGATCGGACCTGGCGCCGGGGCGCGCGTGATGGTGGCGACACGACCTGTGGATTTTCGCAAGGG
TGCGGATTCGCTCGCGGCGCTGGTCGGCGCCGAGTACGGCGGCGATCCCTATTCGGGGGTGATCTACGTGTTCCGGGCCAAGCGCGCAGACCGGATCAAG
CTGGTGTGGTGGGACGGCACCGGCCTGTGCCTGATGGCCAAGAAGCTGGAGAGCGGCGGGTTCAAGTGGCCGGGCATCCAGGACGGCGTGATGCGCGTGA
CGGCGGCGCAACTTGGCGCTTTGCTGGAAGGTCTGGACTGGCGCCGGGTGCATGGCGGACGTCGCCCCATTGCCCCGCAGATTGCTGGTTGACGGGGCAT
TCTTCTGCTGATTCACTACTGTCATGCTGATGGAAGCGGACCTTCCCGATGACGTCGAAGCGCTGCGTGCGCTTATCCGCGAACAGGCCCGCGAACTCGA
TGCGCTCAAGGTTTTCCAGGCTGAGGTCGAGCGCCTGAAGGCGATTATAGAGGCCCTCCAGCGTCACCGTTTCGGTCGTCGCTCGGAGCAGCTGGATCCC
GACCAGCTTCAGCTTGCCCTGGAGGAAGTCGAGACGGCCTTGGCCGAGGCGGAGCACGCGCACGACAAGGCAAGCCGGAGGCAAGCCGCTCGTCCGCGCA
AGACCAACCGCGGTTCACTGCCGGCTCATCTCGAACGGATCGAGCAAGTCGTCGACGTCGAGGACAAGGCCTGTCCGTGCTGCGGCGACGCGCTCCACCA
GATCGGCGAGGACGTGGCCGAGCGCCTCGACGTCGTGCCCACCACTTTCCGCGTCCTCGTCACCCGCCGGCCGCGCTACGGCTGTCGTTCGTGCGAGAGC
GCAGTCGTTCAGGCCCCGGCACCAGCACGGATCGTCGAGGGCGGTATTCCCACCGAGGCGCTGATCGCCCAGGTGCTCGTCGCCAAGTACGCCGATCACC
TGCCGCTCTACCGGCAGGCCCAGATCTACGCCCGGCAAGGCATCCAGCTTGATCGATCCACCCTGGCTGACTGGGTGGGTCGGGCAGCCTGGTATCTGCG
CCCCTTGCGTGATCACATCCTCGAACGGCTTCGACGATCCGAACGGCTGTTCGCGGACGAGACGACTGCGCCGGTGCTCGATCCGGGGCGTGGGCGGACC
AAGACCGGCCAGCTATGGGCCTATGCCCGCGACGACCGACCTTGGGGCGCCGATGATCCGCCGATGGTCGCCTATGTCTATGCGGCCGATCGCAGGGGCG
AACGGGCAGAAGCGCATCTCGGCGATTTTGCAGGTATCCTGCAGGTCGATGGCTATGGCGGCTATGCCGCGCTCGCCAGGCGTCGTCAGCAGATCAGCCT
TGCCTTTTGCTGGGCACACGTCCGGCGCAAGTTCTACGAGCTGGCCGACAGCTCTCCGGTGGCAACGGAAGTGCTGCGTCGCGTCGCCTTGCTCTATGCC
ATCGAAGATGAGGTGCGAGGATCATCGGCGGAGCAACGCCGGGCTGTACGCCACGACCGCAGCCGCATCATCGTCGATGACCTTCGCCAATATCTCGATG
CCCGCAATCGCCAGGTCAGCGCCAAGAGCAAGATCGGCGAAGCGATCCGCTATGCGCTCAACCGCTGGGATGGCCTGTCGCGCTTCCTGGACGATGGTCG
CATCGACCTCGACAGCAACACCGTCGAACGCTCTATCCGCCCCCTCGCGCTCAATCGGAAGAATGCGCTGTTCGCCGGCTCCGATGAAGGCGGCGACAAC
TGGGCGGTGATCGCCACGCTCATCGAGAACTGCAAACTCTCCGGCACCAACCCGAATATCTGGCTGACCGAAACCCTCACCAGCCTGGCCAATGGTCATC
CCGCAAACAGCGTCGGCGACCTCATGCCATGGAGTGCCGTGGGCTGAAAACACCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
453 bp | 150 aa | 87 | 539 | + | No |
AG : IS66 TnpA
ORF sequence :
MIVSGDDPVSKGAHRFEVFTGAGKRRDWPPEVKASIVAECYTGREGVSAVARRHGLDPSQVYAWRRDLRKQLEAEGVIVPPTEPGAPSFVPAVVEPSVAA
DTVAKRRPRRRRRATEAAVELEIDGVAVKIGRGADAGTITAVIEALRTPR
DTVAKRRPRRRRRATEAAVELEIDGVAVKIGRGADAGTITAVIEALRTPR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 536 | 892 | + | No |
AG : IS66 TnpB
ORF sequence :
VIGPGAGARVMVATRPVDFRKGADSLAALVGAEYGGDPYSGVIYVFRAKRADRIKLVWWDGTGLCLMAKKLESGGFKWPGIQDGVMRVTAAQLGALLEGL
DWRRVHGGRRPIAPQIAG
DWRRVHGGRRPIAPQIAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1524 bp | 507 aa | 924 | 2447 | + | No |
Chemistry : DDE
ORF sequence :
MEADLPDDVEALRALIREQARELDALKVFQAEVERLKAIIEALQRHRFGRRSEQLDPDQLQLALEEVETALAEAEHAHDKASRRQAARPRKTNRGSLPAH
LERIEQVVDVEDKACPCCGDALHQIGEDVAERLDVVPTTFRVLVTRRPRYGCRSCESAVVQAPAPARIVEGGIPTEALIAQVLVAKYADHLPLYRQAQIY
ARQGIQLDRSTLADWVGRAAWYLRPLRDHILERLRRSERLFADETTAPVLDPGRGRTKTGQLWAYARDDRPWGADDPPMVAYVYAADRRGERAEAHLGDF
AGILQVDGYGGYAALARRRQQISLAFCWAHVRRKFYELADSSPVATEVLRRVALLYAIEDEVRGSSAEQRRAVRHDRSRIIVDDLRQYLDARNRQVSAKS
KIGEAIRYALNRWDGLSRFLDDGRIDLDSNTVERSIRPLALNRKNALFAGSDEGGDNWAVIATLIENCKLSGTNPNIWLTETLTSLANGHPANSVGDLMP
WSAVG
LERIEQVVDVEDKACPCCGDALHQIGEDVAERLDVVPTTFRVLVTRRPRYGCRSCESAVVQAPAPARIVEGGIPTEALIAQVLVAKYADHLPLYRQAQIY
ARQGIQLDRSTLADWVGRAAWYLRPLRDHILERLRRSERLFADETTAPVLDPGRGRTKTGQLWAYARDDRPWGADDPPMVAYVYAADRRGERAEAHLGDF
AGILQVDGYGGYAALARRRQQISLAFCWAHVRRKFYELADSSPVATEVLRRVALLYAIEDEVRGSSAEQRRAVRHDRSRIIVDDLRQYLDARNRQVSAKS
KIGEAIRYALNRWDGLSRFLDDGRIDLDSNTVERSIRPLALNRKNALFAGSDEGGDNWAVIATLIENCKLSGTNPNIWLTETLTSLANGHPANSVGDLMP
WSAVG
Blast result :
Comments
ISSphsp14 is 76% aa (transposase) similar to ISFpe1.
References
1] Maurizio Labbate (2021) Direct submission.