ISThli1

  • Family IS1595
  • Group ISNha5
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
FNBW01000006 ND Thalassobaculum litoreum
Thalassobaculum litoreum DSM 18839
DNA section
IS Length : 4402 bp

Ends


IR Length : 26

IRL : CGGTATTAGGTAGCAAATGCACCTAGGGGAAACGGAGCGATGAACACGGG
IRR : CGGTATTAGGTAGCAAATGCACCTAGAAGCCCCTCAAAATTTTTCCAGGC

Insertion site


Left flankDirect repeatRight flankDR Length
TCCTGGACGAGCTGGCCGAATTCCCGCGTCAGGTGCTGGA8

DNA sequence

CGGTATTAGGTAGCAAATGCACCTAGGGGAAACGGAGCGATGAACACGGGAATAAATCCCGCTATGGTGCGGGGATGGGGTGGATCAAGCGACACTGGAT
CGGCGTGCTCGGAGGACTCAACCTCATCGAGCCCATATTCCGCGGGATCAAGTGGCTTATTGGACTCGGCGGTGACGTCGACTTCCTTGTTGCTAGGGCG
CAGGACCCAGACTGGGTGGGCGTCATGATCGACGCGCTCCTAAATGTTCCCGGATGGCTCTCGTTGCTGCTCATCGTTCTGGGGTTCGCACTCATCCGAT
GGGACATCCGGCGAAACAATATGCGTGACCGCGAAAGCAGGGCAGTTGGGGCGGCGGGCGCGGTCGATAAAGCCGCGTCATCCGATGCCAAGACGCTTAT
GCACGGAGGCGCCGAGGCGAACCCGACGACGAGTTGGACGGTTCGCCTCGGCGACCGCTCCACTGCTGTAAGTGGGGTTTCCATCGTAGCGACATCGACA
GCCATCTACGCGCTATCCTCCGTCAAAAGCCTGAATGACGGCGCGACGAAATCTCGGATTAGAATCGACGCGCTTGAGTTAAATGGAGAGCGAAATAAAG
AATTTGGTGAGAATGGCGCCGGATACATTATTTTTTCTCTTCCTGGCTCATATCGGCCAGATGCACGACGGATAGTCCCGCTTAATGAAGGCGGTATGAT
TGTCGTTGGCACTGCATTCGAAGATGAGACAGCGCTATCGCATCCGTTCTTGGTAAGAATCGATAAAACAGGTTCGTTATTTTCTGATTTTGGCCGTGGC
GGATTTACACTGCTGCCAATGTCAGCTGGACTGAATTATGGATGGGATGCTGTTCAAGTCGGCGAAAGAATCTATACTATTTGTAGTTCTTTTGAGTCAT
CCATGTTGACCCTAGCCGTTACGCCAATAAATTTGCGGGCCGATTCATCTATAATCTTCCAGCCAATTTCGATTTTTGAGGGCGGGACTGTGTGGCCATC
GCGAATCGTTCGCGTTTTCCCAGAAACGGCATTCTACATAATAGGCAAGAGCGTCAGCGCCGCGCATCAGACCGACGGCTTTATTGCAAAGGTGGACGCA
ACAGGCAAACTAGATTCAGGATTTGGCGATTCAGGAAAAGCGCTATTGAAGACTGCGTTTAATCCTGCGTTCATAAATAACTGTTCAGGCGCAGCAGTCA
TAAGTGATGGTGTGTGCTTAATAGGCGGATTCGGTAATGACGCATTCATGGTTGCGGTGGGACCGCATGGTAACCGGGTTGCCTCATTCTCAGAGAACGG
CGTGTATTGCCTTCGTGGTGTTCATCGGACCTATGCTTCATCAGTAGCCGCATCGACACGCCTGAATGTCGTAGCGCTTAGCGGGATGGAAAGCGACGGG
AACAGCATCGAGCGGGCGTTCGTCGCGTTCACAGACCTCCGGGGAAACGACCTGAGAGTGGGCGCTGACAACATGGAGCGCGTATCCTCAGACGCCAAAA
CGAAGATGGTGGATCACGTCATCTCTGACGAAGGTGCAATCTTCGGATTGGTGCAGGAAGGTGGCACATCTCAGGGCTCACAACGAGTTGCGGTCGTGCG
CATCCCGGTGCCGCAGCCTTAGTCGTCTGTCAGGCGCACCGTAACTGCCCCCATCACACTTAGGCATTGCACAAAGAACGCCGCCGAGAAGCCGCCGCGC
GCCACCTTGTTCCGCAGGTTGACGGGCGTTTCGTTAACGCCATTCGCCGCCAATCGCTCTGATAGCTGGTCGTATGTGATGCCTCGTCGGGTCATCTCTG
CGCGGAGAATACCCTTAACCAGGCTAATCCACTTGTCGTCCGTCTTCATGGCGCCATCCATCACGTTACGACCGCCATCATATTTGTTACTAACGCCATT
GACAACGCTACAATATGCCTCCATATTCGTTACAACAGTAACGTATTTGATGGCGACAACAGATGGCCCAACACTTCCTCCTCAGCGCGAAAGCCCGGAC
ATTGTCCGTGGTTCAGGTCGCGCGGATGACGGAAGACGAGGCCCGCTCTCTGTTCCGTGCCATCCGTTGGGCCGACACTGACGGCGAGCCGGTTTGCCCG
CGCTGCGGCTGCTTTGCGGTCAACGAGTACAAGTCTCGCCCGATCTTCAAGTGCAAGGGCTGCGGCCACCAGTTCAGCATCACGTCCGGTACGATCTTCG
CCAGCCGCAAGCTGCCGGTGCGCGATATCCTGCTTGCCATCGCTCTGTTCGTGAACGGCGCCAAGGGCATGTCCGCTCTGCAGATGAGCCGGAACCTCGA
TGTTCAGTACAAGTCGGCATACGTCCTGCTCCACAAGATCCGCGAAGTCATGGCCGCTGAGACTGCCGACGCCACCCTGTCCGGTGAGGTCGAGGTTGAT
GGTGCCTATTTCGGCGGTCACGTCCGCCCGGAGAACCGCAAGGAAGACCGCAAGGATCGTCGCCTGAAAGAGAACCAGACCGGCAAGCGCCGTGTTGTCG
TTGTCATGCGTGAGCGTGGCGGGCGCACCCTGCCGTTCGTGTTCCGCTCTGAGGACCAGTCAGTCGCCACGATCCGCGCCCATGTCGCCAGCGGCACCAC
CGTCTACGCTGACGAGGCGTCCGGCTGGGACGAGCTGCACGCCACTTACGAGACCAAGCGTATCAATCACAGCCTCGCGTTCATGGATGACGGCGCCTGC
ACTAATCAGGCGGAAAGCTACTTCTCCCGTCTGCGCCGTGCCGAGTGGGGCCAGCACCATCACATCAGCGGGCGTCACCTGCACGCCTATGCCGCCGAAA
TGGCGTGGCGTGAGGATCACCGCCGGGTACCGAACGGTACGCTGTTCACCGCCGTTGTGGACGCTGCGCTCAATGCCCCGGTGTCGCGGCAGTGGAAGGG
CTATTGGCAGAAGTAAAAACCCCCGCCTGGGGCGGGGGTAAGAAGTTCAACTAATCGCCTCTAGCCTTGGCGTGGCTTGTTCTACGCGTTCCGCCACCGC
ACGGATTGTGTCTTCGTCGGTATTTTGGTGGTAAATACCCCTTACCACCCGCTCGACAAAATCATGAGCTGCCTTTTTACGGTTGCTTTCCTTGTCCATT
AGATCCATCTGATCCTCCATCCGACGTTGTTATTTCGCCCCACACTCTATCAATAATCAAATCAGCGAAATGACCAAACCAATACTTTTGTGTTAGCTTC
TTTCCGGACAAAAATTTCTGTAAATCCTCGTTAACGTTCAACACCTCCTTTAAAGAGCCTTGAATGTCGTCAGGTCGAATAGGAGACTTATCCGGAAAGT
TACCCTTAATATAATCGTACGAGCTCTGCGCAACGTCAAGCCAAACACCCTTGTCTTTCTCAAAAAATCTTACAAGACCTGCATTTTCAAGCTTTTGTTC
TACTTCAAGTGTAAGCGCCATTATTTCACCATACCGTTATGTAAGACCAAATTGCGGCGATTAACACCACAACAAACGATAGGCCTAAAAGAGCCACAGA
GGTGTAAACAATCCAATCATAGCTAAAGAAAAGATATGATATTGCAGTTGCACTAGCTAAGACTCCAAATATAAATCCGGCCAGAGAAAGTCCCGCACGA
CGCAAAAATTGGGCCCGAACGGCACCCTGATGTTCTGTTTCATAGCTGTTGTACCCGGCAATAGCGTTGCCCAAAAATACCAGCGTAAAGCCTGCAAATG
CGGAACCGCTGATGGCGACCCGGCTCGCAACATCAAGCAACATATTAGGCTCTGGACAGATACACATCGATTATCTTATCCCCCGAATCGTTATGAATTT
ACGATTCGAGTTCAATCAACGCAAGAATAATATCGATTTTTGGTGGTTTGGTAGTCATACCAGATAATGATTGTCACCCTATCTGGTGTCCGACCTCACT
CATCCCGTCACCCACTGCTTATACGTCCCTTCGCTCGGAACCTCCCGCGCCAAGCCCTTTGCGACTAGCTTGTTCATACATGACCCCACCCGCTTGCGGA
TCAGCGACACGGCCCGCGCGTCGTTCGGGTCCAGCCCGCGCCCTTTGACCACCTCTATGGCGATTTCCAGCGTGGTGACGGCGCTGTCCGCGTTCCTGAG
CGCTCCTAGGACGAACCGCTGCATTTCCCCACGATAGGCCCCGTGACGGGCCGGGAACTGCTTTGGCTGCACCAGGGCTAGATCCTGATCCGGGGCAAAC
ATGTGGATGGTGTGGTCTATGTGGTCCAGGTCAACGATCAGGTCGTTGAGGGCGCGTTGGTGGTGCTCGATCTGGCCGGATATCTCGCGGCGCTTCTCGA
TCAGTCCTGCGACTGTGTTGGGGCGTTCTGTCATGCGGCAAGCATATGAAAAGCCTGGAAAAATTTTGAGGGGCTTCTAGGTGCATTTGCTACCTAATAC
CG
Protein section
ORF number : 5

 

ORF 1
LengthBeginEndStrandFusion ORF
1398 bp465 aa2251622+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MIDALLNVPGWLSLLLIVLGFALIRWDIRRNNMRDRESRAVGAAGAVDKAASSDAKTLMHGGAEANPTTSWTVRLGDRSTAVSGVSIVATSTAIYALSSV
KSLNDGATKSRIRIDALELNGERNKEFGENGAGYIIFSLPGSYRPDARRIVPLNEGGMIVVGTAFEDETALSHPFLVRIDKTGSLFSDFGRGGFTLLPMS
AGLNYGWDAVQVGERIYTICSSFESSMLTLAVTPINLRADSSIIFQPISIFEGGTVWPSRIVRVFPETAFYIIGKSVSAAHQTDGFIAKVDATGKLDSGF
GDSGKALLKTAFNPAFINNCSGAAVISDGVCLIGGFGNDAFMVAVGPHGNRVASFSENGVYCLRGVHRTYASSVAASTRLNVVALSGMESDGNSIERAFV
AFTDLRGNDLRVGADNMERVSSDAKTKMVDHVISDEGAIFGLVQEGGTSQGSQRVAVVRIPVPQP

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
231 bp76 aa18491619-No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MKTDDKWISLVKGILRAEMTRRGITYDQLSERLAANGVNETPVNLRNKVARGGFSAAFFVQCLSVMGAVTVRLTDD

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
954 bp317 aa19632916+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MAQHFLLSAKARTLSVVQVARMTEDEARSLFRAIRWADTDGEPVCPRCGCFAVNEYKSRPIFKCKGCGHQFSITSGTIFASRKLPVRDILLAIALFVNGA
KGMSALQMSRNLDVQYKSAYVLLHKIREVMAAETADATLSGEVEVDGAYFGGHVRPENRKEDRKDRRLKENQTGKRRVVVVMRERGGRTLPFVFRSEDQS
VATIRAHVASGTTVYADEASGWDELHATYETKRINHSLAFMDDGACTNQAESYFSRLRRAEWGQHHHISGRHLHAYAAEMAWREDHRRVPNGTLFTAVVD
AALNAPVSRQWKGYWQK

 

Blast result :
ORF 4
LengthBeginEndStrandFusion ORF
162 bp53 aa31082947-No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MDLMDKESNRKKAAHDFVERVVRGIYHQNTDEDTIRAVAERVEQATPRLEAIS

 

Blast result :
ORF 5
LengthBeginEndStrandFusion ORF
435 bp144 aa43343900-No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MTERPNTVAGLIEKRREISGQIEHHQRALNDLIVDLDHIDHTIHMFAPDQDLALVQPKQFPARHGAYRGEMQRFVLGALRNADSAVTTLEIAIEVVKGRG
LDPNDARAVSLIRKRVGSCMNKLVAKGLAREVPSEGTYKQWVTG

 

Blast result :
Comments
ISThli1 is 84% aa similar to ISBmo1.
References
1] ISfinder annotation (2017)
2] Varghese,N. and Submissions,S. (2017) Direct GenBank submission.