ISTha3
- Family IS91
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
EU327987 | ND | Thauera sp. | Thauera sp. E7 |
DNA section
IS Length : 2424 bp
Ends
oriIS : TTGAACTGAACCGCCGCTTAGGCGGTGATGCTGGGGATTCTACGCGCGCCTCATCCACATTGGATAGGGCGCGACTGGATCTTGGCATCAGCGAGTGGCC II struct. : No
terIS : GAATGCGGGGACATCCCCGGTTTCAAGGTAGGTCGGCGGCGCAGACGTCATGAAAAAACGCGGTGGCGCTTGGGTTCGCGGCCACCGCGTCAGCGGTTTA II struct. : No
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
AGGGATAAGC CTTG | GTTC AACTTTGTTT |
DNA sequence
TTGAACTGAACCGCCGCTTAGGCGGTGATGCTGGGGATTCTACGCGCGCCTCATCCACATTGGATAGGGCGCGACTGGATCTTGGCATCAGCGAGTGGCC
GGTCCCGGCTCAGCGCTCGCGACGATCGCCGTAATCGGCCGCAATCGAGTGCCGGGCGTGCGTCGGGCGTGCGGTGTCGGTCGCGTTCGCCAGTCTGGGA
ACGTGCCACGGCCTGCCGGTGCGAAAGCCGCATCAGCGAGGATTGAGACGGGGGCGCGTGAGTGTTCATCGCGGCCCATCCCGACAGCGCGGCGCCGGTG
CGGGCGGATCGTACCGGCGAGCCATGACGATCTCGACCGTGTCCCAACGCCCGACCCTACAGTGCGGGCAGCATCGCGGATCCTCCCCGCTGACGCGGGC
GAGGAAGTCCTCGGCTGAATAGATCACAGCCGCTTGTGGTGCCGGCGTCTGGAGTGCCGCACGGGCGGCGGCAAGCCGGGCCCGCTTGCGGCCGCAGGCG
AGCAGCCCGTAGTGGCGCAGGCGCTTGAAGCCGGCCGGCAGCACGTGGCGCAGGAAGCGTCCGATGAAGGTGTCGGTCGGCAGGCTGACCGTACGCTTGC
CGCCGGTCGCGTTGTCGCGCACCCGAAGGCGCACCTGCCCCACCTCGCAGCCGAGAAGGCGGTCGTTCGAGAGCGCCACGCGGTGCGTGTAACGGGCAAG
GTAGTCGAGCACTTGGGCGGGCCCGCCCGGCGGCGGCTTGGCGTAGACCACCCAGTCATGCACGAGCAGCGCACGGCGCCTGTCTTGCCATGCACCCGGC
GCACCTTGCGGATCGTCGGGCAGCTCGCCGCTGCGGTGGGCAGCGTCGAGCCGCGCGAGGAACTTGCCGCGAAACACCTTCGAGGCGGCCTGGATCGGAA
ACAGGAAGTGCGTGCCGCGTGCCGGTATGCGCCATCTGCCCTCGGCGTCGAGCCCGCCACAGGTGATCAGGGCGTGCACATGCAGGTGCACGCGCAGATC
CTGACTCCAGGTGTGGAGTACGAGGCTGAAGCCAGGCACGGCGCCCAGCCAGCGCGGGTTGGCGGCGAGCTCGAGCAAGGTGGCCGCCGCACTGTCGAAC
AACGCGCCGTAGAGCCAGCGCGGATGCCGGATCGCCAGTGGATTGAGCGCGTGGGGCAGCGTGAACACCCAGTGCGCGTAGGGCACCGGCAGCACTTCGC
GCAGGCGCGCCGCGCGCCAGGCCTCCTTGGCCCGCGTCTGGCACTGCGGGCAATGGCGGTTGCGACACGAACGCCAGACATGGCGCTCGGCGCCGCAGTT
TGCGCAACGCTCACGTAAGCCGCCGAGCGCCGCCGTACGGCAGTCGACGATCGCCCGCCAGGCCCGCGCCTGGGTGGGCGACAGAGCATGGCTCCGGCGG
TAGGTGCCGCCGTGCGTGCGCAGGATGCTGGCCAGCGTGGGCTGGTCCATGGCCGCTCGGCCTCAGAGCCGGCGCAGCAGATCGAGCGGGCTGTCGTGGG
CGACTGAACCCGGACGCGCCAGATGCAGGTAGCGCTGCGTGGTCGACAGGTGGCCGTGGCCCAGGAGCTTCGCGAGCGTGGCCAGATCGACGCCGGCCTC
GAGCAGATGGGTGGCGAAGGCGTGGCGCAGCGTGTGGATGCCGCCCGTCTTGGTGATACCAGCTCGGTCGCGGGCACGGTAGTACGCACGTTGCGCACTG
CTGATGTCGTACGGCTGGGCGGCGTCGGTGGCGCGAGGGAACAGCCAGACGCGGGGTTTGCAGATGCGCCAATAGACGCGCAGCGCTTCGAGCAGGCTCG
GGCTGAGCAGCGTGTAGCGATCCTTGCCACCCTTGCCCGCCACCACCCGGATGCACATGCGATCCGGTGCGCTGTCAATGTCGGCGACGCGTAGCGCGCA
CACCTCGGACACCCGCAGTCCGGCCGCGTAGGCCGTCATCAACAGTGTGCGCGCGCGCAGATTGGCGGCCGCCTCGAACAGTCGCGCCAGTTCCTCGCGC
GACAGGATCTCCGGCTGACGCTGCGGCGTGTGCGCGTAGGGAATCGTGATGGCCTCGGCCGTGCGGCCCAGCACGATATCGAAGAAGAACTTCAGCGCGC
ATACGGCCTGGTTCACGGTCGTGTAGGCCAGGTGCCGCTCGGTGATCCGGTGCAGCAGCCAGGCCTTGACATGCGCGCCATCGAGCCGGTCGGGGCTGCA
GTGGTAGTGCGCGGCCAACTGCGCGACCACCGACAGGTAGGCCTGCTGCGTGCGCCGTGCCAATCCGCGCAGCACCATCGCGTCGATCATCTGCTGACGT
AAGGGGTTCATGACACTTCTCCTTGAATGCGGGGACATCCCCGGTTTCAAGGTAGGTCGGCGGCGCAGACGTCATGAAAAAACGCGGTGGCGCTTGGGTT
CGCGGCCACCGCGTCAGCGGTTTA
GGTCCCGGCTCAGCGCTCGCGACGATCGCCGTAATCGGCCGCAATCGAGTGCCGGGCGTGCGTCGGGCGTGCGGTGTCGGTCGCGTTCGCCAGTCTGGGA
ACGTGCCACGGCCTGCCGGTGCGAAAGCCGCATCAGCGAGGATTGAGACGGGGGCGCGTGAGTGTTCATCGCGGCCCATCCCGACAGCGCGGCGCCGGTG
CGGGCGGATCGTACCGGCGAGCCATGACGATCTCGACCGTGTCCCAACGCCCGACCCTACAGTGCGGGCAGCATCGCGGATCCTCCCCGCTGACGCGGGC
GAGGAAGTCCTCGGCTGAATAGATCACAGCCGCTTGTGGTGCCGGCGTCTGGAGTGCCGCACGGGCGGCGGCAAGCCGGGCCCGCTTGCGGCCGCAGGCG
AGCAGCCCGTAGTGGCGCAGGCGCTTGAAGCCGGCCGGCAGCACGTGGCGCAGGAAGCGTCCGATGAAGGTGTCGGTCGGCAGGCTGACCGTACGCTTGC
CGCCGGTCGCGTTGTCGCGCACCCGAAGGCGCACCTGCCCCACCTCGCAGCCGAGAAGGCGGTCGTTCGAGAGCGCCACGCGGTGCGTGTAACGGGCAAG
GTAGTCGAGCACTTGGGCGGGCCCGCCCGGCGGCGGCTTGGCGTAGACCACCCAGTCATGCACGAGCAGCGCACGGCGCCTGTCTTGCCATGCACCCGGC
GCACCTTGCGGATCGTCGGGCAGCTCGCCGCTGCGGTGGGCAGCGTCGAGCCGCGCGAGGAACTTGCCGCGAAACACCTTCGAGGCGGCCTGGATCGGAA
ACAGGAAGTGCGTGCCGCGTGCCGGTATGCGCCATCTGCCCTCGGCGTCGAGCCCGCCACAGGTGATCAGGGCGTGCACATGCAGGTGCACGCGCAGATC
CTGACTCCAGGTGTGGAGTACGAGGCTGAAGCCAGGCACGGCGCCCAGCCAGCGCGGGTTGGCGGCGAGCTCGAGCAAGGTGGCCGCCGCACTGTCGAAC
AACGCGCCGTAGAGCCAGCGCGGATGCCGGATCGCCAGTGGATTGAGCGCGTGGGGCAGCGTGAACACCCAGTGCGCGTAGGGCACCGGCAGCACTTCGC
GCAGGCGCGCCGCGCGCCAGGCCTCCTTGGCCCGCGTCTGGCACTGCGGGCAATGGCGGTTGCGACACGAACGCCAGACATGGCGCTCGGCGCCGCAGTT
TGCGCAACGCTCACGTAAGCCGCCGAGCGCCGCCGTACGGCAGTCGACGATCGCCCGCCAGGCCCGCGCCTGGGTGGGCGACAGAGCATGGCTCCGGCGG
TAGGTGCCGCCGTGCGTGCGCAGGATGCTGGCCAGCGTGGGCTGGTCCATGGCCGCTCGGCCTCAGAGCCGGCGCAGCAGATCGAGCGGGCTGTCGTGGG
CGACTGAACCCGGACGCGCCAGATGCAGGTAGCGCTGCGTGGTCGACAGGTGGCCGTGGCCCAGGAGCTTCGCGAGCGTGGCCAGATCGACGCCGGCCTC
GAGCAGATGGGTGGCGAAGGCGTGGCGCAGCGTGTGGATGCCGCCCGTCTTGGTGATACCAGCTCGGTCGCGGGCACGGTAGTACGCACGTTGCGCACTG
CTGATGTCGTACGGCTGGGCGGCGTCGGTGGCGCGAGGGAACAGCCAGACGCGGGGTTTGCAGATGCGCCAATAGACGCGCAGCGCTTCGAGCAGGCTCG
GGCTGAGCAGCGTGTAGCGATCCTTGCCACCCTTGCCCGCCACCACCCGGATGCACATGCGATCCGGTGCGCTGTCAATGTCGGCGACGCGTAGCGCGCA
CACCTCGGACACCCGCAGTCCGGCCGCGTAGGCCGTCATCAACAGTGTGCGCGCGCGCAGATTGGCGGCCGCCTCGAACAGTCGCGCCAGTTCCTCGCGC
GACAGGATCTCCGGCTGACGCTGCGGCGTGTGCGCGTAGGGAATCGTGATGGCCTCGGCCGTGCGGCCCAGCACGATATCGAAGAAGAACTTCAGCGCGC
ATACGGCCTGGTTCACGGTCGTGTAGGCCAGGTGCCGCTCGGTGATCCGGTGCAGCAGCCAGGCCTTGACATGCGCGCCATCGAGCCGGTCGGGGCTGCA
GTGGTAGTGCGCGGCCAACTGCGCGACCACCGACAGGTAGGCCTGCTGCGTGCGCCGTGCCAATCCGCGCAGCACCATCGCGTCGATCATCTGCTGACGT
AAGGGGTTCATGACACTTCTCCTTGAATGCGGGGACATCCCCGGTTTCAAGGTAGGTCGGCGGCGCAGACGTCATGAAAAAACGCGGTGGCGCTTGGGTT
CGCGGCCACCGCGTCAGCGGTTTA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1185 bp | 394 aa | 1450 | 266 | - | No |
Chemistry : Y2
ORF sequence :
MDQPTLASILRTHGGTYRRSHALSPTQARAWRAIVDCRTAALGGLRERCANCGAERHVWRSCRNRHCPQCQTRAKEAWRAARLREVLPVPYAHWVFTLPH
ALNPLAIRHPRWLYGALFDSAAATLLELAANPRWLGAVPGFSLVLHTWSQDLRVHLHVHALITCGGLDAEGRWRIPARGTHFLFPIQAASKVFRGKFLAR
LDAAHRSGELPDDPQGAPGAWQDRRRALLVHDWVVYAKPPPGGPAQVLDYLARYTHRVALSNDRLLGCEVGQVRLRVRDNATGGKRTVSLPTDTFIGRFL
RHVLPAGFKRLRHYGLLACGRKRARLAAARAALQTPAPQAAVIYSAEDFLARVSGEDPRCCPHCRVGRWDTVEIVMARRYDPPAPAPRCRDGPR
ALNPLAIRHPRWLYGALFDSAAATLLELAANPRWLGAVPGFSLVLHTWSQDLRVHLHVHALITCGGLDAEGRWRIPARGTHFLFPIQAASKVFRGKFLAR
LDAAHRSGELPDDPQGAPGAWQDRRRALLVHDWVVYAKPPPGGPAQVLDYLARYTHRVALSNDRLLGCEVGQVRLRVRDNATGGKRTVSLPTDTFIGRFL
RHVLPAGFKRLRHYGLLACGRKRARLAAARAALQTPAPQAAVIYSAEDFLARVSGEDPRCCPHCRVGRWDTVEIVMARRYDPPAPAPRCRDGPR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
849 bp | 282 aa | 2311 | 1463 | - | No |
AG : IS91 integrase/resolvase
ORF sequence :
MNPLRQQMIDAMVLRGLARRTQQAYLSVVAQLAAHYHCSPDRLDGAHVKAWLLHRITERHLAYTTVNQAVCALKFFFDIVLGRTAEAITIPYAHTPQRQP
EILSREELARLFEAAANLRARTLLMTAYAAGLRVSEVCALRVADIDSAPDRMCIRVVAGKGGKDRYTLLSPSLLEALRVYWRICKPRVWLFPRATDAAQP
YDISSAQRAYYRARDRAGITKTGGIHTLRHAFATHLLEAGVDLATLAKLLGHGHLSTTQRYLHLARPGSVAHDSPLDLLRRL
EILSREELARLFEAAANLRARTLLMTAYAAGLRVSEVCALRVADIDSAPDRMCIRVVAGKGGKDRYTLLSPSLLEALRVYWRICKPRVWLFPRATDAAQP
YDISSAQRAYYRARDRAGITKTGGIHTLRHAFATHLLEAGVDLATLAKLLGHGHLSTTQRYLHLARPGSVAHDSPLDLLRRL
Blast result :
Comments
ISTha3 is embedded within a class 1 integron gene cassette, e.g. associated to an attC recombination site.
ISTha3 is 56% aa (ORFA : the tranposase) and 61% aa (ORFB : the recombinase) similar to ISAzo26.
ISTha3 is not bracketed by DR but displays palindromic sequences that could account for its orIS (71-141) and terIS(2382-2414).
ISTha3 is 56% aa (ORFA : the tranposase) and 61% aa (ORFB : the recombinase) similar to ISAzo26.
ISTha3 is not bracketed by DR but displays palindromic sequences that could account for its orIS (71-141) and terIS(2382-2414).
References