ISAar44

  • Family IS3
  • Group IS3
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
ND Arthrobacter arilaitensis
Arthrobacter arilaitensis RE117
DNA section
IS Length : 1599 bp

Ends


IR Length : 20/38

IRL : CCCTGTCTGTCAGGCTGGAGTGAGACCACTGATTCATAGCAAGTAAACCT
IRR : CGGAAATTTTCAAGTCAAGTGAGACCAATGAGTGTTAACCAGCTTGCATT

Insertion site


Left flankDirect repeatRight flankDR Length
CGTCCGCGCCAGCCGGCCTG0
CATATTCCTGTGGCGGTGACGTG3
GTCAGTACGCTGGGGCTACCGGT3
ACATCTGCCGTTTCGATGGAGCG3

DNA sequence

CCCTGTCTGTCAGGCTGGAGTGAGACCACTGATTCATAGCAAGTAAACCTCCCCTCAGACAGGGCGAGAACAGGTACAACCGCCATCATGAGCATCGATT
CCATCCACCCCACCCCGGGGAACATGCCGGAGAATGGAACTATGGACTCCATCGATTCGCGCCCAGAACAGCCCCGCCGCAGGAGCATCAGCCCAGCCCA
GAAACTGGCTTACCTCGACGGCTACGAACAAGCGATCCAAACCGGCGAGGGCCGAGGCTACCTGCGCGCCAACGCCTTGTATTCCTCGCAGATCGTTGAA
TGGCGCAGGCTACGCGACGCCGGAGTCCTCGGAGATTCAACCAGCAACAGCTTCCCGGCCCGGTCCAGTAGCCGGCTGAGCAAGGAACAAGCGGAAATCG
CCCGACTGAAAAAGCAGCTAGCTGCCAACGAGCAGAAGCTTGCAACCACCCAAACCGCGTTGGAGATCATGGGAAAAGCACACGCGCTCTTGGAACAAAT
CTCGAAGAGCGCGGATTCCAAATAGCAGCCCAGGACCTCCTGTCCCACGTTTATGAACAACTGGTCACCGGCGGGGTATCAACACGTCAGGCCAGCTCGC
TGGCCGGGGTCAGTAGGGCGACGATGAACCGCCGCCAAAACGCTGCCCGCGCAGGGCGAGCACTGCGGGCTACCGCTCCTCGCCCGGCACCAGCGAACAA
GCTCACCGCGCAGGAAGAAACCACGATTCTGGCAACGCTGAACAGCGAACGGTTCGTGGACCAAGCCCCAGAGCAGATCCACGCCACGCTGCTGTCTGAA
GGGACATATCTGTGCTCGGTATCGAGCATGTATCGGTTGCTGCGGAGGGCGAAGCAAGTGGCTGAACGCCGTCGACAGGCCCGTCATCCAGCACGAAAGG
TCCCCGAGCTGGTCGCGGATCAGCCTGGAGAGGTGTTCACTTGGGATATCACTAAGCTGGCCGGCCCGACGAAGGGCGTGTATTTCGACGCGTACGTGAT
GATTGATATCTATTCGCGCTATATCGTCGGTTGCCAGGTCCATACTCGTGAATCCGGGGAGCTGGCGCGTGATTTCATTGCCGGGGTGTTCGCCAAAGAC
CAGGTGCCGAAAGTTGTGCATGCCGACCGCGGCACCTCGATGACCTCGAAACCCGTGGCGGCTTTGTTGGCGGACTTGGACGTGCTCAAATCGCATTCGC
GGCCGAAGGTCAGCAATGACAACCCGTTCAGTGAGGCGTGGTTCAAGACCTTGAAGTATCTTCCGACATTCCCGCAGCGGTTTGGATCACTTGTTGATGC
CCGGGCTTTCATGGACCGGTTTGTTCAGTCGTATAACGGGCACCATCGCCATTCGGGGATTGGGTTCCATACGCCTGCTGATGTTCATTTCGGGATGACT
GGGCATGTTGATGACCAGAGGTTGGCTGCATTGCAGAGGGCTTGGGATGAGCATCCTGAGCGTTTTGGGCGGCGCAGGTTGCCGAAGAAGCTTCAGATGC
CCGAAGCGGCGTGGATCAATGAGCCGGTGAAGCGGTTGGAAGGACAAGAAATGCAAGCTGGTTAACACTCATTGGTCTCACTTGACTTGAAAATTTCCG
Recoding section
  • Recoding by frameshift
  • Frame
  • Type
  • Experimentally demonstrated

Stimulators :

  • Shine-Dalgarno sequence :
  • Secondary structure :

Recoding motif :

Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
438 bp145 aa88525+No
ORF function : Transposase
Description : First part of the transposase

ORF sequence :

MSIDSIHPTPGNMPENGTMDSIDSRPEQPRRRSISPAQKLAYLDGYEQAIQTGEGRGYLRANALYSSQIVEWRRLRDAGVLGDSTSNSFPARSSSRLSKE
QAEIARLKKQLAANEQKLATTQTALEIMGKAHALLEQISKSADSK

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
1194 bp397 aa3721565+No
ORF function : Transposase
Description : Second part of the transposase

ORF sequence :

PAEQGTSGNRPTEKAASCQRAEACNHPNRVGDHGKSTRALGTNLEERGFQIAAQDLLSHVYEQLVTGGVSTRQASSLAGVSRATMNRRQNAARAGRALRA
TAPRPAPANKLTAQEETTILATLNSERFVDQAPEQIHATLLSEGTYLCSVSSMYRLLRRAKQVAERRRQARHPARKVPELVADQPGEVFTWDITKLAGPT
KGVYFDAYVMIDIYSRYIVGCQVHTRESGELARDFIAGVFAKDQVPKVVHADRGTSMTSKPVAALLADLDVLKSHSRPKVSNDNPFSEAWFKTLKYLPTF
PQRFGSLVDARAFMDRFVQSYNGHHRHSGIGFHTPADVHFGMTGHVDDQRLAALQRAWDEHPERFGRRRLPKKLQMPEAAWINEPVKRLEGQEMQAG

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1478 bp492 aa881565+Yes
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MSIDSIHPTPGNMPENGTMDSIDSRPEQPRRRSISPAQKLAYLDGYEQAIQTGEGRGYLRANALYSSQIVEWRRLRDAGVLGDSTSNSFPARSSSRLSKE
QAEIARLKKQLAANEQKLATTQTALEIMGKSTRALGTNLEERGFQIAAQDLLSHVYEQLVTGGVSTRQASSLAGVSRATMNRRQNAARAGRALRATAPRP
APANKLTAQEETTILATLNSERFVDQAPEQIHATLLSEGTYLCSVSSMYRLLRRAKQVAERRRQARHPARKVPELVADQPGEVFTWDITKLAGPTKGVYF
DAYVMIDIYSRYIVGCQVHTRESGELARDFIAGVFAKDQVPKVVHADRGTSMTSKPVAALLADLDVLKSHSRPKVSNDNPFSEAWFKTLKYLPTFPQRFG
SLVDARAFMDRFVQSYNGHHRHSGIGFHTPADVHFGMTGHVDDQRLAALQRAWDEHPERFGRRRLPKKLQMPEAAWINEPVKRLEGQEMQAG

 

Blast result :
Comments
ISAar44 is 56% aa similar to ISGur10.
There are 4 full length copies of ISAar44 encoded in the Arthrobacter arilaitensis genome.
The ISAar44 ends are not classic for this ISs family. An additional fragment is also present on each extremities of IRs (as for ISSde10).
The third ORF is a putative ORFAB transposase reconstructed in silico by a possible -1 frameshift.
References
1] ISfinder annotation (2009).