ISAau4

  • Family Tn3
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s) TnAau4
Accession numberTranspositionOriginHost
NC_008711 ND Arthrobacter aurescens
Arthrobacter aurescens TC1
DNA section
IS Length : 3898 bp

Ends


IR Length : 18/21

IRL : GGGGTCTTGGTAGTAACGGCGGGAAATTGAACGCTAAGCCTCGTTGGGCT
IRR : GGAGTCTCGGTAGTAGCGGCGTGTGTGGATTTCGGCGCTGTTTGGTGGCG

Insertion site


Left flankDirect repeatRight flankDR Length
AGAGCCTTGCAAAGTGGAGA0

DNA sequence

GGGGTCTTGGTAGTAACGGCGGGAAATTGAACGCTAAGCCTCGTTGGGCTGGTCTTGGTGCTGCTCGGTAGCCTTGGTGTGTGTCGGTGGAGTTTCTGAG
CGAGGAGCAGGCGGGCGGGTTCGGGCGTTTCCTGGGTGAGCCGTCGCGGGCTGATCTGGAGCGGTTTTTCTATCTTGATGACGCTGATCTGGAGCTGATC
GCGAAGCGGCGTGGAGACCACAACCGGCTCGGCTTTGCCGTGCAGCTTGGGACGATCCGGTTCCTGGGTGTGCTGCTTGCGGATCCGCTTGATGTTCCGT
GGGGTGTTGTCGATTACCTTTCGGCCCGGCTGGGCACCGCGGACCCCTCGATCGTGAAGAAATACATGCGGCGGCGACCGACGGTTCACGAGCACGCCCG
CGAGATCCGCGCTGTTTACGGCTACCGTGACCTGGTCGGGCCTGTCCTGGAAGACCTGTCGGCGTACGTCTACTCGAGGGCGTGGACGCACGGGGAGGGT
CCGAGTGTTCTTTTCGAGCTGGCGACGGCGTGGCTTCGTCGGGAGCGTGTGCTTCTTCCCGGGGTGACGACGCTCGTGCGGGTGGTGCAGTCTGCGCGCG
AAGCGGCGCAGTCCGGGGTGTACGGCGTCGTTGCGACGGCGGCCAGCGCGGTCGATCCGCGGTTGCCGGTGGTGCTGCGCGGGCTGCTCGTGACTGACCG
GGGTGAGCGGGTCTCGCGGTTGGAGTTGCTGCGGGCGGGTCCGACGAGGGTCTCGGGCCCGGAGCTGGACAAGGCGTTGGGCAGGGTCGCTGCGTTGCGG
GCGCTCGGGGCCAGGGCGGTGGACCTGTCGGCGGTGCCGCCGGCGCGGGTGCGTGCCTTGGCCCGGTACGGGATCGGGGCCAAGGCGCAGTCGTTGCGGC
GTTTGGCCGAACCACGACGCACGGCGACGCTCGTGGCGACGGTCACGGCGTTGGAGGCCAACGCGGTCGATGATGCGCTGGACCTGTTTGATCTGCTGAT
GACCACGCGGGTGCTCGACCCCTCGCGCCGTGCGGCGGTCGCGGAGCGGTTGGCGAAGATGCCCGAACTGGAGAAGGCCTCGGGCGTTCTGGCCCGGGTC
GGAGCCCGGCTGCTTCGCGTGCTTGAGGAGTCCGGCGACCAGGTTGATGTCGCGGCAGCGTGGGCGGCTCTGGAACAGGTCGCCGCGCGGGACCGGATCG
CCGATGCGGTGGCGAAGGTGGGTGAGCTCGTCCCCGACGAGAGCGGCGCCGACGGGGCGATGCGTGGGCAGATGGCGCGTCGTTTCCGGACCGTGGCACC
GTTCCTGCGGCTGTTGGCCACCACGATCCCGTGGGGTGCGACCGCCGCCGGCCAGCCCCTGCTCGAGGCGCTTGCCCGCCTGGACGGGTTGCGGGGTCGG
CGCAAGGTGCGGCGCGAGGAAATCGACGAGGCGCTGGTGCCGCGGGCCTGGCACGCGGCGGTGTTCGGCCGCGCCGGCGGGGCCGGGGTGGACCGGGACG
CGTGGGTGGTGTGCGTGCTGGAACAGCTGCGTTCGGGCCTTCGCCGCCGCGACGTATTCGCGGTCGGCTCCACCAGGTGGGGCGACCCGCGCACCCGCCT
GCTCGACGGTCCCGCTTGGGAGGCGGTGCGCGAACAGGCGTTGACGAGCCTGAGCCTTCACGCCCCGGTGTCCGAGCACCTGCGCACTCGGACGGAGGTG
CTCGACGCCGCCTGGCGAGGGCTCGCCGCGGCGATCGGGCAGACCGGCCCGGACGGGTCCGTGCAGCTGACCGAGGGGCCGGACGGAAGGGTCAGGCTCA
CCGTCTCCCCGCTCGAGGCCTTGGAGATCCCCGACTCCCTCACGAAGCTGCGCAAGCAGGTGGCAGCGATGCTGCCGCGGGTGGACCTGCCCAAGATCCT
GCTGGAGGTCCACTCCTGGACCGGGTTCCTTCACGCCTACACCCACATCGGACAGTCCGGTTCCCGGATGAGGGATCTTCCGGTCTCGGTCGCCGCGGTC
CTGATCGCCCAGGCCTGCAACGTCGGCCTGACACCGGTCGTCGCCGAGGGGCACCCGGCGCTGACCCGGGACCGGCTGGGGCACGTGGACGCGAACTACG
TGCGCGCCGAGACCCACGCCGCCGCGAACGCCCTCCTGATCGATGCGCAGGCCGGGGTGCCGATCGCGAGCTCGTGGGGCGGCGGGCTGCTGGCCTCGGT
GGACGGGCTGCGGTTCGTCGTGCCGGTGCGCACCATCAACGCCGCGCCGAACCCGAAGTACTTCGGCCGCGGCCGGGGGCTGACCTGGTTCAACGCGGTC
AACGACCAGGCCGCCGGGATCGGCGGAGTCGTCGTGCCCGGCACGGTGCGCGACTCGCTGTACGTGCTGGACACCATGCTCAACCTCGACGGCGGCCCGA
AGCCCGAGATGGTCGCCTCCGACACCGCCTCCTACTCCGACCTGGTCTTCGGGATCTTCACGCTGCTCGGCTACCGCTTCGCACCGCGCATCGCGGACCT
GTCCGACCAACGCCTGTGGCGCACCGGGATGCCCGGCGGCGAGGCGGACTACGGGGCGCTGAACGCGGTGGCGCGCAACAAGGTCAACCTGGCGAAGATC
ACCGCCCACTGGGACGACATGACCCGCGTGGCCGCCTCGCTGGTGACCGGGACGGTCCGCGCCTACGACGTGCTGCGCATGCTCACCCGCGACAACGGGG
CACCGAACCCGCTCGGGGCGGCGATCGCGGAGTACGGGCGCATCGCCAAGACCCTGCACCTGCTGGCCCTCATCGACCCGACGGATGAGACCTACCGGCG
TTCGATCAACACCCAGCTGACCGTCCAGGAGTCACGCCACCGCCTGGCCAGGGCGATCTTCCACGGCCGGCGCGGGCAGATCCACCAGCGCTACCGAGAG
GGCCAGGAGGACCAGCTCGGCGCGCTCGGACTCGTGCTCAACGCCGTCGTGCTATGGAACACCCGCTACACCGCCGCCGCCGTCACGGCGCTCCGCGAGG
CCGGACAGGACATCCCCGAGACTGACCTCGCCCGGCCGTCACCGCTGGCCGATCAGCACATCAACATGCTCGGCCGCTACGCCTTCACCGCACCCACACC
CGACGGGCTACGACCGCTGCAAGACCCCGCAGCCGGGCAAATCGAGCGCTGACCGGCACCCGTTCGGCACATAGCATTGGCTGCAGAGCACGTTGCCGAG
AATTCGCCCCTCGCAATCGACGGTCGCGCGATCACCGGCACCACCGCCACATCGAAGGACGGCAAGATCATCGAATGCGAACAGGGGGAAGGCGCATGAC
TTCTCCGCACACAGGGCTTCTGCACCACGTGGAGCTCTGGGTTCCCGACATCGAACGCGCCGCGGCCCAGTGGGGATGGCTGCTCGAGGAGATCGGCTAT
GACCCGTTCCAGGTGTGGCCAGGCGGCCGCAGCTGGAGGCTCGCCCACACCTACATCGTGCTCGAGCAGTCACCCGACATGCGTGGCGGGAACCACGACC
GCAAGCGCCCGGGCCTCAACCATCTCGCCTTCTACGCCGGGAACCGGCAGCGTGTCGATGACCTCGCCACCGCCGCCCCGCACCACGGCTGGACACTCCT
CTTCCCGGATCGCCACCCACACGCCGGCGGACCGCAAACGTATGCCGCCTACCTGACCAACACCGACGGCTACGAGGCCGAACTGATCGCCCACGACTGA
ACCGCCCTCCACGGGCCGTGACGGTTTCCCGGCAGCACTCGACCACGTCCCCGAAAACAACCAGCCCGAGACCGTCACCGATGACGTCCTCGAAGCCGCC
GTGAGAATCCTTGCCTTCCACAACCAGCGACATCGAGCGATCACAAAACGCCACCAAACAGCGCCGAAATCCACACACGCCGCTACTACCGAGACTCC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
3072 bp1023 aa813152+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MSVEFLSEEQAGGFGRFLGEPSRADLERFFYLDDADLELIAKRRGDHNRLGFAVQLGTIRFLGVLLADPLDVPWGVVDYLSARLGTADPSIVKKYMRRRP
TVHEHAREIRAVYGYRDLVGPVLEDLSAYVYSRAWTHGEGPSVLFELATAWLRRERVLLPGVTTLVRVVQSAREAAQSGVYGVVATAASAVDPRLPVVLR
GLLVTDRGERVSRLELLRAGPTRVSGPELDKALGRVAALRALGARAVDLSAVPPARVRALARYGIGAKAQSLRRLAEPRRTATLVATVTALEANAVDDAL
DLFDLLMTTRVLDPSRRAAVAERLAKMPELEKASGVLARVGARLLRVLEESGDQVDVAAAWAALEQVAARDRIADAVAKVGELVPDESGADGAMRGQMAR
RFRTVAPFLRLLATTIPWGATAAGQPLLEALARLDGLRGRRKVRREEIDEALVPRAWHAAVFGRAGGAGVDRDAWVVCVLEQLRSGLRRRDVFAVGSTRW
GDPRTRLLDGPAWEAVREQALTSLSLHAPVSEHLRTRTEVLDAAWRGLAAAIGQTGPDGSVQLTEGPDGRVRLTVSPLEALEIPDSLTKLRKQVAAMLPR
VDLPKILLEVHSWTGFLHAYTHIGQSGSRMRDLPVSVAAVLIAQACNVGLTPVVAEGHPALTRDRLGHVDANYVRAETHAAANALLIDAQAGVPIASSWG
GGLLASVDGLRFVVPVRTINAAPNPKYFGRGRGLTWFNAVNDQAAGIGGVVVPGTVRDSLYVLDTMLNLDGGPKPEMVASDTASYSDLVFGIFTLLGYRF
APRIADLSDQRLWRTGMPGGEADYGALNAVARNKVNLAKITAHWDDMTRVAASLVTGTVRAYDVLRMLTRDNGAPNPLGAAIAEYGRIAKTLHLLALIDP
TDETYRRSINTQLTVQESRHRLARAIFHGRRGQIHQRYREGQEDQLGALGLVLNAVVLWNTRYTAAAVTALREAGQDIPETDLARPSPLADQHINMLGRY
AFTAPTPDGLRPLQDPAAGQIER

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
123 bp40 aa31773299+No
ORF function : Passenger Gene
Annotation : Description :

ORF sequence :

MAAEHVAENSPLAIDGRAITGTTATSKDGKIIECEQGEGA

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
405 bp134 aa32963700+No
ORF function : Passenger Gene
Annotation : putative glyoxalase family proteinDescription :

ORF sequence :

MTSPHTGLLHHVELWVPDIERAAAQWGWLLEEIGYDPFQVWPGGRSWRLAHTYIVLEQSPDMRGGNHDRKRPGLNHLAFYAGNRQRVDDLATAAPHHGWT
LLFPDRHPHAGGPQTYAAYLTNTDGYEAELIAHD

 

Blast result :
Comments
ISAau4 ORFA (the Transposase) is 57% aa similar to ISThsp9.
ORFB and ORFC are passenger genes.
ORFC is a putative glyoxalase family protein.
References
1] ISfinder annotation (2009)
2] Mongodin,E.F., Shapir,N., Daugherty,S.C., DeBoy,R.T., Emerson,J.B., Shvartzbeyn,A., Radune,D., Vamathevan,J., Riggs,F., Grinberg,V., Khouri,H., Wackett,L.P., Nelson,K.E. and Sadowsky,M.J. (2006) PLoS Genet. 2 (12), E214