ISAzvi3

  • Family IS110
  • Group IS1111
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
CP001157 ND Azotobacter vinelandii
Azotobacter vinelandii
DNA section
IS Length : 1424 bp

Ends


IR Length : 13

IRL : agatggtATGGACGCCTCCCCCGGCAGCGGGCCGATTGCAACACCTCTGC
IRR : ----tatATGGACGCCTCCCGGAACTCAAGAGGTAGGCCTGCTGGCGTTC

Insertion site


Left flankDirect repeatRight flankDR Length

DNA sequence

AGATGGTATGGACGCCTCCCCCGGCAGCGGGCCGATTGCAACACCTCTGCATCGATAGAGGATCCTCGCTGCTCGGCATCGGAGTGCCACAGTGAAGCAT
GTGAATGGCAAGCATCACTTGAGCAAGAGGATCCATCATGGAACATAGCGTGATCGGCATGGACATCGCCAAGAAGGTTTTCCAACTACATACCGTCGAC
CAGAACACCGGAAAAATCGAGCGCATCAAACTGCGGCGCGATGAAGTCCTGGCGTTTTTCGCCCGGCGCCAACCCTGCCTGGTGGCGATCGAGGCGTGCG
GCAGCGCCCACTGGTGGGCTCGTCAGCTTCGGCAGCAGGGGCATGAAGTGCGCTTGCTGGCCCCTCGCTCAGTGCGTCCCTTCGTACTGCGCAACAAGAC
CGATGCGGCCGACGCCCAGGCCATCTGGACAGCGGTACAGCAGCCCGGCGCCTGCCTGGTCGCCATCAAGCAAGCGGACCAGCAGGCCATTCTCTCGCTG
CACCGTATTCGTGCCCAACTGCTGAAGTTTCGCATCATGCAGAGCAATGGTTTGCGAGGGCTGTTTTACGAGTTCGGGATTGTGCTGCCGGAGGGGTATG
CCCCCTTGTCCAAGGCTATGCCGGAAGCCTTCGCCGAGGCGGAAAATCAGGTTCCTCCCGTACTTCTGGAGAGCCTGCGCGAGCAGTGGGCGCGGGTGCT
CAGACTGGAGGAGGAAATCCAGGTGATCGAACTCCGCCTGAAGCGCTGCCTGCGCGAGAACCCCGACTGCCAGAAAATTGCAGAGATCCCCGGCATCGGT
TTATTGACGGCCACCGCTGCGGTTGCCTCGCTGGGAGACGCGACGACCTTTCGTTCCGGCCGACAGTTCGCCGCCTGGCTGGGACTGGTTCCCCGGCAGA
CCGGTACCGGCGGTCGAGTCCGGCAACTGGGGCTGAGCAAGCGCGGTGACAGCTACCTGCGCATGCTGTTGATGCACGGGGCTCGCTCGATCCTCGCCAG
GAGCCAGAAATCCGGATGGCTTGAGCGCTTGCTGGCCAGGCGGCCTCACAACGTGGCGGTCGCCGCGGTGGCCAACAAGCTGGCCCGAACCCTGTGGGCC
GTGCTGGCCAAGGGCTCTCCGTACCGGGCCGAGCGCTTTACCGCATGCCCTGCGGGGCATTGAGAAACGGCAGCCAACAGGCTTCACAAGGAGCGCAGGC
GGTAAGCACGTGATGGCGGAACAGGTCAGACCGTGGCGAGGTAAACCTGATAAGGAAGATGAGCCAGGGCGCTGCCCACAGCTCGATTTTCAGATGCGGG
CCTCGCCAGCGGATTCCATCAGGGCCAGCAGGTTTTTCGGCCTGCACAAACAGGCCGGATATAAGACTGCTCCTACCTGAACGCCAGCAGGCCTACCTCT
TGAGTTCCGGGAGGCGTCCATATA
Protein section
ORF number : 1

 

ORF 1
LengthBeginEndStrandFusion ORF
1026 bp341 aa1381163+No
ORF function : Transposase
Chemistry : DEDD

ORF sequence :

MEHSVIGMDIAKKVFQLHTVDQNTGKIERIKLRRDEVLAFFARRQPCLVAIEACGSAHWWARQLRQQGHEVRLLAPRSVRPFVLRNKTDAADAQAIWTAV
QQPGACLVAIKQADQQAILSLHRIRAQLLKFRIMQSNGLRGLFYEFGIVLPEGYAPLSKAMPEAFAEAENQVPPVLLESLREQWARVLRLEEEIQVIELR
LKRCLRENPDCQKIAEIPGIGLLTATAAVASLGDATTFRSGRQFAAWLGLVPRQTGTGGRVRQLGLSKRGDSYLRMLLMHGARSILARSQKSGWLERLLA
RRPHNVAVAAVANKLARTLWAVLAKGSPYRAERFTACPAGH

 

Blast result :
Comments
No uninterrupted target sequence was found, so the ends of the IS have been defined by analogy with other IS1111 family elements, assuming that 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.

The transposase protein is 87% identical to that of IS1383 and 42% identical to that of IS1111.
References
1] US Department of Energy Joint Genome Institute http://jgi.doe.gov/JGI_microbial/html/
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384