ISAzvi3
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP001157 | ND | Azotobacter vinelandii | Azotobacter vinelandii |
DNA section
IS Length : 1424 bp
Ends
IR Length : 13
IRL : agatggtATGGACGCCTCCCCCGGCAGCGGGCCGATTGCAACACCTCTGC
IRR : ----tatATGGACGCCTCCCGGAACTCAAGAGGTAGGCCTGCTGGCGTTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
AGATGGTATGGACGCCTCCCCCGGCAGCGGGCCGATTGCAACACCTCTGCATCGATAGAGGATCCTCGCTGCTCGGCATCGGAGTGCCACAGTGAAGCAT
GTGAATGGCAAGCATCACTTGAGCAAGAGGATCCATCATGGAACATAGCGTGATCGGCATGGACATCGCCAAGAAGGTTTTCCAACTACATACCGTCGAC
CAGAACACCGGAAAAATCGAGCGCATCAAACTGCGGCGCGATGAAGTCCTGGCGTTTTTCGCCCGGCGCCAACCCTGCCTGGTGGCGATCGAGGCGTGCG
GCAGCGCCCACTGGTGGGCTCGTCAGCTTCGGCAGCAGGGGCATGAAGTGCGCTTGCTGGCCCCTCGCTCAGTGCGTCCCTTCGTACTGCGCAACAAGAC
CGATGCGGCCGACGCCCAGGCCATCTGGACAGCGGTACAGCAGCCCGGCGCCTGCCTGGTCGCCATCAAGCAAGCGGACCAGCAGGCCATTCTCTCGCTG
CACCGTATTCGTGCCCAACTGCTGAAGTTTCGCATCATGCAGAGCAATGGTTTGCGAGGGCTGTTTTACGAGTTCGGGATTGTGCTGCCGGAGGGGTATG
CCCCCTTGTCCAAGGCTATGCCGGAAGCCTTCGCCGAGGCGGAAAATCAGGTTCCTCCCGTACTTCTGGAGAGCCTGCGCGAGCAGTGGGCGCGGGTGCT
CAGACTGGAGGAGGAAATCCAGGTGATCGAACTCCGCCTGAAGCGCTGCCTGCGCGAGAACCCCGACTGCCAGAAAATTGCAGAGATCCCCGGCATCGGT
TTATTGACGGCCACCGCTGCGGTTGCCTCGCTGGGAGACGCGACGACCTTTCGTTCCGGCCGACAGTTCGCCGCCTGGCTGGGACTGGTTCCCCGGCAGA
CCGGTACCGGCGGTCGAGTCCGGCAACTGGGGCTGAGCAAGCGCGGTGACAGCTACCTGCGCATGCTGTTGATGCACGGGGCTCGCTCGATCCTCGCCAG
GAGCCAGAAATCCGGATGGCTTGAGCGCTTGCTGGCCAGGCGGCCTCACAACGTGGCGGTCGCCGCGGTGGCCAACAAGCTGGCCCGAACCCTGTGGGCC
GTGCTGGCCAAGGGCTCTCCGTACCGGGCCGAGCGCTTTACCGCATGCCCTGCGGGGCATTGAGAAACGGCAGCCAACAGGCTTCACAAGGAGCGCAGGC
GGTAAGCACGTGATGGCGGAACAGGTCAGACCGTGGCGAGGTAAACCTGATAAGGAAGATGAGCCAGGGCGCTGCCCACAGCTCGATTTTCAGATGCGGG
CCTCGCCAGCGGATTCCATCAGGGCCAGCAGGTTTTTCGGCCTGCACAAACAGGCCGGATATAAGACTGCTCCTACCTGAACGCCAGCAGGCCTACCTCT
TGAGTTCCGGGAGGCGTCCATATA
GTGAATGGCAAGCATCACTTGAGCAAGAGGATCCATCATGGAACATAGCGTGATCGGCATGGACATCGCCAAGAAGGTTTTCCAACTACATACCGTCGAC
CAGAACACCGGAAAAATCGAGCGCATCAAACTGCGGCGCGATGAAGTCCTGGCGTTTTTCGCCCGGCGCCAACCCTGCCTGGTGGCGATCGAGGCGTGCG
GCAGCGCCCACTGGTGGGCTCGTCAGCTTCGGCAGCAGGGGCATGAAGTGCGCTTGCTGGCCCCTCGCTCAGTGCGTCCCTTCGTACTGCGCAACAAGAC
CGATGCGGCCGACGCCCAGGCCATCTGGACAGCGGTACAGCAGCCCGGCGCCTGCCTGGTCGCCATCAAGCAAGCGGACCAGCAGGCCATTCTCTCGCTG
CACCGTATTCGTGCCCAACTGCTGAAGTTTCGCATCATGCAGAGCAATGGTTTGCGAGGGCTGTTTTACGAGTTCGGGATTGTGCTGCCGGAGGGGTATG
CCCCCTTGTCCAAGGCTATGCCGGAAGCCTTCGCCGAGGCGGAAAATCAGGTTCCTCCCGTACTTCTGGAGAGCCTGCGCGAGCAGTGGGCGCGGGTGCT
CAGACTGGAGGAGGAAATCCAGGTGATCGAACTCCGCCTGAAGCGCTGCCTGCGCGAGAACCCCGACTGCCAGAAAATTGCAGAGATCCCCGGCATCGGT
TTATTGACGGCCACCGCTGCGGTTGCCTCGCTGGGAGACGCGACGACCTTTCGTTCCGGCCGACAGTTCGCCGCCTGGCTGGGACTGGTTCCCCGGCAGA
CCGGTACCGGCGGTCGAGTCCGGCAACTGGGGCTGAGCAAGCGCGGTGACAGCTACCTGCGCATGCTGTTGATGCACGGGGCTCGCTCGATCCTCGCCAG
GAGCCAGAAATCCGGATGGCTTGAGCGCTTGCTGGCCAGGCGGCCTCACAACGTGGCGGTCGCCGCGGTGGCCAACAAGCTGGCCCGAACCCTGTGGGCC
GTGCTGGCCAAGGGCTCTCCGTACCGGGCCGAGCGCTTTACCGCATGCCCTGCGGGGCATTGAGAAACGGCAGCCAACAGGCTTCACAAGGAGCGCAGGC
GGTAAGCACGTGATGGCGGAACAGGTCAGACCGTGGCGAGGTAAACCTGATAAGGAAGATGAGCCAGGGCGCTGCCCACAGCTCGATTTTCAGATGCGGG
CCTCGCCAGCGGATTCCATCAGGGCCAGCAGGTTTTTCGGCCTGCACAAACAGGCCGGATATAAGACTGCTCCTACCTGAACGCCAGCAGGCCTACCTCT
TGAGTTCCGGGAGGCGTCCATATA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1026 bp | 341 aa | 138 | 1163 | + | No |
Chemistry : DEDD
ORF sequence :
MEHSVIGMDIAKKVFQLHTVDQNTGKIERIKLRRDEVLAFFARRQPCLVAIEACGSAHWWARQLRQQGHEVRLLAPRSVRPFVLRNKTDAADAQAIWTAV
QQPGACLVAIKQADQQAILSLHRIRAQLLKFRIMQSNGLRGLFYEFGIVLPEGYAPLSKAMPEAFAEAENQVPPVLLESLREQWARVLRLEEEIQVIELR
LKRCLRENPDCQKIAEIPGIGLLTATAAVASLGDATTFRSGRQFAAWLGLVPRQTGTGGRVRQLGLSKRGDSYLRMLLMHGARSILARSQKSGWLERLLA
RRPHNVAVAAVANKLARTLWAVLAKGSPYRAERFTACPAGH
QQPGACLVAIKQADQQAILSLHRIRAQLLKFRIMQSNGLRGLFYEFGIVLPEGYAPLSKAMPEAFAEAENQVPPVLLESLREQWARVLRLEEEIQVIELR
LKRCLRENPDCQKIAEIPGIGLLTATAAVASLGDATTFRSGRQFAAWLGLVPRQTGTGGRVRQLGLSKRGDSYLRMLLMHGARSILARSQKSGWLERLLA
RRPHNVAVAAVANKLARTLWAVLAKGSPYRAERFTACPAGH
Blast result :
Comments
No uninterrupted target sequence was found, so the ends of the IS have been defined by analogy with other IS1111 family elements, assuming that 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right.
The transposase protein is 87% identical to that of IS1383 and 42% identical to that of IS1111.
The transposase protein is 87% identical to that of IS1383 and 42% identical to that of IS1111.
References
1] US Department of Energy Joint Genome Institute http://jgi.doe.gov/JGI_microbial/html/
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384