ISAzvi4
- Family IS110
- Group IS1111
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP001157 | ND | Azotobacter vinelandii | Azotobacter vinelandii |
DNA section
IS Length : 1351 bp
Ends
IR Length : 12
IRL : taatgagATGGTCACCCCCACACCCTGCGAGGGCCATGCTTTCGCAGGGT
IRR : ----tctATGGTCACCCCCGTTTTTGCAATAGTGATCAGCGATGAGTATT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
TAATGAGATGGTCACCCCCACACCCTGCGAGGGCCATGCTTTCGCAGGGTGTTTCATTTTGGAGACCATCACCATGAGCGACGTGGCACTGATAGGAATC
GACCTCGGCAAGCACAGCTTCCATCTTCACGGACAAGACAAGATCGGCCGAGAGGTCTTTCGCAGGAAGTCGTCGCGTCAGCAGATGATGCGGTTCTTCG
GCAACCTCCCGGCTTGCACCGTGGTGATGGAGGCTTGTGCCGGCTCCCATTTCATTGCCCGGGAGCTGATGGCCTTGGGGCATCAGGCCAAGCTGATCTC
CCCGCAATTCGTCCGGCCCTTCGTCAAGAGCAACAAGAACGACTTCGTGGATGCCGAAGCGATCTGCGAAGCCGCTTCACGTTCCTCAATGCGCTTCGTG
ACGCCCAGGACCGAGGCCCAGCAGACGCTCTCGGTCCTGCATCGCATGCGGGAATCCCTGGTGCGGGATCGGACCAAAACCGCCAATCAGATGCATGGTT
TTCTGCTGGAGTTCGGAGTCAGTCTGCCCAAAGGGTTGGCAGTCATGAGGCGCCTGCCCGGTGTCCTGAGCGAGCGGCCACTGCCTGTTCGGTTAAGCCT
GCTGCTGCAGCGCCTGCACGCGCACTTCGACTACCTCGACAAACAGATCAGGGAACTGGACAAGGAGGTTGCAGGCCAGCTCGCGAGCGATGACCTGGGC
AGCCGCTTGCTGACCATCCCATGCATCGGCCCCATCACCGCCAGCCTATTGGCTGCCGAAATAGGCGATGGCAGGCAATACGGATGCAGCCGGGACTTCG
CCGCGTCCGTCGGTCTGGTGCCCAGGCAGTACAGCACCGGTGGCAAAGCCAGGCTACTGGGCATCAGCAAGCGCGGCGACAAGAGGCTCAGGCAACTGCT
GGTCCAGTGCGCCAGGGTTTACCTGCAACGACTGGAGCATCAGCGCGGCGCCCTGGCCGACTGGGTACGCGCCCTGCGCAGTCGCCGCCACTCGAACGTG
GTCGCCTGCGCCTTGGCCAACAAGTTGGCTCGCATCGCCTGGTCGATCGCGGCCAACCACACGCAGTTCGAAGCGGGGCCAGACGCCTCGGCGGCCTGAC
CTTGCCGTTGCGCAGTACCCCGAACACCTTCCAGGTTTTGCGATAGCTGGACAAGCGATGACGTGAACGGCCCACCGGCCTGGCGAAAAACCTGGCGTAA
AAATCGGCTTTCGAAGCCGCCGACTTTTTCAGGATCGCCAGGCGCGACTCTCATCGTGGCGCGGAGCATGCTCCAAACGGACGCCGGATAGATTTAGGCA
AGCCCAATACTCATCGCTGATCACTATTGCAAAAACGGGGGTGACCATAGA
GACCTCGGCAAGCACAGCTTCCATCTTCACGGACAAGACAAGATCGGCCGAGAGGTCTTTCGCAGGAAGTCGTCGCGTCAGCAGATGATGCGGTTCTTCG
GCAACCTCCCGGCTTGCACCGTGGTGATGGAGGCTTGTGCCGGCTCCCATTTCATTGCCCGGGAGCTGATGGCCTTGGGGCATCAGGCCAAGCTGATCTC
CCCGCAATTCGTCCGGCCCTTCGTCAAGAGCAACAAGAACGACTTCGTGGATGCCGAAGCGATCTGCGAAGCCGCTTCACGTTCCTCAATGCGCTTCGTG
ACGCCCAGGACCGAGGCCCAGCAGACGCTCTCGGTCCTGCATCGCATGCGGGAATCCCTGGTGCGGGATCGGACCAAAACCGCCAATCAGATGCATGGTT
TTCTGCTGGAGTTCGGAGTCAGTCTGCCCAAAGGGTTGGCAGTCATGAGGCGCCTGCCCGGTGTCCTGAGCGAGCGGCCACTGCCTGTTCGGTTAAGCCT
GCTGCTGCAGCGCCTGCACGCGCACTTCGACTACCTCGACAAACAGATCAGGGAACTGGACAAGGAGGTTGCAGGCCAGCTCGCGAGCGATGACCTGGGC
AGCCGCTTGCTGACCATCCCATGCATCGGCCCCATCACCGCCAGCCTATTGGCTGCCGAAATAGGCGATGGCAGGCAATACGGATGCAGCCGGGACTTCG
CCGCGTCCGTCGGTCTGGTGCCCAGGCAGTACAGCACCGGTGGCAAAGCCAGGCTACTGGGCATCAGCAAGCGCGGCGACAAGAGGCTCAGGCAACTGCT
GGTCCAGTGCGCCAGGGTTTACCTGCAACGACTGGAGCATCAGCGCGGCGCCCTGGCCGACTGGGTACGCGCCCTGCGCAGTCGCCGCCACTCGAACGTG
GTCGCCTGCGCCTTGGCCAACAAGTTGGCTCGCATCGCCTGGTCGATCGCGGCCAACCACACGCAGTTCGAAGCGGGGCCAGACGCCTCGGCGGCCTGAC
CTTGCCGTTGCGCAGTACCCCGAACACCTTCCAGGTTTTGCGATAGCTGGACAAGCGATGACGTGAACGGCCCACCGGCCTGGCGAAAAACCTGGCGTAA
AAATCGGCTTTCGAAGCCGCCGACTTTTTCAGGATCGCCAGGCGCGACTCTCATCGTGGCGCGGAGCATGCTCCAAACGGACGCCGGATAGATTTAGGCA
AGCCCAATACTCATCGCTGATCACTATTGCAAAAACGGGGGTGACCATAGA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1026 bp | 341 aa | 74 | 1099 | + | No |
Chemistry : DEDD
ORF sequence :
MSDVALIGIDLGKHSFHLHGQDKIGREVFRRKSSRQQMMRFFGNLPACTVVMEACAGSHFIARELMALGHQAKLISPQFVRPFVKSNKNDFVDAEAICEA
ASRSSMRFVTPRTEAQQTLSVLHRMRESLVRDRTKTANQMHGFLLEFGVSLPKGLAVMRRLPGVLSERPLPVRLSLLLQRLHAHFDYLDKQIRELDKEVA
GQLASDDLGSRLLTIPCIGPITASLLAAEIGDGRQYGCSRDFAASVGLVPRQYSTGGKARLLGISKRGDKRLRQLLVQCARVYLQRLEHQRGALADWVRA
LRSRRHSNVVACALANKLARIAWSIAANHTQFEAGPDASAA
ASRSSMRFVTPRTEAQQTLSVLHRMRESLVRDRTKTANQMHGFLLEFGVSLPKGLAVMRRLPGVLSERPLPVRLSLLLQRLHAHFDYLDKQIRELDKEVA
GQLASDDLGSRLLTIPCIGPITASLLAAEIGDGRQYGCSRDFAASVGLVPRQYSTGGKARLLGISKRGDKRLRQLLVQCARVYLQRLEHQRGALADWVRA
LRSRRHSNVVACALANKLARIAWSIAANHTQFEAGPDASAA
Blast result :
Comments
The IR of this IS are not at its termini. In the IS sequence as given 7 nt separate IRl from the left-hand end of the element and 3 nt separate IRr from the right-hand end. The first residue of the sequence may in fact belong as the final residue, giving 6 nt on the left and 4 on the right. Some uninterrupted target sequences predict that the first T of the given sequence is missing, giving terminal sequences totalling 9 rather than 10 bp.
The transposase protein is 72% identical to that of ISBcen1 and 38% identical to that of IS1111.
By analogy with IS4321, ISAzvi4 may exist in a circular form. If the terminal sequences total 10 bp a -10 region is created by these abutted sequences and is at an appropriate distance from a -35 region located just inside the right-hand end of the element to form a promoter.
The transposase protein is 72% identical to that of ISBcen1 and 38% identical to that of IS1111.
By analogy with IS4321, ISAzvi4 may exist in a circular form. If the terminal sequences total 10 bp a -10 region is created by these abutted sequences and is at an appropriate distance from a -35 region located just inside the right-hand end of the element to form a promoter.
References
1] US Department of Energy Joint Genome Institute http://jgi.doe.gov/JGI_microbial/html/
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384
2] Partridge and Hall (2003) J. Bacteriol. 185, 6371-6384