ISNeu3
- Family IS5
- Group IS427
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_004757 | ND | Nitrosomonas europaea | Nitrosomonas europaea ATCC 19718 |
DNA section
IS Length : 843 bp
Ends
IR Length : 14/16
IRL : GGCAAATCGACATTTACGGATTTCCAACATCTATAATTTTTGCTTTACTG
IRR : GTCGAATCGACATTTAAAGCCATAAAGTTGCTGCAGCCAGATGTATGAAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCTTAAGAATTATTAA | CTAGGGACCGGACAGAT | 0 | |
GCCCTTGGGGTGTTAA | CTAGGGAACTACACAGC | 0 | |
ATGGCCAAATCGCTAA | CTAGTGTTTTAGATCAT | 0 | |
TCCGTCCCTAACGCGA | CTAG | ACACGATCTGACACCTC | 4 |
ATGCGGGACCAAACAG | CTAG | GCGTTTTATCAAAGCGG | 4 |
ATGGTTGGGTTTGGTA | CTAG | GTGTTTGGTCCCGCATT | 4 |
TTACTTAGAATTGTTC | CTAG | CGACATGGCTGCATCTA | 4 |
TATTCAGGAAAAAACG | CTAG | AGAAGAAGCAAGAAGAT | 4 |
DNA sequence
GGCAAATCGACATTTACGGATTTCCAACATCTATAATTTTTGCTTTACTGAAAGAGGTTTAACATTTTCAGTGAGGAATGTATGAAACGCTATGAATTGA
ACAGGGAGCAATGGCGTCGAATAGAGCCGTTTATACCGGGTAAAATTGGGGATCGTGGCCGACATGGCGCGGATAATCTATTGTTTATAAATGGTGTTTT
ATGGGTTTTACGCTCTGGTGCGCACTGGCATGACCTGCCGGAGCGGTATGGCAAATGGAAAACTGCTCACAAACGCTTTACGCGGTGGGCACAGGCCGGT
ATTTGGGAAAAGATATTCGATGTTTTGACCGAAGACCCGGACAATCAATATATTATGATCGACAGCACGATCGTGCGCGCTCATCAGCAGGCCGCCTGCG
GAAAAGGGGGGCGCGGCGTGAGGCTTTGGGGCGTTCCCGAGGCGGTCTGAGTACCAAAATCCATATGTGCGTCGATGCCTCTGGCCGACCCTTACGTTTT
ATACTGACGGGTGGACAGTGCAACGATTGCACGCAAGCTCTTGATCTGATCAGCGGGTTCAGGCCCTCGCATGTTCTGGCGGATAAAGGCTACGACAGCG
ATAATATTCTCGACGCCATTGCCTCCATGAAGGCCGTGCCTGTCATTCCACCGCGATCAAACCGCAAAATACGAAGAACGTATGATCGCGAAATCTATAA
ATGTAGAAATATCATTGAGCGCACATTCAACAAACTCAAGCATTGGCGCAGACTCTCAACCAGATATGACCGTAAAGCTATTTACTTCTCCGCCTTCATA
CATCTGGCTGCAGCAACTTTATGGCTTTAAATGTCGATTCGAC
ACAGGGAGCAATGGCGTCGAATAGAGCCGTTTATACCGGGTAAAATTGGGGATCGTGGCCGACATGGCGCGGATAATCTATTGTTTATAAATGGTGTTTT
ATGGGTTTTACGCTCTGGTGCGCACTGGCATGACCTGCCGGAGCGGTATGGCAAATGGAAAACTGCTCACAAACGCTTTACGCGGTGGGCACAGGCCGGT
ATTTGGGAAAAGATATTCGATGTTTTGACCGAAGACCCGGACAATCAATATATTATGATCGACAGCACGATCGTGCGCGCTCATCAGCAGGCCGCCTGCG
GAAAAGGGGGGCGCGGCGTGAGGCTTTGGGGCGTTCCCGAGGCGGTCTGAGTACCAAAATCCATATGTGCGTCGATGCCTCTGGCCGACCCTTACGTTTT
ATACTGACGGGTGGACAGTGCAACGATTGCACGCAAGCTCTTGATCTGATCAGCGGGTTCAGGCCCTCGCATGTTCTGGCGGATAAAGGCTACGACAGCG
ATAATATTCTCGACGCCATTGCCTCCATGAAGGCCGTGCCTGTCATTCCACCGCGATCAAACCGCAAAATACGAAGAACGTATGATCGCGAAATCTATAA
ATGTAGAAATATCATTGAGCGCACATTCAACAAACTCAAGCATTGGCGCAGACTCTCAACCAGATATGACCGTAAAGCTATTTACTTCTCCGCCTTCATA
CATCTGGCTGCAGCAACTTTATGGCTTTAAATGTCGATTCGAC
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
366 bp | 122 aa | 82 | 447 | + | No |
Description : First part of the transposase
ORF sequence :
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWHDLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
HQQAACGKGGRGVRLWGVPEAV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
597 bp | 191 aa | 234 | 830 | + | No |
Description : Second part of the transposase
ORF sequence :
PAGAVWQMENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRKRGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCT
QALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
QALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
749 bp | 249 aa | 82 | 830 | + | Yes |
Chemistry : DDE
ORF sequence :
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWHDLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRT
YDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
HQQAACGKGGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRT
YDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
Blast result :
Comments
ISNeu3 is 67% aa similar to ISAli12B.
ISNeu3 was found by screening completely sequenced genomes for seqences homologous to the ISScr1 transposase using BLASTP. It codes two overlapping ORFs. The overlap displayed two possible -1 frameshift sites, suggesting putative fusion peptides. Multiple alignments revealed conserved D(N3) and E(C1) amino acids separated by 44 residues in the C-terminal ORF. The copy number in Nitrosomonas europaea ATCC 19718 is 8.
ISNeu3 was found by screening completely sequenced genomes for seqences homologous to the ISScr1 transposase using BLASTP. It codes two overlapping ORFs. The overlap displayed two possible -1 frameshift sites, suggesting putative fusion peptides. Multiple alignments revealed conserved D(N3) and E(C1) amino acids separated by 44 residues in the C-terminal ORF. The copy number in Nitrosomonas europaea ATCC 19718 is 8.
References
1] Chain,P., Lamerdin,J., Larimer,F., Regala,W., Lao,V., Land,M., Hauser,L., Hooper,A., Klotz,M., Norton,J., Sayavedra-Soto,L., Arciero,D., Hommes,N., Whittaker,M. and Arp,D.(2003)J. Bacteriol. 185,2759-2773