ISGur5
- Family IS3
- Group IS3
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_009483 | ND | Geobacter uraniireducens | Geobacter uraniireducens Rf4 |
DNA section
IS Length : 1760 bp
Ends
IR Length : 21/28
IRL : CCCCCTGTGTCAGAATAGTTGTCGCCTCAGATGGGTTGTTTAGCCACGGT
IRR : CGGCGGGTGTCAAGGTAGTTGTCGCCTCTGTTGAATTTTTATGCGGCGTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CAATGTATCC | GATC | TGTCCAATAG | 4 |
GGGAATACAG | CTAT | TATCCCGATG | 4 |
CGTGAGTCAA | GTTT | TGCTTTAACA | 4 |
TGTTAAAGCA | AGAC | ATGACTCACG | 4 |
TGTCCAGTTA | AAAC | CGGAATCGGT | 4 |
CTACTAGGTC | CAAG | TGGCTCCACC | 4 |
GGATGCCTTC | GTTC | TGTACAGCGA | 4 |
CCGTTTCGAT | CC | AATGCCCGAC | 2 |
DNA sequence
CCCCCTGTGTCAGAATAGTTGTCGCCTCAGATGGGTTGTTTAGCCACGGTCTTTCGAGCTGAATTGCAACGAGCGTAGCGAGCTTGCGATTCAGCTCGAA
AGACCGGGACCGGCAAAAGCCGGCGACCCGTATTTTTTCAACATCTGGGGGCAGCAAGGATCTATAATGACATACACACAAGGATTCAAATCCAGCCTGG
TTCGCAAGATGGCCTGCCCTAATGGCGTCTCGGCCTCAGCTCTATCCCGAGAAGTGGGAATTCCCCAACAATCACTCTCTCGCTGGCTTCGAGACGCAAA
TGTCGTGAATGAATCTGGTAACTCCCCAATTTCTGAAGCACCATGGAGAACAATGTCCCCAAAGCGCCCACAAGACAAGTCTGCCGAAGAAAAGCTCAAA
ATCGTCATCGAAGCCGAGACGGTTGCCGAAGATCAACTTGGCGCCTTCCTGCGTCGCAATGGAATTCATGAAGCACAGTTACGCGAGTGGCGCGCCATGA
TGCTTTCGGGGCTGCAAAAACCGTCTCGGCCTTCCTCAAAAACATCAGAGGAAACTCGCAAAATTCATGCACTGGAAAAAGAATTACTGCGCAAAGATAA
AGCATTGGCCGAGGCCGCCGCGATTTTGATCCTCAAAAAAAAAGTCCAATCGATCTGGGGGGGCGAGGACGAGCCCACGGACAAGAGGAACGGCAGATGA
TCGGGCAACTTATTGAAGAGGCAACCCATTCCGGAGCGAGACTTGAATCGGCCGTTGTCGCCATCGGGCTGAGCATTCGCACGTTCCAGCGTTGGAATCT
CCAGGATGACGGAGCAGATCGGCGACATGGGCCAGTCGCAGAACCGGCGAACAAGCTGGCTCCGTCGGAGCGGCAGAACATCATCGATATTGCAAACTCC
CCGGCGTTTCGGGATATTTCGCCGAAACAGATCGTGCCGCAATTGGCCGATCAGGGAATATACGTGGCGTCGGAATCGAGTTTCTACCGGGTGCTCAAAG
ACGATGGGCTGATGACTCACCGGGAACCCTCCCGGCCCGCCACCAGCCGCAAGCCAAAAGAACATGTGGCTACCGGTCCTTGCCAGGTCTGGTCATGGGA
CATCACCTACTTGAAGAGCCCGATTCTTGGGCAGTTCTTTTATCTCTACATGATCATGGATGTCTGGAGCCGCAAGATCGTAGCGTCCACGGTGTTTTTA
AAAGAATCCAATGACTATAGCGCCAGGCTGTTTCTGAAAGCCTGTATCAGGCTCGGCATCAATCCGGAAGGCCTGATTCTTCACTCCGACAACGGCGGCC
CGATGAAAGGGGCGACCATGCTGGCTACTCTCCAGCGCCTGGGCGTAATTCCTTCCTTCAGCAGGCCACAGGTCAGTGACGACAACCCTTTCTCAGAAGC
GCTGTTCCGAACCATGAAATATAGACCTGGTTATCCGAGTCGGCCTTTTTCAAGTCTGGCAGCGGCCCAGACTTGGGTTGATGGTTTCGTCGCTTGGTAT
AACACGGAGCATCTGCACAGCGGCATTCGCTTCGTCACCCCGGACGATCGCCACTTCGGCCGGGAGAAAGCCATCCTGTTAAACCGCCGGGAGATATACG
AGAAGGCCCGGCAACAAAACCCAAACCGCTGGTCAAAAAACATCCGAAACTGGGAGCCAGTGGAAACAGTTTATCTTAACCCTGAACCAACGGCGGAAGT
CGAACTTCTTGACGCCGCATAAAAATTCAACAGAGGCGACAACTACCTTGACACCCGCCG
AGACCGGGACCGGCAAAAGCCGGCGACCCGTATTTTTTCAACATCTGGGGGCAGCAAGGATCTATAATGACATACACACAAGGATTCAAATCCAGCCTGG
TTCGCAAGATGGCCTGCCCTAATGGCGTCTCGGCCTCAGCTCTATCCCGAGAAGTGGGAATTCCCCAACAATCACTCTCTCGCTGGCTTCGAGACGCAAA
TGTCGTGAATGAATCTGGTAACTCCCCAATTTCTGAAGCACCATGGAGAACAATGTCCCCAAAGCGCCCACAAGACAAGTCTGCCGAAGAAAAGCTCAAA
ATCGTCATCGAAGCCGAGACGGTTGCCGAAGATCAACTTGGCGCCTTCCTGCGTCGCAATGGAATTCATGAAGCACAGTTACGCGAGTGGCGCGCCATGA
TGCTTTCGGGGCTGCAAAAACCGTCTCGGCCTTCCTCAAAAACATCAGAGGAAACTCGCAAAATTCATGCACTGGAAAAAGAATTACTGCGCAAAGATAA
AGCATTGGCCGAGGCCGCCGCGATTTTGATCCTCAAAAAAAAAGTCCAATCGATCTGGGGGGGCGAGGACGAGCCCACGGACAAGAGGAACGGCAGATGA
TCGGGCAACTTATTGAAGAGGCAACCCATTCCGGAGCGAGACTTGAATCGGCCGTTGTCGCCATCGGGCTGAGCATTCGCACGTTCCAGCGTTGGAATCT
CCAGGATGACGGAGCAGATCGGCGACATGGGCCAGTCGCAGAACCGGCGAACAAGCTGGCTCCGTCGGAGCGGCAGAACATCATCGATATTGCAAACTCC
CCGGCGTTTCGGGATATTTCGCCGAAACAGATCGTGCCGCAATTGGCCGATCAGGGAATATACGTGGCGTCGGAATCGAGTTTCTACCGGGTGCTCAAAG
ACGATGGGCTGATGACTCACCGGGAACCCTCCCGGCCCGCCACCAGCCGCAAGCCAAAAGAACATGTGGCTACCGGTCCTTGCCAGGTCTGGTCATGGGA
CATCACCTACTTGAAGAGCCCGATTCTTGGGCAGTTCTTTTATCTCTACATGATCATGGATGTCTGGAGCCGCAAGATCGTAGCGTCCACGGTGTTTTTA
AAAGAATCCAATGACTATAGCGCCAGGCTGTTTCTGAAAGCCTGTATCAGGCTCGGCATCAATCCGGAAGGCCTGATTCTTCACTCCGACAACGGCGGCC
CGATGAAAGGGGCGACCATGCTGGCTACTCTCCAGCGCCTGGGCGTAATTCCTTCCTTCAGCAGGCCACAGGTCAGTGACGACAACCCTTTCTCAGAAGC
GCTGTTCCGAACCATGAAATATAGACCTGGTTATCCGAGTCGGCCTTTTTCAAGTCTGGCAGCGGCCCAGACTTGGGTTGATGGTTTCGTCGCTTGGTAT
AACACGGAGCATCTGCACAGCGGCATTCGCTTCGTCACCCCGGACGATCGCCACTTCGGCCGGGAGAAAGCCATCCTGTTAAACCGCCGGGAGATATACG
AGAAGGCCCGGCAACAAAACCCAAACCGCTGGTCAAAAAACATCCGAAACTGGGAGCCAGTGGAAACAGTTTATCTTAACCCTGAACCAACGGCGGAAGT
CGAACTTCTTGACGCCGCATAAAAATTCAACAGAGGCGACAACTACCTTGACACCCGCCG
Recoding section
- Recoding by frameshift
- Frame
- Type
- Experimentally demonstrated
Stimulators :
- Shine-Dalgarno sequence :
- Secondary structure :
Recoding motif :
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
534 bp | 177 aa | 167 | 700 | + | No |
Description : First part of the transposase
ORF sequence :
MTYTQGFKSSLVRKMACPNGVSASALSREVGIPQQSLSRWLRDANVVNESGNSPISEAPWRTMSPKRPQDKSAEEKLKIVIEAETVAEDQLGAFLRRNGI
HEAQLREWRAMMLSGLQKPSRPSSKTSEETRKIHALEKELLRKDKALAEAAAILILKKKVQSIWGGEDEPTDKRNGR
HEAQLREWRAMMLSGLQKPSRPSSKTSEETRKIHALEKELLRKDKALAEAAAILILKKKVQSIWGGEDEPTDKRNGR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1122 bp | 373 aa | 601 | 1722 | + | No |
Description : Second part of the transposase
ORF sequence :
SIGRGRRDFDPQKKSPIDLGGRGRAHGQEERQMIGQLIEEATHSGARLESAVVAIGLSIRTFQRWNLQDDGADRRHGPVAEPANKLAPSERQNIIDIANS
PAFRDISPKQIVPQLADQGIYVASESSFYRVLKDDGLMTHREPSRPATSRKPKEHVATGPCQVWSWDITYLKSPILGQFFYLYMIMDVWSRKIVASTVFL
KESNDYSARLFLKACIRLGINPEGLILHSDNGGPMKGATMLATLQRLGVIPSFSRPQVSDDNPFSEALFRTMKYRPGYPSRPFSSLAAAQTWVDGFVAWY
NTEHLHSGIRFVTPDDRHFGREKAILLNRREIYEKARQQNPNRWSKNIRNWEPVETVYLNPEPTAEVELLDAA
PAFRDISPKQIVPQLADQGIYVASESSFYRVLKDDGLMTHREPSRPATSRKPKEHVATGPCQVWSWDITYLKSPILGQFFYLYMIMDVWSRKIVASTVFL
KESNDYSARLFLKACIRLGINPEGLILHSDNGGPMKGATMLATLQRLGVIPSFSRPQVSDDNPFSEALFRTMKYRPGYPSRPFSSLAAAQTWVDGFVAWY
NTEHLHSGIRFVTPDDRHFGREKAILLNRREIYEKARQQNPNRWSKNIRNWEPVETVYLNPEPTAEVELLDAA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1556 bp | 518 aa | 167 | 1722 | + | Yes |
Chemistry : DDE
ORF sequence :
MTYTQGFKSSLVRKMACPNGVSASALSREVGIPQQSLSRWLRDANVVNESGNSPISEAPWRTMSPKRPQDKSAEEKLKIVIEAETVAEDQLGAFLRRNGI
HEAQLREWRAMMLSGLQKPSRPSSKTSEETRKIHALEKELLRKDKALAEAAAILILKKKSPIDLGGRGRAHGQEERQMIGQLIEEATHSGARLESAVVAI
GLSIRTFQRWNLQDDGADRRHGPVAEPANKLAPSERQNIIDIANSPAFRDISPKQIVPQLADQGIYVASESSFYRVLKDDGLMTHREPSRPATSRKPKEH
VATGPCQVWSWDITYLKSPILGQFFYLYMIMDVWSRKIVASTVFLKESNDYSARLFLKACIRLGINPEGLILHSDNGGPMKGATMLATLQRLGVIPSFSR
PQVSDDNPFSEALFRTMKYRPGYPSRPFSSLAAAQTWVDGFVAWYNTEHLHSGIRFVTPDDRHFGREKAILLNRREIYEKARQQNPNRWSKNIRNWEPVE
TVYLNPEPTAEVELLDAA
HEAQLREWRAMMLSGLQKPSRPSSKTSEETRKIHALEKELLRKDKALAEAAAILILKKKSPIDLGGRGRAHGQEERQMIGQLIEEATHSGARLESAVVAI
GLSIRTFQRWNLQDDGADRRHGPVAEPANKLAPSERQNIIDIANSPAFRDISPKQIVPQLADQGIYVASESSFYRVLKDDGLMTHREPSRPATSRKPKEH
VATGPCQVWSWDITYLKSPILGQFFYLYMIMDVWSRKIVASTVFLKESNDYSARLFLKACIRLGINPEGLILHSDNGGPMKGATMLATLQRLGVIPSFSR
PQVSDDNPFSEALFRTMKYRPGYPSRPFSSLAAAQTWVDGFVAWYNTEHLHSGIRFVTPDDRHFGREKAILLNRREIYEKARQQNPNRWSKNIRNWEPVE
TVYLNPEPTAEVELLDAA
Blast result :
Comments
ISGur5 is 56% (ORFA) and 68% (ORFB) aa similar to ISPpr7. The third ORF is a putative ORFAB transposase reconstructed in sillico by -1 frameshift.
File updated: September 13 2013. Extremities were modified, the proposed ends for this sequence are based on the similarities with closely relative elements.
File updated: September 13 2013. Extremities were modified, the proposed ends for this sequence are based on the similarities with closely relative elements.
References
1] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavinadel Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S.,Chertkov,O., Brettin,T., Bruce,D., Han,C., Schmutz,J., Larimer,F.,Land,M., Hauser,L., Kyrpides,N., Mikhailova,N., Shelobolina,E.,Aklujkar,M., Lovley,D. and Richardson,P.(2007) Direct submission GenBank.