ISRhba1
- Family IS1595
- Group ISNwi1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AAMT01000005 | ND | Rhodobacterales bacterium | Rhodobacterales bacterium HTCC2654 |
DNA section
IS Length : 2457 bp
Ends
IR Length : 23/27
IRL : GGGGCCTATATACTTAGCTAGCCAAAAATCTCTTGCGGCGGATGGCTAGT
IRR : GGGGCCTACATACTTAACTAGCCGCAATGACGGCTAGATCGGTTTTCCTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GACTCCCCCGAGGTT | TTTTGAAA | CAGAGAAGAGGG | 8 |
DNA sequence
GGGGCCTATATACTTAGCTAGCCAAAAATCTCTTGCGGCGGATGGCTAGTTAGATTAAACATGATGCAGAAGGAGCTTGCATCATGTCCAAACCCGAAAC
CCTCTCCACCTTCGAGTTCTTCAAGAAGTTCCCGGATGAGGAAGCCGCCCGCAGGTTTTTCGAGGCGCGCCGCTGGGGCGATGAGCCGGTTTGCGGCCAC
TGTGGTTCGGTCAGTGTGACCGAGTGCAAAGACCATAAGCCCATGCCCTACCGCTGCAAGGACTGCCGGAAGCATTTCAGCGTCCGCACGGGCACCGTGT
TGGCTGAGAGCCGCCTTCCCCTTCAGAAGTGGCTTCTTGCTATCTTCATGCTCACCAGCGCCCGTAAGGGCATACCGAGCACTCAGATGGCCCGTGAACT
GGGGGTCACGCAGAAAACCGCGTGGTTCCTCGCCCAGCGCATCCGAGAGACGTGGCTGAAGGATCGTGACGATCACATGGACGGCCAGATGCAGGTTGAC
GAAACCTACATTGGGGGGCGCGAAAAGAACAAACATGCCGACAAGAAACTTCGCGCTGGTCGTGGCGCGGTTGGCAAGACTGCGGTTGTTGGTGTCCGCG
ATGAAGTTGGTCAGGTCCGCGCCGTAGTGGTTGAAAACACCAAGGCTGCGACCCTTGAGAAGTTCGTTCGCCAACACTGCAAGAAGGGGGCAACCGTCGT
CACGGACACTCACGGCGGTTACATTGGACTGACGGGCGCGGGCTACCGTCACATTCGGATTAATCACTCTGCTGGCGAGTATGTCCGCGACATGGCGCAT
ACCAACGGCATCGAAAGCTTCTGGTCTCTGCTGAAGCGCGGCTACATCGGCATCTACCACTACATGAGTGCCAAGCACCTTCATCGCTACATCAAGGAAT
ACTCATTCCGCCATAACACGTCGCAGGTCGGCACCATGGATTTCATCAACATGACAATCGACCGCATGGACGGCAAGCGACTGACATACAGGAGGCTGAC
CAATGCCTGATGAGAAGAAAGACATCATCGAACCTATCGACGCCAACTTTGAGGAAGTTGTGAGCAAGGTAGCACCGCGCCTTAAAGCATCATCCGGGCC
TGAGATTATTCCTGCGTCTGAAAGGGACGCCCTCGCTGACAATGAGGCAACGCATCGGGGCAAGCTCAGGATTGGGCCCGTCGAGATTCCCTGCGCGGTC
TTGAAAGACGGTCGTAGAGTTCTGTCTGGGCACGGAATTGCCAGCGTCCTTGGCGGTCGCAGCGGTGCAGCAAAGAGATTGAAAACAGAGGCAGAGAAAG
ACGGGGCCCATATGCCCGTCTTTCTGGCATCAAAAAGTCTTTTGCCATACATTTCCAAGGAGTTAATGGACGGGCCCCTCAAGCCGATAACTTATAAATC
TGGAGACACCGAGGCGGAAGGCTATCCAGCCGAAGCACTTCCTGAAATCTGCAACATATGGCTACAGGCTAGACAGGATGGGGTCCTTAATCCGCAGCAG
GCGGACAGGGCCCAAGCAGCGGAAATCGTTATGCGCGGTCTTGCGGACCTTGGCATCATCGGCCTCGTTGACGAAGCCACAGGATACCAAAACACCCGAG
ACCATGACGCCCTTCAGGCGATTTTGGACAAATACCTGCAAAAGGAATTTGCCGCATGGGCTAAGCGGTTCCCCGATGCTTTCTATCGCGAGATATTCAG
GCTTCGCGGATGGAGTTGGAATGCAATGTCGGTCGCTAGGCCGGGCGTGGTCGGAAAATACACAAACGACATTGTCTATGAACGCCTAGCCCCCGGCATC
CTCGAAGAGCTGCAAGCCATGAACCCAACAAAGGACGATGGGGGGCGTTTGAGACGCCATCACCAATTTCTCACTGAAGATATCGGGCACCCGGCACTAG
CACAGCACCTGCACGCTGTTATTGGCCTCATGCGGGCATCGGCCACATGGGAACAGTTCAAAACCATGCTAGATCGCGCCTTCCCCAAGAAGGGGACGCA
GTTAGAACTGCTACTGGACGACGACCGATAGCCCAAACCGCGAACGCATCAAGAACAAGATATGACTCGACACGCAATCCTGCGATTCGATATAGGTTAA
GAGACGGCTACCGTCACCATATATAGCTCCACGCCCGATAAGCAGTAGTTCTGGGGGCGCAAGGGGTGGCCCCAGTGGGCCGGGACATACCCGGATCGCA
CGAAGGAGCCTAATATGGTAGCGTGTGTATGGCCGAAACCGGTGAGCGTTTGCTCATACACCCGGTTTCGGTTTGGAAAGTGGGAATACGTGGTCTCGCA
CTGCCGTTCCTACCCGTCCCGCTAAGGTTGTCTAGGCCAAGGGACACGTAAGAAGATGGTAGCCGTCACTCCCTAAACGTTCCTATCATTAGGCGGAATT
TGGTGTCTAGGAAAACCGATCTAGCCGTCATTGCGGCTAGTTAAGTATGTAGGCCCC
CCTCTCCACCTTCGAGTTCTTCAAGAAGTTCCCGGATGAGGAAGCCGCCCGCAGGTTTTTCGAGGCGCGCCGCTGGGGCGATGAGCCGGTTTGCGGCCAC
TGTGGTTCGGTCAGTGTGACCGAGTGCAAAGACCATAAGCCCATGCCCTACCGCTGCAAGGACTGCCGGAAGCATTTCAGCGTCCGCACGGGCACCGTGT
TGGCTGAGAGCCGCCTTCCCCTTCAGAAGTGGCTTCTTGCTATCTTCATGCTCACCAGCGCCCGTAAGGGCATACCGAGCACTCAGATGGCCCGTGAACT
GGGGGTCACGCAGAAAACCGCGTGGTTCCTCGCCCAGCGCATCCGAGAGACGTGGCTGAAGGATCGTGACGATCACATGGACGGCCAGATGCAGGTTGAC
GAAACCTACATTGGGGGGCGCGAAAAGAACAAACATGCCGACAAGAAACTTCGCGCTGGTCGTGGCGCGGTTGGCAAGACTGCGGTTGTTGGTGTCCGCG
ATGAAGTTGGTCAGGTCCGCGCCGTAGTGGTTGAAAACACCAAGGCTGCGACCCTTGAGAAGTTCGTTCGCCAACACTGCAAGAAGGGGGCAACCGTCGT
CACGGACACTCACGGCGGTTACATTGGACTGACGGGCGCGGGCTACCGTCACATTCGGATTAATCACTCTGCTGGCGAGTATGTCCGCGACATGGCGCAT
ACCAACGGCATCGAAAGCTTCTGGTCTCTGCTGAAGCGCGGCTACATCGGCATCTACCACTACATGAGTGCCAAGCACCTTCATCGCTACATCAAGGAAT
ACTCATTCCGCCATAACACGTCGCAGGTCGGCACCATGGATTTCATCAACATGACAATCGACCGCATGGACGGCAAGCGACTGACATACAGGAGGCTGAC
CAATGCCTGATGAGAAGAAAGACATCATCGAACCTATCGACGCCAACTTTGAGGAAGTTGTGAGCAAGGTAGCACCGCGCCTTAAAGCATCATCCGGGCC
TGAGATTATTCCTGCGTCTGAAAGGGACGCCCTCGCTGACAATGAGGCAACGCATCGGGGCAAGCTCAGGATTGGGCCCGTCGAGATTCCCTGCGCGGTC
TTGAAAGACGGTCGTAGAGTTCTGTCTGGGCACGGAATTGCCAGCGTCCTTGGCGGTCGCAGCGGTGCAGCAAAGAGATTGAAAACAGAGGCAGAGAAAG
ACGGGGCCCATATGCCCGTCTTTCTGGCATCAAAAAGTCTTTTGCCATACATTTCCAAGGAGTTAATGGACGGGCCCCTCAAGCCGATAACTTATAAATC
TGGAGACACCGAGGCGGAAGGCTATCCAGCCGAAGCACTTCCTGAAATCTGCAACATATGGCTACAGGCTAGACAGGATGGGGTCCTTAATCCGCAGCAG
GCGGACAGGGCCCAAGCAGCGGAAATCGTTATGCGCGGTCTTGCGGACCTTGGCATCATCGGCCTCGTTGACGAAGCCACAGGATACCAAAACACCCGAG
ACCATGACGCCCTTCAGGCGATTTTGGACAAATACCTGCAAAAGGAATTTGCCGCATGGGCTAAGCGGTTCCCCGATGCTTTCTATCGCGAGATATTCAG
GCTTCGCGGATGGAGTTGGAATGCAATGTCGGTCGCTAGGCCGGGCGTGGTCGGAAAATACACAAACGACATTGTCTATGAACGCCTAGCCCCCGGCATC
CTCGAAGAGCTGCAAGCCATGAACCCAACAAAGGACGATGGGGGGCGTTTGAGACGCCATCACCAATTTCTCACTGAAGATATCGGGCACCCGGCACTAG
CACAGCACCTGCACGCTGTTATTGGCCTCATGCGGGCATCGGCCACATGGGAACAGTTCAAAACCATGCTAGATCGCGCCTTCCCCAAGAAGGGGACGCA
GTTAGAACTGCTACTGGACGACGACCGATAGCCCAAACCGCGAACGCATCAAGAACAAGATATGACTCGACACGCAATCCTGCGATTCGATATAGGTTAA
GAGACGGCTACCGTCACCATATATAGCTCCACGCCCGATAAGCAGTAGTTCTGGGGGCGCAAGGGGTGGCCCCAGTGGGCCGGGACATACCCGGATCGCA
CGAAGGAGCCTAATATGGTAGCGTGTGTATGGCCGAAACCGGTGAGCGTTTGCTCATACACCCGGTTTCGGTTTGGAAAGTGGGAATACGTGGTCTCGCA
CTGCCGTTCCTACCCGTCCCGCTAAGGTTGTCTAGGCCAAGGGACACGTAAGAAGATGGTAGCCGTCACTCCCTAAACGTTCCTATCATTAGGCGGAATT
TGGTGTCTAGGAAAACCGATCTAGCCGTCATTGCGGCTAGTTAAGTATGTAGGCCCC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
927 bp | 308 aa | 84 | 1010 | + | No |
Chemistry : DDE
ORF sequence :
MSKPETLSTFEFFKKFPDEEAARRFFEARRWGDEPVCGHCGSVSVTECKDHKPMPYRCKDCRKHFSVRTGTVLAESRLPLQKWLLAIFMLTSARKGIPST
QMARELGVTQKTAWFLAQRIRETWLKDRDDHMDGQMQVDETYIGGREKNKHADKKLRAGRGAVGKTAVVGVRDEVGQVRAVVVENTKAATLEKFVRQHCK
KGATVVTDTHGGYIGLTGAGYRHIRINHSAGEYVRDMAHTNGIESFWSLLKRGYIGIYHYMSAKHLHRYIKEYSFRHNTSQVGTMDFINMTIDRMDGKRL
TYRRLTNA
QMARELGVTQKTAWFLAQRIRETWLKDRDDHMDGQMQVDETYIGGREKNKHADKKLRAGRGAVGKTAVVGVRDEVGQVRAVVVENTKAATLEKFVRQHCK
KGATVVTDTHGGYIGLTGAGYRHIRINHSAGEYVRDMAHTNGIESFWSLLKRGYIGIYHYMSAKHLHRYIKEYSFRHNTSQVGTMDFINMTIDRMDGKRL
TYRRLTNA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1029 bp | 342 aa | 1003 | 2031 | + | No |
Annotation : Phage related proteinDescription :
ORF sequence :
MPDEKKDIIEPIDANFEEVVSKVAPRLKASSGPEIIPASERDALADNEATHRGKLRIGPVEIPCAVLKDGRRVLSGHGIASVLGGRSGAAKRLKTEAEKD
GAHMPVFLASKSLLPYISKELMDGPLKPITYKSGDTEAEGYPAEALPEICNIWLQARQDGVLNPQQADRAQAAEIVMRGLADLGIIGLVDEATGYQNTRD
HDALQAILDKYLQKEFAAWAKRFPDAFYREIFRLRGWSWNAMSVARPGVVGKYTNDIVYERLAPGILEELQAMNPTKDDGGRLRRHHQFLTEDIGHPALA
QHLHAVIGLMRASATWEQFKTMLDRAFPKKGTQLELLLDDDR
GAHMPVFLASKSLLPYISKELMDGPLKPITYKSGDTEAEGYPAEALPEICNIWLQARQDGVLNPQQADRAQAAEIVMRGLADLGIIGLVDEATGYQNTRD
HDALQAILDKYLQKEFAAWAKRFPDAFYREIFRLRGWSWNAMSVARPGVVGKYTNDIVYERLAPGILEELQAMNPTKDDGGRLRRHHQFLTEDIGHPALA
QHLHAVIGLMRASATWEQFKTMLDRAFPKKGTQLELLLDDDR
Blast result :
Comments
ISRhba1 is 50% aa similar to ISSpo3.
The second ORF is a phage-related protein.
The second ORF is a phage-related protein.
References
1] Giovannoni,S.J., Cho,J.-C., Ferriera,S., Johnson,J., Kravitz,S., Halpern,A., Remington,K., Beeson,K., Tran,B., Rogers,Y.-H., Friedman,R. and Venter,J.C. (2006) Direct submission GenBank