ISRpa1
- Family IS1595
- Group ISNwi1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007778 | ND | Rhodopseudomonas palustris | Rhodopseudomonas palustris HaA2 |
DNA section
IS Length : 2091 bp
Ends
IR Length : 27/28
IRL : CCCAATATTATACTTTGCGTACCCAAAATTCTGTGAGATATTGGCTTCAA
IRR : CCCAATATTATACTTTGCGTACCCACAAAACCCGGTGCTACCATGAGTAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACGAACGAGAGCTAGG | TTTTTTTT | CCTAAATTGCT | 8 |
DNA sequence
CCCAATATTATACTTTGCGTACCCAAAATTCTGTGAGATATTGGCTTCAACACGAGGTTGAAGCCATGAAGTCCTTTTTGAACGCCAAGCACCTGCAGAA
CGAAGAAGCGGCCTACGCTTGGGTTGAAGCCCGTATTTGGCCCAACGGCCCGGTTTGCCCGCACTGTGGCGGCGTTGACCGCATTTCCAAGATGCAGGGC
AAGTCCACTCGGATCGGCGCTTACAAGTGCTATCAGTGCCGCAAGCCCTTTACCGTCAAGGTTGGCACCATCTTCGAATCCAGCCACGTCCCCATGCACC
TGTGGCTGCAGGCGATCTTCCTGCTTTCGTCCTCAAAGAAGGGGATCAGCAGCAACCAGCTTCACCGCACCCTTGGCTGCACCCTCAAAACCGCGTGGTT
CATTTCTCACCGCGTCCGCGAAGCGATGCGCGATGGTGGCCTTGCCCCGATGGGCGGCGCTGGCGGAATTGTCGAGGTTGACGAAACCTATTTCGGCAAG
ACCAAAGAAAAGAAGCCGTCACCGCAGCGCAAGGGCCGTCCTTTCATTCATCGTGGCGGCGGCCCGTCCGGCAAGCGCGCTATCGTTTCGCTTGTCGAGC
GCGGCGGCAAAGTGCGCTCGTTCCATGTCGAGAACGCCGACAAGCCGACTGTCGTTGGCATCGTTACTGCCAACGTCGCGAAAGAAAGCCGCCTGCACAC
CGACGAAAGCCGCCTCTACATCGGCGCGGACCAACACTTTTCCTCTCACGAAACTATCAATCACACCGCCAAGGAATACGCTCGCGGCGATGTCACCACC
AATTCTGTTGAAGGCTTCTTTTCGATCTTCAAGCGCGGCATGAAGGGCGTCTATCAGCACTGCGGCGAAAACCATCTTCATCGCTATCTGTCGGAATACG
ATTTCCGCTATAACAACCGCGTCGCACTTGGCGTGAACGACTACGAGCGCGCCGACCGTGTGTTGGCTGGCGTGGTTGGCAAGCGTCTCACCTATCAAAC
GACTCGTTAAAGGCAAAATTCGATGGATAAGAAGCCAAAAGCTCGCGGCCCTACGAAGAAAGCAGCGAAGCCTAAAGCGAAAGATGAAAAGCAATTCGAG
CGGTTTATAGAAACCGCTCGTGAGCTTGATGTTGATGAGAGCGGGAAGAGCTTTGATGTAGCGTTTAGGAAGATTGCCACGCCGACAGTGAGGACTAAAT
CAAAATCGCCAACTGGTTCATGAAGGTGCGAACGCACTGAAATATATCTTTGCTTTGTTCCGGCGAGTAAGTCGAGATCGGGTGCATCGTGCTGTTACGC
CATGCTTGCTTCACATGGTAAAGATTTGCGTGACATTCTGACCATGCGTCGCGCTTCTTGTCTTTGGGCATCGCGCCGATTTTAGCGTGCAAGTCGCTCA
ATAGCTTACCCCACTCTCTATCGGGGTTAGGGATCTGCAAAGAGGTGCACAACTCCCCGACCGCACATTCCATCGCTCTCATGAGGTGAAATACTGAGGC
GGTGTCCTGACCAAGGGCGATGCACTTGCCCGCATTTTCAATGTCGGGAATCGAGTTGGGAAAACGCTCGGTTACTCGGTTTCCGAATTGGTCCTTATCT
TCGTAATACCGCACTAACTCTGGCCGAACGTAGTACATTTTGTGAGCAGGCAATTCGTCTTGGATGCGGTTTCTCAGATCGGTAACGAGTCCAAGAATTT
CACTAAGTTGATTGCTAGACATCGGAGGGGTAAACGAGCCTATGGCCCGCCAGACGCGTTGTACTTGAGCCTCAACCGACGGCATAGACATCGGCAGCTT
AAGGACGATGGTTTTGCTATCGTCCTCGTCCCTCGAAATTATTTTTTCCCAGATAGGGGCGAGGCGGGCTTCCAAAGTTGCCCCAGAAGATATGTCTAGC
GCCTTTACTACGTCAGGGTTCGATATTCCGCCTCCCTTTTCCATCTCGGCAATGAGTTGAGCGATAAAGAAGAATTCAGATACGTATGCCCGCAACATAT
CAAACAAGCTCCATAGCTTGCCGGGCTGGACGCCCCCGGCGCTACTCATGGTAGCACCGGGTTTTGTGGGTACGCAAAGTATAATATTGGG
CGAAGAAGCGGCCTACGCTTGGGTTGAAGCCCGTATTTGGCCCAACGGCCCGGTTTGCCCGCACTGTGGCGGCGTTGACCGCATTTCCAAGATGCAGGGC
AAGTCCACTCGGATCGGCGCTTACAAGTGCTATCAGTGCCGCAAGCCCTTTACCGTCAAGGTTGGCACCATCTTCGAATCCAGCCACGTCCCCATGCACC
TGTGGCTGCAGGCGATCTTCCTGCTTTCGTCCTCAAAGAAGGGGATCAGCAGCAACCAGCTTCACCGCACCCTTGGCTGCACCCTCAAAACCGCGTGGTT
CATTTCTCACCGCGTCCGCGAAGCGATGCGCGATGGTGGCCTTGCCCCGATGGGCGGCGCTGGCGGAATTGTCGAGGTTGACGAAACCTATTTCGGCAAG
ACCAAAGAAAAGAAGCCGTCACCGCAGCGCAAGGGCCGTCCTTTCATTCATCGTGGCGGCGGCCCGTCCGGCAAGCGCGCTATCGTTTCGCTTGTCGAGC
GCGGCGGCAAAGTGCGCTCGTTCCATGTCGAGAACGCCGACAAGCCGACTGTCGTTGGCATCGTTACTGCCAACGTCGCGAAAGAAAGCCGCCTGCACAC
CGACGAAAGCCGCCTCTACATCGGCGCGGACCAACACTTTTCCTCTCACGAAACTATCAATCACACCGCCAAGGAATACGCTCGCGGCGATGTCACCACC
AATTCTGTTGAAGGCTTCTTTTCGATCTTCAAGCGCGGCATGAAGGGCGTCTATCAGCACTGCGGCGAAAACCATCTTCATCGCTATCTGTCGGAATACG
ATTTCCGCTATAACAACCGCGTCGCACTTGGCGTGAACGACTACGAGCGCGCCGACCGTGTGTTGGCTGGCGTGGTTGGCAAGCGTCTCACCTATCAAAC
GACTCGTTAAAGGCAAAATTCGATGGATAAGAAGCCAAAAGCTCGCGGCCCTACGAAGAAAGCAGCGAAGCCTAAAGCGAAAGATGAAAAGCAATTCGAG
CGGTTTATAGAAACCGCTCGTGAGCTTGATGTTGATGAGAGCGGGAAGAGCTTTGATGTAGCGTTTAGGAAGATTGCCACGCCGACAGTGAGGACTAAAT
CAAAATCGCCAACTGGTTCATGAAGGTGCGAACGCACTGAAATATATCTTTGCTTTGTTCCGGCGAGTAAGTCGAGATCGGGTGCATCGTGCTGTTACGC
CATGCTTGCTTCACATGGTAAAGATTTGCGTGACATTCTGACCATGCGTCGCGCTTCTTGTCTTTGGGCATCGCGCCGATTTTAGCGTGCAAGTCGCTCA
ATAGCTTACCCCACTCTCTATCGGGGTTAGGGATCTGCAAAGAGGTGCACAACTCCCCGACCGCACATTCCATCGCTCTCATGAGGTGAAATACTGAGGC
GGTGTCCTGACCAAGGGCGATGCACTTGCCCGCATTTTCAATGTCGGGAATCGAGTTGGGAAAACGCTCGGTTACTCGGTTTCCGAATTGGTCCTTATCT
TCGTAATACCGCACTAACTCTGGCCGAACGTAGTACATTTTGTGAGCAGGCAATTCGTCTTGGATGCGGTTTCTCAGATCGGTAACGAGTCCAAGAATTT
CACTAAGTTGATTGCTAGACATCGGAGGGGTAAACGAGCCTATGGCCCGCCAGACGCGTTGTACTTGAGCCTCAACCGACGGCATAGACATCGGCAGCTT
AAGGACGATGGTTTTGCTATCGTCCTCGTCCCTCGAAATTATTTTTTCCCAGATAGGGGCGAGGCGGGCTTCCAAAGTTGCCCCAGAAGATATGTCTAGC
GCCTTTACTACGTCAGGGTTCGATATTCCGCCTCCCTTTTCCATCTCGGCAATGAGTTGAGCGATAAAGAAGAATTCAGATACGTATGCCCGCAACATAT
CAAACAAGCTCCATAGCTTGCCGGGCTGGACGCCCCCGGCGCTACTCATGGTAGCACCGGGTTTTGTGGGTACGCAAAGTATAATATTGGG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
945 bp | 314 aa | 66 | 1010 | + | No |
Chemistry : DDE
ORF sequence :
MKSFLNAKHLQNEEAAYAWVEARIWPNGPVCPHCGGVDRISKMQGKSTRIGAYKCYQCRKPFTVKVGTIFESSHVPMHLWLQAIFLLSSSKKGISSNQLH
RTLGCTLKTAWFISHRVREAMRDGGLAPMGGAGGIVEVDETYFGKTKEKKPSPQRKGRPFIHRGGGPSGKRAIVSLVERGGKVRSFHVENADKPTVVGIV
TANVAKESRLHTDESRLYIGADQHFSSHETINHTAKEYARGDVTTNSVEGFFSIFKRGMKGVYQHCGENHLHRYLSEYDFRYNNRVALGVNDYERADRVL
AGVVGKRLTYQTTR
RTLGCTLKTAWFISHRVREAMRDGGLAPMGGAGGIVEVDETYFGKTKEKKPSPQRKGRPFIHRGGGPSGKRAIVSLVERGGKVRSFHVENADKPTVVGIV
TANVAKESRLHTDESRLYIGADQHFSSHETINHTAKEYARGDVTTNSVEGFFSIFKRGMKGVYQHCGENHLHRYLSEYDFRYNNRVALGVNDYERADRVL
AGVVGKRLTYQTTR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
855 bp | 284 aa | 2049 | 1195 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MSSAGGVQPGKLWSLFDMLRAYVSEFFFIAQLIAEMEKGGGISNPDVVKALDISSGATLEARLAPIWEKIISRDEDDSKTIVLKLPMSMPSVEAQVQRVW
RAIGSFTPPMSSNQLSEILGLVTDLRNRIQDELPAHKMYYVRPELVRYYEDKDQFGNRVTERFPNSIPDIENAGKCIALGQDTASVFHLMRAMECAVGEL
CTSLQIPNPDREWGKLLSDLHAKIGAMPKDKKRDAWSECHANLYHVKQAWRNSTMHPISTYSPEQSKDIFQCVRTFMNQLAILI
RAIGSFTPPMSSNQLSEILGLVTDLRNRIQDELPAHKMYYVRPELVRYYEDKDQFGNRVTERFPNSIPDIENAGKCIALGQDTASVFHLMRAMECAVGEL
CTSLQIPNPDREWGKLLSDLHAKIGAMPKDKKRDAWSECHANLYHVKQAWRNSTMHPISTYSPEQSKDIFQCVRTFMNQLAILI
Blast result :
Comments
ISRpa1 is 74% aa similar to ISNwi1. The second ORF is a passenger gene on the complementary strand.
References
1] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina,T., Hammon,N., Israni,S., Pitluck,S., Chain,P., Malfatti,S., Shin,M., Vergez,L., Schmutz,J., Larimer,F., Land,M., Hauser,L., Pelletier,D.A., Kyrpides,N., Anderson,I., Oda,Y., Harwood,C.S. and Richardson,P. US DOE Joint Genome Institute (2006) Direct submission GenBank.