ISRpa4
- Family IS1595
- Group ISNha5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007925 | ND | Rhodopseudomonas palustris | Rhodopseudomonas palustris BisB18 |
DNA section
IS Length : 4218 bp
Ends
IR Length : 25/26
IRL : CGGCCTTAGGTAGCAAATGCACCAAGCGCTAGCGAAGGGGGCCTTGCTGG
IRR : CGGCCTTAGGTAGCAGATGCACCAAGGCGTTTTGTGGATTTCCATCCCAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GATCCACGCGCT | GGATTATC | CGCGCGAGAAACT | 8 |
DNA sequence
CGGCCTTAGGTAGCAAATGCACCAAGCGCTAGCGAAGGGGGCCTTGCTGGCCCTATGTGGCCTCGTTACCGTCGGCGTCATGAGCCATGCCGCCGGGAGG
GTCGGTGGCTATGTTCCCGCAATACACAAGCCTGAACCGCGTTTCCCGGTTCTTGTTGAAGGCGTCCCTGTATCGAATTTCGCCGTAAACCCAAACACTT
GCCTTACCACTGGCGACAAGATCAAAATGTTGCTTGGAGAGTCTGTAGTTTGGTCCGAAGCCAGCGAAGGCTGTCGACATTTCCCCAGGCCCCAAGACTG
CCTTACGCGGGTCGCTATGGTCGATATCGTGTCTGACTTCATCAAATGGCCGATCACTCATTGCGCCATTCATACAATGGTGCAGCCCGTAAGCTGGGGT
TTGCCCAGTGTTCTTCATCGTGACCACTATTTCGACGGGCTCGCCAACCGCAAAGCGCCTGACCCTGCCTTGGCTAACGAGAACGTATGCCCGCAGTTGC
CGCTCTGCGTTGTCTCTAGTTTCCTCGAAGGTCTTGTCGGCAATTTCGGCGGACCTAACGGCGGCTTTGCCGGCAGTTATGCCAGCCTCAGCGGTGCGTC
TAGTTTCCCTGAACGTCTTGATGAGATACCAAAGGGTAAAAACCCCAATCGGCAACTGAGCGAGAGACACGAGAAACGAGTTCTGCGCCCAGTAAGTAGC
CCCAACCTGCGCAGCGGAACTTCTGTTCTCACGTTCCTCTTTTTTCTTTTCTGCTTCATCGCCAGCAGTTTTTTCTGCTCGGTATATCTCTATCGCTTCG
TCGAACATGGCGCGAAGAGTCGCCCGGTTGGCGATTATTTCGGTCTCATTAGTGGCGCCGCTATGATTCTCGCATTCTATTTGATCCTCGGTAGCGGCCC
CTAGGCTATATGGGCCGATCAGGCATCCTCCAATCGCAACGATGAGCAACCGACCGCCGATAAGCATTGGATCAAAACGCCGCTGTTAACCTGCCTCTAT
CGATCTTATTGGCGATGCTTCGCCATGTCTCTGCCGCGTCGCGCGCTGCCACTGGTATTTTCAACTCAAGCCATAGCAAGTTAGTCGCAACTCCCCATTT
ATGGGCCATATCCCATTGACTTGATTTGTCCGAAATACTGGAATGCCCCGCGATCTAGCGGGGGTGTATCTCATGGTTGGCGGGGCGGCTGGTAGTTCGA
GGGCAATATGGCAGGCAGCGAGAGAACCGAGGGAGCCGAGCCAACCGCAGCGACCGGAGCGGCCGGAGAGGAATCCGGAGCCGGCCCGGCACCCGGATCG
GGAGCCGGGCCGAACGAACGACCGGCCAGGACGGGAGATAAGGGAGAATACGGACCCGTCCCGTCACCCGGACCCGTGGCCTCCGCAACGCCGCTAGAGG
CTATAACGATTTCTTGCCTGGCAAACGCGCATTACCATTCAGGCCGGGAAGCGTTCTTAGACACTGTGCATCGCTGGTTTATGTTTTTGGTGATTGCGTT
CGGCGCGACCGCTTTGACGGACGTCTTGCCACGAATCTCGCATTCAGTGTTTGGGTTTAGCCTTGATGTCGGGGCGGTAAAGGAGGCTTGCGCGGCGAGC
GCGGCCATCATTGCGGCGCTTGATCTGACTTTTGATCTTTCCAATCGAGCCCGCTCGCATTCGATGATGAAGCGGCGGTACTACGAATTGCTGGCCGATC
TGAGAGAAGGCCAGAAAACGATTGAACATGGTCGGGTGTGCCTAGAGCGGTTTTCGGCTGACGAGGAGCCTATGTATCGGGTGCTATATCTAAGTTGCTG
GAACTCCGCCCAAATGACCGTATTCGGGAAACTGGCGAAAAAATTTGAAATCTCTTTTTGCGGGAATCTGTTCAAGAACTGGTTTCGAAGGCCATCTGCC
GTCTACCCCATGGTTGATCCGAGGGGCGATAGGGAGCGCGGGTTCGCCAGTTCTCAATCTGCCTGACAAGCCTATGTCTGCTTTTTGACAATCGGCCCCG
CAACAGAGTCCCCCGTAAACCTGATCCCGTCAGCCTCTAGCGCCCCGCGGATGGCCGCTAAGTTGTTGGCGATGAGGTTGCGCTTTCCGCTTTCGTAGTC
TTTCACGGTCGATAGTCCGACCTTGGCTCTAGTTGCCAATTCAGCTTGGGAGATATTTAACCAATTCCGCGCGGCGCGCGCTTGCTCTGGGCTCATTGTC
GGTCCTGCTATCGTGTTGTTCCGGCCACTATATATCCGTTTTGGTCAAGATCAACCATTTTAGTTGACGTAATCCAAATCAGCTAATATAGTCTATGTCA
TCCAAAAAGGATGACATTAGATGGCCCAGCACTTCCTTCTCTCGGCAAAAGCCCGCACTCTCTCGCTCAAGTCGATCTTCGCGGCTGGCGAGGAAAAGGC
TTACGAGACGTTCTGCAAGCTCCGCTGGCCTGCCACGGACGGCGAGCCCGTTTGCCCGTTCTGCGGTGGCCTGGACGCCTACAGGATCACCACGCGTCGT
CGCTTCAAATGTAAGGCTTGCCTGCGCCAGTTCTCGGTTACGAGCGGGACTATTCTCGCGTCGCGCAAGATGTCGTTCACCGACCTCTTGGCCGCGATCT
GCATCATCGTGAATGGCGCCAAGGGTATCTCTGCCCTGCAGCTCGCGCGCGACATCAACTGCCAGCACAAGACGGCTTTCGTGCTTTCGCACAAGCTGCG
CGAGGCTATGGCGTCGGAAGCAGCTAAGACGTTGGGTGGCGAAGTTGAGGTTGATGGGGCCTACTTTGGCGGCCACATTCGGCCCGCCAACTACGCCGAA
AACAGGATCGATCGGCGGTTAGCCGAGCACCAGACCGGCAAGCGCCGCGTCGTCATCGTTCTTCGCGAACGCAACGGCAGTACGGTTACATTCGTCCGCA
AGAGTGAAGCGGAAGGCGTCGATATCGCCAAGCGTATCGTTTCCCGCGACGCTATCATGCACGCTGACGAAGCGGCGCATTGGGACGCGCTGCGCGCCGG
TTGGCAGGTTCACAGGATTAATCACTCGGAGGCTTACTGCGATGACGGCGCGTGCACGAACCAAGCTGAAAGCTATTTCTCACGCCTTCGCCGCATGGTT
GACGGGCAACACCATTCCGTCTCGCCGCAATACCTGCACCAGTACGCAGGCCATGCCGCTTGGATTGAGGACAACCGCCGTCTCGACAACGGCGCGCTGG
CAAGCCGCATCGGCGGGTTGGCGATGGCTCATCCGGTTAGCCGGAATTGGAAGGGATATTGGCAGCGGGCGGCTTAATGTTGCGCATCTAAGCGCCATCG
ACTCGAACCCCGGCTTCATTATCTCTGGAGTGTTGAGTGCTGATCATCGTCGGGGTGCCGGATAGAGGTTATCGTTCTGGTAATGTATCCGCTGGCTAGT
TCTTTGATTGTCGTCTCTGTGAAGTCGTCAGCTTTAGAGAATATCCGCCTCGAAGTTAGGTTTGCTATTATCTTCTGCTCGAAGAGTCGCAACCTTTGCT
TTGATGTATCGACGTCCTGATCGAACGCAATCTCTCCGAGGGCGAACCTTACCATTTCTCCTAACGCAATTTGCATAGCGATGTCGTTGGCAATTTGTTC
TGGGGCGCGTATCTGAGATTTCAACTTTTCGACCTGCGACGTTAGGTCGGATATTTGCATCCTCAAGGTATCAATCTTTTGTTCGGTTGACTGCGACTCC
GGTTCCATGTTGCTAGCCCTCGTTGTTAAGAAGCCCAAACCCTAACCCCTTTCCGTTTCGGCAGGGCGGTAACTTTGCCTCGCTTGGCTGCCATATTTAG
GGCTTGGACTATACGGAAGGCTATTGAGGTTCTTAGCACCCTGTCGCCTTCGTCTAGGCCCTTGGCCCGGATGACGCGGATAGCGAGTTCACGGGTATCT
AGCGGCCCTTCCTTGGCAAGGGCTTCACGGCAAATCGTGGTCATTTCCCGAGGCTTAAATAGCCTCAAGGTGTTCATGTAGACCGGGAATTGGGTTCGTT
TGTCGGGGTGCTCAAACAGCACCAGCGTAGCGTTGATATGGTACAGATCGCGACGGCAGATTTCAATCTGCTTTTCATAGGCGCGGATCGTCCGCTCTAT
CTCTTTCCGCTTGGTTTCAAGCGTTCTGACTAGGTTTGGTTCGCCCATGGGCGGCATGGTGCCGTCAGTTGGGATGGAAATCCACAAAACGCCTTGGTGC
ATCTGCTACCTAAGGCCG
GTCGGTGGCTATGTTCCCGCAATACACAAGCCTGAACCGCGTTTCCCGGTTCTTGTTGAAGGCGTCCCTGTATCGAATTTCGCCGTAAACCCAAACACTT
GCCTTACCACTGGCGACAAGATCAAAATGTTGCTTGGAGAGTCTGTAGTTTGGTCCGAAGCCAGCGAAGGCTGTCGACATTTCCCCAGGCCCCAAGACTG
CCTTACGCGGGTCGCTATGGTCGATATCGTGTCTGACTTCATCAAATGGCCGATCACTCATTGCGCCATTCATACAATGGTGCAGCCCGTAAGCTGGGGT
TTGCCCAGTGTTCTTCATCGTGACCACTATTTCGACGGGCTCGCCAACCGCAAAGCGCCTGACCCTGCCTTGGCTAACGAGAACGTATGCCCGCAGTTGC
CGCTCTGCGTTGTCTCTAGTTTCCTCGAAGGTCTTGTCGGCAATTTCGGCGGACCTAACGGCGGCTTTGCCGGCAGTTATGCCAGCCTCAGCGGTGCGTC
TAGTTTCCCTGAACGTCTTGATGAGATACCAAAGGGTAAAAACCCCAATCGGCAACTGAGCGAGAGACACGAGAAACGAGTTCTGCGCCCAGTAAGTAGC
CCCAACCTGCGCAGCGGAACTTCTGTTCTCACGTTCCTCTTTTTTCTTTTCTGCTTCATCGCCAGCAGTTTTTTCTGCTCGGTATATCTCTATCGCTTCG
TCGAACATGGCGCGAAGAGTCGCCCGGTTGGCGATTATTTCGGTCTCATTAGTGGCGCCGCTATGATTCTCGCATTCTATTTGATCCTCGGTAGCGGCCC
CTAGGCTATATGGGCCGATCAGGCATCCTCCAATCGCAACGATGAGCAACCGACCGCCGATAAGCATTGGATCAAAACGCCGCTGTTAACCTGCCTCTAT
CGATCTTATTGGCGATGCTTCGCCATGTCTCTGCCGCGTCGCGCGCTGCCACTGGTATTTTCAACTCAAGCCATAGCAAGTTAGTCGCAACTCCCCATTT
ATGGGCCATATCCCATTGACTTGATTTGTCCGAAATACTGGAATGCCCCGCGATCTAGCGGGGGTGTATCTCATGGTTGGCGGGGCGGCTGGTAGTTCGA
GGGCAATATGGCAGGCAGCGAGAGAACCGAGGGAGCCGAGCCAACCGCAGCGACCGGAGCGGCCGGAGAGGAATCCGGAGCCGGCCCGGCACCCGGATCG
GGAGCCGGGCCGAACGAACGACCGGCCAGGACGGGAGATAAGGGAGAATACGGACCCGTCCCGTCACCCGGACCCGTGGCCTCCGCAACGCCGCTAGAGG
CTATAACGATTTCTTGCCTGGCAAACGCGCATTACCATTCAGGCCGGGAAGCGTTCTTAGACACTGTGCATCGCTGGTTTATGTTTTTGGTGATTGCGTT
CGGCGCGACCGCTTTGACGGACGTCTTGCCACGAATCTCGCATTCAGTGTTTGGGTTTAGCCTTGATGTCGGGGCGGTAAAGGAGGCTTGCGCGGCGAGC
GCGGCCATCATTGCGGCGCTTGATCTGACTTTTGATCTTTCCAATCGAGCCCGCTCGCATTCGATGATGAAGCGGCGGTACTACGAATTGCTGGCCGATC
TGAGAGAAGGCCAGAAAACGATTGAACATGGTCGGGTGTGCCTAGAGCGGTTTTCGGCTGACGAGGAGCCTATGTATCGGGTGCTATATCTAAGTTGCTG
GAACTCCGCCCAAATGACCGTATTCGGGAAACTGGCGAAAAAATTTGAAATCTCTTTTTGCGGGAATCTGTTCAAGAACTGGTTTCGAAGGCCATCTGCC
GTCTACCCCATGGTTGATCCGAGGGGCGATAGGGAGCGCGGGTTCGCCAGTTCTCAATCTGCCTGACAAGCCTATGTCTGCTTTTTGACAATCGGCCCCG
CAACAGAGTCCCCCGTAAACCTGATCCCGTCAGCCTCTAGCGCCCCGCGGATGGCCGCTAAGTTGTTGGCGATGAGGTTGCGCTTTCCGCTTTCGTAGTC
TTTCACGGTCGATAGTCCGACCTTGGCTCTAGTTGCCAATTCAGCTTGGGAGATATTTAACCAATTCCGCGCGGCGCGCGCTTGCTCTGGGCTCATTGTC
GGTCCTGCTATCGTGTTGTTCCGGCCACTATATATCCGTTTTGGTCAAGATCAACCATTTTAGTTGACGTAATCCAAATCAGCTAATATAGTCTATGTCA
TCCAAAAAGGATGACATTAGATGGCCCAGCACTTCCTTCTCTCGGCAAAAGCCCGCACTCTCTCGCTCAAGTCGATCTTCGCGGCTGGCGAGGAAAAGGC
TTACGAGACGTTCTGCAAGCTCCGCTGGCCTGCCACGGACGGCGAGCCCGTTTGCCCGTTCTGCGGTGGCCTGGACGCCTACAGGATCACCACGCGTCGT
CGCTTCAAATGTAAGGCTTGCCTGCGCCAGTTCTCGGTTACGAGCGGGACTATTCTCGCGTCGCGCAAGATGTCGTTCACCGACCTCTTGGCCGCGATCT
GCATCATCGTGAATGGCGCCAAGGGTATCTCTGCCCTGCAGCTCGCGCGCGACATCAACTGCCAGCACAAGACGGCTTTCGTGCTTTCGCACAAGCTGCG
CGAGGCTATGGCGTCGGAAGCAGCTAAGACGTTGGGTGGCGAAGTTGAGGTTGATGGGGCCTACTTTGGCGGCCACATTCGGCCCGCCAACTACGCCGAA
AACAGGATCGATCGGCGGTTAGCCGAGCACCAGACCGGCAAGCGCCGCGTCGTCATCGTTCTTCGCGAACGCAACGGCAGTACGGTTACATTCGTCCGCA
AGAGTGAAGCGGAAGGCGTCGATATCGCCAAGCGTATCGTTTCCCGCGACGCTATCATGCACGCTGACGAAGCGGCGCATTGGGACGCGCTGCGCGCCGG
TTGGCAGGTTCACAGGATTAATCACTCGGAGGCTTACTGCGATGACGGCGCGTGCACGAACCAAGCTGAAAGCTATTTCTCACGCCTTCGCCGCATGGTT
GACGGGCAACACCATTCCGTCTCGCCGCAATACCTGCACCAGTACGCAGGCCATGCCGCTTGGATTGAGGACAACCGCCGTCTCGACAACGGCGCGCTGG
CAAGCCGCATCGGCGGGTTGGCGATGGCTCATCCGGTTAGCCGGAATTGGAAGGGATATTGGCAGCGGGCGGCTTAATGTTGCGCATCTAAGCGCCATCG
ACTCGAACCCCGGCTTCATTATCTCTGGAGTGTTGAGTGCTGATCATCGTCGGGGTGCCGGATAGAGGTTATCGTTCTGGTAATGTATCCGCTGGCTAGT
TCTTTGATTGTCGTCTCTGTGAAGTCGTCAGCTTTAGAGAATATCCGCCTCGAAGTTAGGTTTGCTATTATCTTCTGCTCGAAGAGTCGCAACCTTTGCT
TTGATGTATCGACGTCCTGATCGAACGCAATCTCTCCGAGGGCGAACCTTACCATTTCTCCTAACGCAATTTGCATAGCGATGTCGTTGGCAATTTGTTC
TGGGGCGCGTATCTGAGATTTCAACTTTTCGACCTGCGACGTTAGGTCGGATATTTGCATCCTCAAGGTATCAATCTTTTGTTCGGTTGACTGCGACTCC
GGTTCCATGTTGCTAGCCCTCGTTGTTAAGAAGCCCAAACCCTAACCCCTTTCCGTTTCGGCAGGGCGGTAACTTTGCCTCGCTTGGCTGCCATATTTAG
GGCTTGGACTATACGGAAGGCTATTGAGGTTCTTAGCACCCTGTCGCCTTCGTCTAGGCCCTTGGCCCGGATGACGCGGATAGCGAGTTCACGGGTATCT
AGCGGCCCTTCCTTGGCAAGGGCTTCACGGCAAATCGTGGTCATTTCCCGAGGCTTAAATAGCCTCAAGGTGTTCATGTAGACCGGGAATTGGGTTCGTT
TGTCGGGGTGCTCAAACAGCACCAGCGTAGCGTTGATATGGTACAGATCGCGACGGCAGATTTCAATCTGCTTTTCATAGGCGCGGATCGTCCGCTCTAT
CTCTTTCCGCTTGGTTTCAAGCGTTCTGACTAGGTTTGGTTCGCCCATGGGCGGCATGGTGCCGTCAGTTGGGATGGAAATCCACAAAACGCCTTGGTGC
ATCTGCTACCTAAGGCCG
Protein section
ORF number : 4
ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
225 bp | 74 aa | 2196 | 1972 | - | No |
Annotation : Transcriptional regulatorDescription : Transcriptional Regulator factor
ORF sequence :
MSPEQARAARNWLNISQAELATRAKVGLSTVKDYESGKRNLIANNLAAIRGALEADGIRFTGDSVAGPIVKKQT
Blast result :ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
888 bp | 295 aa | 17 | 904 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MHQALAKGALLALCGLVTVGVMSHAAGRVGGYVPAIHKPEPRFPVLVEGVPVSNFAVNPNTCLTTGDKIKMLLGESVVWSEASEGCRHFPRPQDCLTRVA
MVDIVSDFIKWPITHCAIHTMVQPVSWGLPSVLHRDHYFDGLANRKAPDPALANENVCPQLPLCVVSSFLEGLVGNFGGPNGGFAGSYASLSGASSFPER
LDEIPKGKNPNRQLSERHEKRVLRPVSSPNLRSGTSVLTFLFFLFCFIASSFFCSVYLYRFVEHGAKSRPVGDYFGLISGAAMILAFYLILGSGP
MVDIVSDFIKWPITHCAIHTMVQPVSWGLPSVLHRDHYFDGLANRKAPDPALANENVCPQLPLCVVSSFLEGLVGNFGGPNGGFAGSYASLSGASSFPER
LDEIPKGKNPNRQLSERHEKRVLRPVSSPNLRSGTSVLTFLFFLFCFIASSFFCSVYLYRFVEHGAKSRPVGDYFGLISGAAMILAFYLILGSGP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
759 bp | 252 aa | 1208 | 1966 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MAGSERTEGAEPTAATGAAGEESGAGPAPGSGAGPNERPARTGDKGEYGPVPSPGPVASATPLEAITISCLANAHYHSGREAFLDTVHRWFMFLVIAFGA
TALTDVLPRISHSVFGFSLDVGAVKEACAASAAIIAALDLTFDLSNRARSHSMMKRRYYELLADLREGQKTIEHGRVCLERFSADEEPMYRVLYLSCWNS
AQMTVFGKLAKKFEISFCGNLFKNWFRRPSAVYPMVDPRGDRERGFASSQSA
TALTDVLPRISHSVFGFSLDVGAVKEACAASAAIIAALDLTFDLSNRARSHSMMKRRYYELLADLREGQKTIEHGRVCLERFSADEEPMYRVLYLSCWNS
AQMTVFGKLAKKFEISFCGNLFKNWFRRPSAVYPMVDPRGDRERGFASSQSA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
957 bp | 318 aa | 2321 | 3277 | + | No |
Chemistry : DDE
ORF sequence :
MAQHFLLSAKARTLSLKSIFAAGEEKAYETFCKLRWPATDGEPVCPFCGGLDAYRITTRRRFKCKACLRQFSVTSGTILASRKMSFTDLLAAICIIVNGA
KGISALQLARDINCQHKTAFVLSHKLREAMASEAAKTLGGEVEVDGAYFGGHIRPANYAENRIDRRLAEHQTGKRRVVIVLRERNGSTVTFVRKSEAEGV
DIAKRIVSRDAIMHADEAAHWDALRAGWQVHRINHSEAYCDDGACTNQAESYFSRLRRMVDGQHHSVSPQYLHQYAGHAAWIEDNRRLDNGALASRIGGL
AMAHPVSRNWKGYWQRAA
KGISALQLARDINCQHKTAFVLSHKLREAMASEAAKTLGGEVEVDGAYFGGHIRPANYAENRIDRRLAEHQTGKRRVVIVLRERNGSTVTFVRKSEAEGV
DIAKRIVSRDAIMHADEAAHWDALRAGWQVHRINHSEAYCDDGACTNQAESYFSRLRRMVDGQHHSVSPQYLHQYAGHAAWIEDNRRLDNGALASRIGGL
AMAHPVSRNWKGYWQRAA
Blast result :
Comments
The transposase is the fourth ORF, the others are passengers genes. ISRpa4 is 77% aa similar tyo ISAusp1.
References
1] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina
del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chain,P., Malfatti,S., Shin,M., Vergez,L., Schmutz,J., Larimer,F., Land,M., Hauser,L., Pelletier,D.A., Kyrpides,N., Anderson,I., Oda,Y., Harwood,C.S. and Richardson,P. (2006) Direct submission GenBank.
del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chain,P., Malfatti,S., Shin,M., Vergez,L., Schmutz,J., Larimer,F., Land,M., Hauser,L., Pelletier,D.A., Kyrpides,N., Anderson,I., Oda,Y., Harwood,C.S. and Richardson,P. (2006) Direct submission GenBank.