ISRpa4

  • Family IS1595
  • Group ISNha5
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
NC_007925 ND Rhodopseudomonas palustris
Rhodopseudomonas palustris BisB18
DNA section
IS Length : 4218 bp

Ends


IR Length : 25/26

IRL : CGGCCTTAGGTAGCAAATGCACCAAGCGCTAGCGAAGGGGGCCTTGCTGG
IRR : CGGCCTTAGGTAGCAGATGCACCAAGGCGTTTTGTGGATTTCCATCCCAA

Insertion site


Left flankDirect repeatRight flankDR Length
GATCCACGCGCTGGATTATCCGCGCGAGAAACT8

DNA sequence

CGGCCTTAGGTAGCAAATGCACCAAGCGCTAGCGAAGGGGGCCTTGCTGGCCCTATGTGGCCTCGTTACCGTCGGCGTCATGAGCCATGCCGCCGGGAGG
GTCGGTGGCTATGTTCCCGCAATACACAAGCCTGAACCGCGTTTCCCGGTTCTTGTTGAAGGCGTCCCTGTATCGAATTTCGCCGTAAACCCAAACACTT
GCCTTACCACTGGCGACAAGATCAAAATGTTGCTTGGAGAGTCTGTAGTTTGGTCCGAAGCCAGCGAAGGCTGTCGACATTTCCCCAGGCCCCAAGACTG
CCTTACGCGGGTCGCTATGGTCGATATCGTGTCTGACTTCATCAAATGGCCGATCACTCATTGCGCCATTCATACAATGGTGCAGCCCGTAAGCTGGGGT
TTGCCCAGTGTTCTTCATCGTGACCACTATTTCGACGGGCTCGCCAACCGCAAAGCGCCTGACCCTGCCTTGGCTAACGAGAACGTATGCCCGCAGTTGC
CGCTCTGCGTTGTCTCTAGTTTCCTCGAAGGTCTTGTCGGCAATTTCGGCGGACCTAACGGCGGCTTTGCCGGCAGTTATGCCAGCCTCAGCGGTGCGTC
TAGTTTCCCTGAACGTCTTGATGAGATACCAAAGGGTAAAAACCCCAATCGGCAACTGAGCGAGAGACACGAGAAACGAGTTCTGCGCCCAGTAAGTAGC
CCCAACCTGCGCAGCGGAACTTCTGTTCTCACGTTCCTCTTTTTTCTTTTCTGCTTCATCGCCAGCAGTTTTTTCTGCTCGGTATATCTCTATCGCTTCG
TCGAACATGGCGCGAAGAGTCGCCCGGTTGGCGATTATTTCGGTCTCATTAGTGGCGCCGCTATGATTCTCGCATTCTATTTGATCCTCGGTAGCGGCCC
CTAGGCTATATGGGCCGATCAGGCATCCTCCAATCGCAACGATGAGCAACCGACCGCCGATAAGCATTGGATCAAAACGCCGCTGTTAACCTGCCTCTAT
CGATCTTATTGGCGATGCTTCGCCATGTCTCTGCCGCGTCGCGCGCTGCCACTGGTATTTTCAACTCAAGCCATAGCAAGTTAGTCGCAACTCCCCATTT
ATGGGCCATATCCCATTGACTTGATTTGTCCGAAATACTGGAATGCCCCGCGATCTAGCGGGGGTGTATCTCATGGTTGGCGGGGCGGCTGGTAGTTCGA
GGGCAATATGGCAGGCAGCGAGAGAACCGAGGGAGCCGAGCCAACCGCAGCGACCGGAGCGGCCGGAGAGGAATCCGGAGCCGGCCCGGCACCCGGATCG
GGAGCCGGGCCGAACGAACGACCGGCCAGGACGGGAGATAAGGGAGAATACGGACCCGTCCCGTCACCCGGACCCGTGGCCTCCGCAACGCCGCTAGAGG
CTATAACGATTTCTTGCCTGGCAAACGCGCATTACCATTCAGGCCGGGAAGCGTTCTTAGACACTGTGCATCGCTGGTTTATGTTTTTGGTGATTGCGTT
CGGCGCGACCGCTTTGACGGACGTCTTGCCACGAATCTCGCATTCAGTGTTTGGGTTTAGCCTTGATGTCGGGGCGGTAAAGGAGGCTTGCGCGGCGAGC
GCGGCCATCATTGCGGCGCTTGATCTGACTTTTGATCTTTCCAATCGAGCCCGCTCGCATTCGATGATGAAGCGGCGGTACTACGAATTGCTGGCCGATC
TGAGAGAAGGCCAGAAAACGATTGAACATGGTCGGGTGTGCCTAGAGCGGTTTTCGGCTGACGAGGAGCCTATGTATCGGGTGCTATATCTAAGTTGCTG
GAACTCCGCCCAAATGACCGTATTCGGGAAACTGGCGAAAAAATTTGAAATCTCTTTTTGCGGGAATCTGTTCAAGAACTGGTTTCGAAGGCCATCTGCC
GTCTACCCCATGGTTGATCCGAGGGGCGATAGGGAGCGCGGGTTCGCCAGTTCTCAATCTGCCTGACAAGCCTATGTCTGCTTTTTGACAATCGGCCCCG
CAACAGAGTCCCCCGTAAACCTGATCCCGTCAGCCTCTAGCGCCCCGCGGATGGCCGCTAAGTTGTTGGCGATGAGGTTGCGCTTTCCGCTTTCGTAGTC
TTTCACGGTCGATAGTCCGACCTTGGCTCTAGTTGCCAATTCAGCTTGGGAGATATTTAACCAATTCCGCGCGGCGCGCGCTTGCTCTGGGCTCATTGTC
GGTCCTGCTATCGTGTTGTTCCGGCCACTATATATCCGTTTTGGTCAAGATCAACCATTTTAGTTGACGTAATCCAAATCAGCTAATATAGTCTATGTCA
TCCAAAAAGGATGACATTAGATGGCCCAGCACTTCCTTCTCTCGGCAAAAGCCCGCACTCTCTCGCTCAAGTCGATCTTCGCGGCTGGCGAGGAAAAGGC
TTACGAGACGTTCTGCAAGCTCCGCTGGCCTGCCACGGACGGCGAGCCCGTTTGCCCGTTCTGCGGTGGCCTGGACGCCTACAGGATCACCACGCGTCGT
CGCTTCAAATGTAAGGCTTGCCTGCGCCAGTTCTCGGTTACGAGCGGGACTATTCTCGCGTCGCGCAAGATGTCGTTCACCGACCTCTTGGCCGCGATCT
GCATCATCGTGAATGGCGCCAAGGGTATCTCTGCCCTGCAGCTCGCGCGCGACATCAACTGCCAGCACAAGACGGCTTTCGTGCTTTCGCACAAGCTGCG
CGAGGCTATGGCGTCGGAAGCAGCTAAGACGTTGGGTGGCGAAGTTGAGGTTGATGGGGCCTACTTTGGCGGCCACATTCGGCCCGCCAACTACGCCGAA
AACAGGATCGATCGGCGGTTAGCCGAGCACCAGACCGGCAAGCGCCGCGTCGTCATCGTTCTTCGCGAACGCAACGGCAGTACGGTTACATTCGTCCGCA
AGAGTGAAGCGGAAGGCGTCGATATCGCCAAGCGTATCGTTTCCCGCGACGCTATCATGCACGCTGACGAAGCGGCGCATTGGGACGCGCTGCGCGCCGG
TTGGCAGGTTCACAGGATTAATCACTCGGAGGCTTACTGCGATGACGGCGCGTGCACGAACCAAGCTGAAAGCTATTTCTCACGCCTTCGCCGCATGGTT
GACGGGCAACACCATTCCGTCTCGCCGCAATACCTGCACCAGTACGCAGGCCATGCCGCTTGGATTGAGGACAACCGCCGTCTCGACAACGGCGCGCTGG
CAAGCCGCATCGGCGGGTTGGCGATGGCTCATCCGGTTAGCCGGAATTGGAAGGGATATTGGCAGCGGGCGGCTTAATGTTGCGCATCTAAGCGCCATCG
ACTCGAACCCCGGCTTCATTATCTCTGGAGTGTTGAGTGCTGATCATCGTCGGGGTGCCGGATAGAGGTTATCGTTCTGGTAATGTATCCGCTGGCTAGT
TCTTTGATTGTCGTCTCTGTGAAGTCGTCAGCTTTAGAGAATATCCGCCTCGAAGTTAGGTTTGCTATTATCTTCTGCTCGAAGAGTCGCAACCTTTGCT
TTGATGTATCGACGTCCTGATCGAACGCAATCTCTCCGAGGGCGAACCTTACCATTTCTCCTAACGCAATTTGCATAGCGATGTCGTTGGCAATTTGTTC
TGGGGCGCGTATCTGAGATTTCAACTTTTCGACCTGCGACGTTAGGTCGGATATTTGCATCCTCAAGGTATCAATCTTTTGTTCGGTTGACTGCGACTCC
GGTTCCATGTTGCTAGCCCTCGTTGTTAAGAAGCCCAAACCCTAACCCCTTTCCGTTTCGGCAGGGCGGTAACTTTGCCTCGCTTGGCTGCCATATTTAG
GGCTTGGACTATACGGAAGGCTATTGAGGTTCTTAGCACCCTGTCGCCTTCGTCTAGGCCCTTGGCCCGGATGACGCGGATAGCGAGTTCACGGGTATCT
AGCGGCCCTTCCTTGGCAAGGGCTTCACGGCAAATCGTGGTCATTTCCCGAGGCTTAAATAGCCTCAAGGTGTTCATGTAGACCGGGAATTGGGTTCGTT
TGTCGGGGTGCTCAAACAGCACCAGCGTAGCGTTGATATGGTACAGATCGCGACGGCAGATTTCAATCTGCTTTTCATAGGCGCGGATCGTCCGCTCTAT
CTCTTTCCGCTTGGTTTCAAGCGTTCTGACTAGGTTTGGTTCGCCCATGGGCGGCATGGTGCCGTCAGTTGGGATGGAAATCCACAAAACGCCTTGGTGC
ATCTGCTACCTAAGGCCG
Protein section
ORF number : 4

 

ORF 3
LengthBeginEndStrandFusion ORF
225 bp74 aa21961972-No
ORF function : Passenger Gene
Annotation : Transcriptional regulatorDescription : Transcriptional Regulator factor

ORF sequence :

MSPEQARAARNWLNISQAELATRAKVGLSTVKDYESGKRNLIANNLAAIRGALEADGIRFTGDSVAGPIVKKQT

 

Blast result :
ORF 1
LengthBeginEndStrandFusion ORF
888 bp295 aa17904+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MHQALAKGALLALCGLVTVGVMSHAAGRVGGYVPAIHKPEPRFPVLVEGVPVSNFAVNPNTCLTTGDKIKMLLGESVVWSEASEGCRHFPRPQDCLTRVA
MVDIVSDFIKWPITHCAIHTMVQPVSWGLPSVLHRDHYFDGLANRKAPDPALANENVCPQLPLCVVSSFLEGLVGNFGGPNGGFAGSYASLSGASSFPER
LDEIPKGKNPNRQLSERHEKRVLRPVSSPNLRSGTSVLTFLFFLFCFIASSFFCSVYLYRFVEHGAKSRPVGDYFGLISGAAMILAFYLILGSGP

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
759 bp252 aa12081966+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MAGSERTEGAEPTAATGAAGEESGAGPAPGSGAGPNERPARTGDKGEYGPVPSPGPVASATPLEAITISCLANAHYHSGREAFLDTVHRWFMFLVIAFGA
TALTDVLPRISHSVFGFSLDVGAVKEACAASAAIIAALDLTFDLSNRARSHSMMKRRYYELLADLREGQKTIEHGRVCLERFSADEEPMYRVLYLSCWNS
AQMTVFGKLAKKFEISFCGNLFKNWFRRPSAVYPMVDPRGDRERGFASSQSA

 

Blast result :
ORF 4
LengthBeginEndStrandFusion ORF
957 bp318 aa23213277+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MAQHFLLSAKARTLSLKSIFAAGEEKAYETFCKLRWPATDGEPVCPFCGGLDAYRITTRRRFKCKACLRQFSVTSGTILASRKMSFTDLLAAICIIVNGA
KGISALQLARDINCQHKTAFVLSHKLREAMASEAAKTLGGEVEVDGAYFGGHIRPANYAENRIDRRLAEHQTGKRRVVIVLRERNGSTVTFVRKSEAEGV
DIAKRIVSRDAIMHADEAAHWDALRAGWQVHRINHSEAYCDDGACTNQAESYFSRLRRMVDGQHHSVSPQYLHQYAGHAAWIEDNRRLDNGALASRIGGL
AMAHPVSRNWKGYWQRAA

 

Blast result :
Comments
The transposase is the fourth ORF, the others are passengers genes. ISRpa4 is 77% aa similar tyo ISAusp1.
References
1] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina
del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chain,P., Malfatti,S., Shin,M., Vergez,L., Schmutz,J., Larimer,F., Land,M., Hauser,L., Pelletier,D.A., Kyrpides,N., Anderson,I., Oda,Y., Harwood,C.S. and Richardson,P. (2006) Direct submission GenBank.