ISApr7

  • Family IS1595
  • Group ISNwi1
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
NZ_ABHC01000017 ND Alpha proteobacterium
Alpha proteobacterium BAL199
DNA section
IS Length : 3923 bp

Ends


IR Length : 25/30

IRL : GGCGACTATGCACTTGACGACACCACTCGGCGGGTGTAGTTAGAATGCAT
IRR : GGCGACTATGCACTTTGCAACACCAAGGCCGCGCAAGGCTACAACCGGCC

Insertion site


Left flankDirect repeatRight flankDR Length
TGGAATACGATAGAAACTCGCGCAGGAATCCCA8

DNA sequence

GGCGACTATGCACTTGACGACACCACTCGGCGGGTGTAGTTAGAATGCATGTCCAGCGACCTAACCCTTTTGGAAATCATGCGGCGCTTCTCCACCGAGG
AAGCGGCGCGTGCATATTTCGAGCGGATGCGCTGGCCGAACGGTCCGGTCTGCCCTCATTGCGGTAGCGCCGAAAAGAACTACGCGCTTACCCCCAACAA
GAAAGCCCGCATCCGCGAGGGCCTGTATAAGTGCGGAACCTGCGACGACCGATTTAGTGTGACCGTGGGGACGGTCATGGAGTCCTCGCATATCCCGCTG
CACAAGTGGCTGATCGCGTTCTATATGATGTGCGCCAGCAAAACGCAGATTTCCGCACTTCAGCTTCAGCGCCAGCTTGAGCTTGGTTCTTACCGGACCG
CGCACTTCCTATGTGCCCGCATTCGGTACGCCCTCAAGGATGCCGGCTCTGCTGGTCTGATTGGCGGCGAAGTCGAGGCCGACGAAACTTACATCGGCGG
CAAGGCCAAGGGCAAAGGTCGCGGCTACACCGGCAATAAGACCGCAGTCGTTTCGCTGGTCCAGCGCGGCGGTGAAGTTCGGTCGACCGTTGTTGCGGAG
CGCGTGACAGGTAAGACCATCGATACCCTGCTCCGACGCCACGTTACCGAGGAAGCCCACCTCAACACGGACGAGTCTCCCCTCTACAACAAGGCCGGTA
AGCGCTTCGCTTCGCATGCCCGCGTGAACCACTCCGCCGAAGAGTACGGCTATTACGATTACCGCTCGGGCCGCACCGTCACGACCAATACGGTCGAGGG
CTTCTTCGGCAACAGCAAGCGAAGCCTTGACGGTACGCACCACAACGTGAGCCGCCAGCATCTGCACCTATACACGGCGGAACTGGATTTCAAATACAAC
ACGCGGAAGTCGACGGACGGTGAGCGCACCGCCGAAGGCATCCGGCGCATTGAGGGGAAGCGCCTGATGTATAAACCTAAGGCTTCGGGCTGATGGCCCG
CGTTCGTGTTGCGGAAGTCCGCACGGAGGTTCTGCTAACCGAACTGCTCAAGGCCCAGGGATGGGATTGTCGGCGTCCGCCGAATGGCGAGATGCTGCGC
CAGCACGAATACAAAGACCATTCCCACCTGCGCGATGTGTTTCTGCACAGGAGCAAGGTGAGGATGATCGGGCATGGATTGCCCGAGGCCGTAGTGGTGG
ATCGGCAATCAATGCAGCCATTGATCGTGATCGAAGCGAAGGCGTCGATTTCAGACCTCGACAAAGCCCTGCGTGAGGCGACGGAGATTTACGGCAACGC
CTGTATCGACGCCGGTTACTCGCCCCTCGCCGTGGCTATCGCCGGGACCAGTGAGGATGACTTCGCAGTTCGCGTCCACAAGTGGAACGGCTCGGCGTGG
AAGGCCGTCACATACGAAGGCAACCCGATTGGGTGGATACCCAATCGTGTCGATGTCGAACGGCTCCGTGTGCCTTCCGCCACCCCCGAACTGCGCCCTT
CGGTCCCTAGTCCCGAAGTGCTGGCAAACTTCGCCGACGAGATTAACCGGCTGCTGCGCGAGTCCAACGTAAACGACCGCTCACGCCCCTCTGTCGTTGG
CGCGTGCATGCTCGCTCTCTGGCAGTCGAAGGGCGCCCTCCGCAAAGACCCGCGAAACATACTCGGCGACATAAATCAGGCGTGCGAAAAGGCGTTCTGG
AACGCGGGCAAAGCGGTGTTGGCCAAGAGCCTCCACGTTGACGAGGCGAATGACAAACTGGCGGTGAAGGCGCGGCGGATTATCAGTATCCTTGAGCGCC
TGAACGTCTCCGTTCTAACTGCCGAGCACGACTACCTTGGCCAGCTCTACGAGACGTTCTTCCGCTACGCTGGCGGCAATACGATTGGCCAGTATTTCAC
GCCGCGCCACATCGCGAGCTTCGGTGCCGATCTTCTCGGCGTTTCGATTGATGACGTAGTGCTCGACCCGACTTGCGGAACGGGCGGATTCCTCATCGCC
GCAATGGAGCGGGTCGCTCGCGAGCATCAGATTTCTCGCTCCGAAATGGTCAAGCTGGTAAGCACCCGGCTGATCGGCTTCGATGATGAACCCATCACGG
CTGCTCTTTGCGTCGCGAACATGATTTTGCGCGGCGATGGCTCATCTAGCGTGCATCGGGGCGATGCCTTCACGGCACCGGAGTATCCGATCGGCACGGC
GAGCGTTGTTCTCATGAACCCGCCGTACCCCCACAAGCAAACCGACACCCCTACCGAGGCGTTCGTGGAACGCGCGCTAGAGGGGTTGTCGCAAGGCTCG
CGCCTCGCTGCGGTCATTCCCCTGTCGCTGCTGGTCAAGAGCAACAAGGCAAGCTGGCGCAAGGCGATCCTGAAGAACAACACGCTAGAGGCAGCGATCA
AGCTCCCTGACGAGCTATTTCAGCCCTACGCGCAGCCCTACACGGTCATTGTCTATCTGCGGAAGGGCATCCCCCATCCGAAGGGCAAGCGCGCGTTCTT
CGCTCGTATCGAAAATGATGGCTTCCGCATTCGCAAGGCTGTTCGCGTTGCGTGCGAGGGATCTGAACTTCCCAAGATGCTGGCCCATTTTCAGGCCGGA
ACGAGCGAGGCAGGCGTCTGCGGTTGGTCCCAAGTGGACGAGGACGCAAGCTTTGGGCCGGGTGCATATATTCCCGCCAAGGAAATGACCGGCGAGGAAA
GCGACGACGCCACCCAAGAGGTAATTCGGGCACGCACATCGTTTGTCGCCTATCACGCCGCCGACCTGGTGCAGCTTTACACCGACAATCCGCTCGACGT
TCGTGCAATGCGCAAGAGGCCGTGGCAGTTCCAGGGTGTGAAGCTGGGCACCGTTGCCGCCTATTTCGACATCTACTACGGACAGAAAGAACTCCACAGC
AAAGATGGCCTGTTGCCGGGCCGCTCACTGGTCATATCGTCGTCCAAGTTCGATAACGGCTGCTACGGTCTCTTCGACTTCGAGCACATACTCAAACCGC
CTTTCGTGACCGTGCCGGGCAACGGCTCGATTGCCTACGCGCATGTTCAAGAGTGGCCCTGTGGCGTGTCCGATGATTGCATGCTCTTGCTTCCCAAGGA
GGGCGTGTCTCATTCGATGATGTACGTCGCCGCCGCAGCGATCCGAAACGAACGGTGGCGCTTCAGCTATGGGCGCAAGGCAACGCCGGACCGGATTGCG
GAGTTCCCGCTACCTCACACCGACGAACTGCTGGCGCGCGTTGACGAGTACCTTGCGCGAGCTGCTCGGGTTGAAGATCGCATGATCGAGGACGCAGAAG
ACGCGCTCGACAGCCAAACGGCACGCATGCGGCTAGCGGATTTGGGGAGCGGAAAAGCAACGGCAGTTTCGGGGGCGGAATTGGAGACGCGGCTTGCAGC
TATGATGGAGAACTAGGCGGGTGCCATACGTCGGCTTCGCCTTTACCACCGCCGCGCTGGACTTCTTAGCTACGCTGCCGCCGAAAATTCGGAAGCAGGT
CATTAAGAAGGCCAAGGCCCTGCATGCCAACCCGCATCCCCAAGGGTCGAAGAAGTTACACGGCGTGGTGACCGACGATGGCGATCCGGTGTATCGTGAG
CGATCTGGAGATTACCGCATCCTCTATGTGGTTCGCCCCGAAGAGGTGATGGTCCTCGATATTGACCATCGGAAGGACGTATATCGTATGCCTCAGACCA
AGGCAGAACCGGCCGACGAAATGAAAATGAAGGAAGCCGACTTCGACGCGATCATGAGCAAGGCGCTGGGCGTCGCTGCTCCACAGAACAAGGACGATGA
GCAGCCAGCTAAGCGGCTAAGCTCCTACCCGCCGAAGAAGCGCGGCACGTCCTAGGCCCAAAAAGCCGGTGTTGGCCGGTTGTAGCCTTGCGCGGCCTTG
GTGTTGCAAAGTGCATAGTCGCC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
915 bp304 aa79993+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MRRFSTEEAARAYFERMRWPNGPVCPHCGSAEKNYALTPNKKARIREGLYKCGTCDDRFSVTVGTVMESSHIPLHKWLIAFYMMCASKTQISALQLQRQL
ELGSYRTAHFLCARIRYALKDAGSAGLIGGEVEADETYIGGKAKGKGRGYTGNKTAVVSLVQRGGEVRSTVVAERVTGKTIDTLLRRHVTEEAHLNTDES
PLYNKAGKRFASHARVNHSAEEYGYYDYRSGRTVTTNTVEGFFGNSKRSLDGTHHNVSRQHLHLYTAELDFKYNTRKSTDGERTAEGIRRIEGKRLMYKP
KASG

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
2424 bp807 aa9933416+No
ORF function : Passenger Gene
Annotation : N-6 DNA methylaseDescription :

ORF sequence :

MARVRVAEVRTEVLLTELLKAQGWDCRRPPNGEMLRQHEYKDHSHLRDVFLHRSKVRMIGHGLPEAVVVDRQSMQPLIVIEAKASISDLDKALREATEIY
GNACIDAGYSPLAVAIAGTSEDDFAVRVHKWNGSAWKAVTYEGNPIGWIPNRVDVERLRVPSATPELRPSVPSPEVLANFADEINRLLRESNVNDRSRPS
VVGACMLALWQSKGALRKDPRNILGDINQACEKAFWNAGKAVLAKSLHVDEANDKLAVKARRIISILERLNVSVLTAEHDYLGQLYETFFRYAGGNTIGQ
YFTPRHIASFGADLLGVSIDDVVLDPTCGTGGFLIAAMERVAREHQISRSEMVKLVSTRLIGFDDEPITAALCVANMILRGDGSSSVHRGDAFTAPEYPI
GTASVVLMNPPYPHKQTDTPTEAFVERALEGLSQGSRLAAVIPLSLLVKSNKASWRKAILKNNTLEAAIKLPDELFQPYAQPYTVIVYLRKGIPHPKGKR
AFFARIENDGFRIRKAVRVACEGSELPKMLAHFQAGTSEAGVCGWSQVDEDASFGPGAYIPAKEMTGEESDDATQEVIRARTSFVAYHAADLVQLYTDNP
LDVRAMRKRPWQFQGVKLGTVAAYFDIYYGQKELHSKDGLLPGRSLVISSSKFDNGCYGLFDFEHILKPPFVTVPGNGSIAYAHVQEWPCGVSDDCMLLL
PKEGVSHSMMYVAAAAIRNERWRFSYGRKATPDRIAEFPLPHTDELLARVDEYLARAARVEDRMIEDAEDALDSQTARMRLADLGSGKATAVSGAELETR
LAAMMEN

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
435 bp144 aa34213855+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

MPYVGFAFTTAALDFLATLPPKIRKQVIKKAKALHANPHPQGSKKLHGVVTDDGDPVYRERSGDYRILYVVRPEEVMVLDIDHRKDVYRMPQTKAEPADE
MKMKEADFDAIMSKALGVAAPQNKDDEQPAKRLSSYPPKKRGTS

 

Blast result :
Comments
ISApr7 is 57% aa similar to ISRpa1. The first ORF is the transposase, the second is a N-6 DNA methylase and the third is an hypothetical protein.
References
1] Hagstrom,A., Ferriera,S., Johnson,J., Kravitz,S., Beeson,K., Sutton,G., Rogers,Y.-H., Friedman,R., Frazier,M. and Venter,J.C. (2007) J Craig Venter Institute Direct submission GenBank.
2] ISfinder annotation (2008)