|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
|
|
|
|
|
|
|
|
|
Name: Tn6122 |
|
Family: Tn3 Group: Tn163 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Paracoccus halophilus JCM 14014 | Molecular Source: | plasmid pHAL1 |
Place of Origin: | South China Sea 2008, China | Date of Isolation: | 2012 |
| | Other Geographic Information: | marine sediment |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 39 bp) | | GGGGTGGATTCCGGTAGGGGGCAGGATCGTATCACAAGG |
IRR (Length: 39 bp) | | GGGGTGGATTCCGGTAGGGGGCAGGATCGTATCACAAGG |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTGGATT CCGGTAGGGG GCAGGATCGT ATCACAAGGA ACTCTGTCGC TGCCGCTTCA TAATATCGAG CACAGTATTT TTGCTGATCC CAAGGTCGCG 100
GGCGATCCAT CGGTATGATC GACCGTCTTC CACGGCTTGA ATGACCTTTG GCGCGAGCTT GTCGGATTTT GGGCGTTGTC CTGGCTGGCG ACCGAGCTTC 200
TTTCCTCGCG CCCTTGCGGC GGCGAGCCCG GATTTCACCC TTTCGCTGAG CAGATCGCGT TCAAACTGAG CAATCCCGGC CAGCATGGTG GCCATCATTC 300
GACCATGTGG CGTCTCTACT TCGAAGGTCA TGCCGCTCAT GGCTACAACG GAGACTTTCC AGCCGGCGAG CCTGTTCAGG GTATCCAGCA GGTCCTGAGT 400
GGACCGCCCC CACCGAGAAA GTTCGCTGAC CAGGATGGCG TCGATTTGTC GTGCCTGCGC CAGGTCGAGT ACCCGGTTGC GCGCAACGCG ATTGTCGGAC 500
GCGCCAGAGG CTTTTTCCTT GAACACTCCC ACCACGTCAT ACCCGCCGCG CTCGGCGAAC GCGGTGAGCT CCGTGACCTG CCGCTCGCAG GACTGGTCGG 600
ACGTGGAAAC CCGGGCATAG ATAGCGGCGC GCTGTCCCAT TTGAACCCCC TTTGAATCGG GCCTTGCAAG TCGTTGGTTT TGCTGGTGCG GGAATTGTCC 700
TGAACAGACT AATGACTAAC AAGGACAAAC TCGTATGCCA AGACGCTCTA TCCTGACCGC GCGCCAGCGC GCGGCGCTGT TCGATCTGCC GACAGACGAA 800
GCCACCATCC TCTACCACTA CACACTGGCG GACGATGACC TCGAATTCAT TCGAACCCGT CGCCATGCAC GCAATCGCCT CGGCTTCGCC CTTCAGCTTT 900
GTGCTTTCCG ATATCCTGGC CGACTGTTGG CGTCGGGTGA GGTCATTCCA GAGGCCATCA GCAGCTTCAT CGCAGCACAG CTTGGCGAAG AGATGGATGA 1000
GCTTTGCCGA TACGCCGAGA CCGACGTAAC GCGGCGCAGG CATTTGGTCG ACTTGCGCCA GCTCTATGGC TTCAAGATGT TCTCCGGGCA TCGCGCGCGG 1100
GAACTGAAGG CCTGGCTGGC AGGCGAGGCG GAGGTGGCAA CCTCCAACCA TGATCTGGCC CGCCGGTTTG TCGAAGAGTG TCGCCGGACC CAGACGATCT 1200
TGCCGGGTGT GTCCGTGATC GAGCGCCTCT GCGCGGATGC ACTCGTGGCG GCGGAACGCC GGATCGAAAG CGCCATCGTG AACCGGCTGG ATGAGGATAG 1300
AAAGGCCCGC CTGGACACCC TGCTCACGGA GATGGTCGAT GGCACGGTCA GCCGGTTCGT CTGGCTGCGC CAATTCGAGG TTGGTCAGAA CTCGGCCGAT 1400
GTTGGCCGCC TCCTCGATCG GCTGGAGGTT CTGCGGGATC TGAACATCTC GCTAGATATC CTCGCGGGCA TTCCACCCCA TCGGGTGACC GGCCTGCGGC 1500
GTCAGGGCGA ACGCTACTTT GCCGATGGTC TGCGTGACAT CAGCAGCGAC CGCCGCCTGG CCATTCTCGC GGTTTGCGCC GTCGAGTGGT GTGCCGCGAT 1600
TTCCGACGCG GTGGTCGAGA CCCATGACCG GATCGTCGGC AAGACCTGGC AGGGCGCGAA GAGGCTGTGC GACGCCAAAA TCGCCGACGC AAAGGCGGCG 1700
GTGCAGGACA CCCTGCGCGC CTTCAAATCT CTCGGGTCGG CCCTGCTGGA CGCCAGAAGC GATGGGTCTT CCTTGGATCA GGCGACAGAG CTGGCTTGCG 1800
GGTGGGGTCG GCTCGAAGGG CTGGTCGCCA CCGCCGCCGA GTTGACGCAC ACGATGGCCG CAGACCCATT GACCCATGTT GGCCAAGGTT ATCTTCGGTT 1900
CCGGCGCTAT GCCCCGCGCA TGCTGCGCGT TCTCGATATC GAGGCGGCGG GTATCGCAGC TCCCTTGCTG CAGGCGACCG CGCTGATCGC AGACGATGAA 2000
AAGCCCGAAA CCCGCCCAGT TGGCTTTCTG CGGCGCGGCT CGAAATGGCA CCGCCATCTG AACGCTGCCG CCGGTGACGG CCATCGGCTC TGGGAGGTTG 2100
CGGTGCTGTT TCATTTGCGG GAGGCGTTCC GGTCGGGTGA CATCTGGCTC CTCCATTCCC GGCGCTACGC TGATCTGAAG CAGGCCCTGG TTCCTGCGGA 2200
GGCGGTAGAG GGCGCGCCAA GGCTGACGGT GCCGCTCGAC CCGGAAACCT GGCTTGCCGA TCGCAAGGCG CGCATGACCG ACGGGCTGGC GCGACTGGCC 2300
AAGGCGGCTC GCGCAGGGGC GATCCCGGGC GGGTCGATCG AGAATGGCGT CTTGAAAACT GACCGCCTGA GTGCGGCGGT GCCGCAAGAG GCGGACGAGC 2400
TGGTGCTCGA TCTCTACGAC CGGTTGCCGA CCATACGGAT CACCGAACTG CTGCAGGAGG TCGATGCAGA TATTGGCTTC ACGGAAGCCT TCACACATCT 2500
GCAGACCGGC GCGCCCTGCA AAGACCAGAT CGGGATGCTG ACCGTGCTGT TGGCTGAAGG TTTGAATCTT GGGCTCAGCA AGATGGCCGA AGCGACCAGC 2600
ACACATGACT ACTTCCAACT CTCCCGCTTG TCGCGATGGC ATATGGAGAG CGATGCCATC AACCAGGCCC TCGCCTTGGT AATTGAGGCA CAGGCCCGGC 2700
TGCCCATGGC CCAATTCTGG GGCGGGGGCG TCACCGCGTC CAGTGACGGG CAATTCTTCC CCGCCGCCCG GCAGGGCGAA GCCATGAACC TCATCAACGC 2800
GAAATACGGC TCCGAGCCAG GCCTCAAGGC CTATACCCAT GTCTCCGACC AGTTCGGCCC CTTTGCCACC CAGAACATTC CGGCCACCGT GAGCGAGGCG 2900
CCCTACATTC TGGACGGGCT GCTGATGAAC GAGGCCGGAC AAAAGATCTC GGAGCAGTAT GCCGACACCG GTGGCTTTAC AGACCAGGTC TTTGCCGTGA 3000
CCGCGCTGCT GTCCTACCGG TTCATTCCGC GCATCCGGGA TCTGCCGTCA ACGCGGCTCT ATCTCTTCGA TCCCGCCACC GCGCCGAACG AGTTGCGCGG 3100
CTTGATCGGC AGCAAGATCC GCGAAGGCGT CATCGTCCAG AACTGGCCTG GCGTCCTGCG TGCCGTGGCC ACCATGGCGA CGGGCGTCCT GCCGCCCAGT 3200
CAGTTGCTCA AAAAGTTCGC CGCCTATCCG CGCCAGCACG AGCTGGCCGT GGCGCTCCGC GAAATCGGCC GCATCGAACG CACCCTCTTC ATCATCGAGT 3300
GGCTACTCGA CGCCGACATG CAGCGCCGGG CCCAGATCGG CTTGAACAAG GGCGAGGCCC ATCACGCCCT GAAAAACGCC CTGCGCATCG GGCGACAGGG 3400
CGAAATCCGT GATCGCACCG CCGAGGGGCA GCACTACCGC ATGGCCGGCC TCAACCTGCT CGCTGCCATC ATCATCTACT GGAACACCAA GCATCTCGGT 3500
CAGGCCGTCG CCGCGCGTCA GCGGGCCGGG TTGAATTGTG CGCCCAATTT ACTGGCGCAT ATCTCGCCGC TCGGCTGGGC ACACATCCTG CTCACTGGCG 3600
AATACCGGTG GCGGCGGCAA ACCGGGGCCA GCTTGTGAAG TAATCAGCCC TATTTCTTGC CACCGTCACG GCTCCCAAAC GGACGCACCG CGATCATCGC 3700
CACCACAAAG GCGGCACAGA GTTCAAAATG GAGGAAAGAA GGGGGTGTCA GGGCCTTGTG ATACGATCCT GCCCCCTACC GGAATCCACC CC
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
res |
641-734 |
94 |
TTGAACCCCC TTTGAATCGG GCCTTGCAAG TCGTTGGTTT TGCTGGTGCG GGAATTGTCC TGAACAGACT AATGACTAAC AAGGACAAAC TCGT |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tnpR |
Tn6122 |
32-640 |
Accessory Gene |
Resolvase |
- |
tnpA |
Tn6122 |
735-3638 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
Tn6122 |
609 |
32-640 |
- |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MGQRAAIYAR VSTSDQSCER QVTELTAFAE RGGYDVVGVF KEKASGASDN RVARNRVLDL AQARQIDAIL VSELSRWGRS TQDLLDTLNR LAGWKVSVVA MSGMTFEVET PHGRMMATML AGIAQFERDL LSERVKSGLA AARARGKKLG RQPGQRPKSD KLAPKVIQAV EDGRSYRWIA RDLGISKNTV LDIMKRQRQS SL
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
Tn6122 |
2904 |
735-3638 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
MPRRSILTAR QRAALFDLPT DEATILYHYT LADDDLEFIR TRRHARNRLG FALQLCAFRY PGRLLASGEV IPEAISSFIA AQLGEEMDEL CRYAETDVTR RRHLVDLRQL YGFKMFSGHR ARELKAWLAG EAEVATSNHD LARRFVEECR RTQTILPGVS VIERLCADAL VAAERRIESA IVNRLDEDRK ARLDTLLTEM VDGTVSRFVW LRQFEVGQNS ADVGRLLDRL EVLRDLNISL DILAGIPPHR VTGLRRQGER YFADGLRDIS SDRRLAILAV CAVEWCAAIS DAVVETHDRI VGKTWQGAKR LCDAKIADAK AAVQDTLRAF KSLGSALLDA RSDGSSLDQA TELACGWGRL EGLVATAAEL THTMAADPLT HVGQGYLRFR RYAPRMLRVL DIEAAGIAAP LLQATALIAD DEKPETRPVG FLRRGSKWHR HLNAAAGDGH RLWEVAVLFH LREAFRSGDI WLLHSRRYAD LKQALVPAEA VEGAPRLTVP LDPETWLADR KARMTDGLAR LAKAARAGAI PGGSIENGVL KTDRLSAAVP QEADELVLDL YDRLPTIRIT ELLQEVDADI GFTEAFTHLQ TGAPCKDQIG MLTVLLAEGL NLGLSKMAEA TSTHDYFQLS RLSRWHMESD AINQALALVI EAQARLPMAQ FWGGGVTASS DGQFFPAARQ GEAMNLINAK YGSEPGLKAY THVSDQFGPF ATQNIPATVS EAPYILDGLL MNEAGQKISE QYADTGGFTD QVFAVTALLS YRFIPRIRDL PSTRLYLFDP ATAPNELRGL IGSKIREGVI VQNWPGVLRA VATMATGVLP PSQLLKKFAA YPRQHELAVA LREIGRIERT LFIIEWLLDA DMQRRAQIGL NKGEAHHALK NALRIGRQGE IRDRTAEGQH YRMAGLNLLA AIIIYWNTKH LGQAVAARQR AGLNCAPNLL AHISPLGWAH ILLTGEYRWR RQTGASL
|
|
References |
|
|
Dziewit L, Baj J, Szuplewska M, Maj A, Tabin M, Czyzkowska A, Skrzypczyk G, Adamczuk M, Sitarek T, Stawinski P, Tudek A, Wanasz K, Wardal E, Piechucka E, Bartosik D. Insights into the transposable mobilome of Paracoccus spp. (Alphaproteobacteria). PLoS One. 2012;7(2):e32277. doi: 10.1371/journal.pone.0032277. Epub 2012 Feb 16. PubMed ID: 22359677
| |
| | |
|
|