|
|
|
|
|
|
|
|
|
|
|
|
Recombination Sites | |
|
|
|
|
|
Internal Transposable Elements | |
|
|
Internal Repeats | |
|
|
|
|
|
|
|
|
|
|
|
Name: TnAli20 (Synonyms: Tn7136) |
|
Family: Tn3 Group: Tn163 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Azospirillum lipoferum 4B | Molecular Source: | plasmid AZO_p5 |
| | | |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
|
| | |
IRL (Length: 49 bp) | | GAGGGCATCGCGACATTTGTTCGGCAGACAACGCTATAAGGTCGAACGC |
IRR (Length: 48 bp) | | GAGGGCATCGCGACATTTGTTCGGCAGACAACGCTAAGGTCGAACGGA |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GAGGGCATCG CGACATTTGT TCGGCAGACA ACGCTATAAG GTCGAACGCA CCGGGAACAG CGATGGATGA TTGCCCAGCC TGATGCTGGT CAGGAGGTGT 100
ACGGCGGCCT CTGGAGGCCC GCCAAGCTCA TCTCCCGGTA AACGGTGGAT CTCCCCAACC CCAGCTGCTT GGCGGCTTCG GTCGGCGACA GGCCAGCTTC 200
CACCAGTTTC AGTGCCGCTT TGATCTTGTC CATGTCCACC GGCTCGCGAC CTGGGCGTTT GCCGCGGGCC CGGGCGGGCG CGATGCCGTC CTTCGTGCGC 300
TCGGCGATCA GCCGCTGTTC GAAATGCGCA ATCGCGCCGA AGACGTGGAA CACCAGTTCG CCGGCCGCCG AAGTGGTGTC GATCCTCTCC TCCAGGCTGA 400
GCAGGCCGAT CCCGCGCTCC TTCAGCATCG CCACTGTCGC CAGCAACTCA GCGAGCGATC GGCCGAGCCG GTCGAGGCGG ACAATGGCCA GCGTGTCGCC 500
ACGGCGCGCA TGGGCGAGCA ACGCCTCCAA CCCTGGCCGG TCCTTCGCCT TGCCCGAGCA CACGTCGGTG AAGACCTTCA AGGCGCCGGC CTGCTCCAGC 600
CGCAACCGTT GGCCGGCGAC ATCCTGGTCG CCGGTCGAGA CCCGTGCGTA GCCCAGCATG TCGCCCATGC CCGTCCCCCA ACCGGCCGTT CTGTGGACGG 700
TCGCGCCGAA CCCGATCAGA CTGCTGCGCC GCCGTCCACA GAATACGTCC CGTTAATCCC TGCCGTCCAC AGTCCAATCC GACGTTTTCT GGACAGGGAA 800
AGACCACGTG GCACGACGCC AATTGCTGAC GGAGGAGGAA CGCCGACTGT TGTTCGGCCT GCCCACGGAT CGGGACGCCT TGGCCCGCCA TTACACGTTC 900
ACCCGCTCGG ATTTGGACCT CATCGCCAGC CGACGCGGTA ATGCCAACCG CTTGGGCTTC GCCGTGCAAT TGGCGCTCTT GCGTTATCCC GGATTGCCTC 1000
TGCCCCATAT CGGCGAGCCG ATCGACGCCG TCGTCGGCTG GGTGGCCGAA CACCTCGAGC TGCCGGTGAC GGCCTTCGCC GAGTATGCGC GTCGATCACA 1100
AACGATGACC GATCACGCCC GCGACACTGT CGCCGCGCTT GGGCTGCGGT TCCCACGCGA GGCCGATTTG CCTGACTTGA TCGAGGCCGC AGCGCAGGCG 1200
GCCTGGATTT CCGACCAAGG AATGTCGATC ATGACCGGGA CCATCGCTGC ACTCCGGTCA GCGAAGATCG TCCTGCCGTC CCCGGCGGTG ATCGAGCGCG 1300
CCGCCCTGGC GGGCCGTGCT CGCGCCCGAA AGCGGGCGGC GGATGCCTTG GTGGCCGATC TCACGGCCGA GCAGCGGGAT AAACTCGACA AGCTGCTGGC 1400
CGTTGATCCA GCAACCGGGA TCACCTCGCT GACCTGGTTG AGGACTATTC CCACGGCGCC GAAAGCGGAT CACGTTCGCG ACGTGATCGA CAAACTTCAC 1500
GTCGTTCGCG GCATCGGAAT CGATGCCGAG GCGCAGGCGC GTGTCCATGA AACCCGCTTT CGCCAGTTTG CCCGCGAGGG CATGGCCTCG CCCACCTACC 1600
TGATCGAGCG CTGTGCGCCG AACCGGCGGC GCGCGACACT GGTGGCGTTG CTGATCGACC TGGAGAACCG GCTCACCGAC GCCGCGCTCG ACATGGCCGA 1700
CAAGCTGATC GGCGGTGCGT TCACGCGCGC GAAGAACAAC AAGGAAAAGA CCTGCGTCGC GAAGACGAAG GACGTCGGCC GTCTGATGCG TCTGTTTCAC 1800
CGCACCATCG AGGCGCTCAG CCTAGCCCAG GAAAGCGACG GGGACGCCTT CGCTCTCGTC AACGAGGCGG TGGGCTGGCC GCAGTTGCTG CGCGTGCGCG 1900
GCGAGGTGGC CAGCCTCGCC GAGCTCGCGG AAGAGGACCC GCTGGTCCGC GCCGCCGACC GGTACGTCAC CATCCGGAAG TTCGCCCCGG CGCTCCTCGA 2000
AGCGCTCACG TTCAAGGCTG CCCGGAGCAA GGACCCGATC CTGGCGGCGG TCGAGTTGCT CAAGGAGCTC AACCGATCCG GCAAGCGCGA CATCCCGGCG 2100
GACGCGCCGA TGCTGTTCCG CAAGGAGTGG CGGCGCCTCG TCACCAAGGA CGGCAAGCCC AACCGGCGGC TCTACGAGAC AGCGGTGCTC GCCACCCTGC 2200
GCAACAAACT GCGCTCGGGC GACGTGTGGG TGGAGCGGTC GTCCAACTAC CGCCGCTTCG ACAGCTATCT GCTGCCCGCG GCGGCGGCGG CCCCGATCTC 2300
AGCGGATTTG ACGCTGCCCG CGACGGCCGA AGAGTGGCTG GGGGCGCTGG GGCGCGACCT CGACGAACAG CTGAAGCGTT TTGCCCAGCG CCTGCGCGAC 2400
GGCCAGCTCG AGGGCGTCGA ATTGCGCGAT GAGCGGCTGC ACATCGCGGC GCTGAAGGCG ACCGCGCCGC CAGAAGCGGA CGTTCTCGCC GACCGGCTCG 2500
ACGCCCTTCT GCCGCGCGTG CGCATCACCG AACTGCTGCA CGAGGTCAAC CGCGCGACCG GCTTCGCGGC GGCGTTCACC AACCTGCGCA CCGGTGAATC 2600
CTGCGACAAC GAGAACGCGC TGCTCGCCGT CATCCTAGCC GACGGCACCA ACCTGGGCCT GACACGCATG GCGGAGGCCA GCCAGGGCGT GACCCGCGAC 2700
CAGCTCATCT GGACCGCCGA CGCCTGCATC CGGCCTGAAA CCTACCAGTC GGCCCTGGCC CGGATCATCG ACGCTCACCA TCGGCTGCCC ATGGCCGCCG 2800
TCTGGGGTGG CGGAACGACG TCCTCATCGG ACGGCCAGTT CTTCCGTTCC GGCAAGCGCG GCAACGTCGC CGGCGAGGTG AACGCCCGGT ATGGCGGCAG 2900
TCCCGGCTTC AGCTTCTACA CCCACGTCTC GGACCAGCAC GGTCCGTACC ATGTCCGGGT CATCTCGGCG GCGGCCCACG AGGCCCCCTA CGTTCTGGAC 3000
GGCCTGCTGC ACCATGGGAC CGGCTTGAAG CTCGACACCC ACTACGTCGA TACAGGCGGC ACCTCGGATC ACGTGTTCAT TTTGGCCGCC ATGCTCGGCT 3100
TCCGCTTCTG TCCTCGCCTG CGCGATTTTC CCGAACGTCG GCTGGCCAGC ATCGAGCCGT CGAGCTGTTA TCCGGACCTC CAGCCACTGC TGGGCCGGCG 3200
GGTCAAGGTG GACGTCATCC GTGAGCATTG GAACGACGTG GTGCGCCTGG TCGCGTCGCT GAAGGCCGGC ACCGTGGCGC CCTCGACCAT GTTGAAGAAG 3300
CTGGCCGCCT ACGAGCGACA AAACCAGCTT GATCTGGCGC TCCAGGAACT GGGCCGCATC GAGCGCACAC TCTTCATGAT CCGCTGGTTG GAAACACCCG 3400
AGCTCAGACG GAGCTGTCAC ATCGGGTTGA ACAAAGGGGA GCAGCGTCAC GCTCTGGCCC AGGCGATCTG CACGTTCAAA CAGGGCCGGA TCGCCGACCG 3500
CGGGTCCCAA GCGCAGCAGT ATCGCGCCTC GGGGCTGAAC CTGCTCATCG CCGCGATCGT CTATTGGAAC TCGACCTACA TGGCCGACGC GGTTGGTCAT 3600
CTGCGCGCCG TCGGCGGGAC CGTACCCGAC GACCTGCTCG TCCACACCTC ACCGGTCGGC TGGGAGCACA TCGGTCTGTC CGGCGATTTC CTGTGGGGCC 3700
GCGCCGCGGC CGTGCCCATC GGCAGGCGAC CGCTTAACCT GCGACGGGAC CGCCATGCCG CCTGAAAGCC AAGTTCCCGG TCCGTTCGAC CTTAGCGTTG 3800
TCTGCCGAAC AAATGTCGCG ATGCCCTC
|
|
|
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tnpR |
TnAli20 |
90-668 |
Accessory Gene |
Resolvase |
- |
tnpA |
TnAli20 |
808-3765 |
Transposase |
|
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpR |
TnpR |
TnAli20 |
579 |
90-668 |
- |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Protein Sequence:
|
MGDMLGYARV STGDQDVAGQ RLRLEQAGAL KVFTDVCSGK AKDRPGLEAL LAHARRGDTL AIVRLDRLGR SLAELLATVA MLKERGIGLL SLEERIDTTS AAGELVFHVF GAIAHFEQRL IAERTKDGIA PARARGKRPG REPVDMDKIK AALKLVEAGL SPTEAAKQLG LGRSTVYREM SLAGLQRPPY TS
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tnpA |
TnpA |
TnAli20 |
2958 |
808-3765 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Protein Sequence:
|
VARRQLLTEE ERRLLFGLPT DRDALARHYT FTRSDLDLIA SRRGNANRLG FAVQLALLRY PGLPLPHIGE PIDAVVGWVA EHLELPVTAF AEYARRSQTM TDHARDTVAA LGLRFPREAD LPDLIEAAAQ AAWISDQGMS IMTGTIAALR SAKIVLPSPA VIERAALAGR ARARKRAADA LVADLTAEQR DKLDKLLAVD PATGITSLTW LRTIPTAPKA DHVRDVIDKL HVVRGIGIDA EAQARVHETR FRQFAREGMA SPTYLIERCA PNRRRATLVA LLIDLENRLT DAALDMADKL IGGAFTRAKN NKEKTCVAKT KDVGRLMRLF HRTIEALSLA QESDGDAFAL VNEAVGWPQL LRVRGEVASL AELAEEDPLV RAADRYVTIR KFAPALLEAL TFKAARSKDP ILAAVELLKE LNRSGKRDIP ADAPMLFRKE WRRLVTKDGK PNRRLYETAV LATLRNKLRS GDVWVERSSN YRRFDSYLLP AAAAAPISAD LTLPATAEEW LGALGRDLDE QLKRFAQRLR DGQLEGVELR DERLHIAALK ATAPPEADVL ADRLDALLPR VRITELLHEV NRATGFAAAF TNLRTGESCD NENALLAVIL ADGTNLGLTR MAEASQGVTR DQLIWTADAC IRPETYQSAL ARIIDAHHRL PMAAVWGGGT TSSSDGQFFR SGKRGNVAGE VNARYGGSPG FSFYTHVSDQ HGPYHVRVIS AAAHEAPYVL DGLLHHGTGL KLDTHYVDTG GTSDHVFILA AMLGFRFCPR LRDFPERRLA SIEPSSCYPD LQPLLGRRVK VDVIREHWND VVRLVASLKA GTVAPSTMLK KLAAYERQNQ LDLALQELGR IERTLFMIRW LETPELRRSC HIGLNKGEQR HALAQAICTF KQGRIADRGS QAQQYRASGL NLLIAAIVYW NSTYMADAVG HLRAVGGTVP DDLLVHTSPV GWEHIGLSGD FLWGRAAAVP IGRRPLNLRR DRHAA
|
|
References |
|
|
Wisniewski-Dyé F, Borziak K, Khalsa-Moyers G, Alexandre G, Sukharnikov LO, Wuichet K, Hurst GB, McDonald WH, Robertson JS, Barbe V, Calteau A, Rouy Z, Mangenot S, Prigent-Combaret C, Normand P, Boyer M, Siguier P, Dessaux Y, Elmerich C, Condemine G, Krishnen G, Kennedy I, Paterson AH, González V, Mavingui P, Zhulin IB. Azospirillum genomes reveal transition of bacteria from aquatic to terrestrial environments. PLoS Genet. 2011 Dec;7(12):e1002430. doi: 10.1371/journal.pgen.1002430. Epub 2011 Dec 22. PubMed ID: 22216014
| |
| | |
|
|