Transposon
Name: TnXc4.1       (Synonyms: Tn7211)
Family: Tn3        Group: Tn3
Evidence of Transposition: no
 Host     

Host Organism:Xanthomonas oryzae pv. oryzicola strain CFBP7331 Molecular Source:chromosome
Place of Origin:Niono, Mali Date of Isolation:2015
Other Geographic Information:strain MAI10

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 38 bp)GGGGTTCGGGGAGCAATGGAACAGGGAAGTCAGTTAAG
IRR (Length: 38 bp)GGGGTCCGGGGAGCAATGGAACAGGGAAGTCAGTTAAG

 Sequence     
DNA SequenceLength  5572 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTTCGGG GAGCAATGGA ACAGGGAAGT CAGTTAAGCC TTCACCAGTG CCGGTCGGTG CCCTCCATGC GCGCGTCGCT CTCCGAGAAG CAGTGCCGGT 100
CGTCCGCACC GCTGGCGATG TCGGTCAGCA ATTCGTTCAC AGCTTCATCC AGTTCTTCGT CGGTGTCATA GGGCACCTTC AGCTCATATT CGCCGTTCGG 200
CCGTCGCGTG GCGTCGTATT GGTCGAGGTA AAAGCTCTCG ACGTGCTCAA TCGTGCGCTT CTTGCCGCGC ACGAACTTGC TGTTGTTCTC GATGCGCAGC 300
GTCAGCAGGA TGGTGGCGAC TTTCGGCGGG GCGTCATTGC CGCTTTCAAG GGCCGACATG CTTGATGGTT GCTGCTCCGT TTTCTTGTAC GGACCGATCT 400
CTACACCGCG ATGTCGCAGG TAGCTGTACA GCGTGCTCTT GGACAGGTGC AGCTTCTGTG CGATGGCGGC GACCGACAGT TTCCGCTCGC GGTACAGGGT 500
CTCTGCCGCC AGCGCCGTCG CTTCGGCCTG CGGCGACAGT CCTGTGGGTC GCCCGCCGAC CCGACCGCGT GCCCGCGCGG CCGTCAGCCC GGCCTGGGTG 600
CGTTCGCGGA TCAGTTCGCG CTCGAACTCG GCCAGCGTAG CGAACAGGTT GAACACAAAC CGCCCCTGGG CGCTGGTGGT GTCGATGGGA TCGTTCAGAC 700
TGAGGAGTCC AACCTTGCGC TCCATCAGGT TGCCGACCAG CTCGACCAGG TGCTTGAGCG AGCGCCCCAT GCGGTCGAGC TTCCAGATCA CCAGCACGTC 800
GCCGGCGCGC AACTGACCCA GTAGCTCATC GAGCGCCGGA CGTGCGGTCT TCGCGCCGCT CGCCACATCA TGATAGATGC GCTCGCAACC GGCCGCCTTG 900
AGAGCGTCCA CCTGCAAGGC CGGGTTCTGT TCTCGCGTGG AGACGCGAGC GTAGCCGATT TTCATCCGAA AATTCCGTTT TACTCGTTGA TTAAGACAAT 1000
ATCAAACTTT GATTATGCAA ACCACTTTTA TAGACTGCTT GTTGGCCTGC GGGCATGTCA TGCCCGTTGC TTCGTTGTAG TTCAGATAAA CGATCGTTTT 1100
ATCGAACCAT TGCGCATGTA ACGAGTAAAG TGTATAAACA CCTATGTGCA CAAACACTGT TCTGGAGGGC ATCATGAATA CCGTGCGTTG GAACATCGCC 1200
GTGTCGCCGG ACGTGGATCA GTCCGTGCGC ATGTTCATTG CCGCCCAAGG CGGCGGCCGC AAGGGCGACC TGTCGCGCTT CATTGAGGAG GCGGTGCGCG 1300
CCTATCTTTT TGAACGGGCG GTCGAGCAAG CCAAGGCCGC GACGGCAGGT ATGGGCGAGG CAGAATTGAA CGACCTCATC GACGAGGCGG TGCAGTGGGC 1400
GCGTGAGCAC TGATGCGGGT CGTCCTTGAC ACCAACGTGC TGCTGGCCGC GCTGATCTCG TCGCACAGCC CTCCCGACAT CATCTATCGC GCCTGGCTTG 1500
CTGGACGTTT TGACCTGGTG ACAGGGGCGG AACAGCTCGA CGAACTGCGC CGCGTGAGCC GTTACCCAAA GATCAGGGCC ATTTTGCCCG CACATCGCAT 1600
CGGCACCATG ATTAACAACA TACAGCGCGC CGTCGTGCTG AACACACTGC CGCCGCTGCC GAACGGCATC GATGCCAACG ATCCGAATGA TGCCTTCCTG 1700
CTGGCGATGT CACTGGCCGG TGAGGCTGAT TACCTCGTCA CTGGCGACCG TCGCGCGGGG CTGCTGCAAC GCGGCAGTAT TGGCCGCACG CGTATCGTCA 1800
CCCCGGCCAC CTTTTGCGCC GAGGCGCTTT GACGCCATGC CGGTCAGCTT CCTGTCCACC ACGCAACGGG AACGCTATGG CCGCTATCCA GAGGCGCTTT 1900
CCAGCGAGGA ACTGGGGCGT TACTTCCACC TGGACGACGA CGACCGCGAG TTGATCGCCA CCAAGCGGCG CGACAGCAGC CGCCTCGGTT ACGCACTGCA 2000
ACTGACGACG GCGCGGTTTC TCGGCACCTT TCTGGAAGAC CCTACCGCCG TGCCAAGCCC GGTGCTGCAT ACGCTGTCGT CGCAACTTGG CATCGCCGAC 2100
CCTTCCGATT GTGTTATCGA CTACCGGACG ACCCGGCAGC GCTGGCAGCA CACGACCGAG ATTCGCGCTC GCTACGGCTA CCGCGAATTC GCCGAACGTG 2200
GCGTGCAGTT CCGCCTTGGC CGCTGGCTGT GCGCGCTGTG CTGGACGGGC ACCGACCGTC CGAGTGCGCT GTTCGACTAC GCCAACGGTT GGCTGGTCGG 2300
CCACAAGGTA CTGCTGCCCG GCGTCACGGT GCTGGAACGC TTTATCGCCG ATATACGCTC GCGCATGGAG TCGCGCCTGT GGCGTTTGCT GGTGCGCGGC 2400
GTGACGGTCG CACAGCGGCA GCGTCTCGAA GACTTGCTCA AGCCTGCCGA AGGCAGCCGC CAGTCCTGGC TGGATCGGCT GCGCAAGGGG CCGGTGCGCG 2500
TCAGCGCTCC GGCGCTTGTG ATGGCCTTGC TGCGCATCGA AACCGTGCGG GATCTGGGCA TCAAACTGCC CGGCACCCAT GTGCCACCAA GCCGGATCGC 2600
GGCACTCGCC CGCTTTGCCA GTACGGTCAA GGTATCCGCC GTGGCCAGGC TGCCGGAGGC GCGGCGCATC GCCACGCTGG TCGCCTTCGT GCATTGCCTG 2700
GAAGCCAGCG CTCAGGACGA TGCCCTTGAT GTGCTCGACC TGCTGCTGCG CGAACTGTTC ACCAAGGCTG AGAAGGAAGA CCGCAAGTTC AGGCAGCGCT 2800
CCCTCAAAGA TCTGGATCGG GCTGCCTCGA CGCTGGCTGA GGCGTGCCGG ATGCTGCTCG ATCCCGGTTT GCCGGACGGC GAACTACGCG AGCGTGTCTA 2900
TGCCGCCATC GGCCGCGATG AACTGGCCCA GGCGCTCAAC GAAGTTCGCG GCCTGGTGCG CCCGCCCAAC GATGTGTTCT ACACCGAACT GGAAGCCAGG 3000
AAGGCCACCG TCTCGCGCTT CCTGCCGACA TTGCTGCGCG TCATCCGCTT CGACGCCAAT CCAGCCGCGC AGCCTTTGGC GCAGGCGTTG AAATGGCTGC 3100
ATGAGAAGCC CGACCATGAT CCGCCCACGG CCATCGTCGG CAAAGCGTGG CAACGCCATG TCGTGCAGGA GGACGGCCGG ATCAATGCCA CGGCCTATTC 3200
TTTCTGCGCG CTCGACAAGC TGCGCAGCGC GATCCGCCGC CGCGACATGT TCATCAGCCC GAGCTGGCGT TACGCCGATC CGCGTGCCGG ACTGCTGGCA 3300
GGAGCCGAGT GGGAGGCCGC ACGGCCCATC GTCTGCCGCT CGCTGAGCCT GACGGCGCAA CCGGAAGCAA CGCTGGCGAC ACTCACGCGC GAACTGGACA 3400
AAACCTACCG GCGCGTCGCG GCTCGCCTGC CCGAGAACGA CGCGGTGCGC TTCGAGACGG TCGGCGACAA GACCGAACTG GTGCTCAGCC CCTTGGAAGC 3500
GTTGGAAGAA CCAACTTCGC TGATCGCGCT GCGCAACGAA ATCAAGGCGC GCATGCCGCG CGTCGATCTG CCGGAAATCC TGCTGGAAGT CGCCGCGCGT 3600
ACTGGCTGCA TGGATGCCTT CACGCACCTG ACCGAGCGCA CGGCGCGTGC GGCCGACCTG ACCACCAGCT TGTGCGCGGT GCTGATGGCT GAAGCCTGCA 3700
ACACCGGCCC GGAACCGCTG GTGCGGCAGG ACACCCCGGC GCTCAAACGC GACCGGCTGA TGTGGGTCGA TCAGAACTAT GTGCGTGATG ACACGCTGGT 3800
TGCCTGCAAC GCCGTGCTGG TGGCGGCGCA AAACCGCATC GCATTGGCGC GCACCTGGGG CGGCGGTGAC GTGGCCTCCG CCGACGGCAT GCGCTTTGTG 3900
GTGCCGGTAC GGACCATCCA CGCCGCGCCG AACCCGAAAT ACTTCAATCG CGGGCGTGGC GTCACCTGGT ACAACCTGCT GTCCGATCAA TGTACTGGGC 4000
TGAACACGAT CACCGTGCCC GGCACGCTGC GCGACAGCCT GGTCTTGCTG GCGGTCGTGC TGGAGCAGCA GACCGAGTTG CAGCCGACAC AGATCATGAC 4100
CGACACCGGT GCGTACAGCG ATTTGGTGTT TGGCCTGTTC AGGCTCTCCA ACTACCGCTT CTGCCCGCGC CTGGCCGATG TCGGCGGCAC ACGCTTCTGG 4200
CGTGTCGATC CCGACGCTGA CTATGGCGAG CTCAACGCGC TCGCCCGGCA GCGTGTGAAC CTCGACCGCA TCACGCCGCA TTGGGATGAC GTGCTGCGCC 4300
TGGTCGGCTC GCTCAAGCTC GGCCTGGTAC CGGCGATGAG CATCATGCGC ACCTTGCAGG TCGATGAACG GCCGACCAGC CTAGCGCAGG CCATCGCCGA 4400
AATCGGTCGC ATCGACAAGA CCATCCACAC GCTGAACTTC ATCGACGACG AGGCCCGCCG CCGCGCCACG CTTCTGCAAT TGAACCTCGG CGAAGGCCGC 4500
CACAGTTTGG CGCGCGAGGT TTTTCACGGC AAGCGCGGCG AACTGTTCCA GCGCTACCGC GAAGGACAGG AAGACCAGTT GAGCGCGCTC GGCCTGGTTG 4600
TGAACATGAT CGTGCTGTGG AACACGCTGT ACATGGACGC GGTACTGGCG CAGTTGCGCA GCGAGGGCTA CCCGATCCGC CCCGAAGACG AGGCGCGGTT 4700
GTCGGCGTTC GTCCACGAGC ACATCAATAT GCTCGGACGC TACTCGTTCT CGGTGCCTGA AGCAGTCGCG CGTGGCGAAC TGAGACCGTT GACCAAACAA 4800
AATGAACCTT AAAAACCATG GAAATTCACT ATGCAACAGA TGACGCAACA ATCCTTGAAC GATGCCGAAC TTGATCGGTT GGGCGACTTT CTCGAAGGAG 4900
TCGGCGCACC TGCAATGAAT CTCGAAATGC TCGATGGGTT CTTTGCCGCA CTCATTTGCG GCCCTGAAAC GGTATTGCCC AGCGAATACT TGCCACAGGT 5000
ACTTGGGGAA GGCCATTGCT TCGACAGCAA TGACCAAGCC GCGGAGATTC TTGGCTTGGT CATGCGGCAT TGGAACACGA TTGCATCAGA ACTGTTCCGC 5100
ACTCTGGAGA AAGACGATGT CTACCTCCCC GTGCTGCTCG AAGATGCGGA TGGGGCCGTA CACGGCAATG ACTGGGCACG TGGTTTCATG CGCGGCATTC 5200
AATTACGGCC CAATAGTTGG CAAGAGCTGA TCGGCAGCGA CGAGTTTGGC GGGCCAATGC TGCCAATTAT GATTTTGACC TATGAACATG ATCCTGATCC 5300
CGCCATGCGC CCGCCAGAGA TTGCGCCGGA CAAACGCGAT GAGTTGCTTC AGTCCCTGAT TGCCGGACTT ACACACATCT ATCGCTACTT CGCGTCACAT 5400
CGCCAATTGG CAACCCACGT GCCTTTACGC AGGCAAGGCC CTAAAGTTGG GAGAAACGAT CAATGTCCAT GTGGCAGTGG GCGGAAGTAC AAGCATTGCT 5500
GTGCTACCGG CGGGCCAATA TTTCATTGAT ATCGGCTTAA CTGACTTCCC TGTTCCATTG CTCCCCGGAC CCC

 Recombination Sites     

Name Coordinates Gene Sequence
res 972-1107 136 ATTCCGTTTT ACTCGTTGAT TAAGACAATA TCAAACTTTG ATTATGCAAA CCACTTTTAT
AGACTGCTTG TTGGCCTGCG GGCATGTCAT GCCCGTTGCT TCGTTGTAGT TCAGATAAAC
GATCGTTTTA TCGAAC
res_site_III 972-1000 29 ATTCCGTTTT ACTCGTTGAT TAAGACAAT
res_site_II 1004-1036 33 AAACTTTGAT TATGCAAACC ACTTTTATAG ACT
res_site_I 1079-1111 33 AGTTCAGATA AACGATCGTT TTATCGAACC ATT

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpR TnXc4.1 42-965 Accessory Gene Resolvase -
RHH_6 TnXc4.1 1174-1413 Passenger Gene Antitoxin +
PIN_3 TnXc4.1 1413-1832 Passenger Gene Toxin +
tnpA TnXc4.1 1837-4812 Transposase   +
secA TnXc4.1 4831-5529 Passenger Gene Plant Pathogenicity +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnXc4.1 924 42-965 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MKIGYARVST REQNPALQVD ALKAAGCERI YHDVASGAKT ARPALDELLG QLRAGDVLVI WKLDRMGRSL KHLVELVGNL MERKVGLLSL NDPIDTTSAQ
GRFVFNLFAT LAEFERELIR ERTQAGLTAA RARGRVGGRP TGLSPQAEAT ALAAETLYRE RKLSVAAIAQ KLHLSKSTLY SYLRHRGVEI GPYKKTEQQP
SSMSALESGN DAPPKVATIL LTLRIENNSK FVRGKKRTIE HVESFYLDQY DATRRPNGEY ELKVPYDTDE ELDEAVNELL TDIASGADDR HCFSESDARM
EGTDRHW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
RHH_6 RHH_6 TnXc4.1 240 1174-1413 +
Class:   Passenger Gene
Sub Class:   Antitoxin
Sequence Family:  RHH_6 (Pfam:PF16762)
Protein Sequence:  
MNTVRWNIAV SPDVDQSVRM FIAAQGGGRK GDLSRFIEEA VRAYLFERAV EQAKAATAGM GEAELNDLID EAVQWAREH

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
PIN_3 PIN_3 TnXc4.1 420 1413-1832 +
Class:   Passenger Gene
Sub Class:   Toxin
Target:   single stranded RNA
Sequence Family:  PIN_3 (Pfam:PF13470)
Comment:   tRNA(fMet)-specific endonuclease
Protein Sequence:  
MRVVLDTNVL LAALISSHSP PDIIYRAWLA GRFDLVTGAE QLDELRRVSR YPKIRAILPA HRIGTMINNI QRAVVLNTLP PLPNGIDAND PNDAFLLAMS
LAGEADYLVT GDRRAGLLQR GSIGRTRIVT PATFCAEAL

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnXc4.1 2976 1837-4812 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MPVSFLSTTQ RERYGRYPEA LSSEELGRYF HLDDDDRELI ATKRRDSSRL GYALQLTTAR FLGTFLEDPT AVPSPVLHTL SSQLGIADPS DCVIDYRTTR
QRWQHTTEIR ARYGYREFAE RGVQFRLGRW LCALCWTGTD RPSALFDYAN GWLVGHKVLL PGVTVLERFI ADIRSRMESR LWRLLVRGVT VAQRQRLEDL
LKPAEGSRQS WLDRLRKGPV RVSAPALVMA LLRIETVRDL GIKLPGTHVP PSRIAALARF ASTVKVSAVA RLPEARRIAT LVAFVHCLEA SAQDDALDVL
DLLLRELFTK AEKEDRKFRQ RSLKDLDRAA STLAEACRML LDPGLPDGEL RERVYAAIGR DELAQALNEV RGLVRPPNDV FYTELEARKA TVSRFLPTLL
RVIRFDANPA AQPLAQALKW LHEKPDHDPP TAIVGKAWQR HVVQEDGRIN ATAYSFCALD KLRSAIRRRD MFISPSWRYA DPRAGLLAGA EWEAARPIVC
RSLSLTAQPE ATLATLTREL DKTYRRVAAR LPENDAVRFE TVGDKTELVL SPLEALEEPT SLIALRNEIK ARMPRVDLPE ILLEVAARTG CMDAFTHLTE
RTARAADLTT SLCAVLMAEA CNTGPEPLVR QDTPALKRDR LMWVDQNYVR DDTLVACNAV LVAAQNRIAL ARTWGGGDVA SADGMRFVVP VRTIHAAPNP
KYFNRGRGVT WYNLLSDQCT GLNTITVPGT LRDSLVLLAV VLEQQTELQP TQIMTDTGAY SDLVFGLFRL SNYRFCPRLA DVGGTRFWRV DPDADYGELN
ALARQRVNLD RITPHWDDVL RLVGSLKLGL VPAMSIMRTL QVDERPTSLA QAIAEIGRID KTIHTLNFID DEARRRATLL QLNLGEGRHS LAREVFHGKR
GELFQRYREG QEDQLSALGL VVNMIVLWNT LYMDAVLAQL RSEGYPIRPE DEARLSAFVH EHINMLGRYS FSVPEAVARG ELRPLTKQNE P

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
secA SecA TnXc4.1 699 4831-5529 +
Class:   Passenger Gene
Sub Class:   Plant Pathogenicity
Comment:   contains 2 target sites for ISXac1 insertions
Protein Sequence:  
MQQMTQQSLN DAELDRLGDF LEGVGAPAMN LEMLDGFFAA LICGPETVLP SEYLPQVLGE GHCFDSNDQA AEILGLVMRH WNTIASELFR TLEKDDVYLP
VLLEDADGAV HGNDWARGFM RGIQLRPNSW QELIGSDEFG GPMLPIMILT YEHDPDPAMR PPEIAPDKRD ELLQSLIAGL THIYRYFASH RQLATHVPLR
RQGPKVGRND QCPCGSGRKY KHCCATGGPI FH