Transposon
Name: TnEc1       (Synonyms: Tn7158)
Family: Tn3        Group: Tn21
Evidence of Transposition: no
 Host     

Host Organism:Escherichia coli Molecular Source:plasmid pMCR-11EC-P293
Place of Origin:Hong Kong Date of Isolation:2016
Other Geographic Information:Pig feces 2011

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 50 bp)GGGGTCAGTTTGGATATCGAAAATTATTGTACGTTAAGGCTACTTTTTAA
IRR (Length: 50 bp)GGGGTCAGTTTGGATATCGAAAATTATTGTACGTTAAGGTTGTTTTTTGA

 Sequence     
DNA SequenceLength  3854 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
GGGGTCAGTT TGGATATCGA AAATTATTGT ACGTTAAGGC TACTTTTTAA TCAGTCGTTG GTTCTCGATA AAGTGCCTGA ATACGGCAAA CCTGCGAAGT 100
GCTGTAGCCC GTAGCATCTG CTGTTTCTCT AATGCTGAGT TTTTTTACTT GTCTGTAGTA CAACACTTTC TGATGCCGTT CCTGATCGGC TTGTTTTCCC 200
CGGTATTTAC CCAACGTATG AGCTCGTTCA ATTCCCTGTT TTTGGCGTTG GCGACGACTC AGCCAGTCCT TATGCGACAT CGCGGCCATC AGGTCTATAA 300
GCATATTATT TATTGAAGTT ATCACTGCGC GGGTAATCGG ATCTGATTGT GAAGGATCTT TATCTGAAAG GGCTTGCCAT GATGTTGGAA CATCCAGACT 400
GACAATTCGC AATTCATGTT GTTCAATTTG CTTTTTCAGC GTTATCCAGT CGCTGTTGCT AAGACGGGTC AGGCGGTCTA TCTGCTCAAC CAGTAAAATA 500
TCATTACGGT GACTGTCCAT CAGTAATCGC CCCAGTTCCG GACGGTCAAG TTTTGTACCG CTTATGTTCT CCCGATAATA ACTGGCAATT TTATGCCCCC 600
GCTCCTGGAC GAACTGTTCC AGCATCTCTT TTGCCCGATC TGCAAACTGA TCTTCTGTTG ACGCCCTTAA ATAAGCTCTG ATGAACATTT TTTCACCGCA 700
TTATCGCATT TAGGTTGTTG CATATATTCT ATCGCATATA GGTTATGTCA TTAATTGTAT GAGGTCACAT TTTATATCTC GCATCAGGGT GTACCCTTAT 800
GCGATAACCT GTTTCGTTAA AAGTACTATG TGAGGTATGA ATATGGCACG GAGGCAGATT CTTTCGCAGT CGGAAAGAGA ATCTCTACTG GCATTGCCGG 900
ATGATGAATT GACCCTGACC CGAATGGCAT ATTTCAGCGA ACATGACCTG GCTCTTATTA ATGCTCATCG CAAACCAGCC AGTCGTTTTG GATTTGCAGT 1000
TCTTCTTTGC TATCTGAAAA ATGTCGGTTT TGCTCCTGAT AAGAAAACGC CCCCCTCTGA TGCATTGTTA AAACCCATAG CCAACAGGTT AAAACTTGCC 1100
GATGGGCTAT GGTCCTCTTA TATCAACGGC AGGGAGACTA CTCGACGCGA GCATTTAACC GAGTTGTACC GTTATCTTGG TGTAAAAGCA TTCACCAACA 1200
AAATACAGCA GAGTTGTATT ACACATCTGT TGCCAATGGC GACTCGTACT GACAAAGGCA TTCTCCTGGC GGAAGAAATG CTGGCCTACC TTCGGCAAAA 1300
CAAGGTTATG TTTCCTGCGA TTGATGTTGT GGAACGTACC TGTGCGGAAG CTATGGCTGG CGGCGATAAA ATAGTCTTTC AGACGCTAAA TGCGCCCTTA 1400
ACCTCTGCTC ACAGGGAAGC TTTGGATCGT TTGCTTGAAT CATCAGACAA TCAACCTTCC CGGCTGACCT GGCTTCTGCA ACCGCCAGGT AAAATCAACG 1500
GTAAGAATGT ACTGCAACAC ATTGATCGGC TGAACAGTAT TGAATCACTG GCATTGCCGG AAGGTATTGA TCGTACCATT CATCAAAACA GGTTACTGAA 1600
ACTTGCGCGA GAAGGGCGAA AGATGAGCAG TAGGGACCTG ACTCGATTTT CAGCAGCCCG ACGTCATGCA ATTCTGGTGT GTGTCCTAGA AGATGCCAGA 1700
GCTACGCTGA CAGATGAAAT TATTGAGTTG CATGAACGAA TATTGAACAG TCTGTTCAAC AAGGCCAAAA AAACACAGGC TGAACGTCTT CAACAAACAG 1800
GAAGGTTAAT CCAGTTAAAA TTAAAGCAGT ATATTGATAT CGGACAGGCT TTATCTGAGG CCCATGATTC CGGTGAAGAT CCATGGCTTG CTATAGAGAA 1900
AATATTGCCA TGGCCTGAAT TTATTGCCAG CCTGGAAGAA ACTCGTCATC TTGCCAGAAA GAATAATTTT GACCCGCTAC ATATTATTAC CGAAAAATAC 2000
AGCACATTAC GGAAATATGC TCCGCGTATG TTATCTACCT TGCAATTAGT TGCCACACCG GCAGCACAGG CGCTGGCTGA TGCGTTGCTT GTAATCAAAG 2100
ATATGTACAA GAAACAGTTA CGAAAAGTCC CGGCGACAGC TCCGCTTGAG TTTGTTCCTG AAAGCTGGCG AAAAGTGGTG ATAACTCCCA CTGGTATTGA 2200
CCGACAATAC TATGAATTTT GTGCACTTAG TGAGCTGAAA GGGGCGTTAC GTTCCGGGGA TATCTGGGTT AAAGGCTCAC GACGTTACAA AAATTTCGAC 2300
GATTACCTTA TTCCAGAGAA AGACTTTGAT AAGCTCTCAC CTGCATTACC ACTACCTGTT TCTGCTGATT ATCATGAATA TATTACAAAT CGAATGATAC 2400
TGCTTCAGTC AAAACTGGAA GAGGTCAACA AAATGGCCAC ACACGGTGAA TTGCCGGATG TGGAAATATC TGACAAAGGA GTAAAAATCT CCCCTCTGGA 2500
TAATTGTGTT CCTTTGCAGG TCTCACCGCT TTCAGAGTTA ATTTACAGTA TGTTACCGCG TCCTAAAATC ACAGAAATTC TCGACGAGGT AAACAGCTGG 2600
ACAGCGTTTA CCCGACATTT TTCACATATA AAAAATGATA TAACTCGTCC CGATACACGC CTGCTTCTTA CCACAATTCT TGCTGATGGC ATCAATCTTG 2700
GCCTGACGAA AATGGCAGAG GCCTGTCCCG GATGCACTAA ATCATCACTC GAAGATATTC AGGCCTGGTA TATCCGTGAT GAAACTTATT CCGCAGCCCT 2800
TGCAGAGCTG GTGAATGCTC AGGGAAAAAG ACCTCTGGCA GCATTCTGGG GAGACGGAAC AACTTCATCA TCAGATGGGC AGAACTTCAG AACTGGTAAT 2900
TCTGGTCGTT ATGCTGGGCA GATTAACCCG AAATATGGGC AAGAGCCTGG ATGCCAGTTT TATACACATA TATCAGATCA ATACAGTCCG TTTTACACCT 3000
GCATAATCAG TCGGGTCAGA GATTCAACAC ATGTGCTTGA TGGATTATTG TACCACGAAA GCGACCTGGA TATCAGGGAA CATTACACAG ATACAGCGGG 3100
TTTCACCGAT CATGTTTTTG CTCTGATGCA TCTGTTGGGT TTTGCATTTT GTCCCCGGAT TCGGGACCTT CACGATAAAA AACTTTTCAT TAAAGGTAAA 3200
GCGGATCAAT ATCCGGCACT TCATTCTCTG ATATCTCCGA CCCGCATTAA TCTGAAAGAA ATAGAAATTC ACTGGCGTGA AGTGCTACGT CTGGCGACCT 3300
CAATAAAACA GGGAACAGTG ACAGCATCCC TGATGCTGAG AAAACTTGCC AGTTACCCCA AACAAAATGG GTTGGCTAAA GCTCTGAGAG AGATTGGCCG 3400
TATAGAACGA ACACTGTTTA TGCTGGACTG GTTTCGGGAT CCGGCATTGC GGCGACGTGT CCAGGCCGGG CTGAACAAAG GAGAAGCCCG TAATGCTCTG 3500
GCCCGTGCTG TATTCATGCA TCGATTGGGG GAGATTAGAG ATCGGAAACC GGAAAATCAA AGTTATCGCG CCAGTGGATT GACTTTATTG ACGGCGGCTA 3600
TTTCCTTATG GAACACAGTT TACATGGAAA GAGCGGTTGA TGCTCTGAAG CGTAAGGGCA TGAAAATAAA TGCTCAACTA TTATCGCATT TATCACCATT 3700
AGGTTGGGAA CATATCAATC TGACAGGCGA TTATATCTGG AAAAACAACC GGATACCGAC TTCAGGTAAG TTTCGTCGTT TGAGATCAGT TAAGATCGAC 3800
AAGGTCAAAA AACAACCTTA ACGTACAATA ATTTTCGATA TCCAAACTGA CCCC

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnpR TnEc1 47-688 Accessory Gene Resolvase -
tnpA TnEc1 837-3821 Transposase   +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpR TnpR TnEc1 642 47-688 -
Class:   Accessory Gene
Sub Class:   Resolvase
Transpoase Chemistry:   Serine
Sequence Family:  Serine Site-Specific Recombinase
Protein Sequence:  
MFIRAYLRAS TEDQFADRAK EMLEQFVQER GHKIASYYRE NISGTKLDRP ELGRLLMDSH RNDILLVEQI DRLTRLSNSD WITLKKQIEQ HELRIVSLDV
PTSWQALSDK DPSQSDPITR AVITSINNML IDLMAAMSHK DWLSRRQRQK QGIERAHTLG KYRGKQADQE RHQKVLYYRQ VKKLSIRETA DATGYSTSQV
CRIQALYREP TTD

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnpA TnpA TnEc1 2985 837-3821 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MNMARRQILS QSERESLLAL PDDELTLTRM AYFSEHDLAL INAHRKPASR FGFAVLLCYL KNVGFAPDKK TPPSDALLKP IANRLKLADG LWSSYINGRE
TTRREHLTEL YRYLGVKAFT NKIQQSCITH LLPMATRTDK GILLAEEMLA YLRQNKVMFP AIDVVERTCA EAMAGGDKIV FQTLNAPLTS AHREALDRLL
ESSDNQPSRL TWLLQPPGKI NGKNVLQHID RLNSIESLAL PEGIDRTIHQ NRLLKLAREG RKMSSRDLTR FSAARRHAIL VCVLEDARAT LTDEIIELHE
RILNSLFNKA KKTQAERLQQ TGRLIQLKLK QYIDIGQALS EAHDSGEDPW LAIEKILPWP EFIASLEETR HLARKNNFDP LHIITEKYST LRKYAPRMLS
TLQLVATPAA QALADALLVI KDMYKKQLRK VPATAPLEFV PESWRKVVIT PTGIDRQYYE FCALSELKGA LRSGDIWVKG SRRYKNFDDY LIPEKDFDKL
SPALPLPVSA DYHEYITNRM ILLQSKLEEV NKMATHGELP DVEISDKGVK ISPLDNCVPL QVSPLSELIY SMLPRPKITE ILDEVNSWTA FTRHFSHIKN
DITRPDTRLL LTTILADGIN LGLTKMAEAC PGCTKSSLED IQAWYIRDET YSAALAELVN AQGKRPLAAF WGDGTTSSSD GQNFRTGNSG RYAGQINPKY
GQEPGCQFYT HISDQYSPFY TCIISRVRDS THVLDGLLYH ESDLDIREHY TDTAGFTDHV FALMHLLGFA FCPRIRDLHD KKLFIKGKAD QYPALHSLIS
PTRINLKEIE IHWREVLRLA TSIKQGTVTA SLMLRKLASY PKQNGLAKAL REIGRIERTL FMLDWFRDPA LRRRVQAGLN KGEARNALAR AVFMHRLGEI
RDRKPENQSY RASGLTLLTA AISLWNTVYM ERAVDALKRK GMKINAQLLS HLSPLGWEHI NLTGDYIWKN NRIPTSGKFR RLRSVKIDKV KKQP