|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
|
|
|
|
|
|
|
|
|
|
|
Name: TnpHS87a (Synonyms: Tn7223) |
|
Family: Tn402 |
|
Evidence of Transposition: Yes |
|
|
Host |
|
|
Host Organism: | Pseudomonas aeruginosa strain HS87 | Molecular Source: | plasmid pHS87a |
Place of Origin: | Shanghai, China | Date of Isolation: | 2016 |
| | Other Geographic Information: | sputum of an inpatient |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCGT TTACACCGGA CCCAGCGAGC 2700
GGCGGCGGCA ATTCGAGCGG GAACTGATGT GAAGCCTGCG CCGCGCTGGC CGCTGCATCC CGCCCCGAAA GAAGGCGAGG CGCTGTCCTC ATGGCTCAAC 2800
CGCGTGGCCC TTTGCTATCA CATGGAGGAG CCCGACCTGC TGGAGCACGA TCTTGGTCAC GGCCAGGTCG ATGACCTGGA CACCGCGCCA CCACTCTCGC 2900
TGCTGGCGTT GCTTTCCCAG CGGAGCGGCA TCGAGCTGGA CCGGCTGCGC TGTATGAGTT TCGCCGGATG GGTGCCTTGG CTACTGGACA GCCTTGATGA 3000
CCAGATTCCA GACGCCTTGG AAACCTATGC GTTTCAGCTC TCGGTGTTGC TGCCAAGACT CCGCCGTAAG ACGCGATCCA TCACGAGCTG GCGTGCCTGG 3100
CTGCCCAGCC AGCCGATAAA CCGCGCCTGT CCGCTCTGCC TGAGCGATCC GGAGAACCAA GCCGTACTGC TCGCGTGGAA GCTGCCCCTG ATGCTGAGCT 3200
GCCCGCTGCA TGGCTGCTGG CTGGAATCCT ATTGGGGCGT GCCAGGGCGG TTTCTCGGCT GGGAGAACGC CGACGCCGAA CCGCGCACCG CCAGCGACGC 3300
GATTGCGGCG ATGGACCAGC GTACCTGGCA GGCACTGACA ACCGGTCACG TGGAGCTGCC GCGCCGACGC ATCCACGCCG GATTGTGGTT TCGACTTCTT 3400
CGCACGCTGC TCGATGAGCT GAACACCCCG CTTTCCGCGT GCGGAACCTG CGCGGGGTAT CCCCGCCAAG TCTGGGAAGG CTGCGGGCAT CCGCTGCGTG 3500
CTGGGCAAAG TCTGTGGCGA CCGTATGAAA CCCTGAATCC GATAGTACGG TTACAGATGC TGGAGGCGGC GGCAACGGCA ATCAGCTTGA TTGAGGTGAG 3600
GGACATCAGC CCGCCAGGCG AGCAGGCAAA GCTATTCTGG TCCGAGCCCC AAACCGGGTT CACCAGTGGC CTGCCGACGA AAGCGCCGAA GCCCGAGCCC 3700
ATCAATCACT GGCAGCGTGC AGTCCAGGCC ATCGACGAGG CCATCATTGA AGCGCGACAC AACCCCGAGA CGGCACGCTC GCTGTTCGCG TTGGCTTCCT 3800
ATGGTCGGCG CGATCCCGCT TCCTTGGAAC GGTTGCGCGC CACCTTCGTG AAGGAAGGCA TCCCGCCGGA ATTTCTGTCA CATTACCTGC CTGATGCACC 3900
CTTTGCATGT CTTAAACAAA ATGACGGGTT AAGTGACAAA TTTTGACGGA TAGAGCTTTC CGGCTCACAC TGTCACATAA TCGAACGTAT ACGTGACGGG 4000
TGAAAAGGTG CTGATCGGCT ACATGCGGGT ATCGAAGGCG GACGGATCCC AGTCCACCAA TTTGCAACGC GATGCGCTCA TCGCCGCTGG TGTGAGCCTT 4100
GCGCACCTTT ACGAGGATCT GGCCTCGGGC AGGCGCGATG ATCGCCCAGG GTTGGCTGCT TGCCTGAAGG CGCTTCGTGA AGGGGACACG CTGATCGTGT 4200
GGAAGCTCGA TCGGCTTGGC CGTGATCTGC GCCACCTGAT CAACACCGTG CACGACCTAA CTGCGCGTAG CGTGGGCCTG AAGGTCCTGA CCGGTCACGG 4300
TGCGGCGGTC GACACGACGA CTGCCGCCGG CAAGCTTGTG TTCGGTATTT TTGCCGCGCT GGCCGAGTTC GAGCGTGAGT TGATTTCCGA GCGAACAGTC 4400
GCTGGACTTA TCTCGGCGCG CGCTCGCGGC AGGAAAGGGG GGCGCCCCTT CAAGATGACC GCCGCCAAGC TACGCCTGGC GATGGCCAGC ATGGGGCAAC 4500
CGGAAACCAA GGTGGGCGAT CTCTGCGAAG AACTCGGGAT TACCCGGCAG ACGCTCTACC GGCACGTGTC GCCCAAGGGC GAACTGCGGC CAGACGGCGT 4600
AAAGCTGCTC TCCCTCGGTT CAGCCGCATA AATGGAGGCG ACCTGGAACG GGGCGCTGTT CAGTGCGGCA ACGATCCGAT TACCGGTGTC GACCCAGAGC 4700
AGCCGTAGAG CTTTTGGGAA AGCTGTCGTT CAACGTTTGA CATGAGGGGC GGCCAAGGGC GCCAGCCCTT GGACGTCCCC CTCGATGGAA GGGTTAGGCA 4800
TCACTGCGTG TTCGCTCGAA TGCCTGGCGT GTTTGAACCA TGTACACGGC TGGACCATCT GGGGTGGTTA CGGTACCTTG CCTCTCAAAC CCCGCTTTCT 4900
CGTAGCATCG GATCGCTCGC AAGTTGCTCG GCGACGGGTC CGTTTGGATC TTGGTGACCT CGGGATCATT GAACAGCAAC TCAACCAGTG CTCGAACCAG 5000
CTTGGTTCCC AAGCCTTTGC CCAGTTGTGA TGCATTCGCC AGTGACTGGT CTATTCCGCG TACTCCTGGA TCGGTTTCTT CTTCCCACCA TCCGTCCCCG 5100
CTTCCAAGAG CAACGTACGA CTGGGCATAC CCAATCGGCT CTCCATTCAG CATTGCAATG TATGGAGTGA CGGACTCTTG CGCTAAAACG CTTGGCAAGT 5200
ACTGTTCCTG TACGTCAGCA AGTGTCGGGC GTGCTTCTTC TCCGCCCCAC CACTCGACGA TATGAGATCG ATTTAGCCAC TCATAGAGCA TCGCAAGGTC 5300
ATGCTCAGTC ATGAGGCGCA GTGTGACGGA ATCGTTGCTG TTGGTCACGA TGCTGTACTT TGTGATGCCT AACTTTGTTT TAGGGCGACT GCCCTGCTGC 5400
GTAACATCGT TGCTGCTCCA TAACATCAAA CATCGACCCA CGGCGTAACG CGCTTGCTGC TTGGATGCCC GAGGCATAGA CTGTACAAAA AAACAGTCAT 5500
AACAAGCCAT GAAAACCGCC ACTGCGCCGT TACCACCGCT GCGTTCGGTC AAGGTTCTGG ACCAGTTGCG TGAGCGCATA CGCTACTTGC ATTACAGTTT 5600
ACGAACCGAA CAGGCTTATG TCAACTGGGT TCGTGCCTTC ATCCGTTTCC ACGGTGTGCG TCACCCGGCA ACCTTGGGCA GCAGCGAAGT CGAGGCATTT 5700
CTGTCCTGGC TGGCGAACGA GCGCAAGGTT TCGGTCTCCA CGCATCGTCA GGCATTGGCG GCCTTGCTGT TCTTCTACGG CAAGGTGCTG TGCACGGATC 5800
TGCCCTGGCT TCAGGAGATC GGAAGACCTC GGCCGTCGCG GCGCTTGCCG GTGGTGCTGA CCCCGGATGA AGTGGTTCGC ATCCTCGGTT TTCTGGAAGG 5900
CGAGCATCGT TTGTTCGCCC AGCTTCTGTA TGGAACGGGC ATGCGGATCA GTGAGGGTTT GCAACTGCGG GTCAAGGATC TGGATTTCGA TCACGGCACG 6000
ATCATCGTGC GGGAGGGCAA GGGCTCCAAG GATCGGGCCT TGATGTTACC CGAGAGCTTG GCACCCAGCC TGCGCGAGCA GCTGTCGCGT GCACGGGCAT 6100
GGTGGCTGAA GGACCAGGCC GAGGGCCGCA GCGGCGTTGC GCTTCCCGAC GCCCTTGAGC GGAAGTATCC GCGCGCCGGG CATTCCTGGC CGTGGTTCTG 6200
GGTTTTTGCG CAGCACACGC ATTCGACCGA TCCACGGAGC GGTGTCGTGC GTCGCCATCA CATGTATGAC CAGACCTTTC AGCGCGCCTT CAAACGTGCC 6300
GTAGAACAAG CAGGCATCAC GAAGCCCGCC ACACCGCACA CCCTCCGCCA CTCGTTCGCG ACGGCCTTGC TCCGCAGCGG TTACGACATT CGAACCGTGC 6400
AGGATCTGCT CGGCCATTCC GACGTCTCTA CGACGATGAT TTACACGCAT GTGCTGAAAG TTGGCGGTGC CGGAGTGCGC TCACCGCTTG ATGCGCTGCC 6500
GCCCCTCACT AGTGAGAGGT AGGGCAGCGC AAGTCAATCC TGGCGGATTC ACTACCCCTG CGCGAAGGCC ATCGGTGCCG CATCGAACGG CCGGTTGCGG 6600
AAAGTCCTCC CTGCGTCCGC TGATGGCCGG CAGCAGCCCG TCGTTGCCTG ATGGATCCAA CCCCTCCGCT GCTATAGTGC AGTCGGCTTC TGACGTTCAG 6700
TGCAGCCGTC TTCTGAAAAC GACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
r6 |
3824-3837 |
14 |
TTGGAACGGT TGCG |
r5 |
3870-3883 |
14 |
AATTTCTGTC ACAT |
r4 |
3875-3888 |
14 |
CTGTCACATT ACCT |
r3 |
3926-3939 |
14 |
GGGTTAAGTG ACAA |
res |
3967-4001 |
35 |
ACACTGTCAC ATAATCGAAC GTATACGTGA CGGGT |
r2 |
3970-3983 |
14 |
CTGTCACATA ATCG |
r1 |
3986-3999 |
14 |
CGTATACGTG ACGG |
attC AAC(6')-Ib core |
4734-4799 |
66 |
CGTTTGACAT GAGGGGCGGC CAAGGGCGCC AGCCCTTGGA CGTCCCCCTC GATGGAAGGG TTAGGC |
attI |
5373-5428 |
56 |
CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
tniA |
TnpHS87a |
142-1821 |
Transposase |
|
+ |
tniB |
TnpHS87a |
1824-2732 |
Accessory Gene |
|
+ |
tniQ |
TnpHS87a |
2729-3946 |
Accessory Gene |
Target Site Selection |
+ |
tniR |
TnpHS87a |
4008-4631 |
Accessory Gene |
Resolvase |
+ |
AAC(6')-Ib10 (ARO:3002581) |
TnpHS87a |
4794-5348 |
Passenger Gene |
Antibiotic Resistance |
- |
intI1 |
TnpHS87a |
5509-6522 |
Integron Integrase |
Class 1 |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
TnpHS87a |
1680 |
142-1821 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7 |
Protein Sequence:
|
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB |
TniB |
TnpHS87a |
909 |
1824-2732 |
+ |
Class: | Accessory Gene |
Sequence Family: | ATP binding protein? |
Comment: | identical to tniB (Tn1721)| similar function to Tn7 tnsC and MuB |
Protein Sequence:
|
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMAVYTGP SERRRQFERE LM
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniQ |
TniQ |
TnpHS87a |
1218 |
2729-3946 |
+ |
Class: | Accessory Gene |
Sub Class: | Target Site Selection |
Comment: | identical to tniQ (Tn1721)|similar function to Tn7 tnsD? |
Protein Sequence:
|
MKPAPRWPLH PAPKEGEALS SWLNRVALCY HMEEPDLLEH DLGHGQVDDL DTAPPLSLLA LLSQRSGIEL DRLRCMSFAG WVPWLLDSLD DQIPDALETY AFQLSVLLPR LRRKTRSITS WRAWLPSQPI NRACPLCLSD PENQAVLLAW KLPLMLSCPL HGCWLESYWG VPGRFLGWEN ADAEPRTASD AIAAMDQRTW QALTTGHVEL PRRRIHAGLW FRLLRTLLDE LNTPLSACGT CAGYPRQVWE GCGHPLRAGQ SLWRPYETLN PIVRLQMLEA AATAISLIEV RDISPPGEQA KLFWSEPQTG FTSGLPTKAP KPEPINHWQR AVQAIDEAII EARHNPETAR SLFALASYGR RDPASLERLR ATFVKEGIPP EFLSHYLPDA PFACLKQNDG LSDKF
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniR |
TniR |
TnpHS87a |
624 |
4008-4631 |
+ |
Class: | Accessory Gene |
Sub Class: | Resolvase |
Transpoase Chemistry: | Serine |
Sequence Family: | Serine Site-Specific Recombinase |
Comment: | resolution of cointegrates || Protein: ACE81792.1 || identical to tniR (Tn1721) |
Protein Sequence:
|
MLIGYMRVSK ADGSQSTNLQ RDALIAAGVS LAHLYEDLAS GRRDDRPGLA ACLKALREGD TLIVWKLDRL GRDLRHLINT VHDLTARSVG LKVLTGHGAA VDTTTAAGKL VFGIFAALAE FERELISERT VAGLISARAR GRKGGRPFKM TAAKLRLAMA SMGQPETKVG DLCEELGITR QTLYRHVSPK GELRPDGVKL LSLGSAA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
AAC(6')-Ib10 (ARO:3002581) |
AAC(6')-Ib10 |
TnpHS87a |
555 |
4794-5348 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic inactivation (ARO:0001004) |
Transpoase Chemistry: | aminoglycoside acetyltransferase |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | AAC(6') (ARO:3000345) |
Comment: | strict match to reference sequence for ARO:3002581 (bitscore: 377) |
Protein Sequence:
|
MTNSNDSVTL RLMTEHDLAM LYEWLNRSHI VEWWGGEEAR PTLADVQEQY LPSVLAQESV TPYIAMLNGE PIGYAQSYVA LGSGDGWWEE ETDPGVRGID QSLANASQLG KGLGTKLVRA LVELLFNDPE VTKIQTDPSP SNLRAIRCYE KAGFERQGTV TTPDGPAVYM VQTRQAFERT RSDA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
intI1 |
IntI1 |
TnpHS87a |
1014 |
5509-6522 |
+ |
Class: | Integron Integrase |
Sub Class: | Class 1 |
Transpoase Chemistry: | Tyrosine |
Sequence Family: | Class 1 Integron Tyrosine Integrase |
Protein Sequence:
|
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVNW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER
|
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t1 |
TnpHS87a |
9-27 |
TCAGAAGACG ACTGCACCA |
repeat t2 |
TnpHS87a |
49-67 |
AACACGTCGG TCGAGGACT |
repeat t3 |
TnpHS87a |
78-97 |
TCAGAAGTGA TCTGCACCAA |
repeat t4 |
TnpHS87a |
110-128 |
TCAATACTCG TGTGCACCA |
repeat i4 |
TnpHS87a |
6605-6623 |
AGGAGGGACG CAGGCGACT |
repeat i3 |
TnpHS87a |
6633-6651 |
CGTCGGGCAG CAACGGACT |
repeat i2 |
TnpHS87a |
6675-6693 |
ATCACGTCAG CCGAAGACT |
IRi |
TnpHS87a |
6692-6724 |
CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT |
|
References |
|
|