|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Internal Transposable Elements | |
|
|
|
|
References | |
|
|
|
|
|
|
|
|
|
Name: In22 |
|
Family: Tn402 Group: Class 1 |
|
Evidence of Transposition: no |
|
|
Host |
|
|
Host Organism: | Escherichia coli SCU-164 | Molecular Source: | cchromosome |
Place of Origin: | USA | Date of Isolation: | 2020 |
|
Map |
|
Terminal Inverted Repeats (IR) |
|
Sequence |
|
|
|
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCTG TCGACAACCT CTCGCGCAAC 2700
CAAGACATCG CGGTCGGACT GCAAGTGATC TTGAAGCCAC GGGCCCGTCC CACCCCGACA TGGACCTCGA TGCCCGAACG GACGTTAGAT TTCGAGTTCT 2800
AGGCGTTCTG CGATGAAGGT TGGATCCCAG CCGGGATTGA AAGTGTCGAC GTGGGTGAAT CCGAGCCGCT CGTATAGGCC ACGCAGGTTC GGGTGGCAGT 2900
CGAGCCGCAG CTTGGCGCAC CCCTGCGTTC GCGCGGCATG GCGGCAAGCC TCGATCAGCG CGGAGCTGAC ACCCCGGCCC GCATGTGTCC GTCGCACCGC 3000
GAGCTTGTGC AGATATGCGG CCTCCCCCTT GAGGGCGTCG GGCCAGAACT CGGGATCCTC GGCCGACAAG GTGCAACAGC CGACGATGCC GTCGCTGCAA 3100
CTCGCGACTA GGAGCTCGGA TCTCAGGACG AAGGTCTCCG CGAATGTCCG GTCGATCCGC GCGACGTCCC AGGCGGGCGT TCCCTTGGCG GACATCCACG 3200
CCGCAGCGTC GTGCATCAGC CGCACAACCT CGTCGATATC ACCCGAGCAG GCGACCCGAA CGTTCGGAGG CTCCTCGCTG TCCATTCGCT CCCCTGGCGC 3300
GGTATGAACC GCCGCCTCAT AGTGCAGTTT GATCCTGACG AGCCCAGCAT GTCTGCGCCC ACCTTCGCGG AACCTGACCA GGGTCCGCTA GCGGGCGGCC 3400
GGAAGGTGAA TGCTAGGCAT GATCTAACCC TCGGTCTCTG GCGTCGCGAC TGCGAAATTT CGCGAGGGTT TCCGAGAAGG TGATTGCGCT TCGCAGATCT 3500
CCAGGCGCGT GGGTGCGGAC GTAGTCAGCG CCATTGCCGA TCGCGTGAAG TTCCGCCGCA AGGCTCGCTG GACCCAGATC CTTTACAGGA AGGCCAACGG 3600
TGGCGCCCAA GAAGGATTTC CGCGACACCG AGACCAATAG CGGAAGCCCC AACGCCGACT TCAGCTTTTG AAGGTTCGAC AGCACGTGCA GCGATGTTTC 3700
CGGTGCGGGG CTCAAGAAAA ATCCCATCCC CGGATCGAGG ATGAGCCGGT CGGCAGCGAC CCCGCTCCGT CGCAAGGCGG AAACCCGCGC CTCGAAGAAC 3800
CGCACAATCT CGTCGAGCGC GTCTTCGGGT CGAAGGTGAC CGGTGCGGGT GGCGATGCCA TCCCGCTGCG CTGAGTGCAT AACCACCAGC CTGCAGTCCG 3900
CCTCAGCAAT ATCGGGATAG AGCGCAGGGT CAGGAAATCC TTGGATATCG TTCAGGTAGC CCACGCCGCG CTTGAGCGCA TAGCGCTGGG TTTCCGGTTG 4000
GAAGCTGTCG ATTGAAACAC GGTGCATCTG ATCGGACAGG GCGTCTAAGA GCGGCGCAAT ACGTCTGATC TCATCGGCCG GCGATACAGG CCTCGCGTCC 4100
GGATGGCTGG CGGCCGGTCC GACATCCACG ACGTCTGATC CGACTCGCAG CATTTCGATC GCCGCGGTGA CAGCGCCGGC GGGGTCTAGC CGCCGGCTCT 4200
CATCGAAGAA GGAGTCCTCG GTGAGATTCA GAATGCCGAA CACCGTCACC ATGGCGTCGG CCTCCGCAGC GACTTCCACG ATGGGGATCG GGCGAGCAAA 4300
AAGGCAGCAA TTATGAGCCC CATACCTACA AAGCCCCACG CATCAAGCTT TTGCCCATGA AGCAACCAGG CAATGGCTGT AATTATGACG ACGCCGAGTC 4400
CCGACCAGAC TGCATAAGCA ACACCGACAG GGATGGATTT CAGAACCAGA GAAAGAAAAT AAAATGCGAT GCCATAACCG ATTATGACAA CGGCGGAAGG 4500
GGCAAGCTTA GTAAAGCCCT CGCTAGATTT TAATGCGGAT GTTGCGATTA CTTCGCCAAC TATTGCGATA ACAAGAAAAA GCCAGCCTTT CATGATATAT 4600
CTCCCAATTT GTGTAGGGCT TATTATGCAC GCTTAAAAAT AATAAAAGCA GACTTGACCT GATAGTTTGG CTGTGAGCAA TTATGTGCTT AGTGCATCTA 4700
ACGCCGAGTT CAGCGGCAGT TTTTAAGTTG TGGTTTTATG GAATACTTTT GCGCAGCAAA ACCATAAAAC CGCGACTTAA AAACTGTCCA AGGAGCGCAG 4800
CGACTGGTGC TGGAACGACT TGTTAGCCTT TTTTCCAAAT CTGATATGTG TAATTTATAT TAGACAAAAA AAACTGCTCA AAAACCAAAT TGAAATTCTC 4900
TGGAATTTTA GGAAAATTGA TATCACCTTC AACCTCAACG TGAACAGTAG ACAAATGAAT TATATCTGCT TTTTCAATAA GACTATTGTA GATTTGACCG 5000
CCACCAGAGA CATATAAATG ATCTGTAATT TTCGATAGTT CTTGCAAAGC GATTTCTATT GAAGGAAAGA CTAATACATT TTCATTTGAG CTTGAAATTC 5100
CTTTCCTCGA CACTACTGCA TATTTTCGAT TTGGAAGAAC ACCCATAGAG TCAAATGTTT TCCTTCCAAC AAGGAGCCAC TGATTATATG TGAGCGCTTT 5200
AAAGAGTAAC TGCTCACCTT TTGCTGACCA TGGGATATCA GGGCCATTAC CGATTACGCC ATTTTCTGAC GTTGCAGAAA TCAATGAAAT TTTCAATTCA 5300
ACCCCCGTAA TGGCTAACTT TGTTTTAGGG CGACTGCCCT GCTGCGTAAC ATCGTTGCTG CTCCATAACA TCAAACATCG ACCCACGGCG TAACGCGCTT 5400
GCTGCTTGGA TGCCCGAGGC ATAGACTGTA CAAAAAAACA GTCATAACAA GCCATGAAAA CCGCCACTGC GCCGTTACCA CCGCTGCGTT CGGTCAAGGT 5500
TCTGGACCAG TTGCGTGAGC GCATACGCTA CTTGCATTAC AGCTTACGAA CCGAACAGGC TTATGTCCAC TGGGTTCGTG CCTTCATCCG TTTCCACGGT 5600
GTGCGTCACC CGGCAACCTT GGGCAGCAGC GAAGTCGAGG CATTTCTGTC CTGGCTGGCG AACGAGCGCA AGGTTTCGGT CTCCACGCAT CGTCAGGCAT 5700
TGGCGGCCTT GCTGTTCTTC TACGGCAAGG TGCTGTGCAC GGATCTGCCC TGGCTTCAGG AGATCGGAAG ACCTCGGCCG TCGCGGCGCT TGCCGGTGGT 5800
GCTGACCCCG GATGAAGTGG TTCGCATCCT CGGTTTTCTG GAAGGCGAGC ATCGTTTGTT CGCCCAGCTT CTGTATGGAA CGGGCATGCG GATCAGTGAG 5900
GGTTTGCAAC TGCGGGTCAA GGATCTGGAT TTCGATCACG GCACGATCAT CGTGCGGGAG GGCAAGGGCT CCAAGGATCG GGCCTTGATG TTACCCGAGA 6000
GCTTGGCACC CAGCCTGCGC GAGCAGCTGT CGCGTGCACG GGCATGGTGG CTGAAGGACC AGGCCGAGGG CCGCAGCGGC GTTGCGCTTC CCGACGCCCT 6100
TGAGCGGAAG TATCCGCGCG CCGGGCATTC CTGGCCGTGG TTCTGGGTTT TTGCGCAGCA CACGCATTCG ACCGATCCAC GGAGCGGTGT CGTGCGTCGC 6200
CATCACATGT ATGACCAGAC CTTTCAGCGC GCCTTCAAAC GTGCCGTAGA ACAAGCAGGC ATCACGAAGC CCGCCACACC GCACACCCTC CGCCACTCGT 6300
TCGCGACGGC CTTGCTCCGC AGCGGTTACG ACATTCGAAC CGTGCAGGAT CTGCTCGGCC ATTCCGACGT CTCTACGACG ATGATTTACA CGCATGTGCT 6400
GAAAGTTGGC GGTGCCGGAG TGCGCTCACC GCTTGATGCG CTGCCGCCCC TCACTAGTGA GAGGTAGGGC AGCGCAAGTC AATCCTGGCG GATTCACTAC 6500
CCCTGCGCGA AGGCCATCGG TGCCGCATCG AACGGCCGGT TGCGGAAAGT CCTCCCTGCG TCCGCTGATG GCCGGCAGCA GCCCGTCGTT GCCTGATGGA 6600
TCCAACCCCT CCGCTGCTAT AGTGCAGTCG GCTTCTGACG TTCAGTGCAG CCGTCTTCTG AAAACGACA
|
|
|
|
Recombination Sites |
|
|
Name |
Coordinates |
Gene |
Sequence |
attC cmlA6 3'-end |
33-38 |
6 |
TTGGGC |
attC qacEdelta1_sul1 core |
3385-3418 |
34 |
CCGCTAGCGG GCGGCCGGAA GGTGAATGCT AGGC |
attC dfrA7 core |
4720-4783 |
64 |
TTTTTAAGTT GTGGTTTTAT GGAATACTTT TGCGCAGCAA AACCATAAAA CCGCGACTTA AAAA |
attI |
5318-5373 |
56 |
CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA |
|
ORFs |
|
|
Gene Name |
Associated TE |
Coordinates |
Class |
Sub Class |
Orientation |
urfM 5'-end |
Tn21 |
1-30 |
Passenger Gene |
Other |
+ |
tniA |
In22 |
142-1821 |
Transposase |
|
+ |
tniB delta1 |
In22 |
1824-2680 |
Accessory Gene |
|
+ |
GNAT_fam |
In22 |
2785-3285 |
Passenger Gene |
Antibiotic Resistance |
- |
sul1 (ARO:3000410) |
In22 |
3413-4252 |
Passenger Gene |
Antibiotic Resistance |
- |
qacEdelta1 (ARO:3005010) |
In22 |
4246-4593 |
Passenger Gene |
Antibiotic Resistance |
- |
dfrA7 (ARO:3002862) |
In22 |
4823-5296 |
Passenger Gene |
Antibiotic Resistance |
- |
intI1 |
In22 |
5454-6467 |
Integron Integrase |
Class 1 |
+ |
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
urfM 5'-end |
N |
Tn21 |
30 |
1-30 |
+ |
Class: | Passenger Gene |
Sub Class: | Other |
Comment: | urfM ORF interrupted by insertion of In2 |
Protein Sequence:
|
VIFRRRLHQ
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniA |
TniA |
In22 |
1680 |
142-1821 |
+ |
Class: | Transposase |
Transpoase Chemistry: | DDE |
Comment: | can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7 |
Protein Sequence:
|
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
tniB delta1 |
TniB delta1 |
In22 |
857 |
1824-2680 |
+ |
Class: | Accessory Gene |
Function: | probable ATP-binding protein. |
Comment: | probably truncated by insertion of IS1326::IS1353 |
Protein Sequence:
|
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
GNAT_fam |
GNAT_fam |
In22 |
501 |
2785-3285 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Target: | aminoglycoside antibiotic (ARO:0000016) |
Sequence Family: | GNAT |
Protein Sequence:
|
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
sul1 (ARO:3000410) |
Sul1 |
In22 |
840 |
3413-4252 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic target replacement (ARO:0001002) |
Transpoase Chemistry: | dihydropteroate synthase |
Target: | sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401) |
Sequence Family: | sulfonamide resistant sul (ARO:3004238) |
Comment: | perfect match to reference sequence for ARO:3000410 |
Protein Sequence:
|
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
qacEdelta1 (ARO:3005010) |
QacEdelta1 |
In22 |
348 |
4246-4593 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic efflux (ARO:0010000) |
Target: | disinfecting agents and antiseptics (ARO:3005386) |
Sequence Family: | major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002) |
Comment: | subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219) |
Protein Sequence:
|
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL ARSPSWKSLR RPTPW
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
dfrA7 (ARO:3002862) |
DfrA7 |
In22 |
474 |
4823-5296 |
- |
Class: | Passenger Gene |
Sub Class: | Antibiotic Resistance |
Function: | antibiotic target replacement (ARO:0001002) |
Target: | diaminopyrimidine antibiotic (ARO:3000171) |
Sequence Family: | trimethoprim resistant dihydrofolate reductase dfr (ARO:3001218) |
Comment: | 100% identity to reference sequence ARO:3002862 in Acinetobacter baumannii (bitscore: 319) |
Protein Sequence:
|
MKISLISATS ENGVIGNGPD IPWSAKGEQL LFKALTYNQW LLVGRKTFDS MGVLPNRKYA VVSRKGISSS NENVLVFPSI EIALQELSKI TDHLYVSGGG QIYNSLIEKA DIIHLSTVHV EVEGDINFPK IPENFNLVFE QFFLSNINYT YQIWKKG
|
Gene Name |
Protein Name |
Associated TE |
Gene Length |
Coordinates |
Strand |
intI1 |
IntI1 |
In22 |
1014 |
5454-6467 |
+ |
Class: | Integron Integrase |
Sub Class: | Class 1 |
Transpoase Chemistry: | Tyrosine |
Sequence Family: | Class 1 Integron Tyrosine Integrase |
Protein Sequence:
|
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER
|
|
Internal Repeat Elements |
|
|
Name |
Associated Mobile Element |
Coordinates |
Sequence (Top Strand) |
repeat t1 |
In22 |
9-27 |
TCAGAAGACG ACTGCACCA |
repeat t2 |
In22 |
49-67 |
AACACGTCGG TCGAGGACT |
repeat t3 |
In22 |
78-97 |
TCAGAAGTGA TCTGCACCAA |
repeat t4 |
In22 |
110-128 |
TCAATACTCG TGTGCACCA |
IRL |
IS1326::IS1353 |
2679-2680 |
TG |
repeat i4 |
In22 |
6550-6568 |
AGGAGGGACG CAGGCGACT |
repeat i3 |
In22 |
6578-6596 |
CGTCGGGCAG CAACGGACT |
repeat i2 |
In22 |
6620-6638 |
ATCACGTCAG CCGAAGACT |
IRi |
In22 |
6637-6669 |
CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT |
|
|