Transposon
Name: In22
Family: Integron        Group: Class 1
Evidence of Transposition: no
 Host     

Host Organism:Escherichia coli SCU-164 Molecular Source:cchromosome
Place of Origin:USA Date of Isolation:2020

 Map     



 Terminal Inverted Repeats (IR)     


 Sequence     
DNA SequenceLength  6669 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTCATTTTC AGAAGACGAC TGCACCAGTT GATTGGGCGT AATGGCTGTT GTGCAGCCAG CTCCTGACAG TTCAATATCA GAAGTGATCT GCACCAATCT 100
CGACTATGCT CAATACTCGT GTGCACCAAA GCGAGGTGAG CATGGCGACG GACACCCCAC GGATTCCAGA ACAAGGCGTG GCCACTCTGC CTGATGAGGC 200
TTGGGAGCGT GCGCGCCGTC GTGCGGAGAT CATCAGTCCG TTGGCGCAGT CGGAGACGGT CGGGCACGAA GCGGCCGATA TGGCGGCTCA GGCGCTGGGC 300
TTGTCTCGGC GCCAGGTATA CGTTCTGATC CGGCGTGCCC GGCAAGGCAG CGGCCTCGTG ACGGATCTGG TGCCCGGCCA GTCCGGTGGA GGTAAAGGTA 400
AGGGGCGCTT GCCGGAACCG GTCGAGCGCG TCATCCACGA GCTACTGCAA AAGCGGTTCC TGACCAAGCA GAAGCGCAGC CTAGCGGCCT TTCACCGCGA 500
AGTCACTCAG GTGTGCAAGG CTCAAAAACT GCGAGTGCCG GCGCGCAATA CCGTGGCCTT ACGGATCGCT AGCCTTGACC CGCGCAAGGT CATCCGCCGG 600
CGGGAAGGCC AGGATGCCGC TCGTGACCTA CAAGGTGTGG GCGGCGAGCC TCCTGCCGTG ACCGCGCCGC TGGAGCAGGT GCAGATAGAC CATACGGTCA 700
TCGACCTGAT CGTGGTCGAT GACCGCGACC GGCAACCTAT TGGCCGCCCG TACCTGACCC TCGCCATCGA CGTGTTCACC CGCTGCGTGC TCGGCATGGT 800
CGTCACGCTG GAAGCGCCGT CTGCCGTTTC GGTTGGCCTG TGCCTCGTGC ATGTCGCCTG CGACAAGCGC CCTTGGCTGG AAGGACTGAA CGTGGAAATG 900
GATTGGCAGA TGAGCGGCAA GCCCTTGCTG CTCTACCTAG ACAACGCGGC CGAGTTCAAG AGCGAGGCCC TGCGCCGGGG TTGCGAGCAG CATGGCATCC 1000
GGCTGGACTA TCGCCCGCTG GGACAGCCGC ACTATGGCGG CATCGTGGAA CGGATCATCG GCACGGCGAT GCAGATGATT CACGACGAAC TGCCGGGAAC 1100
GACCTTCTCC AACCCTGACC AGCGCGGCGA CTACGATTCC GAAAACAAGG CCGCCCTGAC GCTGCGCGAG CTAGAGCGCT GGCTCACATT GGCGGTCGGC 1200
ACCTACCACG GTTCGGTGCA CAACGGCCTG CTCCAACCGC CGGCCGCGCG CTGGGCCGAG GCCGTGGCGC GTGTCGGCGT ACCGGCCGTC GTCACACGCG 1300
CTACTTCGTT CCTGGTCGAT TTTCTGCCGA TCCTCCGGCG CACGCTGACC CGCACCGGCT TTGTCATCGA CCACATCCAC TACTACGCCG ATGCGCTCAA 1400
GCCGTGGATT GCGCGGCGTG AACGCTGGCC GTCCTTTCTG ATCCGGCGCG ATCCGCGCGA CATCAGCCGT ATCTGGGTCC TGGAACCGGA GGGACAGCAT 1500
TACCTGGAAA TTCCCTACCG TACCTTGTCG CATCCGGCTG TCACCCTCTG GGAACAACGG CAGGCGCTGG CGAAACTGCG GCAGCAAGGG CGCGAACAGG 1600
TGGATGAGTC GGCGCTGTTC CGCATGATCG GCCAGATGCG TGAGATTGTG ACCAGCGCGC AGAAGGCCAC ACGCAAGGCG CGGCGTGACG CGGATCGCCG 1700
CCAGCACCTC AAGACATCAG CTCGGCCGGA CAAGCCCGTT CCGCCGGATA CGGATATTGC CGACCCGCAG GCAGACAACT TGCCACCCGC CAAACCGTTC 1800
GACCAGATTG AGGAGTGGTA GCCGTGGACG AATATCCCAT CATCGACCTG TCCCACCTGC TGCCGGCGGC CCAGGGCTTG GCCCGTCTTC CGGCGGACGA 1900
GCGCATCCAG CGCCTTCGCG CCGACCGCTG GATCGGCTAT CCGCGCGCAG TCGAGGCGCT GAACCGGCTG GAAGCCCTTT ATGCGTGGCC AAACAAGCAA 2000
CGCATGCCCA ACCTGCTGCT GGTTGGCCCG ACCAACAATG GCAAGTCGAT GATCGTCGAG AAGTTCCGCC GCACCCACCC GGCCAGCTCC GACGCCGACC 2100
AGGAGCACAT CCCGGTGTTG GTCGTGCAGA TGCCGTCCGA GCCGTCCGTG ATCCGCTTCT ACGTCGCGCT GCTCGCCGCG ATGGGCGCGC CGCTGCGCCC 2200
ACGCCCACGG TTGCCGGAAA TGGAGCAACT GGCTCTGGCA CTGCTGCGCA AGGTCGGCGT GCGCATGCTG GTGATCGACG AGCTGCACAA CGTGCTGGCC 2300
GGCAACAGCG TCAACCGCCG GGAATTCCTC AACCTGCTGC GCTTCCTCGG CAACGAACTG CGCATCCCGT TGGTTGGGGT AGGCACGCGC GACGCCTACC 2400
TAGCCATCCG CTCCGATGAC CAGTTGGAAA ATCGCTTCGA GCCGATGATG CTGCCGGTAT GGGAGGCCAA CGACGATTGC TGCTCACTGC TGGCCAGCTT 2500
CGCCGCTTCG CTCCCGCTGC GCCGGCCTTC CCCAATTGCC ACGCTGGACA TGGCTCGCTA CCTGCTCACA CGCAGCGAGG GCACCATAGG GGAACTGGCG 2600
CACTTGCTGA TGGCGGCGGC CATCGTCGCC GTGGAGAGCG GCGAGGAAGC GATCAACCAT CGCACACTCA GCATGGCCTG TCGACAACCT CTCGCGCAAC 2700
CAAGACATCG CGGTCGGACT GCAAGTGATC TTGAAGCCAC GGGCCCGTCC CACCCCGACA TGGACCTCGA TGCCCGAACG GACGTTAGAT TTCGAGTTCT 2800
AGGCGTTCTG CGATGAAGGT TGGATCCCAG CCGGGATTGA AAGTGTCGAC GTGGGTGAAT CCGAGCCGCT CGTATAGGCC ACGCAGGTTC GGGTGGCAGT 2900
CGAGCCGCAG CTTGGCGCAC CCCTGCGTTC GCGCGGCATG GCGGCAAGCC TCGATCAGCG CGGAGCTGAC ACCCCGGCCC GCATGTGTCC GTCGCACCGC 3000
GAGCTTGTGC AGATATGCGG CCTCCCCCTT GAGGGCGTCG GGCCAGAACT CGGGATCCTC GGCCGACAAG GTGCAACAGC CGACGATGCC GTCGCTGCAA 3100
CTCGCGACTA GGAGCTCGGA TCTCAGGACG AAGGTCTCCG CGAATGTCCG GTCGATCCGC GCGACGTCCC AGGCGGGCGT TCCCTTGGCG GACATCCACG 3200
CCGCAGCGTC GTGCATCAGC CGCACAACCT CGTCGATATC ACCCGAGCAG GCGACCCGAA CGTTCGGAGG CTCCTCGCTG TCCATTCGCT CCCCTGGCGC 3300
GGTATGAACC GCCGCCTCAT AGTGCAGTTT GATCCTGACG AGCCCAGCAT GTCTGCGCCC ACCTTCGCGG AACCTGACCA GGGTCCGCTA GCGGGCGGCC 3400
GGAAGGTGAA TGCTAGGCAT GATCTAACCC TCGGTCTCTG GCGTCGCGAC TGCGAAATTT CGCGAGGGTT TCCGAGAAGG TGATTGCGCT TCGCAGATCT 3500
CCAGGCGCGT GGGTGCGGAC GTAGTCAGCG CCATTGCCGA TCGCGTGAAG TTCCGCCGCA AGGCTCGCTG GACCCAGATC CTTTACAGGA AGGCCAACGG 3600
TGGCGCCCAA GAAGGATTTC CGCGACACCG AGACCAATAG CGGAAGCCCC AACGCCGACT TCAGCTTTTG AAGGTTCGAC AGCACGTGCA GCGATGTTTC 3700
CGGTGCGGGG CTCAAGAAAA ATCCCATCCC CGGATCGAGG ATGAGCCGGT CGGCAGCGAC CCCGCTCCGT CGCAAGGCGG AAACCCGCGC CTCGAAGAAC 3800
CGCACAATCT CGTCGAGCGC GTCTTCGGGT CGAAGGTGAC CGGTGCGGGT GGCGATGCCA TCCCGCTGCG CTGAGTGCAT AACCACCAGC CTGCAGTCCG 3900
CCTCAGCAAT ATCGGGATAG AGCGCAGGGT CAGGAAATCC TTGGATATCG TTCAGGTAGC CCACGCCGCG CTTGAGCGCA TAGCGCTGGG TTTCCGGTTG 4000
GAAGCTGTCG ATTGAAACAC GGTGCATCTG ATCGGACAGG GCGTCTAAGA GCGGCGCAAT ACGTCTGATC TCATCGGCCG GCGATACAGG CCTCGCGTCC 4100
GGATGGCTGG CGGCCGGTCC GACATCCACG ACGTCTGATC CGACTCGCAG CATTTCGATC GCCGCGGTGA CAGCGCCGGC GGGGTCTAGC CGCCGGCTCT 4200
CATCGAAGAA GGAGTCCTCG GTGAGATTCA GAATGCCGAA CACCGTCACC ATGGCGTCGG CCTCCGCAGC GACTTCCACG ATGGGGATCG GGCGAGCAAA 4300
AAGGCAGCAA TTATGAGCCC CATACCTACA AAGCCCCACG CATCAAGCTT TTGCCCATGA AGCAACCAGG CAATGGCTGT AATTATGACG ACGCCGAGTC 4400
CCGACCAGAC TGCATAAGCA ACACCGACAG GGATGGATTT CAGAACCAGA GAAAGAAAAT AAAATGCGAT GCCATAACCG ATTATGACAA CGGCGGAAGG 4500
GGCAAGCTTA GTAAAGCCCT CGCTAGATTT TAATGCGGAT GTTGCGATTA CTTCGCCAAC TATTGCGATA ACAAGAAAAA GCCAGCCTTT CATGATATAT 4600
CTCCCAATTT GTGTAGGGCT TATTATGCAC GCTTAAAAAT AATAAAAGCA GACTTGACCT GATAGTTTGG CTGTGAGCAA TTATGTGCTT AGTGCATCTA 4700
ACGCCGAGTT CAGCGGCAGT TTTTAAGTTG TGGTTTTATG GAATACTTTT GCGCAGCAAA ACCATAAAAC CGCGACTTAA AAACTGTCCA AGGAGCGCAG 4800
CGACTGGTGC TGGAACGACT TGTTAGCCTT TTTTCCAAAT CTGATATGTG TAATTTATAT TAGACAAAAA AAACTGCTCA AAAACCAAAT TGAAATTCTC 4900
TGGAATTTTA GGAAAATTGA TATCACCTTC AACCTCAACG TGAACAGTAG ACAAATGAAT TATATCTGCT TTTTCAATAA GACTATTGTA GATTTGACCG 5000
CCACCAGAGA CATATAAATG ATCTGTAATT TTCGATAGTT CTTGCAAAGC GATTTCTATT GAAGGAAAGA CTAATACATT TTCATTTGAG CTTGAAATTC 5100
CTTTCCTCGA CACTACTGCA TATTTTCGAT TTGGAAGAAC ACCCATAGAG TCAAATGTTT TCCTTCCAAC AAGGAGCCAC TGATTATATG TGAGCGCTTT 5200
AAAGAGTAAC TGCTCACCTT TTGCTGACCA TGGGATATCA GGGCCATTAC CGATTACGCC ATTTTCTGAC GTTGCAGAAA TCAATGAAAT TTTCAATTCA 5300
ACCCCCGTAA TGGCTAACTT TGTTTTAGGG CGACTGCCCT GCTGCGTAAC ATCGTTGCTG CTCCATAACA TCAAACATCG ACCCACGGCG TAACGCGCTT 5400
GCTGCTTGGA TGCCCGAGGC ATAGACTGTA CAAAAAAACA GTCATAACAA GCCATGAAAA CCGCCACTGC GCCGTTACCA CCGCTGCGTT CGGTCAAGGT 5500
TCTGGACCAG TTGCGTGAGC GCATACGCTA CTTGCATTAC AGCTTACGAA CCGAACAGGC TTATGTCCAC TGGGTTCGTG CCTTCATCCG TTTCCACGGT 5600
GTGCGTCACC CGGCAACCTT GGGCAGCAGC GAAGTCGAGG CATTTCTGTC CTGGCTGGCG AACGAGCGCA AGGTTTCGGT CTCCACGCAT CGTCAGGCAT 5700
TGGCGGCCTT GCTGTTCTTC TACGGCAAGG TGCTGTGCAC GGATCTGCCC TGGCTTCAGG AGATCGGAAG ACCTCGGCCG TCGCGGCGCT TGCCGGTGGT 5800
GCTGACCCCG GATGAAGTGG TTCGCATCCT CGGTTTTCTG GAAGGCGAGC ATCGTTTGTT CGCCCAGCTT CTGTATGGAA CGGGCATGCG GATCAGTGAG 5900
GGTTTGCAAC TGCGGGTCAA GGATCTGGAT TTCGATCACG GCACGATCAT CGTGCGGGAG GGCAAGGGCT CCAAGGATCG GGCCTTGATG TTACCCGAGA 6000
GCTTGGCACC CAGCCTGCGC GAGCAGCTGT CGCGTGCACG GGCATGGTGG CTGAAGGACC AGGCCGAGGG CCGCAGCGGC GTTGCGCTTC CCGACGCCCT 6100
TGAGCGGAAG TATCCGCGCG CCGGGCATTC CTGGCCGTGG TTCTGGGTTT TTGCGCAGCA CACGCATTCG ACCGATCCAC GGAGCGGTGT CGTGCGTCGC 6200
CATCACATGT ATGACCAGAC CTTTCAGCGC GCCTTCAAAC GTGCCGTAGA ACAAGCAGGC ATCACGAAGC CCGCCACACC GCACACCCTC CGCCACTCGT 6300
TCGCGACGGC CTTGCTCCGC AGCGGTTACG ACATTCGAAC CGTGCAGGAT CTGCTCGGCC ATTCCGACGT CTCTACGACG ATGATTTACA CGCATGTGCT 6400
GAAAGTTGGC GGTGCCGGAG TGCGCTCACC GCTTGATGCG CTGCCGCCCC TCACTAGTGA GAGGTAGGGC AGCGCAAGTC AATCCTGGCG GATTCACTAC 6500
CCCTGCGCGA AGGCCATCGG TGCCGCATCG AACGGCCGGT TGCGGAAAGT CCTCCCTGCG TCCGCTGATG GCCGGCAGCA GCCCGTCGTT GCCTGATGGA 6600
TCCAACCCCT CCGCTGCTAT AGTGCAGTCG GCTTCTGACG TTCAGTGCAG CCGTCTTCTG AAAACGACA

 Recombination Sites     

Name Coordinates Gene Sequence
attC cmlA6 3'-end 33-38 6 TTGGGC
attC qacEdelta1_sul1 core 3385-3418 34 CCGCTAGCGG GCGGCCGGAA GGTGAATGCT AGGC
attC dfrA7 core 4720-4783 64 TTTTTAAGTT GTGGTTTTAT GGAATACTTT TGCGCAGCAA AACCATAAAA CCGCGACTTA
AAAA
attI 5318-5373 56 CTTTGTTTTA GGGCGACTGC CCTGCTGCGT AACATCGTTG CTGCTCCATA ACATCA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
urfM 5'-end Tn21 1-30 Passenger Gene Other +
tniA In22 142-1821 Transposase   +
tniB delta1 In22 1824-2680 Accessory Gene   +
GNAT_fam In22 2785-3285 Passenger Gene Antibiotic Resistance -
sul1 (ARO:3000410) In22 3413-4252 Passenger Gene Antibiotic Resistance -
qacEdelta1 (ARO:3005010) In22 4246-4593 Passenger Gene Antibiotic Resistance -
dfrA7 (ARO:3002862) In22 4823-5296 Passenger Gene Antibiotic Resistance -
intI1 In22 5454-6467 Integron Integrase Class 1 +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
urfM 5'-end N Tn21 30 1-30 +
Class:   Passenger Gene
Sub Class:   Other
Comment:   urfM ORF interrupted by insertion of In2
Protein Sequence:  
VIFRRRLHQ

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniA TniA In22 1680 142-1821 +
Class:   Transposase
Transpoase Chemistry:   DDE
Comment:   can be extended upstream by 12 amino acids| identical to tniA (Tn1721 and In2)| 25% amino acid sequence identity to TnsB from Tn7
Protein Sequence:  
MATDTPRIPE QGVATLPDEA WERARRRAEI ISPLAQSETV GHEAADMAAQ ALGLSRRQVY VLIRRARQGS GLVTDLVPGQ SGGGKGKGRL PEPVERVIHE
LLQKRFLTKQ KRSLAAFHRE VTQVCKAQKL RVPARNTVAL RIASLDPRKV IRRREGQDAA RDLQGVGGEP PAVTAPLEQV QIDHTVIDLI VVDDRDRQPI
GRPYLTLAID VFTRCVLGMV VTLEAPSAVS VGLCLVHVAC DKRPWLEGLN VEMDWQMSGK PLLLYLDNAA EFKSEALRRG CEQHGIRLDY RPLGQPHYGG
IVERIIGTAM QMIHDELPGT TFSNPDQRGD YDSENKAALT LRELERWLTL AVGTYHGSVH NGLLQPPAAR WAEAVARVGV PAVVTRATSF LVDFLPILRR
TLTRTGFVID HIHYYADALK PWIARRERWP SFLIRRDPRD ISRIWVLEPE GQHYLEIPYR TLSHPAVTLW EQRQALAKLR QQGREQVDES ALFRMIGQMR
EIVTSAQKAT RKARRDADRR QHLKTSARPD KPVPPDTDIA DPQADNLPPA KPFDQIEEW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tniB delta1 TniB delta1 In22 857 1824-2680 +
Class:   Accessory Gene
Function:   probable ATP-binding protein.
Comment:   probably truncated by insertion of IS1326::IS1353
Protein Sequence:  
MDEYPIIDLS HLLPAAQGLA RLPADERIQR LRADRWIGYP RAVEALNRLE ALYAWPNKQR MPNLLLVGPT NNGKSMIVEK FRRTHPASSD ADQEHIPVLV
VQMPSEPSVI RFYVALLAAM GAPLRPRPRL PEMEQLALAL LRKVGVRMLV IDELHNVLAG NSVNRREFLN LLRFLGNELR IPLVGVGTRD AYLAIRSDDQ
LENRFEPMML PVWEANDDCC SLLASFAASL PLRRPSPIAT LDMARYLLTR SEGTIGELAH LLMAAAIVAV ESGEEAINHR TLSMA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
GNAT_fam GNAT_fam In22 501 2785-3285 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Target:   aminoglycoside antibiotic (ARO:0000016)
Sequence Family:  GNAT
Protein Sequence:  
MDSEEPPNVR VACSGDIDEV VRLMHDAAAW MSAKGTPAWD VARIDRTFAE TFVLRSELLV ASCSDGIVGC CTLSAEDPEF WPDALKGEAA YLHKLAVRRT
HAGRGVSSAL IEACRHAART QGCAKLRLDC HPNLRGLYER LGFTHVDTFN PGWDPTFIAE RLELEI

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
sul1 (ARO:3000410) Sul1 In22 840 3413-4252 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Transpoase Chemistry:   dihydropteroate synthase
Target:   sulfonamide antibiotic (ARO:3000282)||sulfone antibiotic (ARO:3003401)
Sequence Family:  sulfonamide resistant sul (ARO:3004238)
Comment:   perfect match to reference sequence for ARO:3000410
Protein Sequence:  
MVTVFGILNL TEDSFFDESR RLDPAGAVTA AIEMLRVGSD VVDVGPAASH PDARPVSPAD EIRRIAPLLD ALSDQMHRVS IDSFQPETQR YALKRGVGYL
NDIQGFPDPA LYPDIAEADC RLVVMHSAQR DGIATRTGHL RPEDALDEIV RFFEARVSAL RRSGVAADRL ILDPGMGFFL SPAPETSLHV LSNLQKLKSA
LGLPLLVSVS RKSFLGATVG LPVKDLGPAS LAAELHAIGN GADYVRTHAP GDLRSAITFS ETLAKFRSRD ARDRGLDHA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
qacEdelta1 (ARO:3005010) QacEdelta1 In22 348 4246-4593 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic efflux (ARO:0010000)
Target:   acridine dye (ARO:3000054)||quaternary ammonium salts
Sequence Family:  major facilitator superfamily (MFS) antibiotic efflux pump (ARO:0010002)
Comment:   subunit of the qac multidrug efflux pump||perfect match to reference sequence for ARO:3005010 (bitscore:219)
Protein Sequence:  
MKGWLFLVIA IVGEVIATSA LKSSEGFTKL APSAVVIIGY GIAFYFLSLV LKSIPVGVAY AVWSGLGVVI ITAIAWLLHG QKLDAWGFVG MGLIIAAFLL
ARSPSWKSLR RPTPW

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
dfrA7 (ARO:3002862) DfrA7 In22 474 4823-5296 -
Class:   Passenger Gene
Sub Class:   Antibiotic Resistance
Function:   antibiotic target replacement (ARO:0001002)
Target:   diaminopyrimidine antibiotic (ARO:3000171)
Sequence Family:  trimethoprim resistant dihydrofolate reductase dfr (ARO:3001218)
Comment:   100% identity to reference sequence ARO:3002862 in Acinetobacter baumannii (bitscore: 319)
Protein Sequence:  
MKISLISATS ENGVIGNGPD IPWSAKGEQL LFKALTYNQW LLVGRKTFDS MGVLPNRKYA VVSRKGISSS NENVLVFPSI EIALQELSKI TDHLYVSGGG
QIYNSLIEKA DIIHLSTVHV EVEGDINFPK IPENFNLVFE QFFLSNINYT YQIWKKG

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
intI1 IntI1 In22 1014 5454-6467 +
Class:   Integron Integrase
Sub Class:   Class 1
Transpoase Chemistry:   Tyrosine
Sequence Family:  Class 1 Integron Tyrosine Integrase
Protein Sequence:  
MKTATAPLPP LRSVKVLDQL RERIRYLHYS LRTEQAYVHW VRAFIRFHGV RHPATLGSSE VEAFLSWLAN ERKVSVSTHR QALAALLFFY GKVLCTDLPW
LQEIGRPRPS RRLPVVLTPD EVVRILGFLE GEHRLFAQLL YGTGMRISEG LQLRVKDLDF DHGTIIVREG KGSKDRALML PESLAPSLRE QLSRARAWWL
KDQAEGRSGV ALPDALERKY PRAGHSWPWF WVFAQHTHST DPRSGVVRRH HMYDQTFQRA FKRAVEQAGI TKPATPHTLR HSFATALLRS GYDIRTVQDL
LGHSDVSTTM IYTHVLKVGG AGVRSPLDAL PPLTSER

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
repeat t1 In22 9-27 TCAGAAGACG ACTGCACCA
repeat t2 In22 49-67 AACACGTCGG TCGAGGACT
repeat t3 In22 78-97 TCAGAAGTGA TCTGCACCAA
repeat t4 In22 110-128 TCAATACTCG TGTGCACCA
IRL IS1326::IS1353 2679-2680 TG
repeat i4 In22 6550-6568 AGGAGGGACG CAGGCGACT
repeat i3 In22 6578-6596 CGTCGGGCAG CAACGGACT
repeat i2 In22 6620-6638 ATCACGTCAG CCGAAGACT
IRi In22 6637-6669 CTGCAAGTCA CGTCGGCAGA AGACTTTTGC TGT