Transposon
Name: IS1326::IS1353
Family: IS21
Evidence of Transposition: no
 Host     

Host Organism:Shigella flexneri Molecular Source:plasmid NR1 (R100)
Place of Origin:Japan Date of Isolation:1950s

 Map     



 Terminal Inverted Repeats (IR)     

IRL (Length: 26 bp)TGTTGAGTTGCATCTAAAATTGACCC
IRR (Length: 26 bp)TGTTGATTTGCACCCAAATTTGACCC

 Sequence     
DNA SequenceLength  4086 
--------10 --------20 --------30 --------40 --------50 --------60 --------70 --------80 --------90 -------100
TGTTGAGTTG CATCTAAAAT TGACCCACTG GGGGTGCGGA CGATTTCTTG GACGGTTTAT ACGGACATCA ATCCGACCGC ATGACGATAC TCGATGGGAC 100
TACGCCCGCC AAGCGACACT TTGATGCGGC GCTCGTTGTA CCAGTGGATA TAGGCATCGA TTCGCGTCAT GAGGTCTTTC AGCGTCACGT GCTGCCAATT 200
CCTCGGGTAG ATTAGTTCGG TCTTCAATCG TCCGAAAAAG CCCTCGCATG CAGCATTGTC TGGCGAGCAG CCCTTTTTGG ACATCGACCG CGTTAATTGG 300
GCATTTTCAG TGCGGCGGAT CCACGCAGGC CAGCGATAAT GCGAGCCCCT GTCCGAATGG ATAACCGGAT GCTCACCGGG TCGCAGTGTC CGTACCGCGT 400
GATCCAGCAT GGTATTGACC AGGTTCGCAT CCGGGCTGGT GCCGATATTC CAGGCCACCA CCAGCCCATC GAAGCAATCG ACGATCGGCG AGACGTAGAC 500
CTTCCCTGCC GGAATGTGTA TTTCCGTCAG ATCGGTCAAC CATTTCGTAT TCGGCGCCGA CGCGTGAAAG TCGCGATTCA GCAGATTCGG GACCGCTGGT 600
GTCGGGTCGC CAGCATACGC CGAGAAGCGC CGGCGGCGCG GTGTTCTCAC GACCAGACGC TCTTGCGCCA TCAAGCGACG CACGACCTTC TCGGACACAC 700
GCATGCCACC AAGGCGCAAG GCACTATCAA TGCGTCGATA GCCATAGCAG CGGTAGTTGT CCTCGAAGAT AGTCCGAATG ACCTCACGCA CCTGCGTGTA 800
CTTGTCGGGC CGCGTCTGCC GCAGGCGTTG ATAGAAGTAT GTGCTGCGCG CCAGCTTCAG GCCGCACAAC AGATTGGCTA ATGGAAACGT GACTCTGAGG 900
GCATCAACCA CCTTCGTTTT TTCTCGGCTT GTCAGTTCGA GGGGGTTGAT GCCCATGTCT TTTTTTATCA ATTCACTCGC CTTCTCCAGA ATTGCATTCT 1000
CCATGCGAAG CCGCTGGTTC TGGCTCTCCA GTTCGGCCAG TTCCCTGAGT AGTGCCTCAT GCCGCTGCTC GAGCGAGGTG TCACCTTTCT TCTTTGTCAT 1100
GGGTTTTAGG GGCACTTTGC CAAGTAATCG ATGCTGCCAG TTATACAACG TTGGTCGCGA TACACCGACA GTGTCGGCCA CATCCTTTGC CGAACCTACG 1200
CGCAGGTTCA GTGCAATGAC GGCTTGCTGC TTCTCGAGGC GAGAGCGGGC GACTGTGGGA GCGCTGCTGC CGACGACCGT CCTAGCGAAT TCAGGGCGTA 1300
AATCACGGAT CCAGGCACGC AAGGCCTCGC GGCTTGGGTA GCCCAGGCTT CGGATTGTGT GACTCAGGCA GTAGCCTTGT TCGATATAGT GATCTACTGC 1400
CCGTTGCTTT TGCTCATCGG TGTACTGCCG TTTTATCCGT TGATAGCCTC GGCGAAGATC CTGATTCCGT TCGAATTCTG CCAACCAGGC CTTCAGCGAG 1500
TTCTTGGTGG GGTATCCCAG CTGCCGTAGT GTGGCGCTCA TCCGGCGCCC AAGCTTCAGG TACAACCTCA CGGCTCGAAG GCGATCTTCA TACGAATACA 1600
TGAACTACTC CTAAAGTAGT CCAAGATTTT GTCCGCACCC CAACTTAGGG TAAAGATTTG CGTCGAAATT TGACCCACGT ATGACACTGT TTCCCGTCTG 1700
GATATGGCGG GAGAAATCAA GGAGTGATAA ACGTGGCGAT ATTGAGCGCA ATTCGACGCT GGCATTTTCG CGATGGTGCG TCGATTCGGG AAATAGCCCG 1800
ACGAAGCGGC CTGTCCAGGA ACACCGTTCG CAAGTATTTG CAAAGCAAGG TGGTTGAACC GCAGTACCCA GCGCGAGACA GCGTTGGCAA GTTAAGTCCT 1900
TTTGAGCCCA AGTTAAGGCA GTGGCTCTCC ACCGAGCACA AAAAGACAAA GAAGCTGCGC AGAAACCTGC GCAGCATGTA CCGGGATTTG GTCGCTTTGG 2000
GCTTTACCGG GTCTTATGAC CGAGTGTGTG CCTTTGCCCG ACAGTGGAAA GATTCCGAAC AGTTCAAGGC GCAAACCTCG GGCAAGGGTT GTTTCATCCC 2100
CTTGCGCTTT GCTTGTGGCG AAGCCTTCCA ATTCGATTGG AGTGAGGACT TTGCCCGCAT AGCGGGCAAA CAGGTCAAAC TTCAGATTGC CCAGTTTAAG 2200
TTGGCCCACA GCCGGGCCTT TGTGCTTCGG GCTTACTACC AGCAAAAACA TGAAATGCTG TTTGATGCCC ACTGGCATGC CTTTCAAATC TTCGGTGGCA 2300
TTCCCAAGCG CGGCATCTAC GACAACATGA AGACCGCTGT GGATTCGGTG GGGCGTGGCA AAGAGCGCAG GGTCAATCAG CGGTTCACTG CCATGGTCAG 2400
CCACTACCTG TTTGATGCGC AGTTCTGTAA TCCAGCATCG GGTTGGGAGA AAGGCCAGAT TGAGAAGAAC GTGCAGGATT CCCGCCAACG CCTGTGGCAA 2500
GGGGCACCAG ACTTTCAAAG CCTTGCTGAT TTGAATGTGT GGCTTGAGCA TCGCTGCAAA GCGCTGTGGT CTGAGCTGCG CCACCCCGAA TTGGACCAAA 2600
CCGTGCAAGA GGCCTTTGCC GATGAACAAG GCGAGTTGAT GGCGCTACCC AATGCCTTTG ATGCATTCGT GGAGCAAACC AAGCGAGTCA CTTCAACCTG 2700
CCTTGTTCAC CACGAGGGCA ATCGCTACAG CGTTCCTGCC AGTTACGCCA ACAGGGCCAT CAGCCTTCGG ATTTATGCAG ACAAGCTGGT GATGGCTGCC 2800
GAAGGCCAAC ACATTGCCGA GCATCCAAGA TTGTTTGGCA GTGGCCACGC TCGGCGTGGC CACACACAAT ACGACTGGCA CCATTACTTG TCTGTGCTTC 2900
AGAAGAAACC TGGGGCGTTG CGCAATGGTG CGCCATTTGC TGAATTGCCA CCCGCGTTCA AGAAGCTTCA ATCCATCTTG CTGCAACGCC CCGGCGGTGA 3000
CCGTGACATG GTGGAAATTC TGGCCCTTGT ATTGCACCAC GATGAAGGTG CGGTACTCAG TGCTGTGGAA TTGGCATTGG AGTGTGGCAA GCCATCGAAG 3100
GAGCATGTGC TTAATCTGTT GGGACGTTTG ACCGAAGAAC CTCCACCCAA ACCGATTCCA ATTCCCAAGG GGTTAAGGCT GACATTGGAA CCACAGGCCA 3200
ACGTGAACCG CTATGACAGT TTAAGGAGAG CCCATGATGC AGCATGAAGG CCATGTGAGA ATCCTCAAAT CCTTGAAACT CTTTGGCATG GCACACGCCA 3300
TTGAGGAGTT GGGCAATCAG AATTCACCAG CATTTAATCA AGCCTTGCCC ATGCTGGACA GCTTGATTAA AGCTGAAGTG GCAGAGCGTG AAGTACGTTC 3400
GGTGAACTAT CAATTGCGGG TGGCCAAGTT CCCCGTGTAT CGGGACTTGG TGGGCTTTGA CTTCAGTCAA AGCCTGGTTA ATGAGGCCAC GGTCAAACAA 3500
TTGCACCGGT GCGACTTCAT GGAACAAGCC CAGAACGTGG TGCTGATTGG TGGGCCAGGC ACAGGCAAGA CTCACCTGGC CACAGCCATT GGTACACAAG 3600
CAGTGATGCA CTTGAACCGA CGGGTGCGTT TCTTCTCCAC CGTGGATTTG GTCAATGCAC TGGAGCAAGA GAAATCATCT GGGCGTCAGG GACAAATCGC 3700
AAACCGTCTG TTGTATGCCG ATTTGGTGAT TCTGGATGAG CTGGGATATT TGCCTTTTAG CCAAACCGGT GGGGCACTGC TGTTTCACCT GCTCTCAAAG 3800
CTGTACGAAA AAACCAGCGT GATACTGACC ACCAACTTGA GCTTCTCGGA ATGGAGCCGA GTGTTTGGCG ATGAAAAGAT GACAACAGCG TTGTTGGACC 3900
GACTAACCCA CCACTGCCAC ATCCTGGAAA CCGGCAATGA AAGTTACCGC TTCAAACACA GTTCAACTCA GAATAAGCAG GAGGAAAAAC AGACCCGCAA 4000
ACTGAAAATC GAGACATAAT TCTGACAACA AGGGGTGGGT CAAAATTCAA TGCAAATCCC GGGTCAAATT TGGGTGCAAA TCAACA

 ORFs     
ORF Summary
Gene Name Associated TE Coordinates Class Sub Class Orientation
tnp IS1353 57-1601 Transposase   -
istA IS1326::IS1353 1724-3247 Transposase   +
istB IS1326::IS1353 3234-4019 Accessory Gene ATPase Transposition Helper +

ORF Details
Gene Name Protein Name Associated TE Gene Length Coordinates Strand
tnp Tnp IS1353 1545 57-1601 -
Class:   Transposase
Function:   transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MYSYEDRLRA VRLYLKLGRR MSATLRQLGY PTKNSLKAWL AEFERNQDLR RGYQRIKRQY TDEQKQRAVD HYIEQGYCLS HTIRSLGYPS REALRAWIRD
LRPEFARTVV GSSAPTVARS RLEKQQAVIA LNLRVGSAKD VADTVGVSRP TLYNWQHRLL GKVPLKPMTK KKGDTSLEQR HEALLRELAE LESQNQRLRM
ENAILEKASE LIKKDMGINP LELTSREKTK VVDALRVTFP LANLLCGLKL ARSTYFYQRL RQTRPDKYTQ VREVIRTIFE DNYRCYGYRR IDSALRLGGM
RVSEKVVRRL MAQERLVVRT PRRRRFSAYA GDPTPAVPNL LNRDFHASAP NTKWLTDLTE IHIPAGKVYV SPIVDCFDGL VVAWNIGTSP DANLVNTMLD
HAVRTLRPGE HPVIHSDRGS HYRWPAWIRR TENAQLTRSM SKKGCSPDNA ACEGFFGRLK TELIYPRNWQ HVTLKDLMTR IDAYIHWYNE RRIKVSLGGR
SPIEYRHAVG LMSV

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istA IstA IS1326::IS1353 1524 1724-3247 +
Class:   Transposase
Transpoase Chemistry:   DDE
Protein Sequence:  
MINVAILSAI RRWHFRDGAS IREIARRSGL SRNTVRKYLQ SKVVEPQYPA RDSVGKLSPF EPKLRQWLST EHKKTKKLRR NLRSMYRDLV ALGFTGSYDR
VCAFARQWKD SEQFKAQTSG KGCFIPLRFA CGEAFQFDWS EDFARIAGKQ VKLQIAQFKL AHSRAFVLRA YYQQKHEMLF DAHWHAFQIF GGIPKRGIYD
NMKTAVDSVG RGKERRVNQR FTAMVSHYLF DAQFCNPASG WEKGQIEKNV QDSRQRLWQG APDFQSLADL NVWLEHRCKA LWSELRHPEL DQTVQEAFAD
EQGELMALPN AFDAFVEQTK RVTSTCLVHH EGNRYSVPAS YANRAISLRI YADKLVMAAE GQHIAEHPRL FGSGHARRGH TQYDWHHYLS VLQKKPGALR
NGAPFAELPP AFKKLQSILL QRPGGDRDMV EILALVLHHD EGAVLSAVEL ALECGKPSKE HVLNLLGRLT EEPPPKPIPI PKGLRLTLEP QANVNRYDSL
RRAHDAA

Gene Name Protein Name Associated TE Gene Length Coordinates Strand
istB IstB IS1326::IS1353 786 3234-4019 +
Class:   Accessory Gene
Sub Class:   ATPase Transposition Helper
Function:   stimulates transposition
Protein Sequence:  
MMQHEGHVRI LKSLKLFGMA HAIEELGNQN SPAFNQALPM LDSLIKAEVA EREVRSVNYQ LRVAKFPVYR DLVGFDFSQS LVNEATVKQL HRCDFMEQAQ
NVVLIGGPGT GKTHLATAIG TQAVMHLNRR VRFFSTVDLV NALEQEKSSG RQGQIANRLL YADLVILDEL GYLPFSQTGG ALLFHLLSKL YEKTSVILTT
NLSFSEWSRV FGDEKMTTAL LDRLTHHCHI LETGNESYRF KHSSTQNKQE EKQTRKLKIE T

 Internal Transposable Elements (TE)     

TnCentral Accession TE Name Type Coordinates Length
IS1353-AF071413 IS1353 Insertion Sequence 29-1642 1614

 Internal Repeat Elements     

Name Associated Mobile Element Coordinates Sequence (Top Strand)
IRR IS1353 29-41 TGGGGGTGCG GAC
IRL IS1353 1631-1642 CAGGCGTGGG GT