ISUnCu3
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_008055 | ND | Uncultured bacterium | Uncultured bacterium plasmid QKH54 |
DNA section
IS Length : 2596 bp
Ends
IR Length : 40/48
IRL : TGCGGATTCCACGCTGACTCGGACACCCATTCCACGCACATCCGGACAGT
IRR : TGCGGATTCCACGCCATTCGGACACTCAGCCCACGCTGATCCGGACACCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTTGAACGGC | TTGATACC | CAAGTGGTCA | 8 |
DNA sequence
TGCGGATTCCACGCTGACTCGGACACCCATTCCACGCACATCCGGACAGTGATTCCACGCTGATCCGGACACTCATTCCACGAGCATCCGGACACCGATT
TCACGGCCATCCGGACACTTTTCAGGCAGGCAGCTACGCAGGATTTCTTCACTACCATCGACCTCTTTTTCGAAGCAGAGAGGTCGTCGTGGAGCGTTTA
TCCATGCGTAAAATCCGAGAGGTGTTGCGCCTCAAGTTCGACTGCGGCCTGTCCGTGCGCAAGATCGCCCGCAGCCTGGGTACTGGCCACAGCAGTGCCG
GTGATTACCTCTGTCGTTTTGCCGCCAGCGGTCTCAGTTGGCCCTGTTCGTTGTCCGATGCCGAGTTGGAGCAGCAACTGTTCCCGCCGGCCCCGGCGGT
TGCCAGTGAGAAGCGGCCTTTACCCGATTGGGCATGGGTGCATGCCGAACTGCGCCGCCCCGGCGTAACCCTGGCGCTGCTCTGGCAGGAGTACCGCCTG
AGCCAGCCGCAGGGCTTTCAGTACAGCTGGTTCTGTGAGCACTACCGAGCCTGGCAGGGCAAGCTGGACGTGGTGATGCGTCAGGAGCACCGCGTCGGCG
AGAAACTGTTCGTCGACTACGCCGGGCAGACGGTGCCGGTGATCGACCGCCACAGCGGCGAAATCCGCCAGGCGCAGGTGTTCGTCGCGGTGCTTGGCGC
GTCCAGCTACACCTTCGCCGAGGCCACCTGGTCGCAGCAGCTGCCGGACTGGCTGGGCTCGCACGCCCGTTGCTTTGCCTTCCTCGGTGGGGTGCCGGAG
ATCGTGGTGCCGGACAACTTGCGCAGCGCGGTGAGCAAGGCCCATCGCTACGAGCCGGACATCAACCCCAGCTACCTCGACCTGGCCGAGCACTATGGCG
TGGCGGTAGTGCCGGCGCGGGCGCGCAAGCCGCGCGATAAGGCCAAGGCCGAGGTCGGCGTGCAGGTGGTCGAGCGCTGGATCCTCGCCGCGCTGAGGAA
CCGCCAGTTCTTCTCCCTGGACGAACTCAACCGGGCCATTGCCATGCTGCTGGAGCGGCTCAACCAACGCCCGTTCAAGAAGCTGCCGGGCTCACGCCAG
ACGGCCTTCGACAGCCTGGATCGTCCGGCCCTGCGCTCGCTGCCGGAACAGCCCTACGTCTATGCCGAGTGGAAGAAGGCGCGGGTGCACATCGACTACC
ACGTCGAGGTCGACGGGCATTACTACTCGGTGCCCTACCAACTGGTGAAGAAGCAGCTGGAGGTACGCCTGACGGCGCGCACTGTGGAGTGCTTCCACGC
CAACCAACGGGTGGCCAGCCACATACGCTCCCTGCACAAGGGCCGGCACAGCACGCAGACCGAGCACATGCCCAAGAGCCACCGCGAGCATGCCGAGTGG
ACGCCGCAACGGCTGATCCACTGGGCCGAGAAGACCGGGCCGAACACCGCCGGCGTGATCGGCCACATCCTCGAACGACGCGTCCATCCGCAGCATGGCT
ACCGCGCCTGCCTGGGCATCCTGCGTCTGGGCAAACAGCATGGCGAAGAGCGGCTGGAAGCCGCCTGCCAACGCGCCCTGAGCCTCGGGGCCTGTAGCTA
CAAGAGCCTCGAATCGATCCTGCGCCAGGGCCTGGAGAACCTGCCCTTGGCGCAACAGAGCCTGCCCCTGCTACCGGACGACCACGCCAACCTGCGTGGC
CCCGGCTACTACCACTGACCCCAAGGAATCCCACCATGCTGCCCCACCCGACCCTGGACAAGCTCCAGTCCCTGCGCCTGCACGGCATGCTCAAGGCTCT
CGCTGAGCAACTGAAAACCCCGGACATCGACAGCCTGAGCTTCGAGGAACGCCTCGGTCTGCTGGTCGACCGTGAGTTGACTGAGCGCGACGACAAGCGC
CTGAGCAGCCGCCTGCGCCAGGCCAGGCTGCGCCACAACGCCTGCCTCGAAGACATCGACTACCGCAGCCCGCGCGGCCTGGACAAGGCCCTGATCCTCC
AGCTACGCAGCGGCCAGTGGCTGCGCGACGGCCTCAACCTGATCATCGGCGGCCCCACCGGCGTAGGCAAAACCTGGCTGGCCTGCGCCCTGGCCCACCA
GGCATGCCGGGACGGCTACAGCGTGCGTTACCTACGCTTGCCGCGCCTGCTGGAAGAGTTGGGCCTTGCCCACGGTGACGGGCGCTTCGCCAAGCTGATG
AGCAGCTACGCCAAGACCGATCTGCTGATCCTCGATGACTGGGGGCTGGCCCCGTTCACTGCCGAACAACGGCGCGACATGCTGGAGCTGCTGGACGACC
GTTACGGCCAGCGCTCGACCCTGGTGACCAGTCAGATGCCGGTGGACAACTGGCACGAACTGATCGGCGACCCGACCCTGGCCGACGCCATCCTCGACCG
CCTGGTGCACAACGCTTACCGGATCAACCTCAAGGGCGAATCGATGCGCAAACGGGCGAAGAAATTGACGACGCCGGGCACCTCAGACTAACAATGCCAG
CCCTGCGTCGCTGCGCTCCGACTGCCTGTCCGAATGATCGTGGAACAGGTGTCCGGATCAGCGTGGGCTGAGTGTCCGAATGGCGTGGAATCCGCA
TCACGGCCATCCGGACACTTTTCAGGCAGGCAGCTACGCAGGATTTCTTCACTACCATCGACCTCTTTTTCGAAGCAGAGAGGTCGTCGTGGAGCGTTTA
TCCATGCGTAAAATCCGAGAGGTGTTGCGCCTCAAGTTCGACTGCGGCCTGTCCGTGCGCAAGATCGCCCGCAGCCTGGGTACTGGCCACAGCAGTGCCG
GTGATTACCTCTGTCGTTTTGCCGCCAGCGGTCTCAGTTGGCCCTGTTCGTTGTCCGATGCCGAGTTGGAGCAGCAACTGTTCCCGCCGGCCCCGGCGGT
TGCCAGTGAGAAGCGGCCTTTACCCGATTGGGCATGGGTGCATGCCGAACTGCGCCGCCCCGGCGTAACCCTGGCGCTGCTCTGGCAGGAGTACCGCCTG
AGCCAGCCGCAGGGCTTTCAGTACAGCTGGTTCTGTGAGCACTACCGAGCCTGGCAGGGCAAGCTGGACGTGGTGATGCGTCAGGAGCACCGCGTCGGCG
AGAAACTGTTCGTCGACTACGCCGGGCAGACGGTGCCGGTGATCGACCGCCACAGCGGCGAAATCCGCCAGGCGCAGGTGTTCGTCGCGGTGCTTGGCGC
GTCCAGCTACACCTTCGCCGAGGCCACCTGGTCGCAGCAGCTGCCGGACTGGCTGGGCTCGCACGCCCGTTGCTTTGCCTTCCTCGGTGGGGTGCCGGAG
ATCGTGGTGCCGGACAACTTGCGCAGCGCGGTGAGCAAGGCCCATCGCTACGAGCCGGACATCAACCCCAGCTACCTCGACCTGGCCGAGCACTATGGCG
TGGCGGTAGTGCCGGCGCGGGCGCGCAAGCCGCGCGATAAGGCCAAGGCCGAGGTCGGCGTGCAGGTGGTCGAGCGCTGGATCCTCGCCGCGCTGAGGAA
CCGCCAGTTCTTCTCCCTGGACGAACTCAACCGGGCCATTGCCATGCTGCTGGAGCGGCTCAACCAACGCCCGTTCAAGAAGCTGCCGGGCTCACGCCAG
ACGGCCTTCGACAGCCTGGATCGTCCGGCCCTGCGCTCGCTGCCGGAACAGCCCTACGTCTATGCCGAGTGGAAGAAGGCGCGGGTGCACATCGACTACC
ACGTCGAGGTCGACGGGCATTACTACTCGGTGCCCTACCAACTGGTGAAGAAGCAGCTGGAGGTACGCCTGACGGCGCGCACTGTGGAGTGCTTCCACGC
CAACCAACGGGTGGCCAGCCACATACGCTCCCTGCACAAGGGCCGGCACAGCACGCAGACCGAGCACATGCCCAAGAGCCACCGCGAGCATGCCGAGTGG
ACGCCGCAACGGCTGATCCACTGGGCCGAGAAGACCGGGCCGAACACCGCCGGCGTGATCGGCCACATCCTCGAACGACGCGTCCATCCGCAGCATGGCT
ACCGCGCCTGCCTGGGCATCCTGCGTCTGGGCAAACAGCATGGCGAAGAGCGGCTGGAAGCCGCCTGCCAACGCGCCCTGAGCCTCGGGGCCTGTAGCTA
CAAGAGCCTCGAATCGATCCTGCGCCAGGGCCTGGAGAACCTGCCCTTGGCGCAACAGAGCCTGCCCCTGCTACCGGACGACCACGCCAACCTGCGTGGC
CCCGGCTACTACCACTGACCCCAAGGAATCCCACCATGCTGCCCCACCCGACCCTGGACAAGCTCCAGTCCCTGCGCCTGCACGGCATGCTCAAGGCTCT
CGCTGAGCAACTGAAAACCCCGGACATCGACAGCCTGAGCTTCGAGGAACGCCTCGGTCTGCTGGTCGACCGTGAGTTGACTGAGCGCGACGACAAGCGC
CTGAGCAGCCGCCTGCGCCAGGCCAGGCTGCGCCACAACGCCTGCCTCGAAGACATCGACTACCGCAGCCCGCGCGGCCTGGACAAGGCCCTGATCCTCC
AGCTACGCAGCGGCCAGTGGCTGCGCGACGGCCTCAACCTGATCATCGGCGGCCCCACCGGCGTAGGCAAAACCTGGCTGGCCTGCGCCCTGGCCCACCA
GGCATGCCGGGACGGCTACAGCGTGCGTTACCTACGCTTGCCGCGCCTGCTGGAAGAGTTGGGCCTTGCCCACGGTGACGGGCGCTTCGCCAAGCTGATG
AGCAGCTACGCCAAGACCGATCTGCTGATCCTCGATGACTGGGGGCTGGCCCCGTTCACTGCCGAACAACGGCGCGACATGCTGGAGCTGCTGGACGACC
GTTACGGCCAGCGCTCGACCCTGGTGACCAGTCAGATGCCGGTGGACAACTGGCACGAACTGATCGGCGACCCGACCCTGGCCGACGCCATCCTCGACCG
CCTGGTGCACAACGCTTACCGGATCAACCTCAAGGGCGAATCGATGCGCAAACGGGCGAAGAAATTGACGACGCCGGGCACCTCAGACTAACAATGCCAG
CCCTGCGTCGCTGCGCTCCGACTGCCTGTCCGAATGATCGTGGAACAGGTGTCCGGATCAGCGTGGGCTGAGTGTCCGAATGGCGTGGAATCCGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1530 bp | 509 aa | 189 | 1718 | + | No |
Chemistry : DDE
ORF sequence :
MERLSMRKIREVLRLKFDCGLSVRKIARSLGTGHSSAGDYLCRFAASGLSWPCSLSDAELEQQLFPPAPAVASEKRPLPDWAWVHAELRRPGVTLALLWQ
EYRLSQPQGFQYSWFCEHYRAWQGKLDVVMRQEHRVGEKLFVDYAGQTVPVIDRHSGEIRQAQVFVAVLGASSYTFAEATWSQQLPDWLGSHARCFAFLG
GVPEIVVPDNLRSAVSKAHRYEPDINPSYLDLAEHYGVAVVPARARKPRDKAKAEVGVQVVERWILAALRNRQFFSLDELNRAIAMLLERLNQRPFKKLP
GSRQTAFDSLDRPALRSLPEQPYVYAEWKKARVHIDYHVEVDGHYYSVPYQLVKKQLEVRLTARTVECFHANQRVASHIRSLHKGRHSTQTEHMPKSHRE
HAEWTPQRLIHWAEKTGPNTAGVIGHILERRVHPQHGYRACLGILRLGKQHGEERLEAACQRALSLGACSYKSLESILRQGLENLPLAQQSLPLLPDDHA
NLRGPGYYH
EYRLSQPQGFQYSWFCEHYRAWQGKLDVVMRQEHRVGEKLFVDYAGQTVPVIDRHSGEIRQAQVFVAVLGASSYTFAEATWSQQLPDWLGSHARCFAFLG
GVPEIVVPDNLRSAVSKAHRYEPDINPSYLDLAEHYGVAVVPARARKPRDKAKAEVGVQVVERWILAALRNRQFFSLDELNRAIAMLLERLNQRPFKKLP
GSRQTAFDSLDRPALRSLPEQPYVYAEWKKARVHIDYHVEVDGHYYSVPYQLVKKQLEVRLTARTVECFHANQRVASHIRSLHKGRHSTQTEHMPKSHRE
HAEWTPQRLIHWAEKTGPNTAGVIGHILERRVHPQHGYRACLGILRLGKQHGEERLEAACQRALSLGACSYKSLESILRQGLENLPLAQQSLPLLPDDHA
NLRGPGYYH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
756 bp | 251 aa | 1736 | 2491 | + | No |
AG : IS21 helper
ORF sequence :
MLPHPTLDKLQSLRLHGMLKALAEQLKTPDIDSLSFEERLGLLVDRELTERDDKRLSSRLRQARLRHNACLEDIDYRSPRGLDKALILQLRSGQWLRDGL
NLIIGGPTGVGKTWLACALAHQACRDGYSVRYLRLPRLLEELGLAHGDGRFAKLMSSYAKTDLLILDDWGLAPFTAEQRRDMLELLDDRYGQRSTLVTSQ
MPVDNWHELIGDPTLADAILDRLVHNAYRINLKGESMRKRAKKLTTPGTSD
NLIIGGPTGVGKTWLACALAHQACRDGYSVRYLRLPRLLEELGLAHGDGRFAKLMSSYAKTDLLILDDWGLAPFTAEQRRDMLELLDDRYGQRSTLVTSQ
MPVDNWHELIGDPTLADAILDRLVHNAYRINLKGESMRKRAKKLTTPGTSD
Blast result :
Comments
ISUnCu3 is 96%(istA) and 97%(istB) aa similar to ISPpu7.
References
1] Akhtar,P. (2002) Thesis
2] Haines,A.S. (2005) Direct Submission GenBank.
2] Haines,A.S. (2005) Direct Submission GenBank.