ISUnCu14
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AB266146 | ND | uncultured bacterium | uncultured bacterium |
DNA section
IS Length : 2442 bp
Ends
IR Length : 21/25
IRL : GTAAGCGATGTTCTGCTCCCACCTTCTCTGGCGTGTTCTGATCTGGGATC
IRR : GTAAGCGGTGTTCCGATCACACCTTGGCGGCGTAGTTCCACGGCAAGAGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCTCAGCGAACC | GTTGATTA | CAAATCTGCC | 8 |
DNA sequence
GTAAGCGATGTTCTGCTCCCACCTTCTCTGGCGTGTTCTGATCTGGGATCGGGATGTCCTGTGATGGGCGGGAGGAAGTTTCTCCATGGCGGCTACATTG
GAGGTTCTCCCGCTCGGTTGCGGCGGTCGCCACCGGCGGACGTGGCCGGATGAGGTGAAGGCGCGGATCGTGGCGGAGACGCTGCGCCCGGGTGTGACGG
TGAACGAGGTCGCGGCACGGCACGGGCTGCAGGCCAACCATCTGTCGTCGTGGCGGTCGCTCGCCCGGGCGGGAAAGCTGGTGCTGCCTGCACCGGAGGG
TTCGGTCGAGTTCGCGGCCATGATCGTGGCACCCCCGGGCGAGTGCCCGCGGCGTTCCGAAGCCGTCGGCCCGGAGATCATGGTCGGTGCCGTGACGGTC
CGCCTTGAGGCCGGCGCCCCGGTCGAACGGATTGCGACCCTCGTGCGCGCGCTGTCGGGCCCGGCATGATCTTCCCGTCGCATCGGGTGCGGATCATGGT
GGCGACGAAGCCCATCGATTTCCGCAAGGGCCACGACAGCCTGGCGGCGTTGGTGAAGAACGAACTGCGCAAGGACCCGTTCACCGGGACGGTATTCGTG
TTCCGCGCCCGCAAGGCGGACCGGCTGAAGCTGCTTTACTGGGATGGCACCGGGCTGGTGATGGCCTACAAGCGGCTGGAGGAGCACAGCTTCAGCTGGC
CCCCGGTGAAGGACGGCCTGATGACGCTGAGCCACGCCCAGTTCGAGGCACTGTTCGCCGGGCTCGACTGGCGGCGGGTCCGGGCCGTCGAGGCCCGCGC
GCCCGAGGCGGTGGAGTGACTGCGACAGGATGACTCGGGTGGTGCTTTCGGCGGGCGCCCGGAGGGCGGATCTGATAGACCGCGCTCATGCTGGCCCCTG
CCGATCTGCCTGACGATATTGCTGCCCTGAAGGCGATGCTGATCGCCTCGGAGGCGCGCAACAATCGCAAGGATGAGCGGATCGAACGGCTGGAGAAGCT
GGTCGCGGCGTTCCGGCAGGCGGCCTTCGGGCGGCGCTCGGAGAAGAGCGATCCGGGCCAGTTCGAACTGGCACTCGAGGATCTGGAGACCGCCATCGCC
GCGATCCGTGCCGAAGAGGATGCCGAGGATCGCGCGGCGAAGCGCCCGTTCAGGCCGCGCGCCATCAATCGCGGATCGCTGCCCAGGCACCTGCCACGCA
TCGACGAGGTCATCGAGCCTGAGAGCCTGACCTGCGCCTGCGGCGGTTGCCTGCATTGCATCGGCGAGGACGTGTCGGAACGGCTGGACATCGTTCCCGC
TCAGTTCCGCGTCATCGTCACCCGTCGTCCGAAGTATGCCTGCCGCTCCTGCACCGACGGCGTGGTGCAGGCCCCGGCGCCGGCGCGGCTGATCCCGGGC
GGCATGCCGACCGAGGCGACGGTCGCCCATGTGCTGGTCAGCAAATACGCCGACCACCTTCCGCTTTACCGCCAGGCGCAGATTTACAGCCGCCAGGGCA
TCGATCTCGACCGGTCCACCCTGGCCGACTGGGTCGGCCGCGCCGCCTTCGAGCTGCGCCCGGTCTTTGACGCCCTGATGGCCGACCTGAAGCGGTCGAC
GAAGCTGTCCATGGACGAGACCCGCGCCCCGGTCCTCGATCCGGGGGCACGGAAGACGAAGACCGGATACTTCTGGGCACTGGCCCGCGATGATCGGCCC
TGGGGCGGCACCGCGCCGCCAGGCGTGGCCTTCACCTATGCTCCGGGTCGAGGCGGGCAGCACGCCGAACGGATCCTGCAGGGCTTCGGTGGCATCCTGC
AGGTCGATGGATACGCAGGTTACAACCGGCTGATCGCGCCCGACCGGGTCGGCCCAGGCATCCAACTGGCCTATTGCTGGGCCCATGCGCGCCGAAAGCT
CATAGAGATCACCCGCACCGGGCCCGCACCGATCGCCGAGGAGGGCGTCGACCTCATCCGCGATCTCTATCGTATCGAGGCTGACATTCGCGGCAGCGAC
CCCACCGCCCGCCTGGTCGCGCGGCAGGACCGTTCAGCCCCGATCCTCGCCCGCCTCGACGACTGGCTGTGCCATCACCGCGCCCGCGCGTCCGCAAAGT
CGCCACTGGGCGAGGCGCTCGCCTACATCGCCAGATACCGTGACGGCCTTGGACGTTTCCTGACCGATGGCCGCATCGAGATCGACTCCAATGCCGTCGA
ACGCACCATCCGTCCGATCGCGCTGAACCGGAAGAATGCCCTCTTCGCCGGGCACGACACCGGCGCCGAAAACTGGGCCGTCATTGCCTCGCTGATCGAG
ACCTGCAAACTCAACAGCGTCGATCCCCAGACCTGGCTGGCGAACACGCTCACCGCCATAGCCAATGGGCATAAGCAGAGCCAAATCAACGATCTCTTGC
CGTGGAACTACGCCGCCAAGGTGTGATCGGAACACCGCTTAC
GAGGTTCTCCCGCTCGGTTGCGGCGGTCGCCACCGGCGGACGTGGCCGGATGAGGTGAAGGCGCGGATCGTGGCGGAGACGCTGCGCCCGGGTGTGACGG
TGAACGAGGTCGCGGCACGGCACGGGCTGCAGGCCAACCATCTGTCGTCGTGGCGGTCGCTCGCCCGGGCGGGAAAGCTGGTGCTGCCTGCACCGGAGGG
TTCGGTCGAGTTCGCGGCCATGATCGTGGCACCCCCGGGCGAGTGCCCGCGGCGTTCCGAAGCCGTCGGCCCGGAGATCATGGTCGGTGCCGTGACGGTC
CGCCTTGAGGCCGGCGCCCCGGTCGAACGGATTGCGACCCTCGTGCGCGCGCTGTCGGGCCCGGCATGATCTTCCCGTCGCATCGGGTGCGGATCATGGT
GGCGACGAAGCCCATCGATTTCCGCAAGGGCCACGACAGCCTGGCGGCGTTGGTGAAGAACGAACTGCGCAAGGACCCGTTCACCGGGACGGTATTCGTG
TTCCGCGCCCGCAAGGCGGACCGGCTGAAGCTGCTTTACTGGGATGGCACCGGGCTGGTGATGGCCTACAAGCGGCTGGAGGAGCACAGCTTCAGCTGGC
CCCCGGTGAAGGACGGCCTGATGACGCTGAGCCACGCCCAGTTCGAGGCACTGTTCGCCGGGCTCGACTGGCGGCGGGTCCGGGCCGTCGAGGCCCGCGC
GCCCGAGGCGGTGGAGTGACTGCGACAGGATGACTCGGGTGGTGCTTTCGGCGGGCGCCCGGAGGGCGGATCTGATAGACCGCGCTCATGCTGGCCCCTG
CCGATCTGCCTGACGATATTGCTGCCCTGAAGGCGATGCTGATCGCCTCGGAGGCGCGCAACAATCGCAAGGATGAGCGGATCGAACGGCTGGAGAAGCT
GGTCGCGGCGTTCCGGCAGGCGGCCTTCGGGCGGCGCTCGGAGAAGAGCGATCCGGGCCAGTTCGAACTGGCACTCGAGGATCTGGAGACCGCCATCGCC
GCGATCCGTGCCGAAGAGGATGCCGAGGATCGCGCGGCGAAGCGCCCGTTCAGGCCGCGCGCCATCAATCGCGGATCGCTGCCCAGGCACCTGCCACGCA
TCGACGAGGTCATCGAGCCTGAGAGCCTGACCTGCGCCTGCGGCGGTTGCCTGCATTGCATCGGCGAGGACGTGTCGGAACGGCTGGACATCGTTCCCGC
TCAGTTCCGCGTCATCGTCACCCGTCGTCCGAAGTATGCCTGCCGCTCCTGCACCGACGGCGTGGTGCAGGCCCCGGCGCCGGCGCGGCTGATCCCGGGC
GGCATGCCGACCGAGGCGACGGTCGCCCATGTGCTGGTCAGCAAATACGCCGACCACCTTCCGCTTTACCGCCAGGCGCAGATTTACAGCCGCCAGGGCA
TCGATCTCGACCGGTCCACCCTGGCCGACTGGGTCGGCCGCGCCGCCTTCGAGCTGCGCCCGGTCTTTGACGCCCTGATGGCCGACCTGAAGCGGTCGAC
GAAGCTGTCCATGGACGAGACCCGCGCCCCGGTCCTCGATCCGGGGGCACGGAAGACGAAGACCGGATACTTCTGGGCACTGGCCCGCGATGATCGGCCC
TGGGGCGGCACCGCGCCGCCAGGCGTGGCCTTCACCTATGCTCCGGGTCGAGGCGGGCAGCACGCCGAACGGATCCTGCAGGGCTTCGGTGGCATCCTGC
AGGTCGATGGATACGCAGGTTACAACCGGCTGATCGCGCCCGACCGGGTCGGCCCAGGCATCCAACTGGCCTATTGCTGGGCCCATGCGCGCCGAAAGCT
CATAGAGATCACCCGCACCGGGCCCGCACCGATCGCCGAGGAGGGCGTCGACCTCATCCGCGATCTCTATCGTATCGAGGCTGACATTCGCGGCAGCGAC
CCCACCGCCCGCCTGGTCGCGCGGCAGGACCGTTCAGCCCCGATCCTCGCCCGCCTCGACGACTGGCTGTGCCATCACCGCGCCCGCGCGTCCGCAAAGT
CGCCACTGGGCGAGGCGCTCGCCTACATCGCCAGATACCGTGACGGCCTTGGACGTTTCCTGACCGATGGCCGCATCGAGATCGACTCCAATGCCGTCGA
ACGCACCATCCGTCCGATCGCGCTGAACCGGAAGAATGCCCTCTTCGCCGGGCACGACACCGGCGCCGAAAACTGGGCCGTCATTGCCTCGCTGATCGAG
ACCTGCAAACTCAACAGCGTCGATCCCCAGACCTGGCTGGCGAACACGCTCACCGCCATAGCCAATGGGCATAAGCAGAGCCAAATCAACGATCTCTTGC
CGTGGAACTACGCCGCCAAGGTGTGATCGGAACACCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
384 bp | 127 aa | 86 | 469 | + | No |
AG : IS66 TnpA
ORF sequence :
MAATLEVLPLGCGGRHRRTWPDEVKARIVAETLRPGVTVNEVAARHGLQANHLSSWRSLARAGKLVLPAPEGSVEFAAMIVAPPGECPRRSEAVGPEIMV
GAVTVRLEAGAPVERIATLVRALSGPA
GAVTVRLEAGAPVERIATLVRALSGPA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 466 | 819 | + | No |
AG : IS66 TnpB
ORF sequence :
MIFPSHRVRIMVATKPIDFRKGHDSLAALVKNELRKDPFTGTVFVFRARKADRLKLLYWDGTGLVMAYKRLEEHSFSWPPVKDGLMTLSHAQFEALFAGL
DWRRVRAVEARAPEAVE
DWRRVRAVEARAPEAVE
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1539 bp | 512 aa | 888 | 2426 | + | No |
Chemistry : DDE
ORF sequence :
MLAPADLPDDIAALKAMLIASEARNNRKDERIERLEKLVAAFRQAAFGRRSEKSDPGQFELALEDLETAIAAIRAEEDAEDRAAKRPFRPRAINRGSLPR
HLPRIDEVIEPESLTCACGGCLHCIGEDVSERLDIVPAQFRVIVTRRPKYACRSCTDGVVQAPAPARLIPGGMPTEATVAHVLVSKYADHLPLYRQAQIY
SRQGIDLDRSTLADWVGRAAFELRPVFDALMADLKRSTKLSMDETRAPVLDPGARKTKTGYFWALARDDRPWGGTAPPGVAFTYAPGRGGQHAERILQGF
GGILQVDGYAGYNRLIAPDRVGPGIQLAYCWAHARRKLIEITRTGPAPIAEEGVDLIRDLYRIEADIRGSDPTARLVARQDRSAPILARLDDWLCHHRAR
ASAKSPLGEALAYIARYRDGLGRFLTDGRIEIDSNAVERTIRPIALNRKNALFAGHDTGAENWAVIASLIETCKLNSVDPQTWLANTLTAIANGHKQSQI
NDLLPWNYAAKV
HLPRIDEVIEPESLTCACGGCLHCIGEDVSERLDIVPAQFRVIVTRRPKYACRSCTDGVVQAPAPARLIPGGMPTEATVAHVLVSKYADHLPLYRQAQIY
SRQGIDLDRSTLADWVGRAAFELRPVFDALMADLKRSTKLSMDETRAPVLDPGARKTKTGYFWALARDDRPWGGTAPPGVAFTYAPGRGGQHAERILQGF
GGILQVDGYAGYNRLIAPDRVGPGIQLAYCWAHARRKLIEITRTGPAPIAEEGVDLIRDLYRIEADIRGSDPTARLVARQDRSAPILARLDDWLCHHRAR
ASAKSPLGEALAYIARYRDGLGRFLTDGRIEIDSNAVERTIRPIALNRKNALFAGHDTGAENWAVIASLIETCKLNSVDPQTWLANTLTAIANGHKQSQI
NDLLPWNYAAKV
Blast result :
Comments
ISUnCu14 is 74% (ORFA) aa similar to IS71, 78% (ORFB) to ISXau4 and 71% (ORFc : the transposase) to ISApr6.
References
1] ISfinder annotation (2010)
2] Suenaga,H., Ohnuki,T. and Miyazaki,K. (2007) Environ. Microbiol. 9 (9), 2289-2297
3] Suenaga,H., Koyama,Y., Miyakoshi,M., Miyazaki,R., Yano,H., Sota,M., Ohtsubo,Y., Tsuda,M. and Miyazaki,K. (2009) ISME J 3 (12), 1335-1348.
2] Suenaga,H., Ohnuki,T. and Miyazaki,K. (2007) Environ. Microbiol. 9 (9), 2289-2297
3] Suenaga,H., Koyama,Y., Miyakoshi,M., Miyazaki,R., Yano,H., Sota,M., Ohtsubo,Y., Tsuda,M. and Miyazaki,K. (2009) ISME J 3 (12), 1335-1348.