ISGau7
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AP009153 | ND | Gemmatimonas aurantiaca | Gemmatimonas aurantiaca T-27 |
DNA section
IS Length : 2510 bp
Ends
IR Length : 50/56
IRL : CCTCGGTGGACACCCAAATCCGGCCACCAGTCGACACCTGAAAACCGGCC
IRR : CCTCGGTGGACACCTCAAATCCGGCCACTGATCGACACTTCAAAACCGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ACACGGCGAG | CAAC | AACCACGATT | 4 |
DNA sequence
CCTCGGTGGACACCCAAATCCGGCCACCAGTCGACACCTGAAAACCGGCCACCACAGGCGAGCTGAGGCGTAGCCCGGTGGGGCGTTCCGAGAGGGACGT
TTCCCGCCATGAGCAACGTCTTGGACGAGACCAAACAGCAGCAGGTGATTGCGTTGGGGCGACTGGGGTGGTCGCTCCGGCGCATCGAGGCGGCGACCGG
CGTCCGCCGCGAGACGATCAGTGGGTATTTGAAGGCGACCGGCGTCGCAGTCCGCCGGCGCGGCGGCCGGCCGACGCAGTGGCCACCACCAAATCCGGCC
ACCACGGGCGAGGTGTCCACCGACCCGGTGTCCATCGGGACCGTGGTGGCCACGGTCGCGCGGCCGGGCCGGGCGCCGACGGCGAGCGCCTGCGAGCCGT
ACCGCGAGCAGATTGTGGAGGCCGTCCGCCGCGGCCGGAACGCGATGGCGATCTGGCAGGACCTCGTCGACCAGCACGGCTTCGACGGCCGCTATGCGAG
TGTGCGGCGCTTCGTCGGTCGCCGGCGCGTGACGTCGCCCGAGCCGGCGGGGATCATCGAGACCGCCCCCGGCGAAGAAGCCCAAGTCGATTACGGCGAA
GGGCCGATGGTGCGCGATCCCGTGAGCGGCAAATACAAACGCACGCGGCTCTTCGTGCTGACGCTCGGCTACTCGCGCAAATCCGTGCGACTGCTCACCT
GGCGCTCCAGTAGTCAGCGGTGGGCGGCGCTGCATGAAGAGGCGTTTCGTCGGCTCGGCGGCACGCCCCGCGTGATTGTCCTCGACAATCTCCGTGAGGG
CGTGCTCACGCCCGATGTGTACGACGCCCAGCTCAATCCGCTCTATCGCGATGTGCTCGCGCACTACGGCGTGGTCGCGCTCCCGGGTCGCGTCCGCGAT
CCAGATCGGAAGGGGAAAGTGGAGTCGGGGATCGGCCACACGCAGCGCACGCCGCTCAAGGGGCTGCGTTTCGAGACCCTCGACGCCGCGCAAGCGTACC
TCGATCAGTGGGAGCTCCGCTGGGCGGACACGCGCATCCATGGCACCACGAAACGCCAGGTCAGCGTGATGTTTTCGGAGGAGCGCCCGCAGCTGCAGTC
GCTGCCCCTGGAACTCTTTCGCTACTACCGGCACGGCACGCGCGTGGTGCATCTCGACGGCTGTGTGGAAGTCGAGGCCGCGTACTACAGCGTGCCGCCG
GGCTGGATCGGGCAGCAAGTCGTCGTGCAGTGGGATGACCTCCACGTGCGGGTGCTCGATCCCAAGACGAGCGGGCTGCTGCGCGAACATCTGCGCACAC
GCCGTGGGCATCATCGCGTGGCCGATGCCGACCGTCCGACTCGCACACCGGCGAAGACGCTCGCCCTCCTCGATGTCGCGCGCAAAGCCGGGCCATCGAT
CAGCGCCGTGTGTGAGCACATCCACCGTACCGAGGGCGTCCTCGCGCCGCGTCGCATCCTCGGCGTGCTCGCGCTCGCGCGCAAACACGGGCCGGCCCTC
ACGGAGGACGCGGCCCATTTCGCGCTCGAAGCCGGCGCGCCGTCCTATCGCTTCCTCCGGCGCTATCTCGAACGCGTGAAGCTGCCAGCCACGGCACTGA
AGCAGATCGACCCGCTCATCCGCCATCTCACCGACTACCGCGCCCTCATCGAACAGCGAGCTGCCGCCTCATGAACGTCACCGAACTGGATCGCGCCCTC
CGCAAGCTCCGCCTCTCCGGCATGGCGGACGTACTCGAAACGCGCCTGCGCCATGCGCAGGCCGAGCGGCTGCCGCCGCTCGATCTCGTCGCGATGCTCG
TCAATGACGAACTGCAACGGCGCCAGGATCGCCTGCTCGAACGGCGGCGTGTACAGGCCCGCTTCCGCGATCCGAACCGCTCGCTCGACTCGTTTGATTT
TGCGTTTAACAAGAAGATGAATCGCGCGCTGATTTTTGAGCTCGCGACCGGCCGCTTCATCACGCAGCGCGAAGATCTGCTGCTCGTCGGTTCACCCGGT
ACGGGAAAGAGTCACGTGGCGCAGGCGCTCGGCCACGCGGCGATTCAGCAAGGCCATCGCGTGCTCTATCGCGAAGCGCATCTGCTCCTCGAAGAGCTCA
CCGACGCCACGCTCGACGGGACCCGGAAGGAGGTCTTCACGGAACTCAGCACCGTCCCACTCCTGATCATCGATGATCTCGGCATGCGAAAGCTGCCGCC
CACCGCCGCCGAAGATCTGCTCGAGCTGATCATGCGCCGCTATGAACGCGCCTCGACGATCCTCACCGCGAATCGCCCCGTCGATGACTGGGGCAAGCTG
CTCGGCGACACCGCCGCCGTCACGGCGCTGCTCGACCGGCTCCTCCATCACGCCCACGTGATCACTTGCGGCCCCCGGAGCTGGCGGACCAAACTCCATG
GGGCGACCGACACCACCGTCACGCCGACCCGATAACCAGCCTCAGCCTGTCGTCGTGGTGGCCGGTTTTGAAGTGTCGATCAGTGGCCGGATTTGAGGTG
TCCACCGAGG
TTCCCGCCATGAGCAACGTCTTGGACGAGACCAAACAGCAGCAGGTGATTGCGTTGGGGCGACTGGGGTGGTCGCTCCGGCGCATCGAGGCGGCGACCGG
CGTCCGCCGCGAGACGATCAGTGGGTATTTGAAGGCGACCGGCGTCGCAGTCCGCCGGCGCGGCGGCCGGCCGACGCAGTGGCCACCACCAAATCCGGCC
ACCACGGGCGAGGTGTCCACCGACCCGGTGTCCATCGGGACCGTGGTGGCCACGGTCGCGCGGCCGGGCCGGGCGCCGACGGCGAGCGCCTGCGAGCCGT
ACCGCGAGCAGATTGTGGAGGCCGTCCGCCGCGGCCGGAACGCGATGGCGATCTGGCAGGACCTCGTCGACCAGCACGGCTTCGACGGCCGCTATGCGAG
TGTGCGGCGCTTCGTCGGTCGCCGGCGCGTGACGTCGCCCGAGCCGGCGGGGATCATCGAGACCGCCCCCGGCGAAGAAGCCCAAGTCGATTACGGCGAA
GGGCCGATGGTGCGCGATCCCGTGAGCGGCAAATACAAACGCACGCGGCTCTTCGTGCTGACGCTCGGCTACTCGCGCAAATCCGTGCGACTGCTCACCT
GGCGCTCCAGTAGTCAGCGGTGGGCGGCGCTGCATGAAGAGGCGTTTCGTCGGCTCGGCGGCACGCCCCGCGTGATTGTCCTCGACAATCTCCGTGAGGG
CGTGCTCACGCCCGATGTGTACGACGCCCAGCTCAATCCGCTCTATCGCGATGTGCTCGCGCACTACGGCGTGGTCGCGCTCCCGGGTCGCGTCCGCGAT
CCAGATCGGAAGGGGAAAGTGGAGTCGGGGATCGGCCACACGCAGCGCACGCCGCTCAAGGGGCTGCGTTTCGAGACCCTCGACGCCGCGCAAGCGTACC
TCGATCAGTGGGAGCTCCGCTGGGCGGACACGCGCATCCATGGCACCACGAAACGCCAGGTCAGCGTGATGTTTTCGGAGGAGCGCCCGCAGCTGCAGTC
GCTGCCCCTGGAACTCTTTCGCTACTACCGGCACGGCACGCGCGTGGTGCATCTCGACGGCTGTGTGGAAGTCGAGGCCGCGTACTACAGCGTGCCGCCG
GGCTGGATCGGGCAGCAAGTCGTCGTGCAGTGGGATGACCTCCACGTGCGGGTGCTCGATCCCAAGACGAGCGGGCTGCTGCGCGAACATCTGCGCACAC
GCCGTGGGCATCATCGCGTGGCCGATGCCGACCGTCCGACTCGCACACCGGCGAAGACGCTCGCCCTCCTCGATGTCGCGCGCAAAGCCGGGCCATCGAT
CAGCGCCGTGTGTGAGCACATCCACCGTACCGAGGGCGTCCTCGCGCCGCGTCGCATCCTCGGCGTGCTCGCGCTCGCGCGCAAACACGGGCCGGCCCTC
ACGGAGGACGCGGCCCATTTCGCGCTCGAAGCCGGCGCGCCGTCCTATCGCTTCCTCCGGCGCTATCTCGAACGCGTGAAGCTGCCAGCCACGGCACTGA
AGCAGATCGACCCGCTCATCCGCCATCTCACCGACTACCGCGCCCTCATCGAACAGCGAGCTGCCGCCTCATGAACGTCACCGAACTGGATCGCGCCCTC
CGCAAGCTCCGCCTCTCCGGCATGGCGGACGTACTCGAAACGCGCCTGCGCCATGCGCAGGCCGAGCGGCTGCCGCCGCTCGATCTCGTCGCGATGCTCG
TCAATGACGAACTGCAACGGCGCCAGGATCGCCTGCTCGAACGGCGGCGTGTACAGGCCCGCTTCCGCGATCCGAACCGCTCGCTCGACTCGTTTGATTT
TGCGTTTAACAAGAAGATGAATCGCGCGCTGATTTTTGAGCTCGCGACCGGCCGCTTCATCACGCAGCGCGAAGATCTGCTGCTCGTCGGTTCACCCGGT
ACGGGAAAGAGTCACGTGGCGCAGGCGCTCGGCCACGCGGCGATTCAGCAAGGCCATCGCGTGCTCTATCGCGAAGCGCATCTGCTCCTCGAAGAGCTCA
CCGACGCCACGCTCGACGGGACCCGGAAGGAGGTCTTCACGGAACTCAGCACCGTCCCACTCCTGATCATCGATGATCTCGGCATGCGAAAGCTGCCGCC
CACCGCCGCCGAAGATCTGCTCGAGCTGATCATGCGCCGCTATGAACGCGCCTCGACGATCCTCACCGCGAATCGCCCCGTCGATGACTGGGGCAAGCTG
CTCGGCGACACCGCCGCCGTCACGGCGCTGCTCGACCGGCTCCTCCATCACGCCCACGTGATCACTTGCGGCCCCCGGAGCTGGCGGACCAAACTCCATG
GGGCGACCGACACCACCGTCACGCCGACCCGATAACCAGCCTCAGCCTGTCGTCGTGGTGGCCGGTTTTGAAGTGTCGATCAGTGGCCGGATTTGAGGTG
TCCACCGAGG
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1566 bp | 521 aa | 109 | 1674 | + | No |
Chemistry : DDE
ORF sequence :
MSNVLDETKQQQVIALGRLGWSLRRIEAATGVRRETISGYLKATGVAVRRRGGRPTQWPPPNPATTGEVSTDPVSIGTVVATVARPGRAPTASACEPYRE
QIVEAVRRGRNAMAIWQDLVDQHGFDGRYASVRRFVGRRRVTSPEPAGIIETAPGEEAQVDYGEGPMVRDPVSGKYKRTRLFVLTLGYSRKSVRLLTWRS
SSQRWAALHEEAFRRLGGTPRVIVLDNLREGVLTPDVYDAQLNPLYRDVLAHYGVVALPGRVRDPDRKGKVESGIGHTQRTPLKGLRFETLDAAQAYLDQ
WELRWADTRIHGTTKRQVSVMFSEERPQLQSLPLELFRYYRHGTRVVHLDGCVEVEAAYYSVPPGWIGQQVVVQWDDLHVRVLDPKTSGLLREHLRTRRG
HHRVADADRPTRTPAKTLALLDVARKAGPSISAVCEHIHRTEGVLAPRRILGVLALARKHGPALTEDAAHFALEAGAPSYRFLRRYLERVKLPATALKQI
DPLIRHLTDYRALIEQRAAAS
QIVEAVRRGRNAMAIWQDLVDQHGFDGRYASVRRFVGRRRVTSPEPAGIIETAPGEEAQVDYGEGPMVRDPVSGKYKRTRLFVLTLGYSRKSVRLLTWRS
SSQRWAALHEEAFRRLGGTPRVIVLDNLREGVLTPDVYDAQLNPLYRDVLAHYGVVALPGRVRDPDRKGKVESGIGHTQRTPLKGLRFETLDAAQAYLDQ
WELRWADTRIHGTTKRQVSVMFSEERPQLQSLPLELFRYYRHGTRVVHLDGCVEVEAAYYSVPPGWIGQQVVVQWDDLHVRVLDPKTSGLLREHLRTRRG
HHRVADADRPTRTPAKTLALLDVARKAGPSISAVCEHIHRTEGVLAPRRILGVLALARKHGPALTEDAAHFALEAGAPSYRFLRRYLERVKLPATALKQI
DPLIRHLTDYRALIEQRAAAS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
765 bp | 254 aa | 1671 | 2435 | + | No |
AG : IS21 helper
ORF sequence :
MNVTELDRALRKLRLSGMADVLETRLRHAQAERLPPLDLVAMLVNDELQRRQDRLLERRRVQARFRDPNRSLDSFDFAFNKKMNRALIFELATGRFITQR
EDLLLVGSPGTGKSHVAQALGHAAIQQGHRVLYREAHLLLEELTDATLDGTRKEVFTELSTVPLLIIDDLGMRKLPPTAAEDLLELIMRRYERASTILTA
NRPVDDWGKLLGDTAAVTALLDRLLHHAHVITCGPRSWRTKLHGATDTTVTPTR
EDLLLVGSPGTGKSHVAQALGHAAIQQGHRVLYREAHLLLEELTDATLDGTRKEVFTELSTVPLLIIDDLGMRKLPPTAAEDLLELIMRRYERASTILTA
NRPVDDWGKLLGDTAAVTALLDRLLHHAHVITCGPRSWRTKLHGATDTTVTPTR
Blast result :
Comments
ISGau7 is 58%(ORFA, the Transposase) and 74% (ORFB) aa similar to ISCARN65.
References
1] Ichikawa, N. (2010) Direct submission
2] Oguchi,A., Ankai,A., Yashiro,I., Takahashi,M., Terui,Y., Fukui,S., Yokoyama,H., Ichikawa,N., Takasaki,K., Miura,H., Matsushita,S., Watanabe,Y., Sekiguchi,Y., Nakamura,K., Tanikawa,S., Hanada,S., Kamagata,Y., Fujita,N. and Kikuchi,H. (2006) Direct submission Genbank
2] Oguchi,A., Ankai,A., Yashiro,I., Takahashi,M., Terui,Y., Fukui,S., Yokoyama,H., Ichikawa,N., Takasaki,K., Miura,H., Matsushita,S., Watanabe,Y., Sekiguchi,Y., Nakamura,K., Tanikawa,S., Hanada,S., Kamagata,Y., Fujita,N. and Kikuchi,H. (2006) Direct submission Genbank