ISNamo20
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
HF582854 | ND | Natronomonas moolapensis | Natronomonas moolapensis 8.8.11 |
DNA section
IS Length : 1536 bp
Ends
Left end : AGAGAGCAAGGAGAATACCTGCGGCTTTAGCCGCAGGATGAATCCGACAACTCCTCCACAACCCACCGCCGATAGCACGGCTGGATATTCCACCGTTCTC II struct. : Yes
Right end : CCAACCTCGGACAGAGGTGGTGGACTGCAAACCCTAATATCCCAATCCAGCGGTGCGGTGCCGTGGGATTCCGCCGCCTTCAGGCGGAGGAGGATGTCAA II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
ACGCGTTT | TTAT | TGGTGCCCCGGT | TCAA |
TGACGGGCGCGT | TCGACGAGGAAC | TCAA | |
CAGCCGACGCTG | TGCCCGTCGCGA | TCAA | |
CATCCGCCGTGT | CAGTGAAGTCCA | TCAA | |
TGGCCGCGTTTT | TTAT | GTACGGTCGAAC | TCAA |
CGTGTGCCGGTG | TTAT | TTGAGTTCGTCC | TCAA |
DNA sequence
AGAGAGCAAGGAGAATACCTGCGGCTTTAGCCGCAGGATGAATCCGACAACTCCTCCACAACCCACCGCCGATAGCACGGCTGGATATTCCACCGTTCTC
AAACCGAACCTTTACAAGAAACACAAGTATAAGTGATTATAGAGGCGTTCAGAATGCTGGAAATCCATCGAACCCACCGAGCGAAAATCCTCAACCACAA
CCAAGTGGCTGAGATGCTCGACCGGCACGGGTGGAGTGCCAGCAAACTCTGGAACGTCGCTAATTACCACTCTCGACAAAAGTGGGATGATACGGGTGAG
ATTCCAGACCACAGCGACCTCAAAGACGAGTTGAAAGGTCACTCAAAGTACAAGGGATTGCACTCGCAGTCCAGTCAGCGCGTTCTGGAGGAACTCGCTG
AAGCCTTCAACTCGTGGTACAAAAAGAGGAAGTCTGATAATCGAGCGAATCCGCCCGGCTACCGCAAAAAAAACTACTACGACGACCACGGCAACCGCGT
CCACGAGGAACATCCGCGTTCGACCGTGACGTGGAAGCAAAACGGCATCAAACACGACACGAACAACAACCGTGTCCGTCTCTCAAAGGGCGCGAATCAC
AAGGAACACCCGAAAGCATGGGAGTACATCCTTGTCGAATACGAGACGCGCCCCGGTGTCACCGTTGAGAACCTACAACAGGTCAGAGCCGTCTACGACA
AAGCAAAGAGGCGCTGGGAACTGCACCTCGTCTGTAAGGATGAAATCGAGACACCCACCGCACCCGGAAACGAGACAGCGGGTATCGACCTCGGCATCTG
TAACTTCGCGGCGGTCACGTACAGCACCGAGCAAGCTGACCTCTACCCCGGCAACCGGTTGAAACAGGACGGGTACTACTTCCCGAAAGAAATCGCCAAG
TGCGACGACTCGGGTGGTGAAGAAGCGACCCGTCTCCACGCGAAGTGGTCGGAGCGCCGCACCCACTTTTTCCACTCGTTAGCGAAACACATCGTCCAGC
GATGTATCGAGAACAGTGTTGGGCGTATCAACATCGGGAAGCTCGCTGGTGTCCGTGAAGACGACAACGGCGAGTCGAAGAACTGGGGCAAGCACGGGAA
CCTCGACCTGCACGGCTGGGCGTTCGACCGCTTCACCTCGATTCTCGAATACAAGGCCAAAGTCGAGGGTGTCGAAGTCGTAGAGGTCTCAGAGCGCGAC
ACGAGCAAGACGTGTTGCGTCTGCGGTAGAAAAGACGAGAGTCAGCGTGTCGAACGTGGCTTGTATGTCTGCGAGCCGTGTGACGCGGCGTTCAACGCTG
ACGTGAATGGGGCGGAGAACATCCGTCTCGAGTTGAAGCAAAGTAACTCCGAGTCTGCTCCCGATTTGGGTGGGGATAGGAGTACCGGCTGGTTGGCACA
GCCCGAAGTCTACCTTCATGACCTCTCCCGAGGATTCCAACCTCGGACAGAGGTGGTGGACTGCAAACCCTAATATCCCAATCCAGCGGTGCGGTGCCGT
GGGATTCCGCCGCCTTCAGGCGGAGGAGGATGTCAA
AAACCGAACCTTTACAAGAAACACAAGTATAAGTGATTATAGAGGCGTTCAGAATGCTGGAAATCCATCGAACCCACCGAGCGAAAATCCTCAACCACAA
CCAAGTGGCTGAGATGCTCGACCGGCACGGGTGGAGTGCCAGCAAACTCTGGAACGTCGCTAATTACCACTCTCGACAAAAGTGGGATGATACGGGTGAG
ATTCCAGACCACAGCGACCTCAAAGACGAGTTGAAAGGTCACTCAAAGTACAAGGGATTGCACTCGCAGTCCAGTCAGCGCGTTCTGGAGGAACTCGCTG
AAGCCTTCAACTCGTGGTACAAAAAGAGGAAGTCTGATAATCGAGCGAATCCGCCCGGCTACCGCAAAAAAAACTACTACGACGACCACGGCAACCGCGT
CCACGAGGAACATCCGCGTTCGACCGTGACGTGGAAGCAAAACGGCATCAAACACGACACGAACAACAACCGTGTCCGTCTCTCAAAGGGCGCGAATCAC
AAGGAACACCCGAAAGCATGGGAGTACATCCTTGTCGAATACGAGACGCGCCCCGGTGTCACCGTTGAGAACCTACAACAGGTCAGAGCCGTCTACGACA
AAGCAAAGAGGCGCTGGGAACTGCACCTCGTCTGTAAGGATGAAATCGAGACACCCACCGCACCCGGAAACGAGACAGCGGGTATCGACCTCGGCATCTG
TAACTTCGCGGCGGTCACGTACAGCACCGAGCAAGCTGACCTCTACCCCGGCAACCGGTTGAAACAGGACGGGTACTACTTCCCGAAAGAAATCGCCAAG
TGCGACGACTCGGGTGGTGAAGAAGCGACCCGTCTCCACGCGAAGTGGTCGGAGCGCCGCACCCACTTTTTCCACTCGTTAGCGAAACACATCGTCCAGC
GATGTATCGAGAACAGTGTTGGGCGTATCAACATCGGGAAGCTCGCTGGTGTCCGTGAAGACGACAACGGCGAGTCGAAGAACTGGGGCAAGCACGGGAA
CCTCGACCTGCACGGCTGGGCGTTCGACCGCTTCACCTCGATTCTCGAATACAAGGCCAAAGTCGAGGGTGTCGAAGTCGTAGAGGTCTCAGAGCGCGAC
ACGAGCAAGACGTGTTGCGTCTGCGGTAGAAAAGACGAGAGTCAGCGTGTCGAACGTGGCTTGTATGTCTGCGAGCCGTGTGACGCGGCGTTCAACGCTG
ACGTGAATGGGGCGGAGAACATCCGTCTCGAGTTGAAGCAAAGTAACTCCGAGTCTGCTCCCGATTTGGGTGGGGATAGGAGTACCGGCTGGTTGGCACA
GCCCGAAGTCTACCTTCATGACCTCTCCCGAGGATTCCAACCTCGGACAGAGGTGGTGGACTGCAAACCCTAATATCCCAATCCAGCGGTGCGGTGCCGT
GGGATTCCGCCGCCTTCAGGCGGAGGAGGATGTCAA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1320 bp | 439 aa | 154 | 1473 | + | No |
AG : TnpB
ORF sequence :
MLEIHRTHRAKILNHNQVAEMLDRHGWSASKLWNVANYHSRQKWDDTGEIPDHSDLKDELKGHSKYKGLHSQSSQRVLEELAEAFNSWYKKRKSDNRANP
PGYRKKNYYDDHGNRVHEEHPRSTVTWKQNGIKHDTNNNRVRLSKGANHKEHPKAWEYILVEYETRPGVTVENLQQVRAVYDKAKRRWELHLVCKDEIET
PTAPGNETAGIDLGICNFAAVTYSTEQADLYPGNRLKQDGYYFPKEIAKCDDSGGEEATRLHAKWSERRTHFFHSLAKHIVQRCIENSVGRINIGKLAGV
REDDNGESKNWGKHGNLDLHGWAFDRFTSILEYKAKVEGVEVVEVSERDTSKTCCVCGRKDESQRVERGLYVCEPCDAAFNADVNGAENIRLELKQSNSE
SAPDLGGDRSTGWLAQPEVYLHDLSRGFQPRTEVVDCKP
PGYRKKNYYDDHGNRVHEEHPRSTVTWKQNGIKHDTNNNRVRLSKGANHKEHPKAWEYILVEYETRPGVTVENLQQVRAVYDKAKRRWELHLVCKDEIET
PTAPGNETAGIDLGICNFAAVTYSTEQADLYPGNRLKQDGYYFPKEIAKCDDSGGEEATRLHAKWSERRTHFFHSLAKHIVQRCIENSVGRINIGKLAGV
REDDNGESKNWGKHGNLDLHGWAFDRFTSILEYKAKVEGVEVVEVSERDTSKTCCVCGRKDESQRVERGLYVCEPCDAAFNADVNGAENIRLELKQSNSE
SAPDLGGDRSTGWLAQPEVYLHDLSRGFQPRTEVVDCKP
Blast result :
Comments
ISNamo20 is 91% (TnpB) aa similar to ISHli3.
References
1] Pfeiffer F. (2013) Direct submission.
2] Dyall-Smith, M.L., Pfeiffer, F., Oberwinkler, T., Klee, K., Rampp, M., Palm, P., Gross, K., Schuster, S.C., Oesterhelt, D.(2013) Genome Announc. 1: e0009513
2] Dyall-Smith, M.L., Pfeiffer, F., Oberwinkler, T., Klee, K., Rampp, M., Palm, P., Gross, K., Schuster, S.C., Oesterhelt, D.(2013) Genome Announc. 1: e0009513