ISNma20
- Family IS607
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP001932 | ND | Natrialba magadii | Natrialba magadii ATCC 43099 |
DNA section
IS Length : 2469 bp
Ends
IR Length : 0
IRL : TAATGTAACTGTTACAGAATTCAGTATCTATAAGTATCTACCAGTAGTAA
IRR : ACCGTTCCAATTTTCAGTAGCTTTCAGTAGGACACCGTAAGAAACGGGTA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGCGGGCCG | GGG | AGTACGGCTGTG | 3 |
CGACGACTG | GGG | CGGGCTGCGCTA | 3 |
DNA sequence
TAATGTAACTGTTACAGAATTCAGTATCTATAAGTATCTACCAGTAGTAACCAGTAATATCAGAATGCCGCGGTCGGACTCGATTGGCGAGTTCGCAGAC
GAACTCGGTGTTCACCCCCAGACCGTCAAACGCTGGTGTCGCAACGACGATCTCGACTATACCCGAACACCCGGCGGTGAACGACGAATACCACATCGAG
AACTTCGCCGACTTGCTGGCGATACTCGTCCGACAGACCGTGTTGTACTCTACGCTCGCGTCTCCAGTCACGGGCAGAAAGATAACGGCGACCTCGACCA
CCAACTTGAGCGACTCACAGACTACGCTCACGACCACGGCTGGAGCGTCGAAAACACCTATACCGACGTTGGCAGTGGCCTCAACGAAGACCGCCGTGGC
CTCAACTCCCTCCTCGATGACCTACAGGAAGCCGACTACGGACGCATTCTCGTCACCTACGAAGATAGACTCACTCGTTTTGGGTTCTCGTATCTCAAAC
GGTATTTCGACTGCTACGGTGTCACAGTCACCGTCATCGAAGACGAGACGGACAAATCTGCACAGGAAGAACTCGTTGACGACCTTATCAAGCTCGTCGC
CAGCTTCAGCGGCAAACTCTACGGGATGCGCTCATCGAAAAAACAACAGGTCGTCAATACCGTCGAATCAGAGGTAAAACCCGATGAGTGACCACCTCTC
GCTCCCGATTCACCTCCCAGATGAGGACGACAAACGATTCAAGCGACTGGCAACACTCACCCGTCGTGTCGCCAACCATGTCCTCGAAGACCACTGGACA
CCAGTCCATCTGAGCGGCATTGCCGACGCATCGCATCAGGCGTGGAAATACTTCGATGAACACGAACCGTTCGAAGAACTCGACCTGTATCTCCCCTCGC
GGTTTCGACGGTGCATCTTGCAGAAAGTCGGGGAAACGCTCCGGAGTCACGCCGACCGCCGAGACGCCTTTCAGTCCATCCAAAGCGTGTTGCCCGACCA
CAAAATCCGACGCATCCACCGTCGTCGCATCAAAGAACAACTCTGGGATGACGGTGACTACCTCTCGTCGGGATACGTGGACATCCTCATCGACCAACTC
ACCAGCTACTGCGACCGTCATGGCACGCATCCTGCCACGTATTTCGAACTGCAGGACTGTCCAGAGTACGATAGCGGCGTGCTACCGTTTTCTGCGGACG
ACGGCCCAACGAGCGGCCAAGCCGTCAAATACCAGTATGACGCCGACAGTGAGACGCTGACCATTCGCCTCAAGACACCGGACACGCTGTCGCCGGAGAC
ACGAGGTGACTGGTCGTGGACGGAGTACGAGCGTGACGGCTACGAGGCGTTTCATGACCTCCTCGCTCACGGCGATCTCTCGGCTCCCGAGTTTCAGCCC
GCTCGCCGAAAGACCGGTGACACCTATTACGAACTGTCCTTCCCCGTCGAAGTCGACCACGCGGAGACGACTGACGGCACAGACTGCGTACTGGCGCTCG
ACGCCGGGATGCGAAAAGACATGACTGTCGTGGTGGCCACAGACGATGGCGAGCAACGATCCACGCCACAGTTCATCCAGTTCACAGACCGCGAGAAGAT
GCGACGACTCCACCGCGAACGTACCCGCCTGAACGACCGCCTTGCCGCGTTGCGCCGTGGCGCTCGCTCGCATACTGACGAGTTCGCCCACATCCACAGT
GAGTACGAGCGAGTGAACAGCAAGATCCAGCACAAGCGCGATCAACTAACACACGACGTCGCAAACCAAGTTCTCGCACTTGCACTCGCCTACGACGTAG
ACGCCATCGTCCACGAGGACTTGCGGTCGCTCTCCCCGCCGAGTGACGAAGGCACGTTGTCGTGGGAATTGTCGTCGTGGGCGCGGCGGGACATCATCGA
AAAAATCGAATATCGGGCAGAGTGTGCCGGTCTCGCTGTTGAGCGAGTCTATCCACAGGGAACAAGTCGGTCGTGTCCCCGGTGTGGCTCAACCGGCCAC
ACCTGCAAGTCACCCAACCACCACGAAGAACACTGGTGGGGCGGGCACTTCCGGTGTGACAACGCACGGTGTGGGTTCGAGGGCGACCGGGACTACATCG
GGACACTGAACGTAGCTCGCGTGTTCTTCAGCGAGACGGACGAGCTAGACCACGGTTTCACGTCCTCCTACACGGGGGATTCTGAAATCGTGCTAGCTGG
CCGTTCCGCTGGTGAGCAGTCCGATGGACTGCGATCCTCGTCAGAAGCGAGTTCTGACGCTGGCACGCGACTCACGTTCGGATCTGGCATCGTCGCCTAC
GAACCTGAACAGGCAGCGGCGACCACTGGTGGTGGGTCGGCTGTCATAGCACCCGCTGTCGCCTTGCCCGAGTCGAATGCGGATGGGAGTGATGGACGCG
GCCCAGTCGTCCAACAGTGTACCCGTTTCTTACGGTGTCCTACTGAAAGCTACTGAAAATTGGAACGGT
GAACTCGGTGTTCACCCCCAGACCGTCAAACGCTGGTGTCGCAACGACGATCTCGACTATACCCGAACACCCGGCGGTGAACGACGAATACCACATCGAG
AACTTCGCCGACTTGCTGGCGATACTCGTCCGACAGACCGTGTTGTACTCTACGCTCGCGTCTCCAGTCACGGGCAGAAAGATAACGGCGACCTCGACCA
CCAACTTGAGCGACTCACAGACTACGCTCACGACCACGGCTGGAGCGTCGAAAACACCTATACCGACGTTGGCAGTGGCCTCAACGAAGACCGCCGTGGC
CTCAACTCCCTCCTCGATGACCTACAGGAAGCCGACTACGGACGCATTCTCGTCACCTACGAAGATAGACTCACTCGTTTTGGGTTCTCGTATCTCAAAC
GGTATTTCGACTGCTACGGTGTCACAGTCACCGTCATCGAAGACGAGACGGACAAATCTGCACAGGAAGAACTCGTTGACGACCTTATCAAGCTCGTCGC
CAGCTTCAGCGGCAAACTCTACGGGATGCGCTCATCGAAAAAACAACAGGTCGTCAATACCGTCGAATCAGAGGTAAAACCCGATGAGTGACCACCTCTC
GCTCCCGATTCACCTCCCAGATGAGGACGACAAACGATTCAAGCGACTGGCAACACTCACCCGTCGTGTCGCCAACCATGTCCTCGAAGACCACTGGACA
CCAGTCCATCTGAGCGGCATTGCCGACGCATCGCATCAGGCGTGGAAATACTTCGATGAACACGAACCGTTCGAAGAACTCGACCTGTATCTCCCCTCGC
GGTTTCGACGGTGCATCTTGCAGAAAGTCGGGGAAACGCTCCGGAGTCACGCCGACCGCCGAGACGCCTTTCAGTCCATCCAAAGCGTGTTGCCCGACCA
CAAAATCCGACGCATCCACCGTCGTCGCATCAAAGAACAACTCTGGGATGACGGTGACTACCTCTCGTCGGGATACGTGGACATCCTCATCGACCAACTC
ACCAGCTACTGCGACCGTCATGGCACGCATCCTGCCACGTATTTCGAACTGCAGGACTGTCCAGAGTACGATAGCGGCGTGCTACCGTTTTCTGCGGACG
ACGGCCCAACGAGCGGCCAAGCCGTCAAATACCAGTATGACGCCGACAGTGAGACGCTGACCATTCGCCTCAAGACACCGGACACGCTGTCGCCGGAGAC
ACGAGGTGACTGGTCGTGGACGGAGTACGAGCGTGACGGCTACGAGGCGTTTCATGACCTCCTCGCTCACGGCGATCTCTCGGCTCCCGAGTTTCAGCCC
GCTCGCCGAAAGACCGGTGACACCTATTACGAACTGTCCTTCCCCGTCGAAGTCGACCACGCGGAGACGACTGACGGCACAGACTGCGTACTGGCGCTCG
ACGCCGGGATGCGAAAAGACATGACTGTCGTGGTGGCCACAGACGATGGCGAGCAACGATCCACGCCACAGTTCATCCAGTTCACAGACCGCGAGAAGAT
GCGACGACTCCACCGCGAACGTACCCGCCTGAACGACCGCCTTGCCGCGTTGCGCCGTGGCGCTCGCTCGCATACTGACGAGTTCGCCCACATCCACAGT
GAGTACGAGCGAGTGAACAGCAAGATCCAGCACAAGCGCGATCAACTAACACACGACGTCGCAAACCAAGTTCTCGCACTTGCACTCGCCTACGACGTAG
ACGCCATCGTCCACGAGGACTTGCGGTCGCTCTCCCCGCCGAGTGACGAAGGCACGTTGTCGTGGGAATTGTCGTCGTGGGCGCGGCGGGACATCATCGA
AAAAATCGAATATCGGGCAGAGTGTGCCGGTCTCGCTGTTGAGCGAGTCTATCCACAGGGAACAAGTCGGTCGTGTCCCCGGTGTGGCTCAACCGGCCAC
ACCTGCAAGTCACCCAACCACCACGAAGAACACTGGTGGGGCGGGCACTTCCGGTGTGACAACGCACGGTGTGGGTTCGAGGGCGACCGGGACTACATCG
GGACACTGAACGTAGCTCGCGTGTTCTTCAGCGAGACGGACGAGCTAGACCACGGTTTCACGTCCTCCTACACGGGGGATTCTGAAATCGTGCTAGCTGG
CCGTTCCGCTGGTGAGCAGTCCGATGGACTGCGATCCTCGTCAGAAGCGAGTTCTGACGCTGGCACGCGACTCACGTTCGGATCTGGCATCGTCGCCTAC
GAACCTGAACAGGCAGCGGCGACCACTGGTGGTGGGTCGGCTGTCATAGCACCCGCTGTCGCCTTGCCCGAGTCGAATGCGGATGGGAGTGATGGACGCG
GCCCAGTCGTCCAACAGTGTACCCGTTTCTTACGGTGTCCTACTGAAAGCTACTGAAAATTGGAACGGT
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
627 bp | 208 aa | 65 | 691 | + | No |
Chemistry : Serine
ORF sequence :
MPRSDSIGEFADELGVHPQTVKRWCRNDDLDYTRTPGGERRIPHRELRRLAGDTRPTDRVVLYARVSSHGQKDNGDLDHQLERLTDYAHDHGWSVENTYT
DVGSGLNEDRRGLNSLLDDLQEADYGRILVTYEDRLTRFGFSYLKRYFDCYGVTVTVIEDETDKSAQEELVDDLIKLVASFSGKLYGMRSSKKQQVVNTV
ESEVKPDE
DVGSGLNEDRRGLNSLLDDLQEADYGRILVTYEDRLTRFGFSYLKRYFDCYGVTVTVIEDETDKSAQEELVDDLIKLVASFSGKLYGMRSSKKQQVVNTV
ESEVKPDE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1773 bp | 590 aa | 684 | 2456 | + | No |
AG : TnpB
ORF sequence :
MSDHLSLPIHLPDEDDKRFKRLATLTRRVANHVLEDHWTPVHLSGIADASHQAWKYFDEHEPFEELDLYLPSRFRRCILQKVGETLRSHADRRDAFQSIQ
SVLPDHKIRRIHRRRIKEQLWDDGDYLSSGYVDILIDQLTSYCDRHGTHPATYFELQDCPEYDSGVLPFSADDGPTSGQAVKYQYDADSETLTIRLKTPD
TLSPETRGDWSWTEYERDGYEAFHDLLAHGDLSAPEFQPARRKTGDTYYELSFPVEVDHAETTDGTDCVLALDAGMRKDMTVVVATDDGEQRSTPQFIQF
TDREKMRRLHRERTRLNDRLAALRRGARSHTDEFAHIHSEYERVNSKIQHKRDQLTHDVANQVLALALAYDVDAIVHEDLRSLSPPSDEGTLSWELSSWA
RRDIIEKIEYRAECAGLAVERVYPQGTSRSCPRCGSTGHTCKSPNHHEEHWWGGHFRCDNARCGFEGDRDYIGTLNVARVFFSETDELDHGFTSSYTGDS
EIVLAGRSAGEQSDGLRSSSEASSDAGTRLTFGSGIVAYEPEQAAATTGGGSAVIAPAVALPESNADGSDGRGPVVQQCTRFLRCPTESY
SVLPDHKIRRIHRRRIKEQLWDDGDYLSSGYVDILIDQLTSYCDRHGTHPATYFELQDCPEYDSGVLPFSADDGPTSGQAVKYQYDADSETLTIRLKTPD
TLSPETRGDWSWTEYERDGYEAFHDLLAHGDLSAPEFQPARRKTGDTYYELSFPVEVDHAETTDGTDCVLALDAGMRKDMTVVVATDDGEQRSTPQFIQF
TDREKMRRLHRERTRLNDRLAALRRGARSHTDEFAHIHSEYERVNSKIQHKRDQLTHDVANQVLALALAYDVDAIVHEDLRSLSPPSDEGTLSWELSSWA
RRDIIEKIEYRAECAGLAVERVYPQGTSRSCPRCGSTGHTCKSPNHHEEHWWGGHFRCDNARCGFEGDRDYIGTLNVARVFFSETDELDHGFTSSYTGDS
EIVLAGRSAGEQSDGLRSSSEASSDAGTRLTFGSGIVAYEPEQAAATTGGGSAVIAPAVALPESNADGSDGRGPVVQQCTRFLRCPTESY
Blast result :
Comments
ISNma20 is 66% (ORFA) aa similar to ISPfu4 and 42% (TnpB) aa similar to ISDge8.
References
1] Pfeiffer, F. (2015) Direct submission
2] Siddaramappa, S., Challacombe, J.F., Decastro, R.E., Pfeiffer, F., Sastre, D.E., Gimenez, M.I., Paggi, R.A., Detter, J.C., Davenport, K.W., Goodwin, L.A., Kyrpides, N., Tapia, R., Pitluck, S., Lucas, S., Woyke, T., Maupin-Furlow, J.A. BMC Genomics (2012) 13: 165
2] Siddaramappa, S., Challacombe, J.F., Decastro, R.E., Pfeiffer, F., Sastre, D.E., Gimenez, M.I., Paggi, R.A., Detter, J.C., Davenport, K.W., Goodwin, L.A., Kyrpides, N., Tapia, R., Pitluck, S., Lucas, S., Woyke, T., Maupin-Furlow, J.A. BMC Genomics (2012) 13: 165