ISNarch3
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_CP050695.1 | ND | Natrialbaceae archaeon | Natrialbaceae archaeon 2447 |
DNA section
IS Length : 2965 bp
Ends
IR Length : 29/34
IRL : TGCGTATTCCGGCCCATCGTGACCGCTCATTCCGATTGATCGTGACCGCT
IRR : TGCGTATTCCGGTGAACGTGACCGCTGATTCCGCTGTTCGTGACCGGTTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GTTTGCCCTG | CCCTG | CCCTGCCCGA | 5 |
DNA sequence
TGCGTATTCCGGCCCATCGTGACCGCTCATTCCGATTGATCGTGACCGCTGTTTCCGGCGCATCGTGACCGGCGATTCCGGCCGGAGCGTGACCGATTTT
GGCCCTGGCGCCGGAATCATTGGTCACGGTCCCGGAATCGCTGGTCACGATGCCGGAATCGGCGTGGTGTGCGCTGGTAATTCCCTTTGACTGCAGGCAT
ATAGCTCTCTTTTCACTGTTCCCGGGGAGAGGGCATGCCGGCACCGAGGATTACCATGCGACAACTCAGCGAGATTCTGCGTCTCAAACACCAGGCCGGA
CTCAGCCACGCCCGCATCGCCGCCGTCCTGGATCTCTCCAAGGGCGTGGTCAACAAGTACGTCAGCCTCGCCCGTGCCAAGGGCGTGGGCTGGCCCCTGC
CCGAGGGCATGGACGAGGCCGCGCTGGAGCGGCTGCTGTTCCCCGCCCCAGTGCTGCCCGGGCGGGGGGCGGAGCCGGACCACAACACCATCCATCAGGA
GCTCAAGCGCAAGGAGGTGACGCTGCAGTTGCTCTGGGCGGAGTACGCGGCCGCCCACGGGGCGGGGGCGCTTCGCTACAGCCAGTTCTGCGAGCGCTAC
CGGCGCTGGCGCGCCCTGCAGCGGCGGTCCATGCGCCAGCAGCACCGGGCCGGGGAGAAGCTGTTCATCGACTACTGCGGCCCCACGGTCCCAGTGATCG
ACCCGGGTACCGGGGAGATCCACCCCGCGGCGATCTTCGTCGCCGTGTGGGGCGCGTCGAGCTACACCTATGCCGAGGCCACACGCACCCAGTCGCTGCC
CGACTGGATCGCCTCCCATCAGCGGGCGCTCACGTACTTCGGGGGCTGTCCCATGCTGCTGGTCCCCGACAACCTCAAAGCCGCCGTCACCCGGGCCTCG
CGTTACACCCCGGAGGTCAACGACACCTATATGGAGATGGCCCGGCACTACGGGATGGCGGTGCTGCCGGCACGGCCCTACAAGCCCCGGGACAAGGCCA
AGGTCGAGGTGGCCGTGCAGGTGGTCGAGCGCTGGATCCTGGCGCGGCTGCGCCACCACACGTTCCACTCCCTGGCCGCGCTCAACGCCGCCATCAAGGA
TCTGCTCGTCGAACTCAACGAACGCCGCCTCCAGCGCCAGCCCCACAGCCGCCGGGAGCTGTTCGAGCGCCTCGACCGCCCCGCCATGCAGCCCCTGCCC
GCAGAGCCCTACGTCTACGCCGAGTGGAAGTACGTCAAACCCGGCATCGACTACCACATCGAGATCGATCGCCGGTTCTACTCCGTCCCCCACGCCCTGG
TCGGCCACCGGATCGAGGCGAGGGTCACCGCCACCACGGTGGAGGTCTACCACAAGGGTCAGCGCGTCGCCGTTCACGCCCGCCACGGCACGGGGGCCCA
CAGCACCCTGGCCGAGCACATGCCCGCCGCCCACCGCAAGCACCAGCAGTGGACCCCGGGGCGGTTCCTCAACTGGGCGCAGGCCATCGGCCCGGCAACA
CGCCAGGTCATCCGCGCCCAGCTCGAAGGCCGCCCCCATCCCGAGCACGGCTACCGCGCCTGCCTCGGGCTGCTCAACCTCGCTCGCCACTACGGCAACG
CCCGCCTGGAGGCCGCATGTGCCAGGGCGGTGCACATCGGCTCGCCGGGCTACCGCAGCATCAAGTCGATCCTCAAAAGCGGCCTCGATCGCGCCACTGT
GGAGGCGACCACTGAGACCCAGGCCCTGCCGCTGCACGCCAACGTCCGCGGCCCCGGCTACTACCACTGACCTCGCACCAAGAGACACCGATCATGCTCA
ACGAACCGACTCTGGAGAAACTCCAGGCGCTGAAGCTCACCGGCATGCGCGAGGCCCTCGAAGACCAGATCGCCCAACCGGCCACGCAGGAGCTCGCCTT
CGAGGAACGCCTTGCGCTGCTGCTCGACCGCGAGATCCTCGCCCGTGACAACCGCCGACTGACGCGGCTGCTCAAGGCCGCCCGGCTGCGTATCCCCGGC
GCCTGCCTGGAGGACGTCGACTACCGCCACGCCCGCGGGCTGCAGCGCCCGCAGATGGCCCAGCTCGGCTCCTGCCAGTGGATCCACCACAAGCAGAACC
TGCTGCTCACCGGCCCCACCGGCACCGGCAAGACCTACCTCGCCTGCGCGCTCGGCAACCAGGCCTGTCGTCAGGGCCTGTCCACGCGCTACGTCCGCCT
CCCGCGGCTGCTCGAGGCCATCCACATCGCCCACGCCGACGGCTCCTACCCGCGGCTGATGCAGCAGCTCGCACGCACGGATCTGCTCATCCTTGACGAC
TGGGCCATCGCGCCGCTGACCGCCAGCCAGCGCCAGGACCTCATGGAGCTCATCGAGGACCGCCACGGCCTGCGATCGACACTCATCGCCAGCCAGCTCC
CCGTCGAGCACTGGCACGACTACCTCGGCGAGCCCACGCTCGCCGACGCCATCCTCGACCGGCTGCTGCACAACGCCCACCGCCTGCCCATGAAAGGCGC
GTCCATGCGCCAGACCACAACCATCGCGGAGGCCGACCAATGAACACCTGGGAACCCGGCCTGTCGCGCAACACGCGCTTCCATCTGCGCCTTGGTGAAC
GGCGCACCACCGTCACCCTGGATACACTGCTATCCAGCTACCTCGCCATCCGCCTGGGTCTTGAGCCCGAAACACCCCAGGCGCATCAGGCCGTGCGGCG
CTGGCTGCAGCACCGCCTCGATGAGCACAACGACCCCGGCCGCGTTGCCGTTAGCCAGTGGCTCCAGCGCGAGGTCCTCACCGTCGTGGCGGATACAAAA
CTATCCACCCACTACGCCAACTGGCTCCTCGACGGCACGCCTCCACCACCGGTCGCGCTTGACCCATCGTGACCGATCACGCTCAATGAACACGGCCCAG
CGCACACCGCGCCGCGAACCGGTCACGAACAGCGGAATCAGCGGTCACGTTCACCGGAATACGCA
GGCCCTGGCGCCGGAATCATTGGTCACGGTCCCGGAATCGCTGGTCACGATGCCGGAATCGGCGTGGTGTGCGCTGGTAATTCCCTTTGACTGCAGGCAT
ATAGCTCTCTTTTCACTGTTCCCGGGGAGAGGGCATGCCGGCACCGAGGATTACCATGCGACAACTCAGCGAGATTCTGCGTCTCAAACACCAGGCCGGA
CTCAGCCACGCCCGCATCGCCGCCGTCCTGGATCTCTCCAAGGGCGTGGTCAACAAGTACGTCAGCCTCGCCCGTGCCAAGGGCGTGGGCTGGCCCCTGC
CCGAGGGCATGGACGAGGCCGCGCTGGAGCGGCTGCTGTTCCCCGCCCCAGTGCTGCCCGGGCGGGGGGCGGAGCCGGACCACAACACCATCCATCAGGA
GCTCAAGCGCAAGGAGGTGACGCTGCAGTTGCTCTGGGCGGAGTACGCGGCCGCCCACGGGGCGGGGGCGCTTCGCTACAGCCAGTTCTGCGAGCGCTAC
CGGCGCTGGCGCGCCCTGCAGCGGCGGTCCATGCGCCAGCAGCACCGGGCCGGGGAGAAGCTGTTCATCGACTACTGCGGCCCCACGGTCCCAGTGATCG
ACCCGGGTACCGGGGAGATCCACCCCGCGGCGATCTTCGTCGCCGTGTGGGGCGCGTCGAGCTACACCTATGCCGAGGCCACACGCACCCAGTCGCTGCC
CGACTGGATCGCCTCCCATCAGCGGGCGCTCACGTACTTCGGGGGCTGTCCCATGCTGCTGGTCCCCGACAACCTCAAAGCCGCCGTCACCCGGGCCTCG
CGTTACACCCCGGAGGTCAACGACACCTATATGGAGATGGCCCGGCACTACGGGATGGCGGTGCTGCCGGCACGGCCCTACAAGCCCCGGGACAAGGCCA
AGGTCGAGGTGGCCGTGCAGGTGGTCGAGCGCTGGATCCTGGCGCGGCTGCGCCACCACACGTTCCACTCCCTGGCCGCGCTCAACGCCGCCATCAAGGA
TCTGCTCGTCGAACTCAACGAACGCCGCCTCCAGCGCCAGCCCCACAGCCGCCGGGAGCTGTTCGAGCGCCTCGACCGCCCCGCCATGCAGCCCCTGCCC
GCAGAGCCCTACGTCTACGCCGAGTGGAAGTACGTCAAACCCGGCATCGACTACCACATCGAGATCGATCGCCGGTTCTACTCCGTCCCCCACGCCCTGG
TCGGCCACCGGATCGAGGCGAGGGTCACCGCCACCACGGTGGAGGTCTACCACAAGGGTCAGCGCGTCGCCGTTCACGCCCGCCACGGCACGGGGGCCCA
CAGCACCCTGGCCGAGCACATGCCCGCCGCCCACCGCAAGCACCAGCAGTGGACCCCGGGGCGGTTCCTCAACTGGGCGCAGGCCATCGGCCCGGCAACA
CGCCAGGTCATCCGCGCCCAGCTCGAAGGCCGCCCCCATCCCGAGCACGGCTACCGCGCCTGCCTCGGGCTGCTCAACCTCGCTCGCCACTACGGCAACG
CCCGCCTGGAGGCCGCATGTGCCAGGGCGGTGCACATCGGCTCGCCGGGCTACCGCAGCATCAAGTCGATCCTCAAAAGCGGCCTCGATCGCGCCACTGT
GGAGGCGACCACTGAGACCCAGGCCCTGCCGCTGCACGCCAACGTCCGCGGCCCCGGCTACTACCACTGACCTCGCACCAAGAGACACCGATCATGCTCA
ACGAACCGACTCTGGAGAAACTCCAGGCGCTGAAGCTCACCGGCATGCGCGAGGCCCTCGAAGACCAGATCGCCCAACCGGCCACGCAGGAGCTCGCCTT
CGAGGAACGCCTTGCGCTGCTGCTCGACCGCGAGATCCTCGCCCGTGACAACCGCCGACTGACGCGGCTGCTCAAGGCCGCCCGGCTGCGTATCCCCGGC
GCCTGCCTGGAGGACGTCGACTACCGCCACGCCCGCGGGCTGCAGCGCCCGCAGATGGCCCAGCTCGGCTCCTGCCAGTGGATCCACCACAAGCAGAACC
TGCTGCTCACCGGCCCCACCGGCACCGGCAAGACCTACCTCGCCTGCGCGCTCGGCAACCAGGCCTGTCGTCAGGGCCTGTCCACGCGCTACGTCCGCCT
CCCGCGGCTGCTCGAGGCCATCCACATCGCCCACGCCGACGGCTCCTACCCGCGGCTGATGCAGCAGCTCGCACGCACGGATCTGCTCATCCTTGACGAC
TGGGCCATCGCGCCGCTGACCGCCAGCCAGCGCCAGGACCTCATGGAGCTCATCGAGGACCGCCACGGCCTGCGATCGACACTCATCGCCAGCCAGCTCC
CCGTCGAGCACTGGCACGACTACCTCGGCGAGCCCACGCTCGCCGACGCCATCCTCGACCGGCTGCTGCACAACGCCCACCGCCTGCCCATGAAAGGCGC
GTCCATGCGCCAGACCACAACCATCGCGGAGGCCGACCAATGAACACCTGGGAACCCGGCCTGTCGCGCAACACGCGCTTCCATCTGCGCCTTGGTGAAC
GGCGCACCACCGTCACCCTGGATACACTGCTATCCAGCTACCTCGCCATCCGCCTGGGTCTTGAGCCCGAAACACCCCAGGCGCATCAGGCCGTGCGGCG
CTGGCTGCAGCACCGCCTCGATGAGCACAACGACCCCGGCCGCGTTGCCGTTAGCCAGTGGCTCCAGCGCGAGGTCCTCACCGTCGTGGCGGATACAAAA
CTATCCACCCACTACGCCAACTGGCTCCTCGACGGCACGCCTCCACCACCGGTCGCGCTTGACCCATCGTGACCGATCACGCTCAATGAACACGGCCCAG
CGCACACCGCGCCGCGAACCGGTCACGAACAGCGGAATCAGCGGTCACGTTCACCGGAATACGCA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1536 bp | 512 aa | 235 | 1770 | + | No |
Chemistry : DDE
ORF sequence :
MPAPRITMRQLSEILRLKHQAGLSHARIAAVLDLSKGVVNKYVSLARAKGVGWPLPEGMDEAALERLLFPAPVLPGRGAEPDHNTIHQELKRKEVTLQLL
WAEYAAAHGAGALRYSQFCERYRRWRALQRRSMRQQHRAGEKLFIDYCGPTVPVIDPGTGEIHPAAIFVAVWGASSYTYAEATRTQSLPDWIASHQRALT
YFGGCPMLLVPDNLKAAVTRASRYTPEVNDTYMEMARHYGMAVLPARPYKPRDKAKVEVAVQVVERWILARLRHHTFHSLAALNAAIKDLLVELNERRLQ
RQPHSRRELFERLDRPAMQPLPAEPYVYAEWKYVKPGIDYHIEIDRRFYSVPHALVGHRIEARVTATTVEVYHKGQRVAVHARHGTGAHSTLAEHMPAAH
RKHQQWTPGRFLNWAQAIGPATRQVIRAQLEGRPHPEHGYRACLGLLNLARHYGNARLEAACARAVHIGSPGYRSIKSILKSGLDRATVEATTETQALPL
HANVRGPGYYH
WAEYAAAHGAGALRYSQFCERYRRWRALQRRSMRQQHRAGEKLFIDYCGPTVPVIDPGTGEIHPAAIFVAVWGASSYTYAEATRTQSLPDWIASHQRALT
YFGGCPMLLVPDNLKAAVTRASRYTPEVNDTYMEMARHYGMAVLPARPYKPRDKAKVEVAVQVVERWILARLRHHTFHSLAALNAAIKDLLVELNERRLQ
RQPHSRRELFERLDRPAMQPLPAEPYVYAEWKYVKPGIDYHIEIDRRFYSVPHALVGHRIEARVTATTVEVYHKGQRVAVHARHGTGAHSTLAEHMPAAH
RKHQQWTPGRFLNWAQAIGPATRQVIRAQLEGRPHPEHGYRACLGLLNLARHYGNARLEAACARAVHIGSPGYRSIKSILKSGLDRATVEATTETQALPL
HANVRGPGYYH
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
750 bp | 249 aa | 1794 | 2543 | + | No |
AG : IS21 helper
ORF sequence :
MLNEPTLEKLQALKLTGMREALEDQIAQPATQELAFEERLALLLDREILARDNRRLTRLLKAARLRIPGACLEDVDYRHARGLQRPQMAQLGSCQWIHHK
QNLLLTGPTGTGKTYLACALGNQACRQGLSTRYVRLPRLLEAIHIAHADGSYPRLMQQLARTDLLILDDWAIAPLTASQRQDLMELIEDRHGLRSTLIAS
QLPVEHWHDYLGEPTLADAILDRLLHNAHRLPMKGASMRQTTTIAEADQ
QNLLLTGPTGTGKTYLACALGNQACRQGLSTRYVRLPRLLEAIHIAHADGSYPRLMQQLARTDLLILDDWAIAPLTASQRQDLMELIEDRHGLRSTLIAS
QLPVEHWHDYLGEPTLADAILDRLLHNAHRLPMKGASMRQTTTIAEADQ
Blast result :
Comments
ISNarch3 is 68% aa (transposase) similar to ISAzo4.
References
1] Sarah Sonbol (2020) Direct submission.
2] Xue,Q. (2020) Direct GenBank submission.
2] Xue,Q. (2020) Direct GenBank submission.