ISNarch5
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_CP050695.1 | ND | Natrialbaceae archaeon | Natrialbaceae archaeon XQ-INN 246 strain 2447 |
DNA section
IS Length : 2312 bp
Ends
IR Length : 17/24
IRL : GTAACCGCTCCACGAACCCCATCTGTTGAGCGGATTGTGGTTCCGGGACA
IRR : GTAAGCGATCCGGGAACTGCACCTACTCGGTGGGTTTGAGCGTGTATAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTGGAGCAGC | GGAGCAGC | GGAGCAGCGC | 8 |
DNA sequence
GTAACCGCTCCACGAACCCCATCTGTTGAGCGGATTGTGGTTCCGGGACACTGAGCCTTCCGGATTGATGGGAGGCTTGGAGCATGTGGTTATCCCCAGT
ACGCGCCCGCTGGCTGGGCTATCTGTATCACGCGCACGCCAGCGGCCTGACGTTGAGCGAGTATGCCGCCCGGCAGGATGTCTCGCTTGCCGAGCTGATG
GACTGGGAGCGCCGGCTGCGTGAGGCCGGGATTGCGGTGCCGGAGCGCCACCGCCCGGCACGGTTCGTCGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GAGCGATGTCGCTGTTTATCTGTGCCGCGAGCCCGTGGATATGCGTAAGTCGATGGACGGATTGTCTTTGCTCGTCCAGGAGGTCATGGAGTGCGACCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTATTCTGGGAGCGCAACGGCTTCGTGGTCTGGTATAAGCGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGAGTGCGGCGAAGGGGATCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGCTGCTTGACGGCATCGATATCACCCG
CATGCAGCCGCACAAAGCGCTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGTCGTTTTGGTACAATTGTCGGCATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGACCTCCAGCGCGAGCTCGACGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAGCTCGCCGAGAAGGAGG
CCGCGTGGGCGGCGGAGAAGCGCTCGCTGTTCGAGCAGATCCGGCTGCTGCTCGATAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACATGTTCTTCGATGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCTGATGAGACCGACGAGGCGGATGAGGACAACCAGCCAGTCCGCGGTAAG
CGCCGCCGTCGGGGCGGCCGCGCCCCGCTGCCACCGGAGTTGCCCCGCGTGGACATCGTCCACGACCTCCCCGAGGACGAACAGCAGTGCGCCTGTGGCT
GCGGTGCGCTCACCCGCATCGGCGAAGAAGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGCCTG
CCGGGCCTGCGAAGACGGCGTCCAGATCGCCGATCTGCCACCGCAGCCGCTGCCAAAGAGCAACGCGAGCCCCGGGCTGCTCGCTTATATCGCCACCGCC
AAGTACCAGGATGCGCTGCCACTGTACCGCCAGGAGCAGGTGTTCAAACGACTGGGCCTGGAGTTGCCACGGAACACGCTCGCCCGCTGGATGGTAGACA
TGGGCGCGCTGCTCGCCCCACTGGCCGAGCGCATGCGCGCCCATCTGCACAATGCGGAACTCATCCACATGGACGAGACCACCGTGCAGGTGAACACCGA
GCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAGCGCGGCGGACCACCCGGTGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGCTCG
GGCCAGGTGCCGCGCCGTCTGCTGGACGACTACAACGGCATCCTGCTCTCTGATGGCTACGAGGGCTATGCCCAGGTGGTGCGCGACAATGCGATCACTC
ACGCTGGCTGCTGGGCGCATGCGCGCCGCAAGTTCGTTGAGGCCCAGAAAGCCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGAGCTCTGGCGTCCAT
CGGCAAACTCTACCGTGTGGAGCGCGAGGCACAGGGTCTGCCCGTTGAGGAGCGTGAACGCCTGCGTGCCACGCACAGCCGGCCGCTGATCGAGGATCTG
CGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCCGCT
TCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCTGCGGAGAACGCCATCCGGCCGTTCGTGGTGGGGCGCAAGAACTGGCTGTTCAGCCAGACGCC
GAGGGGTGCCCACGCCAGCGCAGCGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAGAAC
CTGCCGGCGGCGGCCAGCGATGAGGCCATCGACGGCCTGCTGCCGTGGCATCAGGATGAGAGCCTATACACGCTCAAACCCACCGAGTAGGTGCAGTTCC
CGGATCGCTTAC
ACGCGCCCGCTGGCTGGGCTATCTGTATCACGCGCACGCCAGCGGCCTGACGTTGAGCGAGTATGCCGCCCGGCAGGATGTCTCGCTTGCCGAGCTGATG
GACTGGGAGCGCCGGCTGCGTGAGGCCGGGATTGCGGTGCCGGAGCGCCACCGCCCGGCACGGTTCGTCGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GAGCGATGTCGCTGTTTATCTGTGCCGCGAGCCCGTGGATATGCGTAAGTCGATGGACGGATTGTCTTTGCTCGTCCAGGAGGTCATGGAGTGCGACCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTATTCTGGGAGCGCAACGGCTTCGTGGTCTGGTATAAGCGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGAGTGCGGCGAAGGGGATCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGCTGCTTGACGGCATCGATATCACCCG
CATGCAGCCGCACAAAGCGCTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGTCGTTTTGGTACAATTGTCGGCATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGACCTCCAGCGCGAGCTCGACGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAGCTCGCCGAGAAGGAGG
CCGCGTGGGCGGCGGAGAAGCGCTCGCTGTTCGAGCAGATCCGGCTGCTGCTCGATAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACATGTTCTTCGATGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCTGATGAGACCGACGAGGCGGATGAGGACAACCAGCCAGTCCGCGGTAAG
CGCCGCCGTCGGGGCGGCCGCGCCCCGCTGCCACCGGAGTTGCCCCGCGTGGACATCGTCCACGACCTCCCCGAGGACGAACAGCAGTGCGCCTGTGGCT
GCGGTGCGCTCACCCGCATCGGCGAAGAAGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGCCTG
CCGGGCCTGCGAAGACGGCGTCCAGATCGCCGATCTGCCACCGCAGCCGCTGCCAAAGAGCAACGCGAGCCCCGGGCTGCTCGCTTATATCGCCACCGCC
AAGTACCAGGATGCGCTGCCACTGTACCGCCAGGAGCAGGTGTTCAAACGACTGGGCCTGGAGTTGCCACGGAACACGCTCGCCCGCTGGATGGTAGACA
TGGGCGCGCTGCTCGCCCCACTGGCCGAGCGCATGCGCGCCCATCTGCACAATGCGGAACTCATCCACATGGACGAGACCACCGTGCAGGTGAACACCGA
GCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAGCGCGGCGGACCACCCGGTGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGCTCG
GGCCAGGTGCCGCGCCGTCTGCTGGACGACTACAACGGCATCCTGCTCTCTGATGGCTACGAGGGCTATGCCCAGGTGGTGCGCGACAATGCGATCACTC
ACGCTGGCTGCTGGGCGCATGCGCGCCGCAAGTTCGTTGAGGCCCAGAAAGCCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGAGCTCTGGCGTCCAT
CGGCAAACTCTACCGTGTGGAGCGCGAGGCACAGGGTCTGCCCGTTGAGGAGCGTGAACGCCTGCGTGCCACGCACAGCCGGCCGCTGATCGAGGATCTG
CGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCCGCT
TCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCTGCGGAGAACGCCATCCGGCCGTTCGTGGTGGGGCGCAAGAACTGGCTGTTCAGCCAGACGCC
GAGGGGTGCCCACGCCAGCGCAGCGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAGAAC
CTGCCGGCGGCGGCCAGCGATGAGGCCATCGACGGCCTGCTGCCGTGGCATCAGGATGAGAGCCTATACACGCTCAAACCCACCGAGTAGGTGCAGTTCC
CGGATCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
207 bp | 68 aa | 84 | 290 | + | No |
AG : IS66 TnpA
ORF sequence :
MWLSPVRARWLGYLYHAHASGLTLSEYAARQDVSLAELMDWERRLREAGIAVPERHRPARFVAVEVVA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 287 | 643 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRPGSDVAVYLCREPVDMRKSMDGLSLLVQEVMECDPFTAAVFVFCNRARDKVKILFWERNGFVVWYKRLEQERFKWPECGEGDRLTLSGQELNWLLDG
IDITRMQPHKALHFQSVG
IDITRMQPHKALHFQSVG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1578 bp | 585 aa | 713 | 2290 | + | No |
Chemistry : DDE
ORF sequence :
LPSSSDLQRELDEQRALVERLQAQLAEKEAAWAAEKRSLFEQIRLLLDNRFGPSTEKYSIKQQDMFFDEAESLVEEPAESDETDEADEDNQPVRGKRRRR
GGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLAYIATAKYQD
ALPLYRQEQVFKRLGLELPRNTLARWMVDMGALLAPLAERMRAHLHNAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDYDPSRSGQVP
RRLLDDYNGILLSDGYEGYAQVVRDNAITHAGCWAHARRKFVEAQKAQPKGKTGKADRALASIGKLYRVEREAQGLPVEERERLRATHSRPLIEDLRQWL
DQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASAAIYSVIETAKINGLEPYAYLLEVLKNLPAA
ASDEAIDGLLPWHQDESLYTLKPTE
GGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLAYIATAKYQD
ALPLYRQEQVFKRLGLELPRNTLARWMVDMGALLAPLAERMRAHLHNAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDYDPSRSGQVP
RRLLDDYNGILLSDGYEGYAQVVRDNAITHAGCWAHARRKFVEAQKAQPKGKTGKADRALASIGKLYRVEREAQGLPVEERERLRATHSRPLIEDLRQWL
DQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASAAIYSVIETAKINGLEPYAYLLEVLKNLPAA
ASDEAIDGLLPWHQDESLYTLKPTE
Blast result :Comments : 78% similar to ISAeh1 transposase
Comments
ISNarch5 is 97% aa (transposase) similar to ISNarch2.
References
1] Sarah Sonbol (2020) Direct submission.
2] Xue,Q. (2020) Direct GenBank submission.
2] Xue,Q. (2020) Direct GenBank submission.