ISNarch6
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_CP050695.1 | ND | Natrialbaceae archaeon | Natrialbaceae archaeon 2447 Salinadaptatus halalkaliphilus 2447 |
DNA section
IS Length : 2314 bp
Ends
IR Length : 26/36
IRL : GTAACCGCTCCACGAACCCCATCTGTTGAGTCGATTCCGGTTCCGGGACA
IRR : GTAAGCGATCCGGGAACCCCAACTACTCAGCGGATTTGAGCGCGTATAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCAATTGAAC | AATTGAAC | AATTGAACAG | 8 |
DNA sequence
GTAACCGCTCCACGAACCCCATCTGTTGAGTCGATTCCGGTTCCGGGACACTGAGCCTTCCAGATGGATGGGAGGCCTGGAGTATGTGGTTATCCCCGGT
ACGAGCCCGCTGGTTGGGCCATTTGTATCACGCCCACGCATTGGGTTTGTCACTGAGTGAGTATGCCCGGCGCCAGGATGTCTCGCTGGCCGAGCTGATG
GACTGGGAGCGCCGGCTGCATGAGGCTGGGGTTCCGGTTCCGGAGCGTCACCGTCCCGCACGGTTCGTCGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GAGCGATGTGGCGGTGTATCTGTGCCGCGAGCCCGTGGACATGCGCAAGTCCATCGACGGGCTCTCGCTGCTCGTCCAAGAAGTCATGGAGTGCGATCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCAGGATAAGGTGAAGATCCTGTTCTGGGAGTGCAACGGCTTCGTGGTCTGGTACAAGCGCCTCG
AGCAGGAGCGGTTCAAGTGGCCAGTGTGCGGTGAGCAGGAGCGTCTGACACTCTCCGGCCAGGAGCTCAACTGGCTGCTCGATGGCATCGACATTAGCCG
CATGCGACCACACAAAGCGCTGCGTTATCAGTCGGTTGGCTGACATTTTGGTCGTCTCTCTAGTGCCTTTTTGGTACAATTACCGGCATGAAACGGGCCG
ATACCAACACGTTGCCATCCAGTTCCGATCTCCAGCGCGAGCTCGACGAACAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAACTCGCCGAGAAGGAGGC
CGCGTGGGCGGCAGAGAAGCGCTCGCTGTTCGAGCAAATCCGGCTGCTGCTGGACAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCAG
GACCTGTTCTTCGACGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCAGGTGAGACTGCCGAGGCGGAAGAGGAAAACCAGCCGGCCCCCAGTGGCG
GCAAGCGTCGCCGGGGTGGGCGCGCCCCACTGCCGCCGGAGCTGCCTCGCGTGGACATCGTCCACGATCTCCCCGAGGACGAACAGCAGTGCGCCTGCGG
CTGCGGTGCGCTCACCCGCATCGGCGAAGAGGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGCC
TGCCGGGCCTGCGAGGACGGTGTCCAGATCGCCGATCTGCCGCCGCAGCCGCTGCCGAAGAGCAACGCGAGCCCCGGACTGCTTGCCTATATCGCCACCG
CCAAGTACCAGGACGCGCTGCCGCTGTACCGCCAGGAGCAGGTCTTCAAACGACTGGGCCTGGAGCTGCCACGGAACACGCTCGCCCGCTGGATGGTGGA
GATGGGCACGTTGCTCGCCCCACTGGCCGAGCGCCTGCGCGCCCATCTGCAAGGCGCGGAACTCATCCACATGGACGAGACGACCGTGCAGGTGAACACC
GAGCCCGGGCGTGCTGCCTCCAGCACCTCGTACATGTGGGTCCAGCGCGGCGGACCACCCGGTGCCGAGGTGGTGCTGTTCGACTACGACCCCAGCCGCT
CGGGCCAGGTCCCGCGCCGCCTGCTGGATGACTACGACGGCATCCTGCTCACCGACGGTTACGAGGGCTATGCCCAGGTGGTGCGCGACAATGCGATCAC
TCACGCTGGCTGCTGGGCGCACGCCAGGCGGCAGTTCGTCGAGGCCCAGAAAGCCCAGCCCAAGGGCAAGACCGGGAAGGCGGACCGAGCGCTGGCGTCC
ATCGGCAAACTCTACCGCGTGGAACGCGAGGCGCAGGGCCTGCCCGTTGAGGAGCGTGAACGCCTGCGCGCCACGCACAGCCAGCCGCTGATCAAGGATC
TGCGCCAGTGGCTTGACCAGTCACTGGAGAAGGTGCCGCCGAAGACCGCCATCGGCAAAGCCGTGCACTACCTCAACAGCCAGTGGCCCCGGCTCATCCG
GTTCCTGGACGATGGGCGCATCCCGCTGGACAACAACCCCGCGGAGAACGCCATCCGGCCGTTCGTGGTGGGGCGCAAGAACTGGCTTTTTAGCCACACA
CAGCGGGGCGCCCACGCCAGCGCCACGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAGA
ACCTGCCGGCGGCGACAACCGACGAGGCCATCACGGATCTGCTGCCGTGGAACCAGGATGAGAGCCTATACGCGCTCAAATCCGCTGAGTAGTTGGGGTT
CCCGGATCGCTTAC
ACGAGCCCGCTGGTTGGGCCATTTGTATCACGCCCACGCATTGGGTTTGTCACTGAGTGAGTATGCCCGGCGCCAGGATGTCTCGCTGGCCGAGCTGATG
GACTGGGAGCGCCGGCTGCATGAGGCTGGGGTTCCGGTTCCGGAGCGTCACCGTCCCGCACGGTTCGTCGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GAGCGATGTGGCGGTGTATCTGTGCCGCGAGCCCGTGGACATGCGCAAGTCCATCGACGGGCTCTCGCTGCTCGTCCAAGAAGTCATGGAGTGCGATCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCAGGATAAGGTGAAGATCCTGTTCTGGGAGTGCAACGGCTTCGTGGTCTGGTACAAGCGCCTCG
AGCAGGAGCGGTTCAAGTGGCCAGTGTGCGGTGAGCAGGAGCGTCTGACACTCTCCGGCCAGGAGCTCAACTGGCTGCTCGATGGCATCGACATTAGCCG
CATGCGACCACACAAAGCGCTGCGTTATCAGTCGGTTGGCTGACATTTTGGTCGTCTCTCTAGTGCCTTTTTGGTACAATTACCGGCATGAAACGGGCCG
ATACCAACACGTTGCCATCCAGTTCCGATCTCCAGCGCGAGCTCGACGAACAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAACTCGCCGAGAAGGAGGC
CGCGTGGGCGGCAGAGAAGCGCTCGCTGTTCGAGCAAATCCGGCTGCTGCTGGACAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCAG
GACCTGTTCTTCGACGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCAGGTGAGACTGCCGAGGCGGAAGAGGAAAACCAGCCGGCCCCCAGTGGCG
GCAAGCGTCGCCGGGGTGGGCGCGCCCCACTGCCGCCGGAGCTGCCTCGCGTGGACATCGTCCACGATCTCCCCGAGGACGAACAGCAGTGCGCCTGCGG
CTGCGGTGCGCTCACCCGCATCGGCGAAGAGGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGCC
TGCCGGGCCTGCGAGGACGGTGTCCAGATCGCCGATCTGCCGCCGCAGCCGCTGCCGAAGAGCAACGCGAGCCCCGGACTGCTTGCCTATATCGCCACCG
CCAAGTACCAGGACGCGCTGCCGCTGTACCGCCAGGAGCAGGTCTTCAAACGACTGGGCCTGGAGCTGCCACGGAACACGCTCGCCCGCTGGATGGTGGA
GATGGGCACGTTGCTCGCCCCACTGGCCGAGCGCCTGCGCGCCCATCTGCAAGGCGCGGAACTCATCCACATGGACGAGACGACCGTGCAGGTGAACACC
GAGCCCGGGCGTGCTGCCTCCAGCACCTCGTACATGTGGGTCCAGCGCGGCGGACCACCCGGTGCCGAGGTGGTGCTGTTCGACTACGACCCCAGCCGCT
CGGGCCAGGTCCCGCGCCGCCTGCTGGATGACTACGACGGCATCCTGCTCACCGACGGTTACGAGGGCTATGCCCAGGTGGTGCGCGACAATGCGATCAC
TCACGCTGGCTGCTGGGCGCACGCCAGGCGGCAGTTCGTCGAGGCCCAGAAAGCCCAGCCCAAGGGCAAGACCGGGAAGGCGGACCGAGCGCTGGCGTCC
ATCGGCAAACTCTACCGCGTGGAACGCGAGGCGCAGGGCCTGCCCGTTGAGGAGCGTGAACGCCTGCGCGCCACGCACAGCCAGCCGCTGATCAAGGATC
TGCGCCAGTGGCTTGACCAGTCACTGGAGAAGGTGCCGCCGAAGACCGCCATCGGCAAAGCCGTGCACTACCTCAACAGCCAGTGGCCCCGGCTCATCCG
GTTCCTGGACGATGGGCGCATCCCGCTGGACAACAACCCCGCGGAGAACGCCATCCGGCCGTTCGTGGTGGGGCGCAAGAACTGGCTTTTTAGCCACACA
CAGCGGGGCGCCCACGCCAGCGCCACGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAGA
ACCTGCCGGCGGCGACAACCGACGAGGCCATCACGGATCTGCTGCCGTGGAACCAGGATGAGAGCCTATACGCGCTCAAATCCGCTGAGTAGTTGGGGTT
CCCGGATCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
207 bp | 68 aa | 84 | 290 | + | No |
AG : IS66 TnpA
ORF sequence :
MWLSPVRARWLGHLYHAHALGLSLSEYARRQDVSLAELMDWERRLHEAGVPVPERHRPARFVAVEVVA
Blast result :Comments : hypothetical proteinORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 287 | 643 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRPGSDVAVYLCREPVDMRKSIDGLSLLVQEVMECDPFTAAVFVFCNRAQDKVKILFWECNGFVVWYKRLEQERFKWPVCGEQERLTLSGQELNWLLDG
IDISRMRPHKALRYQSVG
IDISRMRPHKALRYQSVG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1605 bp | 534 aa | 688 | 2292 | + | No |
Chemistry : DDE
ORF sequence :
MKRADTNTLPSSSDLQRELDEQRALVERLQAQLAEKEAAWAAEKRSLFEQIRLLLDNRFGPSTEKYSIKQQDLFFDEAESLVEEPAESGETAEAEEENQP
APSGGKRRRGGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLA
YIATAKYQDALPLYRQEQVFKRLGLELPRNTLARWMVEMGTLLAPLAERLRAHLQGAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDY
DPSRSGQVPRRLLDDYDGILLTDGYEGYAQVVRDNAITHAGCWAHARRQFVEAQKAQPKGKTGKADRALASIGKLYRVEREAQGLPVEERERLRATHSQP
LIKDLRQWLDQSLEKVPPKTAIGKAVHYLNSQWPRLIRFLDDGRIPLDNNPAENAIRPFVVGRKNWLFSHTQRGAHASATIYSVIETAKINGLEPYAYLL
EVLKNLPAATTDEAITDLLPWNQDESLYALKSAE
APSGGKRRRGGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLA
YIATAKYQDALPLYRQEQVFKRLGLELPRNTLARWMVEMGTLLAPLAERLRAHLQGAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDY
DPSRSGQVPRRLLDDYDGILLTDGYEGYAQVVRDNAITHAGCWAHARRQFVEAQKAQPKGKTGKADRALASIGKLYRVEREAQGLPVEERERLRATHSQP
LIKDLRQWLDQSLEKVPPKTAIGKAVHYLNSQWPRLIRFLDDGRIPLDNNPAENAIRPFVVGRKNWLFSHTQRGAHASATIYSVIETAKINGLEPYAYLL
EVLKNLPAATTDEAITDLLPWNQDESLYALKSAE
Blast result :
Comments
ISNarch6 is 98% aa (tranposase) similar to ISNarch2.
References
1] Sarah Sonbol (2020) Direct submission.
2] Xue,Q. (2020) Direct GenBank submission.
2] Xue,Q. (2020) Direct GenBank submission.