ISNarch2
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_CP050695.1 | ND | Natrialbaceae archaeon | Natrialbaceae archaeon XQ-INN 246 strain 2447 |
DNA section
IS Length : 2315 bp
Ends
IR Length : 22/24
IRL : GTAAGCGCTCCACGAACCCCATCTGTTGAGTCGATTCCAGTTCCGCGACA
IRR : GTAAGCGCTCCGCGAACCCCACCTACTCAGCGGGTTTGAGCGCGTATAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGGAAGCCCT | GAAGCCCT | GAAGCCCTTG | 8 |
CCGTTCGTCG | GTTCGTCG | GTTCGTCGTC | 8 |
ATCTGATTTT | GGCTTCGGTG | 0 | |
AGCACCCGGC | CACCCGGC | CACCCGGCGG | 8 |
DNA sequence
GTAAGCGCTCCACGAACCCCATCTGTTGAGTCGATTCCAGTTCCGCGACACTGAGCCTTCCAGATTGATGGGAGGCTTGGAGCATGTGGTTATCCCCGGT
ACGCGCCCGCTGGCTGGGTCATTTGTATCACGCCCATGCCTTGGGGTTGTCGCTGAGTGAGTATGCGCGGCGCCAGGATGTCTCGCTCGCCGAGTTGATG
GACTGGGAGCGCCGGCTGCATGAGGCCGGAGTTCCGGTTCCGGAGCGTCACCGTCCCGCACGGTTCGTGGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GACGGATGTGGCGGTGTATCTGTGCCGCGAGCCCGTGGACATGCGCAAGTCGATTGACGGTTTGTCGCTGCTCGTCCAGGAGGTCATGGCGTGCGATCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTGTTCTGGGAGCGCAACGGCTTTGTGGTCTGGTACAAACGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGCTTGCGGTGAGCGGGAGCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGTTGCTCGACGGCATCGACATCACCCG
TATGCAGCCGCACAAAGCGTTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGGCGTTTTGGTACAATTACCCACATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGATCTCCAGCGCGAGCTCGATGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAACTCGCCGAGAAGGAGG
CCGCGTGGGCGGCAGAGAAGCGCTCGCTGTTCGAGCAAATCCGGCTGCTGCTGGACAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACCTGTTCTTCGACGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCAGGTGAGACTGCCGAGGCGGAAGAGGAAAACCAGCCGGCCCCCAGTGGC
GGCAAGCGTCGCCGGGGTGGGCGCGCCCCACTGCCGCCGGAGCTGCCTCGCGTGGACATCGTCCACGATCTCCCCGAGGACGAACAGCAGTGTGCCTGCG
GCTGCGGTGCGCTCACCCGCATCGGCGAAGAGGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGC
CTGCCGGGCCTGCGAGGACGGTGTCCAGATCGCCGATCTGCCGCCGCAGCCGCTGCCGAAGAGCAACGCGAGCCCCGGACTGCTTGCCTATATCGCCACC
GCCAAGTACCAGGACGCGCTGCCGCTGTACCGCCAGGAGCAGGTCTTCAAACGACTGGGCCTGGAGCTGCCACGGAACACGCTCGCCCGCTGGATGGTGG
ACCTGGGTGCGTTGCTCGCGCCACTGGCCGAGCGCATGCGCGCCCATTTGCACAGTGCGGAGCTCATCCACATGGATGAGACCACCGTGCAGGTGAACAC
CGAGCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAACGCGGCGGGCCGCCCGGAGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGC
TCGGGCCAGGTCCCGCGGCGCCTGCTGGATGACTATGGTGGTATCCTGCTCACCGACGGCTACGAGGGCTATGCCCAGGTCGTGCGCGATAATGCCATCA
CCCATGCCGGGTGCTGGGCGCATGCGCGCCGCAAGTTCAAAGAGGCCCAGAAGGTCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGGGCGCTGGCGTC
CATCGGCAAGCTCTACCGGGTGGAGCGCGAAGCCCAGGGCCTGCCCGTTGAGAAGCGTGAACGCCTGCGCGCCACGCACAGCCGGCCGCTGATCGAGGAT
CTGCGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCC
GCTTCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCCGCGGAGAACGCCATTCGGCCGTTCGTGGTGGGGCGCAAAAACTGGCTGTTCAGTCAGAC
GCCGCGGGGTGCGCACGCCAGTGCCACGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAG
AACCTGCCGGGCGCGACAACCGGCGAGGCCATCGACCGACTGCTGCCGTGGCATCAGGACGAGAGCCTATACGCGCTCAAACCCGCTGAGTAGGTGGGGT
TCGCGGAGCGCTTAC
ACGCGCCCGCTGGCTGGGTCATTTGTATCACGCCCATGCCTTGGGGTTGTCGCTGAGTGAGTATGCGCGGCGCCAGGATGTCTCGCTCGCCGAGTTGATG
GACTGGGAGCGCCGGCTGCATGAGGCCGGAGTTCCGGTTCCGGAGCGTCACCGTCCCGCACGGTTCGTGGCCGTGGAGGTGGTGGCATGATCCGGCCCGG
GACGGATGTGGCGGTGTATCTGTGCCGCGAGCCCGTGGACATGCGCAAGTCGATTGACGGTTTGTCGCTGCTCGTCCAGGAGGTCATGGCGTGCGATCCG
TTCACCGCGGCGGTGTTCGTGTTCTGCAACCGGGCGCGGGATAAGGTGAAGATCCTGTTCTGGGAGCGCAACGGCTTTGTGGTCTGGTACAAACGCCTCG
AGCAGGAGCGGTTCAAGTGGCCGGCTTGCGGTGAGCGGGAGCGGCTCACGCTCTCGGGCCAGGAGCTCAACTGGTTGCTCGACGGCATCGACATCACCCG
TATGCAGCCGCACAAAGCGTTGCATTTTCAGTCGGTTGGATGAAATTTTTGCTCGCCGCACCGGTGGCGTTTTGGTACAATTACCCACATGAAACGGGCC
GATACCAACACGTTGCCATCCAGTTCCGATCTCCAGCGCGAGCTCGATGAGCAGCGCGCTCTGGTCGAACGCCTCCAGGCCCAACTCGCCGAGAAGGAGG
CCGCGTGGGCGGCAGAGAAGCGCTCGCTGTTCGAGCAAATCCGGCTGCTGCTGGACAACCGCTTCGGCCCCTCCACCGAGAAGTACAGCATCAAGCAGCA
GGACCTGTTCTTCGACGAGGCCGAGAGCCTGGTGGAAGAGCCCGCCGAGTCAGGTGAGACTGCCGAGGCGGAAGAGGAAAACCAGCCGGCCCCCAGTGGC
GGCAAGCGTCGCCGGGGTGGGCGCGCCCCACTGCCGCCGGAGCTGCCTCGCGTGGACATCGTCCACGATCTCCCCGAGGACGAACAGCAGTGTGCCTGCG
GCTGCGGTGCGCTCACCCGCATCGGCGAAGAGGTCACCGAGCAGCTCGACATCATCCCGGCCCAGATCCAGGTGCTGCGCCATGTGCGCATCAAGTACGC
CTGCCGGGCCTGCGAGGACGGTGTCCAGATCGCCGATCTGCCGCCGCAGCCGCTGCCGAAGAGCAACGCGAGCCCCGGACTGCTTGCCTATATCGCCACC
GCCAAGTACCAGGACGCGCTGCCGCTGTACCGCCAGGAGCAGGTCTTCAAACGACTGGGCCTGGAGCTGCCACGGAACACGCTCGCCCGCTGGATGGTGG
ACCTGGGTGCGTTGCTCGCGCCACTGGCCGAGCGCATGCGCGCCCATTTGCACAGTGCGGAGCTCATCCACATGGATGAGACCACCGTGCAGGTGAACAC
CGAGCCCGGGCGGGCCGCCTCCAGCACCTCGTACATGTGGGTCCAACGCGGCGGGCCGCCCGGAGCCGAGGTGGTGCTGTTCGACTACGATCCCAGCCGC
TCGGGCCAGGTCCCGCGGCGCCTGCTGGATGACTATGGTGGTATCCTGCTCACCGACGGCTACGAGGGCTATGCCCAGGTCGTGCGCGATAATGCCATCA
CCCATGCCGGGTGCTGGGCGCATGCGCGCCGCAAGTTCAAAGAGGCCCAGAAGGTCCAGCCCAAGGGCAAGACCGGCAAGGCCGACCGGGCGCTGGCGTC
CATCGGCAAGCTCTACCGGGTGGAGCGCGAAGCCCAGGGCCTGCCCGTTGAGAAGCGTGAACGCCTGCGCGCCACGCACAGCCGGCCGCTGATCGAGGAT
CTGCGCCAGTGGCTTGACCAGTCCCTGGAGAAGGTGCCGCCGAAGAGCGCCATCGGCAAGGCCGTGCACTACCTCAACAGCCAATGGCCCCGGCTCATCC
GCTTCCTGGAGGATGGCCGCATCCCGCTGGACAACAACCCCGCGGAGAACGCCATTCGGCCGTTCGTGGTGGGGCGCAAAAACTGGCTGTTCAGTCAGAC
GCCGCGGGGTGCGCACGCCAGTGCCACGATCTACAGCGTCATCGAGACGGCCAAGATCAACGGCCTGGAGCCCTACGCGTACCTGCTCGAGGTGTTAAAG
AACCTGCCGGGCGCGACAACCGGCGAGGCCATCGACCGACTGCTGCCGTGGCATCAGGACGAGAGCCTATACGCGCTCAAACCCGCTGAGTAGGTGGGGT
TCGCGGAGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
207 bp | 68 aa | 84 | 290 | + | No |
AG : IS66 TnpA
ORF sequence :
MWLSPVRARWLGHLYHAHALGLSLSEYARRQDVSLAELMDWERRLHEAGVPVPERHRPARFVAVEVVA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 287 | 643 | + | No |
AG : IS66 TnpB
ORF sequence :
MIRPGTDVAVYLCREPVDMRKSIDGLSLLVQEVMACDPFTAAVFVFCNRARDKVKILFWERNGFVVWYKRLEQERFKWPACGERERLTLSGQELNWLLDG
IDITRMQPHKALHFQSVG
IDITRMQPHKALHFQSVG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1605 bp | 534 aa | 689 | 2293 | + | No |
Chemistry : DDE
ORF sequence :
MKRADTNTLPSSSDLQRELDEQRALVERLQAQLAEKEAAWAAEKRSLFEQIRLLLDNRFGPSTEKYSIKQQDLFFDEAESLVEEPAESGETAEAEEENQP
APSGGKRRRGGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLA
YIATAKYQDALPLYRQEQVFKRLGLELPRNTLARWMVDLGALLAPLAERMRAHLHSAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDY
DPSRSGQVPRRLLDDYGGILLTDGYEGYAQVVRDNAITHAGCWAHARRKFKEAQKVQPKGKTGKADRALASIGKLYRVEREAQGLPVEKRERLRATHSRP
LIEDLRQWLDQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASATIYSVIETAKINGLEPYAYLL
EVLKNLPGATTGEAIDRLLPWHQDESLYALKPAE
APSGGKRRRGGRAPLPPELPRVDIVHDLPEDEQQCACGCGALTRIGEEVTEQLDIIPAQIQVLRHVRIKYACRACEDGVQIADLPPQPLPKSNASPGLLA
YIATAKYQDALPLYRQEQVFKRLGLELPRNTLARWMVDLGALLAPLAERMRAHLHSAELIHMDETTVQVNTEPGRAASSTSYMWVQRGGPPGAEVVLFDY
DPSRSGQVPRRLLDDYGGILLTDGYEGYAQVVRDNAITHAGCWAHARRKFKEAQKVQPKGKTGKADRALASIGKLYRVEREAQGLPVEKRERLRATHSRP
LIEDLRQWLDQSLEKVPPKSAIGKAVHYLNSQWPRLIRFLEDGRIPLDNNPAENAIRPFVVGRKNWLFSQTPRGAHASATIYSVIETAKINGLEPYAYLL
EVLKNLPGATTGEAIDRLLPWHQDESLYALKPAE
Blast result :
Comments
ISNarch2 is 77% (transposase) aa similar to ISAeh1.
References
1] Sarah Sonbol (2020) Direct submission.
2] Xue,Q (202) Direct GenBank submission.
2] Xue,Q (202) Direct GenBank submission.