ISNarch4

  • Family IS66
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
NZ_CP050695.1 ND Natrialbaceae archaeon
Natrialbaceae archaeon 2447
Salinadaptatus halalkaliphilus 2447
DNA section
IS Length : 2455 bp

Ends


IR Length : 22/30

IRL : GTAAGCGCCAGCGAATCGGCACCTATTCCCTGGTCTGAGTACTGGTCACA
IRR : GTAAGCGCCCGTAAATCGACCCACTTGCCCCGCAGGCGGTCGTCAGCTCT

Insertion site


Left flankDirect repeatRight flankDR Length
CAGGGCGAAGGGACGAAGGGACGAAGCG8
CCGGCAGTCCGGCAGTCCGGCAGTCCCG8

DNA sequence

GTAAGCGCCAGCGAATCGGCACCTATTCCCTGGTCTGAGTACTGGTCACACTGGTTCCATCCCGTTGACCACTGGAGGAAATCGGTATGACCAGCGCCGA
GCTGCATCATTACTGGCGACAGACGCTGGATGCGTGGACGGCCTCCGGGTTGTCCGGCGCGGCGTTCTGCAAGCAACACTCACTCACCTACCACCAGTTT
GTCTACTGGCGGCGAAAGCTCCGTGGCCCGGGCGAGTCGCCTTCGCGGGCCGGCTTTGCCAGGGTGGCGCCGGTGGCACACGATGACGCCGCGGATGGGC
TGACCGTCTCGTTGCCCGGCGGTGTGTCGATCACCGGCCTGCACGCGGGCAACATCGAGTTGCTGGGCGCGGTGCTGAGGCAGTTGTGATGCGCAACCGG
TCTCTGCGCCCGTCCCGGCAGTTGCCGGAGATTTACCTGTACCGGGCCCCGGTGGATTTCCGCAAACAGGCCCATGGTCTCGCGCTGATCGTCGAGCAGG
AGCTCGGGCACAGCCCCTTCACCGGGGCGCTGTACGCCTTCACCAACCGCCAGCGCAACAAGATCAAGTGTCTGATGTGGGAAGACAACGGCTTCGTGCT
CTACTACAAGGCCCTGGCCGAGGAGCGGTTCAAGTGGCCGGCCCCGGGCGATGAGTTGATGAGCCTGAGCGGGGAGCAGATCAACTGGCTGCTCGACGGC
TACGACATCACGCTGCTGCGGGGGCACAAAAAGCTGCATTACGAGGCGCTTGGGTAGGCGTTTTTGCGTGCGCCAGGGCGCTGTTTTTGGTATGATTTCG
TAATGAAATCAACGCCCGATAACGCCCCTCCGGCTCCCGATCTCAGCGGCCTCTCCGCCGCTGAGATGATGGCCGTTATCGGTGACCTTCAGCAACAACT
GGCCTCGAAAGAGCAGGCCATCCGGCAACGCGATACGCGCATTGATCTGCTCGAAGAACTGCTGCGCCTGAAGACCCTCCAGAAGTTCGCCGCCAGTAGC
GAGAAGCATCAAAACCAGATCACGCTGTTCGACGAGGCGGAGGTGGAAGCCGAGATCGATGCCTTGCGCGAGGCACTCCCGGACGACGCTGAACCCGACC
CGGATGAGACGCCGCGCACCTCGGGCAAGCGGCGTGAGCGGGGCTTCTCGGACACGCTGGCGCGCAGGCGCGTTGAGCTCACGCTCAGCGACGAGGAGAA
AGCCGGTGCCAGCAAGACCTTCTTCACCAAGGTCAAGGAGGAGCTTGAGTTCATCCCCGCTCAGTTGAGCGTGCTGGAGTACTGGCAGGAGAAGGCCGTG
TTCGAGCACGACGACGGGGAGGAGTCCCTAGTGGCGGCGCCCCGGCCGGTCCACCCGCTGGGCAAATGCATTGCCACCACCGCGCTGCTCGCCTACATCA
TCACCTCGAAGTACGCCGACGGTCTGCCGCTGTACCGACTGGAGAACATGCTGGCGCGGCTCGGGCATTCGGTCAGTCGCACCAGCATGGCGCACTGGAT
CATCCGCCTGGATGCGGTGTTCAGCCCGCTGATCAACCTCATGCGCGAGGCGCAGAACACCAGCGACTACCTCCAGGCCGATGAGACCCGCATGCAGGTC
CTCAAGGAGGACGGCAAGGTCGCCCAGTCCGACAAATGGATCTGGGTGACCCGGGGTGGGCCACCTGGCCGGCCAACGGTGCTGTTCGCGTACGACCCCT
CACGTGCGGGGAGCGTGCCCGTGCGCCTGCTCGATGACTTCAGCGGCATCCTGCAGGCCGATGGCTACTCCGGCTACGGCCAGGTGTGTCGGGACAACGC
CATCACCCGGATCGGGTGCTGGGATCATGCCCGTCGCAAGTTTGTCGAGGCCTCCAAGGCGGCGCCGCCCAAGAAGAAGGGCAAAGGCAAACGCCAGAGC
GCCAAGGCGGATGTGGCGCTGGGGGCGATCCAAGAGCTCTACGCCATTGAGCGCCGAATCAAGGATCTCGGCGATGATGAGCGCTATCGCATCCGCCAGG
CCGAGAGCCTGCCCCGGCTCCAGGCGTTGAAAACCTGGCTGGAAGACAACGCCGGCAAGGTCGTGAAGGGCTCACTGACCCGCAAGGCGATGGACTACAC
CCTGAACCAGTGGGACACCCTGGTGGGCTACTGCGAGCGTGGGGATCTACAGATCAGTAACGCCCTGGCCGAGAACGCCATCCGCCCGTTCGCGCTCGGT
CGCAAGGCATGGCTGTTCGCCGATACCACCCAGGGCGCACGCGCCAGCGCGAGCTGCTACTCACTAATCGAGACCGCCAAGGCCAATGGCCTGGACCCCT
CGGCCTACATCCACCATGTGCTCACGCACATCGGCGAGGCGGACACCGTCGAGAAGCTCGAAGCGCTACTGCCCTGGAATACGGGCCTGGAGCCGGCTCC
GAAAAAGAGCTGACGACCGCCTGCGGGGCAAGTGGGTCGATTTACGGGCGCTTAC
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
303 bp100 aa87389+No
ORF function : Accessory Gene
AG : IS66 TnpA

ORF sequence :

MTSAELHHYWRQTLDAWTASGLSGAAFCKQHSLTYHQFVYWRRKLRGPGESPSRAGFARVAPVAHDDAADGLTVSLPGGVSITGLHAGNIELLGAVLRQL

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
369 bp123 aa389757+No
ORF function : Accessory Gene
AG : IS66 TnpB

ORF sequence :

MRNRSLRPSRQLPEIYLYRAPVDFRKQAHGLALIVEQELGHSPFTGALYAFTNRQRNKIKCLMWEDNGFVLYYKALAEERFKWPAPGDELMSLSGEQINW
LLDGYDITLLRGHKKLHYEALG

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1611 bp536 aa8032413+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

MKSTPDNAPPAPDLSGLSAAEMMAVIGDLQQQLASKEQAIRQRDTRIDLLEELLRLKTLQKFAASSEKHQNQITLFDEAEVEAEIDALREALPDDAEPDP
DETPRTSGKRRERGFSDTLARRRVELTLSDEEKAGASKTFFTKVKEELEFIPAQLSVLEYWQEKAVFEHDDGEESLVAAPRPVHPLGKCIATTALLAYII
TSKYADGLPLYRLENMLARLGHSVSRTSMAHWIIRLDAVFSPLINLMREAQNTSDYLQADETRMQVLKEDGKVAQSDKWIWVTRGGPPGRPTVLFAYDPS
RAGSVPVRLLDDFSGILQADGYSGYGQVCRDNAITRIGCWDHARRKFVEASKAAPPKKKGKGKRQSAKADVALGAIQELYAIERRIKDLGDDERYRIRQA
ESLPRLQALKTWLEDNAGKVVKGSLTRKAMDYTLNQWDTLVGYCERGDLQISNALAENAIRPFALGRKAWLFADTTQGARASASCYSLIETAKANGLDPS
AYIHHVLTHIGEADTVEKLEALLPWNTGLEPAPKKS

 

Blast result :
Comments
ISNarch4 is 82% aa (transposase) similar to ISAeme5.
References
1] Sarah Sonbol (2020) Direct submission.
2] Xue,Q. (2020) Direct GenBank submission.