ISArsp16

  • Family ISNCY
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s)
Accession numberTranspositionOriginHost
MH067967 ND Arthrobacter sp.
Arthrobacter sp.
Arthrobacter sp. ANT_H19B pA19BH1
DNA section
IS Length : 4338 bp

Ends


IR Length : 24/27

IRL : TGAGAATGCTCATTGGTATGTCAGTAATCGGCGGAAATTGCTCATTGAGA
IRR : TGAGAATGCTCATCGAGATGTCAGTAAATTGCTCAATGAGATGTCAGTGA

Insertion site


Left flankDirect repeatRight flankDR Length
tgagaatgctcattggtatgtcagtaaGTTttactgacatctcgatgagcattctca3

DNA sequence

TGAGAATGCTCATTGGTATGTCAGTAATCGGCGGAAATTGCTCATTGAGATGTCAGTAATTTGCTCATTGAGATGTCAGTGGATCGGGGGCTCGTCCCAG
TAGGCTCTTTGACCATCTTGGTGATGTGGTGGTCGGCAGAAGGGGTCGATTTTGGGCGAGCAGGACCTGGTTGCGCGCGATCGGCTTCCTCGTGAGGCGG
CGGTGCGGCGGCTGATGGCCTTGGACGGTGAAGGCCGGCTCTCTACGGAGGACGTGCGGCTGGCGGCGGAGGGTCTGGCCGTGTCGGAGCGGACCGTGTG
GCGGTGGATTGGGCGGGCACGAGTCACTGATGACTTCGATGAGGCGGCGCGCGAGCATTTCACGGTGACGGATCTGGTGCGGGAACGGCTCGCGTTTTGG
CGAGGCAATATCTCGGCTGTGCACCGGGAGCTTGTTGAGGAGGCGGAGAGGGCGGGGACGGAGGCTGTAAGTCGACAGACCCTGCAGCGGGCCGTGGAAC
GGGACATTCTGCGGGGAGACCGGGCGGGTTTGCGCCATGGTGAACATGCCCGGAGGGCGCATGATGTTTTTCTGCAACGGCCGCGAACGCACCGAAATGG
AGCGTGGGAGTCGGATCATGTTGAGGCCCCCGTGGAGGTCGACGTCGAGGGCCGGCTGGTCAAACCGTGGGTGACCTGGTTCGTGGACGTGGGCACGAAC
GCGGTGTGCGGAACGGCCGTCACACCGGGGGCGCCGTCCAGGGAAAGCATTTTGGCGGCGCTGCGTGCAGCAATTGCCCTCGAGGCCCCATATGGGCCGC
CTGGTGGGTTGCCCGAGAGAGTACGGATTGACCGGGGGAAGGATTTCCTGTCCGAGGCGGTGCGGAGCGTGCTGGCAGGTTTCGCGGTGAGGGTTGTCCC
CCTGCCTCCGTACACACCACACCTGAAAGGCACGGTGGAGACGGTAAACGGGGCGGCCGGGCAGATGTTCTTCGCGGGCCTGCCCCGCTACACGGGCGCA
CAGACGCTGGCCAACGGCCGATCGATCGATCCGGACGCGCCAGCGTTGACATTCGAGGCGTTCGTTGCGCAGCTGCTGGGCTGGGTGAGCTGGTGGAACG
CCGAGCACCAGATGCCTGTGCTGGAAGGCAGGACGCCGCTGCAGGCGTGGTTGGATGACCCAACCCCGCTGAACACGGTGCCCGCCGGTGACCTGCGGCT
GCTTACTCTGGAAGACGACGGGCGTACCCGCAAGATCACGACGAAGGGTGTCTCGTGGCGGGCCAGTCAATACGTCGCCGCGTGGATGACCGGGCAGGTC
GGACGCCCGGTTCGGCTGCGATACATGCCCCACCACGAGCACGAGGTCGAGGTATTCGACGCCAACACCGGGGAACACCTGGGCGCCGCGAATCTCGCTG
ACAAGGCCAGCAGCGAGCAGATTGGTGAACTGCGCCGCTCCCGCGAGGCCCGCCGCCGGCAACTAAAGGCCGACCTACGTGCGGCAGAGAAAGCCCGCCG
GATCCGGTACGCCGCGGCGACCACCGCCGCTCCCCCGCAACCGATCAGCACCGTCACCGTCGCGCAGGCCACAGCTGAACTGTCTAACGCCGACGACCAG
CAGCTGCAGGCCGTAGCCCGGCCCCTGCTGGTGCCGTTACGGCCGCCAGCACCAGGCTGGGTCCTGCCCCGGTCCCCGTACGAGAAGACCAACCGTGCCG
TGGACGAAGGCATCGACGGGGGCCAAGACCAGTGAGCGCCTGGACAGACGTCGACGAGCGCGACGATCACTACCTCGGCCTGGCCGGGGCAAACGTCGTC
GCGACCGAGTCCCTGCTGGTGCTGCAGGACAACCTCGCCGACGTCATGACGGCCAAAGCGATGATGTGTGTGCACGGCGATGCCGGGCTGGGTAAGACCC
TGTCGGTCAACACGTCCCTGCGGGCGCTGGCGCCGGCCGACGTCTGCCGGGTGCAGTTCCGGGCCCGGCCCACCCCACGCGACATCCGTCACGTCCTCTT
CGATGCCCTCGGCGTCGGCGGAGCCCCACCCTCGCGACCGATCGAGTTCGACGCCCTACTCAAAGATGTTTTGTCGGAACGGTTCCGTGTCCTGGTGTGC
GACGAAGCTCAGTGGCTCTCACGGGAGTGCTTTGAGCTGTGGCGGCATCTGTGGGACGACCGACGGACCAACATCGCGATCGTATTCGTCGGCGGTGGCG
ACTGCTACCGGGTCTTGCGCCGCGAACCGATGCTCTCCAGCCGGGTCTACGTTTGGCAGGAGTTCCGCCGCCTGACCCGCGAGCAAATCCTGGCCGTGAT
CCCGGCCTACCACCGGGTTTGGGCCGAGGCCGACCCGGAAGACATCGTCTACACCGACGTGCACGCCGGACACGGCAACTTCCGGGCCTGGTCGAAGATC
ACCGCCCATCTGGTCACCGCGCTGGACCGCCTGGAAAGAAAACGCCCCGACCGGGAGGTCCTGCAATGGGTCTTCAGCCGGCTGGGCGGCAGCAGTGGGT
GAGCGCGCCAACCACCCGGATGAAACTGCGCCAGTTCGGATCCTGCTGGACCCCACAGACGACGTCCGGGTGACGTCGAGCCTGCTGGAGCGCCACAACC
CGGCCCACGGCCTCGCCGTCGTACATCCCACGCCCGCCACCTCCAGCCCCACGGCCCTCGCCTACGACGTGCTGGTCGCCCTTGACCGGCCGGTGAGCCG
GCTGGAGGCTGAACACCTGACCGGGACTGCACGCCCCTGGCAGGCCGCGGCCGTCTGGATGACCACGGATCAGGTCAAAGACCTGATCGTCCTCCGCGCC
GAGCGGCTCTCGGCCAGCACCTGGAACCACTTGATCCGGCTGTGCCGGGACACCGGCAGCCGCCTAACGCTGGTCTGCCACACCCGTCAGATCCCAGAAC
ACCTCAGGGGCGTCCTAACCGGAATCGAGCACCACCTGCTCACGGACCTGGCACAGGCCCGCACCCTGCACAAGAAGGCCCACCCACCCCCGCTCAGTAC
GGAGCCGCGGTCTGGCCGCCAGGACACCGACCAGCTGCCGGACCTGCCAGCCGCCGGCGTCGCGCACTTCCGGGCCGAGGCCTACCGAAGGCTCGACCCT
GCAGCATTCGCACGCGTCGATGCCGCCTATCGCTACGGGCGACACAGGGCCTACGACTGGTTGAGCGGCCCCGCACCCGAAGAGACCTACGCCGGCACGG
AACACGCGCAGCTGTTCCTGACCGGGCTGGTTCATGACAGCCCTACCCGTGCTCACACCCTGGCCCGGCTGCGCGGAGCCCAGGCCGGGTTCCTGGCACA
CGGGCTCCTGCTGGGTGTCCCCTCAGCTCACGACCTCATGAACGTCCTCAGCGGCCCGGGTTTGAACACCCTGCCGGTCAGTCAGGACGCCCTGCAGCGC
ATCCGAACGGGCGTGGCCCACCCCATGGTCGCCGCGGGCGTAGCAACCGCACTGTTCACAGGCATATCCACACAGGCGACGAGGCACGCAACACTGGCCG
ACCAGCACCCAAACCCCGCTGCCCTACGGGTGACTTGGCGGCCCCACATCACCAAAACGACGCCGAACCTGATCAACCAGACGGCACCGACCCTCTCTAC
CCCGGCCCTCTTTCACGTGCCCGCTGCGGCGCGGCCGCTGCTACGTGCCGCCGCCGACTTCGCCCTGCGGCAGCCCTCCGCCGCTGCCCGTGAGCGCATA
TTCGCCCCGCCGGCAGTGACGAACGAGCGGGTCCAGGCCGCAGCAGACCACTGTCAGATCGCGTTGCCTGTTCAGGCACCAACCCTCGAGGCCACCTGGC
AGATCAGAGTCACCTGCACCCAGATCAATGCCCCGCCCGTTCATGCCGCCCCGTCGCACCCAACCGGGCAACCACCACTCTCGCGGGTCCTCGGCCAACC
CATCCGACCGGCATCCGCCCGCCGCGGCTACCTGAAGCCTGCGGTGAACGGCCATGCCCACGACCGGTGGCACGGTCGACGCCCGCTGACAGAGCACACC
GCAGCCCTCGTCCTCGACCTGATCCATGACCACCTGGCCTCCCCCGCGCCGTCCGACCGGAACGAACGCTCGTCAGGGCCGAGCTGGATGCTGATCCGCC
GCCAGCTCGCCTGCTATCCACGCGACCCAGACGGGAACCTCACCGCACTGTCACCCCACCCTGATGTCCTGTACGCCCTGGAGCTCACTGATCGCCCAGC
CCAAACCGTCCCCGATCGAGCCCGCGTGATACATGAAGATCAATGAACCGCTCCCCCAGCAGTCCCCTAGTCGAGCGCACCGACTTGGTCACTGACATCT
CATTGAGCAATTTACTGACATCTCGATGAGCATTCTCA
Protein section
ORF number : 3

 

ORF 1
LengthBeginEndStrandFusion ORF
1584 bp527 aa1521735+No
ORF function : Transposase
Chemistry : Unknow

ORF sequence :

LGEQDLVARDRLPREAAVRRLMALDGEGRLSTEDVRLAAEGLAVSERTVWRWIGRARVTDDFDEAAREHFTVTDLVRERLAFWRGNISAVHRELVEEAER
AGTEAVSRQTLQRAVERDILRGDRAGLRHGEHARRAHDVFLQRPRTHRNGAWESDHVEAPVEVDVEGRLVKPWVTWFVDVGTNAVCGTAVTPGAPSRESI
LAALRAAIALEAPYGPPGGLPERVRIDRGKDFLSEAVRSVLAGFAVRVVPLPPYTPHLKGTVETVNGAAGQMFFAGLPRYTGAQTLANGRSIDPDAPALT
FEAFVAQLLGWVSWWNAEHQMPVLEGRTPLQAWLDDPTPLNTVPAGDLRLLTLEDDGRTRKITTKGVSWRASQYVAAWMTGQVGRPVRLRYMPHHEHEVE
VFDANTGEHLGAANLADKASSEQIGELRRSREARRRQLKADLRAAEKARRIRYAAATTAAPPQPISTVTVAQATAELSNADDQQLQAVARPLLVPLRPPA
PGWVLPRSPYEKTNRAVDEGIDGGQDQ

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
771 bp256 aa17322502+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

VSAWTDVDERDDHYLGLAGANVVATESLLVLQDNLADVMTAKAMMCVHGDAGLGKTLSVNTSLRALAPADVCRVQFRARPTPRDIRHVLFDALGVGGAPP
SRPIEFDALLKDVLSERFRVLVCDEAQWLSRECFELWRHLWDDRRTNIAIVFVGGGDCYRVLRREPMLSSRVYVWQEFRRLTREQILAVIPAYHRVWAEA
DPEDIVYTDVHAGHGNFRAWSKITAHLVTALDRLERKRPDREVLQWVFSRLGGSSG

 

Blast result :
ORF 3
LengthBeginEndStrandFusion ORF
1677 bp558 aa25704246+No
ORF function : Passenger Gene
Annotation : Hypothetical proteinDescription :

ORF sequence :

VTSSLLERHNPAHGLAVVHPTPATSSPTALAYDVLVALDRPVSRLEAEHLTGTARPWQAAAVWMTTDQVKDLIVLRAERLSASTWNHLIRLCRDTGSRLT
LVCHTRQIPEHLRGVLTGIEHHLLTDLAQARTLHKKAHPPPLSTEPRSGRQDTDQLPDLPAAGVAHFRAEAYRRLDPAAFARVDAAYRYGRHRAYDWLSG
PAPEETYAGTEHAQLFLTGLVHDSPTRAHTLARLRGAQAGFLAHGLLLGVPSAHDLMNVLSGPGLNTLPVSQDALQRIRTGVAHPMVAAGVATALFTGIS
TQATRHATLADQHPNPAALRVTWRPHITKTTPNLINQTAPTLSTPALFHVPAAARPLLRAAADFALRQPSAAARERIFAPPAVTNERVQAAADHCQIALP
VQAPTLEATWQIRVTCTQINAPPVHAAPSHPTGQPPLSRVLGQPIRPASARRGYLKPAVNGHAHDRWHGRRPLTEHTAALVLDLIHDHLASPAPSDRNER
SSGPSWMLIRRQLACYPRDPDGNLTALSPHPDVLYALELTDRPAQTVPDRARVIHEDQ

 

Blast result :
Comments
ISArsp16 is 78% (transposase) aa similar to ISAau5.
References
1] Romaniuk, K. (2018) Direct submission.