ISArsp12
- Family IS481
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
MH067968 | ND | Arthrobacter sp. | Arthrobacter sp. Arthrobacter sp. ANT_H2 pA2H1 |
DNA section
IS Length : 2615 bp
Ends
IR Length : 24/26
IRL : TGTACTGACCGGACACGTTGATCGATAGACAAGTCGCCACTTGTGACAGA
IRR : TGTACTGACCGGACACGTTGGTCAATCGTGAGAAGGGCGGCTGGTTGCTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
tgtactgaccggacacgttg | cctaga | tgtactgaccggacacgttg | 6 |
DNA sequence
TGTACTGACCGGACACGTTGATCGATAGACAAGTCGCCACTTGTGACAGAAGGCCACGCTTGTCTATAGTGTGGGTATGACAGTTTCACTGGATGTTGCC
CTGCCGCGCACGGACCTGTCGGCGCGGGATCACGCGCGGATGCTTGCTCCGTTGTTGAAGCCTTTGTCAGACGAGAACAGACTGATGATCGTGCTGACGC
TTGCGGCGGGGGCGTGCTCGAACAAGAAGTTGCAGGAGGCCACCGGGTTGAGCCAGGCGCTGGTGAGTCACCATGTGGCAGCCTTGCGGCAAGCGCAGCT
GATCAGTGTGCGGGCCGAGGGCCGGTCCAACATCTACGCGCTGTGTTGTGAGCAGTTGGCGGCGCCGGTGCAGTGGCTCGCTCACTTGGCGACTCTCACT
CCAGAGGGACAGAAGGCGTGCTGCACTGCCCCAGCAGGGGAGGGTCAGGCATCGACTGAGGGTGCTGCGCGGTGATTCATGCGGTACAGACCTTCGGGTT
GATCCTGGTCGAGTTGCTCGTCCTGTTCTCGCTGATCTCGGTGCTCGTTGCGCTGGTGAACCGGCGGTTCGGGCCGGATCGGATCCAGAGCTGGATGGCC
GATGGGCGCCTTCCCGGCCCGCTCAAGGGGCTACTGCTGGGGGCGATCACGCCGTTCTGTTCCTGTTCGACGCTGCCGGTGCTGGCTGGGATGTTGAAGT
CGGGGGTGGCGTTCCGGACGTCGATGACCTTCCTGATCTCGTCACCGCTGCTGGATCCCATCATCGTGGCCGGGGTGGTGCTGCTGTTCGACTGGCGTAT
CGCGTTGGTCTACACGGTGGTCACCGCCGCATGGTCCCTGCTCGCCCCCCTAGTATGGGAACGACTCGGCATGGCCAGCCAGCTCAAGCGAGTCAAGGTC
GTCGGCGACGACAGCACCCCCCACCCCTGGGCAGGCCTGCGCAACGAGATCCGGCCCGCACTCGGCGAGGCCTGGGCCGATCTGAGACCGCTGCTCGTCC
CGATGGTGCTGGGTGTCTCGGTCGGTGCATTTATCTACGGTTTCGTGCCTCAGGACCAACTCGCCGCGGTCGCTGGCGACGACAAACCGTGGGCCGTGCC
GTTGGCCGCCGTGCTTGGCGTTCCGCTTTACGTGCGTATCGAGACGATGCTGCCCATCGGCTTGGCCCTCAGCTCGACCGGCATGGGGCTAGGTGCAGTG
TTCGCGCTGATGATCGGCGGCGCCGGAGCCTCGATCCCCGAGATCTCGATGCTCACCACCTTGTTCAAGCCCCGACTGGTCGTCACTTTCGTGGTCACTG
TCATCGGCACGGCCATCGCCGCCGGCTACCTCATCCCACTGATCGCCTGACCCAACCACCACATTCCCAACCACACGAGGAGAACAAGCAACCATGGGAC
TCAAGGACCTCTTCACCGCACGCAAGCAGGACAGCAGTTGCTGCGGGGCCCAGATCGTGCCCGACGATGACGACCAGCCCCAGAGCGAGCAGGGAACACC
CACCGAGACGGCTGCCTCTTCAGCGCCGAACAGCGAGGACACGGACGGCAACTGACATACCGAATCGATGGGGCTGCACCACAGGGTGCAGCCCCATCAC
TGTCTCCACCCGCTCGTCACCAGGAAGACCACCATGTCCCACGCCAATGCCGCCCTCACTCCGCGTCACCGCCTCAAGGTTGCCCAGCTGGTTGTTGACC
ACGGATGGCCGATCAGCGAGGTCGCTGCCCGGTTCCAGGTCTCCTGGCCCACCGTGAAGCGCTGGGCTGACCGCTACCGGGCCGGTCAGTCCATGCAGGA
CCGCAGCTCACGGCCTCACCGCTCCCCGAACAAGACCAGCCCCACGACTGCGAAGCGCTGTATCCAGCTCCGGCTGCGTCTGCGGGAAGGTCCGGTTCAG
TTGGCATACCGGATTGGCGTTGCTCCGTCGACAGTCCACCGGATCTTGGTCGACGTCCACTTGAACCGCCTGTCGCACGTCGACCGTGCCACTGGAGAGC
CTGTTCGTCGCTACGAGCACGACCACCCCGGGGCGATGCTGCACGTCGACGTGAAGAAGCTCGGCAACATCCCAGACGGCGGAGGCTGGCGCTACGTCGG
CCGACGACAAGGCGAGAAGAACCGCGCAGCCACACCCGACAAGCCCAAGAACAAGCACTACGACCCGTTGATGGGCAAGGCCTACGTCCACACCGTCATC
GACGACCACTCCCGCGTCGCCTACGCCGAGATCCACGACGACGAAACAGCCCACACCGCCACAGCGGTCCTGGCCAGGGCCGTCAAGTGGTTCAACGCCC
GCGGGGTGACCGTCGAGCGGGTCCTGTCGGACAACGGTGGCGCCTATCGCTCACACCTGTGGCGCGACACCTGCGCCGAGCTGGGAATCAGGCACAAACG
GACCAGGCCGTATCGCCCGCAGACCAACGGCAAGATCGAGCGCTTCCACCGCACCCTGGCCGACGGCTGGGCCTACGCCCGCTGCTACACCTCCGAGACG
GAGCGTCGCGGCGAACTCGACGGCTGGCTGCACTACTACAACCATCACCGGCCCCACACAGCCTGCAGCAACCAGCCGCCCTTCTCACGATTGACCAACG
TGTCCGGTCAGTACA
CTGCCGCGCACGGACCTGTCGGCGCGGGATCACGCGCGGATGCTTGCTCCGTTGTTGAAGCCTTTGTCAGACGAGAACAGACTGATGATCGTGCTGACGC
TTGCGGCGGGGGCGTGCTCGAACAAGAAGTTGCAGGAGGCCACCGGGTTGAGCCAGGCGCTGGTGAGTCACCATGTGGCAGCCTTGCGGCAAGCGCAGCT
GATCAGTGTGCGGGCCGAGGGCCGGTCCAACATCTACGCGCTGTGTTGTGAGCAGTTGGCGGCGCCGGTGCAGTGGCTCGCTCACTTGGCGACTCTCACT
CCAGAGGGACAGAAGGCGTGCTGCACTGCCCCAGCAGGGGAGGGTCAGGCATCGACTGAGGGTGCTGCGCGGTGATTCATGCGGTACAGACCTTCGGGTT
GATCCTGGTCGAGTTGCTCGTCCTGTTCTCGCTGATCTCGGTGCTCGTTGCGCTGGTGAACCGGCGGTTCGGGCCGGATCGGATCCAGAGCTGGATGGCC
GATGGGCGCCTTCCCGGCCCGCTCAAGGGGCTACTGCTGGGGGCGATCACGCCGTTCTGTTCCTGTTCGACGCTGCCGGTGCTGGCTGGGATGTTGAAGT
CGGGGGTGGCGTTCCGGACGTCGATGACCTTCCTGATCTCGTCACCGCTGCTGGATCCCATCATCGTGGCCGGGGTGGTGCTGCTGTTCGACTGGCGTAT
CGCGTTGGTCTACACGGTGGTCACCGCCGCATGGTCCCTGCTCGCCCCCCTAGTATGGGAACGACTCGGCATGGCCAGCCAGCTCAAGCGAGTCAAGGTC
GTCGGCGACGACAGCACCCCCCACCCCTGGGCAGGCCTGCGCAACGAGATCCGGCCCGCACTCGGCGAGGCCTGGGCCGATCTGAGACCGCTGCTCGTCC
CGATGGTGCTGGGTGTCTCGGTCGGTGCATTTATCTACGGTTTCGTGCCTCAGGACCAACTCGCCGCGGTCGCTGGCGACGACAAACCGTGGGCCGTGCC
GTTGGCCGCCGTGCTTGGCGTTCCGCTTTACGTGCGTATCGAGACGATGCTGCCCATCGGCTTGGCCCTCAGCTCGACCGGCATGGGGCTAGGTGCAGTG
TTCGCGCTGATGATCGGCGGCGCCGGAGCCTCGATCCCCGAGATCTCGATGCTCACCACCTTGTTCAAGCCCCGACTGGTCGTCACTTTCGTGGTCACTG
TCATCGGCACGGCCATCGCCGCCGGCTACCTCATCCCACTGATCGCCTGACCCAACCACCACATTCCCAACCACACGAGGAGAACAAGCAACCATGGGAC
TCAAGGACCTCTTCACCGCACGCAAGCAGGACAGCAGTTGCTGCGGGGCCCAGATCGTGCCCGACGATGACGACCAGCCCCAGAGCGAGCAGGGAACACC
CACCGAGACGGCTGCCTCTTCAGCGCCGAACAGCGAGGACACGGACGGCAACTGACATACCGAATCGATGGGGCTGCACCACAGGGTGCAGCCCCATCAC
TGTCTCCACCCGCTCGTCACCAGGAAGACCACCATGTCCCACGCCAATGCCGCCCTCACTCCGCGTCACCGCCTCAAGGTTGCCCAGCTGGTTGTTGACC
ACGGATGGCCGATCAGCGAGGTCGCTGCCCGGTTCCAGGTCTCCTGGCCCACCGTGAAGCGCTGGGCTGACCGCTACCGGGCCGGTCAGTCCATGCAGGA
CCGCAGCTCACGGCCTCACCGCTCCCCGAACAAGACCAGCCCCACGACTGCGAAGCGCTGTATCCAGCTCCGGCTGCGTCTGCGGGAAGGTCCGGTTCAG
TTGGCATACCGGATTGGCGTTGCTCCGTCGACAGTCCACCGGATCTTGGTCGACGTCCACTTGAACCGCCTGTCGCACGTCGACCGTGCCACTGGAGAGC
CTGTTCGTCGCTACGAGCACGACCACCCCGGGGCGATGCTGCACGTCGACGTGAAGAAGCTCGGCAACATCCCAGACGGCGGAGGCTGGCGCTACGTCGG
CCGACGACAAGGCGAGAAGAACCGCGCAGCCACACCCGACAAGCCCAAGAACAAGCACTACGACCCGTTGATGGGCAAGGCCTACGTCCACACCGTCATC
GACGACCACTCCCGCGTCGCCTACGCCGAGATCCACGACGACGAAACAGCCCACACCGCCACAGCGGTCCTGGCCAGGGCCGTCAAGTGGTTCAACGCCC
GCGGGGTGACCGTCGAGCGGGTCCTGTCGGACAACGGTGGCGCCTATCGCTCACACCTGTGGCGCGACACCTGCGCCGAGCTGGGAATCAGGCACAAACG
GACCAGGCCGTATCGCCCGCAGACCAACGGCAAGATCGAGCGCTTCCACCGCACCCTGGCCGACGGCTGGGCCTACGCCCGCTGCTACACCTCCGAGACG
GAGCGTCGCGGCGAACTCGACGGCTGGCTGCACTACTACAACCATCACCGGCCCCACACAGCCTGCAGCAACCAGCCGCCCTTCTCACGATTGACCAACG
TGTCCGGTCAGTACA
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
399 bp | 132 aa | 77 | 475 | + | No |
Annotation : ArsR family transcriptional regulatorDescription : Transcriptional Regulator factor
ORF sequence :
MTVSLDVALPRTDLSARDHARMLAPLLKPLSDENRLMIVLTLAAGACSNKKLQEATGLSQALVSHHVAALRQAQLISVRAEGRSNIYALCCEQLAAPVQW
LAHLATLTPEGQKACCTAPAGEGQASTEGAAR
LAHLATLTPEGQKACCTAPAGEGQASTEGAAR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
879 bp | 292 aa | 472 | 1350 | + | No |
Annotation : permeaseDescription :
ORF sequence :
VIHAVQTFGLILVELLVLFSLISVLVALVNRRFGPDRIQSWMADGRLPGPLKGLLLGAITPFCSCSTLPVLAGMLKSGVAFRTSMTFLISSPLLDPIIVA
GVVLLFDWRIALVYTVVTAAWSLLAPLVWERLGMASQLKRVKVVGDDSTPHPWAGLRNEIRPALGEAWADLRPLLVPMVLGVSVGAFIYGFVPQDQLAAV
AGDDKPWAVPLAAVLGVPLYVRIETMLPIGLALSSTGMGLGAVFALMIGGAGASIPEISMLTTLFKPRLVVTFVVTVIGTAIAAGYLIPLIA
GVVLLFDWRIALVYTVVTAAWSLLAPLVWERLGMASQLKRVKVVGDDSTPHPWAGLRNEIRPALGEAWADLRPLLVPMVLGVSVGAFIYGFVPQDQLAAV
AGDDKPWAVPLAAVLGVPLYVRIETMLPIGLALSSTGMGLGAVFALMIGGAGASIPEISMLTTLFKPRLVVTFVVTVIGTAIAAGYLIPLIA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
987 bp | 328 aa | 1634 | 2620 | + | No |
Chemistry : DDE
ORF sequence :
MSHANAALTPRHRLKVAQLVVDHGWPISEVAARFQVSWPTVKRWADRYRAGQSMQDRSSRPHRSPNKTSPTTAKRCIQLRLRLREGPVQLAYRIGVAPST
VHRILVDVHLNRLSHVDRATGEPVRRYEHDHPGAMLHVDVKKLGNIPDGGGWRYVGRRQGEKNRAATPDKPKNKHYDPLMGKAYVHTVIDDHSRVAYAEI
HDDETAHTATAVLARAVKWFNARGVTVERVLSDNGGAYRSHLWRDTCAELGIRHKRTRPYRPQTNGKIERFHRTLADGWAYARCYTSETERRGELDGWLH
YYNHHRPHTACSNQPPFSRLTNVSGQYT
VHRILVDVHLNRLSHVDRATGEPVRRYEHDHPGAMLHVDVKKLGNIPDGGGWRYVGRRQGEKNRAATPDKPKNKHYDPLMGKAYVHTVIDDHSRVAYAEI
HDDETAHTATAVLARAVKWFNARGVTVERVLSDNGGAYRSHLWRDTCAELGIRHKRTRPYRPQTNGKIERFHRTLADGWAYARCYTSETERRGELDGWLH
YYNHHRPHTACSNQPPFSRLTNVSGQYT
Blast result :
Comments
ISArsp12 is 92% aa similar to ISPfr21.
References
1] Romaniuk, K. (2018) Direct submission.