ISCsp4
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
KC771559 | ND | Comamonas sp | Comamonas sp Comamonas sp plasmid |
DNA section
IS Length : 2501 bp
Ends
IR Length : 23/32
IRL : TGTTGAATCACAACGAAAACTGGGCCATTTTTGGGGTGGAtcacactgaa
IRR : tgttgattcaccgtacaaactgagccagtgttcacgccgaatattgagcc
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCAGGGTCAA | GGATTTGTTC | 0 | |
GATCGTAGGC | ACCAAGCTCA | 0 |
DNA sequence
TGTTGAATCACAACGAAAACTGGGCCATTTTTGGGGTGGATCACACTGAAAATTGAGCCACGTTGACGGCTACGCTGCTGAATTTTTCAGCAAGGTGGTC
AGGAGTGATCAACGTGAGCACATTGAGTAAATTGCGCCGCCTCGTGCTGAGGGATGGCGTATCGGTGCGGGAGGCCAGCCGGCGGCTGGGGATCTCGCGC
AACACGGCCAGCAAGTGGCTGTCTGAGCCCGAAATGCGGGAGCCGAAGTACCCCGTGCGGGTGAAGGCGCAGGGCGTACTGACGCCTTATGAGGACACCT
TGGCCCGCTGGCTCAAGGCCGACCAGCACCGTAACAAGCGCGAGCGCCGAGGCATCAAGGCCATGTTCGAGGCCTTGCGCGCCATGGGCTACGGCGGCAG
CCGTGGGCCGGTCTACAGCTTTGCCAAGCAGTGGCGTCTGGCGCAGGACCACAGCGCCCGAGGGGCAGGCTTTGTGCCCCTGTCCTTTGAATTGGGCGAG
GCGTTCCAGTTTGACTGGAGCACCGAGTACGCGTTCATTGGCGGACTGCGCCGGCGCCTGGAGGTGGCGCACACCAAGCTGGCGGCCAGCCGGGCGTTTA
TGCTGTCGGCGTTCCACAGCCAGGCGCACGAGATGCTGTTTGAGGCCCATGCCCGGGCCTTTGCGGTGCTGGGCGGGGTGGCCAAGCGGGGTATCTACGA
CAACATGAAGACGGCAGTGGACAAGGTGGGCGCGGGCAAGCAGCGCAGCGTCAACGCCCGCTTCGAGGCGATGACCGGTCACTACCTCTTTGATCCCGAG
TTCTGCAACCGCGCAGCGGGCTGGGAGAAGGGCATCGTGGAGAAGAACGTGCAGGACCGCCGGCGTGGCATCTGGCGTGAAGCCAGCGAGCGACGCTGGG
CCAGCCTGAGCGAGCTCAACGCCTGGCTGCAGCAGGCGTGCGTGGACGCCTGGAGCGAGTTGCGCCACCCCGAGTGGCACGAGCTCACGGTGGCCGAGGT
GTGGCAGGACGAGAGGCTGCGCCTGACGCCGAACCCGCGTCCCTTCGATGGCTATGTGGAGGACCCGGTGCGGGTGACCTCGACCTCGCTGATCCACTTC
CAGCGCAACCGCTACAGCGTGCCGTGCGAATGGGTGCATGCGGTGGTCAGCCTGCGGGCGTACCACGACCGGCTGCTGGTGGTGGGCCCCAAGGGCCACA
GTGTGACGCTGCAGCGCTGCTTCGATCGTGACCAGACGATCTACGACTGGACCCACTACATTGCCCTGATCGAGCGCAAGCCCGGGGCGCTGCGTAATGG
CGCACCGTTCAAGACCATGCCCGAGCCGCTGCGACTGCTGCAAGACCAGTTGCTGCGTCATGCGGGCGGTGATCGGGTGATGGCCCAGGTGCTCAGCGCG
GTGACCGCCCACGGGCTGGAGGCGGTGCTGGTGGCGGTGGAGCTGGCCCTGCAGTCCGGGCGGGTCAGCGCCGAGCACGTACTCAATGTGCTGTCGCGCT
TGAAAGAACCCGAGCTGCGGGTGGAGCCGGCGACCACGTCCGTTGACCTGAAGACACCCTCACAGGCCGATCTGAAGCGCTACGACCGGCTGCGCCAGCC
CAAGGAGGTGCGCCATGCTTGAACTGGTCACCGACTTCAAGGCACTCGGTCTGCACGGCATGGCCAGTGCCTGGCCGGAGGTGCTGGGCACGGCACGCAT
CAAGGCGATGGACCACGAGGCGGTGCTGCGCCAGCTCATCAAGGCCGAGATGGCGCAGCGCGAGGTGCGCTCCATGGCCTACCAGATGCGCGTGGCCCGC
TTCCCCTCGCACCGGGACCTGGCGGGGTTTGACTTTGCCCAGGCGCATGTGCAGGAGGCACAGGTGCGAGAGCTGCACACCCTGCGCTTTACCGAATCCG
CGCACAACGTGGTCTTCGTCGGAGGTCCCGGAACGGGCAAGACGCACCTGGCCACCAGCCTGGGCATCGAGGCGATTCGGGTCCATGGCAAGCGGGTGCG
CTTCTTCTCGACGGTGGAGCTGGTCAACGCGCTGGAGTTGGAGAAGGCCCAGAACAAGGCTGGGCAGTTGGCCCACCGGCTGATGTACGTGGATCTGGTG
ATCCTGGACGAGATGGGCTATCTGCCCTTCACGCAGTCGGGTGGCGCCTTGCTGTTCCACCTGCTGTCCAAACTCTACGAGCGAACTAGCGTGGTCATCA
CCACCAACCTGAACTTCTCGGAGTGGAGCACCGTCTTTGGCGACGCCAAGATGACAACGGCGTTGCTGGACCGGCTGACCCACCACTGCCACATCGTGGA
GAGCGGCAACGAGAGCTGGCGCTTCAAGCATTCCAGTGCCGCTGCGGGGATCACCAAAACCGCACGGGCCAAAGCCCAGAAAGGAGCCGATCAAACGACC
CAGCCGTTAGACTTATCCACAACCAACTGACAAAACATCCTTAAAACTAGTGGCTCAATATTCGGCGTGAACACTGGCTCAGTTTGTACGGTGAATCAAC
A
AGGAGTGATCAACGTGAGCACATTGAGTAAATTGCGCCGCCTCGTGCTGAGGGATGGCGTATCGGTGCGGGAGGCCAGCCGGCGGCTGGGGATCTCGCGC
AACACGGCCAGCAAGTGGCTGTCTGAGCCCGAAATGCGGGAGCCGAAGTACCCCGTGCGGGTGAAGGCGCAGGGCGTACTGACGCCTTATGAGGACACCT
TGGCCCGCTGGCTCAAGGCCGACCAGCACCGTAACAAGCGCGAGCGCCGAGGCATCAAGGCCATGTTCGAGGCCTTGCGCGCCATGGGCTACGGCGGCAG
CCGTGGGCCGGTCTACAGCTTTGCCAAGCAGTGGCGTCTGGCGCAGGACCACAGCGCCCGAGGGGCAGGCTTTGTGCCCCTGTCCTTTGAATTGGGCGAG
GCGTTCCAGTTTGACTGGAGCACCGAGTACGCGTTCATTGGCGGACTGCGCCGGCGCCTGGAGGTGGCGCACACCAAGCTGGCGGCCAGCCGGGCGTTTA
TGCTGTCGGCGTTCCACAGCCAGGCGCACGAGATGCTGTTTGAGGCCCATGCCCGGGCCTTTGCGGTGCTGGGCGGGGTGGCCAAGCGGGGTATCTACGA
CAACATGAAGACGGCAGTGGACAAGGTGGGCGCGGGCAAGCAGCGCAGCGTCAACGCCCGCTTCGAGGCGATGACCGGTCACTACCTCTTTGATCCCGAG
TTCTGCAACCGCGCAGCGGGCTGGGAGAAGGGCATCGTGGAGAAGAACGTGCAGGACCGCCGGCGTGGCATCTGGCGTGAAGCCAGCGAGCGACGCTGGG
CCAGCCTGAGCGAGCTCAACGCCTGGCTGCAGCAGGCGTGCGTGGACGCCTGGAGCGAGTTGCGCCACCCCGAGTGGCACGAGCTCACGGTGGCCGAGGT
GTGGCAGGACGAGAGGCTGCGCCTGACGCCGAACCCGCGTCCCTTCGATGGCTATGTGGAGGACCCGGTGCGGGTGACCTCGACCTCGCTGATCCACTTC
CAGCGCAACCGCTACAGCGTGCCGTGCGAATGGGTGCATGCGGTGGTCAGCCTGCGGGCGTACCACGACCGGCTGCTGGTGGTGGGCCCCAAGGGCCACA
GTGTGACGCTGCAGCGCTGCTTCGATCGTGACCAGACGATCTACGACTGGACCCACTACATTGCCCTGATCGAGCGCAAGCCCGGGGCGCTGCGTAATGG
CGCACCGTTCAAGACCATGCCCGAGCCGCTGCGACTGCTGCAAGACCAGTTGCTGCGTCATGCGGGCGGTGATCGGGTGATGGCCCAGGTGCTCAGCGCG
GTGACCGCCCACGGGCTGGAGGCGGTGCTGGTGGCGGTGGAGCTGGCCCTGCAGTCCGGGCGGGTCAGCGCCGAGCACGTACTCAATGTGCTGTCGCGCT
TGAAAGAACCCGAGCTGCGGGTGGAGCCGGCGACCACGTCCGTTGACCTGAAGACACCCTCACAGGCCGATCTGAAGCGCTACGACCGGCTGCGCCAGCC
CAAGGAGGTGCGCCATGCTTGAACTGGTCACCGACTTCAAGGCACTCGGTCTGCACGGCATGGCCAGTGCCTGGCCGGAGGTGCTGGGCACGGCACGCAT
CAAGGCGATGGACCACGAGGCGGTGCTGCGCCAGCTCATCAAGGCCGAGATGGCGCAGCGCGAGGTGCGCTCCATGGCCTACCAGATGCGCGTGGCCCGC
TTCCCCTCGCACCGGGACCTGGCGGGGTTTGACTTTGCCCAGGCGCATGTGCAGGAGGCACAGGTGCGAGAGCTGCACACCCTGCGCTTTACCGAATCCG
CGCACAACGTGGTCTTCGTCGGAGGTCCCGGAACGGGCAAGACGCACCTGGCCACCAGCCTGGGCATCGAGGCGATTCGGGTCCATGGCAAGCGGGTGCG
CTTCTTCTCGACGGTGGAGCTGGTCAACGCGCTGGAGTTGGAGAAGGCCCAGAACAAGGCTGGGCAGTTGGCCCACCGGCTGATGTACGTGGATCTGGTG
ATCCTGGACGAGATGGGCTATCTGCCCTTCACGCAGTCGGGTGGCGCCTTGCTGTTCCACCTGCTGTCCAAACTCTACGAGCGAACTAGCGTGGTCATCA
CCACCAACCTGAACTTCTCGGAGTGGAGCACCGTCTTTGGCGACGCCAAGATGACAACGGCGTTGCTGGACCGGCTGACCCACCACTGCCACATCGTGGA
GAGCGGCAACGAGAGCTGGCGCTTCAAGCATTCCAGTGCCGCTGCGGGGATCACCAAAACCGCACGGGCCAAAGCCCAGAAAGGAGCCGATCAAACGACC
CAGCCGTTAGACTTATCCACAACCAACTGACAAAACATCCTTAAAACTAGTGGCTCAATATTCGGCGTGAACACTGGCTCAGTTTGTACGGTGAATCAAC
A
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1509 bp | 502 aa | 114 | 1622 | + | No |
Chemistry : DDE
ORF sequence :
MSTLSKLRRLVLRDGVSVREASRRLGISRNTASKWLSEPEMREPKYPVRVKAQGVLTPYEDTLARWLKADQHRNKRERRGIKAMFEALRAMGYGGSRGPV
YSFAKQWRLAQDHSARGAGFVPLSFELGEAFQFDWSTEYAFIGGLRRRLEVAHTKLAASRAFMLSAFHSQAHEMLFEAHARAFAVLGGVAKRGIYDNMKT
AVDKVGAGKQRSVNARFEAMTGHYLFDPEFCNRAAGWEKGIVEKNVQDRRRGIWREASERRWASLSELNAWLQQACVDAWSELRHPEWHELTVAEVWQDE
RLRLTPNPRPFDGYVEDPVRVTSTSLIHFQRNRYSVPCEWVHAVVSLRAYHDRLLVVGPKGHSVTLQRCFDRDQTIYDWTHYIALIERKPGALRNGAPFK
TMPEPLRLLQDQLLRHAGGDRVMAQVLSAVTAHGLEAVLVAVELALQSGRVSAEHVLNVLSRLKEPELRVEPATTSVDLKTPSQADLKRYDRLRQPKEVR
HA
YSFAKQWRLAQDHSARGAGFVPLSFELGEAFQFDWSTEYAFIGGLRRRLEVAHTKLAASRAFMLSAFHSQAHEMLFEAHARAFAVLGGVAKRGIYDNMKT
AVDKVGAGKQRSVNARFEAMTGHYLFDPEFCNRAAGWEKGIVEKNVQDRRRGIWREASERRWASLSELNAWLQQACVDAWSELRHPEWHELTVAEVWQDE
RLRLTPNPRPFDGYVEDPVRVTSTSLIHFQRNRYSVPCEWVHAVVSLRAYHDRLLVVGPKGHSVTLQRCFDRDQTIYDWTHYIALIERKPGALRNGAPFK
TMPEPLRLLQDQLLRHAGGDRVMAQVLSAVTAHGLEAVLVAVELALQSGRVSAEHVLNVLSRLKEPELRVEPATTSVDLKTPSQADLKRYDRLRQPKEVR
HA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
816 bp | 271 aa | 1615 | 2430 | + | No |
AG : IS21 helper
ORF sequence :
MLELVTDFKALGLHGMASAWPEVLGTARIKAMDHEAVLRQLIKAEMAQREVRSMAYQMRVARFPSHRDLAGFDFAQAHVQEAQVRELHTLRFTESAHNVV
FVGGPGTGKTHLATSLGIEAIRVHGKRVRFFSTVELVNALELEKAQNKAGQLAHRLMYVDLVILDEMGYLPFTQSGGALLFHLLSKLYERTSVVITTNLN
FSEWSTVFGDAKMTTALLDRLTHHCHIVESGNESWRFKHSSAAAGITKTARAKAQKGADQTTQPLDLSTTN
FVGGPGTGKTHLATSLGIEAIRVHGKRVRFFSTVELVNALELEKAQNKAGQLAHRLMYVDLVILDEMGYLPFTQSGGALLFHLLSKLYERTSVVITTNLN
FSEWSTVFGDAKMTTALLDRLTHHCHIVESGNESWRFKHSSAAAGITKTARAKAQKGADQTTQPLDLSTTN
Blast result :
Comments
ISCsp4 is 73% (transposasse) and 79% (helper) similar to ISAav1.
References