ISLxc1
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
QD191803 | ND | Leifsonia xyli | Leifsonia xyli subsp. cynodontis |
DNA section
IS Length : 2630 bp
Ends
IR Length : 37/50
IRL : TGTCTGTGTCACTGTTAGCGGGCTCCGGGTTGTCGGAGTTGTCTGGCCCC
IRR : TGACTATGTCAAGGCTGGCGGGGATCGGGTTGTCGAAGTTAGTTGGCCCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCTGGCG | ATCGCC | CGGCTCC | 6 |
DNA sequence
TGTCTGTGTCACTGTTAGCGGGCTCCGGGTTGTCGGAGTTGTCTGGCCCCGGCGGCGACTTGCCGGGCTGGTTGTGTCGAAGCTAAGAGGTGGTTTCAGT
AGTCGGCGTGGCGGTGGTTGTTGTGGCTGGTTAGCGGGGATGCTGGTCGAGTGTCGACGCTTGCTGAGGATCGGTTCATTACGGATATGTTGTGGGAGCG
GCTGGAGCCGTTGATTCCGCCTCGGCCGCCTGTGGTCAATGGGCGGGCTGGGCAGCCTCGGGTTCCTGACCGGAAGGTGTTCGCTGGGATCGTGTTCGTG
CTGCTGACGGGGATCCCGTGGAGAAGCTCCCGCCCGAGTTGGGGTATGGGTCCGGGGTCACTTGTTGGCGGCGTCTGCGTGAATGGTCCGAAGCGGGCGC
GTGGGATGCACTGCGGAAGATCATGCTCGACGAACTCGGCCAGGCTGGCATGATCGACTGGTCAAGAACCTGCCTGGACTCCGTAAGTGTCCGGGCGAAA
AGGGGGGGCGATCTCACTGGACCTAACCCCACGGATCGTGGGAAACGGGGCACCAAGTACCATGTCCTGACCGACCGCAACGGACTCCCGCTGCATGTGG
AGATCTCCGGCGCCAACCGACACGACTCCATGCTCGTGGAACCTGTGTTAGACAACATCACCGCGATCAAGGGCGTCGGCCGCGGTCGGCCCAGACGCCG
CCCGGTTATCTTCCACGCCGATAAGGCTTACGACAACCGCCGCGTCCGCTGTTACCTGCGTTGTCGTGGGATCAAGGCACGCATCGCACGAATCGGAGTC
GACTCCAAACAGTGACTGGGTAAACACTGTTGGGTAGTCGAACGCACCATGGCCTGGATCCTCGCCTTCCGGAAACTCGCCACTCGCTACGACCGCACCG
CCTCAACGATCACGGCGCTCGTCGCTCTAGCAATCGCGATCACCAGCGCCCGCAAACTCACCAAAAACGACTACTGAAACCATCTCTAAGTGGCCCCACC
TGAGTGTGGTTCCCGTCTGTTACTTCAGGATGGGAGGGCCCGGGTTGGGGTCGAGGTTGGATCAGTTCGCTGTGATCAGGAGAGATGCCCGGGTGGAGGG
TCTCTCTATTCGTGAGCTTGCTGTTCGTCATGGTGTTCACCGGCGGACGGTTCGTCAGGCTTTGGAGTCAGCGACGCCACCGTCGCGGAAGCCGAGAGTT
CAGTCGTCGCCGAAGCTGGATCTGGTTCGCGAGCTGATCGACGCGATTCTGCGTCAGGATCTCGGGGCGCCGAAGAAGCAGAAGCAGACGGCGACACGGA
CCTGGAAACGTCTCCTCGATGAGCACCAGGTCGATGTCTCCTACGGCACGGTCCGTGACTATGTGCGCTCCCGGCGCCCGCAGATCGACGCGGAGGCTGG
TCGTCTGCCGGAGGTGTTCGTCCCGCAGGAGCATGCTCCGGGGGCGGAGGCGGAGGTCGATTTCGGTGAGGTCTGGGTGATCCTGGCCGGGGTGAAGACC
AAGTGCCACATGTTCACTTTTCGGCTCTCGTACTCGGGGAAGGCGATTCACCGGGTCTACTCGACGCAGTCGCAAGAGGCGTTCCTGGAAGGCCACATCG
ACGCGTTCGAGGAGATTGGCGGGATGCCGACCCTTCACATCAAATACGACAACTTGGCCGCCGCGGTGAAGTCCGTGGTCAACGGCAAGGACCGCAAACG
GGTCGAGAATGACCGTTGGGTCCTGTTCCGTTCCCACTACGGATTCGACGCGTTCTACTGCCAGCCCGGCATTGACGGTGCGCACGAGAAGGGCGGCGTT
GAGGGCGAGGTCGGCCGGTTCCGCCGCACGTGGCTCTCACCCATGCCCGAAGTCGACTCCCTCGCGCAACTGAACGCGATGATCCGCCGCTGGGATGCCC
GTGACGAGCAGCGCCGGATCGCGCAGCGCCGAACCAAGGTCGAGGACGACTTCGCTACAGAACGCCCGCTCCTTCGGCCGCTGCCGGCGGAGCGATTTGA
TCCGGGACTGGTGCTGCATCCTCGGGTCGACCGGTCCGGGCTGATCACGGTCCGGATGGCGAAATACTCTGTCCCTGCCCGGCTCATCGGCCGTGAAGTC
CGCGTCTCTCCGCGCTTCCGAGGTCGTCGTCTTCGACGGGCGCGTTGAAGTCGCCCGGCACGAACGCGTGGTCGCCCGCGGAGGCGAGTCGATCCAGCTG
GATCACTACCTCGAGGTCCTCCGGCACAAGCCCGGCGCGTTCCCCGGCTCGACAGCACTTGCACGCGCCCGGGAGGCGGGAACCTTCACCGCCGCCCATG
AAGCGTTCTGGCAGGAGGCAAGGAAAGTCAATGGCGACACGGCCGGCACCAGAGAGCTCGTCGACGTGCTCCTGCTCCATCGCAGCATGCGCGCAGCCGA
CGTGATCGCCGGGATCCGCGCGGCACTCTCGGTCGGGGCCATCTCCGCTGACGTCGTCGCGGTTGAAGCACGACTGTACGCAGGTGGGGCCATCCAACAC
CGACAGCCCGTTGAACAACGCCCCGATCGCGAGCGACGAGTTGTCAGTCTCACCCAGCGCCGGCTCCGGGATCCGCAGGCTGTCATCGCCGGCCTGCCAC
AGGACAAACGCCCGCTCCCGACCGTCACCCAATACGACGAACTCCTCCAGCGCCGCCCCGTCTTCACCGACCCCACCACCGATGAGAGAGAAGGAACCAC
CGACACATGAGCCCAACCACCACGAACATCACCACCACCCTCCGCCGGCAACGCGGGATGACCCAGGAAGCCGCCGCGGCCGCCGTCGACCAAGCCTGCA
GACGCCTGCGACTACCGACCATTCGCGCCGTGATGGACGAAGCGATCCGGGTCGCCGAGCACGAGCAGCTGTCCTACCAAGGCTTCCTCGCCGAAGTGCT
GTTGGCCGAGTGCGACGACCGCGACCGCCGCTCCACCGTCCGCCGCGTCGCCTCCGCCGGCTTCCCACGTCAGAAATGGCTCGGCGACTTCGACTTCGAC
ACCAACCCGAACATCAACGCGGCGACCATCCACACGCTCGCCACCGGCGACTGGGTCAGACGCGGCGACCCGCTCTGCCTCATCGGGGACTCCGGCACCG
GCAAGAGCCACCTCCTCATCGGCCTCGGCACCGCCGCAGCCGAGAAGGGCTACCGAGTCAAATACACCCTCGCGACCAAGCTCGTGAACGAACTCGTCGA
AGCAGCAGATGAGAAGCAGTTGGCCCGCACGATCGCTCGCTACGGCCGCGTCGATCTGCTCTGCATCGACGAGCTCGGCTACATGGAACTCGACCGACGC
GGCGCCGAGCTCCTCTTCCAAGTCCTCACCGAACGCGAAGAGAAGAACTCCGTCGCGATCGCATCCAACCAGTCATTCACGGGATGGACGGACACCTTCA
CCGACCCCAGGCTCTGCGCTGCCATCATCGAGACCGGCACCACCTCCTACCGCCTCCAACACACCCGCAACACCGCACTCGCTGGGGCCAACTAACTTCG
ACAACCCGATCCCCGCCAGCCTTGACATAGTCA
AGTCGGCGTGGCGGTGGTTGTTGTGGCTGGTTAGCGGGGATGCTGGTCGAGTGTCGACGCTTGCTGAGGATCGGTTCATTACGGATATGTTGTGGGAGCG
GCTGGAGCCGTTGATTCCGCCTCGGCCGCCTGTGGTCAATGGGCGGGCTGGGCAGCCTCGGGTTCCTGACCGGAAGGTGTTCGCTGGGATCGTGTTCGTG
CTGCTGACGGGGATCCCGTGGAGAAGCTCCCGCCCGAGTTGGGGTATGGGTCCGGGGTCACTTGTTGGCGGCGTCTGCGTGAATGGTCCGAAGCGGGCGC
GTGGGATGCACTGCGGAAGATCATGCTCGACGAACTCGGCCAGGCTGGCATGATCGACTGGTCAAGAACCTGCCTGGACTCCGTAAGTGTCCGGGCGAAA
AGGGGGGGCGATCTCACTGGACCTAACCCCACGGATCGTGGGAAACGGGGCACCAAGTACCATGTCCTGACCGACCGCAACGGACTCCCGCTGCATGTGG
AGATCTCCGGCGCCAACCGACACGACTCCATGCTCGTGGAACCTGTGTTAGACAACATCACCGCGATCAAGGGCGTCGGCCGCGGTCGGCCCAGACGCCG
CCCGGTTATCTTCCACGCCGATAAGGCTTACGACAACCGCCGCGTCCGCTGTTACCTGCGTTGTCGTGGGATCAAGGCACGCATCGCACGAATCGGAGTC
GACTCCAAACAGTGACTGGGTAAACACTGTTGGGTAGTCGAACGCACCATGGCCTGGATCCTCGCCTTCCGGAAACTCGCCACTCGCTACGACCGCACCG
CCTCAACGATCACGGCGCTCGTCGCTCTAGCAATCGCGATCACCAGCGCCCGCAAACTCACCAAAAACGACTACTGAAACCATCTCTAAGTGGCCCCACC
TGAGTGTGGTTCCCGTCTGTTACTTCAGGATGGGAGGGCCCGGGTTGGGGTCGAGGTTGGATCAGTTCGCTGTGATCAGGAGAGATGCCCGGGTGGAGGG
TCTCTCTATTCGTGAGCTTGCTGTTCGTCATGGTGTTCACCGGCGGACGGTTCGTCAGGCTTTGGAGTCAGCGACGCCACCGTCGCGGAAGCCGAGAGTT
CAGTCGTCGCCGAAGCTGGATCTGGTTCGCGAGCTGATCGACGCGATTCTGCGTCAGGATCTCGGGGCGCCGAAGAAGCAGAAGCAGACGGCGACACGGA
CCTGGAAACGTCTCCTCGATGAGCACCAGGTCGATGTCTCCTACGGCACGGTCCGTGACTATGTGCGCTCCCGGCGCCCGCAGATCGACGCGGAGGCTGG
TCGTCTGCCGGAGGTGTTCGTCCCGCAGGAGCATGCTCCGGGGGCGGAGGCGGAGGTCGATTTCGGTGAGGTCTGGGTGATCCTGGCCGGGGTGAAGACC
AAGTGCCACATGTTCACTTTTCGGCTCTCGTACTCGGGGAAGGCGATTCACCGGGTCTACTCGACGCAGTCGCAAGAGGCGTTCCTGGAAGGCCACATCG
ACGCGTTCGAGGAGATTGGCGGGATGCCGACCCTTCACATCAAATACGACAACTTGGCCGCCGCGGTGAAGTCCGTGGTCAACGGCAAGGACCGCAAACG
GGTCGAGAATGACCGTTGGGTCCTGTTCCGTTCCCACTACGGATTCGACGCGTTCTACTGCCAGCCCGGCATTGACGGTGCGCACGAGAAGGGCGGCGTT
GAGGGCGAGGTCGGCCGGTTCCGCCGCACGTGGCTCTCACCCATGCCCGAAGTCGACTCCCTCGCGCAACTGAACGCGATGATCCGCCGCTGGGATGCCC
GTGACGAGCAGCGCCGGATCGCGCAGCGCCGAACCAAGGTCGAGGACGACTTCGCTACAGAACGCCCGCTCCTTCGGCCGCTGCCGGCGGAGCGATTTGA
TCCGGGACTGGTGCTGCATCCTCGGGTCGACCGGTCCGGGCTGATCACGGTCCGGATGGCGAAATACTCTGTCCCTGCCCGGCTCATCGGCCGTGAAGTC
CGCGTCTCTCCGCGCTTCCGAGGTCGTCGTCTTCGACGGGCGCGTTGAAGTCGCCCGGCACGAACGCGTGGTCGCCCGCGGAGGCGAGTCGATCCAGCTG
GATCACTACCTCGAGGTCCTCCGGCACAAGCCCGGCGCGTTCCCCGGCTCGACAGCACTTGCACGCGCCCGGGAGGCGGGAACCTTCACCGCCGCCCATG
AAGCGTTCTGGCAGGAGGCAAGGAAAGTCAATGGCGACACGGCCGGCACCAGAGAGCTCGTCGACGTGCTCCTGCTCCATCGCAGCATGCGCGCAGCCGA
CGTGATCGCCGGGATCCGCGCGGCACTCTCGGTCGGGGCCATCTCCGCTGACGTCGTCGCGGTTGAAGCACGACTGTACGCAGGTGGGGCCATCCAACAC
CGACAGCCCGTTGAACAACGCCCCGATCGCGAGCGACGAGTTGTCAGTCTCACCCAGCGCCGGCTCCGGGATCCGCAGGCTGTCATCGCCGGCCTGCCAC
AGGACAAACGCCCGCTCCCGACCGTCACCCAATACGACGAACTCCTCCAGCGCCGCCCCGTCTTCACCGACCCCACCACCGATGAGAGAGAAGGAACCAC
CGACACATGAGCCCAACCACCACGAACATCACCACCACCCTCCGCCGGCAACGCGGGATGACCCAGGAAGCCGCCGCGGCCGCCGTCGACCAAGCCTGCA
GACGCCTGCGACTACCGACCATTCGCGCCGTGATGGACGAAGCGATCCGGGTCGCCGAGCACGAGCAGCTGTCCTACCAAGGCTTCCTCGCCGAAGTGCT
GTTGGCCGAGTGCGACGACCGCGACCGCCGCTCCACCGTCCGCCGCGTCGCCTCCGCCGGCTTCCCACGTCAGAAATGGCTCGGCGACTTCGACTTCGAC
ACCAACCCGAACATCAACGCGGCGACCATCCACACGCTCGCCACCGGCGACTGGGTCAGACGCGGCGACCCGCTCTGCCTCATCGGGGACTCCGGCACCG
GCAAGAGCCACCTCCTCATCGGCCTCGGCACCGCCGCAGCCGAGAAGGGCTACCGAGTCAAATACACCCTCGCGACCAAGCTCGTGAACGAACTCGTCGA
AGCAGCAGATGAGAAGCAGTTGGCCCGCACGATCGCTCGCTACGGCCGCGTCGATCTGCTCTGCATCGACGAGCTCGGCTACATGGAACTCGACCGACGC
GGCGCCGAGCTCCTCTTCCAAGTCCTCACCGAACGCGAAGAGAAGAACTCCGTCGCGATCGCATCCAACCAGTCATTCACGGGATGGACGGACACCTTCA
CCGACCCCAGGCTCTGCGCTGCCATCATCGAGACCGGCACCACCTCCTACCGCCTCCAACACACCCGCAACACCGCACTCGCTGGGGCCAACTAACTTCG
ACAACCCGATCCCCGCCAGCCTTGACATAGTCA
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1119 bp | 372 aa | 127 | 1245 | + | No |
Description :
ORF sequence :
MGGPGLGSRLDQFAVIRRDARVEGLSIRELAVRHGVHRRTVRQALESATPPSRKPRVQSSPKLDLVRELIDAILRQDLGAPKKQKQTATRTWKRLLDEHQ
VDVSYGTVRDYVRSRRPQIDAEAGRLPEVFVPQEHAPGAEAEVDFGEVWVILAGVKTKCHMFTFRLSYSGKAIHRVYSTQSQEAFLEGHIDAFEEIGGMP
TLHIKYDNLAAAVKSVVNGKDRKRVENDRWVLFRSHYGFDAFYCQPGIDGAHEKGGVEGEVGRFRRTWLSPMPEVDSLAQLNAMIRRWDARDEQRRIAQR
RTKVEDDFATERPLLRPLPAERFDPGLVLHPRVDRSGLITVRMAKYSVPARLIGREVRVSPRFRGRRLRRAR
VDVSYGTVRDYVRSRRPQIDAEAGRLPEVFVPQEHAPGAEAEVDFGEVWVILAGVKTKCHMFTFRLSYSGKAIHRVYSTQSQEAFLEGHIDAFEEIGGMP
TLHIKYDNLAAAVKSVVNGKDRKRVENDRWVLFRSHYGFDAFYCQPGIDGAHEKGGVEGEVGRFRRTWLSPMPEVDSLAQLNAMIRRWDARDEQRRIAQR
RTKVEDDFATERPLLRPLPAERFDPGLVLHPRVDRSGLITVRMAKYSVPARLIGREVRVSPRFRGRRLRRAR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
615 bp | 205 aa | 1190 | 1804 | + | No |
Chemistry : DDE
ORF sequence :
VKSASLRASEVVVFDGRVEVARHERVVARGGESIQLDHYLEVLRHKPGAFPGSTALARAREAGTFTAAHEAFWQEARKVNGDTAGTRELVDVLLLHRSMR
AADVIAGIRAALSVGAISADVVAVEARLYAGGAIQHRQPVEQRPDRERRVVSLTQRRLRDPQAVIAGLPQDKRPLPTVTQYDELLQRRPVFTDPTTDERE
GTTDT
AADVIAGIRAALSVGAISADVVAVEARLYAGGAIQHRQPVEQRPDRERRVVSLTQRRLRDPQAVIAGLPQDKRPLPTVTQYDELLQRRPVFTDPTTDERE
GTTDT
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
786 bp | 262 aa | 1804 | 2589 | + | No |
AG : IS21 helper
ORF sequence :
MSPTTTNITTTLRRQRGMTQEAAAAAVDQACRRLRLPTIRAVMDEAIRVAEHEQLSYQGFLAEVLLAECDDRDRRSTVRRVASAGFPRQKWLGDFDFDTN
PNINAATIHTLATGDWVRRGDPLCLIGDSGTGKSHLLIGLGTAAAEKGYRVKYTLATKLVNELVEAADEKQLARTIARYGRVDLLCIDELGYMELDRRGA
ELLFQVLTEREEKNSVAIASNQSFTGWTDTFTDPRLCAAIIETGTTSYRLQHTRNTALAGAN
PNINAATIHTLATGDWVRRGDPLCLIGDSGTGKSHLLIGLGTAAAEKGYRVKYTLATKLVNELVEAADEKQLARTIARYGRVDLLCIDELGYMELDRRGA
ELLFQVLTEREEKNSVAIASNQSFTGWTDTFTDPRLCAAIIETGTTSYRLQHTRNTALAGAN
Blast result :
Comments
ISLxc1 is a transposon which inserted by IS1237 at 86nt after the begin of the sequence. ISLxc1 has three ORFs. ORF1 and ORF2 can translate an intact IstA protein when a frameshitting occured at 1207nt site. ORF3 translate IstB protein. The IstA is 68% similar to ISlxx3 and IstB is 78% similar to ISLxx3.
The ISLxc1 sequence was reconstructed by in silico deletion of the IS1237 and of one of the DR generated by the insertion of this sequence.
The ISLxc1 sequence was reconstructed by in silico deletion of the IS1237 and of one of the DR generated by the insertion of this sequence.
References
1] Hui Lin, Yong-ping Zhang, Tai-yuan Li, Yi Zhang (20005) Direct submission.