ISSeq2
- Family ISNCY
- Group IS1202
Isoform Synonym(s) ISSzo2
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_012470 | ND | Streptococcus equi | Streptococcus equi subsp. equi 4047 Streptococcus equi subsp. zooepidemicus H70 |
DNA section
IS Length : 1544 bp
Ends
IR Length : 21/25
IRL : TATCATTCGAAAGTGATAATGTCTTAGCTAGGTCGAATGTGAAAATGTCC
IRR : TGTCAATCGAAAATGACAATGTCTTGAAAAAATCGGTTAGATAAGTGAAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAAAATACAA | CTAATTTCTTCTACTACCAAAGCCTCCC | CAGGCTTTTT | 28 |
CCCCTGTCAA | GCTCTATTTTCCTTTTTCAGCCCTTGT | ATACTGGGGA | 27 |
GCTCAAATGT | TTTATGACCATTATCAGCCTTCGG | CCTAAGGCTC | 24 |
TGGTTGTCAT | CTTTTTTTTGAAGGCTTCTCGTTATA | GGCCTTGGTA | 26 |
ATAATTTCAA | CTGTTTTCTTATTTTTTAGTCATTCTC | ATTCAACAAT | 27 |
CAGATGACAA | GCTATTTGATTTTTTTCTGAGACGAC | GTTACAAGGG | 26 |
TATTTTAAAA | GACTTTTTTCTAAAAAGATCACACCAT | ATCCATGAAG | 27 |
TAGAACTTTC | GACTTTTTTT | 0 | |
GAAAATACAA | CTAATTTCTTCTACTACCAAAGCCTCCC | CAGGCTTTTT | 28 |
GCCATGTCAA | TTATTTTTTCTGCTTTTTCCTGAATA | AATTGGACAA | 26 |
CTTGTGTCAA | ATGCTTTTTAAGCTAGGCTTAGCGTTTT | TTGCTGCCTT | 28 |
CTGCTCAAAG | CCCTTCATAATGGCTAGCAATTCTGG | CGGTCATCAA | 26 |
CTTTTGGCAA | GCTATTTTTTACATCTGCAGGCCTTAGCC | ACTAAGCAGC | 29 |
DNA sequence
TATCATTCGAAAGTGATAATGTCTTAGCTAGGTCGAATGTGAAAATGTCCCGTCTGTTAAACTAGAAAGATGAGGAAAATAGAACTAACTATGATAGAGT
CAAAGAAATACCTAGTCATCAAAGCTGTTTGTGAGGGTAAAAAACAGAAAAACAGGGCTTGTGTAGAATTAGGGTTAAGCAAGAGGCAGGTGAACCGATT
GATACTAGCCTATCAAGAGAAAGGAAAGTCAGCTTTCGTTCATGGTAATAGATCAAAAAGACCAACTCATGCCATGTCTTTAGAAACAAAGAGAAGGATT
ATCGAGAAATATCAAAGTTACGGTGATTTAAGACCAAATGTTGTTCATTTCTGCGAACTACTAGCGGAAGAAGAAAACATCGCCTATTCAGATACAACTG
TTAGAAAACTACTCTACCAAGCAGGGTTTCTATCTCCAAAGACTCAACGAGCAACGAAAAGACGATTGAAACAAGAAGCCAAACAAAAGAAAAGGGAGGC
AGAAAAAAGAGGAGCCAAACTCCCAACTGCTTCCAACTTCTTTGAAGAACCTGACAAAGCTCATCCCAGCCGAGCAAGAAAGAAATTCAAAGGGGAACTC
ATTCAAATGGATGCCAGTCAATTCCCTTGGTTTGGACAACAGGAAACACATCTTCATGTCGCTATTGACGATACCTCCGGTGATATTGTCGGGGCTTACT
TTGATACTCAGGAGACGCTAAACGGGTATTATCATGTCTTAGAACAGATTCTAGAGGTACATGGTATCCCTTTCCAGTTCCTTACTGATAAAAGGACTGT
ATTCACTTATGCTTCCAGTCAGTCTAAAAAGATCGAGGAAGATACCTTTACACAGTTTGGATATGCTTGCCATCAGCTTGGTATTGCCATTGAAACGTCT
TCTATCCCTCAAGCTAAAGGGCGTGTAGAACGTCTTAATCAGACCCTTCAATCGCGTCTACCAATTGATTTACAGAGGAATCAAATTACCAGCATCTCTC
AAGCGAATCGTTATCTTAAGAGATGGATTAAACGATTTAACAAGCAGTTTGGTGGACTGGCTAGTGAGTCTGTTTTTGAGAAAGCACCTAAACCAGCCCA
GCGAAACCTACTGTTAGCGAGAGTCTCTGAAAGAGTGATTGATAGCGGGCATCATATTCGATATCAAAACAACTTCTACTTGCCCGTCGAAGGGGATAAA
GAAATCTATTTTACGCGTAAGACAAAGGCACTTGTGATTGAGGCATTCGATGGAGACATCTACCTTAATATCGCAGACAATATTTATGCCACTAGAAAGT
TACCAAAACACGAGAAGCACTCCAAAGAATTCGAAATGGTGCCTAAAACTAAAAAAGAAAGACGCAAGTATATTCCACCACAATCCCATCCGTGGAAACT
TGCATCTTTCAAACAATACCTTCATAAAATCGGAAAATCTTATGAAGAATTCCAGCGTGAGAAGAATTCTTCTCACCCACAATTATAACAGATTTTTCAC
TTATCTAACCGATTTTTTCAAGACATTGTCATTTTCGATTGACA
CAAAGAAATACCTAGTCATCAAAGCTGTTTGTGAGGGTAAAAAACAGAAAAACAGGGCTTGTGTAGAATTAGGGTTAAGCAAGAGGCAGGTGAACCGATT
GATACTAGCCTATCAAGAGAAAGGAAAGTCAGCTTTCGTTCATGGTAATAGATCAAAAAGACCAACTCATGCCATGTCTTTAGAAACAAAGAGAAGGATT
ATCGAGAAATATCAAAGTTACGGTGATTTAAGACCAAATGTTGTTCATTTCTGCGAACTACTAGCGGAAGAAGAAAACATCGCCTATTCAGATACAACTG
TTAGAAAACTACTCTACCAAGCAGGGTTTCTATCTCCAAAGACTCAACGAGCAACGAAAAGACGATTGAAACAAGAAGCCAAACAAAAGAAAAGGGAGGC
AGAAAAAAGAGGAGCCAAACTCCCAACTGCTTCCAACTTCTTTGAAGAACCTGACAAAGCTCATCCCAGCCGAGCAAGAAAGAAATTCAAAGGGGAACTC
ATTCAAATGGATGCCAGTCAATTCCCTTGGTTTGGACAACAGGAAACACATCTTCATGTCGCTATTGACGATACCTCCGGTGATATTGTCGGGGCTTACT
TTGATACTCAGGAGACGCTAAACGGGTATTATCATGTCTTAGAACAGATTCTAGAGGTACATGGTATCCCTTTCCAGTTCCTTACTGATAAAAGGACTGT
ATTCACTTATGCTTCCAGTCAGTCTAAAAAGATCGAGGAAGATACCTTTACACAGTTTGGATATGCTTGCCATCAGCTTGGTATTGCCATTGAAACGTCT
TCTATCCCTCAAGCTAAAGGGCGTGTAGAACGTCTTAATCAGACCCTTCAATCGCGTCTACCAATTGATTTACAGAGGAATCAAATTACCAGCATCTCTC
AAGCGAATCGTTATCTTAAGAGATGGATTAAACGATTTAACAAGCAGTTTGGTGGACTGGCTAGTGAGTCTGTTTTTGAGAAAGCACCTAAACCAGCCCA
GCGAAACCTACTGTTAGCGAGAGTCTCTGAAAGAGTGATTGATAGCGGGCATCATATTCGATATCAAAACAACTTCTACTTGCCCGTCGAAGGGGATAAA
GAAATCTATTTTACGCGTAAGACAAAGGCACTTGTGATTGAGGCATTCGATGGAGACATCTACCTTAATATCGCAGACAATATTTATGCCACTAGAAAGT
TACCAAAACACGAGAAGCACTCCAAAGAATTCGAAATGGTGCCTAAAACTAAAAAAGAAAGACGCAAGTATATTCCACCACAATCCCATCCGTGGAAACT
TGCATCTTTCAAACAATACCTTCATAAAATCGGAAAATCTTATGAAGAATTCCAGCGTGAGAAGAATTCTTCTCACCCACAATTATAACAGATTTTTCAC
TTATCTAACCGATTTTTTCAAGACATTGTCATTTTCGATTGACA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1398 bp | 465 aa | 91 | 1488 | + | No |
Chemistry : DDE
ORF sequence :
MIESKKYLVIKAVCEGKKQKNRACVELGLSKRQVNRLILAYQEKGKSAFVHGNRSKRPTHAMSLETKRRIIEKYQSYGDLRPNVVHFCELLAEEENIAYS
DTTVRKLLYQAGFLSPKTQRATKRRLKQEAKQKKREAEKRGAKLPTASNFFEEPDKAHPSRARKKFKGELIQMDASQFPWFGQQETHLHVAIDDTSGDIV
GAYFDTQETLNGYYHVLEQILEVHGIPFQFLTDKRTVFTYASSQSKKIEEDTFTQFGYACHQLGIAIETSSIPQAKGRVERLNQTLQSRLPIDLQRNQIT
SISQANRYLKRWIKRFNKQFGGLASESVFEKAPKPAQRNLLLARVSERVIDSGHHIRYQNNFYLPVEGDKEIYFTRKTKALVIEAFDGDIYLNIADNIYA
TRKLPKHEKHSKEFEMVPKTKKERRKYIPPQSHPWKLASFKQYLHKIGKSYEEFQREKNSSHPQL
DTTVRKLLYQAGFLSPKTQRATKRRLKQEAKQKKREAEKRGAKLPTASNFFEEPDKAHPSRARKKFKGELIQMDASQFPWFGQQETHLHVAIDDTSGDIV
GAYFDTQETLNGYYHVLEQILEVHGIPFQFLTDKRTVFTYASSQSKKIEEDTFTQFGYACHQLGIAIETSSIPQAKGRVERLNQTLQSRLPIDLQRNQIT
SISQANRYLKRWIKRFNKQFGGLASESVFEKAPKPAQRNLLLARVSERVIDSGHHIRYQNNFYLPVEGDKEIYFTRKTKALVIEAFDGDIYLNIADNIYA
TRKLPKHEKHSKEFEMVPKTKKERRKYIPPQSHPWKLASFKQYLHKIGKSYEEFQREKNSSHPQL
Blast result :
Comments
ISSeq2 is 79% aa similar to IS1202.
References
1] ISfinder submission (2012)
2] Holden,M.T., Heather,Z., Paillot,R., Steward,K.F., Webb,K., Ainslie,F., Jourdan,T., Bason,N.C., Holroyd,N.E., Mungall,K., Quail,M.A., Sanders,M., Simmonds,M., Willey,D., Brooks,K., Aanensen,D.M., Spratt,B.G., Jolley,K.A., Maiden,M.C., Kehoe,M., Chanter,N., Bentley,S.D., Robinson,C., Maskell,D.J., Parkhill,J. (2009) PLoS Pathog. 5 (3), E1000346 .
2] Holden,M.T., Heather,Z., Paillot,R., Steward,K.F., Webb,K., Ainslie,F., Jourdan,T., Bason,N.C., Holroyd,N.E., Mungall,K., Quail,M.A., Sanders,M., Simmonds,M., Willey,D., Brooks,K., Aanensen,D.M., Spratt,B.G., Jolley,K.A., Maiden,M.C., Kehoe,M., Chanter,N., Bentley,S.D., Robinson,C., Maskell,D.J., Parkhill,J. (2009) PLoS Pathog. 5 (3), E1000346 .