ISSeq1
- Family ISL3
- Group
Isoform Synonym(s) ISSzo1
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_012470 | ND | Streptococcus equi | Streptococcus equi subsp. zooepidemicus H70 Streptococcus equi equi 4047 Streptococcus equi zooepidemicus 35246 |
DNA section
IS Length : 1415 bp
Ends
IR Length : 21/25
IRL : GGCTCTATAATTTCTGTAGTGGGTAAGTCCACTGCAGGGGTTATAGGGCT
IRR : GGCTCTATGTCAACTGTAGTGGGTAATTGACAAGCTAACATCTGGAGAGG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAGGGGGTAA | TGATTATA | CCGTTAAAT | 8 |
CAATTACCCA | GCTTTTTA | TAGTATTAT | 8 |
CTCTTTAGAC | GTTAGTTA | TTAGTCAAC | 8 |
CTTCAATGGT | TAAAGTTT | TGGACTAAC | 8 |
AATCTCTTGA | CAAATAAA | AAAGGGGTG | 8 |
TCATCACCTA | CCTTTTTT | AACCTCCTG | 8 |
CTTTTGATTT | TATGATTTTA | 0 | |
CTCAGTCCAA | CAAAAAAG | CGCCCAAGAT | 8 |
CACACAAAAA | CCAAAAAAAG | 0 | |
AAAGATGACA | ATAAAAAA | TAACCGTCTT | 8 |
GCCATTCCAA | AGTATATG | TCATTCTAAT | 8 |
ATTCAACTAA | CAAAAAGC | AAGAGGACAT | 8 |
AAATTAGCAA | TACAAAAA | TGCCTACCAG | 8 |
AAATAAATAC | CAAAAATTCT | 0 | |
AGTAAACCAT | TAATTTAC | TGTTTCTTAA | 8 |
CATTCTAGTA | AAAAAGAA | TACCTAGCTT | 8 |
ATTTAAGACT | AGATTACA | CTGCTCTTTG | 8 |
GTGGAAATCA | AAACTCCGCC | 0 | |
GTGGAAATCA | AAACTCCGCC | 0 | |
ATGGAAATAA | AGAATCCTGAT | 0 | |
CCATTAGCCA | AAAAGTAC | TCACCCTCTAT | 8 |
CAATTACCCA | GCTTTTTA | TAGTATTATAA | 8 |
CAGAAATCGT | CTTTTAAT | AATGCCTAAAT | 8 |
DNA sequence
GGCTCTATAATTTCTGTAGTGGGTAAGTCCACTGCAGGGGTTATAGGGCTTTATGAGTATCAAAAAAGTCCCATATGACCTATAATGAAAAGCGACTAAA
CTATCATTTAGAAAGACTCATATGGAACAATCACATGTTATCACACAACTACTGGGAATTAAAGACCCTCATATCACATTCTCCAAAGAAATACACGACA
TGAAAACTCATAAGGAATTGAAAGCTGTCCTTGACTACGATGCCCCACCTTGCCCTAACTGCCAAAGTCAGATGGGCAAGTACGACTTCCAACGGGAATC
CAAGGTCCCTTTTCTCGATTGCGCAGGCTACAAGACTTTGATTCGCCTCAAAAAGCGCCGTTTCAAATGTCAGTTTTGCGGAAAAATTACTGTCGCTGAG
ACTTCTCTAGTCCCTAAAAACCATCAAATACCAACCATCGTCAAACAGAAGGTAGCCCAGCTTCTCATTGAGAAAGTCTCCATGACCGCTATCGCTGATA
GACTATCCATCTCCACCTCAACCGTCATGCGAAAGCTCAACGAGTTCACGTTCAAGCCTCATTTGACTTATTTACCCGAACATATGTCCTGGGATGAATA
TGCCTTTAAGAAGGGCAAGATGAGCTTTATTGCTCAGGACTTTGACACCAACATCATCATCGCTATTTTGGATGGACGGACGAAAGCTGTCATTCGCAAC
CACTTCCTGAGATATCCTCGGCAGGTCAGAAACGACGTTAAACTCATCACCATGGATATGTTTACCCCCTATTACAACCTGGCTAAGATGCTTTTCCCAA
ACGCTCAGATTGTCCTTGATCGTTTCCACATTGTGCAACATTTGGGACGTGCCATGAACCGTATCCGTACTCAAATCATGAACTCTTTTGATCGAAAATC
CCACGAATACAAGGCTTTGAAACGTTACTGGAAGCTGATTCAACAAGATAGCAGCAAACTCAGTGACAAGCGTTTTTACCGCCCGACTTTTCGCATGCAT
TTGACAAACAAGGAGGTGGTCGAGCGTCTTTTGAGCTACTCTGACGAGCTTAGAAAGCATTATGACCTTTATCAGCTTCTGCTTTTTCACTTCCAGGAAA
AGCAAGCTGACCACTTTTTCGGCCTGATTGAGGAGCAGATAGACAGCAGCAATCCTCTCTTTCAGACCGTTTTCAAGACCTTTTTAAAAGACAGAGACAA
GATTGAGAACGCACTGGATTTGCCTTATTCTAACGCTAAACTGGAGGCTACCAATAATCTCATCAAAGTCATCAAGCGCAATGCTTTTGGATTTCGGAAC
TTTGAAAACTTTAAAAGACGGATTTTGATCGCTATCAATATGAAAAAAGAGAAGACCAAATTGGTCCTCTCCAGATGTTAGCTTGTCAATTACCCACTAC
AGTTGACATAGAGCC
CTATCATTTAGAAAGACTCATATGGAACAATCACATGTTATCACACAACTACTGGGAATTAAAGACCCTCATATCACATTCTCCAAAGAAATACACGACA
TGAAAACTCATAAGGAATTGAAAGCTGTCCTTGACTACGATGCCCCACCTTGCCCTAACTGCCAAAGTCAGATGGGCAAGTACGACTTCCAACGGGAATC
CAAGGTCCCTTTTCTCGATTGCGCAGGCTACAAGACTTTGATTCGCCTCAAAAAGCGCCGTTTCAAATGTCAGTTTTGCGGAAAAATTACTGTCGCTGAG
ACTTCTCTAGTCCCTAAAAACCATCAAATACCAACCATCGTCAAACAGAAGGTAGCCCAGCTTCTCATTGAGAAAGTCTCCATGACCGCTATCGCTGATA
GACTATCCATCTCCACCTCAACCGTCATGCGAAAGCTCAACGAGTTCACGTTCAAGCCTCATTTGACTTATTTACCCGAACATATGTCCTGGGATGAATA
TGCCTTTAAGAAGGGCAAGATGAGCTTTATTGCTCAGGACTTTGACACCAACATCATCATCGCTATTTTGGATGGACGGACGAAAGCTGTCATTCGCAAC
CACTTCCTGAGATATCCTCGGCAGGTCAGAAACGACGTTAAACTCATCACCATGGATATGTTTACCCCCTATTACAACCTGGCTAAGATGCTTTTCCCAA
ACGCTCAGATTGTCCTTGATCGTTTCCACATTGTGCAACATTTGGGACGTGCCATGAACCGTATCCGTACTCAAATCATGAACTCTTTTGATCGAAAATC
CCACGAATACAAGGCTTTGAAACGTTACTGGAAGCTGATTCAACAAGATAGCAGCAAACTCAGTGACAAGCGTTTTTACCGCCCGACTTTTCGCATGCAT
TTGACAAACAAGGAGGTGGTCGAGCGTCTTTTGAGCTACTCTGACGAGCTTAGAAAGCATTATGACCTTTATCAGCTTCTGCTTTTTCACTTCCAGGAAA
AGCAAGCTGACCACTTTTTCGGCCTGATTGAGGAGCAGATAGACAGCAGCAATCCTCTCTTTCAGACCGTTTTCAAGACCTTTTTAAAAGACAGAGACAA
GATTGAGAACGCACTGGATTTGCCTTATTCTAACGCTAAACTGGAGGCTACCAATAATCTCATCAAAGTCATCAAGCGCAATGCTTTTGGATTTCGGAAC
TTTGAAAACTTTAAAAGACGGATTTTGATCGCTATCAATATGAAAAAAGAGAAGACCAAATTGGTCCTCTCCAGATGTTAGCTTGTCAATTACCCACTAC
AGTTGACATAGAGCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1260 bp | 419 aa | 122 | 1381 | + | No |
Chemistry : Unknow
ORF sequence :
MEQSHVITQLLGIKDPHITFSKEIHDMKTHKELKAVLDYDAPPCPNCQSQMGKYDFQRESKVPFLDCAGYKTLIRLKKRRFKCQFCGKITVAETSLVPKN
HQIPTIVKQKVAQLLIEKVSMTAIADRLSISTSTVMRKLNEFTFKPHLTYLPEHMSWDEYAFKKGKMSFIAQDFDTNIIIAILDGRTKAVIRNHFLRYPR
QVRNDVKLITMDMFTPYYNLAKMLFPNAQIVLDRFHIVQHLGRAMNRIRTQIMNSFDRKSHEYKALKRYWKLIQQDSSKLSDKRFYRPTFRMHLTNKEVV
ERLLSYSDELRKHYDLYQLLLFHFQEKQADHFFGLIEEQIDSSNPLFQTVFKTFLKDRDKIENALDLPYSNAKLEATNNLIKVIKRNAFGFRNFENFKRR
ILIAINMKKEKTKLVLSRC
HQIPTIVKQKVAQLLIEKVSMTAIADRLSISTSTVMRKLNEFTFKPHLTYLPEHMSWDEYAFKKGKMSFIAQDFDTNIIIAILDGRTKAVIRNHFLRYPR
QVRNDVKLITMDMFTPYYNLAKMLFPNAQIVLDRFHIVQHLGRAMNRIRTQIMNSFDRKSHEYKALKRYWKLIQQDSSKLSDKRFYRPTFRMHLTNKEVV
ERLLSYSDELRKHYDLYQLLLFHFQEKQADHFFGLIEEQIDSSNPLFQTVFKTFLKDRDKIENALDLPYSNAKLEATNNLIKVIKRNAFGFRNFENFKRR
ILIAINMKKEKTKLVLSRC
Blast result :
Comments
ISSeq1 is 86% aa similar to ISSth1.
References
1] ISfinder submission (2012)
2] Holden,M.T., Heather,Z., Paillot,R., Steward,K.F., Webb,K., Ainslie,F., Jourdan,T., Bason,N.C., Holroyd,N.E., Mungall,K., Quail,M.A., Sanders,M., Simmonds,M., Willey,D., Brooks,K., Aanensen,D.M., Spratt,B.G., Jolley,K.A., Maiden,M.C., Kehoe,M., Chanter,N., Bentley,S.D., Robinson,C., Maskell,D.J., Parkhill,J. (2009) PLoS Pathog. 5 (3), E1000346 .
2] Holden,M.T., Heather,Z., Paillot,R., Steward,K.F., Webb,K., Ainslie,F., Jourdan,T., Bason,N.C., Holroyd,N.E., Mungall,K., Quail,M.A., Sanders,M., Simmonds,M., Willey,D., Brooks,K., Aanensen,D.M., Spratt,B.G., Jolley,K.A., Maiden,M.C., Kehoe,M., Chanter,N., Bentley,S.D., Robinson,C., Maskell,D.J., Parkhill,J. (2009) PLoS Pathog. 5 (3), E1000346 .