ISSgsp1
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
HE578057 | ND | Shigella sp. | Citrobacter freundii Shigella sp. MO17 plasmid pMO17_54 Enterobacter hormaechei subsp. hormaechei |
DNA section
IS Length : 2438 bp
Ends
IR Length : 21/22
IRL : GTAAGCGTCCAGCGAGAGCCGTCTGGTCATAATCTGATGTATTCGACAAA
IRR : GTAAGCGTCCAGCGAGAACCGTATTGACGGGGATGCGTTATTCAATTGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATGGGCGTTG | GCCTCAAC | ACGATTTTAC | 8 |
TATACATCTG | GATGAAAA | GGGTATCAGC | 8 |
CCAGTACCAG | GGGGGATC | AGGGCTTGTT | 8 |
DNA sequence
GTAAGCGTCCAGCGAGAGCCGTCTGGTCATAATCTGATGTATTCGACAAAGTGGTGTACACCAAATAGTGGTGGGAACCAAAGTGTCAGATATGCAGAAA
AATGTGACTTCCGGCAGGCGTAAAGGCTGCCCTAATTATTCTCCTGAGTTTAAGCAGCAGCTCGTTGCCGCCTCCTGCGAGCCCGGGATATCCATCTCAA
AACTGGCGCTCGAAAATGGCATTAACGCCAATCTGCTATTTAAATGGCGCCAGCAGTGGCGCGAGGGAAAGCTGCTATTACCTTCACCTGAGAGTCCTCA
TTTACTCCCTGTGACTCTCGATGCCACCGCCGTACAGCCAGAACCGCCCGCAGAGGACCCAGAGCTCAGTATCAGCTGTGAGGTAACGTTCCGGCACGGG
ACGCTTCGCCTCAACGGTACTGTCAGCGAAAAGCTCCTGACTCTGCTGATACAGGAACTGAAGCGATGATCCCGTTACCTTCCGGGACCAAAATCTGGCT
GGTTGCCGGCATTACCGACATGCGAAACGGCTTCAATGGTCTGGCCGCAAAGGTGCAGATGGCGCTGAAAGACGATCCGATGTCCGGCCACGTCTTCATC
TTCCGGGGCCGCAGCGGCAGCCAGGTAAAACTGCTGTGGTCTACCGGAGACGGGCTGTGCCTGCTGACCAAAAGACTGGAGCGTGGGCGCTTCGCCTGGC
CGTCAGCCCGTGATGGCAAAGTGTTCCTGACACCGGCACAGCTGGCAATGCTGCTGGAAGGCATCGACTGGCGACAGCCGAAACGGTTGATGACCTCCCT
GACTATGCTGTAGGTCTCTTTATCCTGGTTGTCGCAGAATAAGCCTGGTAAAATGCGGGCTTATGAACGACACCTCTTCTGACGACATCCTTCTGCTGAA
ACAGTGCCTGGCTGAACAGGAAGCGTTGATCCACGCCCTGCAGGAAAAGCTGGCCGACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAACTGGATAAG
CTTCGCCGGATGAACTTCGGTAGCCGTTCCGAAAAGATCTCCCGCCGTATCGCACAGATGGAAGCCGACCTGAACCGGCTTCAGAAAGAGAGCGATACGC
TGACCGGTCGGGTGGATGACCCGGCCGTGCAACGCCCGTTGCGCCAGACCCGTACCCGCAAACCGTTCCCCGAATCACTCCCCCGTGACGAAAAGCGACT
GCTGCCAGCAGAGCCGTGTTGCCCGGACTGTGGCGGTGCGTTGAGCTACCTGGGTGAAGATACCGCTGAACAACTGGAGCTGATGCGCAGCGCCTTCCGG
GTTATCCGGACCGTGCGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCCCCCGCGCCTTCACGTCCCATCGAGCGGGGTATCGCCGGAC
CGGGGCTGCTGGCCCGCGTGCTGAGTTCAAAGTATGCAGAACATACCCCGCTGTACCGGCAGTCAGAAATATACAGCCGCCAGGGCGTGGAGCTGAGCCG
CTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCACCGCTGGAAGAGGCGCTTCAGGGCTATGTCCTGACCGACGGTAAACTCCATGCC
GATGACACCCCGGTCCAGGTGCTGCTGCCGGGCAATAAGAAGACGAAGACCGGGCGGTTGTGGACGTATGTTCGTGATGACCGCAACGCTGGATCAGCAG
TCGCGCCGGCGGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCACCCTCAGACCCATCTTGCAGGGTTCAGTGGTGTGCTGCAGGCGGATGCGTA
CGCCGGGTTCAACGAGCTGTACCGCGATGGCCAGATAACGGAAGCCGCCTGCTGGGCTCACGCCCGCCGTAAAATCCACGATGTGCACGTCCGCACACCG
TCGGCGCTGACGGATGAAGCCCTGAAGCGTATCGGCGAGCTGTATGCCGTTGAAGCGGAGATAAGGGGAATGCCGGCGGAGCAACGCCTTGCTGAACGTC
AGCAAAAAGCTAAACCACAGTTGAAAGCCCTGGAAAGCTGGCTGCGTGAAAAGATGAAAACGCTGTCGCGACACTCGGAGCTGGCGAAGGCGTTCACGTA
CGCCCTGAACCAGTGGCCGGCACTGACGTACTATGCGGAAGATGGCTGGGCCGAAGCCGATAACAACATCGCTGAAAATGCGCTACGGCTGGTCAGTCTG
GGGCGTAAAAACTGGCTGTTCTTCGGCTCTGACCACGGTGGTGAGCGAGGAGCGCTGCTGTACAGCCTGATCGGGACGTGCAAACTGAATGGCGTGGATC
CAGAAAGCTACCTCCGTCACGTCCTTAACGTCATAGCGGACTGGCCAGTCAACCGGGTCAGCGAACTGCTCCCCTGGCGCGTAGCACTGCCAATTGAATA
ACGCATCCCCGTCAATACGGTTCTCGCTGGACGCTTAC
AATGTGACTTCCGGCAGGCGTAAAGGCTGCCCTAATTATTCTCCTGAGTTTAAGCAGCAGCTCGTTGCCGCCTCCTGCGAGCCCGGGATATCCATCTCAA
AACTGGCGCTCGAAAATGGCATTAACGCCAATCTGCTATTTAAATGGCGCCAGCAGTGGCGCGAGGGAAAGCTGCTATTACCTTCACCTGAGAGTCCTCA
TTTACTCCCTGTGACTCTCGATGCCACCGCCGTACAGCCAGAACCGCCCGCAGAGGACCCAGAGCTCAGTATCAGCTGTGAGGTAACGTTCCGGCACGGG
ACGCTTCGCCTCAACGGTACTGTCAGCGAAAAGCTCCTGACTCTGCTGATACAGGAACTGAAGCGATGATCCCGTTACCTTCCGGGACCAAAATCTGGCT
GGTTGCCGGCATTACCGACATGCGAAACGGCTTCAATGGTCTGGCCGCAAAGGTGCAGATGGCGCTGAAAGACGATCCGATGTCCGGCCACGTCTTCATC
TTCCGGGGCCGCAGCGGCAGCCAGGTAAAACTGCTGTGGTCTACCGGAGACGGGCTGTGCCTGCTGACCAAAAGACTGGAGCGTGGGCGCTTCGCCTGGC
CGTCAGCCCGTGATGGCAAAGTGTTCCTGACACCGGCACAGCTGGCAATGCTGCTGGAAGGCATCGACTGGCGACAGCCGAAACGGTTGATGACCTCCCT
GACTATGCTGTAGGTCTCTTTATCCTGGTTGTCGCAGAATAAGCCTGGTAAAATGCGGGCTTATGAACGACACCTCTTCTGACGACATCCTTCTGCTGAA
ACAGTGCCTGGCTGAACAGGAAGCGTTGATCCACGCCCTGCAGGAAAAGCTGGCCGACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAACTGGATAAG
CTTCGCCGGATGAACTTCGGTAGCCGTTCCGAAAAGATCTCCCGCCGTATCGCACAGATGGAAGCCGACCTGAACCGGCTTCAGAAAGAGAGCGATACGC
TGACCGGTCGGGTGGATGACCCGGCCGTGCAACGCCCGTTGCGCCAGACCCGTACCCGCAAACCGTTCCCCGAATCACTCCCCCGTGACGAAAAGCGACT
GCTGCCAGCAGAGCCGTGTTGCCCGGACTGTGGCGGTGCGTTGAGCTACCTGGGTGAAGATACCGCTGAACAACTGGAGCTGATGCGCAGCGCCTTCCGG
GTTATCCGGACCGTGCGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCCCCCGCGCCTTCACGTCCCATCGAGCGGGGTATCGCCGGAC
CGGGGCTGCTGGCCCGCGTGCTGAGTTCAAAGTATGCAGAACATACCCCGCTGTACCGGCAGTCAGAAATATACAGCCGCCAGGGCGTGGAGCTGAGCCG
CTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCACCGCTGGAAGAGGCGCTTCAGGGCTATGTCCTGACCGACGGTAAACTCCATGCC
GATGACACCCCGGTCCAGGTGCTGCTGCCGGGCAATAAGAAGACGAAGACCGGGCGGTTGTGGACGTATGTTCGTGATGACCGCAACGCTGGATCAGCAG
TCGCGCCGGCGGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCACCCTCAGACCCATCTTGCAGGGTTCAGTGGTGTGCTGCAGGCGGATGCGTA
CGCCGGGTTCAACGAGCTGTACCGCGATGGCCAGATAACGGAAGCCGCCTGCTGGGCTCACGCCCGCCGTAAAATCCACGATGTGCACGTCCGCACACCG
TCGGCGCTGACGGATGAAGCCCTGAAGCGTATCGGCGAGCTGTATGCCGTTGAAGCGGAGATAAGGGGAATGCCGGCGGAGCAACGCCTTGCTGAACGTC
AGCAAAAAGCTAAACCACAGTTGAAAGCCCTGGAAAGCTGGCTGCGTGAAAAGATGAAAACGCTGTCGCGACACTCGGAGCTGGCGAAGGCGTTCACGTA
CGCCCTGAACCAGTGGCCGGCACTGACGTACTATGCGGAAGATGGCTGGGCCGAAGCCGATAACAACATCGCTGAAAATGCGCTACGGCTGGTCAGTCTG
GGGCGTAAAAACTGGCTGTTCTTCGGCTCTGACCACGGTGGTGAGCGAGGAGCGCTGCTGTACAGCCTGATCGGGACGTGCAAACTGAATGGCGTGGATC
CAGAAAGCTACCTCCGTCACGTCCTTAACGTCATAGCGGACTGGCCAGTCAACCGGGTCAGCGAACTGCTCCCCTGGCGCGTAGCACTGCCAATTGAATA
ACGCATCCCCGTCAATACGGTTCTCGCTGGACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
378 bp | 125 aa | 92 | 469 | + | No |
AG : IS66 TnpA
ORF sequence :
MQKNVTSGRRKGCPNYSPEFKQQLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSPESPHLLPVTLDATAVQPEPPAEDPELSISCEVTF
RHGTLRLNGTVSEKLLTLLIQELKR
RHGTLRLNGTVSEKLLTLLIQELKR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
294 bp | 97 aa | 520 | 813 | + | No |
AG : IS66 TnpB
ORF sequence :
MRNGFNGLAAKVQMALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTPAQLAMLLEGIDWRQPKRLMTSLTML
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1539 bp | 512 aa | 863 | 2401 | + | No |
Chemistry : DDE
ORF sequence :
MNDTSSDDILLLKQCLAEQEALIHALQEKLADREREIDHLQAQLDKLRRMNFGSRSEKISRRIAQMEADLNRLQKESDTLTGRVDDPAVQRPLRQTRTRK
PFPESLPRDEKRLLPAEPCCPDCGGALSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLSSKYAEHTPLYRQ
SEIYSRQGVELSRSLLSGWVDACCRLLSPLEEALQGYVLTDGKLHADDTPVQVLLPGNKKTKTGRLWTYVRDDRNAGSAVAPAVWFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGQITEAACWAHARRKIHDVHVRTPSALTDEALKRIGELYAVEAEIRGMPAEQRLAERQQKAKPQLKALESWLREKMKT
LSRHSELAKAFTYALNQWPALTYYAEDGWAEADNNIAENALRLVSLGRKNWLFFGSDHGGERGALLYSLIGTCKLNGVDPESYLRHVLNVIADWPVNRVS
ELLPWRVALPIE
PFPESLPRDEKRLLPAEPCCPDCGGALSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLSSKYAEHTPLYRQ
SEIYSRQGVELSRSLLSGWVDACCRLLSPLEEALQGYVLTDGKLHADDTPVQVLLPGNKKTKTGRLWTYVRDDRNAGSAVAPAVWFAYSPDRKGIHPQTH
LAGFSGVLQADAYAGFNELYRDGQITEAACWAHARRKIHDVHVRTPSALTDEALKRIGELYAVEAEIRGMPAEQRLAERQQKAKPQLKALESWLREKMKT
LSRHSELAKAFTYALNQWPALTYYAEDGWAEADNNIAENALRLVSLGRKNWLFFGSDHGGERGALLYSLIGTCKLNGVDPESYLRHVLNVIADWPVNRVS
ELLPWRVALPIE
Blast result :
Comments
ISSgsp1 is 91% (orfA), 98% (orfB) and 96% (orfC) aa similar to ISEc8.
References
1] ISfinder (2017) Direct submission
2] Norman A. (2011) Direct submission to GenBank
2] Norman A. (2011) Direct submission to GenBank