ISCep1
- Family ISKra4
- Group ISAzba1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_019753 | ND | Crinalium epipsammum | Crinalium epipsammum PCC 9333 |
DNA section
IS Length : 2958 bp
Ends
IR Length : 27/28
IRL : GGAGAGCGATTTTTTTTGGGGGAGATGGAGTACATAACAAGGTAAGATGA
IRR : GGAGAGCGATTTTTTTCGGGGGAGATGGGCAAGAATTAAGCAAGAGGTTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AAAGCTGCTAAAGTCACT | ATCTCCATCACTAC | 0 |
DNA sequence
GGAGAGCGATTTTTTTTGGGGGAGATGGAGTACATAACAAGGTAAGATGACGAAATGTCCGATTCCCCTGAACCTGTCAACTATCAACTTAAGGTGGTAC
TGCTGGGTATCAGTCCGATGATCTGGCGTCGCATTCTAGTACTGAGCGACAGCACAATAGAAGATTTGCACTACACGCTACAAATATCAATGGGGTGGGA
AGACATTCACCTGCATCATTTCATCATTCATGGGAAGCAATACGGAGTAACTCAACCAGGTGGCACAACATTTAGTACTCGCGCCTGCGAGGTAAAGCTC
TCCACATTCGGGTGGCAGCTCAAAGAAAAATTCTTATACGAATATGACTTCAGCATCTGCCCATCAATGGGTACTTGGCGTTATTGGTGGCGACACCAAA
TCCGACTCGAAGCCATCTCAGCAGCACAAGCAAAGCAAACTTATCCGGTCTGCACGGGTGGTAAAGGTGTTTGTCCTCCAGAAGACTGTGGTGGAGCTTG
GGGATTTATGGAGACAAGGCAAGAATATTCAGGTTGGCATATATTAGAACGCTTTGGCTCAATGCTAGAAGAAGAAGACTGGACAGAATATAGAGAAGAA
TTATGCCAACTACAAACCTGGCTGTTAGTTGATCAAAATCATTTCGACCGCGCTCAAGTAAACAGCCGTTTACAGCAATATGCCAGCCGGGATAAGGAAC
TAATGTTATATGAGCAGGGGATAGGGTGAAACTTAAAATACAAGTTGTAATCGAATCAGATAATGGCGACAGAGAAATTATCCAAGAAATTACCCAAATA
GAGCGTGGTATTCTACAACCTGAAAACTTAGGATTAAGCCTAGTAGAAGCAAAAACTCTGCTCAACAAAGTCCAACGCACTTTAGTTCAGCAACAAGTTA
CTGAGTATGAAAAGCAACACGATTCATGCCAGCACTGTGAAAATAAACTCTTGCGTAAAGACAAACGGATAGTTGTATACAGAACCCTTTTTGGTAAGTT
AAAGCTACAATGCAATCGCCTATTTCAATGTACCTGCCAAGAGCAGTCAACCCGTACCTTTAAACCACTAGGTAAGTTACTGAAAGAACGAATCTCTCCA
GAATTGCTCTATTTAGAATCCAAGTTTGTATCTTTGATGTCTTACGGGCTTTCCGTTAAATTATTGGAGGAATTATTACCAATAGAAGGCGAAATAAGCG
TTACCACAGTTCGGAATAAGCTCCACGTCTGTGCCCAACGTTTAGAATCCGAATTGGGTGAAGAGAAAAGAGTATACATCGAAGGCACCCAAAGGGATTG
GGATAATTTGCCCAGACCGGAGCTACCGCTGGTAGTAGGAGTAGATGGAGGATATGTTCGTTTCTATGACAAGAAATCATCAACCAAAGGTAATTTTGAA
ATTATCGTTGGTAAGAGCATAAAATCAGACAATACCTCCAAGCGGTTTGGAGGAGTTTATTCCTACGATACCAAACCTCAGCGTCGGATCTTCGAGGTCT
TGAAATCCCAAGGGATGCAGATGAACCAACAGGTAACATTCCTGTCAGATGGTGATGAGAAGCTCCGTGACCTAATGTTTGGTCTCAACCCAAACACAGA
ATATCTCTTAGATTGGTTTCACATCACCATGCGGCTGACAGTGATAAATCAGATGGCAAAAGGAATTAATAAAAAAGATACCGAACTAAATACAGACATA
CCCAAAGATCTGGAACGGATTAAATGGTATTTGTGGCATGGAAACGTATTTAGAGCATTGCAACTATTAAAAGACTTTGTTGATGACCTAGAAGTTGAGG
TTTTTAATGGCAATGCTCAACGAGAGGTAAAAAAGCTGTTGAAAATGGTGCAGGAATTTGAAACATACATTTTTAACAATGGAGCTTGTATTCCTAACTA
TGGAGAACGTTGGCACAATGACGAGGCTATTGCAACTGGATTTGTCGAATCAACAGTTAATCAAGTTATTAGCAAACGATTTGTGAAAAAACAACAGATG
CGGTGGACACCCAAAGGTGCTCATCTTCTTCTCCAGATGCGAATGCTAGTTTTAAATGGAGAGTTGCGTCGCCAATTCGAGCAATGGTATTCTGGTTTAA
GGCTAGATAATGAGCCAAATCTACAGATACCTAATTCCGTTTAAATATACAGAAACAAAGTAATTCCTGTAAAAATCCCACAGTGTTTTTAGATCTACTG
GTTTACCAACTTTCGTGTATAAGTTGGTAAAATTAGCTCATATACCCCAAGTTTTTCAGGGGGTTGAGTTATGACAGCACAAACCCTCCAACTACCGAAA
GTTGGCAAAACAGAAAAACAAGAACGTCAGTCGCCTAACTCTCATAAGTATTTCGAGGTCAGGACTAGGGAGTATTTATTACCAGAAGAAGTCTCTGCGA
TCCGGTTAGCCATCAAAAAATCTAAGGGTCGCCACGCTCACCGAGACTCAACTCTAATTTTGCTTTGCTATCGTCACGGATTGCGCGTGGCAGAGGTGGC
ATCTTTGCGGTGGGAGCAAATAGATTGGAGTGGTGGCACAATTTACGTGAAACGAGTTAAAAAAGGGACACCCTGGGTTCAACCACTTTCTGGTCTGGAG
ATTCGCTCTCTCCGTCAGTTGCTTAGAAATTATCCTGCTAGTCCCTATATTTTCCAGTCGTCTCGACTTGGCCCGTTGGCACATGATACGATCTCCGGCA
TTGTTGAGCGGGCTGGGGAATTAGCTGGTTTGCCTTTTCCTATTCATGCTCATATGCTGCGGCATGGAACAGGTTATTATCTGGCGAATCGAGGTATTGA
TACCCGAACAATTCAGAGTTATTTGGGGCATAAAAATATTCAGCATACTGTTCATTACACCGAACTTGCATCTACCAAATTTCAAGGGCTTTGGGATGAT
TAATGTTTAAACCTCTTGCTTAATTCTTGCCCATCTCCCCCGAAAAAAATCGCTCTCC
TGCTGGGTATCAGTCCGATGATCTGGCGTCGCATTCTAGTACTGAGCGACAGCACAATAGAAGATTTGCACTACACGCTACAAATATCAATGGGGTGGGA
AGACATTCACCTGCATCATTTCATCATTCATGGGAAGCAATACGGAGTAACTCAACCAGGTGGCACAACATTTAGTACTCGCGCCTGCGAGGTAAAGCTC
TCCACATTCGGGTGGCAGCTCAAAGAAAAATTCTTATACGAATATGACTTCAGCATCTGCCCATCAATGGGTACTTGGCGTTATTGGTGGCGACACCAAA
TCCGACTCGAAGCCATCTCAGCAGCACAAGCAAAGCAAACTTATCCGGTCTGCACGGGTGGTAAAGGTGTTTGTCCTCCAGAAGACTGTGGTGGAGCTTG
GGGATTTATGGAGACAAGGCAAGAATATTCAGGTTGGCATATATTAGAACGCTTTGGCTCAATGCTAGAAGAAGAAGACTGGACAGAATATAGAGAAGAA
TTATGCCAACTACAAACCTGGCTGTTAGTTGATCAAAATCATTTCGACCGCGCTCAAGTAAACAGCCGTTTACAGCAATATGCCAGCCGGGATAAGGAAC
TAATGTTATATGAGCAGGGGATAGGGTGAAACTTAAAATACAAGTTGTAATCGAATCAGATAATGGCGACAGAGAAATTATCCAAGAAATTACCCAAATA
GAGCGTGGTATTCTACAACCTGAAAACTTAGGATTAAGCCTAGTAGAAGCAAAAACTCTGCTCAACAAAGTCCAACGCACTTTAGTTCAGCAACAAGTTA
CTGAGTATGAAAAGCAACACGATTCATGCCAGCACTGTGAAAATAAACTCTTGCGTAAAGACAAACGGATAGTTGTATACAGAACCCTTTTTGGTAAGTT
AAAGCTACAATGCAATCGCCTATTTCAATGTACCTGCCAAGAGCAGTCAACCCGTACCTTTAAACCACTAGGTAAGTTACTGAAAGAACGAATCTCTCCA
GAATTGCTCTATTTAGAATCCAAGTTTGTATCTTTGATGTCTTACGGGCTTTCCGTTAAATTATTGGAGGAATTATTACCAATAGAAGGCGAAATAAGCG
TTACCACAGTTCGGAATAAGCTCCACGTCTGTGCCCAACGTTTAGAATCCGAATTGGGTGAAGAGAAAAGAGTATACATCGAAGGCACCCAAAGGGATTG
GGATAATTTGCCCAGACCGGAGCTACCGCTGGTAGTAGGAGTAGATGGAGGATATGTTCGTTTCTATGACAAGAAATCATCAACCAAAGGTAATTTTGAA
ATTATCGTTGGTAAGAGCATAAAATCAGACAATACCTCCAAGCGGTTTGGAGGAGTTTATTCCTACGATACCAAACCTCAGCGTCGGATCTTCGAGGTCT
TGAAATCCCAAGGGATGCAGATGAACCAACAGGTAACATTCCTGTCAGATGGTGATGAGAAGCTCCGTGACCTAATGTTTGGTCTCAACCCAAACACAGA
ATATCTCTTAGATTGGTTTCACATCACCATGCGGCTGACAGTGATAAATCAGATGGCAAAAGGAATTAATAAAAAAGATACCGAACTAAATACAGACATA
CCCAAAGATCTGGAACGGATTAAATGGTATTTGTGGCATGGAAACGTATTTAGAGCATTGCAACTATTAAAAGACTTTGTTGATGACCTAGAAGTTGAGG
TTTTTAATGGCAATGCTCAACGAGAGGTAAAAAAGCTGTTGAAAATGGTGCAGGAATTTGAAACATACATTTTTAACAATGGAGCTTGTATTCCTAACTA
TGGAGAACGTTGGCACAATGACGAGGCTATTGCAACTGGATTTGTCGAATCAACAGTTAATCAAGTTATTAGCAAACGATTTGTGAAAAAACAACAGATG
CGGTGGACACCCAAAGGTGCTCATCTTCTTCTCCAGATGCGAATGCTAGTTTTAAATGGAGAGTTGCGTCGCCAATTCGAGCAATGGTATTCTGGTTTAA
GGCTAGATAATGAGCCAAATCTACAGATACCTAATTCCGTTTAAATATACAGAAACAAAGTAATTCCTGTAAAAATCCCACAGTGTTTTTAGATCTACTG
GTTTACCAACTTTCGTGTATAAGTTGGTAAAATTAGCTCATATACCCCAAGTTTTTCAGGGGGTTGAGTTATGACAGCACAAACCCTCCAACTACCGAAA
GTTGGCAAAACAGAAAAACAAGAACGTCAGTCGCCTAACTCTCATAAGTATTTCGAGGTCAGGACTAGGGAGTATTTATTACCAGAAGAAGTCTCTGCGA
TCCGGTTAGCCATCAAAAAATCTAAGGGTCGCCACGCTCACCGAGACTCAACTCTAATTTTGCTTTGCTATCGTCACGGATTGCGCGTGGCAGAGGTGGC
ATCTTTGCGGTGGGAGCAAATAGATTGGAGTGGTGGCACAATTTACGTGAAACGAGTTAAAAAAGGGACACCCTGGGTTCAACCACTTTCTGGTCTGGAG
ATTCGCTCTCTCCGTCAGTTGCTTAGAAATTATCCTGCTAGTCCCTATATTTTCCAGTCGTCTCGACTTGGCCCGTTGGCACATGATACGATCTCCGGCA
TTGTTGAGCGGGCTGGGGAATTAGCTGGTTTGCCTTTTCCTATTCATGCTCATATGCTGCGGCATGGAACAGGTTATTATCTGGCGAATCGAGGTATTGA
TACCCGAACAATTCAGAGTTATTTGGGGCATAAAAATATTCAGCATACTGTTCATTACACCGAACTTGCATCTACCAAATTTCAAGGGCTTTGGGATGAT
TAATGTTTAAACCTCTTGCTTAATTCTTGCCCATCTCCCCCGAAAAAAATCGCTCTCC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
675 bp | 224 aa | 55 | 729 | + | No |
Annotation : plasmid pRIA4b ORF-3 family proteinDescription :
ORF sequence :
MSDSPEPVNYQLKVVLLGISPMIWRRILVLSDSTIEDLHYTLQISMGWEDIHLHHFIIHGKQYGVTQPGGTTFSTRACEVKLSTFGWQLKEKFLYEYDFS
ICPSMGTWRYWWRHQIRLEAISAAQAKQTYPVCTGGKGVCPPEDCGGAWGFMETRQEYSGWHILERFGSMLEEEDWTEYREELCQLQTWLLVDQNHFDRA
QVNSRLQQYASRDKELMLYEQGIG
ICPSMGTWRYWWRHQIRLEAISAAQAKQTYPVCTGGKGVCPPEDCGGAWGFMETRQEYSGWHILERFGSMLEEEDWTEYREELCQLQTWLLVDQNHFDRA
QVNSRLQQYASRDKELMLYEQGIG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1419 bp | 472 aa | 726 | 2144 | + | No |
Chemistry : DDE
ORF sequence :
MKLKIQVVIESDNGDREIIQEITQIERGILQPENLGLSLVEAKTLLNKVQRTLVQQQVTEYEKQHDSCQHCENKLLRKDKRIVVYRTLFGKLKLQCNRLF
QCTCQEQSTRTFKPLGKLLKERISPELLYLESKFVSLMSYGLSVKLLEELLPIEGEISVTTVRNKLHVCAQRLESELGEEKRVYIEGTQRDWDNLPRPEL
PLVVGVDGGYVRFYDKKSSTKGNFEIIVGKSIKSDNTSKRFGGVYSYDTKPQRRIFEVLKSQGMQMNQQVTFLSDGDEKLRDLMFGLNPNTEYLLDWFHI
TMRLTVINQMAKGINKKDTELNTDIPKDLERIKWYLWHGNVFRALQLLKDFVDDLEVEVFNGNAQREVKKLLKMVQEFETYIFNNGACIPNYGERWHNDE
AIATGFVESTVNQVISKRFVKKQQMRWTPKGAHLLLQMRMLVLNGELRRQFEQWYSGLRLDNEPNLQIPNSV
QCTCQEQSTRTFKPLGKLLKERISPELLYLESKFVSLMSYGLSVKLLEELLPIEGEISVTTVRNKLHVCAQRLESELGEEKRVYIEGTQRDWDNLPRPEL
PLVVGVDGGYVRFYDKKSSTKGNFEIIVGKSIKSDNTSKRFGGVYSYDTKPQRRIFEVLKSQGMQMNQQVTFLSDGDEKLRDLMFGLNPNTEYLLDWFHI
TMRLTVINQMAKGINKKDTELNTDIPKDLERIKWYLWHGNVFRALQLLKDFVDDLEVEVFNGNAQREVKKLLKMVQEFETYIFNNGACIPNYGERWHNDE
AIATGFVESTVNQVISKRFVKKQQMRWTPKGAHLLLQMRMLVLNGELRRQFEQWYSGLRLDNEPNLQIPNSV
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
633 bp | 210 aa | 2271 | 2903 | + | No |
Annotation : IntegraseDescription :
ORF sequence :
MTAQTLQLPKVGKTEKQERQSPNSHKYFEVRTREYLLPEEVSAIRLAIKKSKGRHAHRDSTLILLCYRHGLRVAEVASLRWEQIDWSGGTIYVKRVKKGT
PWVQPLSGLEIRSLRQLLRNYPASPYIFQSSRLGPLAHDTISGIVERAGELAGLPFPIHAHMLRHGTGYYLANRGIDTRTIQSYLGHKNIQHTVHYTELA
STKFQGLWDD
PWVQPLSGLEIRSLRQLLRNYPASPYIFQSSRLGPLAHDTISGIVERAGELAGLPFPIHAHMLRHGTGYYLANRGIDTRTIQSYLGHKNIQHTVHYTELA
STKFQGLWDD
Blast result :
Comments
ISCep1 is 65% aa (ORFB : the transposase) similar to ISAzs26.
ORFA and ORFC are passenger gene respectively annotated as Plasmid pRiA4b ORF-3-like protein and integrase family protein.
ORFA and ORFC are passenger gene respectively annotated as Plasmid pRiA4b ORF-3-like protein and integrase family protein.
References
1] ISfinder annotation (2013)
2] Gugger,M., Coursin,T., Rippka,R., Tandeau De Marsac,N., Huntemann,M., Wei,C.-L., Han,J., Detter,J.C., Han,C., Tapia,R., Davenport,K., Daligault,H., Erkkila,T., Gu,W., Munk,A.C.C., Teshima,H., Xu,Y., Chain,P., Chen,A., Krypides,N., Mavromatis,K., Markowitz,V., Szeto,E., Ivanova,N., Mikhailova,N., Ovchinnikova,G., Pagani,I., Pati,A., Goodwin,L., Peters,L., Pitluck,S., Woyke,T. and Kerfeld,C. (2012) Direct submission GenBank.
2] Gugger,M., Coursin,T., Rippka,R., Tandeau De Marsac,N., Huntemann,M., Wei,C.-L., Han,J., Detter,J.C., Han,C., Tapia,R., Davenport,K., Daligault,H., Erkkila,T., Gu,W., Munk,A.C.C., Teshima,H., Xu,Y., Chain,P., Chen,A., Krypides,N., Mavromatis,K., Markowitz,V., Szeto,E., Ivanova,N., Mikhailova,N., Ovchinnikova,G., Pagani,I., Pati,A., Goodwin,L., Peters,L., Pitluck,S., Woyke,T. and Kerfeld,C. (2012) Direct submission GenBank.