IS891
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s) IS891N
Accession number | Transposition | Origin | Host |
---|---|---|---|
M24855 | Y | Anabaena sp. | Nostoc ellipsosporum CPB 311 Anabaena sp. M-131 Anabaena sp. PCC 7120 |
DNA section
IS Length : 1352 bp
Ends
Left end : GAGCCGTGAAGCTAAAGCCCCGTATTTTTAATCGGGGGATATAAGCGAATGACCGAATTTATTCGTCGTAACATGGTATAATTACGTCAGAGAGTTTGAC II struct. : Yes
Right end : TTGGGTGAGGTAACTCCTCCAAATAAGTCGAGTCGTGGAAAGAGAAAGCCCAAGAAGTGATTCTTGGAATCCCCGTTTTCTAAACGGGGGAGGATGTCAA II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
TAC | TTAC | TTGGGCG | tcaa |
DNA sequence
GAGCCGTGAAGCTAAAGCCCCGTATTTTTAATCGGGGGATATAAGCGAATGACCGAATTTATTCGTCGTAACATGGTATAATTACGTCAGAGAGTTTGAC
TTAAAAATGCTAGTATTTGAGACAAAACTTGAAGGAACAAACGAGCAGTATCAATTGCTGATGAGGCGATTAAAACTGCTCGTTTTGTCGAATGCTTGCC
TCCGTACTTGGATTGGACAACCAAACATCGGCAGGTATGATTTGAGTGCTTATTGCGCTGTCCTGCTGCCAATGAAAACTTTCCGTTCGTTGCCAAACTC
AACTCTATGGCTCGACAAGCTTCTGCTGAAAGAGCGTGGAGTGCAATTGCTCGGTTTTTTGACAATTGCAAGCAAAACAAAACCGGGAAGAAAGGTTATC
CACGCTTTAAAAAAGAACAGACGCATGGGAGTGTTGAGTATAAAACTAGCGGCTGGAAGCTTAGTAGTGACCGTCGCTTATGTCACTTTTAGCGACGGAT
TTAAAGCAGGAACTTTCAAACTCTGGGGAACTCGTGACTTGCATTTCTACCAGTTGAAACAGTTCAAGAGGGTGCGGGTTGTGCGTCGTGCCGATGGGTA
CTACGCGCAGTTTTGCATTGACCAAGAGCGAGTAGAAAGGCGAGAACCAACGCTTAAAACTATTGGGCTGGATGTGGGATTGAACCATTTCTTGACCGAT
AGCGAAGGCAATACAGTTGAGAACCCTAGACACTTGCGTAAAAGCGAAAAGTCTCTCAAGAGATTGCAACGCAGATTGTCTAAAACCAAGAAGGGTTCTA
ACAACAGAGTCAAGGCAAGAAATCGCTTGAGTAGAAAACACCTTAAAGTAAGTAGGCAGCGTAAAGACTTCGCCGTAAAGTTGGCGAGGTGCGTAGTCCA
GTCTAGCGACTTGGTAGCCTATGAGGATTTGCAGGTGCGGAACATGGTCAGGAATAGACATCTTGCCAAGTCGATTAGTGATGCAGCGTGGACGCAGTTT
CGGCAATGGGTTGAGTATTTCGGCAAAGTGTTTGGTGTAGTGACTGTTGCAGTCCCACCCCATCACACTTCGCAGAATTGTTCCAACTGTGGCGAAGTAG
TGAAAAAGTCGCTGAGTACAAGAACTCATGCTTGCCCTCACTGTGGACATATTCAAGACAGGGATTGGAACGCTGCACGGAACATACTTGAACTAGGACT
ACGTACTGTGGGACACACAGGATCTCAAGTCTCTGGAGATATCGACCTCTGTTTGGGTGAGGTAACTCCTCCAAATAAGTCGAGTCGTGGAAAGAGAAAG
CCCAAGAAGTGATTCTTGGAATCCCCGTTTTCTAAACGGGGGAGGATGTCAA
TTAAAAATGCTAGTATTTGAGACAAAACTTGAAGGAACAAACGAGCAGTATCAATTGCTGATGAGGCGATTAAAACTGCTCGTTTTGTCGAATGCTTGCC
TCCGTACTTGGATTGGACAACCAAACATCGGCAGGTATGATTTGAGTGCTTATTGCGCTGTCCTGCTGCCAATGAAAACTTTCCGTTCGTTGCCAAACTC
AACTCTATGGCTCGACAAGCTTCTGCTGAAAGAGCGTGGAGTGCAATTGCTCGGTTTTTTGACAATTGCAAGCAAAACAAAACCGGGAAGAAAGGTTATC
CACGCTTTAAAAAAGAACAGACGCATGGGAGTGTTGAGTATAAAACTAGCGGCTGGAAGCTTAGTAGTGACCGTCGCTTATGTCACTTTTAGCGACGGAT
TTAAAGCAGGAACTTTCAAACTCTGGGGAACTCGTGACTTGCATTTCTACCAGTTGAAACAGTTCAAGAGGGTGCGGGTTGTGCGTCGTGCCGATGGGTA
CTACGCGCAGTTTTGCATTGACCAAGAGCGAGTAGAAAGGCGAGAACCAACGCTTAAAACTATTGGGCTGGATGTGGGATTGAACCATTTCTTGACCGAT
AGCGAAGGCAATACAGTTGAGAACCCTAGACACTTGCGTAAAAGCGAAAAGTCTCTCAAGAGATTGCAACGCAGATTGTCTAAAACCAAGAAGGGTTCTA
ACAACAGAGTCAAGGCAAGAAATCGCTTGAGTAGAAAACACCTTAAAGTAAGTAGGCAGCGTAAAGACTTCGCCGTAAAGTTGGCGAGGTGCGTAGTCCA
GTCTAGCGACTTGGTAGCCTATGAGGATTTGCAGGTGCGGAACATGGTCAGGAATAGACATCTTGCCAAGTCGATTAGTGATGCAGCGTGGACGCAGTTT
CGGCAATGGGTTGAGTATTTCGGCAAAGTGTTTGGTGTAGTGACTGTTGCAGTCCCACCCCATCACACTTCGCAGAATTGTTCCAACTGTGGCGAAGTAG
TGAAAAAGTCGCTGAGTACAAGAACTCATGCTTGCCCTCACTGTGGACATATTCAAGACAGGGATTGGAACGCTGCACGGAACATACTTGAACTAGGACT
ACGTACTGTGGGACACACAGGATCTCAAGTCTCTGGAGATATCGACCTCTGTTTGGGTGAGGTAACTCCTCCAAATAAGTCGAGTCGTGGAAAGAGAAAG
CCCAAGAAGTGATTCTTGGAATCCCCGTTTTCTAAACGGGGGAGGATGTCAA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1206 bp | 401 aa | 107 | 1312 | + | No |
AG : TnpB
ORF sequence :
MLVFETKLEGTNEQYQLLMRRLKLLVLSNACLRTWIGQPNIGRYDLSAYCAVLLPMKTFRSLPNSTLWLDKLLLKERGVQLLGFLTIASKTKPGRKVIHA
LKKNRRMGVLSIKLAAGSLVVTVAYVTFSDGFKAGTFKLWGTRDLHFYQLKQFKRVRVVRRADGYYAQFCIDQERVERREPTLKTIGLDVGLNHFLTDSE
GNTVENPRHLRKSEKSLKRLQRRLSKTKKGSNNRVKARNRLSRKHLKVSRQRKDFAVKLARCVVQSSDLVAYEDLQVRNMVRNRHLAKSISDAAWTQFRQ
WVEYFGKVFGVVTVAVPPHHTSQNCSNCGEVVKKSLSTRTHACPHCGHIQDRDWNAARNILELGLRTVGHTGSQVSGDIDLCLGEVTPPNKSSRGKRKPK
K
LKKNRRMGVLSIKLAAGSLVVTVAYVTFSDGFKAGTFKLWGTRDLHFYQLKQFKRVRVVRRADGYYAQFCIDQERVERREPTLKTIGLDVGLNHFLTDSE
GNTVENPRHLRKSEKSLKRLQRRLSKTKKGSNNRVKARNRLSRKHLKVSRQRKDFAVKLARCVVQSSDLVAYEDLQVRNMVRNRHLAKSISDAAWTQFRQ
WVEYFGKVFGVVTVAVPPHHTSQNCSNCGEVVKKSLSTRTHACPHCGHIQDRDWNAARNILELGLRTVGHTGSQVSGDIDLCLGEVTPPNKSSRGKRKPK
K
Blast result :
Comments
Corrected sequence: insert a G between bp 985-986, and read the 1217-1219 triplet ACA instead of CAC (Donadio, 1993).
References
1] Bancroft, I., and Wolk, P. (1989) J. Bacteriol. 171, 5949-5954.
2] Donadio, S., and Staver, M.J. (1993) Gene 126, 147-151.
3] Donadio, S. (1993) Personal communication.
4] Wolk,C.P. (1996) GenBank direct submission.
2] Donadio, S., and Staver, M.J. (1993) Gene 126, 147-151.
3] Donadio, S. (1993) Personal communication.
4] Wolk,C.P. (1996) GenBank direct submission.