ISAsp14
- Family IS200/IS605
- Group IS1341
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_019427 | ND | Anabaena sp. | Anabaena sp. 90 |
DNA section
IS Length : 1794 bp
Ends
Left end : ACAGTTTTGCGTAATTCCCCACCCTCAATGTACTCATAGGGTGGGGATGTAAGCAAGTCAATAACGAAACATCCTGAAAATGTTTTTTACTGTAAGTAAA II struct. : Yes
Right end : GTATGAAGCAACTAGGTCGTGCGAAGCGTCAAAAATCTCAGGCTCCGCTAGGGGATGTAGAAACCCCTGGCTCAAACGAAGTTAGCAAGGGGTAGTTCAT II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
TCTGATCTAG | TTTA | CTGCCAGACT | tcat |
DNA sequence
ACAGTTTTGCGTAATTCCCCACCCTCAATGTACTCATAGGGTGGGGATGTAAGCAAGTCAATAACGAAACATCCTGAAAATGTTTTTTACTGTAAGTAAA
AGACTTTTGAGGATAGTTGAGTTATTGACAACTATATTTTATTTATTCCAAATTATCAATGTATTTCTCGACTAATTCGCGTATCCAGCCGCCATAAGTC
TGTGACGGTGGTTTTATTTTGTCTATTTGTTCAGCTATCTCTTTCTTGAGAGTAACACTAACCTTGACTCTTGATGCTTCATACTTAGCCGTATAAGCCC
TAGTTTTATCGGCATTTCGTTGTTGCCATTCTTGATGTTTTTTATCTATCATTTTCATTGACACTCTAAGCTCTTGGATTATATAATAGTAGTATATTAT
CAAAGAAGATGAAGGTAATAGTTAATGGCGTTACGTAGAGCAACTTTCAGGCTATATCCTAATAAACAAGTCAGTGAGATGTTGCACTACCACCGCAAAC
TTCATAAAGATTTGTATAATGCTGCTGTATCTAACCGCATTACTTCATATAAAAAGTTTGGTAAAACTGTTTCTTACTTTGAACAACAGAACTGTTTACC
GGATTTCAAAGAAGTCTGGATAGAATATAAAGTAATTAATTCCCAAGCATTACAAGCCACATTAAAACGGGTTGATTTTGCTTTTGGAAGGTTTTTTAAG
GGACTAGGAAAGTACCCTAGATTCAAGCCAATCCGTCATTATTCTGGTTGGACTTATCCATCTTTTACAGGCTGGAAAGTACATAGCACAGGGGATAACG
GCTATTTAGAATTATCAAAGATTGGTCAAATCCAAATGCGTGGTAAGGCTAGACTTTGGGGGCATCCTAAAGCTTTAGATATCGTTAACCGCAATGGTCA
ATGGTATGCTTCCATCGTCTTAGAAATTGATGATACTTTGTTAAAGAATAGCCGCAAAACTGATAATGGTGTTATGGCAATTGATTTAGGTTGTAATGAT
GCAATTGCTTGGACAAATGGTGAAGAAAATGGTTTAGTGGCTGCACCTCGGTTTTTCCGAAAAGCAGAACAAAAAAATCAAGAGTTGGGTAAATCAAAAC
GTCGAAAACGTTCTCCCAATTTCAAAAAGAAAGTTAAGGCTTCTAGAAGGTGGAAGAAAGTTCAGAAGTTAGTTAGTAAACTTTCGAGAAAAGTTGCCAA
CCAAAGACAGAACTTTGTTCACCAAGAAACTACACGAATAATCAGCGGTAATAGCACGGTAGTTACTGAAAAATTAGAAGTCAAAAAAATGAGTGCCAAA
GCTAAGAAAGGCGATGCCCTGAGCTTGTCGAAGGGTAAACGTAAAAAACAAAAGGCAGGACTAAATAAATCAATTCTTGATGTGGGAATGGGGATGATAA
GGGACGCTCTAAAAGCGAAATTATCAGACATTGGTGGCTTATTTGTAGAAGTTCCTACAAAGAAGGTAAAACCATCTCAAACCTGTCCAAAATGCGGAAA
TCAAGAAAAGAAGAGTTTAGCCGCAAGAACTCATGTTTGCCATAACTGCGGATATACCCAACAGCGTGATATCGCCGCCGCAGAAGTAATGCTACTTTGG
TACTCAAATAATCTACAGGGGTTAGGAACTAGCCTCTTAGACGTAGATGATTCTAGCTCTACTTCAAACACCAGCGAACGCAAGAATGCTGGAAGTATGA
AGCAACTAGGTCGTGCGAAGCGTCAAAAATCTCAGGCTCCGCTAGGGGATGTAGAAACCCCTGGCTCAAACGAAGTTAGCAAGGGGTAGTTCAT
AGACTTTTGAGGATAGTTGAGTTATTGACAACTATATTTTATTTATTCCAAATTATCAATGTATTTCTCGACTAATTCGCGTATCCAGCCGCCATAAGTC
TGTGACGGTGGTTTTATTTTGTCTATTTGTTCAGCTATCTCTTTCTTGAGAGTAACACTAACCTTGACTCTTGATGCTTCATACTTAGCCGTATAAGCCC
TAGTTTTATCGGCATTTCGTTGTTGCCATTCTTGATGTTTTTTATCTATCATTTTCATTGACACTCTAAGCTCTTGGATTATATAATAGTAGTATATTAT
CAAAGAAGATGAAGGTAATAGTTAATGGCGTTACGTAGAGCAACTTTCAGGCTATATCCTAATAAACAAGTCAGTGAGATGTTGCACTACCACCGCAAAC
TTCATAAAGATTTGTATAATGCTGCTGTATCTAACCGCATTACTTCATATAAAAAGTTTGGTAAAACTGTTTCTTACTTTGAACAACAGAACTGTTTACC
GGATTTCAAAGAAGTCTGGATAGAATATAAAGTAATTAATTCCCAAGCATTACAAGCCACATTAAAACGGGTTGATTTTGCTTTTGGAAGGTTTTTTAAG
GGACTAGGAAAGTACCCTAGATTCAAGCCAATCCGTCATTATTCTGGTTGGACTTATCCATCTTTTACAGGCTGGAAAGTACATAGCACAGGGGATAACG
GCTATTTAGAATTATCAAAGATTGGTCAAATCCAAATGCGTGGTAAGGCTAGACTTTGGGGGCATCCTAAAGCTTTAGATATCGTTAACCGCAATGGTCA
ATGGTATGCTTCCATCGTCTTAGAAATTGATGATACTTTGTTAAAGAATAGCCGCAAAACTGATAATGGTGTTATGGCAATTGATTTAGGTTGTAATGAT
GCAATTGCTTGGACAAATGGTGAAGAAAATGGTTTAGTGGCTGCACCTCGGTTTTTCCGAAAAGCAGAACAAAAAAATCAAGAGTTGGGTAAATCAAAAC
GTCGAAAACGTTCTCCCAATTTCAAAAAGAAAGTTAAGGCTTCTAGAAGGTGGAAGAAAGTTCAGAAGTTAGTTAGTAAACTTTCGAGAAAAGTTGCCAA
CCAAAGACAGAACTTTGTTCACCAAGAAACTACACGAATAATCAGCGGTAATAGCACGGTAGTTACTGAAAAATTAGAAGTCAAAAAAATGAGTGCCAAA
GCTAAGAAAGGCGATGCCCTGAGCTTGTCGAAGGGTAAACGTAAAAAACAAAAGGCAGGACTAAATAAATCAATTCTTGATGTGGGAATGGGGATGATAA
GGGACGCTCTAAAAGCGAAATTATCAGACATTGGTGGCTTATTTGTAGAAGTTCCTACAAAGAAGGTAAAACCATCTCAAACCTGTCCAAAATGCGGAAA
TCAAGAAAAGAAGAGTTTAGCCGCAAGAACTCATGTTTGCCATAACTGCGGATATACCCAACAGCGTGATATCGCCGCCGCAGAAGTAATGCTACTTTGG
TACTCAAATAATCTACAGGGGTTAGGAACTAGCCTCTTAGACGTAGATGATTCTAGCTCTACTTCAAACACCAGCGAACGCAAGAATGCTGGAAGTATGA
AGCAACTAGGTCGTGCGAAGCGTCAAAAATCTCAGGCTCCGCTAGGGGATGTAGAAACCCCTGGCTCAAACGAAGTTAGCAAGGGGTAGTTCAT
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
216 bp | 71 aa | 358 | 143 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MKMIDKKHQEWQQRNADKTRAYTAKYEASRVKVSVTLKKEIAEQIDKIKPPSQTYGGWIRELVEKYIDNLE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1365 bp | 454 aa | 425 | 1789 | + | No |
AG : TnpB
ORF sequence :
MALRRATFRLYPNKQVSEMLHYHRKLHKDLYNAAVSNRITSYKKFGKTVSYFEQQNCLPDFKEVWIEYKVINSQALQATLKRVDFAFGRFFKGLGKYPRF
KPIRHYSGWTYPSFTGWKVHSTGDNGYLELSKIGQIQMRGKARLWGHPKALDIVNRNGQWYASIVLEIDDTLLKNSRKTDNGVMAIDLGCNDAIAWTNGE
ENGLVAAPRFFRKAEQKNQELGKSKRRKRSPNFKKKVKASRRWKKVQKLVSKLSRKVANQRQNFVHQETTRIISGNSTVVTEKLEVKKMSAKAKKGDALS
LSKGKRKKQKAGLNKSILDVGMGMIRDALKAKLSDIGGLFVEVPTKKVKPSQTCPKCGNQEKKSLAARTHVCHNCGYTQQRDIAAAEVMLLWYSNNLQGL
GTSLLDVDDSSSTSNTSERKNAGSMKQLGRAKRQKSQAPLGDVETPGSNEVSKG
KPIRHYSGWTYPSFTGWKVHSTGDNGYLELSKIGQIQMRGKARLWGHPKALDIVNRNGQWYASIVLEIDDTLLKNSRKTDNGVMAIDLGCNDAIAWTNGE
ENGLVAAPRFFRKAEQKNQELGKSKRRKRSPNFKKKVKASRRWKKVQKLVSKLSRKVANQRQNFVHQETTRIISGNSTVVTEKLEVKKMSAKAKKGDALS
LSKGKRKKQKAGLNKSILDVGMGMIRDALKAKLSDIGGLFVEVPTKKVKPSQTCPKCGNQEKKSLAARTHVCHNCGYTQQRDIAAAEVMLLWYSNNLQGL
GTSLLDVDDSSSTSNTSERKNAGSMKQLGRAKRQKSQAPLGDVETPGSNEVSKG
Blast result :
Comments
ISAsp14 is 65% (TnpB) aa similar to ISCysp13. The first ORF is a passenger gene annotated as hypothetical protein.
There are 1 complete copy and 2 disrupted copies in the genome.
There are 1 complete copy and 2 disrupted copies in the genome.
References
1] Wang,H., Sivonen,K., Rouhiainen,L., Fewer,D.P., Lyra,C., Rantala-Ylinen,A., Vestola,J., Jokela,J., Rantasarkka,K., Li,Z. and Liu,B. (2012) BMC Genomics 13, 613.