ISFsp4
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Frankia sp. | Frankia sp. EAN1pec |
DNA section
IS Length : 1784 bp
Ends
Left end : CAGGAAAACGTAGCGCGCAACCCCCGTACTTCGGAGTGGGGTTAGCGGGTCGGGGCTGGTCGCCGTTGCTCCTCGATGTACTCGCGTACGAGTGCCAGTG II struct. : Yes
Right end : CTCCGGGCAGCTGGCTGTGAAACAGGAACCCACCGAGGTGCCGCGTGAACCCACGCCGCACGGCAGGAACCCCCAGCCTTCAAGCCAGGGAGGACGTCAA II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GCCCCCAGGAGTGCTCGTGC | TGAG | ATGACCCGGCGATCGGCGAGCC | TCAA |
DNA sequence
CAGGAAAACGTAGCGCGCAACCCCCGTACTTCGGAGTGGGGTTAGCGGGTCGGGGCTGGTCGCCGTTGCTCCTCGATGTACTCGCGTACGAGTGCCAGTG
GGGCGTCTTCGGCCGAGACTGCGAGGTAGGACGGTGACCAGAAGTGGCCGCGTATGAGGTGCTGGTTGACCCGGTCGGTGAACTCGGTGCGCAGCCGTCG
GGCGGACACGCCTTTGAGGCTGTTCACCAGCGCGGAGACCGCGACCTTGGGTGGGTACTCGACCAGCAGGTGGACGTGATCGTCTTCTCCGTTGAACTCG
GTGAGGGCTGCGCCGAAGTCGCAGCACACGCTGCGCATGATCTGTTCGCAGCGGGTCAGCATCGCGTCGTCGAGTCCTCCACGCCGATACCTGGTCACGA
AGACCAAGTGGGCGTGCAGCGAGGAGACCACATGTCTGCCTCGCCTGTAGTCGTCTGCTTCCGTCATAGACCAAATATGCTGGATGGTGTGGCAGTCGTT
CGTTACCGCTACCGTGCCTACCCCGGGCCGGGGCAGGCACGGGCGTTGGCGCGGGCGTTCGGCTGCGCCCGGGGGTCGTGTTCAACGACGCGATCCGGGC
CCGCGACGAGGCGTACAAGGCCGGCGAGAAACTGTCGGACACCGAGGTTCAGCGCCGGGTGGTCACCCTCGCGAAACTCACCGACGAGCGAACCTGGCTG
TCCGAGGTGTCGTCGGTGGTACTCGTGCAGGCGTGCCAGGACGCACGCCGGGCGTTCCGGAACTGGTTCGACTCGCTGTCCGGGAAGCGGAAAGGCCGGC
AGGTCGGCCATCCGCGGTTCCGGTCACGGAAGGACAACCGGCAGTCGATCCGCCTCACCCGCAACGGCTTCACCGTCACGCCCCGAGGGGTGCGGGTGGC
GAAGGTCGGAGATCTGCGGCTGGCCTGGTCGCGTCCGCTGCCCTCGGTTCCGACGTCGGCGACGGTGATCCGGGAGGCGGACGGCAGGTACTACGTGTCG
TTCGTCGTCGACGTCGACGACGTCCCCTCCCCGGCGACAGGCGCCGAGATCGGCGTCGACCTCGGGTTGGACCGGCTCGCGACCCTGTCAACCGGACAGA
TCGTCGCGAACCCGCGTCCTCTGCGGTCGCGTCAGCGCAGGCTCGCCCGCGCACAGCGGGCACTGGCCCGCAAGCGGAAGGGTTCGGTGAACCGGCGCAA
GGCGGTCCGCCGGGTCGCGGTCGAACATCGGAAGGTACGGGACACCCGCCGGGATCATCATCACAAGCTCGCTGCTCGGCTGGTCCGCGACAACCAAGCG
GTCTACGTCGAGGATCTGGCGGTAGCCGGGCTGGCTCGTACGCGGCTGGCCCGGTCGGTGCACGACGCGGGCTGGTCGATGCTGGTCGGTCTGCTCGAGG
AGAAAGCGGCCCGGTGTGGCCGGGCCGTGGTGAGGGTGGGCCGGTTCTTCCCGTCGTCGCAGGTCTGCTCGGCCTGCGGCCACCGGGACGGCCCGAAGCC
TCTCCAGGTCCGGACGTGGACCTGTCCGGGGTGCGGTGTCAGCCACGACCGGGACCTGAATGCCGCGCGGAACATCCTCGTCGAGGGTCAGCGCCTGGTC
GCCGCCGGGCGGAAAGGCGTGGCTGCAATGCCACGTCAGGCGGAGACCGTAAACGCCTGCGGAGCCGACGTGAGACCCGGACCCCTCCGGGCAGCTGGCT
GTGAAACAGGAACCCACCGAGGTGCCGCGTGAACCCACGCCGCACGGCAGGAACCCCCAGCCTTCAAGCCAGGGAGGACGTCAA
GGGCGTCTTCGGCCGAGACTGCGAGGTAGGACGGTGACCAGAAGTGGCCGCGTATGAGGTGCTGGTTGACCCGGTCGGTGAACTCGGTGCGCAGCCGTCG
GGCGGACACGCCTTTGAGGCTGTTCACCAGCGCGGAGACCGCGACCTTGGGTGGGTACTCGACCAGCAGGTGGACGTGATCGTCTTCTCCGTTGAACTCG
GTGAGGGCTGCGCCGAAGTCGCAGCACACGCTGCGCATGATCTGTTCGCAGCGGGTCAGCATCGCGTCGTCGAGTCCTCCACGCCGATACCTGGTCACGA
AGACCAAGTGGGCGTGCAGCGAGGAGACCACATGTCTGCCTCGCCTGTAGTCGTCTGCTTCCGTCATAGACCAAATATGCTGGATGGTGTGGCAGTCGTT
CGTTACCGCTACCGTGCCTACCCCGGGCCGGGGCAGGCACGGGCGTTGGCGCGGGCGTTCGGCTGCGCCCGGGGGTCGTGTTCAACGACGCGATCCGGGC
CCGCGACGAGGCGTACAAGGCCGGCGAGAAACTGTCGGACACCGAGGTTCAGCGCCGGGTGGTCACCCTCGCGAAACTCACCGACGAGCGAACCTGGCTG
TCCGAGGTGTCGTCGGTGGTACTCGTGCAGGCGTGCCAGGACGCACGCCGGGCGTTCCGGAACTGGTTCGACTCGCTGTCCGGGAAGCGGAAAGGCCGGC
AGGTCGGCCATCCGCGGTTCCGGTCACGGAAGGACAACCGGCAGTCGATCCGCCTCACCCGCAACGGCTTCACCGTCACGCCCCGAGGGGTGCGGGTGGC
GAAGGTCGGAGATCTGCGGCTGGCCTGGTCGCGTCCGCTGCCCTCGGTTCCGACGTCGGCGACGGTGATCCGGGAGGCGGACGGCAGGTACTACGTGTCG
TTCGTCGTCGACGTCGACGACGTCCCCTCCCCGGCGACAGGCGCCGAGATCGGCGTCGACCTCGGGTTGGACCGGCTCGCGACCCTGTCAACCGGACAGA
TCGTCGCGAACCCGCGTCCTCTGCGGTCGCGTCAGCGCAGGCTCGCCCGCGCACAGCGGGCACTGGCCCGCAAGCGGAAGGGTTCGGTGAACCGGCGCAA
GGCGGTCCGCCGGGTCGCGGTCGAACATCGGAAGGTACGGGACACCCGCCGGGATCATCATCACAAGCTCGCTGCTCGGCTGGTCCGCGACAACCAAGCG
GTCTACGTCGAGGATCTGGCGGTAGCCGGGCTGGCTCGTACGCGGCTGGCCCGGTCGGTGCACGACGCGGGCTGGTCGATGCTGGTCGGTCTGCTCGAGG
AGAAAGCGGCCCGGTGTGGCCGGGCCGTGGTGAGGGTGGGCCGGTTCTTCCCGTCGTCGCAGGTCTGCTCGGCCTGCGGCCACCGGGACGGCCCGAAGCC
TCTCCAGGTCCGGACGTGGACCTGTCCGGGGTGCGGTGTCAGCCACGACCGGGACCTGAATGCCGCGCGGAACATCCTCGTCGAGGGTCAGCGCCTGGTC
GCCGCCGGGCGGAAAGGCGTGGCTGCAATGCCACGTCAGGCGGAGACCGTAAACGCCTGCGGAGCCGACGTGAGACCCGGACCCCTCCGGGCAGCTGGCT
GTGAAACAGGAACCCACCGAGGTGCCGCGTGAACCCACGCCGCACGGCAGGAACCCCCAGCCTTCAAGCCAGGGAGGACGTCAA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
426 bp | 141 aa | 42 | 467 | - | No |
Chemistry : Y1
ORF sequence :
MTEADDYRRGRHVVSSLHAHLVFVTRYRRGGLDDAMLTRCEQIMRSVCCDFGAALTEFNGEDDHVHLLVEYPPKVAVSALVNSLKGVSARRLRTEFTDRV
NQHLIRGHFWSPSYLAVSAEDAPLALVREYIEEQRRPAPTR
NQHLIRGHFWSPSYLAVSAEDAPLALVREYIEEQRRPAPTR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1155 bp | 384 aa | 578 | 1732 | + | No |
AG : TnpB
ORF sequence :
MFNDAIRARDEAYKAGEKLSDTEVQRRVVTLAKLTDERTWLSEVSSVVLVQACQDARRAFRNWFDSLSGKRKGRQVGHPRFRSRKDNRQSIRLTRNGFTV
TPRGVRVAKVGDLRLAWSRPLPSVPTSATVIREADGRYYVSFVVDVDDVPSPATGAEIGVDLGLDRLATLSTGQIVANPRPLRSRQRRLARAQRALARKR
KGSVNRRKAVRRVAVEHRKVRDTRRDHHHKLAARLVRDNQAVYVEDLAVAGLARTRLARSVHDAGWSMLVGLLEEKAARCGRAVVRVGRFFPSSQVCSAC
GHRDGPKPLQVRTWTCPGCGVSHDRDLNAARNILVEGQRLVAAGRKGVAAMPRQAETVNACGADVRPGPLRAAGCETGTHRGAA
TPRGVRVAKVGDLRLAWSRPLPSVPTSATVIREADGRYYVSFVVDVDDVPSPATGAEIGVDLGLDRLATLSTGQIVANPRPLRSRQRRLARAQRALARKR
KGSVNRRKAVRRVAVEHRKVRDTRRDHHHKLAARLVRDNQAVYVEDLAVAGLARTRLARSVHDAGWSMLVGLLEEKAARCGRAVVRVGRFFPSSQVCSAC
GHRDGPKPLQVRTWTCPGCGVSHDRDLNAARNILVEGQRLVAAGRKGVAAMPRQAETVNACGADVRPGPLRAAGCETGTHRGAA
Blast result :
Comments
ISFsp4 is 62%(ORFA) aa similar to IS606 and 54% (ORF B) to IS1136A .
Contig accession number : NZ_AAII01000195
Contig accession number : NZ_AAII01000195
References
1] US DOE Joint Genome Institute