ISFsp3
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY008259 | ND | Frankia sp. | Frankia sp. Ar15 |
DNA section
IS Length : 2209 bp
Ends
IR Length : 27/34
IRL : TGTCGCCGCCTCCTGAAAACTGACCCCCCAGGGGGCCGCGAAAACTGACC
IRR : TGTCAAGGCTTCCTGAAAACTGACCCCGTGGGGGTCTCTGAAATTTGACC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATCCGGGTCAACGTCCACCAC | GGGCGGAGAGCAGCGCCGGG | 0 |
DNA sequence
TGTCGCCGCCTCCTGAAAACTGACCCCCCAGGGGGCCGCGAAAACTGACCCCCTGCTGGTGTAGGAGGGGTGCTGAAGGTGGAGGACTGGGCGGAGATCC
GCAGGTTGCGTCGAGCTGAGGGCGTTCCGATCAAGGAGATCGCTCGACGGCTGGGTGTGGCGCGGAACACGGTGCGGGCGGCGTTGGCGTCGGACCGGCC
GCCGCAGTACGAACGGGCGCCGCGCGGGTCGGTGGTCGATCCGTTCGAGCCGGTGATCCGGGCGTTGCTGGCGGAGTGGCCGCGGATGCCGGCGCCGGTG
ATCGCGCAGCGGATCGGCTGGCCGTATTCGCTGTCGCCGTTGAAGAAGCGATTGACGGTGATCCGGCCGGAGTATGTCGGGATCGACCCGGTGGACCGGA
TGGTGTATGAGCCGGGGGAGTTCGCGCATTGTGATCTGTGGTTCCCGGAGCCGGTGATTCCGGTGGGTGCGGGGCAGGAAAGGGTGCTTCCGGTGCTGGT
GATGACGTTGGCGTTCTCCCGGTTCCTGACCGCGACGATGATTCCCTCGCGGCAGGCCGGTGACATTCTCGCGGGCATGTGGCTGCTGATCGGCCGGGTC
GGGCGGGTCACGAAGACGCTGGTGTGGGATCGGGAATCGGCGATCGGGGGGACCGGCCGGGTGTCGGCGCCGGCGGCGGGGTTCGCGGGAACGTTGGCGA
CCCAGATCAGGCTTGCGCCGCCACGGGACCCGGAATATAAGGGTATCGTCGAGCGGGCCAACGGCTATTTCGAGACGTCGTTTCTGCCGGGCCGGCGGTT
CGTCTCCCCGGAGGATTTCAACATCCAGCTGGCTGAATGGCTGACGCTGGCGAATGCCCGCACCGTGCGGTCGGTCGGGGGCCGTCCGGTCGATCTGCTG
GAGACCGATCTGCGGTCGATGCTGGAACTGCCACCGGTCGACCCGCTGACCGGCCTTTCCGCCCGGGTCCGGCTCGGCCGGGACTACTACGTGCGGGTCG
ACACCGTCGACTACTCCGTCGACCCGCGGGCGATCGGCCGGTTCGTCGACGTGACCGCCTCGCTGGACACGGTGGCGGTGACCTGCGACGGCCAGCCCGT
GGCCCGTCATGCCCGCTCGTGGGCCCGCCACGGTGTCATCACCGATCCCGAGCATGCCGCGGCCGCCGCGCGGATGCGCCAGGCCCTGGCCGAGGACCGC
CGGCGCCGGGCCGCGGCAACACGCCACCACGGCGACGGCCACCCGGTCAGCATGCGGGCGCTGCCGGACTACGACGCCCTGTTCGGCGTCGACTTCACCC
CCACACCGTCCGAGAAGAAAGCGAGCAGCGAATGACCACAGCCACCATACCGGCGGCACCGGCCGGGAAAACCTCCGACGGCATGCCAGCAATGATCGCC
TATCTGACCCGGGTGTTGAAAACCCCGACGATCGGCGCGTTCTGGGAAGAACTCGCCATCCAGGCCCGCGAAGAGAACTGGTCCCACGAAGAATACCTCG
CCGCTCTGCTGCAGCGCCAGGTCGCCGACCGCGAGTCCAAAGGCACCGTCATGCGGATCCGCACCGCGCACTTCCCGACCGTCAAAACGTTGGAGGACTT
CAACCTCGACCACCTCCCCTCACTTCGCCGCGACGTTCTCGCCCATCTGGCCACCAGCACCTACATCGCCAAAGCGGGAAACGTCGTCCTCCTCGGCCCG
CCCGGCGTCGGGAAGACCCACCTTGCCATCGGCCTGGGCGTCAAAGCAACCCACGCCGGCTACTCCGTCCTGTTCGACACCGCCAGCAACTGGATAACCC
GCCTCGCCGACGCCCACCACGCCGGCCGTCTCGACGAAGAACTCAGAAAGATCCGCCGCTACAAACTTATCATCATCGACGAAGTCGGCTACATCCCCTT
CGACCAGGACGCAGCGAACCTGTTCTTCCAGCTCATCGCCTCCCGCTACGAACAGGGCTCGGTCCTGGTCACCTCGAACCTCCCCTTCGGCCGCTGGGGC
GAGACCTTCTCCGACGACGTCGTCGCCGCCGCCATGATCGACCGGCTCGTCCACCACGCCGAGGTCCTCACCCTCGCCGGCGACTCCTACCGCACCCGCC
AACGCCGCGAGCTCCTCGCCAAGGACCGACCCAACCACAACTGATCAACCAGGAATGGGGGTCAAATTTCAGAGACCCCCACGGGGTCAGTTTTCAGGAA
GCCTTGACA
GCAGGTTGCGTCGAGCTGAGGGCGTTCCGATCAAGGAGATCGCTCGACGGCTGGGTGTGGCGCGGAACACGGTGCGGGCGGCGTTGGCGTCGGACCGGCC
GCCGCAGTACGAACGGGCGCCGCGCGGGTCGGTGGTCGATCCGTTCGAGCCGGTGATCCGGGCGTTGCTGGCGGAGTGGCCGCGGATGCCGGCGCCGGTG
ATCGCGCAGCGGATCGGCTGGCCGTATTCGCTGTCGCCGTTGAAGAAGCGATTGACGGTGATCCGGCCGGAGTATGTCGGGATCGACCCGGTGGACCGGA
TGGTGTATGAGCCGGGGGAGTTCGCGCATTGTGATCTGTGGTTCCCGGAGCCGGTGATTCCGGTGGGTGCGGGGCAGGAAAGGGTGCTTCCGGTGCTGGT
GATGACGTTGGCGTTCTCCCGGTTCCTGACCGCGACGATGATTCCCTCGCGGCAGGCCGGTGACATTCTCGCGGGCATGTGGCTGCTGATCGGCCGGGTC
GGGCGGGTCACGAAGACGCTGGTGTGGGATCGGGAATCGGCGATCGGGGGGACCGGCCGGGTGTCGGCGCCGGCGGCGGGGTTCGCGGGAACGTTGGCGA
CCCAGATCAGGCTTGCGCCGCCACGGGACCCGGAATATAAGGGTATCGTCGAGCGGGCCAACGGCTATTTCGAGACGTCGTTTCTGCCGGGCCGGCGGTT
CGTCTCCCCGGAGGATTTCAACATCCAGCTGGCTGAATGGCTGACGCTGGCGAATGCCCGCACCGTGCGGTCGGTCGGGGGCCGTCCGGTCGATCTGCTG
GAGACCGATCTGCGGTCGATGCTGGAACTGCCACCGGTCGACCCGCTGACCGGCCTTTCCGCCCGGGTCCGGCTCGGCCGGGACTACTACGTGCGGGTCG
ACACCGTCGACTACTCCGTCGACCCGCGGGCGATCGGCCGGTTCGTCGACGTGACCGCCTCGCTGGACACGGTGGCGGTGACCTGCGACGGCCAGCCCGT
GGCCCGTCATGCCCGCTCGTGGGCCCGCCACGGTGTCATCACCGATCCCGAGCATGCCGCGGCCGCCGCGCGGATGCGCCAGGCCCTGGCCGAGGACCGC
CGGCGCCGGGCCGCGGCAACACGCCACCACGGCGACGGCCACCCGGTCAGCATGCGGGCGCTGCCGGACTACGACGCCCTGTTCGGCGTCGACTTCACCC
CCACACCGTCCGAGAAGAAAGCGAGCAGCGAATGACCACAGCCACCATACCGGCGGCACCGGCCGGGAAAACCTCCGACGGCATGCCAGCAATGATCGCC
TATCTGACCCGGGTGTTGAAAACCCCGACGATCGGCGCGTTCTGGGAAGAACTCGCCATCCAGGCCCGCGAAGAGAACTGGTCCCACGAAGAATACCTCG
CCGCTCTGCTGCAGCGCCAGGTCGCCGACCGCGAGTCCAAAGGCACCGTCATGCGGATCCGCACCGCGCACTTCCCGACCGTCAAAACGTTGGAGGACTT
CAACCTCGACCACCTCCCCTCACTTCGCCGCGACGTTCTCGCCCATCTGGCCACCAGCACCTACATCGCCAAAGCGGGAAACGTCGTCCTCCTCGGCCCG
CCCGGCGTCGGGAAGACCCACCTTGCCATCGGCCTGGGCGTCAAAGCAACCCACGCCGGCTACTCCGTCCTGTTCGACACCGCCAGCAACTGGATAACCC
GCCTCGCCGACGCCCACCACGCCGGCCGTCTCGACGAAGAACTCAGAAAGATCCGCCGCTACAAACTTATCATCATCGACGAAGTCGGCTACATCCCCTT
CGACCAGGACGCAGCGAACCTGTTCTTCCAGCTCATCGCCTCCCGCTACGAACAGGGCTCGGTCCTGGTCACCTCGAACCTCCCCTTCGGCCGCTGGGGC
GAGACCTTCTCCGACGACGTCGTCGCCGCCGCCATGATCGACCGGCTCGTCCACCACGCCGAGGTCCTCACCCTCGCCGGCGACTCCTACCGCACCCGCC
AACGCCGCGAGCTCCTCGCCAAGGACCGACCCAACCACAACTGATCAACCAGGAATGGGGGTCAAATTTCAGAGACCCCCACGGGGTCAGTTTTCAGGAA
GCCTTGACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
936 bp | 311 aa | 400 | 1335 | + | No |
Chemistry : DDE
ORF sequence :
MVYEPGEFAHCDLWFPEPVIPVGAGQERVLPVLVMTLAFSRFLTATMIPSRQAGDILAGMWLLIGRVGRVTKTLVWDRESAIGGTGRVSAPAAGFAGTLA
TQIRLAPPRDPEYKGIVERANGYFETSFLPGRRFVSPEDFNIQLAEWLTLANARTVRSVGGRPVDLLETDLRSMLELPPVDPLTGLSARVRLGRDYYVRV
DTVDYSVDPRAIGRFVDVTASLDTVAVTCDGQPVARHARSWARHGVITDPEHAAAAARMRQALAEDRRRRAAATRHHGDGHPVSMRALPDYDALFGVDFT
PTPSEKKASSE
TQIRLAPPRDPEYKGIVERANGYFETSFLPGRRFVSPEDFNIQLAEWLTLANARTVRSVGGRPVDLLETDLRSMLELPPVDPLTGLSARVRLGRDYYVRV
DTVDYSVDPRAIGRFVDVTASLDTVAVTCDGQPVARHARSWARHGVITDPEHAAAAARMRQALAEDRRRRAAATRHHGDGHPVSMRALPDYDALFGVDFT
PTPSEKKASSE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1308 bp | 435 aa | 837 | 2144 | + | No |
AG : IS21 helper
ORF sequence :
MADAGECPHRAVGRGPSGRSAGDRSAVDAGTATGRPADRPFRPGPARPGLLRAGRHRRLLRRPAGDRPVRRRDRLAGHGGGDLRRPARGPSCPLVGPPRC
HHRSRACRGRRADAPGPGRGPPAPGRGNTPPRRRPPGQHAGAAGLRRPVRRRLHPHTVREESEQRMTTATIPAAPAGKTSDGMPAMIAYLTRVLKTPTIG
AFWEELAIQAREENWSHEEYLAALLQRQVADRESKGTVMRIRTAHFPTVKTLEDFNLDHLPSLRRDVLAHLATSTYIAKAGNVVLLGPPGVGKTHLAIGL
GVKATHAGYSVLFDTASNWITRLADAHHAGRLDEELRKIRRYKLIIIDEVGYIPFDQDAANLFFQLIASRYEQGSVLVTSNLPFGRWGETFSDDVVAAAM
IDRLVHHAEVLTLAGDSYRTRQRRELLAKDRPNHN
HHRSRACRGRRADAPGPGRGPPAPGRGNTPPRRRPPGQHAGAAGLRRPVRRRLHPHTVREESEQRMTTATIPAAPAGKTSDGMPAMIAYLTRVLKTPTIG
AFWEELAIQAREENWSHEEYLAALLQRQVADRESKGTVMRIRTAHFPTVKTLEDFNLDHLPSLRRDVLAHLATSTYIAKAGNVVLLGPPGVGKTHLAIGL
GVKATHAGYSVLFDTASNWITRLADAHHAGRLDEELRKIRRYKLIIIDEVGYIPFDQDAANLFFQLIASRYEQGSVLVTSNLPFGRWGETFSDDVVAAAM
IDRLVHHAEVLTLAGDSYRTRQRRELLAKDRPNHN
Blast result :
Comments
ISFsp3 is 57% (ORF A) aaa similar to ISMt3 and 75% (ORF B) to ISLxx1.
References
1] Bock,J.V., Battershell,T., Wiggington,J., John,T.R. and Johnson,J.D. (2001) Microbiology 147 (Pt 2), 499-506