ISCsp3
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
BBXH01000014 | ND | Comamonas sp. | Comamonas sp. E6 |
DNA section
IS Length : 2495 bp
Ends
IR Length : 15/16
IRL : tgttgatttccacgcagaagtgacccactaacccccaggatttccatcta
IRR : tgttgatttccatgcaatctgaacccaccaattccatctcgatctgaccc
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GCATGATTTGGGC | GAAGCGA | TTCACGGAGA | 7 |
DNA sequence
TGTTGATTTCCACGCAGAAGTGACCCACTAACCCCCAGGATTTCCATCTAAATCTGATCCACGTATTAACCCTAGCCTGCTGCTTTCTTAGGCAGCAGGA
GACCAGGAGTGATAGACGTGGCAACACTGAGTGTCATCAGGCGATGGGCCCTGCGCGAGCAACTGTCCATCCGCGAGATCTCCCGCCGCACAGGCCTGTC
TCGCAACACCATTCGCAAGTACCTGCGCGCTGACTCTGCCGAGCCCAGCTATGCCAAGCGGGTCAGTCCCAGCAAGCTCGATCCGTTCGCCACCAAGCTG
GCCGACTGGCTCAAGACCGAGTCCACCCGGGGTCGCAAGCAGCGACGCACCGTCAAACAGATGTACTTTGACCTGAAGGTGCTGGGCTACAGCGGCTCTT
ACAACCGCGTGGCGGCCTTCGCCCGAGTCTGGCAAGCCCAGCAGATGCAGGCACAGCAAACGACCGGCCGCGGCACCTTCGTGCCCCTCCTATTCGATTG
CGGTGAGGCGTTCCAGTTTGACTGGTCCGAGGACTGGGCCGTCATCGCCGGTGTGCGCACCAAGCTGCAAGTGGCCCACTTCAAGCTCAGCCACAGTCGT
GCTTTTTATCTGCGTGCCTACCCCCTGCAGACCCATGAGATGCTGTTTGACGCTCACAACCACGCCTTTGAGGTGCTTGGTGGCGTACCTCGCAGGGGCA
TCTACGACAACATGCGCACCGCCGTGGATCGGGTCCGCAAGGGTAAGGACAGAGACGTCAATGCACGCTTTGCCGCCATGGTCAGCCACTTCCTGTTTGA
AGCCGAGTTCTGCAACCCGGCCTCCGGGTGGGAGAAGGGGCAGGTGGAGAAGAATGTGCGCGACGCTCGTCATCGCCTGTGGCAATCGGTGCCAGCGTTT
GCAACGCTGGACGACCTCAATGCCTGGTTAGAGCAGCGCTGCCAGGCGCTGTGGCACGAGATTGAGCATGGCAAGCTTCCCGGAACCGTTGCGGACGTCT
GGGCCCAGGAGCGCTCTGCACTGATGCCCGTGCCAAGGGCCTTTGATGGCTTCGTGGAGCACACCAAGCGAGTCTCACCCACCTGCCTGATTCACTTTGA
GCGCAACCGCTACAGTGTCCCGGCTCCATACGCCAACAGGCCCGTTAGCCTGCGTGTCTATGCCAGTCGGCTGGTTGTGGCGGCTGAGGGGCAGCTGCTC
TGCGAGCACCAGCGGGTCATTGAGCGCAATCATCACGGTGCGGGCCAGACGATCTACGACTGGCGTCACTACCTGGCCGTCCTCCAGCGCAAACCTGGCG
CTTTGCGCAACGGTGCCCCATTCCAGGAATTACCCCCAGCCTTCAAGAGGCTGCAGGCCTTGCTGCTCAAGCAGCGAGGCGGAGATCGGGAGATGGTGGA
GATCCTGGCCCTGGTGCTTCACCATGATGAGCAGGCGGTGCTCGCAGCAGTTGAGCTGGCGTTGGAGGCCGGGGCTCCCAGCAAGACACACATCCTGAAC
GTTCTGCATAGATTGGTCGATGGCAGACCCGCTCCGGCTCCGGTCACTTCCCCCCAAGCCTTGCGCCTGGTGGTGGAACCCATGGCCAATGTGCTGCGCT
ATGACCAGTTGAGGGAGGTGCGCCATGCGACATGACCCAGCTATTGCGGCCATCGTCATCATGTTGCGAGAGCTCAAGATGCATGGCATGGCTCAGGCGG
TCAATGAACTGGCTGAACAAGGGGCCCCAGCGTTTGACGCTGCGCAGCCCATCCTGTCGCAGTTGCTCAAGGCCGAGACTGCCGAACGGGAGGTACGCTC
GGTGGCCTATCAGCTCAAGGTGGCAAGGTTCCCAGCCTATCGAGACCTGACGGGCTTCAACTTCAGCCACAGCGAGGTCAATGAAGCCTTGGTTCGGCAG
TTGCACCGCTGTGAGTTCCTAGAAGATGCCAACAACGTGGTTCTCGTGGGCGGACCGGGAACGGGTAAGACCCACATTGCCACAGCCCTGGGCGTGCAAG
CGATTGAACATCACCACCGCAGGGTGCGGTTCTTCTCAACGGTGGAGTTGGTCAACGCATTGGAGGAAGAGAAGGCTCAGGGCAAGGCGGGACAGCTCGC
TCACCGGCTGGCGTATGCCGACCTGGTGATTCTGGATGAGCTGGGTTACCTGCCATTCAGTGCCTCTGGCGGGGCACTTCTGTTCCATCTGCTGTCCAAA
CTCTACGAACGCACCAGCGTCGTGATCACCACCAACCTGAGCTTCAGTGAATGGGCCAGCGTGTTCGGGGACGCAAAGATGACCACGGCACTGCTGGACC
GACTCACCCATCACTGCCATATTCTTGAAACCGGCAACGACAGCTACCGATTCAAGAACAGTTCTGCCCAGCAACCCCAATCTAAAAAGGAAAGGAACAC
CAAGAACTTATCCACAACGTGAGCTTCAAGTTCACGCAACAGGGTGGGTCAGATCGAGATGGAATTGGTGGGTTCAGATTGCATGGAAATCAACA
GACCAGGAGTGATAGACGTGGCAACACTGAGTGTCATCAGGCGATGGGCCCTGCGCGAGCAACTGTCCATCCGCGAGATCTCCCGCCGCACAGGCCTGTC
TCGCAACACCATTCGCAAGTACCTGCGCGCTGACTCTGCCGAGCCCAGCTATGCCAAGCGGGTCAGTCCCAGCAAGCTCGATCCGTTCGCCACCAAGCTG
GCCGACTGGCTCAAGACCGAGTCCACCCGGGGTCGCAAGCAGCGACGCACCGTCAAACAGATGTACTTTGACCTGAAGGTGCTGGGCTACAGCGGCTCTT
ACAACCGCGTGGCGGCCTTCGCCCGAGTCTGGCAAGCCCAGCAGATGCAGGCACAGCAAACGACCGGCCGCGGCACCTTCGTGCCCCTCCTATTCGATTG
CGGTGAGGCGTTCCAGTTTGACTGGTCCGAGGACTGGGCCGTCATCGCCGGTGTGCGCACCAAGCTGCAAGTGGCCCACTTCAAGCTCAGCCACAGTCGT
GCTTTTTATCTGCGTGCCTACCCCCTGCAGACCCATGAGATGCTGTTTGACGCTCACAACCACGCCTTTGAGGTGCTTGGTGGCGTACCTCGCAGGGGCA
TCTACGACAACATGCGCACCGCCGTGGATCGGGTCCGCAAGGGTAAGGACAGAGACGTCAATGCACGCTTTGCCGCCATGGTCAGCCACTTCCTGTTTGA
AGCCGAGTTCTGCAACCCGGCCTCCGGGTGGGAGAAGGGGCAGGTGGAGAAGAATGTGCGCGACGCTCGTCATCGCCTGTGGCAATCGGTGCCAGCGTTT
GCAACGCTGGACGACCTCAATGCCTGGTTAGAGCAGCGCTGCCAGGCGCTGTGGCACGAGATTGAGCATGGCAAGCTTCCCGGAACCGTTGCGGACGTCT
GGGCCCAGGAGCGCTCTGCACTGATGCCCGTGCCAAGGGCCTTTGATGGCTTCGTGGAGCACACCAAGCGAGTCTCACCCACCTGCCTGATTCACTTTGA
GCGCAACCGCTACAGTGTCCCGGCTCCATACGCCAACAGGCCCGTTAGCCTGCGTGTCTATGCCAGTCGGCTGGTTGTGGCGGCTGAGGGGCAGCTGCTC
TGCGAGCACCAGCGGGTCATTGAGCGCAATCATCACGGTGCGGGCCAGACGATCTACGACTGGCGTCACTACCTGGCCGTCCTCCAGCGCAAACCTGGCG
CTTTGCGCAACGGTGCCCCATTCCAGGAATTACCCCCAGCCTTCAAGAGGCTGCAGGCCTTGCTGCTCAAGCAGCGAGGCGGAGATCGGGAGATGGTGGA
GATCCTGGCCCTGGTGCTTCACCATGATGAGCAGGCGGTGCTCGCAGCAGTTGAGCTGGCGTTGGAGGCCGGGGCTCCCAGCAAGACACACATCCTGAAC
GTTCTGCATAGATTGGTCGATGGCAGACCCGCTCCGGCTCCGGTCACTTCCCCCCAAGCCTTGCGCCTGGTGGTGGAACCCATGGCCAATGTGCTGCGCT
ATGACCAGTTGAGGGAGGTGCGCCATGCGACATGACCCAGCTATTGCGGCCATCGTCATCATGTTGCGAGAGCTCAAGATGCATGGCATGGCTCAGGCGG
TCAATGAACTGGCTGAACAAGGGGCCCCAGCGTTTGACGCTGCGCAGCCCATCCTGTCGCAGTTGCTCAAGGCCGAGACTGCCGAACGGGAGGTACGCTC
GGTGGCCTATCAGCTCAAGGTGGCAAGGTTCCCAGCCTATCGAGACCTGACGGGCTTCAACTTCAGCCACAGCGAGGTCAATGAAGCCTTGGTTCGGCAG
TTGCACCGCTGTGAGTTCCTAGAAGATGCCAACAACGTGGTTCTCGTGGGCGGACCGGGAACGGGTAAGACCCACATTGCCACAGCCCTGGGCGTGCAAG
CGATTGAACATCACCACCGCAGGGTGCGGTTCTTCTCAACGGTGGAGTTGGTCAACGCATTGGAGGAAGAGAAGGCTCAGGGCAAGGCGGGACAGCTCGC
TCACCGGCTGGCGTATGCCGACCTGGTGATTCTGGATGAGCTGGGTTACCTGCCATTCAGTGCCTCTGGCGGGGCACTTCTGTTCCATCTGCTGTCCAAA
CTCTACGAACGCACCAGCGTCGTGATCACCACCAACCTGAGCTTCAGTGAATGGGCCAGCGTGTTCGGGGACGCAAAGATGACCACGGCACTGCTGGACC
GACTCACCCATCACTGCCATATTCTTGAAACCGGCAACGACAGCTACCGATTCAAGAACAGTTCTGCCCAGCAACCCCAATCTAAAAAGGAAAGGAACAC
CAAGAACTTATCCACAACGTGAGCTTCAAGTTCACGCAACAGGGTGGGTCAGATCGAGATGGAATTGGTGGGTTCAGATTGCATGGAAATCAACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1527 bp | 508 aa | 109 | 1635 | + | No |
Chemistry : DDE
ORF sequence :
MIDVATLSVIRRWALREQLSIREISRRTGLSRNTIRKYLRADSAEPSYAKRVSPSKLDPFATKLADWLKTESTRGRKQRRTVKQMYFDLKVLGYSGSYNR
VAAFARVWQAQQMQAQQTTGRGTFVPLLFDCGEAFQFDWSEDWAVIAGVRTKLQVAHFKLSHSRAFYLRAYPLQTHEMLFDAHNHAFEVLGGVPRRGIYD
NMRTAVDRVRKGKDRDVNARFAAMVSHFLFEAEFCNPASGWEKGQVEKNVRDARHRLWQSVPAFATLDDLNAWLEQRCQALWHEIEHGKLPGTVADVWAQ
ERSALMPVPRAFDGFVEHTKRVSPTCLIHFERNRYSVPAPYANRPVSLRVYASRLVVAAEGQLLCEHQRVIERNHHGAGQTIYDWRHYLAVLQRKPGALR
NGAPFQELPPAFKRLQALLLKQRGGDREMVEILALVLHHDEQAVLAAVELALEAGAPSKTHILNVLHRLVDGRPAPAPVTSPQALRLVVEPMANVLRYDQ
LREVRHAT
VAAFARVWQAQQMQAQQTTGRGTFVPLLFDCGEAFQFDWSEDWAVIAGVRTKLQVAHFKLSHSRAFYLRAYPLQTHEMLFDAHNHAFEVLGGVPRRGIYD
NMRTAVDRVRKGKDRDVNARFAAMVSHFLFEAEFCNPASGWEKGQVEKNVRDARHRLWQSVPAFATLDDLNAWLEQRCQALWHEIEHGKLPGTVADVWAQ
ERSALMPVPRAFDGFVEHTKRVSPTCLIHFERNRYSVPAPYANRPVSLRVYASRLVVAAEGQLLCEHQRVIERNHHGAGQTIYDWRHYLAVLQRKPGALR
NGAPFQELPPAFKRLQALLLKQRGGDREMVEILALVLHHDEQAVLAAVELALEAGAPSKTHILNVLHRLVDGRPAPAPVTSPQALRLVVEPMANVLRYDQ
LREVRHAT
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
798 bp | 265 aa | 1625 | 2422 | + | No |
AG : IS21 helper
ORF sequence :
MRHDPAIAAIVIMLRELKMHGMAQAVNELAEQGAPAFDAAQPILSQLLKAETAEREVRSVAYQLKVARFPAYRDLTGFNFSHSEVNEALVRQLHRCEFLE
DANNVVLVGGPGTGKTHIATALGVQAIEHHHRRVRFFSTVELVNALEEEKAQGKAGQLAHRLAYADLVILDELGYLPFSASGGALLFHLLSKLYERTSVV
ITTNLSFSEWASVFGDAKMTTALLDRLTHHCHILETGNDSYRFKNSSAQQPQSKKERNTKNLSTT
DANNVVLVGGPGTGKTHIATALGVQAIEHHHRRVRFFSTVELVNALEEEKAQGKAGQLAHRLAYADLVILDELGYLPFSASGGALLFHLLSKLYERTSVV
ITTNLSFSEWASVFGDAKMTTALLDRLTHHCHILETGNDSYRFKNSSAQQPQSKKERNTKNLSTT
Blast result :
Comments
ISCsp3 is 94% (transposase) and 97% (helper) aa similar to ISAlde1.
References