ISPpa7
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
EU909901 | ND | Paracoccus pantotrophus | Paracoccus pantotrophus DSM 65 |
DNA section
IS Length : 2731 bp
Ends
IR Length : 26/35
IRL : GTATGCGCCGTCTCCAGCCCATTGATATCGCGGGTTTCCTGCGAGCTGCG
IRR : GTAAGCGCCGTCCCCGTCCCATTGGGATTTGGGGTGCGCTCGGGAGCGCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|
DNA sequence
GTATGCGCCGTCTCCAGCCCATTGATATCGCGGGTTTCCTGCGAGCTGCGTAGCAGCGAGCCTCTTTGTGGTTTTCTGCAAGGGGGCGATGATGGCGGAT
GGTGGCGACGGGTTCGTTGGTCGTGTCGAGGTGGTGCGGCGGACGCGGGGGTATCGGCGCTGGCCTGAGGCGGTGAAGGCCCGGATCGTGGCGGAGAGTT
TCCAGCCCGGGGTGCGAGTTGTGGATGTCGCGCAGCGGCATGATCTTGCGCCGCATCAGCTCTCGGACTGGCGGCGCCAGGCGCGACAGGGGCTTCTGGC
GCTGCCGGCGGATGCCATGGAGGCGGTAGGCGCGACCGTGGTGCCGGCGTTCGTGCCGGTGTCGGTAGATGACGAGGTGGATGCCTCGCCTGATGCCGCG
CGTGGTGCCGAAGGGGTCGTCGGGATCGAGATTGGCGACGGGATTATTGTCCGCGTGCCGGGCGACGTGTCGGCGGAGCGAGCGGCGGCGCTTGTGCGTG
CGCTGCGCAGGTCGGCATGATCGTCGCCGGGCAGCGGCTGCCGATCCTGGTGGCGACCCGACCGGTGGACTTCCGTTGCGGTCATCAGGCGCTGGCACTG
ATCGTTCAGAACACGCTGCGGCTCGATCCGCATTCCGGGGTGATGGTGGTCTTCCGATCGAAGCGCGGCGACAGATTGAAGATCCTGGTGTGGGACGGCA
CCGGCATGGTGCTGGTGTATAAGGTCCTGGAGCAGGGCAAGTTCGCCTGGCCGAAGGTTCAGGACGGGGTGATGCGGCTGTCACGGGCGCAATGCGAGGC
GCTGGTCGAGGGCCTGGACTGGCGGCGGGTGATGGCGCAGCGGGTGACGGTGCCGACTGCGGCGGGGTGACTCAGGTCGTCGTTCCGGGGTTGTTTTGTT
GGGCTTCCCTGCCGGGATCGGGTAGAAAGGGCCATGTCGCAGCCCCTCGATCTGAGCCGCTTTCCTGACCTCCCGCCCGAGGTGGTGGCGGCCTTTGCGG
CGCAGCACGAGGCGCTGGAGGCCGCCCGGTTCGAGGCCTCGGTCGAACGCGCGGCACGCCAGCACGAGCAAGCGGTGGTGGCCGAGAAGGAGGCGTTCAT
CGCCGAGCTCAAGGAGCTGGTGGCGACGCTGGAAGGCCAGATCCAGCAGTATCGCCGCGCGAAGTTCGGGCCGAAGTCCGAGAAGCTGGACCCCGCCCAG
CTGGACCTGGCACTGGAAGACCTCGAGACTGCCATCGCCGAGACCGAGGCGCGCATCGCCGCGGTCGAGGAGAAGATCGCCGCCAGCACGCTCGATCCCG
AGAAGAAAGCTCGGCGTAAACCGCGCAAGGCCCGGGCGCTGCCCGAAGGCCTGCCGCATGTGGAGCGGGTGGTCGAGCCCGACAGCATCGCCTGTCCTTG
TGGCTGTGGCGACATGGTCCGGATCGGCGAGGACCGCAGCAAGCGGCTGGATTGCATCCCCGCGCGGTATCAGGTGATCGTCACGGTGCGACCCAGATAC
GCCTGCCCCAAGGGCCGGGCCGGTGTGGTGCAGGCAAAAGCCCCGGCGCATCTGCTCGAAGGCAGCTGGCCTACCGAGGCGCTGCTCGCGCAGATCGCGG
TCGCCAAGCATTCTGAACACATGCCCTTGAACCGGCAGTCGCTGGTCATGGCCCGGCACGGGGTGCCCATCGACCGCTCGGTGCTGGCCGACTGGATGGG
CCGGACCGGCGCGCTGATCGCGCCCGTGGTCGAGCGCATGACGGTGCTTCTGAAGACCGGCGGCACCCGCCTCTATGTCGACGAGACCACAGCCCCGGTC
CTGGACCCAGGGCGCGGCAAGACGAAGACCGGATATCTCTGGGCGGTGCTACGCGACGATCGCGGTTGGGGCGGTCCCGCCCCGCCCGGCGTGGTGTTCC
ACTATCGCCCGGGCCGCGCCGGCGAGAACGCCGACGAGATCCTCGACGGGTTCGACGGGACGATTCAGGTCGACGCCTATGGCGGCTACACCCATCTGGC
CAAGCCCGACCGGAAGGGCGGCAAGCCGCTTGCGCTGGCGTTCTGCTGGTCGCACGGCCGCAGGAAGCTCATCGCGGCCAGACCGAAGGCCGGCTCGCCC
ATTGTCGACGAGGCGCTGGCGCGCATCGCCGCGCTCTATCGGATCGAGGCCGCCATCCGCGGCAAGGATGCCGGCCTGCGCCGCACCATCCGGCAGGAGC
AGTCCCGCCCCCTGGTCGATAAATTCTTCGCCTGGCTCGCCGCCCAGGCCGACCGGACCTCGCGCAAGTCGGATCTCGGCAAGGCCCTGCACTACATGCT
GCGCCGCCAGGACGGCTTCCGGCTGTTCCTCGACGATGGCCATGTCGACATGGACTCCAACCTGGTCGAGAACGCCATCCGCAGCCCGGCCATGAACCGC
CGCAATGCGCTCTTCGCCGGCCACGACGAGGGTGGCCGCAACTGGGCCCGCTTCGCCAGCCTGATCGGCACCTGCAAGCTGAACGACGTGGAGCCTTACG
CCTACCTGCGCGATCTCTTCACAAGCATCGCCACCGGCCACCTCGACAAGGACATAGACGCCTTGATGCCATGGGCCTACGCGGCAGCGACCAAGACCTC
ACAATGAGCACATCCGGCACTCCCGGCTGAGCTCATTGTGAGCTTCACCAACCGGCACTCGCCCCCCCCCCGCACCCAACTCGCGCTCCCGAGCGCACCC
CAAATCCCAATGGGACGGGGACGGCGCTTAC
GGTGGCGACGGGTTCGTTGGTCGTGTCGAGGTGGTGCGGCGGACGCGGGGGTATCGGCGCTGGCCTGAGGCGGTGAAGGCCCGGATCGTGGCGGAGAGTT
TCCAGCCCGGGGTGCGAGTTGTGGATGTCGCGCAGCGGCATGATCTTGCGCCGCATCAGCTCTCGGACTGGCGGCGCCAGGCGCGACAGGGGCTTCTGGC
GCTGCCGGCGGATGCCATGGAGGCGGTAGGCGCGACCGTGGTGCCGGCGTTCGTGCCGGTGTCGGTAGATGACGAGGTGGATGCCTCGCCTGATGCCGCG
CGTGGTGCCGAAGGGGTCGTCGGGATCGAGATTGGCGACGGGATTATTGTCCGCGTGCCGGGCGACGTGTCGGCGGAGCGAGCGGCGGCGCTTGTGCGTG
CGCTGCGCAGGTCGGCATGATCGTCGCCGGGCAGCGGCTGCCGATCCTGGTGGCGACCCGACCGGTGGACTTCCGTTGCGGTCATCAGGCGCTGGCACTG
ATCGTTCAGAACACGCTGCGGCTCGATCCGCATTCCGGGGTGATGGTGGTCTTCCGATCGAAGCGCGGCGACAGATTGAAGATCCTGGTGTGGGACGGCA
CCGGCATGGTGCTGGTGTATAAGGTCCTGGAGCAGGGCAAGTTCGCCTGGCCGAAGGTTCAGGACGGGGTGATGCGGCTGTCACGGGCGCAATGCGAGGC
GCTGGTCGAGGGCCTGGACTGGCGGCGGGTGATGGCGCAGCGGGTGACGGTGCCGACTGCGGCGGGGTGACTCAGGTCGTCGTTCCGGGGTTGTTTTGTT
GGGCTTCCCTGCCGGGATCGGGTAGAAAGGGCCATGTCGCAGCCCCTCGATCTGAGCCGCTTTCCTGACCTCCCGCCCGAGGTGGTGGCGGCCTTTGCGG
CGCAGCACGAGGCGCTGGAGGCCGCCCGGTTCGAGGCCTCGGTCGAACGCGCGGCACGCCAGCACGAGCAAGCGGTGGTGGCCGAGAAGGAGGCGTTCAT
CGCCGAGCTCAAGGAGCTGGTGGCGACGCTGGAAGGCCAGATCCAGCAGTATCGCCGCGCGAAGTTCGGGCCGAAGTCCGAGAAGCTGGACCCCGCCCAG
CTGGACCTGGCACTGGAAGACCTCGAGACTGCCATCGCCGAGACCGAGGCGCGCATCGCCGCGGTCGAGGAGAAGATCGCCGCCAGCACGCTCGATCCCG
AGAAGAAAGCTCGGCGTAAACCGCGCAAGGCCCGGGCGCTGCCCGAAGGCCTGCCGCATGTGGAGCGGGTGGTCGAGCCCGACAGCATCGCCTGTCCTTG
TGGCTGTGGCGACATGGTCCGGATCGGCGAGGACCGCAGCAAGCGGCTGGATTGCATCCCCGCGCGGTATCAGGTGATCGTCACGGTGCGACCCAGATAC
GCCTGCCCCAAGGGCCGGGCCGGTGTGGTGCAGGCAAAAGCCCCGGCGCATCTGCTCGAAGGCAGCTGGCCTACCGAGGCGCTGCTCGCGCAGATCGCGG
TCGCCAAGCATTCTGAACACATGCCCTTGAACCGGCAGTCGCTGGTCATGGCCCGGCACGGGGTGCCCATCGACCGCTCGGTGCTGGCCGACTGGATGGG
CCGGACCGGCGCGCTGATCGCGCCCGTGGTCGAGCGCATGACGGTGCTTCTGAAGACCGGCGGCACCCGCCTCTATGTCGACGAGACCACAGCCCCGGTC
CTGGACCCAGGGCGCGGCAAGACGAAGACCGGATATCTCTGGGCGGTGCTACGCGACGATCGCGGTTGGGGCGGTCCCGCCCCGCCCGGCGTGGTGTTCC
ACTATCGCCCGGGCCGCGCCGGCGAGAACGCCGACGAGATCCTCGACGGGTTCGACGGGACGATTCAGGTCGACGCCTATGGCGGCTACACCCATCTGGC
CAAGCCCGACCGGAAGGGCGGCAAGCCGCTTGCGCTGGCGTTCTGCTGGTCGCACGGCCGCAGGAAGCTCATCGCGGCCAGACCGAAGGCCGGCTCGCCC
ATTGTCGACGAGGCGCTGGCGCGCATCGCCGCGCTCTATCGGATCGAGGCCGCCATCCGCGGCAAGGATGCCGGCCTGCGCCGCACCATCCGGCAGGAGC
AGTCCCGCCCCCTGGTCGATAAATTCTTCGCCTGGCTCGCCGCCCAGGCCGACCGGACCTCGCGCAAGTCGGATCTCGGCAAGGCCCTGCACTACATGCT
GCGCCGCCAGGACGGCTTCCGGCTGTTCCTCGACGATGGCCATGTCGACATGGACTCCAACCTGGTCGAGAACGCCATCCGCAGCCCGGCCATGAACCGC
CGCAATGCGCTCTTCGCCGGCCACGACGAGGGTGGCCGCAACTGGGCCCGCTTCGCCAGCCTGATCGGCACCTGCAAGCTGAACGACGTGGAGCCTTACG
CCTACCTGCGCGATCTCTTCACAAGCATCGCCACCGGCCACCTCGACAAGGACATAGACGCCTTGATGCCATGGGCCTACGCGGCAGCGACCAAGACCTC
ACAATGAGCACATCCGGCACTCCCGGCTGAGCTCATTGTGAGCTTCACCAACCGGCACTCGCCCCCCCCCCGCACCCAACTCGCGCTCCCGAGCGCACCC
CAAATCCCAATGGGACGGGGACGGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
432 bp | 143 aa | 89 | 520 | + | No |
AG : IS66 TnpA
ORF sequence :
MMADGGDGFVGRVEVVRRTRGYRRWPEAVKARIVAESFQPGVRVVDVAQRHDLAPHQLSDWRRQARQGLLALPADAMEAVGATVVPAFVPVSVDDEVDAS
PDAARGAEGVVGIEIGDGIIVRVPGDVSAERAAALVRALRRSA
PDAARGAEGVVGIEIGDGIIVRVPGDVSAERAAALVRALRRSA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 517 | 870 | + | No |
AG : IS66 TnpB
ORF sequence :
MIVAGQRLPILVATRPVDFRCGHQALALIVQNTLRLDPHSGVMVVFRSKRGDRLKILVWDGTGMVLVYKVLEQGKFAWPKVQDGVMRLSRAQCEALVEGL
DWRRVMAQRVTVPTAAG
DWRRVMAQRVTVPTAAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1674 bp | 557 aa | 934 | 2607 | + | No |
Chemistry : DDE
ORF sequence :
MSQPLDLSRFPDLPPEVVAAFAAQHEALEAARFEASVERAARQHEQAVVAEKEAFIAELKELVATLEGQIQQYRRAKFGPKSEKLDPAQLDLALEDLETA
IAETEARIAAVEEKIAASTLDPEKKARRKPRKARALPEGLPHVERVVEPDSIACPCGCGDMVRIGEDRSKRLDCIPARYQVIVTVRPRYACPKGRAGVVQ
AKAPAHLLEGSWPTEALLAQIAVAKHSEHMPLNRQSLVMARHGVPIDRSVLADWMGRTGALIAPVVERMTVLLKTGGTRLYVDETTAPVLDPGRGKTKTG
YLWAVLRDDRGWGGPAPPGVVFHYRPGRAGENADEILDGFDGTIQVDAYGGYTHLAKPDRKGGKPLALAFCWSHGRRKLIAARPKAGSPIVDEALARIAA
LYRIEAAIRGKDAGLRRTIRQEQSRPLVDKFFAWLAAQADRTSRKSDLGKALHYMLRRQDGFRLFLDDGHVDMDSNLVENAIRSPAMNRRNALFAGHDEG
GRNWARFASLIGTCKLNDVEPYAYLRDLFTSIATGHLDKDIDALMPWAYAAATKTSQ
IAETEARIAAVEEKIAASTLDPEKKARRKPRKARALPEGLPHVERVVEPDSIACPCGCGDMVRIGEDRSKRLDCIPARYQVIVTVRPRYACPKGRAGVVQ
AKAPAHLLEGSWPTEALLAQIAVAKHSEHMPLNRQSLVMARHGVPIDRSVLADWMGRTGALIAPVVERMTVLLKTGGTRLYVDETTAPVLDPGRGKTKTG
YLWAVLRDDRGWGGPAPPGVVFHYRPGRAGENADEILDGFDGTIQVDAYGGYTHLAKPDRKGGKPLALAFCWSHGRRKLIAARPKAGSPIVDEALARIAA
LYRIEAAIRGKDAGLRRTIRQEQSRPLVDKFFAWLAAQADRTSRKSDLGKALHYMLRRQDGFRLFLDDGHVDMDSNLVENAIRSPAMNRRNALFAGHDEG
GRNWARFASLIGTCKLNDVEPYAYLRDLFTSIATGHLDKDIDALMPWAYAAATKTSQ
Blast result :
Comments
ISPpa7 is 76% (ORFA), 90% (ORFB) and 87% (ORFC, the transposase) aa similar to iSRosp4.
References
1] Dziewit, L., Baj, J., Szuplewska, M., Maj A., Tabin, M., Czyzkowska, A., Skrzypczyk, G., Adamczuk, M., Sitarek, T., Stawinski, P., Tudek, A., Wanasz, K., Wardal, E., Piechucka, E., Bartosik, D. (2012) PLoS ONE 7: e32277