ISDpr4
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AAQF00000000 | ND | delta proteobacterium | delta proteobacterium MLMS-1 |
DNA section
IS Length : 2431 bp
Ends
IR Length : 19/20
IRL : GTAAGCGCTCAGCGAACCCCTTCTTTTTTTGTCTCCCCCACCATGGCAAT
IRR : GTAAGCGCTCAGCAAACCCCAGTCACCGGGCAACCACGAGGCTGCGGTCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCACGGTGTC | CCAGGCATTACAGC | TGCTGACCCC | 14 |
CCTTATTTGC | CAAAATTCAG | 0 | |
GAGGAGCAAG | CCACCAAAGG | 0 | |
TTCGACAAAC | TTTGGTGGGG | 0 |
DNA sequence
GTAAGCGCTCAGCGAACCCCTTCTTTTTTTGTCTCCCCCACCATGGCAATATCTCCCCCGGTTGGCTGGCCGGGATATCCTCGGCCGCATTTTATCCGAG
GAGGAGATCATGGACAAGAACGGCAGGAGGAGCAAGCGTCGTTTCTGGGCCGCCCATGTGGCGGCCTGGGAAAAGAGCGGCCTGACCCAGACCGCGTATT
GCCGGGAGCACGGCCTGAGCCGGCACGCCTTTGGCTGGTGGCGGCGCAAATTTAGAGACCAGCCGGCAAGCGAGCAACCGCTGCTGGTACAGGTACCAAC
CGTAACCCGCCCGACAGCGGCCACCGCGGCTGCTGATTTTTCCGGCTTGCGGCTGTTGCTGCCCCGGGGGCTGCAGCTGGAAATCAACCAGGGGTTCGAT
GCCGCCACCCTGGGGCGGGTGATCGCCACCTTGGAGGCCGGCCAATGATCCCCGCCGAATTCGCTATCCAGGTGTATCTGGCCTTGGGCAGCACCGATAT
GCGTAAGAGTATCGACGGCTTGGCGCTGCTGGTCTCGGAGCAGCTGGGTTTGAACCCACTCAGTGGCCAGCTTTTCGCTTTCAGTAACCGCCGCCGGACC
ATGGTCAAGATCCTTTACTGGGATCGCAACGGTTTTTGTCTGTGGCAGAAGCGCCTGGAAAAGGAACGATTCCGCTGGCCCTGTAGCCGGCAGGAGGTTT
TGACCATCGGTCGCCGGGAATTGAGCTGGTTGCTCGACGGCCTTGAATTGGAGCAGCAACGGGCCCATAAAAAACTGGAATACAGCCGGATTTTTTGATG
CTAGAGATAGATATTCTCTTGACATTTCAGGCAGATGGATTACTATGACGGCATGCCTTCGATCCCCGCCGACATACCCGAAAATATCCGGGCCTATATC
CAGCAGCTGATCGCTGCCCATGAACAGGAGCGTGAACTCTTGTTGGAGCAGATCAAGCTGCTGCGGGCCCAGCTGTTCGGGCGTAAAAGCGAGCAGGTGG
CGGATATCTCGTCGCCGCAGCTGCCGCTTTTTGATGAGCAGGCGGCTGCCGGCGACCCAGCTGAAGAGGCGTACGAGCCTGAAGTTCAGGTGGACTCCCA
CAGCCGCCGCCGGAAGGGGCGCAAGCCACTGCCGGAAGATCTGCCCCGGGTGGAGGTGGTCCATGATGTTGACGCCGAGGCCAAAACCTGCGCTTGCGGT
TGCGAAAAGAGCCGGATCGGCGAGGATGTCTCCGAGCAGTTGGATATGATCCCGGCCAAAATGCGGGTAATCCGCCATGTCCGCCCCAAGTACGCCTGTC
GGCAGTGCGAAGGGGTGGAAGATGAGGGGCCGACCGTGGTCATTGCCCCGCCCCCGGAGCAGATGATCGCCAAGAGTATCGCCAGCCCCGGCTTGCTGGC
CCAGGTGTTGACCGCCAAGTTTGCCGATGCCCTGCCTTTTTATCGCCAGGAAGGGCAGTTTACCCGACTGGGAGTGGAAATCGGCCGGGGTACCATGTGC
GGTTGGGCCATGCAGGTGGCCGAGCGCTGCGGGCCGCTGCTTAGCTTGATGCAGGAGGAGATTCGTTCCGGGCCGCTGATCAATGTCGACGAGACCACCG
TCCAGGTGCTGGCCGAAATCGGCCGGGCCGCCAAAACCAAGTCCTACATGTGGGTCTTTCGCGGTGGTGCGCCCGAACGTCCGGTTTTGGTGTTCCAGTA
TCATCCGACCAGGAGTGGCGATCCGGCCGCAGTGTTCCTCCGTGGTTACCGGGGCTGTGTGCAGACCGACGGCTATACCGGCTATGACTTCCTTGATCAT
CAGCCGGGGGTGATTCATGCCGGCTGCTGGGCCCATGCCCGGCGCAAGTTTAACGATGTCCTCAAGGCGGCCGGTAAGTCTGCCAAGCGGGACGGTATTG
CCGGCGAGGCCCTGGACTGGATCGGCAAATTGTACAAAATCGAGGCGACCATGCGGAAGGCCGAGCTTGCTCCGGAGGCGATCCACCAGCGGCGCCAGGA
GCAGGTGGTGCCCTTGCTGGCTGATTTCCGGCAATGGCTGCTGGCCCGCCAGCGGGAGGTCCCGCCCAAGAGCTTGCTGGGCAAGGCCATTGCCTATACC
CTGAGCCAATGGAAAAGATTGCGACGTTACGTTGAGCATGGCCTCCTGCGACTGGATAACAACTTGGCGGAAAACGCCATCCGCCCCTTTGTCGTAGGTC
GGAAAAACTGGCTCTTCTCCGGCACCGCCCAAGGAGCCAAGGCCAGTGCCGCAATTTACAGCATTATCGAAACGGCCAAAGCCAACGGCCTGGAGCCTTA
TTGGTACCTGCGCGCTTTGTTCGAACGACTGCCGGCGGCCAAAACCCCGGAACAGCTCAAGGCGCTACTGCCCCAGTACATCGACCGCAGCCTCGTGGTT
GCCCGGTGACTGGGGTTTGCTGAGCGCTTAC
GAGGAGATCATGGACAAGAACGGCAGGAGGAGCAAGCGTCGTTTCTGGGCCGCCCATGTGGCGGCCTGGGAAAAGAGCGGCCTGACCCAGACCGCGTATT
GCCGGGAGCACGGCCTGAGCCGGCACGCCTTTGGCTGGTGGCGGCGCAAATTTAGAGACCAGCCGGCAAGCGAGCAACCGCTGCTGGTACAGGTACCAAC
CGTAACCCGCCCGACAGCGGCCACCGCGGCTGCTGATTTTTCCGGCTTGCGGCTGTTGCTGCCCCGGGGGCTGCAGCTGGAAATCAACCAGGGGTTCGAT
GCCGCCACCCTGGGGCGGGTGATCGCCACCTTGGAGGCCGGCCAATGATCCCCGCCGAATTCGCTATCCAGGTGTATCTGGCCTTGGGCAGCACCGATAT
GCGTAAGAGTATCGACGGCTTGGCGCTGCTGGTCTCGGAGCAGCTGGGTTTGAACCCACTCAGTGGCCAGCTTTTCGCTTTCAGTAACCGCCGCCGGACC
ATGGTCAAGATCCTTTACTGGGATCGCAACGGTTTTTGTCTGTGGCAGAAGCGCCTGGAAAAGGAACGATTCCGCTGGCCCTGTAGCCGGCAGGAGGTTT
TGACCATCGGTCGCCGGGAATTGAGCTGGTTGCTCGACGGCCTTGAATTGGAGCAGCAACGGGCCCATAAAAAACTGGAATACAGCCGGATTTTTTGATG
CTAGAGATAGATATTCTCTTGACATTTCAGGCAGATGGATTACTATGACGGCATGCCTTCGATCCCCGCCGACATACCCGAAAATATCCGGGCCTATATC
CAGCAGCTGATCGCTGCCCATGAACAGGAGCGTGAACTCTTGTTGGAGCAGATCAAGCTGCTGCGGGCCCAGCTGTTCGGGCGTAAAAGCGAGCAGGTGG
CGGATATCTCGTCGCCGCAGCTGCCGCTTTTTGATGAGCAGGCGGCTGCCGGCGACCCAGCTGAAGAGGCGTACGAGCCTGAAGTTCAGGTGGACTCCCA
CAGCCGCCGCCGGAAGGGGCGCAAGCCACTGCCGGAAGATCTGCCCCGGGTGGAGGTGGTCCATGATGTTGACGCCGAGGCCAAAACCTGCGCTTGCGGT
TGCGAAAAGAGCCGGATCGGCGAGGATGTCTCCGAGCAGTTGGATATGATCCCGGCCAAAATGCGGGTAATCCGCCATGTCCGCCCCAAGTACGCCTGTC
GGCAGTGCGAAGGGGTGGAAGATGAGGGGCCGACCGTGGTCATTGCCCCGCCCCCGGAGCAGATGATCGCCAAGAGTATCGCCAGCCCCGGCTTGCTGGC
CCAGGTGTTGACCGCCAAGTTTGCCGATGCCCTGCCTTTTTATCGCCAGGAAGGGCAGTTTACCCGACTGGGAGTGGAAATCGGCCGGGGTACCATGTGC
GGTTGGGCCATGCAGGTGGCCGAGCGCTGCGGGCCGCTGCTTAGCTTGATGCAGGAGGAGATTCGTTCCGGGCCGCTGATCAATGTCGACGAGACCACCG
TCCAGGTGCTGGCCGAAATCGGCCGGGCCGCCAAAACCAAGTCCTACATGTGGGTCTTTCGCGGTGGTGCGCCCGAACGTCCGGTTTTGGTGTTCCAGTA
TCATCCGACCAGGAGTGGCGATCCGGCCGCAGTGTTCCTCCGTGGTTACCGGGGCTGTGTGCAGACCGACGGCTATACCGGCTATGACTTCCTTGATCAT
CAGCCGGGGGTGATTCATGCCGGCTGCTGGGCCCATGCCCGGCGCAAGTTTAACGATGTCCTCAAGGCGGCCGGTAAGTCTGCCAAGCGGGACGGTATTG
CCGGCGAGGCCCTGGACTGGATCGGCAAATTGTACAAAATCGAGGCGACCATGCGGAAGGCCGAGCTTGCTCCGGAGGCGATCCACCAGCGGCGCCAGGA
GCAGGTGGTGCCCTTGCTGGCTGATTTCCGGCAATGGCTGCTGGCCCGCCAGCGGGAGGTCCCGCCCAAGAGCTTGCTGGGCAAGGCCATTGCCTATACC
CTGAGCCAATGGAAAAGATTGCGACGTTACGTTGAGCATGGCCTCCTGCGACTGGATAACAACTTGGCGGAAAACGCCATCCGCCCCTTTGTCGTAGGTC
GGAAAAACTGGCTCTTCTCCGGCACCGCCCAAGGAGCCAAGGCCAGTGCCGCAATTTACAGCATTATCGAAACGGCCAAAGCCAACGGCCTGGAGCCTTA
TTGGTACCTGCGCGCTTTGTTCGAACGACTGCCGGCGGCCAAAACCCCGGAACAGCTCAAGGCGCTACTGCCCCAGTACATCGACCGCAGCCTCGTGGTT
GCCCGGTGACTGGGGTTTGCTGAGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
339 bp | 112 aa | 110 | 448 | + | No |
AG : IS66 TnpA
ORF sequence :
MDKNGRRSKRRFWAAHVAAWEKSGLTQTAYCREHGLSRHAFGWWRRKFRDQPASEQPLLVQVPTVTRPTAATAAADFSGLRLLLPRGLQLEINQGFDAAT
LGRVIATLEAGQ
LGRVIATLEAGQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 445 | 798 | + | No |
AG : IS66 TnpB
ORF sequence :
MIPAEFAIQVYLALGSTDMRKSIDGLALLVSEQLGLNPLSGQLFAFSNRRRTMVKILYWDRNGFCLWQKRLEKERFRWPCSRQEVLTIGRRELSWLLDGL
ELEQQRAHKKLEYSRIF
ELEQQRAHKKLEYSRIF
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1575 bp | 524 aa | 835 | 2409 | + | No |
Chemistry : DDE
ORF sequence :
MDYYDGMPSIPADIPENIRAYIQQLIAAHEQERELLLEQIKLLRAQLFGRKSEQVADISSPQLPLFDEQAAAGDPAEEAYEPEVQVDSHSRRRKGRKPLP
EDLPRVEVVHDVDAEAKTCACGCEKSRIGEDVSEQLDMIPAKMRVIRHVRPKYACRQCEGVEDEGPTVVIAPPPEQMIAKSIASPGLLAQVLTAKFADAL
PFYRQEGQFTRLGVEIGRGTMCGWAMQVAERCGPLLSLMQEEIRSGPLINVDETTVQVLAEIGRAAKTKSYMWVFRGGAPERPVLVFQYHPTRSGDPAAV
FLRGYRGCVQTDGYTGYDFLDHQPGVIHAGCWAHARRKFNDVLKAAGKSAKRDGIAGEALDWIGKLYKIEATMRKAELAPEAIHQRRQEQVVPLLADFRQ
WLLARQREVPPKSLLGKAIAYTLSQWKRLRRYVEHGLLRLDNNLAENAIRPFVVGRKNWLFSGTAQGAKASAAIYSIIETAKANGLEPYWYLRALFERLP
AAKTPEQLKALLPQYIDRSLVVAR
EDLPRVEVVHDVDAEAKTCACGCEKSRIGEDVSEQLDMIPAKMRVIRHVRPKYACRQCEGVEDEGPTVVIAPPPEQMIAKSIASPGLLAQVLTAKFADAL
PFYRQEGQFTRLGVEIGRGTMCGWAMQVAERCGPLLSLMQEEIRSGPLINVDETTVQVLAEIGRAAKTKSYMWVFRGGAPERPVLVFQYHPTRSGDPAAV
FLRGYRGCVQTDGYTGYDFLDHQPGVIHAGCWAHARRKFNDVLKAAGKSAKRDGIAGEALDWIGKLYKIEATMRKAELAPEAIHQRRQEQVVPLLADFRQ
WLLARQREVPPKSLLGKAIAYTLSQWKRLRRYVEHGLLRLDNNLAENAIRPFVVGRKNWLFSGTAQGAKASAAIYSIIETAKANGLEPYWYLRALFERLP
AAKTPEQLKALLPQYIDRSLVVAR
Blast result :
Comments
ISDpr4 (orf1) is 44% aa similar to ISAeme5 (orf1),
ISDpr4 (orf2) is 65% aa similar to ISAeh1 (orf1),
ISDpr4 (orf3) is 63% aa similar to ISPpu13 (orf2).
ISDpr4 (orf2) is 65% aa similar to ISAeh1 (orf1),
ISDpr4 (orf3) is 63% aa similar to ISPpu13 (orf2).
References
1] ISfinder annotation (2008).
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Bruce,D., Pitluck,S. and Richardson,P. (2006) Direct Submission GenBank.
2] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Bruce,D., Pitluck,S. and Richardson,P. (2006) Direct Submission GenBank.