ISCausp4
- Family IS1595
- Group ISNha5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AKKF01000138 | ND | Caulobacter sp. | Caulobacter sp. AP07 |
DNA section
IS Length : 3070 bp
Ends
IR Length : 24
IRL : CGGTATTATGTAGCAAACACACCTGCTGCGATGCCGGGTCTCTAGAGGAG
IRR : CGGTATTATGTAGCAAACACACCTAGCCCAGCATCGTTCAGAAGTGAATG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CACCAGCAAC | CACTTCCG | CGCCGAAGCC | 8 |
DNA sequence
CGGTATTATGTAGCAAACACACCTGCTGCGATGCCGGGTCTCTAGAGGAGACCTGAGTTGACGAAACAGCCTCACGGTTGCGCTTCTGCGGGACTCCCGG
TCATTGTCTCGGGATGGGTGCGCCAGCCTCGGCATACGAGAAAATCACGGCCATCACGACGGGCGCGTTGGCGTTCGCGGTTAGCGGCGTGCTGACTGTT
CTCGATCACATGGGAGTCGAAGTCCCCAATTGGGCTTGGATGGCCTACGGCTGGGGATGCGGGTTCCTGGCGCTCGTGTGTGTCTCGCTGTCGATTCACA
TGGTCTTGGAGGGGGCGTTCAGCCGCAAGCGCCCCAAGGCTCCACCGTTGGTCCTAGAGCCCGTCACGCCGCCCGATCCCCAGGCGGCGCTTCTTGAGGT
GCAAGAGGCGATTGCGAATGAGCTATGGCTTCAGAGAGCGCATCGCGAGGGGGCTGACCGCCTGACGCGGGCAAGCTGGATGGTGTTCGGAAGACCCCCT
CACGAAATCAATCCAGCCGAAGAGTCGTCACCCCGATTGCCTGCAAAGCGGCGAGGAAGAACCAAGCGGGGAAGCCGCCCCGATTGATCTTGGCTTGAAC
TCCCGTCTCGGTCTCGGGAAGCCCCATCTCAGACAGCTTACGCGCCAGGTCGGCATTCTTCATGCCTGCCCGGACCATCTCGGCTTTCAGATGGCGCTTG
GCCTTGTCGGCCCATTCGTCGCTAGTCGCGGCCATCCGATTTTCTCCACTTGTGAATGCCAATCAAACATATCTGTCGCTTTTCCCATTGCAAGCGTCGA
TGGCATTTCCTATCTATCCGTTGTCGAACCATACGGATATGTCGGTTGTCACAACACTTCCTCCTCTCTGCCGCCGCCCGGACCCTGTCGCTCGGGAAGG
TGATGCGCATGTCGGAAGACGAGGCGTTCGAGACGTTCAAGACCATCCGTTGGGACGAGAACGAGGGTCAGCCCTATTGCGGCAAGTGCGGTTGCACGGC
CGTCTACACGTTCAAGGCTCGTCGCATCTTCAAGTGCAAGAGCTGCGAGGCTCAGTTCTCGGTGACGAGCGGGACCATCTTCGCCAGCCGCAAGCTCGCC
GTGCGCGACATTCTCGCGGCTATCGCGATCTTCGCCAACGGCGCCAAGGGCTATAGCGCCCTCCAATTGAGCCGGGATCTCGACGTCCAGTACAAGACGG
CGTTCGTGATGGCGCACAAGCTGCGCGAAGCCCTGGGCAAGGTGGCGGACCGGACCAAGCTCGATGGCGTGGTCGAGATCGATGGCGCCTACTTCGGCGG
CTACGTGAAGCCCGCGAACGAGAAGTGGGCTCGTCAGGACCGGCGCCTCCTCGAAAACCAGACGGGTAAGCGCCAGGTTGTCGTGGTGTTGCGCGAGCGT
GGCGGCCGGACGTTGCCGTTCGTCACCAAGGCTGAGCGTGAGGGTGTCCCCCTGGTGACCGCCAACGTGTCCCCGGGGACAACCGTCCACGCGGACGAGG
CTCCGCATTGGGACGTCCTGTCGGCCAAGTTCGCGACCAAGCGGATCAACCACTCGGAGGCCTATAGCCTCGATGGCGCCTGCACCAACTGGGCCGAGAG
CTTCTTCAGCCGCATCCGCCGGGCCGAAGCCGGCGTCCATCACAAGATCGCCGGCCGCTATCTGGAAGCCTATGCGGGCGAAATGGCGTGGCGCGAGGAT
CACCGCCGCATGAGCAATGGTGTCCAGTTCGCCCAGATCGTCGATGCGGCCATGTCGGCCCCCGTCTCGCGCCAGTGGAAGGGCTATTGGCAACGGCCCG
CTTAGGGCGGATAAAACACATGCCTCGTCCGAAGTTTGGTCACGGAAGACGAGGTGCGCTGTTGCGCGTGAACGCGGGCCCGCTCTGTCGGGCGGCCTGC
TCAATAAAATACCTTCGGGGAATAAGCAGGACGCGCTTCCCCGTGTTCGCGCGTGACCAACCGTCCCGGCACTCCGCCGGGCTTGGGAGCACGGGGGACT
ATGAGGAATCACGTCTGTATGGGGATGGCGCTTGCCATCGCCTTGAGCGCTGTCACGTCGATAGCTCACGCTCAGACCGCCACGACGCCGGGTGTGACGC
CCGAGACGGGAAAGTGGCGCTACACCGAAACGGCCTCCGCCCTGGACGGAGCTAAGAGCCGGATCGCCACCCTGACGGCGGAGACGCAAGTCGCGAACAT
CTTGGGCCGCATGGAGGCTCCGACCTTGGGCCTCACCTGCGACAAGAACGGCCTGGGCGTGGTCATGTCGTGGCCCGACTTCGTGGGCGAGGCCGGATTC
CTGACGCTTCCGGTCAAGTGGAAGATCGATGACGGCAAGGTCTACAAGACCGGGTGGTTCCCGGGGACCACGAGCGTCACTCTCATGGGCCCAGGAGCCC
AGGGATGGATAAGGCAGGTCAAGGACGCCAAGACCCTCGTGGTCGCCGTCCCTGATCGCCACGGAGGCCAGGAGGCCACCTTCGACCTCACGGGGATCGA
GGCCATCGGCCCGCGCTTCTCCGAGGTGTCCTGTGGCGGCGGAGCTGGCGCGCGCTAGACCGCCAGCCGCCAGACGAATTCACACCGCTTGGTGTCGTTC
TCCGAGATCGCAAGGCCTGCGTCGCGCATCTGGCGTAGCGCCTTGCTGATCCGCCGGACGACGTCGAGCATCATCCGGCGGTCCCGGCCGTCCTTGCCCT
CATAGTTGATGATCGTCAGGGCGAGTTGCCGGGCGGTCTGCGGCCCCTTCTCGCGAAGCTGGCCAATGAGGAACTGGCGGAGCTCGCCCCGGTAGAACAG
GACGATGCGCGGCGCCCTGACGGTCAGCCGGACCTCGCCGTCATACCCCAGGGTCTCTAGGACCCGGTCGACCGCCTCGACGTCGTTCGAGAGAACCGCC
AGCCTCTCCCGAGTGATGGCGATCTCCTCCATCATCTCCTCCCGCTTCTTGAGCAAGCCGGAGATGGCGTGAGCGAAAGTCTCGGTTCTGGCGGGTTTCA
TGCCCCAGAACGTACAAAATCATTCACTTCTGAACGATGCTGGGCTAGGTGTGTTTGCTACATAATACCG
TCATTGTCTCGGGATGGGTGCGCCAGCCTCGGCATACGAGAAAATCACGGCCATCACGACGGGCGCGTTGGCGTTCGCGGTTAGCGGCGTGCTGACTGTT
CTCGATCACATGGGAGTCGAAGTCCCCAATTGGGCTTGGATGGCCTACGGCTGGGGATGCGGGTTCCTGGCGCTCGTGTGTGTCTCGCTGTCGATTCACA
TGGTCTTGGAGGGGGCGTTCAGCCGCAAGCGCCCCAAGGCTCCACCGTTGGTCCTAGAGCCCGTCACGCCGCCCGATCCCCAGGCGGCGCTTCTTGAGGT
GCAAGAGGCGATTGCGAATGAGCTATGGCTTCAGAGAGCGCATCGCGAGGGGGCTGACCGCCTGACGCGGGCAAGCTGGATGGTGTTCGGAAGACCCCCT
CACGAAATCAATCCAGCCGAAGAGTCGTCACCCCGATTGCCTGCAAAGCGGCGAGGAAGAACCAAGCGGGGAAGCCGCCCCGATTGATCTTGGCTTGAAC
TCCCGTCTCGGTCTCGGGAAGCCCCATCTCAGACAGCTTACGCGCCAGGTCGGCATTCTTCATGCCTGCCCGGACCATCTCGGCTTTCAGATGGCGCTTG
GCCTTGTCGGCCCATTCGTCGCTAGTCGCGGCCATCCGATTTTCTCCACTTGTGAATGCCAATCAAACATATCTGTCGCTTTTCCCATTGCAAGCGTCGA
TGGCATTTCCTATCTATCCGTTGTCGAACCATACGGATATGTCGGTTGTCACAACACTTCCTCCTCTCTGCCGCCGCCCGGACCCTGTCGCTCGGGAAGG
TGATGCGCATGTCGGAAGACGAGGCGTTCGAGACGTTCAAGACCATCCGTTGGGACGAGAACGAGGGTCAGCCCTATTGCGGCAAGTGCGGTTGCACGGC
CGTCTACACGTTCAAGGCTCGTCGCATCTTCAAGTGCAAGAGCTGCGAGGCTCAGTTCTCGGTGACGAGCGGGACCATCTTCGCCAGCCGCAAGCTCGCC
GTGCGCGACATTCTCGCGGCTATCGCGATCTTCGCCAACGGCGCCAAGGGCTATAGCGCCCTCCAATTGAGCCGGGATCTCGACGTCCAGTACAAGACGG
CGTTCGTGATGGCGCACAAGCTGCGCGAAGCCCTGGGCAAGGTGGCGGACCGGACCAAGCTCGATGGCGTGGTCGAGATCGATGGCGCCTACTTCGGCGG
CTACGTGAAGCCCGCGAACGAGAAGTGGGCTCGTCAGGACCGGCGCCTCCTCGAAAACCAGACGGGTAAGCGCCAGGTTGTCGTGGTGTTGCGCGAGCGT
GGCGGCCGGACGTTGCCGTTCGTCACCAAGGCTGAGCGTGAGGGTGTCCCCCTGGTGACCGCCAACGTGTCCCCGGGGACAACCGTCCACGCGGACGAGG
CTCCGCATTGGGACGTCCTGTCGGCCAAGTTCGCGACCAAGCGGATCAACCACTCGGAGGCCTATAGCCTCGATGGCGCCTGCACCAACTGGGCCGAGAG
CTTCTTCAGCCGCATCCGCCGGGCCGAAGCCGGCGTCCATCACAAGATCGCCGGCCGCTATCTGGAAGCCTATGCGGGCGAAATGGCGTGGCGCGAGGAT
CACCGCCGCATGAGCAATGGTGTCCAGTTCGCCCAGATCGTCGATGCGGCCATGTCGGCCCCCGTCTCGCGCCAGTGGAAGGGCTATTGGCAACGGCCCG
CTTAGGGCGGATAAAACACATGCCTCGTCCGAAGTTTGGTCACGGAAGACGAGGTGCGCTGTTGCGCGTGAACGCGGGCCCGCTCTGTCGGGCGGCCTGC
TCAATAAAATACCTTCGGGGAATAAGCAGGACGCGCTTCCCCGTGTTCGCGCGTGACCAACCGTCCCGGCACTCCGCCGGGCTTGGGAGCACGGGGGACT
ATGAGGAATCACGTCTGTATGGGGATGGCGCTTGCCATCGCCTTGAGCGCTGTCACGTCGATAGCTCACGCTCAGACCGCCACGACGCCGGGTGTGACGC
CCGAGACGGGAAAGTGGCGCTACACCGAAACGGCCTCCGCCCTGGACGGAGCTAAGAGCCGGATCGCCACCCTGACGGCGGAGACGCAAGTCGCGAACAT
CTTGGGCCGCATGGAGGCTCCGACCTTGGGCCTCACCTGCGACAAGAACGGCCTGGGCGTGGTCATGTCGTGGCCCGACTTCGTGGGCGAGGCCGGATTC
CTGACGCTTCCGGTCAAGTGGAAGATCGATGACGGCAAGGTCTACAAGACCGGGTGGTTCCCGGGGACCACGAGCGTCACTCTCATGGGCCCAGGAGCCC
AGGGATGGATAAGGCAGGTCAAGGACGCCAAGACCCTCGTGGTCGCCGTCCCTGATCGCCACGGAGGCCAGGAGGCCACCTTCGACCTCACGGGGATCGA
GGCCATCGGCCCGCGCTTCTCCGAGGTGTCCTGTGGCGGCGGAGCTGGCGCGCGCTAGACCGCCAGCCGCCAGACGAATTCACACCGCTTGGTGTCGTTC
TCCGAGATCGCAAGGCCTGCGTCGCGCATCTGGCGTAGCGCCTTGCTGATCCGCCGGACGACGTCGAGCATCATCCGGCGGTCCCGGCCGTCCTTGCCCT
CATAGTTGATGATCGTCAGGGCGAGTTGCCGGGCGGTCTGCGGCCCCTTCTCGCGAAGCTGGCCAATGAGGAACTGGCGGAGCTCGCCCCGGTAGAACAG
GACGATGCGCGGCGCCCTGACGGTCAGCCGGACCTCGCCGTCATACCCCAGGGTCTCTAGGACCCGGTCGACCGCCTCGACGTCGTTCGAGAGAACCGCC
AGCCTCTCCCGAGTGATGGCGATCTCCTCCATCATCTCCTCCCGCTTCTTGAGCAAGCCGGAGATGGCGTGAGCGAAAGTCTCGGTTCTGGCGGGTTTCA
TGCCCCAGAACGTACAAAATCATTCACTTCTGAACGATGCTGGGCTAGGTGTGTTTGCTACATAATACCG
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
228 bp | 75 aa | 735 | 508 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MAATSDEWADKAKRHLKAEMVRAGMKNADLARKLSEMGLPETETGVQAKINRGGFPAWFFLAALQAIGVTTLRLD
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
903 bp | 300 aa | 903 | 1805 | + | No |
Chemistry : DDE
ORF sequence :
MRMSEDEAFETFKTIRWDENEGQPYCGKCGCTAVYTFKARRIFKCKSCEAQFSVTSGTIFASRKLAVRDILAAIAIFANGAKGYSALQLSRDLDVQYKTA
FVMAHKLREALGKVADRTKLDGVVEIDGAYFGGYVKPANEKWARQDRRLLENQTGKRQVVVVLRERGGRTLPFVTKAEREGVPLVTANVSPGTTVHADEA
PHWDVLSAKFATKRINHSEAYSLDGACTNWAESFFSRIRRAEAGVHHKIAGRYLEAYAGEMAWREDHRRMSNGVQFAQIVDAAMSAPVSRQWKGYWQRPA
FVMAHKLREALGKVADRTKLDGVVEIDGAYFGGYVKPANEKWARQDRRLLENQTGKRQVVVVLRERGGRTLPFVTKAEREGVPLVTANVSPGTTVHADEA
PHWDVLSAKFATKRINHSEAYSLDGACTNWAESFFSRIRRAEAGVHHKIAGRYLEAYAGEMAWREDHRRMSNGVQFAQIVDAAMSAPVSRQWKGYWQRPA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
540 bp | 179 aa | 2019 | 2558 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MGMALAIALSAVTSIAHAQTATTPGVTPETGKWRYTETASALDGAKSRIATLTAETQVANILGRMEAPTLGLTCDKNGLGVVMSWPDFVGEAGFLTLPVK
WKIDDGKVYKTGWFPGTTSVTLMGPGAQGWIRQVKDAKTLVVAVPDRHGGQEATFDLTGIEAIGPRFSEVSCGGGAGAR
WKIDDGKVYKTGWFPGTTSVTLMGPGAQGWIRQVKDAKTLVVAVPDRHGGQEATFDLTGIEAIGPRFSEVSCGGGAGAR
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
447 bp | 148 aa | 3001 | 2555 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MKPARTETFAHAISGLLKKREEMMEEIAITRERLAVLSNDVEAVDRVLETLGYDGEVRLTVRAPRIVLFYRGELRQFLIGQLREKGPQTARQLALTIINY
EGKDGRDRRMMLDVVRRISKALRQMRDAGLAISENDTKRCEFVWRLAV
EGKDGRDRRMMLDVVRRISKALRQMRDAGLAISENDTKRCEFVWRLAV
Blast result :
Comments
ISCausp3 is 79% (transposase) aa similar to ISNwi4.
References
1] ISfinder annotation (2017)
2] Brown,S.D., Utturkar,S.M., Klingeman,D.M., Johnson,C.M., Martin,S.L., Land,M.L., Lu,T.Y., Schadt,C.W., Doktycz,M.J. and Pelletier,D.A. (2012) J. Bacteriol. 194 (21), 5991-5993 .
2] Brown,S.D., Utturkar,S.M., Klingeman,D.M., Johnson,C.M., Martin,S.L., Land,M.L., Lu,T.Y., Schadt,C.W., Doktycz,M.J. and Pelletier,D.A. (2012) J. Bacteriol. 194 (21), 5991-5993 .