ISTico1
- Family IS1595
- Group ISNha5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
FWZX01000001 | ND | Tistlia consotensis | Tistlia consotensis USBA 355 |
DNA section
IS Length : 4119 bp
Ends
IR Length : 21/23
IRL : CGGCATTATGCAGCAAGCACACCAAGGCTGCTGCACTGCCTCGCCGAGCC
IRR : CGGCATTATGCAGCAAACGCACCTTCCGCCCGATAGACCCTAAGTGGTTG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGGCGGGGGTTTT | GCTTTTGG | GCAGCGCCATATC | 8 |
DNA sequence
CGGCATTATGCAGCAAGCACACCAAGGCTGCTGCACTGCCTCGCCGAGCCTCTGGCCCAACCCCGCAAGCTAAGGAGGGAACGATGAGCTAGGGGCCGGG
GCGATGTTCGCCAAGGCGGCTGGGGCCGCGCTTGCGTTAATTGGACTTGTTGCCCTTTTCGCAGGGAACCAATTCGTTCTCGCCAGTCCGATCAATGTGG
AACGGGAGCCAATAACGGAAACGTGTAGTTCTCCGCCCTTTGAAAGCATCTTGATATTTGATTTCGCCAAAAACATACAGGTTATATGTTTTACTATCTG
CGAATCCCAGGAACATAGAGAGCAATTGACTCTTTATAGGAATTGTATGATAGATCTCTTGATTAAGAAAGATATCAATAACATTTTTAACACGCTCCCT
ATCAACCTCGAAAGTCTCCGATCCTACTTTGAAGACCGTGCTAGTTGCAAAATATTTTACTTTGTATGCCGGCGTAGATCCGTAGTTTTTAATGGATACA
ATAATATTAGCAGATTTCCCTCCATCATGGAGGGCCAAATTTACTTTCGAAAAGCCAACATAGGCGCGGATCTCTTTCTGGGTGTTCTTCCGCGCCTCTT
TAATGGAAACAACTGCAGCATTGGCAGTTCTTTTTGTCTCGTGCAATGTGTCTCGCACAAACAATACACCGACGAAGGTAAGAACTATCGTTATAAGCCC
GACGCTTCCCGCAACGATGACGGCGCCATACGTCCAATCAGCCATCGATTTCTGGGCATACAGATCCTCTTCGGCGCGCTGGGCCTCGCGATTGGCCTCT
ACCTTCAGTTTAAGGCAATCGAGAGCTTTCTGAGGTTCGCTGAGCCGCAGGCAGTCGCGTTGAAGTTCATACTGAGTACGCTGTTGTGTATTGGCAGCGG
CATAATCTTCGTACGCGCGGTTGTATTGCTGACTGGCCCCTGACCACCAAATGAACGGTGCTATGAAAGCAAGATAGACGACGAGAATCAGCCAATGCGC
ATTTCGCGAGTACCAATCGCTTCTAGGCATTGGAGCAGAAAGGCAGCGGTGAATTTTCCACGGCTGACCTTGTTCCTCAGGTTGCGTTCATCCTCGTGAA
CGCCAATTGATTCCAGCTTCTGCTGCAATTGGGAATAGGTAATCCCCTGACGCGCCATCTCGCTGCGGATGAGGCCGCGCGCCTTGGCTTCCCAATCGGT
CCTTTCTGGCATGGTCGCTGCTCCTATGATGCGATTTGTCATCATATAGCGTGCTTCTGCCCTTGAAAAGGTGATAGATCGTCATCATCTCTGATGCGTA
GGCACTGGAGACGATGACAATGACCCAACACTTCCTTCTCTCAGCGAAAGCCCGGACCCTCAGCGTGGTCCAGGTGGCTCGCATGTCCGAGGAAGAGGCT
CGGGCCGCATTCCGGGCGATCCGCTGGGCCACGACCGATGGCGACCCCGTCTGCCCGCGCTGCGGCTGCTGTGCTGTCTACGAGTACGCCAGCCGCCCGA
TCTTCAAGTGCAAGGCTTGCGGCCATCAGTTCAGCGTCACCAGCGGGACGATCTTCGCCAGCCGCAAGCTGCCGGTGCGCGACTACCTCCTGGCCATCGC
GATCTTCGTGAACGCCGCCAAGGGCATGTCCGCGCTGCAGCTCGGCCGCGACCTCGATGTGCAGTACCGCACGGCCTACGTCCTGGCGCACAAGCTGCGC
GAGGCGATGGCTGCCGAGAACGAGACCATGACCCTGAGCGGCACGGTCGAGGTTGATGGCGCCTACTTCGGCGGCTACGTCCGGCCGGAGAACCGCAAGG
AGGATCGCGTCGATCTCCGCCTCAAGGCCAACCAATCCGGCAAGCGCCGCGTCGTGGTCGTCATGCGCCAGCGTGGCGGCCGGACGCTGCCGTTCGTCTT
CCGCAGCGAGGGCGAGTCCCTGCCGTCGATCCTGGCCCGCGTCGAGCCGGGCAGCACGATCCACGCGGACGAAGCCCCGAGCTGGGATGAGCTGCACACC
CGGTACGAGACCCGCCGCATCAACCACAGCATTGCCTTCTCGGATGACGGCGCCTGCACCAATCAGGCGGAAAGCTACTTCTCCCGTCTGCGCCGCGCCG
AGTGGGGCCAGCACCACCACATCAGCGGCCGGTATCTCGGCTTCTATGCTGGCGGGGCCGCGTGGCGCGAGGACACGCGCCGGAAGCCCAATGGGGCGCT
CTACGGGCTGCTGATTGCCGCTGCCGCCGCCCATCCGGTTTCGCGGCAGTGGAAGGGGTACTGGCAACGGGCCCGCTAGTCTCGGGCGTACAGACACTTC
AATGCGAGGAATATGGCCATTTCCTCGCGTCTGGGAGATAGCAAGTAAACCATCAAGTCATGGAATAATTCAGGGGTCATTCTAAAGACAAAAATGTCTT
TTCTGTCGTCTCTTACACTCTCAAACCAGTTTATAACGTGTATTGTGAAGTGAAATGCGAGGTAAGTTCCGCGTCCTGCATTTAGGTCTAGCCTTGCTTC
TGCTTTCTCAGAATCGCTTCCATTGAAAATTATGCCATCAGCCGATGGGTCATACTCATAAATATCATTTACCCAATCCACTGTTGCTAGCTGGTATAGC
AATATAGCCATCATCTTTAAGTGGGAAAGTCCAGATGAATCAGACATGACCCACTCGCTTTCCCACAACTCGCAGCGGCTCCGCCAATTATCGTGGGCCA
CCTTCAGTGCTGATTCATCAAAATCAAAGTAATAACCAACAGCCTTTGCCATGATCTGGCAATGGCGGAGGAAATAATACTTAAATTCGGAAAAATTGCC
CAGCCACTCGTCTAGGCAATCCCGAGTAAGATGCTGATTGTGTGGAGATGCTTCGTTGGACATAAACGCCACTTACCCCATTTTAGGGTGTAGCGTCTAG
TCCAACAAGCAATGGTGGAAACGCGCTTTAGCTCGTCGCGGCCACTTTCTGCCGCTTACGTTCCCCAATCAAAACGCGAATCGTGCGCGTCTCCAGGGGA
GCGCTATAGCCCTCCGGAATGCTCGGACGGCCATTTGTGAAGAGGAACTTAAACCGCTCCTCATCGCGCTTAACTGGCTCGCCGCGACCTACGACGCCCT
GATGCTTCGCCATTGCACCACCCCTCCTCGCGCGCTCCGTGGAGTGTCCCGAAACGCTGCTTATTCGGGCTATATGCAGCATGCGGCCCGCCAGGTCAAA
CAAGGAGTGAGGGCTAAACCTCTTTTCCACTTGACAGCTCCGACGCGCCGGCGGGTGTTTCGCACACCGATTCGACGCTACTAAGACGTGCGTGTGACCA
CGAAAGTACGCCCAGGTCAACGGGCATTAAGTTAAGCCGTTCCCGATGTTTTTTCAAAAGACCTGTGACAATATGGCACGTGTACTACTCGATACAGCGC
CGGGAAGCCCGCAGCGCCTGCTTTCGCTTCTCGACCCTTCCGGCCGTCACGAGCTGGTCTGCCCGCTTGGCTTCGTCCGGATCGATCAGGCAATGCCAGT
AGCCGTCCGCCAGCTTGGCGAGGACGCGGGCCTCTAGGTCTTCCTCGGTGTCTGCCATGCGCTCCCTCTCGCATGAGAACGTAGAGGGGACAAGCGTGAT
TCACGCAAGGCCGCGTCGCACTGCCTACCGCTTCGGCTTCCACAGCACGACGCCATCCTCCGTCCAATACCGCCGGACGGTACCGCGCAGCTCCTGCCGC
TCCAGCATCTTGAGGACCTTGCGCCGTACCTCGATGTAGGCCGGTGAGGCGGGGTCGAGCGCCTTGCGCTCCAAGATGGCGTCGCGGAGATCGACGACGG
ACAGCGGGCCGCGCTCGCGCAGAAGATCGAAAATCAGGCGCACCAGCTCGCCGCGCTTGAAGTAGCCGCCGGGGTACGGGCGGATCGCCTTCACCGGCTT
GCCGTCCCGGTGCCCGAAGATGCCGAGCGTGGCCTCTACGTGGGCGACCTGGCGCTCAAGGACGTCCAGCTTCTCCCGCGTGACTTCCAGCTCGCCAAGC
AGCCTGGCGCGCTTGTCCATGAGCGCGGAGATGACGTGTGGCTCTCCCATGGCCCGATGGTAAAATTTTCAACCACTTAGGGTCTATCGGGCGGAAGGTG
CGTTTGCTGCATAATGCCG
GCGATGTTCGCCAAGGCGGCTGGGGCCGCGCTTGCGTTAATTGGACTTGTTGCCCTTTTCGCAGGGAACCAATTCGTTCTCGCCAGTCCGATCAATGTGG
AACGGGAGCCAATAACGGAAACGTGTAGTTCTCCGCCCTTTGAAAGCATCTTGATATTTGATTTCGCCAAAAACATACAGGTTATATGTTTTACTATCTG
CGAATCCCAGGAACATAGAGAGCAATTGACTCTTTATAGGAATTGTATGATAGATCTCTTGATTAAGAAAGATATCAATAACATTTTTAACACGCTCCCT
ATCAACCTCGAAAGTCTCCGATCCTACTTTGAAGACCGTGCTAGTTGCAAAATATTTTACTTTGTATGCCGGCGTAGATCCGTAGTTTTTAATGGATACA
ATAATATTAGCAGATTTCCCTCCATCATGGAGGGCCAAATTTACTTTCGAAAAGCCAACATAGGCGCGGATCTCTTTCTGGGTGTTCTTCCGCGCCTCTT
TAATGGAAACAACTGCAGCATTGGCAGTTCTTTTTGTCTCGTGCAATGTGTCTCGCACAAACAATACACCGACGAAGGTAAGAACTATCGTTATAAGCCC
GACGCTTCCCGCAACGATGACGGCGCCATACGTCCAATCAGCCATCGATTTCTGGGCATACAGATCCTCTTCGGCGCGCTGGGCCTCGCGATTGGCCTCT
ACCTTCAGTTTAAGGCAATCGAGAGCTTTCTGAGGTTCGCTGAGCCGCAGGCAGTCGCGTTGAAGTTCATACTGAGTACGCTGTTGTGTATTGGCAGCGG
CATAATCTTCGTACGCGCGGTTGTATTGCTGACTGGCCCCTGACCACCAAATGAACGGTGCTATGAAAGCAAGATAGACGACGAGAATCAGCCAATGCGC
ATTTCGCGAGTACCAATCGCTTCTAGGCATTGGAGCAGAAAGGCAGCGGTGAATTTTCCACGGCTGACCTTGTTCCTCAGGTTGCGTTCATCCTCGTGAA
CGCCAATTGATTCCAGCTTCTGCTGCAATTGGGAATAGGTAATCCCCTGACGCGCCATCTCGCTGCGGATGAGGCCGCGCGCCTTGGCTTCCCAATCGGT
CCTTTCTGGCATGGTCGCTGCTCCTATGATGCGATTTGTCATCATATAGCGTGCTTCTGCCCTTGAAAAGGTGATAGATCGTCATCATCTCTGATGCGTA
GGCACTGGAGACGATGACAATGACCCAACACTTCCTTCTCTCAGCGAAAGCCCGGACCCTCAGCGTGGTCCAGGTGGCTCGCATGTCCGAGGAAGAGGCT
CGGGCCGCATTCCGGGCGATCCGCTGGGCCACGACCGATGGCGACCCCGTCTGCCCGCGCTGCGGCTGCTGTGCTGTCTACGAGTACGCCAGCCGCCCGA
TCTTCAAGTGCAAGGCTTGCGGCCATCAGTTCAGCGTCACCAGCGGGACGATCTTCGCCAGCCGCAAGCTGCCGGTGCGCGACTACCTCCTGGCCATCGC
GATCTTCGTGAACGCCGCCAAGGGCATGTCCGCGCTGCAGCTCGGCCGCGACCTCGATGTGCAGTACCGCACGGCCTACGTCCTGGCGCACAAGCTGCGC
GAGGCGATGGCTGCCGAGAACGAGACCATGACCCTGAGCGGCACGGTCGAGGTTGATGGCGCCTACTTCGGCGGCTACGTCCGGCCGGAGAACCGCAAGG
AGGATCGCGTCGATCTCCGCCTCAAGGCCAACCAATCCGGCAAGCGCCGCGTCGTGGTCGTCATGCGCCAGCGTGGCGGCCGGACGCTGCCGTTCGTCTT
CCGCAGCGAGGGCGAGTCCCTGCCGTCGATCCTGGCCCGCGTCGAGCCGGGCAGCACGATCCACGCGGACGAAGCCCCGAGCTGGGATGAGCTGCACACC
CGGTACGAGACCCGCCGCATCAACCACAGCATTGCCTTCTCGGATGACGGCGCCTGCACCAATCAGGCGGAAAGCTACTTCTCCCGTCTGCGCCGCGCCG
AGTGGGGCCAGCACCACCACATCAGCGGCCGGTATCTCGGCTTCTATGCTGGCGGGGCCGCGTGGCGCGAGGACACGCGCCGGAAGCCCAATGGGGCGCT
CTACGGGCTGCTGATTGCCGCTGCCGCCGCCCATCCGGTTTCGCGGCAGTGGAAGGGGTACTGGCAACGGGCCCGCTAGTCTCGGGCGTACAGACACTTC
AATGCGAGGAATATGGCCATTTCCTCGCGTCTGGGAGATAGCAAGTAAACCATCAAGTCATGGAATAATTCAGGGGTCATTCTAAAGACAAAAATGTCTT
TTCTGTCGTCTCTTACACTCTCAAACCAGTTTATAACGTGTATTGTGAAGTGAAATGCGAGGTAAGTTCCGCGTCCTGCATTTAGGTCTAGCCTTGCTTC
TGCTTTCTCAGAATCGCTTCCATTGAAAATTATGCCATCAGCCGATGGGTCATACTCATAAATATCATTTACCCAATCCACTGTTGCTAGCTGGTATAGC
AATATAGCCATCATCTTTAAGTGGGAAAGTCCAGATGAATCAGACATGACCCACTCGCTTTCCCACAACTCGCAGCGGCTCCGCCAATTATCGTGGGCCA
CCTTCAGTGCTGATTCATCAAAATCAAAGTAATAACCAACAGCCTTTGCCATGATCTGGCAATGGCGGAGGAAATAATACTTAAATTCGGAAAAATTGCC
CAGCCACTCGTCTAGGCAATCCCGAGTAAGATGCTGATTGTGTGGAGATGCTTCGTTGGACATAAACGCCACTTACCCCATTTTAGGGTGTAGCGTCTAG
TCCAACAAGCAATGGTGGAAACGCGCTTTAGCTCGTCGCGGCCACTTTCTGCCGCTTACGTTCCCCAATCAAAACGCGAATCGTGCGCGTCTCCAGGGGA
GCGCTATAGCCCTCCGGAATGCTCGGACGGCCATTTGTGAAGAGGAACTTAAACCGCTCCTCATCGCGCTTAACTGGCTCGCCGCGACCTACGACGCCCT
GATGCTTCGCCATTGCACCACCCCTCCTCGCGCGCTCCGTGGAGTGTCCCGAAACGCTGCTTATTCGGGCTATATGCAGCATGCGGCCCGCCAGGTCAAA
CAAGGAGTGAGGGCTAAACCTCTTTTCCACTTGACAGCTCCGACGCGCCGGCGGGTGTTTCGCACACCGATTCGACGCTACTAAGACGTGCGTGTGACCA
CGAAAGTACGCCCAGGTCAACGGGCATTAAGTTAAGCCGTTCCCGATGTTTTTTCAAAAGACCTGTGACAATATGGCACGTGTACTACTCGATACAGCGC
CGGGAAGCCCGCAGCGCCTGCTTTCGCTTCTCGACCCTTCCGGCCGTCACGAGCTGGTCTGCCCGCTTGGCTTCGTCCGGATCGATCAGGCAATGCCAGT
AGCCGTCCGCCAGCTTGGCGAGGACGCGGGCCTCTAGGTCTTCCTCGGTGTCTGCCATGCGCTCCCTCTCGCATGAGAACGTAGAGGGGACAAGCGTGAT
TCACGCAAGGCCGCGTCGCACTGCCTACCGCTTCGGCTTCCACAGCACGACGCCATCCTCCGTCCAATACCGCCGGACGGTACCGCGCAGCTCCTGCCGC
TCCAGCATCTTGAGGACCTTGCGCCGTACCTCGATGTAGGCCGGTGAGGCGGGGTCGAGCGCCTTGCGCTCCAAGATGGCGTCGCGGAGATCGACGACGG
ACAGCGGGCCGCGCTCGCGCAGAAGATCGAAAATCAGGCGCACCAGCTCGCCGCGCTTGAAGTAGCCGCCGGGGTACGGGCGGATCGCCTTCACCGGCTT
GCCGTCCCGGTGCCCGAAGATGCCGAGCGTGGCCTCTACGTGGGCGACCTGGCGCTCAAGGACGTCCAGCTTCTCCCGCGTGACTTCCAGCTCGCCAAGC
AGCCTGGCGCGCTTGTCCATGAGCGCGGAGATGACGTGTGGCTCTCCCATGGCCCGATGGTAAAATTTTCAACCACTTAGGGTCTATCGGGCGGAAGGTG
CGTTTGCTGCATAATGCCG
Protein section
ORF number : 6
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
840 bp | 279 aa | 104 | 943 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MFAKAAGAALALIGLVALFAGNQFVLASPINVEREPITETCSSPPFESILIFDFAKNIQVICFTICESQEHREQLTLYRNCMIDLLIKKDINNIFNTLPI
NLESLRSYFEDRASCKIFYFVCRRRSVVFNGYNNISRFPSIMEGQIYFRKANIGADLFLGVLPRLFNGNNCSIGSSFCLVQCVSHKQYTDEGKNYRYKPD
ASRNDDGAIRPISHRFLGIQILFGALGLAIGLYLQFKAIESFLRFAEPQAVALKFILSTLLCIGSGIIFVRAVVLLTGP
NLESLRSYFEDRASCKIFYFVCRRRSVVFNGYNNISRFPSIMEGQIYFRKANIGADLFLGVLPRLFNGNNCSIGSSFCLVQCVSHKQYTDEGKNYRYKPD
ASRNDDGAIRPISHRFLGIQILFGALGLAIGLYLQFKAIESFLRFAEPQAVALKFILSTLLCIGSGIIFVRAVVLLTGP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
225 bp | 74 aa | 1212 | 988 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MPERTDWEAKARGLIRSEMARQGITYSQLQQKLESIGVHEDERNLRNKVSRGKFTAAFLLQCLEAIGTREMRIG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
960 bp | 319 aa | 1320 | 2279 | + | No |
Chemistry : DDE
ORF sequence :
MTQHFLLSAKARTLSVVQVARMSEEEARAAFRAIRWATTDGDPVCPRCGCCAVYEYASRPIFKCKACGHQFSVTSGTIFASRKLPVRDYLLAIAIFVNAA
KGMSALQLGRDLDVQYRTAYVLAHKLREAMAAENETMTLSGTVEVDGAYFGGYVRPENRKEDRVDLRLKANQSGKRRVVVVMRQRGGRTLPFVFRSEGES
LPSILARVEPGSTIHADEAPSWDELHTRYETRRINHSIAFSDDGACTNQAESYFSRLRRAEWGQHHHISGRYLGFYAGGAAWREDTRRKPNGALYGLLIA
AAAAHPVSRQWKGYWQRAR
KGMSALQLGRDLDVQYRTAYVLAHKLREAMAAENETMTLSGTVEVDGAYFGGYVRPENRKEDRVDLRLKANQSGKRRVVVVMRQRGGRTLPFVFRSEGES
LPSILARVEPGSTIHADEAPSWDELHTRYETRRINHSIAFSDDGACTNQAESYFSRLRRAEWGQHHHISGRYLGFYAGGAAWREDTRRKPNGALYGLLIA
AAAAHPVSRQWKGYWQRAR
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
186 bp | 61 aa | 3113 | 2928 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MAKHQGVVGRGEPVKRDEERFKFLFTNGRPSIPEGYSAPLETRTIRVLIGERKRQKVAATS
Blast result :ORF 5
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
174 bp | 57 aa | 3558 | 3385 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MADTEEDLEARVLAKLADGYWHCLIDPDEAKRADQLVTAGRVEKRKQALRASRRCIE
Blast result :ORF 6
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
426 bp | 141 aa | 4050 | 3625 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MGEPHVISALMDKRARLLGELEVTREKLDVLERQVAHVEATLGIFGHRDGKPVKAIRPYPGGYFKRGELVRLIFDLLRERGPLSVVDLRDAILERKALDP
ASPAYIEVRRKVLKMLERQELRGTVRRYWTEDGVVLWKPKR
ASPAYIEVRRKVLKMLERQELRGTVRRYWTEDGVVLWKPKR
Blast result :
Comments
ISTico1 is 86% aa similar to ISBmo1.
References
1] ISfinder annotation (2017)
2] Varghese,N. and Submissions,S. (2017) Direct GenBank submission.
2] Varghese,N. and Submissions,S. (2017) Direct GenBank submission.