ISBos1
- Family IS1595
- Group ISNha5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP014301 | ND | Bosea sp. | Bosea sp. PAMC 26642 |
DNA section
IS Length : 3731 bp
Ends
IR Length : 22/23
IRL : CGGCATTAGGTAGCAAGCTCACCAACTGCTGAGTCCGTTAACCATAGAGT
IRR : CGGCATTGGGTAGCAAGCTCACCTAAGAGAACGTATGCCAACGTTTTCCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGTGCGCCCT | GGATTATC | CGCCGGCGAA | 8 |
DNA sequence
CGGCATTAGGTAGCAAGCTCACCAACTGCTGAGTCCGTTAACCATAGAGTGCCTGTGGATAACTTTCTGAGACTTCCCAATCTTGGCTTGGGCGCTTGCC
TTGGCTGCCGACTCGACGTATGGGTTGACTCAAGGAGAGGCAGCCATGACTACGCTCCAAGAATTCTTACACGGCCGGATGAACGAGCTTGCAACCGACG
ATGCTAGCTTAAGGCGCAAGCTTGCGCTCAATGAACAAGAGCGAGAGCAGCTTCAAAAGGCTGCAAAAGCGGCAGGGCTGCACCTTAAGCCCATTACCGA
AGCGCCGCCACCGCCGCCGGCTACTTATGTCTTGTCGGTCGGAACCTCCCAAGGGCGCGCGACTACCCAAGGGGTAGGCATGACAGTCTCGCGGCGGCCA
ATCCCCGAGAAGACGATTAAGGAAGCTGTGTTGGAAGTCCTTGGGGTTCTAGGCACAGGCCTTACAGCGTTGGATCTGTTGTCGGCCATCAACGCCAAAT
TCGATACTGACTACCCTAGGACTTCGCTGAGCCCGCAGTTGTCGCGGCTGAAGGCAGAGGGGAAAATTACCCGCTTGGGTAACTTGTGGAGTTTGGCGCC
CGATGCGCCAGAAACGAATGAGCCCGCTCATCCAACGTCCGAGGGGACGAATGAGCGGGCTCAGGTTCCTGAAACCAACAGCACGACCGTGGAGCCGGTC
GGGGAGGTGGCACATGAGAAGACGTTGACCGTCGACCCACTTGAATAACCACGCGGCGGCTCGGGTGTTCGCGCATCCGGGTCGCTCTGCGGGGGTAGAT
GGTCGGGCTAACGGTCTCCCACCAGCGAGCCCAGGGGTTCGTGTCCCCTTACCTCCACCACAGCGCCCGCTTTGCAGGTAACTGCAAAGTGGGCGCCCTT
TATATAGCGAACCTAGGCAGGGGCGTCTAGCGCCTCAACGCTTTTCGTTGGCGGGGCGTAAGTGATTCCACTCGCTGTGAAATTGATCCCGCCGCCTTCG
AGGGCGCGCCTCATCGCGTTTAAATTATTGGCGATTGGGACGCGACGCGCCTTCTCATAATCCCGCACGGTCGAAAGCGATACGCTTGCAGCGCTCGCTA
GCTGATCTTGCGTCCAATCCAACCACCCGCGCGCGGCGCGGCTCTGTTCTGGCGACATACACGATGACTAGTGACAAAAAGTTTTTCAGTCAACCCCAAC
GAAAAGTGTTGACGTCGCTGATTTCCATGGATATCACTGATGTCAACGATATTCGTTGGCGTGGGCATGGCACAGCATTTCCTCCTTTCCGCAGCGGCCC
GCACCCTCTCTTTGGTGACGGTCGCCCGCATGTCGGACGACGAAGCGCATAGCGCCTTTCGCCAGATCCGTTGGTGCGACACTGGCGGCGAGCCGGTTTG
CCCGCGTTGCAACTGCGGTGCGGTCTACACGTTCAAGACCCGCAAGCTCTACAAGTGCAAAGCCTGCACGCACCAGTTCTCGGTCACGTCTGGTACGATC
TTCGCCAGCCGCAAACTGGCTATCCGCGACTACCTCCTGGCCATCGCGATCTTCGTCAACGGCGCCAAGGGCCATTCGGCCTTGCAGCTCTCCCGCGACA
TCAACGTCCAGTACAAGACGGCCTTCGTCTTGACGCACAAGCTGCGCGAGGCGATGTCCGCCGAAATGGCCGACATGACTGTCTCGGGTGAGGTCGAGAT
CGACGGCGCCTATTTCGGTGGTCACGTTCGCCCGGCCAACTTCAAAGAGAACCGCGTCGATCGCCGTCTCGCCAAGAACCAGAACGGCAAGCGCCGCGTC
GTGGTCATCATGCGCGAGCGCGCCGGTCGCACGTTGCCATTCGTATTTAAGTCTGAGGGCGCCTCGCTGGCGACCATCGGCCGACGCGTGCACCCCTCGG
CTACGGTTCACGCTGACGAGGCGTTGCATTGGGATGAGCTTCACACGTTCTATCTGACGAAGCGCATCAACCACAGCGAGGCCTATTCGGACGGCCAATC
CTGCACCAACATGGCGGAAAGCTTTTTCTCCCGCCTGCGTCGCGCCGAGATCGGCACGCATCACCACATCGCCGGCCCGTACCTCAACGCCTACTCGTCA
GAAATGGCGTGGCGCGAAGATCACCGTCGTGTCAGCAATGGCGAACAGTACCTGATGATCGCGAGCGCGGCGCTCGCGCACCCCGTCAGCCGGCTGTGGA
AAGGCTATTGGCAGCGCCGCACGGGTTGACATTTGCCCGGGGGGATTCCCAGGCGAATCACGCCGTGTTACCTTCAGCTAAATGAAGCGATTTGTACCCA
ATATTCCGGAATGGTTTGAACCGCGCCGAGCGGCGCAGGTGACGGCGTTTTTTGCGCTTAAATCCGGTGGCAAAATCAACATTCTTAAGGCGACCAAGCT
CGTCTACCTGTCTGACCGGCTCAGTATGTCCCGACGCGATCACCCGATTACGGGGGACAATTTCGTTTCGATGCCGTTTGGACCCGTCAATACGTTTACT
TACAGCTACATGGACGGGGCAGCTCACTCCGACGCTTGGACGGAATTCGTAGCGCCGCGAAACGGCAACGAGCTTGAGCTGACCAAGAGGATTGATATTG
GCGATCTTGACGAGCTGAGCCGGTCAGACCTCAAAATTCTTGACGATACGTGGGAGGAATTCAAAGAGGTAGATCGCTTTGAATTAGCCGAGTGGACGCA
TAGATATTGCCCTGAGTGGAGAGACCCTGGCGGCTCCTCTATACCAATCGATTTTTCGACGGTTTTCAAAAACTTGAGCAAAGAATCGCCTGCTGAACTG
GCAGATGACATCCAAGCCGAGAGAGAACTGTTTATTCATCTCGCCGGTAAGTGATGTCATATAAGCCGTATCAAGGCGCAACGCTGCTGATACCTTATAA
CAACGTCCCTCATCTATTTTTCGTTCTCAACGAGCCATGCAAGGATGGCTTCTGCCTCCTAGTAATGGTGACATCAATAAATCCTAAGAAGTTACACGAC
GGCGCATGCGTTTTGCAGGCTGGTGATCACCCATTTGTCGTTCATCCAAGCTACTTGCTGTATCGTCTTGCGACTCAATCGCCGGCGCATCACATTCAGA
AGATGGTCGATAAGAAGTATTATGTTCCTAAAGAAGACTTGAGCAAAGCTGTCCTTGAGAGGGTCATTAATGGACTATGGTCTTCCGATGATACAAGGCC
TTCGATGCTTAGGTATGCAAAGGGCATTGGCCTAGAGCTTGATCCCAGTCCATAACCGTACGCCATCGCGCTTACCCTCTGAGACGATCAGCCCGCGCTT
AAGCCGCATGCTCAACGTCTGCACGACGCGATGCGCGACGGTACTGCGAAGCACCTTGTCGCTTTCGTCCATGCCCTTCGCGCGCATCACTCGCAGCGCC
AACTCGCGCGTGTCCAGAGGCGATTCGATCGCCAGCGCCTCATGGCAGAGCTTCCCGATCTCGCGGGGCCTGAACAGCCTTCCTATGTCCATGTAGACCG
GGAATTGAAGCGGCTCGTCTCCCACCTCGAATAGCCGGATCGTCGCGGCGATATGGGCCAGGTCGCGGCGTGCTGTCTCTAGCTTGTCCTCATACGCCTT
GATCGCGTCGTTGATGGCGTCGCGGCGGCGGCGAAGGGTGAGGAGAACGTTGGGGTCCGACATCAGAAAACACTAGACCAACGGAAAACGTTGGCATACG
TTCTCTTAGGTGAGCTTGCTACCCAATGCCG
TTGGCTGCCGACTCGACGTATGGGTTGACTCAAGGAGAGGCAGCCATGACTACGCTCCAAGAATTCTTACACGGCCGGATGAACGAGCTTGCAACCGACG
ATGCTAGCTTAAGGCGCAAGCTTGCGCTCAATGAACAAGAGCGAGAGCAGCTTCAAAAGGCTGCAAAAGCGGCAGGGCTGCACCTTAAGCCCATTACCGA
AGCGCCGCCACCGCCGCCGGCTACTTATGTCTTGTCGGTCGGAACCTCCCAAGGGCGCGCGACTACCCAAGGGGTAGGCATGACAGTCTCGCGGCGGCCA
ATCCCCGAGAAGACGATTAAGGAAGCTGTGTTGGAAGTCCTTGGGGTTCTAGGCACAGGCCTTACAGCGTTGGATCTGTTGTCGGCCATCAACGCCAAAT
TCGATACTGACTACCCTAGGACTTCGCTGAGCCCGCAGTTGTCGCGGCTGAAGGCAGAGGGGAAAATTACCCGCTTGGGTAACTTGTGGAGTTTGGCGCC
CGATGCGCCAGAAACGAATGAGCCCGCTCATCCAACGTCCGAGGGGACGAATGAGCGGGCTCAGGTTCCTGAAACCAACAGCACGACCGTGGAGCCGGTC
GGGGAGGTGGCACATGAGAAGACGTTGACCGTCGACCCACTTGAATAACCACGCGGCGGCTCGGGTGTTCGCGCATCCGGGTCGCTCTGCGGGGGTAGAT
GGTCGGGCTAACGGTCTCCCACCAGCGAGCCCAGGGGTTCGTGTCCCCTTACCTCCACCACAGCGCCCGCTTTGCAGGTAACTGCAAAGTGGGCGCCCTT
TATATAGCGAACCTAGGCAGGGGCGTCTAGCGCCTCAACGCTTTTCGTTGGCGGGGCGTAAGTGATTCCACTCGCTGTGAAATTGATCCCGCCGCCTTCG
AGGGCGCGCCTCATCGCGTTTAAATTATTGGCGATTGGGACGCGACGCGCCTTCTCATAATCCCGCACGGTCGAAAGCGATACGCTTGCAGCGCTCGCTA
GCTGATCTTGCGTCCAATCCAACCACCCGCGCGCGGCGCGGCTCTGTTCTGGCGACATACACGATGACTAGTGACAAAAAGTTTTTCAGTCAACCCCAAC
GAAAAGTGTTGACGTCGCTGATTTCCATGGATATCACTGATGTCAACGATATTCGTTGGCGTGGGCATGGCACAGCATTTCCTCCTTTCCGCAGCGGCCC
GCACCCTCTCTTTGGTGACGGTCGCCCGCATGTCGGACGACGAAGCGCATAGCGCCTTTCGCCAGATCCGTTGGTGCGACACTGGCGGCGAGCCGGTTTG
CCCGCGTTGCAACTGCGGTGCGGTCTACACGTTCAAGACCCGCAAGCTCTACAAGTGCAAAGCCTGCACGCACCAGTTCTCGGTCACGTCTGGTACGATC
TTCGCCAGCCGCAAACTGGCTATCCGCGACTACCTCCTGGCCATCGCGATCTTCGTCAACGGCGCCAAGGGCCATTCGGCCTTGCAGCTCTCCCGCGACA
TCAACGTCCAGTACAAGACGGCCTTCGTCTTGACGCACAAGCTGCGCGAGGCGATGTCCGCCGAAATGGCCGACATGACTGTCTCGGGTGAGGTCGAGAT
CGACGGCGCCTATTTCGGTGGTCACGTTCGCCCGGCCAACTTCAAAGAGAACCGCGTCGATCGCCGTCTCGCCAAGAACCAGAACGGCAAGCGCCGCGTC
GTGGTCATCATGCGCGAGCGCGCCGGTCGCACGTTGCCATTCGTATTTAAGTCTGAGGGCGCCTCGCTGGCGACCATCGGCCGACGCGTGCACCCCTCGG
CTACGGTTCACGCTGACGAGGCGTTGCATTGGGATGAGCTTCACACGTTCTATCTGACGAAGCGCATCAACCACAGCGAGGCCTATTCGGACGGCCAATC
CTGCACCAACATGGCGGAAAGCTTTTTCTCCCGCCTGCGTCGCGCCGAGATCGGCACGCATCACCACATCGCCGGCCCGTACCTCAACGCCTACTCGTCA
GAAATGGCGTGGCGCGAAGATCACCGTCGTGTCAGCAATGGCGAACAGTACCTGATGATCGCGAGCGCGGCGCTCGCGCACCCCGTCAGCCGGCTGTGGA
AAGGCTATTGGCAGCGCCGCACGGGTTGACATTTGCCCGGGGGGATTCCCAGGCGAATCACGCCGTGTTACCTTCAGCTAAATGAAGCGATTTGTACCCA
ATATTCCGGAATGGTTTGAACCGCGCCGAGCGGCGCAGGTGACGGCGTTTTTTGCGCTTAAATCCGGTGGCAAAATCAACATTCTTAAGGCGACCAAGCT
CGTCTACCTGTCTGACCGGCTCAGTATGTCCCGACGCGATCACCCGATTACGGGGGACAATTTCGTTTCGATGCCGTTTGGACCCGTCAATACGTTTACT
TACAGCTACATGGACGGGGCAGCTCACTCCGACGCTTGGACGGAATTCGTAGCGCCGCGAAACGGCAACGAGCTTGAGCTGACCAAGAGGATTGATATTG
GCGATCTTGACGAGCTGAGCCGGTCAGACCTCAAAATTCTTGACGATACGTGGGAGGAATTCAAAGAGGTAGATCGCTTTGAATTAGCCGAGTGGACGCA
TAGATATTGCCCTGAGTGGAGAGACCCTGGCGGCTCCTCTATACCAATCGATTTTTCGACGGTTTTCAAAAACTTGAGCAAAGAATCGCCTGCTGAACTG
GCAGATGACATCCAAGCCGAGAGAGAACTGTTTATTCATCTCGCCGGTAAGTGATGTCATATAAGCCGTATCAAGGCGCAACGCTGCTGATACCTTATAA
CAACGTCCCTCATCTATTTTTCGTTCTCAACGAGCCATGCAAGGATGGCTTCTGCCTCCTAGTAATGGTGACATCAATAAATCCTAAGAAGTTACACGAC
GGCGCATGCGTTTTGCAGGCTGGTGATCACCCATTTGTCGTTCATCCAAGCTACTTGCTGTATCGTCTTGCGACTCAATCGCCGGCGCATCACATTCAGA
AGATGGTCGATAAGAAGTATTATGTTCCTAAAGAAGACTTGAGCAAAGCTGTCCTTGAGAGGGTCATTAATGGACTATGGTCTTCCGATGATACAAGGCC
TTCGATGCTTAGGTATGCAAAGGGCATTGGCCTAGAGCTTGATCCCAGTCCATAACCGTACGCCATCGCGCTTACCCTCTGAGACGATCAGCCCGCGCTT
AAGCCGCATGCTCAACGTCTGCACGACGCGATGCGCGACGGTACTGCGAAGCACCTTGTCGCTTTCGTCCATGCCCTTCGCGCGCATCACTCGCAGCGCC
AACTCGCGCGTGTCCAGAGGCGATTCGATCGCCAGCGCCTCATGGCAGAGCTTCCCGATCTCGCGGGGCCTGAACAGCCTTCCTATGTCCATGTAGACCG
GGAATTGAAGCGGCTCGTCTCCCACCTCGAATAGCCGGATCGTCGCGGCGATATGGGCCAGGTCGCGGCGTGCTGTCTCTAGCTTGTCCTCATACGCCTT
GATCGCGTCGTTGATGGCGTCGCGGCGGCGGCGAAGGGTGAGGAGAACGTTGGGGTCCGACATCAGAAAACACTAGACCAACGGAAAACGTTGGCATACG
TTCTCTTAGGTGAGCTTGCTACCCAATGCCG
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
603 bp | 200 aa | 146 | 748 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MTTLQEFLHGRMNELATDDASLRRKLALNEQEREQLQKAAKAAGLHLKPITEAPPPPPATYVLSVGTSQGRATTQGVGMTVSRRPIPEKTIKEAVLEVLG
VLGTGLTALDLLSAINAKFDTDYPRTSLSPQLSRLKAEGKITRLGNLWSLAPDAPETNEPAHPTSEGTNERAQVPETNSTTVEPVGEVAHEKTLTVDPLE
VLGTGLTALDLLSAINAKFDTDYPRTSLSPQLSRLKAEGKITRLGNLWSLAPDAPETNEPAHPTSEGTNERAQVPETNSTTVEPVGEVAHEKTLTVDPLE
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
963 bp | 320 aa | 1267 | 2229 | + | No |
Chemistry : DDE
ORF sequence :
MAQHFLLSAAARTLSLVTVARMSDDEAHSAFRQIRWCDTGGEPVCPRCNCGAVYTFKTRKLYKCKACTHQFSVTSGTIFASRKLAIRDYLLAIAIFVNGA
KGHSALQLSRDINVQYKTAFVLTHKLREAMSAEMADMTVSGEVEIDGAYFGGHVRPANFKENRVDRRLAKNQNGKRRVVVIMRERAGRTLPFVFKSEGAS
LATIGRRVHPSATVHADEALHWDELHTFYLTKRINHSEAYSDGQSCTNMAESFFSRLRRAEIGTHHHIAGPYLNAYSSEMAWREDHRRVSNGEQYLMIAS
AALAHPVSRLWKGYWQRRTG
KGHSALQLSRDINVQYKTAFVLTHKLREAMSAEMADMTVSGEVEIDGAYFGGHVRPANFKENRVDRRLAKNQNGKRRVVVIMRERAGRTLPFVFKSEGAS
LATIGRRVHPSATVHADEALHWDELHTFYLTKRINHSEAYSDGQSCTNMAESFFSRLRRAEIGTHHHIAGPYLNAYSSEMAWREDHRRVSNGEQYLMIAS
AALAHPVSRLWKGYWQRRTG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
516 bp | 171 aa | 2339 | 2854 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MTAFFALKSGGKINILKATKLVYLSDRLSMSRRDHPITGDNFVSMPFGPVNTFTYSYMDGAAHSDAWTEFVAPRNGNELELTKRIDIGDLDELSRSDLKI
LDDTWEEFKEVDRFELAEWTHRYCPEWRDPGGSSIPIDFSTVFKNLSKESPAELADDIQAERELFIHLAGK
LDDTWEEFKEVDRFELAEWTHRYCPEWRDPGGSSIPIDFSTVFKNLSKESPAELADDIQAERELFIHLAGK
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
432 bp | 143 aa | 3663 | 3232 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MSDPNVLLTLRRRRDAINDAIKAYEDKLETARRDLAHIAATIRLFEVGDEPLQFPVYMDIGRLFRPREIGKLCHEALAIESPLDTRELALRVMRAKGMDE
SDKVLRSTVAHRVVQTLSMRLKRGLIVSEGKRDGVRLWTGIKL
SDKVLRSTVAHRVVQTLSMRLKRGLIVSEGKRDGVRLWTGIKL
Blast result :
Comments
ISBos1 is 84% (transposase) aa similar to ISNwi4.
References
1] ISfinder annotation (2017)
2] Park,H (2016) Direct GenBank submission.
2] Park,H (2016) Direct GenBank submission.