ISThsp3
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Thiomonas sp. | Thiomonas sp. 3As |
DNA section
IS Length : 2455 bp
Ends
IR Length : 18/24
IRL : GTAAGCGTTTGCCGAACCCATCTTTTCAATGAGTCTGAGCTGAGCGAAGC
IRR : GTAAGCGTGCAGGGAACCCCTCTTGACACCCGAAGTGAAGGTCAGGTGCT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TCGACGTTTG | GAAATACC | CTCTGGCCGA | 8 |
DNA sequence
GTAAGCGTTTGCCGAACCCATCTTTTCAATGAGTCTGAGCTGAGCGAAGCTTGGGTTTTTTCGGGGGCTCCATGCGATCAGACAAACTCAGTGAAGAGCG
GATTGATGAGATTCTGGCGATCCTCGATGAACTCAAAGCCAGCGGGATGAAGGCCGAAGCGTTTGCGCAAAGCAAGGGGCTGGGCTACGGGCAACTGCGC
GGCTGGCTGTCGAGCGCGCCGCGCTGGCGCGCCCAACGCGCGGGTCTGGCCCTGCCGCCTCGGCGCAGCGCCTTCGTGCGCGCCCACTGCAACCCGTCCC
CCGCTGCGGCTGCGGAGGGAAGCCCCCATCCCGGACACGCCCGCATCACCTGCGAGGGGGCCCAGCGCCGCGCCCAGATTGACTGGCCCGTGGCCCACGC
AGCCGAGTGCGCGCAGTGGCTGCAGGCCTGGCTGGGATAAGCCATGCTGCGCATTGAGGCCATCTGGCTGGCCGTGGGCGCCAGCGATCTGCGCGGCGGC
ATGGATAGCCTGCTGGGCCAGGTTGTGGCCCGGTTTGGCTCGGCCCAGCGCCACCATGCCTACGTGTTTGCCAACCGCCGCGCCACCCGGCTGAAGGTGC
TCGTCTTTGATGGCTCGGGGATCTGGCTGTGCACGCGCCGACTGCAAGAAGGCCGGTTTGCCTGGCCGCAGGAGGACCGTGAGGCGCTGCACCTGAGTGC
CGAGCAATGGAGTTGGCTGGCCGCCGGGCTGCCGTGGCAGCGCATGACCGCGCACGCCACCGCCAGCGCCATTGCCGTGGTGTAGTCCCCCGGACGTTTG
TCGATCGGGGCAACTCCCGATTCTCGCGCGCGCGTGCGCGCGAGAGAATTCGGCATGGTCAGCGCCATCGACGACGACAAACTCCAGTCCCTGGGCGACG
ATCCGGCGGCGCAGTACGCGCGCACGGTCATTGCGCAGTTCGGTGCACAGATCGCGCACCAGCGTGCCGAGCTCAAATTTCAATCGACCAAGATTGCGGC
CCTGAGCTTCGAGCTGGCGCGCCTCAAGCAATGGCGCTTCGGCCAGTCCAGCGAGAGCCTGGACACGCAAGGCCAGCTCTTCGACGCCAAGACGCAGGCG
CTGCTGCAAGCCGAGGAACAGGCCGAGGACCGCGCCGCCGATGCGGAGCGCACCGCCCCAGGCAAACGTCGCCCCAAACGCCAGCCCCTGCCCAGCCAGT
TGCCGCGCATCGAGCATCGCTACGAGATCGACTCCGGCCTGTGCCCGCAGGGCCACACCCTGCGCCGTATCGGGGAGGAGATCAGCGAGCAACTCGACTG
CGAGCCCACGCGTTTCTTCGTGCACCGGCACATCCGCGGCAAGTATGCCTGCGCCTGCTGCCAGACGGTGTTGGCTGCACCCCTGCCAGCGCAACTCATT
GACAAAGGCATTCCCGCCCCCGGCCTGCTGGCGCAGGTGGTGCTGGCCAAGCACGACGATCACCTGCCGCTGTACCGCCAGGAGGAGATTTACCGCCGCA
GCGGCGTGCACCTCCCGCGCTCGAGCCTGGCGCAGTGGGTGGGCCTGTGCGGGGTGCGCCTGGAACCGCTGGCCCAGGCCCTCAAAGACCATCTGCTGGA
GCAACCCGTGCTGCATGCCGATGAGACCCCGGTAGCCGAGCTGGCGCCGGGCACGGGCAAGACACACCGCGCTTATGTCTGGGTCTACCGCAGCGCGGCT
ACGCCGGCGGTGGTGTTTGACTATTGCGCCAGTCGTGCCGGTGCGCATGCGCGCGACTTCCTGCAGGATTGGTCAGGCACCTTGCTCACCGATGACTTCA
GCGGCTACAAGGCGCTGTATGCCCAGGGGAGCATTGTGGAGGCGGGGTGCTGGGCGCATGTACGCAGAAAGTTCTTCGAGGCGCACAAGCTGGCGGGCAG
CGCCATCGCGCAGGAGGCGCTTGAGCGCATCAAAGCCTTGTATGCCATTGAGCAGACCCTTCGGGAGCATCCGCCCGATGCGCGCACCGCGCTGCGCCAG
CGCCAAAGCCAACCCCTGCTCGAGGCCTTGCACGCCTGGCTGATCGAGCAACGCCCGCTTCTGGCCAAGGCCGACGCCACGGCGCGGGCCATCGACTATG
CGCTGGGCCGCTGGCGGGCGTTGTGTGTGTTTGCCACCGATGGGCGCGTGCCGATCGATAACAACGCGGTGGAAAACGCCATTCGGCCTCTCGCACTCGG
GCGCCGAAACTGGCTCTTCGTGGGCTCGCCCCAGGCCGGCCGCCGAGCCGCCGTGCTCATGACGCTGATCGAATCGGCCAAGCTCTGCGAGGTCGACCCC
TGGGCCTATCTCAAGGACGTGCTGACGAAGCTGCCCACCTGGCCCAACAGCCGTCTGGGCGAATTGCTGCCCCACAACTGGGCGAAAACCAATCCCCCTG
CACTCAGCACCTGACCTTCACTTCGGGTGTCAAGAGGGGTTCCCTGCACGCTTAC
GATTGATGAGATTCTGGCGATCCTCGATGAACTCAAAGCCAGCGGGATGAAGGCCGAAGCGTTTGCGCAAAGCAAGGGGCTGGGCTACGGGCAACTGCGC
GGCTGGCTGTCGAGCGCGCCGCGCTGGCGCGCCCAACGCGCGGGTCTGGCCCTGCCGCCTCGGCGCAGCGCCTTCGTGCGCGCCCACTGCAACCCGTCCC
CCGCTGCGGCTGCGGAGGGAAGCCCCCATCCCGGACACGCCCGCATCACCTGCGAGGGGGCCCAGCGCCGCGCCCAGATTGACTGGCCCGTGGCCCACGC
AGCCGAGTGCGCGCAGTGGCTGCAGGCCTGGCTGGGATAAGCCATGCTGCGCATTGAGGCCATCTGGCTGGCCGTGGGCGCCAGCGATCTGCGCGGCGGC
ATGGATAGCCTGCTGGGCCAGGTTGTGGCCCGGTTTGGCTCGGCCCAGCGCCACCATGCCTACGTGTTTGCCAACCGCCGCGCCACCCGGCTGAAGGTGC
TCGTCTTTGATGGCTCGGGGATCTGGCTGTGCACGCGCCGACTGCAAGAAGGCCGGTTTGCCTGGCCGCAGGAGGACCGTGAGGCGCTGCACCTGAGTGC
CGAGCAATGGAGTTGGCTGGCCGCCGGGCTGCCGTGGCAGCGCATGACCGCGCACGCCACCGCCAGCGCCATTGCCGTGGTGTAGTCCCCCGGACGTTTG
TCGATCGGGGCAACTCCCGATTCTCGCGCGCGCGTGCGCGCGAGAGAATTCGGCATGGTCAGCGCCATCGACGACGACAAACTCCAGTCCCTGGGCGACG
ATCCGGCGGCGCAGTACGCGCGCACGGTCATTGCGCAGTTCGGTGCACAGATCGCGCACCAGCGTGCCGAGCTCAAATTTCAATCGACCAAGATTGCGGC
CCTGAGCTTCGAGCTGGCGCGCCTCAAGCAATGGCGCTTCGGCCAGTCCAGCGAGAGCCTGGACACGCAAGGCCAGCTCTTCGACGCCAAGACGCAGGCG
CTGCTGCAAGCCGAGGAACAGGCCGAGGACCGCGCCGCCGATGCGGAGCGCACCGCCCCAGGCAAACGTCGCCCCAAACGCCAGCCCCTGCCCAGCCAGT
TGCCGCGCATCGAGCATCGCTACGAGATCGACTCCGGCCTGTGCCCGCAGGGCCACACCCTGCGCCGTATCGGGGAGGAGATCAGCGAGCAACTCGACTG
CGAGCCCACGCGTTTCTTCGTGCACCGGCACATCCGCGGCAAGTATGCCTGCGCCTGCTGCCAGACGGTGTTGGCTGCACCCCTGCCAGCGCAACTCATT
GACAAAGGCATTCCCGCCCCCGGCCTGCTGGCGCAGGTGGTGCTGGCCAAGCACGACGATCACCTGCCGCTGTACCGCCAGGAGGAGATTTACCGCCGCA
GCGGCGTGCACCTCCCGCGCTCGAGCCTGGCGCAGTGGGTGGGCCTGTGCGGGGTGCGCCTGGAACCGCTGGCCCAGGCCCTCAAAGACCATCTGCTGGA
GCAACCCGTGCTGCATGCCGATGAGACCCCGGTAGCCGAGCTGGCGCCGGGCACGGGCAAGACACACCGCGCTTATGTCTGGGTCTACCGCAGCGCGGCT
ACGCCGGCGGTGGTGTTTGACTATTGCGCCAGTCGTGCCGGTGCGCATGCGCGCGACTTCCTGCAGGATTGGTCAGGCACCTTGCTCACCGATGACTTCA
GCGGCTACAAGGCGCTGTATGCCCAGGGGAGCATTGTGGAGGCGGGGTGCTGGGCGCATGTACGCAGAAAGTTCTTCGAGGCGCACAAGCTGGCGGGCAG
CGCCATCGCGCAGGAGGCGCTTGAGCGCATCAAAGCCTTGTATGCCATTGAGCAGACCCTTCGGGAGCATCCGCCCGATGCGCGCACCGCGCTGCGCCAG
CGCCAAAGCCAACCCCTGCTCGAGGCCTTGCACGCCTGGCTGATCGAGCAACGCCCGCTTCTGGCCAAGGCCGACGCCACGGCGCGGGCCATCGACTATG
CGCTGGGCCGCTGGCGGGCGTTGTGTGTGTTTGCCACCGATGGGCGCGTGCCGATCGATAACAACGCGGTGGAAAACGCCATTCGGCCTCTCGCACTCGG
GCGCCGAAACTGGCTCTTCGTGGGCTCGCCCCAGGCCGGCCGCCGAGCCGCCGTGCTCATGACGCTGATCGAATCGGCCAAGCTCTGCGAGGTCGACCCC
TGGGCCTATCTCAAGGACGTGCTGACGAAGCTGCCCACCTGGCCCAACAGCCGTCTGGGCGAATTGCTGCCCCACAACTGGGCGAAAACCAATCCCCCTG
CACTCAGCACCTGACCTTCACTTCGGGTGTCAAGAGGGGTTCCCTGCACGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
369 bp | 122 aa | 72 | 440 | + | No |
AG : IS66 TnpA
ORF sequence :
MRSDKLSEERIDEILAILDELKASGMKAEAFAQSKGLGYGQLRGWLSSAPRWRAQRAGLALPPRRSAFVRAHCNPSPAAAAEGSPHPGHARITCEGAQRR
AQIDWPVAHAAECAQWLQAWLG
AQIDWPVAHAAECAQWLQAWLG
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
342 bp | 113 aa | 444 | 785 | + | No |
AG : IS66 TnpB
ORF sequence :
MLRIEAIWLAVGASDLRGGMDSLLGQVVARFGSAQRHHAYVFANRRATRLKVLVFDGSGIWLCTRRLQEGRFAWPQEDREALHLSAEQWSWLAAGLPWQR
MTAHATASAIAVV
MTAHATASAIAVV
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1560 bp | 519 aa | 855 | 2414 | + | No |
Chemistry : DDE
ORF sequence :
MVSAIDDDKLQSLGDDPAAQYARTVIAQFGAQIAHQRAELKFQSTKIAALSFELARLKQWRFGQSSESLDTQGQLFDAKTQALLQAEEQAEDRAADAERT
APGKRRPKRQPLPSQLPRIEHRYEIDSGLCPQGHTLRRIGEEISEQLDCEPTRFFVHRHIRGKYACACCQTVLAAPLPAQLIDKGIPAPGLLAQVVLAKH
DDHLPLYRQEEIYRRSGVHLPRSSLAQWVGLCGVRLEPLAQALKDHLLEQPVLHADETPVAELAPGTGKTHRAYVWVYRSAATPAVVFDYCASRAGAHAR
DFLQDWSGTLLTDDFSGYKALYAQGSIVEAGCWAHVRRKFFEAHKLAGSAIAQEALERIKALYAIEQTLREHPPDARTALRQRQSQPLLEALHAWLIEQR
PLLAKADATARAIDYALGRWRALCVFATDGRVPIDNNAVENAIRPLALGRRNWLFVGSPQAGRRAAVLMTLIESAKLCEVDPWAYLKDVLTKLPTWPNSR
LGELLPHNWAKTNPPALST
APGKRRPKRQPLPSQLPRIEHRYEIDSGLCPQGHTLRRIGEEISEQLDCEPTRFFVHRHIRGKYACACCQTVLAAPLPAQLIDKGIPAPGLLAQVVLAKH
DDHLPLYRQEEIYRRSGVHLPRSSLAQWVGLCGVRLEPLAQALKDHLLEQPVLHADETPVAELAPGTGKTHRAYVWVYRSAATPAVVFDYCASRAGAHAR
DFLQDWSGTLLTDDFSGYKALYAQGSIVEAGCWAHVRRKFFEAHKLAGSAIAQEALERIKALYAIEQTLREHPPDARTALRQRQSQPLLEALHAWLIEQR
PLLAKADATARAIDYALGRWRALCVFATDGRVPIDNNAVENAIRPLALGRRNWLFVGSPQAGRRAAVLMTLIESAKLCEVDPWAYLKDVLTKLPTWPNSR
LGELLPHNWAKTNPPALST
Blast result :
Comments
ISThsp3 is 30% (ORF1) and 54% (ORF2) aa similar to ISBcen19, 50% (ORF3) to ISBcen14.
References