ISAtsp2
- Family IS1634
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Arthrospira sp. | Arthrospira sp. PCC 8005 |
DNA section
IS Length : 1761 bp
Ends
IR Length : 27/30
IRL : CCTACATTCCGCAGGTTGACAGGGAATAAAAGATATGGTGCTAGTCTAGG
IRR : CCTACATTCCGCAGGTTGATAGGTAATTAACCGGATGCGTAGTAATAGCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TGAAGCTGGC | ATGAA | TAGTCAAGAC | 5 |
GACAGTGGCG | GTGTAA | TAATTGAAGG | 6 |
AAGATTTCCG | TTTTTAA | AAACCTCCAC | 7 |
TTTCCAAATG | GTTAA | AACCTGTTGG | 5 |
ACCTGAAAGG | ATGGAA | GACTAAGGGG | 6 |
ATATAAACGGAACGTTTTAA | ATCTAAAGATGGAATGCCCA | 0 | |
AAATGGAACG | TTTAAA | CTCCTATTAG | 6 |
AATGCTCTGC | CTATAA | CTATAAACAA | 6 |
DNA sequence
CCTACATTCCGCAGGTTGACAGGGAATAAAAGATATGGTGCTAGTCTAGGCTAGAAAGGGTTCCTTCGAGCGGTAGGCTCTGGCGATGCAGATTAAAAAT
TTAGACCATCTGGGTTTAGTGGCTGGAGTGATTGACGAACTTGGCCTGGTCGAGTTGACGGATAAGCGGATTGAACCTCATAGCTTGGAGCATGTGAGTG
CCGGACAAGTGGTTAAAGCCATGATTTTGAACGCCCTAGGCTTTCTCAGTGCACCGTTGTATTTATTCAGCGAGTTTTTTGAGAGTAAAGCGGTGTCTCA
TCTGCTCGGGGAAGGGGTAGAGGCTCGTCACTTGAATGACGACCGCTTAGGTCGAGTGTTGGATGAACTCTACGCAGAAGGGACGACATCATTTTTTCTC
CAGGTGGCGCTCCAGGCTGTGGAACGATTTGGAATTGATATTCAACAGCGTCATCTCGATGCCACCTCGATCTCAGTAGAAGGGAAGTATCAGCGGTGCT
CGAAGGGGAAATCCGAGGTAGGACTTGAGTCAGCTCCCCCCGGTGAGACATCAGCAGAACCCAGCCCAATTCGGCTGTGTCGAGGCTATTCCCGAGACCA
TCGTCCAGATTTGAAACAGTTTTTGATGACTCTAGTCTGTGCCGCCGACGGTGGCGTGCCGCTATGGTTGCAGTTGGCCAGTGGCAATGAACAGGACACT
CAGCAGTTTGCAGAGGTGCTCAAGGCGTTTGGTGACCAATGGACTAGCGACGGTATCGTTGTGATGGATGCCGCCTTTTACACAGCAGCCAATCTGCAGC
AGATGGAGACCACGGGGTGGCTATCACGGGTGCCGCTGACCCTAAAAGCGGCTCAAGAGCTGGTGCACAGCGATGTCACCCGACTGACTGAAGTCCCCTG
CAACTCCAAGGATTACCGGATGTGGGAGATTGAGCAGACCTATGCCGGAGTGCGCCAGCGCTGGCGCCTCGTCGAAAGCCAAACCCGCAAAGCCAATGCC
GACCTCTGGCAACCCGAATTAGAGAAGCTCGAACACCGCCTCAACCGCCAATTGAAAAAGCTGACCCAGCGGGTCTTTGCTTGCAAACCCGATGCCCTCG
AGGCCTTGATGCAGTTTCAAGATGGACTCGAGGTGCATCAGCTCACTCAGGTCTCCCTGGAGACGGTGCGGGCCAAGCGACCCCCCGGTCGTCCCGCCAA
ATCCGCCGAACCCACCCCAGTTCAGGGCTATCGGCTCCAGGCCACGTTACAGCAGACCGCCACGGCGGAAGACCGCTTTAGCCGTCAGCGTAGTCGCTTT
ATTCTAGCCACCAATCAACTGGAGCAATCCCTCTGGCCGGCTCAGACCTGCTTGAGCGAATATAAAGGGCAACAGACCGTCGAGAGAGGCTTTCGCTTCC
TCCAAGACCCCCTCTTCTTTGCCAGTAGCGTCTTTGTCAAAAAGCCGCAGCGGGTCGAGGCCTTAGCTCTCATCATGGCCCTAACCCTTATGGTGTATAC
CCTCGCCGAACGCCAACTGCGACAGGCGCTAGATGCTCAGAAGCAAACGGTGCGCGACCAACGCCAACAACCCACCGCTAAACCGACCTTTCGCTGGATT
ATGCAGAAGTTTCAAGGAATCCACTGGGTTAATCTCGATGGGCAAAGGCAGATTAGCAATCTCAATGATGAACGGCGATTGATTATTCACCTCTTCGGTC
CACCCGTTGAGCGCTATTACTACGCATCCGGTTAATTACCTATCAACCTGCGGAATGTAGG
TTAGACCATCTGGGTTTAGTGGCTGGAGTGATTGACGAACTTGGCCTGGTCGAGTTGACGGATAAGCGGATTGAACCTCATAGCTTGGAGCATGTGAGTG
CCGGACAAGTGGTTAAAGCCATGATTTTGAACGCCCTAGGCTTTCTCAGTGCACCGTTGTATTTATTCAGCGAGTTTTTTGAGAGTAAAGCGGTGTCTCA
TCTGCTCGGGGAAGGGGTAGAGGCTCGTCACTTGAATGACGACCGCTTAGGTCGAGTGTTGGATGAACTCTACGCAGAAGGGACGACATCATTTTTTCTC
CAGGTGGCGCTCCAGGCTGTGGAACGATTTGGAATTGATATTCAACAGCGTCATCTCGATGCCACCTCGATCTCAGTAGAAGGGAAGTATCAGCGGTGCT
CGAAGGGGAAATCCGAGGTAGGACTTGAGTCAGCTCCCCCCGGTGAGACATCAGCAGAACCCAGCCCAATTCGGCTGTGTCGAGGCTATTCCCGAGACCA
TCGTCCAGATTTGAAACAGTTTTTGATGACTCTAGTCTGTGCCGCCGACGGTGGCGTGCCGCTATGGTTGCAGTTGGCCAGTGGCAATGAACAGGACACT
CAGCAGTTTGCAGAGGTGCTCAAGGCGTTTGGTGACCAATGGACTAGCGACGGTATCGTTGTGATGGATGCCGCCTTTTACACAGCAGCCAATCTGCAGC
AGATGGAGACCACGGGGTGGCTATCACGGGTGCCGCTGACCCTAAAAGCGGCTCAAGAGCTGGTGCACAGCGATGTCACCCGACTGACTGAAGTCCCCTG
CAACTCCAAGGATTACCGGATGTGGGAGATTGAGCAGACCTATGCCGGAGTGCGCCAGCGCTGGCGCCTCGTCGAAAGCCAAACCCGCAAAGCCAATGCC
GACCTCTGGCAACCCGAATTAGAGAAGCTCGAACACCGCCTCAACCGCCAATTGAAAAAGCTGACCCAGCGGGTCTTTGCTTGCAAACCCGATGCCCTCG
AGGCCTTGATGCAGTTTCAAGATGGACTCGAGGTGCATCAGCTCACTCAGGTCTCCCTGGAGACGGTGCGGGCCAAGCGACCCCCCGGTCGTCCCGCCAA
ATCCGCCGAACCCACCCCAGTTCAGGGCTATCGGCTCCAGGCCACGTTACAGCAGACCGCCACGGCGGAAGACCGCTTTAGCCGTCAGCGTAGTCGCTTT
ATTCTAGCCACCAATCAACTGGAGCAATCCCTCTGGCCGGCTCAGACCTGCTTGAGCGAATATAAAGGGCAACAGACCGTCGAGAGAGGCTTTCGCTTCC
TCCAAGACCCCCTCTTCTTTGCCAGTAGCGTCTTTGTCAAAAAGCCGCAGCGGGTCGAGGCCTTAGCTCTCATCATGGCCCTAACCCTTATGGTGTATAC
CCTCGCCGAACGCCAACTGCGACAGGCGCTAGATGCTCAGAAGCAAACGGTGCGCGACCAACGCCAACAACCCACCGCTAAACCGACCTTTCGCTGGATT
ATGCAGAAGTTTCAAGGAATCCACTGGGTTAATCTCGATGGGCAAAGGCAGATTAGCAATCTCAATGATGAACGGCGATTGATTATTCACCTCTTCGGTC
CACCCGTTGAGCGCTATTACTACGCATCCGGTTAATTACCTATCAACCTGCGGAATGTAGG
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1650 bp | 549 aa | 86 | 1735 | + | No |
Chemistry : DDE
ORF sequence :
MQIKNLDHLGLVAGVIDELGLVELTDKRIEPHSLEHVSAGQVVKAMILNALGFLSAPLYLFSEFFESKAVSHLLGEGVEARHLNDDRLGRVLDELYAEGT
TSFFLQVALQAVERFGIDIQQRHLDATSISVEGKYQRCSKGKSEVGLESAPPGETSAEPSPIRLCRGYSRDHRPDLKQFLMTLVCAADGGVPLWLQLASG
NEQDTQQFAEVLKAFGDQWTSDGIVVMDAAFYTAANLQQMETTGWLSRVPLTLKAAQELVHSDVTRLTEVPCNSKDYRMWEIEQTYAGVRQRWRLVESQT
RKANADLWQPELEKLEHRLNRQLKKLTQRVFACKPDALEALMQFQDGLEVHQLTQVSLETVRAKRPPGRPAKSAEPTPVQGYRLQATLQQTATAEDRFSR
QRSRFILATNQLEQSLWPAQTCLSEYKGQQTVERGFRFLQDPLFFASSVFVKKPQRVEALALIMALTLMVYTLAERQLRQALDAQKQTVRDQRQQPTAKP
TFRWIMQKFQGIHWVNLDGQRQISNLNDERRLIIHLFGPPVERYYYASG
TSFFLQVALQAVERFGIDIQQRHLDATSISVEGKYQRCSKGKSEVGLESAPPGETSAEPSPIRLCRGYSRDHRPDLKQFLMTLVCAADGGVPLWLQLASG
NEQDTQQFAEVLKAFGDQWTSDGIVVMDAAFYTAANLQQMETTGWLSRVPLTLKAAQELVHSDVTRLTEVPCNSKDYRMWEIEQTYAGVRQRWRLVESQT
RKANADLWQPELEKLEHRLNRQLKKLTQRVFACKPDALEALMQFQDGLEVHQLTQVSLETVRAKRPPGRPAKSAEPTPVQGYRLQATLQQTATAEDRFSR
QRSRFILATNQLEQSLWPAQTCLSEYKGQQTVERGFRFLQDPLFFASSVFVKKPQRVEALALIMALTLMVYTLAERQLRQALDAQKQTVRDQRQQPTAKP
TFRWIMQKFQGIHWVNLDGQRQISNLNDERRLIIHLFGPPVERYYYASG
Blast result :
Comments
ISAtsp2 is 62% aa similar to ISAva4.
References
1] ISfinder annotation (2009)
2] Morin Nicolas (2009) Direct submission
2] Morin Nicolas (2009) Direct submission