ISSpu21
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AAWY01000007 | ND | Shewanella putrefaciens | Shewanella putrefaciens 200 |
DNA section
IS Length : 2740 bp
Ends
IR Length : 22
IRL : GTAAGCGTCCATTTTAGGACGTCCTATTCCCACTTAAGATCGCGCGATAA
IRR : GTAAGCGTCCATTTTAGGACGTATTGACGCCTTATAGATTTCGGCCAAGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
TTTCAAAGCA | TCTTTAGA | TGTTGCTTAC | 8 |
TCATTCTACT | GTAGGAGA | AGTAAAGCTA | 8 |
ATGCAATGAC | TTTTGACT | TACGCATCTA | 8 |
ACCACAATCA | GCAGATAG | CGCCCGTCAA | 8 |
DNA sequence
GTAAGCGTCCATTTTAGGACGTCCTATTCCCACTTAAGATCGCGCGATAAGCTGTGCACTCATCATTACAGGAGAGTGATATGGCAAAGCGAGCTTACCG
CAACGCTGCACAATGGCAGCAATTAATCGACTTATGGCACACATCAGAGCTGAGCATCACTGAGTTTTGCCAAACCCATCAACTGAGCACCATGTCATTT
TATAAATGGCGGCAACGCTTGGCTGAGCAAACTGAACAAGCGCCTGTTCGCCAGGATACACAGTTTATCGATTTATCCGCGCTTGCACATCACGGTAACA
CCCGCTGGCATATCGTCTTGTCACTGGGCAATGGCGTTGAACTTACCCTTAGCCAGCAATAATGTTTATCCCTGCTGCCAACCAAAAAATCTGGCTGTGC
ACCACACCCGTGGATATGCGTAAATCCTTTAACGGTTTAAGTGCCCTGGTGCGCAATAAACTTTGCCATAATCCACAAAGCGGCCAGTACTTTGTGTTTA
TCAATGCCCGCAAAACCCAGATGAAGGTGTTGTACTTCGAAAGCTCAGGCTATTGTGTCTGGGCTAAGCGCCTCGAGCAAGGCCAGTTTCCGGTGAAACC
GCATCCGAGTGGGATCCAATCACTCACCGGTTGCACCCTGCAAATGATCGTTGACGGAATTACCGTACTTCGCCAGAAACAAGCGAAGCGCTATGGGCAG
TCAATCGCTTGATAAGTCTGTATAATTGACGCTATGAAAATCAAGCCTTCATCCTCATCCAACGTGGCAACGGACGCCGAGACACGGGTTAATTTGACCA
GTGAAATCGCGCGCCTGCAGAGTTTGTTGCAACAGGCTTCTGCAGAGAATCAGGCCTGGTCAGAAAAGTACGGCAAGCTGGAAACTGAGCATGCGTCATT
GCGCGAAACCTTCGCAGGCATTCAGCAACGCTTAGCCTGGTTTGAGAAGCAGTTGTTTGGTCAGAAGTCAGAAAAGCGCGCCTTGGAATTGGGTATGCAG
TTAAGCTTACTGGGCGATATGGTGCCAGCACTGGCCCAACCCGAAGGTGAAACCGAATATACCACCTACACCCGCAGAAAAGGCAAACAGCGCCCGGATG
ATTGCGTTAATGACAGTGGTTTGCGATTTAACGATAACGTACCGGTGAAAGTGATTACGCTGATACCAGACGAGTTAAAAGGGGAAGATGCCGACCAGTA
TGAAGTGATTGGCGTAAAAAGCACCTTCCGCTTAGCACAGCGCCCAGCGAGCTTCGACGTGCTGCGTTATGACCGGCAAATCGTTAAACACAAAGGCAGC
GACACTATCCTGCCAAGCGCAGCGCCGTTTAATGTGCTGGATAAAAGCGTGGCGGATGTGAGCTTTATCGTCGGCATGTTGGTGGATAAATTCCAATACC
ACTTGCCGCTGTATCGCCAACACCAGCGCTTAGCCGCAGCGGGCATTACTTTAAGCCGCAGCACACTGGGCACCATCGTGGCACGCGGTATTGATTTACT
CTATCCGATTGTGGATGCGATGCTGACCAGCATCTTACAAAGCCAGGTGCTGGCAATGGACGAGACGCCGATTAAAGCAGGCAAAGCGGGCCCGGGCAAA
ATGAAGCAAAGTTACTTCTGGCCGGTGTATGGTGATAAGGATGAAATTGTTTTTACCTTCTCCACCAGCCGCGGGCGGCAACATATAGAAGAGATATTAA
AACATCGCTTTAAAGGGACTTTATTAAGCGATGGCTACAGCGCCTATGCCAGTTATATCAAAGCGAATGAGGGCTTAACCCATGCCCAGTGCTGGGTGCA
CAGTCGCCGGCAATTTATTGCAGCCGAAAACAGCTGGCCACAACCGGCCAAAGAAGCGATAAGCCTAATCGCCAAGCTGTATGAGATAGAAGAGACTATC
CAACGCCAAAAACTGACCGATGATAAAAAGCGCCAATACCGGCTTACCCACAGCAAACCGGTGGTTGACCGCTTCTTTACGTGGTGTGATGACACCTTGC
ATAACCTGACGCTGCTGCCAAAAGACCCCCTCTACAAAGCGATTGGCTATGTACAAAGCAAAGAAATGGCACTGCGTGTCTTCTTAGAAGACCCCGATGT
GCCGCTGGATACAAACCATCTGGAGCGGGCGCTCAGGCCCATACCCATGGGGAGAAAAAATTGGTTGTTCTGTTGGACAGAAATCGGTGCAGAGCATGTT
GGCGTTATCCAAAGCCTGATAGTCACTTGTCGATTGCATGATATCAATGTAAACGATTATCTGACCGATGTACTGCTACGTATCAGCCAACACCCTGCCA
GCCTGGTGCATGAACTGACGCCACGGTATTGGAAAACCCAGTTCGCCGATAATCCACTGCGTTCAGACCTCTTTGCTTTACCCGCCACTTCGGTAGAATA
GCCTCCAGACGGAGGCGATATGCGCATAGAAGACCTAACCTTAGACGAATTACATGACCTGAACGAGCTCATCTGCAAGCGGATAGACTACTTACGTTTG
CAAAACGACATCAATGTTCTAAGCCAGCTACGTTTAGGCCAAAAAGTCCACTTCAAGGCCAAAGAAGGCCAAGTGTTCGGCGTGGTTATCAAAATCAACA
AGAAATCAGTGATGGTGGTCAGCGATGATAACCGCCAGTGGAAAATCCCACCAGGACTGGTGCAGATCATGAAAGACATCTAGCCAGGATGCTTGGCCGA
AATCTATAAGGCGTCAATACGTCCTAAAATGGACGCTTAC
CAACGCTGCACAATGGCAGCAATTAATCGACTTATGGCACACATCAGAGCTGAGCATCACTGAGTTTTGCCAAACCCATCAACTGAGCACCATGTCATTT
TATAAATGGCGGCAACGCTTGGCTGAGCAAACTGAACAAGCGCCTGTTCGCCAGGATACACAGTTTATCGATTTATCCGCGCTTGCACATCACGGTAACA
CCCGCTGGCATATCGTCTTGTCACTGGGCAATGGCGTTGAACTTACCCTTAGCCAGCAATAATGTTTATCCCTGCTGCCAACCAAAAAATCTGGCTGTGC
ACCACACCCGTGGATATGCGTAAATCCTTTAACGGTTTAAGTGCCCTGGTGCGCAATAAACTTTGCCATAATCCACAAAGCGGCCAGTACTTTGTGTTTA
TCAATGCCCGCAAAACCCAGATGAAGGTGTTGTACTTCGAAAGCTCAGGCTATTGTGTCTGGGCTAAGCGCCTCGAGCAAGGCCAGTTTCCGGTGAAACC
GCATCCGAGTGGGATCCAATCACTCACCGGTTGCACCCTGCAAATGATCGTTGACGGAATTACCGTACTTCGCCAGAAACAAGCGAAGCGCTATGGGCAG
TCAATCGCTTGATAAGTCTGTATAATTGACGCTATGAAAATCAAGCCTTCATCCTCATCCAACGTGGCAACGGACGCCGAGACACGGGTTAATTTGACCA
GTGAAATCGCGCGCCTGCAGAGTTTGTTGCAACAGGCTTCTGCAGAGAATCAGGCCTGGTCAGAAAAGTACGGCAAGCTGGAAACTGAGCATGCGTCATT
GCGCGAAACCTTCGCAGGCATTCAGCAACGCTTAGCCTGGTTTGAGAAGCAGTTGTTTGGTCAGAAGTCAGAAAAGCGCGCCTTGGAATTGGGTATGCAG
TTAAGCTTACTGGGCGATATGGTGCCAGCACTGGCCCAACCCGAAGGTGAAACCGAATATACCACCTACACCCGCAGAAAAGGCAAACAGCGCCCGGATG
ATTGCGTTAATGACAGTGGTTTGCGATTTAACGATAACGTACCGGTGAAAGTGATTACGCTGATACCAGACGAGTTAAAAGGGGAAGATGCCGACCAGTA
TGAAGTGATTGGCGTAAAAAGCACCTTCCGCTTAGCACAGCGCCCAGCGAGCTTCGACGTGCTGCGTTATGACCGGCAAATCGTTAAACACAAAGGCAGC
GACACTATCCTGCCAAGCGCAGCGCCGTTTAATGTGCTGGATAAAAGCGTGGCGGATGTGAGCTTTATCGTCGGCATGTTGGTGGATAAATTCCAATACC
ACTTGCCGCTGTATCGCCAACACCAGCGCTTAGCCGCAGCGGGCATTACTTTAAGCCGCAGCACACTGGGCACCATCGTGGCACGCGGTATTGATTTACT
CTATCCGATTGTGGATGCGATGCTGACCAGCATCTTACAAAGCCAGGTGCTGGCAATGGACGAGACGCCGATTAAAGCAGGCAAAGCGGGCCCGGGCAAA
ATGAAGCAAAGTTACTTCTGGCCGGTGTATGGTGATAAGGATGAAATTGTTTTTACCTTCTCCACCAGCCGCGGGCGGCAACATATAGAAGAGATATTAA
AACATCGCTTTAAAGGGACTTTATTAAGCGATGGCTACAGCGCCTATGCCAGTTATATCAAAGCGAATGAGGGCTTAACCCATGCCCAGTGCTGGGTGCA
CAGTCGCCGGCAATTTATTGCAGCCGAAAACAGCTGGCCACAACCGGCCAAAGAAGCGATAAGCCTAATCGCCAAGCTGTATGAGATAGAAGAGACTATC
CAACGCCAAAAACTGACCGATGATAAAAAGCGCCAATACCGGCTTACCCACAGCAAACCGGTGGTTGACCGCTTCTTTACGTGGTGTGATGACACCTTGC
ATAACCTGACGCTGCTGCCAAAAGACCCCCTCTACAAAGCGATTGGCTATGTACAAAGCAAAGAAATGGCACTGCGTGTCTTCTTAGAAGACCCCGATGT
GCCGCTGGATACAAACCATCTGGAGCGGGCGCTCAGGCCCATACCCATGGGGAGAAAAAATTGGTTGTTCTGTTGGACAGAAATCGGTGCAGAGCATGTT
GGCGTTATCCAAAGCCTGATAGTCACTTGTCGATTGCATGATATCAATGTAAACGATTATCTGACCGATGTACTGCTACGTATCAGCCAACACCCTGCCA
GCCTGGTGCATGAACTGACGCCACGGTATTGGAAAACCCAGTTCGCCGATAATCCACTGCGTTCAGACCTCTTTGCTTTACCCGCCACTTCGGTAGAATA
GCCTCCAGACGGAGGCGATATGCGCATAGAAGACCTAACCTTAGACGAATTACATGACCTGAACGAGCTCATCTGCAAGCGGATAGACTACTTACGTTTG
CAAAACGACATCAATGTTCTAAGCCAGCTACGTTTAGGCCAAAAAGTCCACTTCAAGGCCAAAGAAGGCCAAGTGTTCGGCGTGGTTATCAAAATCAACA
AGAAATCAGTGATGGTGGTCAGCGATGATAACCGCCAGTGGAAAATCCCACCAGGACTGGTGCAGATCATGAAAGACATCTAGCCAGGATGCTTGGCCGA
AATCTATAAGGCGTCAATACGTCCTAAAATGGACGCTTAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
282 bp | 93 aa | 81 | 362 | + | No |
AG : IS66 TnpA
ORF sequence :
MAKRAYRNAAQWQQLIDLWHTSELSITEFCQTHQLSTMSFYKWRQRLAEQTEQAPVRQDTQFIDLSALAHHGNTRWHIVLSLGNGVELTLSQQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
351 bp | 116 aa | 362 | 712 | + | No |
AG : IS66 TnpB
ORF sequence :
MFIPAANQKIWLCTTPVDMRKSFNGLSALVRNKLCHNPQSGQYFVFINARKTQMKVLYFESSGYCVWAKRLEQGQFPVKPHPSGIQSLTGCTLQMIVDGI
TVLRQKQAKRYGQSIA
TVLRQKQAKRYGQSIA
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1668 bp | 555 aa | 734 | 2401 | + | No |
Chemistry : DDE
ORF sequence :
MKIKPSSSSNVATDAETRVNLTSEIARLQSLLQQASAENQAWSEKYGKLETEHASLRETFAGIQQRLAWFEKQLFGQKSEKRALELGMQLSLLGDMVPAL
AQPEGETEYTTYTRRKGKQRPDDCVNDSGLRFNDNVPVKVITLIPDELKGEDADQYEVIGVKSTFRLAQRPASFDVLRYDRQIVKHKGSDTILPSAAPFN
VLDKSVADVSFIVGMLVDKFQYHLPLYRQHQRLAAAGITLSRSTLGTIVARGIDLLYPIVDAMLTSILQSQVLAMDETPIKAGKAGPGKMKQSYFWPVYG
DKDEIVFTFSTSRGRQHIEEILKHRFKGTLLSDGYSAYASYIKANEGLTHAQCWVHSRRQFIAAENSWPQPAKEAISLIAKLYEIEETIQRQKLTDDKKR
QYRLTHSKPVVDRFFTWCDDTLHNLTLLPKDPLYKAIGYVQSKEMALRVFLEDPDVPLDTNHLERALRPIPMGRKNWLFCWTEIGAEHVGVIQSLIVTCR
LHDINVNDYLTDVLLRISQHPASLVHELTPRYWKTQFADNPLRSDLFALPATSVE
AQPEGETEYTTYTRRKGKQRPDDCVNDSGLRFNDNVPVKVITLIPDELKGEDADQYEVIGVKSTFRLAQRPASFDVLRYDRQIVKHKGSDTILPSAAPFN
VLDKSVADVSFIVGMLVDKFQYHLPLYRQHQRLAAAGITLSRSTLGTIVARGIDLLYPIVDAMLTSILQSQVLAMDETPIKAGKAGPGKMKQSYFWPVYG
DKDEIVFTFSTSRGRQHIEEILKHRFKGTLLSDGYSAYASYIKANEGLTHAQCWVHSRRQFIAAENSWPQPAKEAISLIAKLYEIEETIQRQKLTDDKKR
QYRLTHSKPVVDRFFTWCDDTLHNLTLLPKDPLYKAIGYVQSKEMALRVFLEDPDVPLDTNHLERALRPIPMGRKNWLFCWTEIGAEHVGVIQSLIVTCR
LHDINVNDYLTDVLLRISQHPASLVHELTPRYWKTQFADNPLRSDLFALPATSVE
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
264 bp | 87 aa | 2420 | 2683 | + | No |
Annotation : Description :
ORF sequence :
MRIEDLTLDELHDLNELICKRIDYLRLQNDINVLSQLRLGQKVHFKAKEGQVFGVVIKINKKSVMVVSDDNRQWKIPPGLVQIMKDI
Blast result :
Comments
ISSpu19 is 53% (ORFA) aa similar to ISAzo19_aa1, 64% (ORFB) to ISAzo19_aa2, and 67% (ORFC, the transposase) to ISCARN15.
The forth ORF is found in other IS66 family elements that are not currently present in the ISfinder database.
The forth ORF is found in other IS66 family elements that are not currently present in the ISfinder database.
References
1] Romine M. (2010) Direct submission.