ISUnCu6
- Family IS4
- Group ISPepr1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AY766185 | ND | uncultured murine | uncultured murine large bowel bacterium BAC 31B |
DNA section
IS Length : 1589 bp
Ends
IR Length : 16
IRL : TTTAAGTTTCGCAAGTCAGCATGAAAGTTTATAAGAGGGTATAAAAGAAG
IRR : TTTAAGTTTCGCAAGTTCAGCTTGCTAGTTTTAATTTGTAAATATCGCAT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTTTGATACG | TATATTA | TCAATAGCAC | 7 |
DNA sequence
TTTAAGTTTCGCAAGTCAGCATGAAAGTTTATAAGAGGGTATAAAAGAAGGCATAGAGTTTTCTGTATATCGCAGAAAATGAGTATCTTTAAAGTGGTCA
AACAAAATAATACTCAACTCTATGCCAGTACACAAAGTTACAGACTTTCTTTCAGAAATCAGCGCATTCTTTAAAAAAGATGATGCGCACAATGCAATGT
ATTCCATTATGGACGTGATTAAATGGTTAAAAATGACAGAGTCTTCCCTGTTCGGGATGAAGAGTAAATGCAACAACATATATTCTTTACTGCAGGTATT
CCAGGCTCTGTTGTTATATCCTTGTTTCATGATTCGTAATCCTTATCATTTTCATGATTCTTCATTAAGCGGCTTGTTGAACTGCAAAAAAGATGTGTTC
TACCGTTTTATGAGCAACCCGAAGATTGATTGGCGTAAACTTGTCTATCATCTTAACATGCAGCTTTGGTCGAAAATCAAAGTCCGCTCCGAGCATAAGG
AAAACACCACCTGTCTGATAGTTGATGATACGGACTATCCCAAAACCGGACGCCGCTTCGAATATATCGGCAGGGTGCATTCCCATGTTCAGCACAGAAG
CATTCTGGGATTCAAGGCATTGTTTTTGGTCATAACAGACGGAACATCCCAAATGATTCTCGATTTTGCACTTCTTGGTGAAAAAGGGAGAAAAGGAAAT
TTTGGTATGTCCGCCAAGGAACTCAAAGACCGTTTCACCAAGCCCCGTGACGAACAGGATGCCTTGCAGGAACGAATCAACGAATATACGGCAAGCAAAA
TCAGTCTTATGATTGACATGATAAAACGAGCCATAGGTAAAGGAGTGAAATTCAGGTATGTTCTGGCTGACAGCTGGTTTGCATGCAAGGATATTATTCG
CTTTGTTCGTTCCCGCCACATGAAATGTGACTACTTGGGCATGATTAAAATCGGTGAGAGCGGAAGGACGAAATATCATTTTGAGCGCAAAGACTTTACG
GCACCGGCTCTGATAAAGTTGCTCTCCAAACGGAAACGGCGTAAATACAATAGAAAGCTGCGTTGCTATTATATGGTAGCTGATGTGGTCTTTGCAGACA
CTAAGGTTCGTTTGTTTTTCGTCAAGCGAAGTAAGAATGCCGCTTGGAATGGTTTGATAACCACTGACACCACTCTTGACTTCCTCTGCGCATATAAAAT
CTATGCTCAAAGATGGGCCTTGGAAGTCATCTTCAAGGAGGCAAAGGGATTGCTGGGACTCGGAAAATGCCAGGCAAACAATTTTGCCTCCCAGATTGCG
GCAACCTCCCTGACTGCATTGCAATACAACATTCTTTCCCTTGTGAAAAGATTTGCAGCCTATGAAACGATGGGAAAACTCTTTGAAAAAGTCTCCAAGG
ACTCTTTGGAACTTTCCATTATAGAAAGGATATGGGGAACTCTCCAAGAACTGATTATAGCAATAGCAAACCTCTTCGGTTTGGCCGATGAGGATATCTA
TGATGTGATGATTAACAGGTCGGAAGAGATGAACCATATATGCGATATTTACAAATTAAAACTAGCAAGCTGAACTTGCGAAACTTAAA
AACAAAATAATACTCAACTCTATGCCAGTACACAAAGTTACAGACTTTCTTTCAGAAATCAGCGCATTCTTTAAAAAAGATGATGCGCACAATGCAATGT
ATTCCATTATGGACGTGATTAAATGGTTAAAAATGACAGAGTCTTCCCTGTTCGGGATGAAGAGTAAATGCAACAACATATATTCTTTACTGCAGGTATT
CCAGGCTCTGTTGTTATATCCTTGTTTCATGATTCGTAATCCTTATCATTTTCATGATTCTTCATTAAGCGGCTTGTTGAACTGCAAAAAAGATGTGTTC
TACCGTTTTATGAGCAACCCGAAGATTGATTGGCGTAAACTTGTCTATCATCTTAACATGCAGCTTTGGTCGAAAATCAAAGTCCGCTCCGAGCATAAGG
AAAACACCACCTGTCTGATAGTTGATGATACGGACTATCCCAAAACCGGACGCCGCTTCGAATATATCGGCAGGGTGCATTCCCATGTTCAGCACAGAAG
CATTCTGGGATTCAAGGCATTGTTTTTGGTCATAACAGACGGAACATCCCAAATGATTCTCGATTTTGCACTTCTTGGTGAAAAAGGGAGAAAAGGAAAT
TTTGGTATGTCCGCCAAGGAACTCAAAGACCGTTTCACCAAGCCCCGTGACGAACAGGATGCCTTGCAGGAACGAATCAACGAATATACGGCAAGCAAAA
TCAGTCTTATGATTGACATGATAAAACGAGCCATAGGTAAAGGAGTGAAATTCAGGTATGTTCTGGCTGACAGCTGGTTTGCATGCAAGGATATTATTCG
CTTTGTTCGTTCCCGCCACATGAAATGTGACTACTTGGGCATGATTAAAATCGGTGAGAGCGGAAGGACGAAATATCATTTTGAGCGCAAAGACTTTACG
GCACCGGCTCTGATAAAGTTGCTCTCCAAACGGAAACGGCGTAAATACAATAGAAAGCTGCGTTGCTATTATATGGTAGCTGATGTGGTCTTTGCAGACA
CTAAGGTTCGTTTGTTTTTCGTCAAGCGAAGTAAGAATGCCGCTTGGAATGGTTTGATAACCACTGACACCACTCTTGACTTCCTCTGCGCATATAAAAT
CTATGCTCAAAGATGGGCCTTGGAAGTCATCTTCAAGGAGGCAAAGGGATTGCTGGGACTCGGAAAATGCCAGGCAAACAATTTTGCCTCCCAGATTGCG
GCAACCTCCCTGACTGCATTGCAATACAACATTCTTTCCCTTGTGAAAAGATTTGCAGCCTATGAAACGATGGGAAAACTCTTTGAAAAAGTCTCCAAGG
ACTCTTTGGAACTTTCCATTATAGAAAGGATATGGGGAACTCTCCAAGAACTGATTATAGCAATAGCAAACCTCTTCGGTTTGGCCGATGAGGATATCTA
TGATGTGATGATTAACAGGTCGGAAGAGATGAACCATATATGCGATATTTACAAATTAAAACTAGCAAGCTGAACTTGCGAAACTTAAA
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1449 bp | 483 aa | 122 | 1570 | + | No |
Chemistry : DDE
ORF sequence :
MPVHKVTDFLSEISAFFKKDDAHNAMYSIMDVIKWLKMTESSLFGMKSKCNNIYSLLQVFQALLLYPCFMIRNPYHFHDSSLSGLLNCKKDVFYRFMSNP
KIDWRKLVYHLNMQLWSKIKVRSEHKENTTCLIVDDTDYPKTGRRFEYIGRVHSHVQHRSILGFKALFLVITDGTSQMILDFALLGEKGRKGNFGMSAKE
LKDRFTKPRDEQDALQERINEYTASKISLMIDMIKRAIGKGVKFRYVLADSWFACKDIIRFVRSRHMKCDYLGMIKIGESGRTKYHFERKDFTAPALIKL
LSKRKRRKYNRKLRCYYMVADVVFADTKVRLFFVKRSKNAAWNGLITTDTTLDFLCAYKIYAQRWALEVIFKEAKGLLGLGKCQANNFASQIAATSLTAL
QYNILSLVKRFAAYETMGKLFEKVSKDSLELSIIERIWGTLQELIIAIANLFGLADEDIYDVMINRSEEMNHICDIYKLKLAS
KIDWRKLVYHLNMQLWSKIKVRSEHKENTTCLIVDDTDYPKTGRRFEYIGRVHSHVQHRSILGFKALFLVITDGTSQMILDFALLGEKGRKGNFGMSAKE
LKDRFTKPRDEQDALQERINEYTASKISLMIDMIKRAIGKGVKFRYVLADSWFACKDIIRFVRSRHMKCDYLGMIKIGESGRTKYHFERKDFTAPALIKL
LSKRKRRKYNRKLRCYYMVADVVFADTKVRLFFVKRSKNAAWNGLITTDTTLDFLCAYKIYAQRWALEVIFKEAKGLLGLGKCQANNFASQIAATSLTAL
QYNILSLVKRFAAYETMGKLFEKVSKDSLELSIIERIWGTLQELIIAIANLFGLADEDIYDVMINRSEEMNHICDIYKLKLAS
Blast result :
Comments
ISUnCu6 is 67% aa similar to IS493. ISUnCu6 was found by screening completely sequenced genomes for sequences homologous to the ISGur1 transposase using BLASTP. Multiple sequence alignments revealed a conserved DDE motif : D(N2)-113-D(N3)-117-E(C1).
References
1] De Palmenaer D, Siguier P, Mahillon J (2008) BMC Evol Biol , 8(1):18
2] Walter,J., Mangold,M. and Tannock,G.W. (2005) Appl. Environ. Microbiol. 71 (5), 2347-2354
2] Walter,J., Mangold,M. and Tannock,G.W. (2005) Appl. Environ. Microbiol. 71 (5), 2347-2354