ISHla4
- Family IS5
- Group ISH1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_012028 | ND | Halorubrum lacusprofundi | Halorubrum lacusprofundi ATCC 49239 chromosome 2 |
DNA section
IS Length : 1804 bp
Ends
IR Length : 17
IRL : GGCTCTGTTGAAATCCTCAGAGATGTACATGTTCGCCGTGTAATTCTATG
IRR : GGCTCTGTTGAAATCCTATTCATCAGACATATACACAGATGAAATAGCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
ATACGAAGAT | GGACTACCTT | 0 |
DNA sequence
GGCTCTGTTGAAATCCTCAGAGATGTACATGTTCGCCGTGTAATTCTATGGGATGAAGTCCCTCCCAAAGTCGCAGATTCTCCGGTTTACTGAGAAGGCG
ATCCATCTGGCACGCCGGGCGGTCTCTCGGTACTCCTCGAAATTCTCTAAACACCGCTATACACTTCCGCAGCACGTTGTTCTGCTCTGTCTCAAAGTTC
GGAAGAACACGACCTACCGTGGTCTGCTTGACGAACTGATCGAGATGCCACGCATCCGTCGTGTTCTCGGGCTAGCCGAACTTCCTACTGCTTCAACGCT
TTGTAAGTCGTTCAGCCGGCTTGATATGGCTGTATGGCGTGTCATATTGACTCTCTCAGCGACACTACTTCCGACAAGCGGCGTTGTTGGTGTTGATGCG
TCAGGGTTCGACCGCAGTCACGCTTCGAAACACTACACGAAACGCGCTGAACTCACGATTCAGCAGCTCAAGGTGACGTTACTGGTCGATGCGAAGGTAA
ACGCGATACTCGATCTACACGTAACTACGACGCGGAAACACGATAGCCAGATCGCTCCGTCGTTGATCAAGCGCAATCCCGACGATATTGACGTTTTGCT
CGGTGACAAAGGGTACGACGATCAGAAGATCAGGCGGCTCGCCCGGCAACACGAAGTTCGACCACTGATCAAGCATCGTGAGTTCACGTCACTCCATAAG
GCATGGAACGTACGCTTAGACACTGATCTCTACGGTCAGCGGAGTCAATCCGAGACTGTCAACTCAACACTCAAGCGGAAGTACGGGGCGTTTGTCCGGT
CACGGCGCTGGTGGAAGCAGTTCCGTGAACTCACCATCGCCTGTCTCATTCATAACGTAGATCGATCACTCTGAGCGGTCAATACAGGAAACTCACGAAT
CGCTCAGTGCAGGAGACGCAGAGATCGCTCAGAGTGTTCTGTTTCGACCCACAAGACATACTCTTTGTCAGCAAAGCCGAATACGTATGTCCCCGACGAA
ACGTCTAATTGGCACCACTTTACGCGAAAAAATTACTGTGATCCTCTTGGCAGCGTCAATGCTAATCTCCGTGGTCGCAGTTGGCTTTGTTTCCGACCCC
GTCGCAGCCACGGCCGGTCATTCCCCAAGTTTTGGCGAAAGTGTTACTGATACAATCGCTGATACCAATGTACCAAATACAGCCAACGAGACCAACAAAA
GTGAGGTCTACTTTGAGATCGCTAACCTCACGCCTCAGGCTCGGGACACTACCGTGGCTCTTGACGACACATTAGCATTCACCGTTACGATCGAAAATAC
TGGTGATGCCAAAGGCGTTCAGACCGTTGAGTTTCGCATCGGCGACAACGCGGTCGCCAGCCGAAAAGTGACGCTTGATGTAAACAACAGCACCACTATT
GAATTTGATGAGATCAACGTGTCACACTTTGACACGGGTGAGTACGAGTACGGCTTCCTTACGAATGACGACAACCGGACTGTAACCATCACATTCCAGC
CTGCTGGTGACGACCGCGTTGTTGTCGATAGAGACAATGATACCGATAGTGAGAACGACTCAAACGGAAGTGACGGTAACGATGAACGGGACGGGGCTAC
CAGTAATGACACGCCCGGCTTCGGTGCGTCTGTGGCTCTCGTTGCTCTCGTCGTAGCTACGCTGTTCATGACCCGTCCCAGCGAGTAGGATCTTTGAATT
CCGGCTGAGTAGAGCGTACAAGGATGTTTTGAGCGATTCGCAGCTCTCACCAACTGGCTATTTCATCTGTGTATATGTCTGATGAATAGGATTTCAACAG
AGCC
ATCCATCTGGCACGCCGGGCGGTCTCTCGGTACTCCTCGAAATTCTCTAAACACCGCTATACACTTCCGCAGCACGTTGTTCTGCTCTGTCTCAAAGTTC
GGAAGAACACGACCTACCGTGGTCTGCTTGACGAACTGATCGAGATGCCACGCATCCGTCGTGTTCTCGGGCTAGCCGAACTTCCTACTGCTTCAACGCT
TTGTAAGTCGTTCAGCCGGCTTGATATGGCTGTATGGCGTGTCATATTGACTCTCTCAGCGACACTACTTCCGACAAGCGGCGTTGTTGGTGTTGATGCG
TCAGGGTTCGACCGCAGTCACGCTTCGAAACACTACACGAAACGCGCTGAACTCACGATTCAGCAGCTCAAGGTGACGTTACTGGTCGATGCGAAGGTAA
ACGCGATACTCGATCTACACGTAACTACGACGCGGAAACACGATAGCCAGATCGCTCCGTCGTTGATCAAGCGCAATCCCGACGATATTGACGTTTTGCT
CGGTGACAAAGGGTACGACGATCAGAAGATCAGGCGGCTCGCCCGGCAACACGAAGTTCGACCACTGATCAAGCATCGTGAGTTCACGTCACTCCATAAG
GCATGGAACGTACGCTTAGACACTGATCTCTACGGTCAGCGGAGTCAATCCGAGACTGTCAACTCAACACTCAAGCGGAAGTACGGGGCGTTTGTCCGGT
CACGGCGCTGGTGGAAGCAGTTCCGTGAACTCACCATCGCCTGTCTCATTCATAACGTAGATCGATCACTCTGAGCGGTCAATACAGGAAACTCACGAAT
CGCTCAGTGCAGGAGACGCAGAGATCGCTCAGAGTGTTCTGTTTCGACCCACAAGACATACTCTTTGTCAGCAAAGCCGAATACGTATGTCCCCGACGAA
ACGTCTAATTGGCACCACTTTACGCGAAAAAATTACTGTGATCCTCTTGGCAGCGTCAATGCTAATCTCCGTGGTCGCAGTTGGCTTTGTTTCCGACCCC
GTCGCAGCCACGGCCGGTCATTCCCCAAGTTTTGGCGAAAGTGTTACTGATACAATCGCTGATACCAATGTACCAAATACAGCCAACGAGACCAACAAAA
GTGAGGTCTACTTTGAGATCGCTAACCTCACGCCTCAGGCTCGGGACACTACCGTGGCTCTTGACGACACATTAGCATTCACCGTTACGATCGAAAATAC
TGGTGATGCCAAAGGCGTTCAGACCGTTGAGTTTCGCATCGGCGACAACGCGGTCGCCAGCCGAAAAGTGACGCTTGATGTAAACAACAGCACCACTATT
GAATTTGATGAGATCAACGTGTCACACTTTGACACGGGTGAGTACGAGTACGGCTTCCTTACGAATGACGACAACCGGACTGTAACCATCACATTCCAGC
CTGCTGGTGACGACCGCGTTGTTGTCGATAGAGACAATGATACCGATAGTGAGAACGACTCAAACGGAAGTGACGGTAACGATGAACGGGACGGGGCTAC
CAGTAATGACACGCCCGGCTTCGGTGCGTCTGTGGCTCTCGTTGCTCTCGTCGTAGCTACGCTGTTCATGACCCGTCCCAGCGAGTAGGATCTTTGAATT
CCGGCTGAGTAGAGCGTACAAGGATGTTTTGAGCGATTCGCAGCTCTCACCAACTGGCTATTTCATCTGTGTATATGTCTGATGAATAGGATTTCAACAG
AGCC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
822 bp | 273 aa | 53 | 874 | + | No |
Chemistry : DDE
ORF sequence :
MKSLPKSQILRFTEKAIHLARRAVSRYSSKFSKHRYTLPQHVVLLCLKVRKNTTYRGLLDELIEMPRIRRVLGLAELPTASTLCKSFSRLDMAVWRVILT
LSATLLPTSGVVGVDASGFDRSHASKHYTKRAELTIQQLKVTLLVDAKVNAILDLHVTTTRKHDSQIAPSLIKRNPDDIDVLLGDKGYDDQKIRRLARQH
EVRPLIKHREFTSLHKAWNVRLDTDLYGQRSQSETVNSTLKRKYGAFVRSRRWWKQFRELTIACLIHNVDRSL
LSATLLPTSGVVGVDASGFDRSHASKHYTKRAELTIQQLKVTLLVDAKVNAILDLHVTTTRKHDSQIAPSLIKRNPDDIDVLLGDKGYDDQKIRRLARQH
EVRPLIKHREFTSLHKAWNVRLDTDLYGQRSQSETVNSTLKRKYGAFVRSRRWWKQFRELTIACLIHNVDRSL
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
702 bp | 233 aa | 987 | 1688 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MSPTKRLIGTTLREKITVILLAASMLISVVAVGFVSDPVAATAGHSPSFGESVTDTIADTNVPNTANETNKSEVYFEIANLTPQARDTTVALDDTLAFTV
TIENTGDAKGVQTVEFRIGDNAVASRKVTLDVNNSTTIEFDEINVSHFDTGEYEYGFLTNDDNRTVTITFQPAGDDRVVVDRDNDTDSENDSNGSDGNDE
RDGATSNDTPGFGASVALVALVVATLFMTRPSE
TIENTGDAKGVQTVEFRIGDNAVASRKVTLDVNNSTTIEFDEINVSHFDTGEYEYGFLTNDDNRTVTITFQPAGDDRVVVDRDNDTDSENDSNGSDGNDE
RDGATSNDTPGFGASVALVALVVATLFMTRPSE
Blast result :
Comments
ISHla4 (orfA, the transposase) is 96% aa similar to ISHma13. The orfB is a passenger gene.
References
1] ISfinder annotation (2010).
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Anderson,I., DasSarma,S., Cavicchioli,R. and Richardson,P. (2009) Direct submission GenBank.
2] Lucas,S., Copeland,A., Lapidus,A., Glavina del Rio,T., Dalin,E., Tice,H., Bruce,D., Goodwin,L., Pitluck,S., Sims,D., Brettin,T., Detter,J.C., Han,C., Larimer,F., Land,M., Hauser,L., Kyrpides,N., Ivanova,N., Anderson,I., DasSarma,S., Cavicchioli,R. and Richardson,P. (2009) Direct submission GenBank.