ISNph7
- Family IS200/IS605
- Group IS605
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_007426 | ND | Natronomonas pharaonis | Natronomonas pharaonis DSM 2160 |
DNA section
IS Length : 2020 bp
Ends
Left end : AGAGAAAGCAGGCAGAGTGCCTCGGGGCTTGACCCCGAGGGTGAATGCCGTAAGCTCCATTAGACGTGTTCAGTGCGTTCGATATACTGCTCAATCGTCT II struct. : Yes
Right end : ACCCACCACACCGAGAGAGGCCACCAGACCGATGGTGTGTCGGGTGTGTCCGACCAATCCACGGGAAGAAGCTCCGGGGCTTGACCCCGGAGCAGTTCAC II struct. : Yes
Insertion site
Left flank | LE cleavage site | Right flank | RE cleavage site |
---|---|---|---|
GGCTTCCCGTCT | CTAC | GCCGCATGACGCAG | TCAC |
DNA sequence
AGAGAAAGCAGGCAGAGTGCCTCGGGGCTTGACCCCGAGGGTGAATGCCGTAAGCTCCATTAGACGTGTTCAGTGCGTTCGATATACTGCTCAATCGTCT
GGCTAGATACTTCACCAGCCGTTCCGATGTAGTATGATTCTTCCCAGAATCCGCCACCCCACAAGTAGTTTTCCAAGAGGTGTTCGTGTTGCTGCCACAT
CTCTCTGGCCGTGATGCTCTTGATTTTTCGTGCAATCTTACTTGGAGAATATTTTGGATGTGCTGAAAGAAACAAGTGAACGTGGTCTGGCGAGATATGC
AATGAGGGTATTTCATAATTGTACTCGCCGCAAACGTCTTTGAACGATGCTTCTAACGACTGTTCTATCTCATCAAGCACTGAGTGACGGTACTGAGCGG
GATCGGAGATCCCGCGAGGTACGCAAATCTTCGATTTGCTTGACTTCGGACACCACACGAAGTGGTAATTGATGTTGTAAACCGTGTGATTTGACCGTGT
TTCGCCCATATCGGCTAGAGTCATGCCAAAAAATTAAGTATAAGCGTATCTAAGTATATACTGTAGAGATGACCAAACGCCTCACCGTAGACTTCGAGGA
CGATTTGTACAAGGAGTTCTCGAAGAAATGTATCGACGTTGAACGTCCGAAATCAGAAGTCGTGCGTGAACTTGTCCAAGACTGGGTAAACGACCAAGAA
TGACGCAGACACAGGCTCTCACCAAGACGCTCGTGTTTCCGCTTGACGTGCAGAGTGGCAACGAGAGCCTGCTTCACGACGCCCGCTTGGAGTGTCGCCG
CGTGTTCAACGAGGTGCTACGTCTCAACTACGACGGGTGGAACTGGGACAAGATAGAAAACGTTGTTGAGCAGAACGCTGACCTCGTGCAAAACACCGCA
CAGCGTATCATCGACAAAGCCTTTGACGCACTCGATAACTACTACGACTACGACGACTGGGGACGACCGTGGTACAAACACGAGACGTTCCCACTGCGGA
TGAACTACGGCGAAGGCTACAACCTCTTTCTCGAAGACGAAACGGTACGGTTCCGCATTTCAGCAAAGCCGTACAATCACGTCAAAGGCGAGCTTCGCGG
TACGCAAGACCAATTCGACCTGCTCAGACAAGCAATCGAAGATGATGACTGGCACGTTGGCACTGCTGAGGCGCTTGTCCGACACGGACGCGAAGAGTTG
CACGCCACCGTCACGAACGAAACTGCCGAAGTCGGTGCGAAAACAACGACAGAAACGGTCACCGGCGTCGATATTAACGAAGACTGTGTTGCGCTAGCGG
CACTGACTGAACACGGTATCGAAGATTCCGTCGTCATTGACTACCCAGAAATCAAGGAACAACGCCACCGGTACTTCACGATGCGGAAACGCATGCAGGA
AGCCGGACAGACCGGGTTTAACGATGTGTTCCGCGACAAGGAACAACAGTTCGTTCACGACCAGCTACACACGGTGTCACGGCGCGTAGTCGAGTGGATT
CAACGGTTCGACAGTCCTGTAATTGTCTTTGAAGACCTCAAGGACATGAGAGACGACATTGAGTACGGGACGCGAATGAACCGGCGACTCCATTCACTAC
CGTTCGCCAAACTGCGGGACTTCATCACGTACAAGGCCGCATGGCGCGGCATTCCGTCAGATGACGTTGACCCGGAGTACACCAGCCAACAGTGTCCGGT
CTGTGGACACACAGAACGGGCGAACCGTCACAAGAAACGGTTCAAGTGCTGTGAGTGCGAGCATCAAGACCACGCTGACCGTGGTGCTGGTATCAGTGTC
GCACAGAAATGGTTGAGAACACAAGAGGATAGAAATGTGCCTGCTCTCAACACACTCCCGCAGGTGCGGAAATGGGAGTTGCGACGGCAGGCATCGGGGC
CTGTGGACGGCCCGACCGTGACCCACCACACCGAGAGAGGCCACCAGACCGATGGTGTGTCGGGTGTGTCCGACCAATCCACGGGAAGAAGCTCCGGGGC
TTGACCCCGGAGCAGTTCAC
GGCTAGATACTTCACCAGCCGTTCCGATGTAGTATGATTCTTCCCAGAATCCGCCACCCCACAAGTAGTTTTCCAAGAGGTGTTCGTGTTGCTGCCACAT
CTCTCTGGCCGTGATGCTCTTGATTTTTCGTGCAATCTTACTTGGAGAATATTTTGGATGTGCTGAAAGAAACAAGTGAACGTGGTCTGGCGAGATATGC
AATGAGGGTATTTCATAATTGTACTCGCCGCAAACGTCTTTGAACGATGCTTCTAACGACTGTTCTATCTCATCAAGCACTGAGTGACGGTACTGAGCGG
GATCGGAGATCCCGCGAGGTACGCAAATCTTCGATTTGCTTGACTTCGGACACCACACGAAGTGGTAATTGATGTTGTAAACCGTGTGATTTGACCGTGT
TTCGCCCATATCGGCTAGAGTCATGCCAAAAAATTAAGTATAAGCGTATCTAAGTATATACTGTAGAGATGACCAAACGCCTCACCGTAGACTTCGAGGA
CGATTTGTACAAGGAGTTCTCGAAGAAATGTATCGACGTTGAACGTCCGAAATCAGAAGTCGTGCGTGAACTTGTCCAAGACTGGGTAAACGACCAAGAA
TGACGCAGACACAGGCTCTCACCAAGACGCTCGTGTTTCCGCTTGACGTGCAGAGTGGCAACGAGAGCCTGCTTCACGACGCCCGCTTGGAGTGTCGCCG
CGTGTTCAACGAGGTGCTACGTCTCAACTACGACGGGTGGAACTGGGACAAGATAGAAAACGTTGTTGAGCAGAACGCTGACCTCGTGCAAAACACCGCA
CAGCGTATCATCGACAAAGCCTTTGACGCACTCGATAACTACTACGACTACGACGACTGGGGACGACCGTGGTACAAACACGAGACGTTCCCACTGCGGA
TGAACTACGGCGAAGGCTACAACCTCTTTCTCGAAGACGAAACGGTACGGTTCCGCATTTCAGCAAAGCCGTACAATCACGTCAAAGGCGAGCTTCGCGG
TACGCAAGACCAATTCGACCTGCTCAGACAAGCAATCGAAGATGATGACTGGCACGTTGGCACTGCTGAGGCGCTTGTCCGACACGGACGCGAAGAGTTG
CACGCCACCGTCACGAACGAAACTGCCGAAGTCGGTGCGAAAACAACGACAGAAACGGTCACCGGCGTCGATATTAACGAAGACTGTGTTGCGCTAGCGG
CACTGACTGAACACGGTATCGAAGATTCCGTCGTCATTGACTACCCAGAAATCAAGGAACAACGCCACCGGTACTTCACGATGCGGAAACGCATGCAGGA
AGCCGGACAGACCGGGTTTAACGATGTGTTCCGCGACAAGGAACAACAGTTCGTTCACGACCAGCTACACACGGTGTCACGGCGCGTAGTCGAGTGGATT
CAACGGTTCGACAGTCCTGTAATTGTCTTTGAAGACCTCAAGGACATGAGAGACGACATTGAGTACGGGACGCGAATGAACCGGCGACTCCATTCACTAC
CGTTCGCCAAACTGCGGGACTTCATCACGTACAAGGCCGCATGGCGCGGCATTCCGTCAGATGACGTTGACCCGGAGTACACCAGCCAACAGTGTCCGGT
CTGTGGACACACAGAACGGGCGAACCGTCACAAGAAACGGTTCAAGTGCTGTGAGTGCGAGCATCAAGACCACGCTGACCGTGGTGCTGGTATCAGTGTC
GCACAGAAATGGTTGAGAACACAAGAGGATAGAAATGTGCCTGCTCTCAACACACTCCCGCAGGTGCGGAAATGGGAGTTGCGACGGCAGGCATCGGGGC
CTGTGGACGGCCCGACCGTGACCCACCACACCGAGAGAGGCCACCAGACCGATGGTGTGTCGGGTGTGTCCGACCAATCCACGGGAAGAAGCTCCGGGGC
TTGACCCCGGAGCAGTTCAC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
450 bp | 149 aa | 509 | 60 | - | No |
Chemistry : Y1
ORF sequence :
MGETRSNHTVYNINYHFVWCPKSSKSKICVPRGISDPAQYRHSVLDEIEQSLEASFKDVCGEYNYEIPSLHISPDHVHLFLSAHPKYSPSKIARKIKSIT
AREMWQQHEHLLENYLWGGGFWEESYYIGTAGEVSSQTIEQYIERTEHV
AREMWQQHEHLLENYLWGGGFWEESYYIGTAGEVSSQTIEQYIERTEHV
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1305 bp | 434 aa | 700 | 2004 | + | No |
AG : TnpB
ORF sequence :
MTQTQALTKTLVFPLDVQSGNESLLHDARLECRRVFNEVLRLNYDGWNWDKIENVVEQNADLVQNTAQRIIDKAFDALDNYYDYDDWGRPWYKHETFPLR
MNYGEGYNLFLEDETVRFRISAKPYNHVKGELRGTQDQFDLLRQAIEDDDWHVGTAEALVRHGREELHATVTNETAEVGAKTTTETVTGVDINEDCVALA
ALTEHGIEDSVVIDYPEIKEQRHRYFTMRKRMQEAGQTGFNDVFRDKEQQFVHDQLHTVSRRVVEWIQRFDSPVIVFEDLKDMRDDIEYGTRMNRRLHSL
PFAKLRDFITYKAAWRGIPSDDVDPEYTSQQCPVCGHTERANRHKKRFKCCECEHQDHADRGAGISVAQKWLRTQEDRNVPALNTLPQVRKWELRRQASG
PVDGPTVTHHTERGHQTDGVSGVSDQSTGRSSGA
MNYGEGYNLFLEDETVRFRISAKPYNHVKGELRGTQDQFDLLRQAIEDDDWHVGTAEALVRHGREELHATVTNETAEVGAKTTTETVTGVDINEDCVALA
ALTEHGIEDSVVIDYPEIKEQRHRYFTMRKRMQEAGQTGFNDVFRDKEQQFVHDQLHTVSRRVVEWIQRFDSPVIVFEDLKDMRDDIEYGTRMNRRLHSL
PFAKLRDFITYKAAWRGIPSDDVDPEYTSQQCPVCGHTERANRHKKRFKCCECEHQDHADRGAGISVAQKWLRTQEDRNVPALNTLPQVRKWELRRQASG
PVDGPTVTHHTERGHQTDGVSGVSDQSTGRSSGA
Blast result :
Comments
ISNph7 is 79% (ORFA : the transposase) and 74% (ORFB) aa similar to ISHma7.
References
1] Falb M., Pfeiffer F., Palm P., Rodewald K., Hickmann V., Tittor J., Oesterhelt D. (2005)
Genome Res. 15: 1336-1343
Genome Res. 15: 1336-1343