ISTni4
- Family ISNCY
- Group ISDol1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP003989 | ND | Thioalkalivibrio nitratireducens | Thioalkalivibrio nitratireducens DSM 14787 |
DNA section
IS Length : 1909 bp
Ends
IR Length : 14/18
IRL : GTGCCTGCACGAGAAGCCAATAAAACGTGCAAAATCAATGGGGTGGACGA
IRR : GTGCCAGCACGGAAACCCTGCTAATTTTAAAATTGCTGGACCGCCGCAAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCGTATTCCAGTGTACCGTTATATA | TACACTACAAAGCCGCGCGT | 0 |
DNA sequence
GTGCCTGCACGAGAAGCCAATAAAACGTGCAAAATCAATGGGGTGGACGATTGTCGCGGCGTGAAGCGGGAGCCGGAGGTAGCGCCTTGCGCTGCGGTGT
GGAGACGGTAACATGCTGATCGGCAATGGGTCCCGGTGCGCCGGGTCGCGACGACGGGGGCGGTGGCCTGTTCCCGCTTCGGTTTCCAACACGATCTTCC
CCGGATGAAAGCACGATGCGCAAGGTGATCGAACCGCAAATGCAGTTGGGAGAACTCCCGATCGGCGGAATTGAACTGGATCCCCGATCCCGCGATGACA
TTCCCCAGATCCTGCGCGGACTGCAGCACCTCTACACGACCCCGGAGGTGCGTGCGGAGGTTTTCGCCATTCTGGAGGAGCTGGTACCCGAGCGCAGCAC
CGAGACGGGCCCCGAGAAGGTCGACGTCGAGAACGGCCGGCCGGGGATGTCGCAGTGGAACCTGCTGGTGCTCGGGGTGTTGCGCCTGGGCTTGAATGCC
GATTACGACCGGATCCGGGAACTGGCGAATGAGCACCGGACGATTCGGCAGTTTCTCGGGCACAGTGGCTGGGACGACGACACGTCGTACGGCTTGCAGA
CGGTGCGCGACAACCTGAGCCGGTTCACCCCCGAGGTGCTGGACCGGATCAACCAGGTGGTGGTACGGGCCGGGCACCGGGCGCTAAAAAAAAGCCTGGC
GGACGGTCTCGTCGGGCGCTGTGACTCGTTCGTGGTGGAGACCGATGTCCACTACCCGACCGACACCAACCTGCTGCTGGACGCGATTCGCAAGGTGATT
GGGCTGAGTGCTGAGTTGGCCGCGGCGAACGCTCGGACCGAGTGGCGCCAGCATGCCTACCACCAGCGGCAGTTCAAGCGCGCCTACCGCCGGCTGACGC
GCCTGCGGCATTCGACCTCGCAGGATCCGGAGCGGCAGCAGGCGCGGACCGAGGCGATCGCGCAGGCGTGCCGGGACTACCTGGCGGCGGCGGAGCTGCA
CCTGCAGCGCTCAGCCGCGACCGGTGCGGGCGTGGCCGCGGTCGATCCCGCCAATTTCGTGCTGCTGCAGGAGATTGCGGTGTTCAGCGACCATGCCCGG
CGCCAGATCGACCAGATCCGCCGGCGTATCCTGGAGGGCGAACGCATCCCGCATGAGGAGAAGGTGTTTTCCCTGTTTGAGCCGCACACGGAGTGGATCA
GCAAGGGCAAGGCGGGGGTCCCGGTGGAGCTGGGCCTGCGCGTATGCGTGATGGTCGAGGCGGACGGGTTCATCCTGCACCACCGGGTGATGGAACGCTG
CACCGACGATGCGGTCGCGGTGCCCATGGTCCAGGAGACCCGGGACCGCTTCCCGCTGGTGCGCGCGGTGAGCATGGACAAGGGGTTCCACAGCCCGAGC
AACCAGACCGAGCTACGCCAGATCGTCGGGACCGTGGTGTTGCCGAAGAAGGGGCGGTGCACCCAGGAAGAAGCGGAACGGGAGCGGGATCCCGAGTTCG
CGACCTTGCGCCGGCAGCATGCCGCGGTCGAGTCGGCGATCAACGCGCTGGAGGTGCACGGCCTGGACCGCTGCCGGGATCACGGGATCGACGGCTTTCG
TCGGTACGTGGGCCTGGCGGTGCTGGCCCGTAACATCCAGCACCTCGGTGCCATCCTGCGGCGCCAGGAGCGGGACGCGGAGAAACGACGACGTGGCCCC
TACCGCAAAGCGGCTTGATCGACGCCGAACATCGCACAGGGCTGGCCACACCCAGGGCTTCCTGTGCCCGCGAATCGCCCCCGTTGCGAGCGACTCTCAT
CCAGGTGCCCAACACAAACCTCAGCCTGGCGATTCCTGAGCACTCCGGCCGCCCGTGGATTTGCGGCGGTCCAGCAATTTTAAAATTAGCAGGGTTTCCG
TGCTGGCAC
GGAGACGGTAACATGCTGATCGGCAATGGGTCCCGGTGCGCCGGGTCGCGACGACGGGGGCGGTGGCCTGTTCCCGCTTCGGTTTCCAACACGATCTTCC
CCGGATGAAAGCACGATGCGCAAGGTGATCGAACCGCAAATGCAGTTGGGAGAACTCCCGATCGGCGGAATTGAACTGGATCCCCGATCCCGCGATGACA
TTCCCCAGATCCTGCGCGGACTGCAGCACCTCTACACGACCCCGGAGGTGCGTGCGGAGGTTTTCGCCATTCTGGAGGAGCTGGTACCCGAGCGCAGCAC
CGAGACGGGCCCCGAGAAGGTCGACGTCGAGAACGGCCGGCCGGGGATGTCGCAGTGGAACCTGCTGGTGCTCGGGGTGTTGCGCCTGGGCTTGAATGCC
GATTACGACCGGATCCGGGAACTGGCGAATGAGCACCGGACGATTCGGCAGTTTCTCGGGCACAGTGGCTGGGACGACGACACGTCGTACGGCTTGCAGA
CGGTGCGCGACAACCTGAGCCGGTTCACCCCCGAGGTGCTGGACCGGATCAACCAGGTGGTGGTACGGGCCGGGCACCGGGCGCTAAAAAAAAGCCTGGC
GGACGGTCTCGTCGGGCGCTGTGACTCGTTCGTGGTGGAGACCGATGTCCACTACCCGACCGACACCAACCTGCTGCTGGACGCGATTCGCAAGGTGATT
GGGCTGAGTGCTGAGTTGGCCGCGGCGAACGCTCGGACCGAGTGGCGCCAGCATGCCTACCACCAGCGGCAGTTCAAGCGCGCCTACCGCCGGCTGACGC
GCCTGCGGCATTCGACCTCGCAGGATCCGGAGCGGCAGCAGGCGCGGACCGAGGCGATCGCGCAGGCGTGCCGGGACTACCTGGCGGCGGCGGAGCTGCA
CCTGCAGCGCTCAGCCGCGACCGGTGCGGGCGTGGCCGCGGTCGATCCCGCCAATTTCGTGCTGCTGCAGGAGATTGCGGTGTTCAGCGACCATGCCCGG
CGCCAGATCGACCAGATCCGCCGGCGTATCCTGGAGGGCGAACGCATCCCGCATGAGGAGAAGGTGTTTTCCCTGTTTGAGCCGCACACGGAGTGGATCA
GCAAGGGCAAGGCGGGGGTCCCGGTGGAGCTGGGCCTGCGCGTATGCGTGATGGTCGAGGCGGACGGGTTCATCCTGCACCACCGGGTGATGGAACGCTG
CACCGACGATGCGGTCGCGGTGCCCATGGTCCAGGAGACCCGGGACCGCTTCCCGCTGGTGCGCGCGGTGAGCATGGACAAGGGGTTCCACAGCCCGAGC
AACCAGACCGAGCTACGCCAGATCGTCGGGACCGTGGTGTTGCCGAAGAAGGGGCGGTGCACCCAGGAAGAAGCGGAACGGGAGCGGGATCCCGAGTTCG
CGACCTTGCGCCGGCAGCATGCCGCGGTCGAGTCGGCGATCAACGCGCTGGAGGTGCACGGCCTGGACCGCTGCCGGGATCACGGGATCGACGGCTTTCG
TCGGTACGTGGGCCTGGCGGTGCTGGCCCGTAACATCCAGCACCTCGGTGCCATCCTGCGGCGCCAGGAGCGGGACGCGGAGAAACGACGACGTGGCCCC
TACCGCAAAGCGGCTTGATCGACGCCGAACATCGCACAGGGCTGGCCACACCCAGGGCTTCCTGTGCCCGCGAATCGCCCCCGTTGCGAGCGACTCTCAT
CCAGGTGCCCAACACAAACCTCAGCCTGGCGATTCCTGAGCACTCCGGCCGCCCGTGGATTTGCGGCGGTCCAGCAATTTTAAAATTAGCAGGGTTTCCG
TGCTGGCAC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1503 bp | 500 aa | 216 | 1718 | + | No |
Chemistry : DDE
ORF sequence :
MRKVIEPQMQLGELPIGGIELDPRSRDDIPQILRGLQHLYTTPEVRAEVFAILEELVPERSTETGPEKVDVENGRPGMSQWNLLVLGVLRLGLNADYDRI
RELANEHRTIRQFLGHSGWDDDTSYGLQTVRDNLSRFTPEVLDRINQVVVRAGHRALKKSLADGLVGRCDSFVVETDVHYPTDTNLLLDAIRKVIGLSAE
LAAANARTEWRQHAYHQRQFKRAYRRLTRLRHSTSQDPERQQARTEAIAQACRDYLAAAELHLQRSAATGAGVAAVDPANFVLLQEIAVFSDHARRQIDQ
IRRRILEGERIPHEEKVFSLFEPHTEWISKGKAGVPVELGLRVCVMVEADGFILHHRVMERCTDDAVAVPMVQETRDRFPLVRAVSMDKGFHSPSNQTEL
RQIVGTVVLPKKGRCTQEEAERERDPEFATLRRQHAAVESAINALEVHGLDRCRDHGIDGFRRYVGLAVLARNIQHLGAILRRQERDAEKRRRGPYRKAA
RELANEHRTIRQFLGHSGWDDDTSYGLQTVRDNLSRFTPEVLDRINQVVVRAGHRALKKSLADGLVGRCDSFVVETDVHYPTDTNLLLDAIRKVIGLSAE
LAAANARTEWRQHAYHQRQFKRAYRRLTRLRHSTSQDPERQQARTEAIAQACRDYLAAAELHLQRSAATGAGVAAVDPANFVLLQEIAVFSDHARRQIDQ
IRRRILEGERIPHEEKVFSLFEPHTEWISKGKAGVPVELGLRVCVMVEADGFILHHRVMERCTDDAVAVPMVQETRDRFPLVRAVSMDKGFHSPSNQTEL
RQIVGTVVLPKKGRCTQEEAERERDPEFATLRRQHAAVESAINALEVHGLDRCRDHGIDGFRRYVGLAVLARNIQHLGAILRRQERDAEKRRRGPYRKAA
Blast result :
Comments
ISTni4 is 74% aa similar to ISDeba1.
This IS is disrupted by ISTni3 (IS1595 family). It was reconstructed in silico by deletion of the ISTni3 sequence and of one of its direct repeat.
This IS is disrupted by ISTni3 (IS1595 family). It was reconstructed in silico by deletion of the ISTni3 sequence and of one of its direct repeat.
References
1] ISfinder annotation (2016)
2] Tikhonova,T.V., Pavlov,A.R., Beletsky,A.V., Mardanov,A.V., Sorokin,D.Y., Ravin,N.V. and Popov,V.O. (2015) Direct submission GenBank.
2] Tikhonova,T.V., Pavlov,A.R., Beletsky,A.V., Mardanov,A.V., Sorokin,D.Y., Ravin,N.V. and Popov,V.O. (2015) Direct submission GenBank.