ISCph4
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AAIC01000403 | ND | Chlorobium phaeobacteroides | Chlorobium phaeobacteroides BS1 |
DNA section
IS Length : 2385 bp
Ends
IR Length : 21/27
IRL : GTATCCGCCCGACCAACCGCATATTTTGCCTGACAGATTTCCACTTTACT
IRR : GTATCCGCCCCACCAACCCCAGTCTGTCAAAGAAAATTTTTGTCGATCAG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTGGCTGGCG | ATCGCAGCCA | 0 |
DNA sequence
GTATCCGCCCGACCAACCGCATATTTTGCCTGACAGATTTCCACTTTACTTTTGTCTGAAAAAATCAGATGAAGAATCAGGATAGTAACATGTACCCCAA
AGTAGAAGCCTGGAAGCAAAGTGGTATGACGATGGCTGATTATGCACAAAGCATCGGCATGAACCGGACAGCCTTTGAGTATTGGGTTAGAAAATTCCGT
AATGCGCAAAACAAGAAAGATGCCGGATTTGTTGAGATCTTTCCCTCAGCTGCAAAGGCAAATCAGCAGAGGCCATTGGCGCCAAGAGCTATTGAGCAGT
CGCATAGTGAGATCGTTTTCAGCTTTGCCAATGGCATGAGCATTAAGGTGAGCTTTTAGCATGTTAGGTTTAAGCAACAAACTGCGCTATTTTCTGTGCG
TCCAACCTACAGATATGCGCAATGGCTTTGACGGACTAGCCGGTATTGTGCGTAATTATTTAATTCAAGACCCCATTTCAGGCGATGTTTTCGTGTTTTT
GAACAAGACACGCACCCACATCAAGCTGCTATACTGGGATGGCGATGGGTTCGCTTTGTATTACAAGCGATTGGAGAAGGGTAGATATAAGGCTCAGAGA
TCTGTTAACGGGCAATCATTGGAGCTCAAAAGAGATGAGCTTATGATGCTTTTGGAAGGTCTATCGATTGGCGATATGAGAAAAAGCAAGCGGTTTAAAA
TTGGATAAAAAAGACGGAATAATTCTCTGAACCCCCTGTCAATTCAGTGTTTTTGTGTATATTTGACACATGAACACACCTCAAGAAGTAGCCCATACTG
ATCCTAAAACACTCGCCCAAAGAGTGGCTTTGCTCGAGGCTACACTTGCCCAGAAAGAGCTTCTGTTGGAAGAGAAAGATATTCTACTTGAGGAGAAAGA
AACATCCATTTCCAACCTGAACGCATCCCTTGAAGCAGAGAAGTTTAAATATGCCCAGTTGCAGCGGTTAATCTTTGGTTCTAAACGCGAACGCTTTGTG
TCGTCTTTTAGCGCCGAACAGCTGCGTTTGGAGTTTGAGCCCAAGACCATCGAGATTGAGCAGGCAGTAGAGGCAGAACGCGAAACGATCAGAGTTGCCT
ATGAGCGCCAGAAGACAAAAAAGCCACATCCCGGACGCATGCCATTGCCTTCGCACCTGCCCGTTGTTGAGATCATCCTTGAACCAGAAGAAGACACAAC
CGGAATGGTATGTATTGGAAAAGAAGTGACCGAAGAACTGGATTTAACACCGGCCAAATTGCAAGTAAACCGCTATATCCGCTTCAAATACATCACCTCG
GAAGACGACAAGGCCAATCAGCGCCAGGTCATAGCCCCGCTTAACAGACCCATCAACAAATGCATTGCTAGTGCTGCATTGCTGGCAGCCATTTTCGTTG
ATAAATTTGTTTATCACCTCCCGTATTATCGGTTTCTTCAGCGCTTAAGCCAGGAAAAGGTGCACATACCCAAGAGCACCTTTGAATCGTGGGTCAAGCT
GGGGGCCGACTTAATCAGGCCGCTCTACCAGGTGCATCGCCTGTATGTATTCAGTCAGATCTACCAGCAGATTGACGAATCTCCCATCAAGGTTCAGGAA
ACCGAAAAACCAGGCTCCTTGCATCAGGGATATATGTGGGTAAGGTATGGTCCATTGACTAAAACCGTATTGTTTGAATATCACTATGGCCGATCAAAAG
AAAGTCCGCTGCGCGATTTATCTTCCTTCAAGGGCTATATCCAAACCGATGGGTATAGTGCTTACACACACCTTGCGCAAACAATGGGGATTACACACCT
TTCATGCTGGGTGCATGCCCGTCGCTATTTTGACCAGGCATTGTCCAATGATCGGCAGCGCGCCTCAAAAATACTCAAGCTCATACAGGTACTGTATGCC
GTGGAGGCGCTTGCCAGAGAGCAAAACATGACTGCGGGGCAACGTCATGAGCTACGACTGGAAAAATCATTACCTGTTATCAACGAAATTGGCGAGTATA
TTTATCAGCAAAGAGACAAAGTCTTGCCCAAGAGTCCCATCGGAACAGCTTTTAATTATTGCGCCAACAGGTGGGTCAGCCTGCAAAATTACTTAAATGA
CGGTATGCTGGAAATCGACAACAACCTGATTGAAAACTCTATCCGACCGCTGGCTTTAGGCCGCAAAAACTACCTCTTTGCCGGAAGTCATCAGGCAGCT
CAGGACATCGCTATGTTTTACAGTTTCTTCGGAACCTGCAAACAGCATGGTATCGACCCACAGAAATGGCTCACCTATGTGATCAACAATATCAACGATA
CTAAACCTTCGCAATATCACACCCTGCTGCCTCATCTGATCGACAAAAATTTTCTTTGACAGACTGGGGTTGGTGGGGCGGATAC
AGTAGAAGCCTGGAAGCAAAGTGGTATGACGATGGCTGATTATGCACAAAGCATCGGCATGAACCGGACAGCCTTTGAGTATTGGGTTAGAAAATTCCGT
AATGCGCAAAACAAGAAAGATGCCGGATTTGTTGAGATCTTTCCCTCAGCTGCAAAGGCAAATCAGCAGAGGCCATTGGCGCCAAGAGCTATTGAGCAGT
CGCATAGTGAGATCGTTTTCAGCTTTGCCAATGGCATGAGCATTAAGGTGAGCTTTTAGCATGTTAGGTTTAAGCAACAAACTGCGCTATTTTCTGTGCG
TCCAACCTACAGATATGCGCAATGGCTTTGACGGACTAGCCGGTATTGTGCGTAATTATTTAATTCAAGACCCCATTTCAGGCGATGTTTTCGTGTTTTT
GAACAAGACACGCACCCACATCAAGCTGCTATACTGGGATGGCGATGGGTTCGCTTTGTATTACAAGCGATTGGAGAAGGGTAGATATAAGGCTCAGAGA
TCTGTTAACGGGCAATCATTGGAGCTCAAAAGAGATGAGCTTATGATGCTTTTGGAAGGTCTATCGATTGGCGATATGAGAAAAAGCAAGCGGTTTAAAA
TTGGATAAAAAAGACGGAATAATTCTCTGAACCCCCTGTCAATTCAGTGTTTTTGTGTATATTTGACACATGAACACACCTCAAGAAGTAGCCCATACTG
ATCCTAAAACACTCGCCCAAAGAGTGGCTTTGCTCGAGGCTACACTTGCCCAGAAAGAGCTTCTGTTGGAAGAGAAAGATATTCTACTTGAGGAGAAAGA
AACATCCATTTCCAACCTGAACGCATCCCTTGAAGCAGAGAAGTTTAAATATGCCCAGTTGCAGCGGTTAATCTTTGGTTCTAAACGCGAACGCTTTGTG
TCGTCTTTTAGCGCCGAACAGCTGCGTTTGGAGTTTGAGCCCAAGACCATCGAGATTGAGCAGGCAGTAGAGGCAGAACGCGAAACGATCAGAGTTGCCT
ATGAGCGCCAGAAGACAAAAAAGCCACATCCCGGACGCATGCCATTGCCTTCGCACCTGCCCGTTGTTGAGATCATCCTTGAACCAGAAGAAGACACAAC
CGGAATGGTATGTATTGGAAAAGAAGTGACCGAAGAACTGGATTTAACACCGGCCAAATTGCAAGTAAACCGCTATATCCGCTTCAAATACATCACCTCG
GAAGACGACAAGGCCAATCAGCGCCAGGTCATAGCCCCGCTTAACAGACCCATCAACAAATGCATTGCTAGTGCTGCATTGCTGGCAGCCATTTTCGTTG
ATAAATTTGTTTATCACCTCCCGTATTATCGGTTTCTTCAGCGCTTAAGCCAGGAAAAGGTGCACATACCCAAGAGCACCTTTGAATCGTGGGTCAAGCT
GGGGGCCGACTTAATCAGGCCGCTCTACCAGGTGCATCGCCTGTATGTATTCAGTCAGATCTACCAGCAGATTGACGAATCTCCCATCAAGGTTCAGGAA
ACCGAAAAACCAGGCTCCTTGCATCAGGGATATATGTGGGTAAGGTATGGTCCATTGACTAAAACCGTATTGTTTGAATATCACTATGGCCGATCAAAAG
AAAGTCCGCTGCGCGATTTATCTTCCTTCAAGGGCTATATCCAAACCGATGGGTATAGTGCTTACACACACCTTGCGCAAACAATGGGGATTACACACCT
TTCATGCTGGGTGCATGCCCGTCGCTATTTTGACCAGGCATTGTCCAATGATCGGCAGCGCGCCTCAAAAATACTCAAGCTCATACAGGTACTGTATGCC
GTGGAGGCGCTTGCCAGAGAGCAAAACATGACTGCGGGGCAACGTCATGAGCTACGACTGGAAAAATCATTACCTGTTATCAACGAAATTGGCGAGTATA
TTTATCAGCAAAGAGACAAAGTCTTGCCCAAGAGTCCCATCGGAACAGCTTTTAATTATTGCGCCAACAGGTGGGTCAGCCTGCAAAATTACTTAAATGA
CGGTATGCTGGAAATCGACAACAACCTGATTGAAAACTCTATCCGACCGCTGGCTTTAGGCCGCAAAAACTACCTCTTTGCCGGAAGTCATCAGGCAGCT
CAGGACATCGCTATGTTTTACAGTTTCTTCGGAACCTGCAAACAGCATGGTATCGACCCACAGAAATGGCTCACCTATGTGATCAACAATATCAACGATA
CTAAACCTTCGCAATATCACACCCTGCTGCCTCATCTGATCGACAAAAATTTTCTTTGACAGACTGGGGTTGGTGGGGCGGATAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
291 bp | 96 aa | 69 | 359 | + | No |
AG : IS66 TnpA
ORF sequence :
MKNQDSNMYPKVEAWKQSGMTMADYAQSIGMNRTAFEYWVRKFRNAQNKKDAGFVEIFPSAAKANQQRPLAPRAIEQSHSEIVFSFANGMSIKVSF
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 361 | 708 | + | No |
AG : IS66 TnpB
ORF sequence :
MLGLSNKLRYFLCVQPTDMRNGFDGLAGIVRNYLIQDPISGDVFVFLNKTRTHIKLLYWDGDGFALYYKRLEKGRYKAQRSVNGQSLELKRDELMMLLEG
LSIGDMRKSKRFKIG
LSIGDMRKSKRFKIG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1590 bp | 529 aa | 770 | 2359 | + | No |
Chemistry : DDE
ORF sequence :
MNTPQEVAHTDPKTLAQRVALLEATLAQKELLLEEKDILLEEKETSISNLNASLEAEKFKYAQLQRLIFGSKRERFVSSFSAEQLRLEFEPKTIEIEQAV
EAERETIRVAYERQKTKKPHPGRMPLPSHLPVVEIILEPEEDTTGMVCIGKEVTEELDLTPAKLQVNRYIRFKYITSEDDKANQRQVIAPLNRPINKCIA
SAALLAAIFVDKFVYHLPYYRFLQRLSQEKVHIPKSTFESWVKLGADLIRPLYQVHRLYVFSQIYQQIDESPIKVQETEKPGSLHQGYMWVRYGPLTKTV
LFEYHYGRSKESPLRDLSSFKGYIQTDGYSAYTHLAQTMGITHLSCWVHARRYFDQALSNDRQRASKILKLIQVLYAVEALAREQNMTAGQRHELRLEKS
LPVINEIGEYIYQQRDKVLPKSPIGTAFNYCANRWVSLQNYLNDGMLEIDNNLIENSIRPLALGRKNYLFAGSHQAAQDIAMFYSFFGTCKQHGIDPQKW
LTYVINNINDTKPSQYHTLLPHLIDKNFL
EAERETIRVAYERQKTKKPHPGRMPLPSHLPVVEIILEPEEDTTGMVCIGKEVTEELDLTPAKLQVNRYIRFKYITSEDDKANQRQVIAPLNRPINKCIA
SAALLAAIFVDKFVYHLPYYRFLQRLSQEKVHIPKSTFESWVKLGADLIRPLYQVHRLYVFSQIYQQIDESPIKVQETEKPGSLHQGYMWVRYGPLTKTV
LFEYHYGRSKESPLRDLSSFKGYIQTDGYSAYTHLAQTMGITHLSCWVHARRYFDQALSNDRQRASKILKLIQVLYAVEALAREQNMTAGQRHELRLEKS
LPVINEIGEYIYQQRDKVLPKSPIGTAFNYCANRWVSLQNYLNDGMLEIDNNLIENSIRPLALGRKNYLFAGSHQAAQDIAMFYSFFGTCKQHGIDPQKW
LTYVINNINDTKPSQYHTLLPHLIDKNFL
Blast result :
Comments
ISCph4 (orf1) is 54% aa similar to ISDpr4 (orf1).
ISCph4 (orf2) is 61% aa similar to ISBlma9 (orf2).
ISCph4 (orf3) is 61% aa similar to ISPto8 (orf3).
ISCph4 (orf2) is 61% aa similar to ISBlma9 (orf2).
ISCph4 (orf3) is 61% aa similar to ISPto8 (orf3).
References
1] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,C., Glavina,T., Hammon,N., Israni,S., Pitluck,S. and Richardson,P. (2005) Direct submission GenBank.