ISApr6
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Alpha proteobacterium | Alpha proteobacterium BAL199 |
DNA section
IS Length : 2479 bp
Ends
IR Length : 18/24
IRL : GTAAGCGGCGTTCTGACCCCACCTACCGCTGATCGCTGATCCGGCGGAGT
IRR : GTAAGCGGTGTTCTGCTCACACGTCGGCGTTGGCGCGGTAGGCCCACGGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAAATCTGCC | GATCCAGA | CGGACTCACC | 8 |
DNA sequence
GTAAGCGGCGTTCTGACCCCACCTACCGCTGATCGCTGATCCGGCGGAGTCTGCCGACATGGAGGCAAGATCCATGTCGAAGCCTGAGCTTATTGCTATG
TCGGAGCCGGTGCAGCGCTTCGAGGTGTTCACCGGCAGCGGCCGTCGGCGGCGCTGGCCTGACGAGGTGAAGGCGCTGATCGTCGCGGAGAGCTGCCGTC
CCGGGGAGACGGTGTGCGGGGTGGCTCGGCGTCATGGTCTGTTGCCGCAGCAGCTGTTTGCCTGGCGCCGGCTGGCGCGACGCAATGGTGTGGCGAGGGA
TGCCGTGCCGTTGTTCGCCGAGGTGGTCGCGGACGCTGTCGCCGGCAATTGGGCAGGGGCAGCGGCGGTAGGATCCGCGGACATCGTGATCGAGCTTGGC
GATTTGCGGTTGCGGCTGGGGTCGGGCGTGGCGGCAGACCGGGCGGCGGCACTGGTGTCGGCGCTGCGGGCGGCTCTGGCGGTTGGGCGATGATCCTGCC
GGGCGGGCCGCTGCGGGTCTATGTGGCGACGCGGCCGGTGGACTTCCGCAAGGGGATGGACGGGCTGGCGGCCCTGGCTGCGGCCGAACTGCGGCTGGAT
GCGTTCTCCGGGGTGGTGCTGGTGTTCCGGGCCAAGCGGGCGGACCGGGTGAAGATCCTGGTGTGGGACGGCAGCGGCCTTGTGCTGATCGCCAAGCGGC
TGGAGCAGGGCAAGTTCGCCTGGCCGGGCGTTCGCGACGGGGTGATGCGGCTGTCGTCGGCGCAGCTGGCGGCGCTGTTCGAGGGGCTCGACTGGCGGCG
GGTGTACACAGCGCGCACCGTGCGGCCGACGGCGGCCGGATAAGCTGCGGCGAAGTGACTCGGGGCGACTGATCGCAGGTCCCCGAGTGCCTCAGGGCAT
GATTCAATGGCGGCCATGACCGACGCCACCGCAGTCCTGGAGGCTTGCGAACTGCCGGACGACCCGGCGGCGTTGAAGGCGCTGCTGATCACCGAGCGTA
GCCTGCGGGCAGAGTTGGGCGCGGAGGTGGTGCGGCTGTCCGCGATCGTGGCGGCGTTCAAGCGGGCGCTGTTCGGCCGGCGCTCGGAGAAGTACGACCC
CGAGCAATTGGACTTGGCGCTGGAGGACGTCGAGCAGGAGTTCGGCGAGCAGCGTGCCGAGGCCGACGCCGGCGATCCGGCGCTCAAGACCGAACGCGCG
CGTCGGCGCCGCGCCAACCGCGGGTCGCTGCCCAGGCATCTGCCGCGGGTCGAGGTGGTGGTCGAGCCGGATAGCCGGACCTGTGGGTGCTGCGGCGAGG
CGCTGCAGGCCATCGGCGAGGACGTCAGCGAGCGGCTCGACGTCATCCCGGCGCAGTTCCGGGTGATCGTCACCCGCCGACCGAAATATGCCTGCCGCGG
CTGTGCCGAGGGCGTGGTCCAGGCGGCCGCACCGGAGCGGCTGATCGCCGGCGGGCTGCCGACCGAGGCCATGGTCGCCCATGTGCTGGTCGCGAAGTAT
GCCGACCACACGCCGCTCTATCGCCAGGCGCAGATCTACGCCCGCCAGGGGATCGCCCTCGACCGCTCGACGCTGGCCGACTGGGTCGGCCACGCGGCGC
GCGAGCTGCGGCCGGTCCATGCCCGGCTGATCGAGATCATGAAGGCCTCGGCGGTCGTGTTCGCCGACGAGACCCGGGCTCCGGTGCTCGATCCTGGGCG
CGGCCGGACCAAGGAGGGCTGGCTGTGGGCGCTGGCGCGCGATCCGCGGCCCTGGGGCGGTGCCGATCCGCCGGCGGTCGCATATCGCTACGCTCCGGGA
CGAGGTGCCGAGCACGCCGAAGTTCATCTCGACGGGTTCAAGGGCATCCTGCAGGTCGACGGCTATGCCGCCTACCGGCGCCTGGCCGGCCCGACCCTGG
CGTTCTGCTGGTCGCATGCCCGGCGCAAGTTCTACGAGATCGCCGAGGCCGGCCACGCGCCGGTCGCCGAGGAGGCGCTGCGCCGGATCGCCGCCCTCTA
CGCCGTCGAGGCGGAGATCCGCGGCAGTGATCCACAGACCCGCCACGCCGAGCGCCAGGCCCGCTCGCGGCCGCTGGTCGAGGACCTTCGTCCCTGGCTC
GAGGCCCAGCTCAGTCGGCTGTCCGGCAAGTCCCGCCTCGCCGAGGCGATCCGCTACACCCTCAAGCTCTGGGACGGGCTCTCCCGCTTCCTCGACGACG
GCCGCATCGAGCTCGACACCAACACCGTCGAGCGCGCCATCCGCCCGATCGCGCTGAACCGCAAGAACGCCCTGTTCGCCGGTTCCGACGGTGGCGGCGA
GCACTGGGCCGTCATCGCCTCCTTGGTCGAGACCTGCAAGCTCAACGACCTCAACCCGCACACCTACCTGACCGACGTCCTCGAACGTCTCGTTGCCGGC
CATCAGCAGAGCCGGATCGACGACCTCATGCCGTGGGCCTACCGCGCCAACGCCGACGTGTGAGCAGAACACCGCTTAC
TCGGAGCCGGTGCAGCGCTTCGAGGTGTTCACCGGCAGCGGCCGTCGGCGGCGCTGGCCTGACGAGGTGAAGGCGCTGATCGTCGCGGAGAGCTGCCGTC
CCGGGGAGACGGTGTGCGGGGTGGCTCGGCGTCATGGTCTGTTGCCGCAGCAGCTGTTTGCCTGGCGCCGGCTGGCGCGACGCAATGGTGTGGCGAGGGA
TGCCGTGCCGTTGTTCGCCGAGGTGGTCGCGGACGCTGTCGCCGGCAATTGGGCAGGGGCAGCGGCGGTAGGATCCGCGGACATCGTGATCGAGCTTGGC
GATTTGCGGTTGCGGCTGGGGTCGGGCGTGGCGGCAGACCGGGCGGCGGCACTGGTGTCGGCGCTGCGGGCGGCTCTGGCGGTTGGGCGATGATCCTGCC
GGGCGGGCCGCTGCGGGTCTATGTGGCGACGCGGCCGGTGGACTTCCGCAAGGGGATGGACGGGCTGGCGGCCCTGGCTGCGGCCGAACTGCGGCTGGAT
GCGTTCTCCGGGGTGGTGCTGGTGTTCCGGGCCAAGCGGGCGGACCGGGTGAAGATCCTGGTGTGGGACGGCAGCGGCCTTGTGCTGATCGCCAAGCGGC
TGGAGCAGGGCAAGTTCGCCTGGCCGGGCGTTCGCGACGGGGTGATGCGGCTGTCGTCGGCGCAGCTGGCGGCGCTGTTCGAGGGGCTCGACTGGCGGCG
GGTGTACACAGCGCGCACCGTGCGGCCGACGGCGGCCGGATAAGCTGCGGCGAAGTGACTCGGGGCGACTGATCGCAGGTCCCCGAGTGCCTCAGGGCAT
GATTCAATGGCGGCCATGACCGACGCCACCGCAGTCCTGGAGGCTTGCGAACTGCCGGACGACCCGGCGGCGTTGAAGGCGCTGCTGATCACCGAGCGTA
GCCTGCGGGCAGAGTTGGGCGCGGAGGTGGTGCGGCTGTCCGCGATCGTGGCGGCGTTCAAGCGGGCGCTGTTCGGCCGGCGCTCGGAGAAGTACGACCC
CGAGCAATTGGACTTGGCGCTGGAGGACGTCGAGCAGGAGTTCGGCGAGCAGCGTGCCGAGGCCGACGCCGGCGATCCGGCGCTCAAGACCGAACGCGCG
CGTCGGCGCCGCGCCAACCGCGGGTCGCTGCCCAGGCATCTGCCGCGGGTCGAGGTGGTGGTCGAGCCGGATAGCCGGACCTGTGGGTGCTGCGGCGAGG
CGCTGCAGGCCATCGGCGAGGACGTCAGCGAGCGGCTCGACGTCATCCCGGCGCAGTTCCGGGTGATCGTCACCCGCCGACCGAAATATGCCTGCCGCGG
CTGTGCCGAGGGCGTGGTCCAGGCGGCCGCACCGGAGCGGCTGATCGCCGGCGGGCTGCCGACCGAGGCCATGGTCGCCCATGTGCTGGTCGCGAAGTAT
GCCGACCACACGCCGCTCTATCGCCAGGCGCAGATCTACGCCCGCCAGGGGATCGCCCTCGACCGCTCGACGCTGGCCGACTGGGTCGGCCACGCGGCGC
GCGAGCTGCGGCCGGTCCATGCCCGGCTGATCGAGATCATGAAGGCCTCGGCGGTCGTGTTCGCCGACGAGACCCGGGCTCCGGTGCTCGATCCTGGGCG
CGGCCGGACCAAGGAGGGCTGGCTGTGGGCGCTGGCGCGCGATCCGCGGCCCTGGGGCGGTGCCGATCCGCCGGCGGTCGCATATCGCTACGCTCCGGGA
CGAGGTGCCGAGCACGCCGAAGTTCATCTCGACGGGTTCAAGGGCATCCTGCAGGTCGACGGCTATGCCGCCTACCGGCGCCTGGCCGGCCCGACCCTGG
CGTTCTGCTGGTCGCATGCCCGGCGCAAGTTCTACGAGATCGCCGAGGCCGGCCACGCGCCGGTCGCCGAGGAGGCGCTGCGCCGGATCGCCGCCCTCTA
CGCCGTCGAGGCGGAGATCCGCGGCAGTGATCCACAGACCCGCCACGCCGAGCGCCAGGCCCGCTCGCGGCCGCTGGTCGAGGACCTTCGTCCCTGGCTC
GAGGCCCAGCTCAGTCGGCTGTCCGGCAAGTCCCGCCTCGCCGAGGCGATCCGCTACACCCTCAAGCTCTGGGACGGGCTCTCCCGCTTCCTCGACGACG
GCCGCATCGAGCTCGACACCAACACCGTCGAGCGCGCCATCCGCCCGATCGCGCTGAACCGCAAGAACGCCCTGTTCGCCGGTTCCGACGGTGGCGGCGA
GCACTGGGCCGTCATCGCCTCCTTGGTCGAGACCTGCAAGCTCAACGACCTCAACCCGCACACCTACCTGACCGACGTCCTCGAACGTCTCGTTGCCGGC
CATCAGCAGAGCCGGATCGACGACCTCATGCCGTGGGCCTACCGCGCCAACGCCGACGTGTGAGCAGAACACCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
396 bp | 131 aa | 98 | 493 | + | No |
AG : IS66 TnpA
ORF sequence :
MSEPVQRFEVFTGSGRRRRWPDEVKALIVAESCRPGETVCGVARRHGLLPQQLFAWRRLARRNGVARDAVPLFAEVVADAVAGNWAGAAAVGSADIVIEL
GDLRLRLGSGVAADRAAALVSALRAALAVGR
GDLRLRLGSGVAADRAAALVSALRAALAVGR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 490 | 843 | + | No |
AG : IS66 TnpB
ORF sequence :
MILPGGPLRVYVATRPVDFRKGMDGLAALAAAELRLDAFSGVVLVFRAKRADRVKILVWDGSGLVLIAKRLEQGKFAWPGVRDGVMRLSSAQLAALFEGL
DWRRVYTARTVRPTAAG
DWRRVYTARTVRPTAAG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1557 bp | 518 aa | 907 | 2463 | + | No |
Chemistry : DDE
ORF sequence :
MAAMTDATAVLEACELPDDPAALKALLITERSLRAELGAEVVRLSAIVAAFKRALFGRRSEKYDPEQLDLALEDVEQEFGEQRAEADAGDPALKTERARR
RRANRGSLPRHLPRVEVVVEPDSRTCGCCGEALQAIGEDVSERLDVIPAQFRVIVTRRPKYACRGCAEGVVQAAAPERLIAGGLPTEAMVAHVLVAKYAD
HTPLYRQAQIYARQGIALDRSTLADWVGHAARELRPVHARLIEIMKASAVVFADETRAPVLDPGRGRTKEGWLWALARDPRPWGGADPPAVAYRYAPGRG
AEHAEVHLDGFKGILQVDGYAAYRRLAGPTLAFCWSHARRKFYEIAEAGHAPVAEEALRRIAALYAVEAEIRGSDPQTRHAERQARSRPLVEDLRPWLEA
QLSRLSGKSRLAEAIRYTLKLWDGLSRFLDDGRIELDTNTVERAIRPIALNRKNALFAGSDGGGEHWAVIASLVETCKLNDLNPHTYLTDVLERLVAGHQ
QSRIDDLMPWAYRANADV
RRANRGSLPRHLPRVEVVVEPDSRTCGCCGEALQAIGEDVSERLDVIPAQFRVIVTRRPKYACRGCAEGVVQAAAPERLIAGGLPTEAMVAHVLVAKYAD
HTPLYRQAQIYARQGIALDRSTLADWVGHAARELRPVHARLIEIMKASAVVFADETRAPVLDPGRGRTKEGWLWALARDPRPWGGADPPAVAYRYAPGRG
AEHAEVHLDGFKGILQVDGYAAYRRLAGPTLAFCWSHARRKFYEIAEAGHAPVAEEALRRIAALYAVEAEIRGSDPQTRHAERQARSRPLVEDLRPWLEA
QLSRLSGKSRLAEAIRYTLKLWDGLSRFLDDGRIELDTNTVERAIRPIALNRKNALFAGSDGGGEHWAVIASLVETCKLNDLNPHTYLTDVLERLVAGHQ
QSRIDDLMPWAYRANADV
Blast result :
Comments
Genome in progress. Accession number: NZ_ABHC01000008
ISApr6 orf1 is 49% aa similar to ISRm14 (orf1).
ISApr6 orf2 is 70% aa similar to ISRm14 (orf2).
ISApr6 orf3 is 65% aa similar to ISBrsp6 (orf3).
ISApr6 orf1 is 49% aa similar to ISRm14 (orf1).
ISApr6 orf2 is 70% aa similar to ISRm14 (orf2).
ISApr6 orf3 is 65% aa similar to ISBrsp6 (orf3).
References
1] Hagstrom,A., Ferriera,S., Johnson,J., Kravitz,S., Beeson,K., Sutton,G., Rogers,Y.-H., Friedman,R., Frazier,M. and Venter,J.C. (2007) Direct Submission GenBank.
2] ISfinder annotation (2008).
2] ISfinder annotation (2008).