ISAav1
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AF086815 | ND | Acidovorax avenae | Acidovorax avenae subsp. citrulli |
DNA section
IS Length : 2532 bp
Ends
IR Length : 27
IRL : TGTTGAATCACGAGCAAAACTGAGCCACTAAAGGGCATCCGTCACGTCCA
IRR : TGTTGAATCACGAGCAAAACTGAGCCAGGCGTCACGTCCAAAACTGAGCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GATAGGCGAC | GC | TGCTGTACCA | 2 |
DNA sequence
TGTTGAATCACGAGCAAAACTGAGCCACTAAAGGGCATCCGTCACGTCCAATTTTGAGCCACGTTATCCACCTACCCTGCTGTTTTTTGCAGCAGTGGGG
TGCAGGAGTGATCAACGTGAGCACATTGAGCAAATTGCGCCGCATGGTCCACCGTGACCACCTGAGCGTGCGCGAAGCCAGCCGAAGGCTGGGCATCTCC
CGCAATACAGCCGCCAAGTGGCTCGAGGCCGACGAGATGGTCGAGCCCCGCTACCCCAAGCGGGCCTCTCTTCCCAGCGTTCTCGATCCCTACAAGGAAC
AGCTGGCCACCTGGCTGAAGGCCGACAGCCACCGCAACAAGCGCGAGCGCCGTGGCATCAAGGCCATGTTCCGGGCACTGCAGGCCATGGGCTACCCTGG
CAGCCGAGGCCCGGTCTATGAGTTCGCCAAGCGCTGGCAGCAGGCACAGTCCGACGCGCCAACCCGAATGGCCTTCGTACCCATGAGTTTCGAGATGGGC
GAGGCCTTCCAGTTCGACTGGAGCTGTGAGTACCTGTTCGTGGGCGGCATGCGCAAACGCCTGGAGGCTTCGCACACCAAGCTGGCGGCCAGTCGGGCCT
TCATGCTCACCGCCTACTTCAGCCAGGCCCACGAGATGTTGTTCGACGCTCATGCCCGAGCCTTTGCTGCTTTTGGTGGCGTGCCGCGCCGGGGCATCTA
CGACAACATGAAGACCGCCGTCGACAAGGTTGGTCAGGGCAAGAGCCGCACCGTCAATGCGCGCTTCGAGGCCATGACGGGTCACTACCTGTTCGAGCCA
GAGTTCTGCAACCGGGCCGCAGGCTGGGAGAAGGGTATCGTCGAGAAGAACGTGCAGGATCGCCGTCGTGGCATCTGGATGGAGGCAGCGGAACGCCGCT
GGCCCGACCTGGAGAGCCTCAATGCGTGGCTGCACCAGGCTTGCCTGGACGCCTGGCATGAGCTGCCTCACCCCGAGTGGCCAGAGCTGACGATCGCCGA
CGTCTGGCAGCAAGAGCAGACGCACCTGATGCCCAACCCTGCGCCATTTGATGGCTATGTTGAGCAGATGGTGCGGGTGACGGCCACCTCGCTGATCCAC
TACCAGCGCAATCGCTACAGCGTGCCATGCGAGTGGGCCCACACCAGTGTCAGCGTGCGGGCTTATCCCGACCGCCTGGTGGTGGTCGGCCCCAACTCGG
AACCGCCAGAACAGCCGGTGAGCCTGCCGCGCAGCTTCGAGCGTGGCCAGACCTTGTACGACTGGCGGCACTACGTCAGCTTGCTCGAACGCAAGCCCGG
CGCTTTGCGCAACGGCGCCCCCTTCAAGACCATGCCCGAGCCCTTGCAGCATCTGCAAGCGCACCTGCTTCGCCACCCTGGCGGCGACCGGGTGATGGCC
CAGGTGCTCATGGCCATCACCCTGCACGGGCTCGATGACGTGCTGGTGGCCGTGGAGTTGGCGCTGCAATCGGGCCGCGTGAGTGCCGACCATGTGCTCA
ACGTGCTGGCCAGGCTCAAGGAGCCCCAAGCCGTGCAAAGCCTGCCTGAAGCAGCCTTGCCTTCACTGACGCTGCACGAGCCGCCTCAGGCCGACGTGTC
GCGCTACGACAGCCTGCGCCAGTCCCAGGAGGATGACCATGTCCAATGACATCGCAGCAAGCCTCAAGGGCTTGAGCCTGCACGGCATGGCCAGTGCCTG
GCCGGAACTGCTGGGCATCGCCCGGCTCAAATCGCTCGACCACGAAGCCCTTCTGCATCAGCTCATCAAGGCCGAGGGCGCGCACCGGGAAGTGCGCTCC
ATGGCCTACCAGATGCGGGTGGCCCGATTCCCTCACCACCGCGATCTGGCTGGCTTTGCCTTTGATCAGGCCCAAGTGGACGAGGCGCTGGTGCGTCAGC
TGCACGAGTTCAAGTTCATCGACTCAGCCCACAACGTGGTCTTCGTGGGCGGCCCAGGCACGGGCAAAACACACCTTGCCACAAGCCTGGGCGTTCATGC
CATCCGTGCACATGGTAAGCGGGTGCGCTTCTTCTCGACCGTCGAGCTGGTCAATCTGCTGGAGGCCGAGAAAGCTCAAGGTAAGGCCGGGCAACTGGCA
CATCGGCTCATGTACGTGGATCTCGTCATCCTCGACGAGATGGGCTATTTGCCCTTCAGTCAAGCCGGTGGGGCCTTGCTGTTCCACCTGCTGTCCAAGC
TGTACGAGCGCACCAGCGTGGTCGTCACCACCAACCTGTCGTTCTCGGAGTGGGCCAGCGTGTTCGGCGACGCCAAGATGACCACCGCGCTACTCGACCG
ACTCACCCATCACTGCCACATCGTCGAAACCGGCAACCAGAGCTGGCGCTTCAGGCACTCGACTGCACAACCGTCTTCGATCATCAGAGCCACACGAACC
AAAACGGCCAAAGGAGCACCCCAGACAGATCACGCAGTAGACTTATCCACATCCGAACAGTCCATTTCTTCGTCAACTTAGTGGCTCAGTTTTGGACGTG
ACGCCTGGCTCAGTTTTGCTCGTGATTCAACA
TGCAGGAGTGATCAACGTGAGCACATTGAGCAAATTGCGCCGCATGGTCCACCGTGACCACCTGAGCGTGCGCGAAGCCAGCCGAAGGCTGGGCATCTCC
CGCAATACAGCCGCCAAGTGGCTCGAGGCCGACGAGATGGTCGAGCCCCGCTACCCCAAGCGGGCCTCTCTTCCCAGCGTTCTCGATCCCTACAAGGAAC
AGCTGGCCACCTGGCTGAAGGCCGACAGCCACCGCAACAAGCGCGAGCGCCGTGGCATCAAGGCCATGTTCCGGGCACTGCAGGCCATGGGCTACCCTGG
CAGCCGAGGCCCGGTCTATGAGTTCGCCAAGCGCTGGCAGCAGGCACAGTCCGACGCGCCAACCCGAATGGCCTTCGTACCCATGAGTTTCGAGATGGGC
GAGGCCTTCCAGTTCGACTGGAGCTGTGAGTACCTGTTCGTGGGCGGCATGCGCAAACGCCTGGAGGCTTCGCACACCAAGCTGGCGGCCAGTCGGGCCT
TCATGCTCACCGCCTACTTCAGCCAGGCCCACGAGATGTTGTTCGACGCTCATGCCCGAGCCTTTGCTGCTTTTGGTGGCGTGCCGCGCCGGGGCATCTA
CGACAACATGAAGACCGCCGTCGACAAGGTTGGTCAGGGCAAGAGCCGCACCGTCAATGCGCGCTTCGAGGCCATGACGGGTCACTACCTGTTCGAGCCA
GAGTTCTGCAACCGGGCCGCAGGCTGGGAGAAGGGTATCGTCGAGAAGAACGTGCAGGATCGCCGTCGTGGCATCTGGATGGAGGCAGCGGAACGCCGCT
GGCCCGACCTGGAGAGCCTCAATGCGTGGCTGCACCAGGCTTGCCTGGACGCCTGGCATGAGCTGCCTCACCCCGAGTGGCCAGAGCTGACGATCGCCGA
CGTCTGGCAGCAAGAGCAGACGCACCTGATGCCCAACCCTGCGCCATTTGATGGCTATGTTGAGCAGATGGTGCGGGTGACGGCCACCTCGCTGATCCAC
TACCAGCGCAATCGCTACAGCGTGCCATGCGAGTGGGCCCACACCAGTGTCAGCGTGCGGGCTTATCCCGACCGCCTGGTGGTGGTCGGCCCCAACTCGG
AACCGCCAGAACAGCCGGTGAGCCTGCCGCGCAGCTTCGAGCGTGGCCAGACCTTGTACGACTGGCGGCACTACGTCAGCTTGCTCGAACGCAAGCCCGG
CGCTTTGCGCAACGGCGCCCCCTTCAAGACCATGCCCGAGCCCTTGCAGCATCTGCAAGCGCACCTGCTTCGCCACCCTGGCGGCGACCGGGTGATGGCC
CAGGTGCTCATGGCCATCACCCTGCACGGGCTCGATGACGTGCTGGTGGCCGTGGAGTTGGCGCTGCAATCGGGCCGCGTGAGTGCCGACCATGTGCTCA
ACGTGCTGGCCAGGCTCAAGGAGCCCCAAGCCGTGCAAAGCCTGCCTGAAGCAGCCTTGCCTTCACTGACGCTGCACGAGCCGCCTCAGGCCGACGTGTC
GCGCTACGACAGCCTGCGCCAGTCCCAGGAGGATGACCATGTCCAATGACATCGCAGCAAGCCTCAAGGGCTTGAGCCTGCACGGCATGGCCAGTGCCTG
GCCGGAACTGCTGGGCATCGCCCGGCTCAAATCGCTCGACCACGAAGCCCTTCTGCATCAGCTCATCAAGGCCGAGGGCGCGCACCGGGAAGTGCGCTCC
ATGGCCTACCAGATGCGGGTGGCCCGATTCCCTCACCACCGCGATCTGGCTGGCTTTGCCTTTGATCAGGCCCAAGTGGACGAGGCGCTGGTGCGTCAGC
TGCACGAGTTCAAGTTCATCGACTCAGCCCACAACGTGGTCTTCGTGGGCGGCCCAGGCACGGGCAAAACACACCTTGCCACAAGCCTGGGCGTTCATGC
CATCCGTGCACATGGTAAGCGGGTGCGCTTCTTCTCGACCGTCGAGCTGGTCAATCTGCTGGAGGCCGAGAAAGCTCAAGGTAAGGCCGGGCAACTGGCA
CATCGGCTCATGTACGTGGATCTCGTCATCCTCGACGAGATGGGCTATTTGCCCTTCAGTCAAGCCGGTGGGGCCTTGCTGTTCCACCTGCTGTCCAAGC
TGTACGAGCGCACCAGCGTGGTCGTCACCACCAACCTGTCGTTCTCGGAGTGGGCCAGCGTGTTCGGCGACGCCAAGATGACCACCGCGCTACTCGACCG
ACTCACCCATCACTGCCACATCGTCGAAACCGGCAACCAGAGCTGGCGCTTCAGGCACTCGACTGCACAACCGTCTTCGATCATCAGAGCCACACGAACC
AAAACGGCCAAAGGAGCACCCCAGACAGATCACGCAGTAGACTTATCCACATCCGAACAGTCCATTTCTTCGTCAACTTAGTGGCTCAGTTTTGGACGTG
ACGCCTGGCTCAGTTTTGCTCGTGATTCAACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1533 bp | 510 aa | 117 | 1649 | + | No |
Chemistry : DDE
ORF sequence :
MSTLSKLRRMVHRDHLSVREASRRLGISRNTAAKWLEADEMVEPRYPKRASLPSVLDPYKEQLATWLKADSHRNKRERRGIKAMFRALQAMGYPGSRGPV
YEFAKRWQQAQSDAPTRMAFVPMSFEMGEAFQFDWSCEYLFVGGMRKRLEASHTKLAASRAFMLTAYFSQAHEMLFDAHARAFAAFGGVPRRGIYDNMKT
AVDKVGQGKSRTVNARFEAMTGHYLFEPEFCNRAAGWEKGIVEKNVQDRRRGIWMEAAERRWPDLESLNAWLHQACLDAWHELPHPEWPELTIADVWQQE
QTHLMPNPAPFDGYVEQMVRVTATSLIHYQRNRYSVPCEWAHTSVSVRAYPDRLVVVGPNSEPPEQPVSLPRSFERGQTLYDWRHYVSLLERKPGALRNG
APFKTMPEPLQHLQAHLLRHPGGDRVMAQVLMAITLHGLDDVLVAVELALQSGRVSADHVLNVLARLKEPQAVQSLPEAALPSLTLHEPPQADVSRYDSL
RQSQEDDHVQ
YEFAKRWQQAQSDAPTRMAFVPMSFEMGEAFQFDWSCEYLFVGGMRKRLEASHTKLAASRAFMLTAYFSQAHEMLFDAHARAFAAFGGVPRRGIYDNMKT
AVDKVGQGKSRTVNARFEAMTGHYLFEPEFCNRAAGWEKGIVEKNVQDRRRGIWMEAAERRWPDLESLNAWLHQACLDAWHELPHPEWPELTIADVWQQE
QTHLMPNPAPFDGYVEQMVRVTATSLIHYQRNRYSVPCEWAHTSVSVRAYPDRLVVVGPNSEPPEQPVSLPRSFERGQTLYDWRHYVSLLERKPGALRNG
APFKTMPEPLQHLQAHLLRHPGGDRVMAQVLMAITLHGLDDVLVAVELALQSGRVSADHVLNVLARLKEPQAVQSLPEAALPSLTLHEPPQADVSRYDSL
RQSQEDDHVQ
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
843 bp | 280 aa | 1639 | 2481 | + | No |
AG : IS21 helper
ORF sequence :
MSNDIAASLKGLSLHGMASAWPELLGIARLKSLDHEALLHQLIKAEGAHREVRSMAYQMRVARFPHHRDLAGFAFDQAQVDEALVRQLHEFKFIDSAHNV
VFVGGPGTGKTHLATSLGVHAIRAHGKRVRFFSTVELVNLLEAEKAQGKAGQLAHRLMYVDLVILDEMGYLPFSQAGGALLFHLLSKLYERTSVVVTTNL
SFSEWASVFGDAKMTTALLDRLTHHCHIVETGNQSWRFRHSTAQPSSIIRATRTKTAKGAPQTDHAVDLSTSEQSISSST
VFVGGPGTGKTHLATSLGVHAIRAHGKRVRFFSTVELVNLLEAEKAQGKAGQLAHRLMYVDLVILDEMGYLPFSQAGGALLFHLLSKLYERTSVVVTTNL
SFSEWASVFGDAKMTTALLDRLTHHCHIVETGNQSWRFRHSTAQPSSIIRATRTKTAKGAPQTDHAVDLSTSEQSISSST
Blast result :
Comments
There are two copies of ISAav1. ISAav1 is 66% (for ORF1) and 81% (for ORF2)aa similar to IS1600.
References
1] Eaton, R.W. (2001) Direct submission GenBank.