ISAtu6
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NC_010929 | ND | Agrobacterium tumefaciens | Agrobacterium tumefaciens Ti plasmid pTiBo542 Agrobacterium tumefaciens |
DNA section
IS Length : 2798 bp
Ends
IR Length : 23/24
IRL : GTGAGCGTCCGGTGAGCTGTTTTTGCTGATCGGGCAAATCAGGCGATGTT
IRR : GTGAGCGTCCGGCGAAAGCGTTTTCGGGTATCGAGAGATGGAAGCTATGA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AGTAAATACC | GCCAATGA | GGTGCGCGAT | 8 |
CGAAAAGATA | GTCTTTAT | CACCGCGCTT | 8 |
GTAGAGCGTT | GTTCAGCG | AAATACTAGC | 8 |
DNA sequence
GTGAGCGTCCGGTGAGCTGTTTTTGCTGATCGGGCAAATCAGGCGATGTTCTGGCATTCGAACCAGCATTAGTGCTGACATTAATACCAGAACAAAGAGG
TTTCATGGCTCGCATGGAGATCATGTCCGGTACCGAGCGCAGGCGGCGCTGGTCGGACGAGGCGAAGCTCAGGATATTGGCGGAAGCTGATGAACCCGGT
GCTCGCATTGGTGAGGTGGCGCGCCGGCATGACGTTCATCCTGGCCAGATCCGCTTGTGGCGGAGGTCGTTCAACTATGTGGATCGACCCACGGTGTTCC
TTCCAGTGGAAATCATGGAGGAGGCTGGCGTAAGTCAGGCGTCTTCGACGGTAACGAGGCCGGCGATCGTCGAGATATTGCTGCGGAACGGTCGGTGCCT
GAAGGTTCCTGCTGACGTTGAGTTGAAGCTGCTCGGTCCGCTGATCGCTTGCGTGGAGGCGGCATGATCGGGCCATCGGGCAATGTAAGGGTCTATCTGG
CCTGCGGAGTGACCGATATGCGGCGTGGCATTGATGGCCTATCGGCGCTGGTCGAGACGGTCGTGAAGGAGGCGCCGGGCTCGGGCGCGATTTTCGGCTT
CCGCGGAAAGCGCGCCGACCGGATCAAGCTTCTCTGGTGGGACGGCCAGGGGTTCTGCCTGTTCTACAAGATTTTGGAGCGTGGATACTTTCCCTGGCCG
ACAGCGAAAGAAGGTGTCGCGCACCTGACGCAGGCGCAGCTTTCCATGCTGGTTGAGGGGATTGATTGGCGCCGCCCGGCATGGACGTCCGCTCCTGGCC
GAACAGGATAAAAGCTATATTTTGCAAGGGCATCCGAGGCTTAACGCTGGCGCGTCAGAGGCAAATCGGCTAGATTTGCGGATATGGAAACAGCGCCGCT
GGACAGTCAGGACGAACTCACTGCTTTGCGCGCACTGGTCGCCGAACAGGCGGCGAAGCTTGAGAGCCAGGAAGCCGAGGTCATCAAGCGAGACTCCATC
ATAGGGCTTCTTCGCGCGCAACTGGAGCTTCTCCGACATCGGCAGCATGGCGCCTCTTCGGAAAAGATCGACCGGAAGATAGAGCAGTTCGAACTGATGC
TGGAGGAGATTGAGGCTTCCCGTGCCGAGGCTGAACTGCGCTCCGGGAAAACACCCTTGCCGGATTTGGACGACGCGCCGGACAAGCCGAAGCGCAAACC
ATTGCCCGATGGTCTCGCCACCGAAGAGCTGATCTATGCGGCTCCCTGTAATTGCCCGACCTGCGGTGGCACCTCGTTCCTGAAGGCGCCCGACAGGGTG
GTTCAGGTGCTGGAACACGTGCCGGCGTCGGTCAAGATTGTCCGCCATGTCGAGAAGCGTATGATCTGCAAGGAATGCGATACGACAGTGGCTGGCGAGA
TGCCGACCTTGCCGATCGAGCGCGGCAAGCCCGGGCCTGGATTGCTCGCCCATATCATGGTCTCCAAGTTCGATGATCACATTCCGCTTTACCGTCTCTC
AGAGATGTATGATCGGCTGGGAATAGACATATCGCGCTCCGTGATGGCCGACTGGGTCGGCCGCGTATCCGCTTTGCTGACACCCATCGTCTTGTTGATC
AGGGCCCACATCGCCGCGCTTGACCGAATACATACGGACGATACCCCGGTCGATGTTCTCGACCCCGGACGGGGCAAGACAAAAACCGGCAGGGTCTGGG
TCTACGTCTTCGACGGCAGTGGCTATCAAGCCACCACTCCGGCAGCCATCGCCTATTACTACAGTCCTGATCGAAAGGGCACACATCCGGCTGACCACCT
GGCAAGCTTTAGCGGCGTCATGCATGCCGACGGTTATGGCGGCTACAGACAACTCTACGGCAACCAGATCGTTGAGGCCGCCTGCATGGCGCATGTACGT
CGCAAGTTCCATGACGTGATCAAGCTGAAGCCATCGCCGATCGCCGACGAAGCGCTGTCGCGCATCGGCGCGCTCTACGATATCGAGGATCGTATCCGCG
GCATGTCGGCTGATGAGCGTCGTACCCTGCGCCAACACCACGCCAGACCCATTCTGGACGAGCTGAAGACCTGGATCGAGGCGACACTCTCAACTTTGCC
ACAGAAGCAGAAGCTGGCCGAGGCAATGCGATATGCGCTGTCTCGATGGGCAGCCTTGAGCGTTTACATCGACGATGGCCGTGTCGAAATCGATAACAAC
ATAGCTGAGCGAGCGATGCGTCCGCTGGGCCTCGGCAGAAAAAACTGGTTATTCGCAGGCTCGGACAAGGGCGGTGAGCGCATCGCCAACATCCTGACCA
TCATCGAAACGGTCAAACTGCACGGCCATAATCCGGAGCTCTACCTGACAGATGTCCTGACCCGGATCCAGGATCACCCCAAAGACCGATTTGAAGATCT
GCTGCCCTGGAACTGGATGCCAGCAAAAGCTCGATGCGAGGCCGCCTGATGGCTCGCTCGAGGTTCATCTATACGCTCAGCCAAGTCGCCGGCATGATCG
GCGAAAACCTCGAACTGATCGAAGAAGTAACCGCAAACCCGGACAACATCTCCGAGGGCGAACTGGTTTACGTCAGCGATGGCAGTGAAGATGGCACGAA
GGGTCTGACCGGGAATGGCATCGAAGAACTTCAAAGCCTACTTGCCGACATCAGGACGTGGGACGGCGGTATCCGCGAGTTCCTCATCGACACGCAGTGC
GATCCTGAAATGATCGACCGCGTCATGGCCGATGAAATGAAACGCGGCTCATAGCTTCCATCTCTCGATACCCGAAAACGCTTTCGCCGGACGCTCAC
TTTCATGGCTCGCATGGAGATCATGTCCGGTACCGAGCGCAGGCGGCGCTGGTCGGACGAGGCGAAGCTCAGGATATTGGCGGAAGCTGATGAACCCGGT
GCTCGCATTGGTGAGGTGGCGCGCCGGCATGACGTTCATCCTGGCCAGATCCGCTTGTGGCGGAGGTCGTTCAACTATGTGGATCGACCCACGGTGTTCC
TTCCAGTGGAAATCATGGAGGAGGCTGGCGTAAGTCAGGCGTCTTCGACGGTAACGAGGCCGGCGATCGTCGAGATATTGCTGCGGAACGGTCGGTGCCT
GAAGGTTCCTGCTGACGTTGAGTTGAAGCTGCTCGGTCCGCTGATCGCTTGCGTGGAGGCGGCATGATCGGGCCATCGGGCAATGTAAGGGTCTATCTGG
CCTGCGGAGTGACCGATATGCGGCGTGGCATTGATGGCCTATCGGCGCTGGTCGAGACGGTCGTGAAGGAGGCGCCGGGCTCGGGCGCGATTTTCGGCTT
CCGCGGAAAGCGCGCCGACCGGATCAAGCTTCTCTGGTGGGACGGCCAGGGGTTCTGCCTGTTCTACAAGATTTTGGAGCGTGGATACTTTCCCTGGCCG
ACAGCGAAAGAAGGTGTCGCGCACCTGACGCAGGCGCAGCTTTCCATGCTGGTTGAGGGGATTGATTGGCGCCGCCCGGCATGGACGTCCGCTCCTGGCC
GAACAGGATAAAAGCTATATTTTGCAAGGGCATCCGAGGCTTAACGCTGGCGCGTCAGAGGCAAATCGGCTAGATTTGCGGATATGGAAACAGCGCCGCT
GGACAGTCAGGACGAACTCACTGCTTTGCGCGCACTGGTCGCCGAACAGGCGGCGAAGCTTGAGAGCCAGGAAGCCGAGGTCATCAAGCGAGACTCCATC
ATAGGGCTTCTTCGCGCGCAACTGGAGCTTCTCCGACATCGGCAGCATGGCGCCTCTTCGGAAAAGATCGACCGGAAGATAGAGCAGTTCGAACTGATGC
TGGAGGAGATTGAGGCTTCCCGTGCCGAGGCTGAACTGCGCTCCGGGAAAACACCCTTGCCGGATTTGGACGACGCGCCGGACAAGCCGAAGCGCAAACC
ATTGCCCGATGGTCTCGCCACCGAAGAGCTGATCTATGCGGCTCCCTGTAATTGCCCGACCTGCGGTGGCACCTCGTTCCTGAAGGCGCCCGACAGGGTG
GTTCAGGTGCTGGAACACGTGCCGGCGTCGGTCAAGATTGTCCGCCATGTCGAGAAGCGTATGATCTGCAAGGAATGCGATACGACAGTGGCTGGCGAGA
TGCCGACCTTGCCGATCGAGCGCGGCAAGCCCGGGCCTGGATTGCTCGCCCATATCATGGTCTCCAAGTTCGATGATCACATTCCGCTTTACCGTCTCTC
AGAGATGTATGATCGGCTGGGAATAGACATATCGCGCTCCGTGATGGCCGACTGGGTCGGCCGCGTATCCGCTTTGCTGACACCCATCGTCTTGTTGATC
AGGGCCCACATCGCCGCGCTTGACCGAATACATACGGACGATACCCCGGTCGATGTTCTCGACCCCGGACGGGGCAAGACAAAAACCGGCAGGGTCTGGG
TCTACGTCTTCGACGGCAGTGGCTATCAAGCCACCACTCCGGCAGCCATCGCCTATTACTACAGTCCTGATCGAAAGGGCACACATCCGGCTGACCACCT
GGCAAGCTTTAGCGGCGTCATGCATGCCGACGGTTATGGCGGCTACAGACAACTCTACGGCAACCAGATCGTTGAGGCCGCCTGCATGGCGCATGTACGT
CGCAAGTTCCATGACGTGATCAAGCTGAAGCCATCGCCGATCGCCGACGAAGCGCTGTCGCGCATCGGCGCGCTCTACGATATCGAGGATCGTATCCGCG
GCATGTCGGCTGATGAGCGTCGTACCCTGCGCCAACACCACGCCAGACCCATTCTGGACGAGCTGAAGACCTGGATCGAGGCGACACTCTCAACTTTGCC
ACAGAAGCAGAAGCTGGCCGAGGCAATGCGATATGCGCTGTCTCGATGGGCAGCCTTGAGCGTTTACATCGACGATGGCCGTGTCGAAATCGATAACAAC
ATAGCTGAGCGAGCGATGCGTCCGCTGGGCCTCGGCAGAAAAAACTGGTTATTCGCAGGCTCGGACAAGGGCGGTGAGCGCATCGCCAACATCCTGACCA
TCATCGAAACGGTCAAACTGCACGGCCATAATCCGGAGCTCTACCTGACAGATGTCCTGACCCGGATCCAGGATCACCCCAAAGACCGATTTGAAGATCT
GCTGCCCTGGAACTGGATGCCAGCAAAAGCTCGATGCGAGGCCGCCTGATGGCTCGCTCGAGGTTCATCTATACGCTCAGCCAAGTCGCCGGCATGATCG
GCGAAAACCTCGAACTGATCGAAGAAGTAACCGCAAACCCGGACAACATCTCCGAGGGCGAACTGGTTTACGTCAGCGATGGCAGTGAAGATGGCACGAA
GGGTCTGACCGGGAATGGCATCGAAGAACTTCAAAGCCTACTTGCCGACATCAGGACGTGGGACGGCGGTATCCGCGAGTTCCTCATCGACACGCAGTGC
GATCCTGAAATGATCGACCGCGTCATGGCCGATGAAATGAAACGCGGCTCATAGCTTCCATCTCTCGATACCCGAAAACGCTTTCGCCGGACGCTCAC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
354 bp | 117 aa | 114 | 467 | + | No |
AG : IS66 TnpA
ORF sequence :
MEIMSGTERRRRWSDEAKLRILAEADEPGARIGEVARRHDVHPGQIRLWRRSFNYVDRPTVFLPVEIMEEAGVSQASSTVTRPAIVEILLRNGRCLKVPA
DVELKLLGPLIACVEAA
DVELKLLGPLIACVEAA
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 464 | 811 | + | No |
AG : IS66 TnpB
ORF sequence :
MIGPSGNVRVYLACGVTDMRRGIDGLSALVETVVKEAPGSGAIFGFRGKRADRIKLLWWDGQGFCLFYKILERGYFPWPTAKEGVAHLTQAQLSMLVEGI
DWRRPAWTSAPGRTG
DWRRPAWTSAPGRTG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1566 bp | 521 aa | 884 | 2449 | + | No |
Chemistry : DDE
ORF sequence :
METAPLDSQDELTALRALVAEQAAKLESQEAEVIKRDSIIGLLRAQLELLRHRQHGASSEKIDRKIEQFELMLEEIEASRAEAELRSGKTPLPDLDDAPD
KPKRKPLPDGLATEELIYAAPCNCPTCGGTSFLKAPDRVVQVLEHVPASVKIVRHVEKRMICKECDTTVAGEMPTLPIERGKPGPGLLAHIMVSKFDDHI
PLYRLSEMYDRLGIDISRSVMADWVGRVSALLTPIVLLIRAHIAALDRIHTDDTPVDVLDPGRGKTKTGRVWVYVFDGSGYQATTPAAIAYYYSPDRKGT
HPADHLASFSGVMHADGYGGYRQLYGNQIVEAACMAHVRRKFHDVIKLKPSPIADEALSRIGALYDIEDRIRGMSADERRTLRQHHARPILDELKTWIEA
TLSTLPQKQKLAEAMRYALSRWAALSVYIDDGRVEIDNNIAERAMRPLGLGRKNWLFAGSDKGGERIANILTIIETVKLHGHNPELYLTDVLTRIQDHPK
DRFEDLLPWNWMPAKARCEAA
KPKRKPLPDGLATEELIYAAPCNCPTCGGTSFLKAPDRVVQVLEHVPASVKIVRHVEKRMICKECDTTVAGEMPTLPIERGKPGPGLLAHIMVSKFDDHI
PLYRLSEMYDRLGIDISRSVMADWVGRVSALLTPIVLLIRAHIAALDRIHTDDTPVDVLDPGRGKTKTGRVWVYVFDGSGYQATTPAAIAYYYSPDRKGT
HPADHLASFSGVMHADGYGGYRQLYGNQIVEAACMAHVRRKFHDVIKLKPSPIADEALSRIGALYDIEDRIRGMSADERRTLRQHHARPILDELKTWIEA
TLSTLPQKQKLAEAMRYALSRWAALSVYIDDGRVEIDNNIAERAMRPLGLGRKNWLFAGSDKGGERIANILTIIETVKLHGHNPELYLTDVLTRIQDHPK
DRFEDLLPWNWMPAKARCEAA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
306 bp | 101 aa | 2449 | 2754 | + | No |
Annotation : hypothetical proteinDescription :
ORF sequence :
MARSRFIYTLSQVAGMIGENLELIEEVTANPDNISEGELVYVSDGSEDGTKGLTGNGIEELQSLLADIRTWDGGIREFLIDTQCDPEMIDRVMADEMKRG
S
S
Blast result :
Comments
ISAtu6 orfA is 50% aa similar to ISPpa5_aa1.
ISAtu6 orfB is 77% aa similar to ISPpa5_aa2.
ISAtu6 orfC is 66% aa similar to ISCARN48_aa3.
ISAtu6 orfD is 55% aa similar to ISPpa5_aa4.
ISAtu6 was reconstructed by deletion of ISAtu7 sequence (family IS110).
ISAtu6 orfB is 77% aa similar to ISPpa5_aa2.
ISAtu6 orfC is 66% aa similar to ISCARN48_aa3.
ISAtu6 orfD is 55% aa similar to ISPpa5_aa4.
ISAtu6 was reconstructed by deletion of ISAtu7 sequence (family IS110).
References
1] Oger,P.M., Farrand,S.K., Olsen,G.J. and Reich,C.(2005) Direct submission GenBank.
2] ISfinder annotation (2009)
2] ISfinder annotation (2009)