ISRsp12
- Family Tn3
- Group
Isoform Synonym(s) TnRsp12
Accession number | Transposition | Origin | Host |
---|---|---|---|
FO082821 | ND | Rhizhobium sp. | Rhizhobium sp. NT-26 |
DNA section
IS Length : 3677 bp
Ends
IR Length : 37
IRL : GCGGCTCGCGCGCCGCAAAACGCTCATCTGGTCTAAGGAATAGCTTGGGC
IRR : GCGGCTCGCGCGCCGCAAAACGCTCATCTGGTCTAAGCCCGTTTCGGCCA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AACCACACTT | AAGTGTGGTT | 0 |
DNA sequence
GCGGCTCGCGCGCCGCAAAACGCTCATCTGGTCTAAGGAATAGCTTGGGCCGATTTCCGCTGTAGCGTCCTGTAGACGGTCGGCCTTGAGACCGAGAAGA
TCTCAGCGAGATCGCTGATAGAGTAGTCGCCGGAGGCGTGCATCCGGCCCAGTTCTTTTTGTTGTTTTTCCGAGAGTTTCGGCTGCTTTCCCCGGAGTTT
GCCCCTAGCGCGGGCGATGGCCATACCTTCGCGGGTGCGCATGCGGATGAGGTCGGCCTCGAACTCGGCGAAAGTGGCGAGGATGTTGAAGAACATCTTT
CCCATCGGGTCGGCCGGATCGTAGACCGATGCGCCGAGGGCAAGCTTGACGCCCTTCTTTTCCAACTGATCCGCGATGGCCCGGGCGTCTGGAACCGAGC
GCGCGAGACGGTCGAGCTTCGGTACGACCAGCACATCGCCTGCGCGCACGGCAGCGAGAGCTTGGTCGAGACCGGGCCGGGCGCGGTTTGTGCCCGTCAA
GCCGTGGTCGGTGTAGATGTGATCTTCGGTGACCCCAAGTTCTGCAAGGGCAGCGCGCTGCGCCGTCAGATCCTGACGGTCGGTGGAGCAGCGGGCATAG
CCAATCTTGGTTGCGGTCATGCAGCGAGTGTACGGATAAGGGGCCGCATGTGCGAATCAAATCGTACCATCTATACGAGACGTGGCGGGGTGCGGTTTCC
TTGCCGCCCTCCGCCACCTCGACCACGTCCGCTTATCGATCCCCTTACGGACGGTCGTCCATCACGATTTCCTGAAGGATCGCCATGCCAAGACGCAGCA
TTCTGACCGACCGCCAACGGGCGGCATTGTTCGACCTGCCGACCGATGACGCCTCGATGCTGCGCCACTACACGCTGGCAGACGACGATCTGGAGATCAT
CCATGCCCGTCGCCGCCCCCACAATCGCTTCGGTTTTGCGTTGCAACTCTGCGCCCTGCGATATCCCGGGCGGTTGTTGGCGCCGGGCGAGGTCATACCG
CTGCCAGTCACGCGCTTTCTGGCCGCGCAACTTGGGATAAAGCCTGACGATCTGGCTGGCTACGCCGCCCGCGAGGAGACCCGGCATGAGCACCTGGCCA
TCTTGCGCGAGATCTACGGATACAAGATGTTCACCGGGCGGGGCGCTCGGGACCTGAAAACCTGGCTTGAGGATACCGCAGAAACCACTCGGTCGAACGA
GGACCTGGCACGGCGCTTCGTCGAGCAGTGCCGGGCAAGCCAGACCATCCTGCCGGGGATCACGGTCATCGAGCGGCTTTGCGCGGATGCGCTCGTCGCG
GCTGAACGGCGGGTCGATGCCCGGATCGCCGGTCGGCTGAACGATGACATGCGAACGCGGCTCGATGCCCTGTTGACCGATGGTGCGGGCGGCGCGGTCA
CGCGGTTCGTCTGGCTGCGTCAGTTCGAGATCGGGCGGAACTCGGCGGACATGAACCGCCTTCTGGACCGCCTGGAATACCTGCAGACCTTGGAACTGGC
CGGGGCATTCTGGCTGACGTGCCTCCTCACCGGATTGCGCGCCTGCGCCGGCAGGGCGAGCGATACTTCGCGGGTGATCTGAGGGATATCTCGGGTGACC
GCCGCCTCGCCATTCTTGCGGTCTGCGCGCTGGAATGGCGCAGTTCCATTGCCGACGCCGTTGTCGAGACCCATGACCGGATCGTCGGCAAGACGTGGCG
CGAGGCGAAATCGCGATGTGACGCCCGGATGAACGATGCCAAATCTGCGCTGAAGGACACCCTGCAATCCTTCAAGACATTGGGCGCGGCGCTTCTGGAA
GCGCATGAGGACCAAGCCTCCCTTGAAGAGGCCATCGGTACTGCGGGCGGCTGGTCATCCCTCAAAGGGCTTGTCGCCACGGCCGCCCAACTGACCGATA
CATTCGCCGCGGACCCGCTGGCGCATGTTGTTCACGGATATCATCGCTTCCGGCGTTATGCGCCGCGCATGCTGCGGGCGCTCGACATTTGTGCCGCGCC
CGTGGCGGAACCGCTGCTGGCGGCAAGCAGGATCATCGCGGGCACGGAGACGACTGACGACCGACCGCTGACCTTCCTGCGCCGCACTTCAAAGTGGCAT
CGCCACCTGAATGGCGACGATGAGCACCGAGTATGGGAGGTCGCGGTCCTGTTCCATCTGCGCGACGCCTTCCGTTCCGGTGATATCTGGCTGGCGCATT
CGCGAAGGTATGGCGACCTGAAGGACGCGCTTGTCCCGGTCGAGGCCGCCCGCGACACGCCGAGGCTGGCCATGCCGTTCGAGCCCGAAACCTGGCTGGC
CGACCGAAAGGCGCGTCTGTCCGATGCTGTATGCCGACTGGCTCGCGCTGCAAAGGCCGGGGCCATACCGGGAGGTTCCATAGAGGATAGCACCTTGAAA
ATCGATCGGCTGACCGCAGCGGTTCCCGAGGAAGCCGACACCCTGGTGCTCGATCTCTATCACCGCCTGCCCGAAGTCAGGGTCACCGATCTCCTGCTCG
AAGTGGATGCGGAGGTCGGCTTCACCGAGGCCTTTACCCATCTGCGCACCGGAGTGCCTTGCAAGGACAGGATTGGCCTGCTGAATGTCCTGCTTGCCGA
GGGGCTGAATCTTGGCCTCAGCAAGATGGCCGGGGCCACGAACACGCACGACTTCTTCCAGCTCTCCCGCCTGTCGCGATGGCATGTCGAAAGCGACGCG
ATGGCTCGCGCCCTGGCCATGGTGATCGAAGGACAGTCGGCACTGCCGATGGCGCGCTTCTGGGGCGCCGGACAAACCGCTTCGAGCGACGGGCAATTCT
TCCCGACAACGCGTCAGGGTGAGGCGATGAACCTGATCAACGCCAAGTACGGCCATGAACCCGGTCTGAAGGCCTATACCCATGTCTCCGACCAGTTCGG
CCCTTTCGCCACACAGACCATTCCAGCCACCGTGAACGAGGCACCCTATATCCTTGACGGCCTGCTGATGACCGGTGCCGGCCAGAAGATCCGCGAACAG
TATGCCGATACAGGTGGCTTCACGGACCATGTCTTCGCGGTCACCGCCCTGCTGGGATTCCAGTTCATACCCCGCATCCGCGACCTGCCGTCAAAGCGCC
TTTACCTCTTCGATCCGGCAGCCTGCCCGAAAGAGCTGAAGGGGCTGATCGGGGGCACGATCAAGGAACGTCTTATCATGACAAACTGGCCTGATATCCT
GCGCAGCGTGGCGACCATGGCGTCAGGGGCCATGCCTCCCAGCCAGCTCCTGCGGAAGTTTGCATCCTATCCCAGGCAACATGAGCTGGCGGTCGCTTTG
CGGGAAATCGGACGGGTCGAACGGACGCTCTTCATCATCGACTGGCTGCTCGATGCCGACATGCAGCGCCGCGCCCAGATCGGCCTCAACAAGGGTGAGG
CCCATCACGCCCTGAAGAACGCTCTCCGCATCGGCCGCCAGGGTGAAATCCGCGATCGCACCTCCGAAGGCCAGCACTTCCGCATGGCCGGGCTGAACCT
CCTCGCCGCGATCGTCATCTACTGGAACACCAAGCATCTCGGTGTCGCCGTCTCCAATCGCCGCCGCGAAGGCCTGGACTGCTCTCCCCACCTCATGGCG
CATATCTCGCCCCTCGGTTGGGCACATATCCTGCTCACTGGCGAATACAGATGGCCGAAACGGGCTTAGACCAGATGAGCGTTTTGCGGCGCGCGAGCCG
C
TCTCAGCGAGATCGCTGATAGAGTAGTCGCCGGAGGCGTGCATCCGGCCCAGTTCTTTTTGTTGTTTTTCCGAGAGTTTCGGCTGCTTTCCCCGGAGTTT
GCCCCTAGCGCGGGCGATGGCCATACCTTCGCGGGTGCGCATGCGGATGAGGTCGGCCTCGAACTCGGCGAAAGTGGCGAGGATGTTGAAGAACATCTTT
CCCATCGGGTCGGCCGGATCGTAGACCGATGCGCCGAGGGCAAGCTTGACGCCCTTCTTTTCCAACTGATCCGCGATGGCCCGGGCGTCTGGAACCGAGC
GCGCGAGACGGTCGAGCTTCGGTACGACCAGCACATCGCCTGCGCGCACGGCAGCGAGAGCTTGGTCGAGACCGGGCCGGGCGCGGTTTGTGCCCGTCAA
GCCGTGGTCGGTGTAGATGTGATCTTCGGTGACCCCAAGTTCTGCAAGGGCAGCGCGCTGCGCCGTCAGATCCTGACGGTCGGTGGAGCAGCGGGCATAG
CCAATCTTGGTTGCGGTCATGCAGCGAGTGTACGGATAAGGGGCCGCATGTGCGAATCAAATCGTACCATCTATACGAGACGTGGCGGGGTGCGGTTTCC
TTGCCGCCCTCCGCCACCTCGACCACGTCCGCTTATCGATCCCCTTACGGACGGTCGTCCATCACGATTTCCTGAAGGATCGCCATGCCAAGACGCAGCA
TTCTGACCGACCGCCAACGGGCGGCATTGTTCGACCTGCCGACCGATGACGCCTCGATGCTGCGCCACTACACGCTGGCAGACGACGATCTGGAGATCAT
CCATGCCCGTCGCCGCCCCCACAATCGCTTCGGTTTTGCGTTGCAACTCTGCGCCCTGCGATATCCCGGGCGGTTGTTGGCGCCGGGCGAGGTCATACCG
CTGCCAGTCACGCGCTTTCTGGCCGCGCAACTTGGGATAAAGCCTGACGATCTGGCTGGCTACGCCGCCCGCGAGGAGACCCGGCATGAGCACCTGGCCA
TCTTGCGCGAGATCTACGGATACAAGATGTTCACCGGGCGGGGCGCTCGGGACCTGAAAACCTGGCTTGAGGATACCGCAGAAACCACTCGGTCGAACGA
GGACCTGGCACGGCGCTTCGTCGAGCAGTGCCGGGCAAGCCAGACCATCCTGCCGGGGATCACGGTCATCGAGCGGCTTTGCGCGGATGCGCTCGTCGCG
GCTGAACGGCGGGTCGATGCCCGGATCGCCGGTCGGCTGAACGATGACATGCGAACGCGGCTCGATGCCCTGTTGACCGATGGTGCGGGCGGCGCGGTCA
CGCGGTTCGTCTGGCTGCGTCAGTTCGAGATCGGGCGGAACTCGGCGGACATGAACCGCCTTCTGGACCGCCTGGAATACCTGCAGACCTTGGAACTGGC
CGGGGCATTCTGGCTGACGTGCCTCCTCACCGGATTGCGCGCCTGCGCCGGCAGGGCGAGCGATACTTCGCGGGTGATCTGAGGGATATCTCGGGTGACC
GCCGCCTCGCCATTCTTGCGGTCTGCGCGCTGGAATGGCGCAGTTCCATTGCCGACGCCGTTGTCGAGACCCATGACCGGATCGTCGGCAAGACGTGGCG
CGAGGCGAAATCGCGATGTGACGCCCGGATGAACGATGCCAAATCTGCGCTGAAGGACACCCTGCAATCCTTCAAGACATTGGGCGCGGCGCTTCTGGAA
GCGCATGAGGACCAAGCCTCCCTTGAAGAGGCCATCGGTACTGCGGGCGGCTGGTCATCCCTCAAAGGGCTTGTCGCCACGGCCGCCCAACTGACCGATA
CATTCGCCGCGGACCCGCTGGCGCATGTTGTTCACGGATATCATCGCTTCCGGCGTTATGCGCCGCGCATGCTGCGGGCGCTCGACATTTGTGCCGCGCC
CGTGGCGGAACCGCTGCTGGCGGCAAGCAGGATCATCGCGGGCACGGAGACGACTGACGACCGACCGCTGACCTTCCTGCGCCGCACTTCAAAGTGGCAT
CGCCACCTGAATGGCGACGATGAGCACCGAGTATGGGAGGTCGCGGTCCTGTTCCATCTGCGCGACGCCTTCCGTTCCGGTGATATCTGGCTGGCGCATT
CGCGAAGGTATGGCGACCTGAAGGACGCGCTTGTCCCGGTCGAGGCCGCCCGCGACACGCCGAGGCTGGCCATGCCGTTCGAGCCCGAAACCTGGCTGGC
CGACCGAAAGGCGCGTCTGTCCGATGCTGTATGCCGACTGGCTCGCGCTGCAAAGGCCGGGGCCATACCGGGAGGTTCCATAGAGGATAGCACCTTGAAA
ATCGATCGGCTGACCGCAGCGGTTCCCGAGGAAGCCGACACCCTGGTGCTCGATCTCTATCACCGCCTGCCCGAAGTCAGGGTCACCGATCTCCTGCTCG
AAGTGGATGCGGAGGTCGGCTTCACCGAGGCCTTTACCCATCTGCGCACCGGAGTGCCTTGCAAGGACAGGATTGGCCTGCTGAATGTCCTGCTTGCCGA
GGGGCTGAATCTTGGCCTCAGCAAGATGGCCGGGGCCACGAACACGCACGACTTCTTCCAGCTCTCCCGCCTGTCGCGATGGCATGTCGAAAGCGACGCG
ATGGCTCGCGCCCTGGCCATGGTGATCGAAGGACAGTCGGCACTGCCGATGGCGCGCTTCTGGGGCGCCGGACAAACCGCTTCGAGCGACGGGCAATTCT
TCCCGACAACGCGTCAGGGTGAGGCGATGAACCTGATCAACGCCAAGTACGGCCATGAACCCGGTCTGAAGGCCTATACCCATGTCTCCGACCAGTTCGG
CCCTTTCGCCACACAGACCATTCCAGCCACCGTGAACGAGGCACCCTATATCCTTGACGGCCTGCTGATGACCGGTGCCGGCCAGAAGATCCGCGAACAG
TATGCCGATACAGGTGGCTTCACGGACCATGTCTTCGCGGTCACCGCCCTGCTGGGATTCCAGTTCATACCCCGCATCCGCGACCTGCCGTCAAAGCGCC
TTTACCTCTTCGATCCGGCAGCCTGCCCGAAAGAGCTGAAGGGGCTGATCGGGGGCACGATCAAGGAACGTCTTATCATGACAAACTGGCCTGATATCCT
GCGCAGCGTGGCGACCATGGCGTCAGGGGCCATGCCTCCCAGCCAGCTCCTGCGGAAGTTTGCATCCTATCCCAGGCAACATGAGCTGGCGGTCGCTTTG
CGGGAAATCGGACGGGTCGAACGGACGCTCTTCATCATCGACTGGCTGCTCGATGCCGACATGCAGCGCCGCGCCCAGATCGGCCTCAACAAGGGTGAGG
CCCATCACGCCCTGAAGAACGCTCTCCGCATCGGCCGCCAGGGTGAAATCCGCGATCGCACCTCCGAAGGCCAGCACTTCCGCATGGCCGGGCTGAACCT
CCTCGCCGCGATCGTCATCTACTGGAACACCAAGCATCTCGGTGTCGCCGTCTCCAATCGCCGCCGCGAAGGCCTGGACTGCTCTCCCCACCTCATGGCG
CATATCTCGCCCCTCGGTTGGGCACATATCCTGCTCACTGGCGAATACAGATGGCCGAAACGGGCTTAGACCAGATGAGCGTTTTGCGGCGCGCGAGCCG
C
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
588 bp | 195 aa | 620 | 33 | - | No |
AG : Tn3 resolvase
ORF sequence :
MTATKIGYARCSTDRQDLTAQRAALAELGVTEDHIYTDHGLTGTNRARPGLDQALAAVRAGDVLVVPKLDRLARSVPDARAIADQLEKKGVKLALGASVY
DPADPMGKMFFNILATFAEFEADLIRMRTREGMAIARARGKLRGKQPKLSEKQQKELGRMHASGDYSISDLAEIFSVSRPTVYRTLQRKSAQAIP
DPADPMGKMFFNILATFAEFEADLIRMRTREGMAIARARGKLRGKQPKLSEKQQKELGRMHASGDYSISDLAEIFSVSRPTVYRTLQRKSAQAIP
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
2885 bp | 961 aa | 785 | 3669 | + | No |
Chemistry : DDE
ORF sequence :
MPRRSILTDRQRAALFDLPTDDASMLRHYTLADDDLEIIHARRRPHNRFGFALQLCALRYPGRLLAPGEVIPLPVTRFLAAQLGIKPDDLAGYAAREETR
HEHLAILREIYGYKMFTGRGARDLKTWLEDTAETTRSNEDLARRFVEQCRASQTILPGITVIERLCADALVAAERRVDARIAGRLNDDMRTRLDALLTDG
AGGAVTRFVWLRQFEIGRNSADMNRLLDRLEYLQTLGTGRGILADVPPHRIARLRRQGERYFAGDLRDISGDRRLAILAVCALEWRSSIADAVVETHDRI
VGKTWREAKSRCDARMNDAKSALKDTLQSFKTLGAALLEAHEDQASLEEAIGTAGGWSSLKGLVATAAQLTDTFAADPLAHVVHGYHRFRRYAPRMLRAL
DICAAPVAEPLLAASRIIAGTETTDDRPLTFLRRTSKWHRHLNGDDEHRVWEVAVLFHLRDAFRSGDIWLAHSRRYGDLKDALVPVEAARDTPRLAMPFE
PETWLADRKARLSDAVCRLARAAKAGAIPGGSIEDSTLKIDRLTAAVPEEADTLVLDLYHRLPEVRVTDLLLEVDAEVGFTEAFTHLRTGVPCKDRIGLL
NVLLAEGLNLGLSKMAGATNTHDFFQLSRLSRWHVESDAMARALAMVIEGQSALPMARFWGAGQTASSDGQFFPTTRQGEAMNLINAKYGHEPGLKAYTH
VSDQFGPFATQTIPATVNEAPYILDGLLMTGAGQKIREQYADTGGFTDHVFAVTALLGFQFIPRIRDLPSKRLYLFDPAACPKELKGLIGGTIKERLIMT
NWPDILRSVATMASGAMPPSQLLRKFASYPRQHELAVALREIGRVERTLFIIDWLLDADMQRRAQIGLNKGEAHHALKNALRIGRQGEIRDRTSEGQHFR
MAGLNLLAAIVIYWNTKHLGVAVSNRRREGLDCSPHLMAHISPLGWAHILLTGEYRWPKRA
HEHLAILREIYGYKMFTGRGARDLKTWLEDTAETTRSNEDLARRFVEQCRASQTILPGITVIERLCADALVAAERRVDARIAGRLNDDMRTRLDALLTDG
AGGAVTRFVWLRQFEIGRNSADMNRLLDRLEYLQTLGTGRGILADVPPHRIARLRRQGERYFAGDLRDISGDRRLAILAVCALEWRSSIADAVVETHDRI
VGKTWREAKSRCDARMNDAKSALKDTLQSFKTLGAALLEAHEDQASLEEAIGTAGGWSSLKGLVATAAQLTDTFAADPLAHVVHGYHRFRRYAPRMLRAL
DICAAPVAEPLLAASRIIAGTETTDDRPLTFLRRTSKWHRHLNGDDEHRVWEVAVLFHLRDAFRSGDIWLAHSRRYGDLKDALVPVEAARDTPRLAMPFE
PETWLADRKARLSDAVCRLARAAKAGAIPGGSIEDSTLKIDRLTAAVPEEADTLVLDLYHRLPEVRVTDLLLEVDAEVGFTEAFTHLRTGVPCKDRIGLL
NVLLAEGLNLGLSKMAGATNTHDFFQLSRLSRWHVESDAMARALAMVIEGQSALPMARFWGAGQTASSDGQFFPTTRQGEAMNLINAKYGHEPGLKAYTH
VSDQFGPFATQTIPATVNEAPYILDGLLMTGAGQKIREQYADTGGFTDHVFAVTALLGFQFIPRIRDLPSKRLYLFDPAACPKELKGLIGGTIKERLIMT
NWPDILRSVATMASGAMPPSQLLRKFASYPRQHELAVALREIGRVERTLFIIDWLLDADMQRRAQIGLNKGEAHHALKNALRIGRQGEIRDRTSEGQHFR
MAGLNLLAAIVIYWNTKHLGVAVSNRRREGLDCSPHLMAHISPLGWAHILLTGEYRWPKRA
Blast result :
Comments
ISRsp12 is 56% (ORFB: the transposase) aa similar to ISAcsp1. The first ORF is an integrase. The transposase had a frameshift (sequencing error ?) and was reconstructed in silico : join (785-1489;1489-3669).
ISRsp12 is flanked by two copies of ISRsp11 (IS110 family).
ISRsp12 is flanked by two copies of ISRsp11 (IS110 family).
References
1] Andres,J., Arsene-Ploetze,F., Barbe,V., Brochier-Armanet,C., Cleiss-Arnold,J., Coppee,J.Y., Dillies,M.A., Geist,L., Joublin,A., Koechler,S., Lassalle,F., Marchal,M., Medigue,C., Muller,D., Nesme,X., Plewniak,F., Proux,C., Ramirez-Bahena,M.H., Schenowitz,C., Sismeiro,O., Vallenet,D., Santini,J.M. and Bertin,P.N. (2013) Genome Biol Evol 5 (5), 934-953