ISRgn2
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
AF439554 | ND | Ruminococcus gnavus | Ruminococcus gnavus LEMB53 |
DNA section
IS Length : 2745 bp
Ends
IR Length : 19/23
IRL : GTAAGCGCAAAATAATTCGACGGATTGTGTTGCCCTAAAAATCATTCCAT
IRR : GTAAGCGCAAAATATTGTTACGCTGCCCATGAATCACGCTTGCACCAAAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
AATTTAGAAA | GATGGTAA | TATTACATGG | 8 |
DNA sequence
GTAAGCGCAAAATAATTCGACGGATTGTGTTGCCCTAAAAATCATTCCATAATAATCATTAGAAAATGCAAGCATTTCACTTTATCTTCTCTTGCCAGAA
ATGATAGATGCGAGCATTTTACTCTATCCGATTTATGGAGGTGTGCTCATGAGAAGCAAAAGAATACCTGCCGAAGAACAATACCGTCTCATCATGGAAT
GCCGTCAAAGTGGATTGACAGATCATCAATGGTGTGTGGAACACGACATCAAACCGGGAACTTTTTACAACTGGGTAAAAAGGCTGCGTCAGAAAGGTTG
TGTGGATTTGCCAGCGTCAACCGGACGCAGCTATCGTGCACCGGAAAACCAGGAAGTTGTCAGAGTGGATTTTCATGATACTGACCCGCTCCAATATGAA
CAGCCATTAAATGTGATTCCGGTAGCTACGGAAAGAAATAACCTTTCCGTAGCAGAGCCAATGAAGCTGTCTGTAGGAAGCTTTCATCTAACAATACCAA
ATGGAACAGATCCTCAGCTTCTGGCTCAAACGCTTCGCATTGTGAAGGAGTTGGAATGTTAGGGGATATCACAGCCGCCGATGAAATCTATATCGTAACA
GGCAGAACGGATATGCGGAAATCCATTGACGGGCTGTGTGCTATTGTAGAGGATCAACTCCATATGGATCCAAGGCGAAGTGCCCTGTATCTTTTCTGTG
GAAAACGTTGTGACAGGATCAAAGCTTTGCTCTGGGAATCTGACGGATTTGTGCTGCTGTATAAACGTATGGAAGTCCAGGGAAGATTCCGCTGGCCCAG
AAATCAGTTGGAAGTAAAACAATTGACCTGGCAGCAATTCGACTGGCTCATGTCCGGGCTTGAAATCGAACAACCAAAGGCATTCAAACCTACGGAATGA
TCGCGTAAAACATCTATGCCAACCAACAGAAACGGAGGGGGTTCTCACCCTATATCATGTCTGAAATCCCTTGTAAATGCTGGGTTTTCTGCTTCTTTTT
CATTGCTTTTTATGGTATAATAAGAGCAGTGAAAAAGGAGCAGAAAACATATGGCTGGTAATTCAAAAGATTCAAAAATCCTCGCGTACAAGGACCTGAT
CAACCAACTGAATAAGACGATTTCCACACAGACAGAACTGATTCAGTCTTTACAAAAAACATTGGAAGCAGACCGTCTGGAAAAAGAAAACCTTCGTCAG
CAGATTGAATATCTCACGAAGAAGCTTTTTGGTACTTCCAGTGAAAAACGGAAAGATATCGATGGTCAGCTGAATCTTTTTGATGAAGCCGAGCAGGAAG
CAGATCCGACATGGAAACAGGAACTTCCTGATGACATCACTGTTCCGGAACATAAGCGAAAAGCACGACGGACACATGCAGATCTCTTTAAAAATGTTCC
TTCCTGCGACGAGATCATTTCTCTTCCGGAGGAGGAGCGAAACTATCCAACCTGCGGAACGCAGATGGAATGTATTGGGAAAGAATTCGTCCGCCATGAA
TTTCGCTTTACCCCTGCCAAAGGAAAAGTAGTAAATATCTATCGTGAAACCTATAAATGCCCGGAATGTGCCATATCAGAGGAACACCCAGATGATCAGA
CATTTGTCAAAGCGCCTGTCCAGGAACCATTGATCCCAGAAAGTTATGCATCGGAATCCGTTGTAGGATGGGCAATGCACCAGAAGTACCAGAATGGTCT
GCCATTAAACCGGCAGGAATCGGAATGGAAGCAGCTGGGTGTCCCATTAAGCCGGGCTACGCTTGCTAACTGGATCATTTACTGTGCCGAGAATTACCTC
TGTCATGTTTATGATTATTTTCACCGTCAGTTACGGATGCGTAAATATCTGATGGCAGATGAAACCCGGGTTCAGGTACTGAATGAGCCGGAGCGCAATC
CTGAAACAGATTCCTGGATGTGGCTCTTCCGCAGTGGAGAAGATGGGCTTCCGCCGATCCTGCTCTATCATTACACAGAGACAAGGGCAAAGTTCCATGC
GGCATCTTTCCTACAGGGGTTCAGCGGATATCTGGAGACTGATGGATACCAGGGTTATAACAATCTGCCGGATATCAAACGATGCTCTTGCTGGGCACAT
GTGAGACGTTACTTCACAGATGCCATACCGAAAGGGAAAGAGTATGATTACAGCCTTCCGGCAGTGCAGGGAGTACAGTTCTGCTCCAAGCTGTTTGATT
GTGAGCGGTACTCAAAAGCAAAAAATCATACTGCGGAGCAGAGAAAACAGTTCCGTCTTGAAAAGGAGAAGCCGATACTGGAGGCATTCTGGAATTGGCT
GGATCAACAGCGTCCAAACAAGGGAACCCGTTTGGCGAAAGCGGTGAACTATGCCCAAAATCGGAAAGACACCCTGATGACCTATCTGGAAGACGGTCAT
TGCAGTTTATCCAATAATCTTAGCGAGAATGCAATCAGACCATTCACTGTTGGCCGGAAAAACTGGCTGTTCAGTGCCAGCCCGAAGGGAGCTGCCTCTA
GCGCTATTGTGTATACAATGGTTGAGATGGCAAAAGCGAATGACCTGAATACCTACAAATATCTGACATATCTCTTATCACAGCGGCCAGACGCTAAAAT
GTCAGATGAACAGTTGGAACAGCTTGCCCCATGGAGCGAGACTGCGAAAGCGAACTGTCAAAACTAAACATAGAGCAAAACGCTTGCATCTATCATTTTG
GTGCAAGCGTGATTCATGGGCAGCGTAACAATATTTTGCGCTTAC
ATGATAGATGCGAGCATTTTACTCTATCCGATTTATGGAGGTGTGCTCATGAGAAGCAAAAGAATACCTGCCGAAGAACAATACCGTCTCATCATGGAAT
GCCGTCAAAGTGGATTGACAGATCATCAATGGTGTGTGGAACACGACATCAAACCGGGAACTTTTTACAACTGGGTAAAAAGGCTGCGTCAGAAAGGTTG
TGTGGATTTGCCAGCGTCAACCGGACGCAGCTATCGTGCACCGGAAAACCAGGAAGTTGTCAGAGTGGATTTTCATGATACTGACCCGCTCCAATATGAA
CAGCCATTAAATGTGATTCCGGTAGCTACGGAAAGAAATAACCTTTCCGTAGCAGAGCCAATGAAGCTGTCTGTAGGAAGCTTTCATCTAACAATACCAA
ATGGAACAGATCCTCAGCTTCTGGCTCAAACGCTTCGCATTGTGAAGGAGTTGGAATGTTAGGGGATATCACAGCCGCCGATGAAATCTATATCGTAACA
GGCAGAACGGATATGCGGAAATCCATTGACGGGCTGTGTGCTATTGTAGAGGATCAACTCCATATGGATCCAAGGCGAAGTGCCCTGTATCTTTTCTGTG
GAAAACGTTGTGACAGGATCAAAGCTTTGCTCTGGGAATCTGACGGATTTGTGCTGCTGTATAAACGTATGGAAGTCCAGGGAAGATTCCGCTGGCCCAG
AAATCAGTTGGAAGTAAAACAATTGACCTGGCAGCAATTCGACTGGCTCATGTCCGGGCTTGAAATCGAACAACCAAAGGCATTCAAACCTACGGAATGA
TCGCGTAAAACATCTATGCCAACCAACAGAAACGGAGGGGGTTCTCACCCTATATCATGTCTGAAATCCCTTGTAAATGCTGGGTTTTCTGCTTCTTTTT
CATTGCTTTTTATGGTATAATAAGAGCAGTGAAAAAGGAGCAGAAAACATATGGCTGGTAATTCAAAAGATTCAAAAATCCTCGCGTACAAGGACCTGAT
CAACCAACTGAATAAGACGATTTCCACACAGACAGAACTGATTCAGTCTTTACAAAAAACATTGGAAGCAGACCGTCTGGAAAAAGAAAACCTTCGTCAG
CAGATTGAATATCTCACGAAGAAGCTTTTTGGTACTTCCAGTGAAAAACGGAAAGATATCGATGGTCAGCTGAATCTTTTTGATGAAGCCGAGCAGGAAG
CAGATCCGACATGGAAACAGGAACTTCCTGATGACATCACTGTTCCGGAACATAAGCGAAAAGCACGACGGACACATGCAGATCTCTTTAAAAATGTTCC
TTCCTGCGACGAGATCATTTCTCTTCCGGAGGAGGAGCGAAACTATCCAACCTGCGGAACGCAGATGGAATGTATTGGGAAAGAATTCGTCCGCCATGAA
TTTCGCTTTACCCCTGCCAAAGGAAAAGTAGTAAATATCTATCGTGAAACCTATAAATGCCCGGAATGTGCCATATCAGAGGAACACCCAGATGATCAGA
CATTTGTCAAAGCGCCTGTCCAGGAACCATTGATCCCAGAAAGTTATGCATCGGAATCCGTTGTAGGATGGGCAATGCACCAGAAGTACCAGAATGGTCT
GCCATTAAACCGGCAGGAATCGGAATGGAAGCAGCTGGGTGTCCCATTAAGCCGGGCTACGCTTGCTAACTGGATCATTTACTGTGCCGAGAATTACCTC
TGTCATGTTTATGATTATTTTCACCGTCAGTTACGGATGCGTAAATATCTGATGGCAGATGAAACCCGGGTTCAGGTACTGAATGAGCCGGAGCGCAATC
CTGAAACAGATTCCTGGATGTGGCTCTTCCGCAGTGGAGAAGATGGGCTTCCGCCGATCCTGCTCTATCATTACACAGAGACAAGGGCAAAGTTCCATGC
GGCATCTTTCCTACAGGGGTTCAGCGGATATCTGGAGACTGATGGATACCAGGGTTATAACAATCTGCCGGATATCAAACGATGCTCTTGCTGGGCACAT
GTGAGACGTTACTTCACAGATGCCATACCGAAAGGGAAAGAGTATGATTACAGCCTTCCGGCAGTGCAGGGAGTACAGTTCTGCTCCAAGCTGTTTGATT
GTGAGCGGTACTCAAAAGCAAAAAATCATACTGCGGAGCAGAGAAAACAGTTCCGTCTTGAAAAGGAGAAGCCGATACTGGAGGCATTCTGGAATTGGCT
GGATCAACAGCGTCCAAACAAGGGAACCCGTTTGGCGAAAGCGGTGAACTATGCCCAAAATCGGAAAGACACCCTGATGACCTATCTGGAAGACGGTCAT
TGCAGTTTATCCAATAATCTTAGCGAGAATGCAATCAGACCATTCACTGTTGGCCGGAAAAACTGGCTGTTCAGTGCCAGCCCGAAGGGAGCTGCCTCTA
GCGCTATTGTGTATACAATGGTTGAGATGGCAAAAGCGAATGACCTGAATACCTACAAATATCTGACATATCTCTTATCACAGCGGCCAGACGCTAAAAT
GTCAGATGAACAGTTGGAACAGCTTGCCCCATGGAGCGAGACTGCGAAAGCGAACTGTCAAAACTAAACATAGAGCAAAACGCTTGCATCTATCATTTTG
GTGCAAGCGTGATTCATGGGCAGCGTAACAATATTTTGCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
414 bp | 137 aa | 149 | 562 | + | No |
AG : IS66 TnpA
ORF sequence :
MRSKRIPAEEQYRLIMECRQSGLTDHQWCVEHDIKPGTFYNWVKRLRQKGCVDLPASTGRSYRAPENQEVVRVDFHDTDPLQYEQPLNVIPVATERNNLS
VAEPMKLSVGSFHLTIPNGTDPQLLAQTLRIVKELEC
VAEPMKLSVGSFHLTIPNGTDPQLLAQTLRIVKELEC
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
345 bp | 114 aa | 556 | 900 | + | No |
AG : IS66 TnpB
ORF sequence :
MLGDITAADEIYIVTGRTDMRKSIDGLCAIVEDQLHMDPRRSALYLFCGKRCDRIKALLWESDGFVLLYKRMEVQGRFRWPRNQLEVKQLTWQQFDWLMS
GLEIEQPKAFKPTE
GLEIEQPKAFKPTE
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1617 bp | 538 aa | 1051 | 2667 | + | No |
Chemistry : DDE
ORF sequence :
MAGNSKDSKILAYKDLINQLNKTISTQTELIQSLQKTLEADRLEKENLRQQIEYLTKKLFGTSSEKRKDIDGQLNLFDEAEQEADPTWKQELPDDITVPE
HKRKARRTHADLFKNVPSCDEIISLPEEERNYPTCGTQMECIGKEFVRHEFRFTPAKGKVVNIYRETYKCPECAISEEHPDDQTFVKAPVQEPLIPESYA
SESVVGWAMHQKYQNGLPLNRQESEWKQLGVPLSRATLANWIIYCAENYLCHVYDYFHRQLRMRKYLMADETRVQVLNEPERNPETDSWMWLFRSGEDGL
PPILLYHYTETRAKFHAASFLQGFSGYLETDGYQGYNNLPDIKRCSCWAHVRRYFTDAIPKGKEYDYSLPAVQGVQFCSKLFDCERYSKAKNHTAEQRKQ
FRLEKEKPILEAFWNWLDQQRPNKGTRLAKAVNYAQNRKDTLMTYLEDGHCSLSNNLSENAIRPFTVGRKNWLFSASPKGAASSAIVYTMVEMAKANDLN
TYKYLTYLLSQRPDAKMSDEQLEQLAPWSETAKANCQN
HKRKARRTHADLFKNVPSCDEIISLPEEERNYPTCGTQMECIGKEFVRHEFRFTPAKGKVVNIYRETYKCPECAISEEHPDDQTFVKAPVQEPLIPESYA
SESVVGWAMHQKYQNGLPLNRQESEWKQLGVPLSRATLANWIIYCAENYLCHVYDYFHRQLRMRKYLMADETRVQVLNEPERNPETDSWMWLFRSGEDGL
PPILLYHYTETRAKFHAASFLQGFSGYLETDGYQGYNNLPDIKRCSCWAHVRRYFTDAIPKGKEYDYSLPAVQGVQFCSKLFDCERYSKAKNHTAEQRKQ
FRLEKEKPILEAFWNWLDQQRPNKGTRLAKAVNYAQNRKDTLMTYLEDGHCSLSNNLSENAIRPFTVGRKNWLFSASPKGAASSAIVYTMVEMAKANDLN
TYKYLTYLLSQRPDAKMSDEQLEQLAPWSETAKANCQN
Blast result :
Comments
ISRgn2 is 45% (ORFA), 76% (ORFB) and 63% (ORFC, the transposase) aa similar to ISCth11.
References
1] Marcille,F., Gomez,A., Joubert,P., Ladire,M., Veau,G., Clara,A., Gavini,F., Willems,A. and Fons,M. (2002) Appl. Environ. Microbiol. 68 (7), 3424-3431