ISVsp4
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_ABIZ00000000 | ND | Verrucomicrobium spinosum | Verrucomicrobium spinosum DSM 4136 |
DNA section
IS Length : 2518 bp
Ends
IR Length : 18/22
IRL : GTAAGCAGATCAGCAAGGCCGTCTGGTGCGCGGAGCGCAGCCGGGCAAGT
IRR : GTATGCAGTTCAGCAATCCCGTGGCATAGCTTCAGGTTCGAGCCCTGACG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GAACCCTGCT | GTTCAACT | ACAGGGTCCG | 8 |
CACGAAGGCG | CTCCTTGT | CCTTCATGAA | 8 |
GCTCGATGTT | CGTTGACA | CGTTGCAAGT | 8 |
GGCGGCAGTC | TTCTTTGG | TGCGGTGCTG | 8 |
DNA sequence
GTAAGCAGATCAGCAAGGCCGTCTGGTGCGCGGAGCGCAGCCGGGCAAGTTTGCGCGTATGCCTCGATATACAGCCCTGCAACGTCGTCGTCTAGTGCGC
CAGTTTCAATCCAGCGGCCTGACCCTGGCAACTTTTTGCCGGCTTCATTCTTTGAGTGTCTCAAGCCTTTGTGCCTGGCGTCGCCATGTTGAACTTCAGA
AGCGCTCCTCTGATCCTGCGCCGCCTCCATGGCTGCCGGTGGAAGTTTCCTCACCTCCGGGGAGCACTGCTGCGGGACTTCCCGGCTTGGATTATCTCAT
CGAAGCGGGTCTCTGGCGTCTGCACGTCCCTTCAGGCTTTGGCCGCGATGAAGTGGCCGCCCTCCTTGAACTTCTCCATGCCAGGCAGGGAGGTGCCCCA
TGCTAAGCCTGGGTCCTGCCACCAAAGTCCATTTGCTGGCAGGCGCCACCGACATGCGCCTTGGGTTTGAGGGGCTGCTCGCCCTGGCTTCTGGTCTGTT
GCGCGAGGATCCTTTGAGCGGTCACCTCTTTGTCTTTTGCAACAAGCACCGCACCCGTCTCAAGGCCCTCTACTGGGACGGCTCGGGCTTGTGGGTGTGC
GCCAAGCGGCTGGAGAAGGGGCGCTTCCACTGGCCGCAGCCCCCAGCGCAAGGAGCTTCACGCAAGGTGGTGCTCACCCAGGCGGAGCTCACTTTGCTGC
TGGCGGGCATTGACCTGGAAGGCACCCAACGGAGGGCCTGGCACCGGGTCGGGTGAGGGTGCGGGAAGAGGTGTTTTTTAGGGTTCGCGGTAACAACTTG
TTGCAACGAGCCCGCTGCTGGGCATCATGCAGCTCCCGCTGCTCATGCCCGCCACTCCTGCCACCCCGGATCTGCCTGAAGCCCTGTTGCTTTCTTTGGG
ACCCCGGGAGCGTCTTGCCCTGGAGGAGATCCTGCGTCAGCGCGATGAGCGTATTGCCACCCTGGAACTGGAAGTGCGCCTGCGTGAGGAGCAGTTGCGG
CTGGCCCTCATCAAGAAGTACGGTCCCAAAGGCGAGGGCCTCGGCAAAGACCAGGCCCTGCTGCTGGATCTGGAGCCCGGCGTGCAGGAGGCCGAGGTGG
CCCTTGAAGTGGGCCTGGCCCTGGCGGACAAGACCCTGCCCGAGGCTGGACGCCTGGAACAGGAGCAGGCCGCCCGGCTCAAAAAGAAAAAGCCCGGCAC
CCCGCGCTACGCCCAAGTGCATCCTGGGCGGCACGAACTGCCCGCGCATCTGCCGCGGGTGGAAGTCATCCTACCATGCTTGGAAGCGGCGCAAGGCGAA
CTGGTGGGCTATGAGATCAAGGAAGAGCTCGTCATCAAGCCTGCGGAGTTCTTTGTGCGGGTGCTCAAGCGCGAGAAGCGAGTCATCCAGCTGGGGGACC
GTCGCACGGTGGCCACAGCCGCCTCCCCTGGGCGCATCGTCGACAAAGGGCAGCTGGCCAACGAGACGGTGGTGGAGCTGGTGGTGCGCAAGTACGCCGA
TTATCTGCCGGTGTACCGTCAGCTACAGGGGTGGGAGCGCGATCACAGTGTGACCGTGCGTCAGGCCACGGCGACGCGGGCGGTGATGGCCGCAGGAGCG
TTGCTGCAACCTCTGGCGCGGGCCATCGGCCAGGAGCTGCGGCAGGGGCCGCTCATTCAGGCCGATGAGACGAGGCTGCCGGTGCTGCAAGATCTGGGTA
AAGGCCGCAACGATGTGGCGTGGTTGTGGCAGTACTCCATCCCCGGAGGGCTGGTGTACTTCGAGTACCAGGACAACCGTGCCCAGGCGGGGGCGAGGGC
CTATTTGAAGGACTACGGAGGCATCCTGCAAAGCGACGGCTATGTGGTCTATGACTGTCTGGAGGGGCAGGTGCAGCGGCATGCGGGATGTTTTGCCCAC
GTGCGACGCAAGTTCGTGGAGGCGTGCCAGGCCGCCCCCAAGGAAGTGCCGTGCGAACCGGGGCTGGCGGTGGTGGCCAGCATCGGCGCGTTGTATGGAG
TTGAAGAGCGGGCGCGGGAAAAGCAGTTGAAGGGGCAGGCCCGTCTGGACTACCGGCAGGAGCAGGGTGTGGCGCAAAAGCTCTCCAGCCTCAAGGCGCA
GATCCTGGAGGTGCGGGCCAAGGCCCTCCTGCCGCAGAGCCTGCTGGCCAAGGCCTGTGACTATGCGCTCAACCAGTGGGAGAAGCTGGAGGTGTATGCC
AGCCACGGGGAGGTGGAGATAGACAACAACTGGTGTGAGAATGCCATGAGGCCGGTGGCGCTGGGGCGCAAGAACTGGCTGCATCTAGGAAGCCACGAGA
GCGGGCCCAAAGTAGCGGCGATCCTGACGGTGCTGGCCAGTGCTCAGCGGCTGGGGCTCAACGTGCGGGAGTATCTTGGGGAGGCGCTGGAGACCCTGTG
TGACGGTGAGGGGTTCAACATCACGCGCATCGGGGAACTGCTGCCCAGCCGCTGGAAGCCCAAACCTGCGTCAGGGCTCGAACCTGAAGCTATGCCACGG
GATTGCTGAACTGCATAC
CAGTTTCAATCCAGCGGCCTGACCCTGGCAACTTTTTGCCGGCTTCATTCTTTGAGTGTCTCAAGCCTTTGTGCCTGGCGTCGCCATGTTGAACTTCAGA
AGCGCTCCTCTGATCCTGCGCCGCCTCCATGGCTGCCGGTGGAAGTTTCCTCACCTCCGGGGAGCACTGCTGCGGGACTTCCCGGCTTGGATTATCTCAT
CGAAGCGGGTCTCTGGCGTCTGCACGTCCCTTCAGGCTTTGGCCGCGATGAAGTGGCCGCCCTCCTTGAACTTCTCCATGCCAGGCAGGGAGGTGCCCCA
TGCTAAGCCTGGGTCCTGCCACCAAAGTCCATTTGCTGGCAGGCGCCACCGACATGCGCCTTGGGTTTGAGGGGCTGCTCGCCCTGGCTTCTGGTCTGTT
GCGCGAGGATCCTTTGAGCGGTCACCTCTTTGTCTTTTGCAACAAGCACCGCACCCGTCTCAAGGCCCTCTACTGGGACGGCTCGGGCTTGTGGGTGTGC
GCCAAGCGGCTGGAGAAGGGGCGCTTCCACTGGCCGCAGCCCCCAGCGCAAGGAGCTTCACGCAAGGTGGTGCTCACCCAGGCGGAGCTCACTTTGCTGC
TGGCGGGCATTGACCTGGAAGGCACCCAACGGAGGGCCTGGCACCGGGTCGGGTGAGGGTGCGGGAAGAGGTGTTTTTTAGGGTTCGCGGTAACAACTTG
TTGCAACGAGCCCGCTGCTGGGCATCATGCAGCTCCCGCTGCTCATGCCCGCCACTCCTGCCACCCCGGATCTGCCTGAAGCCCTGTTGCTTTCTTTGGG
ACCCCGGGAGCGTCTTGCCCTGGAGGAGATCCTGCGTCAGCGCGATGAGCGTATTGCCACCCTGGAACTGGAAGTGCGCCTGCGTGAGGAGCAGTTGCGG
CTGGCCCTCATCAAGAAGTACGGTCCCAAAGGCGAGGGCCTCGGCAAAGACCAGGCCCTGCTGCTGGATCTGGAGCCCGGCGTGCAGGAGGCCGAGGTGG
CCCTTGAAGTGGGCCTGGCCCTGGCGGACAAGACCCTGCCCGAGGCTGGACGCCTGGAACAGGAGCAGGCCGCCCGGCTCAAAAAGAAAAAGCCCGGCAC
CCCGCGCTACGCCCAAGTGCATCCTGGGCGGCACGAACTGCCCGCGCATCTGCCGCGGGTGGAAGTCATCCTACCATGCTTGGAAGCGGCGCAAGGCGAA
CTGGTGGGCTATGAGATCAAGGAAGAGCTCGTCATCAAGCCTGCGGAGTTCTTTGTGCGGGTGCTCAAGCGCGAGAAGCGAGTCATCCAGCTGGGGGACC
GTCGCACGGTGGCCACAGCCGCCTCCCCTGGGCGCATCGTCGACAAAGGGCAGCTGGCCAACGAGACGGTGGTGGAGCTGGTGGTGCGCAAGTACGCCGA
TTATCTGCCGGTGTACCGTCAGCTACAGGGGTGGGAGCGCGATCACAGTGTGACCGTGCGTCAGGCCACGGCGACGCGGGCGGTGATGGCCGCAGGAGCG
TTGCTGCAACCTCTGGCGCGGGCCATCGGCCAGGAGCTGCGGCAGGGGCCGCTCATTCAGGCCGATGAGACGAGGCTGCCGGTGCTGCAAGATCTGGGTA
AAGGCCGCAACGATGTGGCGTGGTTGTGGCAGTACTCCATCCCCGGAGGGCTGGTGTACTTCGAGTACCAGGACAACCGTGCCCAGGCGGGGGCGAGGGC
CTATTTGAAGGACTACGGAGGCATCCTGCAAAGCGACGGCTATGTGGTCTATGACTGTCTGGAGGGGCAGGTGCAGCGGCATGCGGGATGTTTTGCCCAC
GTGCGACGCAAGTTCGTGGAGGCGTGCCAGGCCGCCCCCAAGGAAGTGCCGTGCGAACCGGGGCTGGCGGTGGTGGCCAGCATCGGCGCGTTGTATGGAG
TTGAAGAGCGGGCGCGGGAAAAGCAGTTGAAGGGGCAGGCCCGTCTGGACTACCGGCAGGAGCAGGGTGTGGCGCAAAAGCTCTCCAGCCTCAAGGCGCA
GATCCTGGAGGTGCGGGCCAAGGCCCTCCTGCCGCAGAGCCTGCTGGCCAAGGCCTGTGACTATGCGCTCAACCAGTGGGAGAAGCTGGAGGTGTATGCC
AGCCACGGGGAGGTGGAGATAGACAACAACTGGTGTGAGAATGCCATGAGGCCGGTGGCGCTGGGGCGCAAGAACTGGCTGCATCTAGGAAGCCACGAGA
GCGGGCCCAAAGTAGCGGCGATCCTGACGGTGCTGGCCAGTGCTCAGCGGCTGGGGCTCAACGTGCGGGAGTATCTTGGGGAGGCGCTGGAGACCCTGTG
TGACGGTGAGGGGTTCAACATCACGCGCATCGGGGAACTGCTGCCCAGCCGCTGGAAGCCCAAACCTGCGTCAGGGCTCGAACCTGAAGCTATGCCACGG
GATTGCTGAACTGCATAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
348 bp | 115 aa | 59 | 406 | + | No |
AG : IS66 TnpA
ORF sequence :
MPRYTALQRRRLVRQFQSSGLTLATFCRLHSLSVSSLCAWRRHVELQKRSSDPAPPPWLPVEVSSPPGSTAAGLPGLDYLIEAGLWRLHVPSGFGRDEVA
ALLELLHARQGGAPC
ALLELLHARQGGAPC
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
357 bp | 118 aa | 400 | 756 | + | No |
AG : IS66 TnpB
ORF sequence :
MLSLGPATKVHLLAGATDMRLGFEGLLALASGLLREDPLSGHLFVFCNKHRTRLKALYWDGSGLWVCAKRLEKGRFHWPQPPAQGASRKVVLTQAELTLL
LAGIDLEGTQRRAWHRVG
LAGIDLEGTQRRAWHRVG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1536 bp | 511 aa | 974 | 2509 | + | No |
Chemistry : DDE
ORF sequence :
MRLREEQLRLALIKKYGPKGEGLGKDQALLLDLEPGVQEAEVALEVGLALADKTLPEAGRLEQEQAARLKKKKPGTPRYAQVHPGRHELPAHLPRVEVIL
PCLEAAQGELVGYEIKEELVIKPAEFFVRVLKREKRVIQLGDRRTVATAASPGRIVDKGQLANETVVELVVRKYADYLPVYRQLQGWERDHSVTVRQATA
TRAVMAAGALLQPLARAIGQELRQGPLIQADETRLPVLQDLGKGRNDVAWLWQYSIPGGLVYFEYQDNRAQAGARAYLKDYGGILQSDGYVVYDCLEGQV
QRHAGCFAHVRRKFVEACQAAPKEVPCEPGLAVVASIGALYGVEERAREKQLKGQARLDYRQEQGVAQKLSSLKAQILEVRAKALLPQSLLAKACDYALN
QWEKLEVYASHGEVEIDNNWCENAMRPVALGRKNWLHLGSHESGPKVAAILTVLASAQRLGLNVREYLGEALETLCDGEGFNITRIGELLPSRWKPKPAS
GLEPEAMPRDC
PCLEAAQGELVGYEIKEELVIKPAEFFVRVLKREKRVIQLGDRRTVATAASPGRIVDKGQLANETVVELVVRKYADYLPVYRQLQGWERDHSVTVRQATA
TRAVMAAGALLQPLARAIGQELRQGPLIQADETRLPVLQDLGKGRNDVAWLWQYSIPGGLVYFEYQDNRAQAGARAYLKDYGGILQSDGYVVYDCLEGQV
QRHAGCFAHVRRKFVEACQAAPKEVPCEPGLAVVASIGALYGVEERAREKQLKGQARLDYRQEQGVAQKLSSLKAQILEVRAKALLPQSLLAKACDYALN
QWEKLEVYASHGEVEIDNNWCENAMRPVALGRKNWLHLGSHESGPKVAAILTVLASAQRLGLNVREYLGEALETLCDGEGFNITRIGELLPSRWKPKPAS
GLEPEAMPRDC
Blast result :
Comments
ISVsp3 orfA is 58% aa similar to ISSwo2, orfB is 65% aa similar toISSba7 and ISVsp3 orfC (the transposase) is 53% aa similar to ISThsp3.
There are 4 copies of ISVsp4 in the chromsome of Verrucomicrobium spinosum DSM 4136. One copy carries an insertion of a gene for reverse transcriptase within transposase orf3.
There are 4 copies of ISVsp4 in the chromsome of Verrucomicrobium spinosum DSM 4136. One copy carries an insertion of a gene for reverse transcriptase within transposase orf3.
References
1] DeBoy, R. (2010) Direct submission