ISVsp3
- Family IS66
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_ABIZ00000000 | ND | Verrucomicrobium spinosum | Verrucomicrobium spinosum DSM 4136 |
DNA section
IS Length : 2446 bp
Ends
IR Length : 17/23
IRL : GTAAGCGTCAAGCCGATCCCCGTCTGATGTCGCTATGAAGCGTTATCACC
IRR : GTAAGCGAGTAACCAAGCCCCGTGGGTGCAGGGCTTGACTTTTACTGGTC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CCGGGCGGGA | GCTTGACA | ACGCGGCCGC | 8 |
DNA sequence
GTAAGCGTCAAGCCGATCCCCGTCTGATGTCGCTATGAAGCGTTATCACCATCAGTCATAGCTGAGGGGTATGACTCATAGTGATCCTTCTTCCATAGCG
TCAGCTCCACTCAAGCGTGATGTCAGGTCCAGGGTGCGCAGCACCCCGGCACAACGACACGCCGCGCTGGAGGCCTATGCCCGCTCAGGCCTGAGCGGGC
CCCAGTTCGCCAGATCTGCTGGCATCAAGTACCAAACGCTGGTCTCCTGGCGCAGGCAGGCCAAAATCTCCACCTGCATTGCCACTCAGGCTGCCTCCCC
TCCCATCGTGGCGTTTCTGGAGGCCGTCCCCATCTCCCCGCTGCCTTCGACACGCCTGGATCTTCTGCTGCCCGGCACCGCCCGCCTGCACCTGTCCTGC
TCCTCGCAGATGCCCCTGGTTGCAGCGCTGCTGCGTCAACTCGAACTCAAGCATTCCCCGGACCTCCACTCATGCTGAGCTTCTCAGGCAGCCTCAAGGT
CTTCATCGCCCTTCAACCCTGTGACATGAGGGCCGGGGTGGGCACCCTTCAGGGCATGGTCAGTGCGCAGTTGCAGGACGATCCCCGCAGCGGCTCGCTC
TTTGTCTTCAGCAACAGGCGCCACTCGGTGCTCAAGATCCTCTACTGGGACGGTTCGGGCTGGTGGCTGCTGGCCAAACGGCTCGAACAAGGCACCTTCA
GCTGGCCGGCTGTCGCCGATCCTGCACAGACCAAGCTGGCCCTGGCCCCGCAGGCCCTGGCCATGCTCACCGACGGCATTGATCTGCGGGGCGCGAAGAT
GCGCCCCTGGTATGAACGTGAGACGGGATAGTTTCCTGCTTCCTGCATCTGAAGCTGGACAAGGGCGCGGCTTGAGGCCACATCACGAGCCTCGTGACTC
CTGAACCTGACGCCGCCACTTTGATCGCAGAGCTTCGTGGGGAGCTTGCGGCGGTGCGTCTGGAAAACAAGCTGCTGCGCCTCAAGCTCGATGCCCTCTC
GCGCCGGATGTTCGGCAAAAGCAGCGAGAAGCTGGATGCAGAGCAGATGCAGCTGCTGCTCGACGGGATCGAGGAGCTTACGCTGGCCGAAGAGTCGGCC
CGTCAGACCCGGCCGGGCAGGGAGCCGTCCACCCCGGAGCCAGCCCGGGAGCGCAAGCCCCGCATCCCGGAGCACCTGCCGGTTAAAGAAGTGTTCATCG
ACCCCGAAGAGGTCAAGGCCTGTCCTGAGGACTGGGTGCACATCGGCGAAGAGGTCACCGAGCAGCTTGAGTACACCCCGGCAAGCTTCGAGCGCTTGCG
CATCATCCGGCGCAAGTATGTGCGCAAGGACCAGCGGCATCTGCCTCCGGTGGTGGCTCCGTTGCAGCCCTGCCTGCAGGAGCGCTGCGTGGCAGCGCCC
TCGCTGCTGGCCCACAGCATGGTCTCACGTTACCGGGACCATCTGCCCTGGCACCGGCTGGAGGGCATCTACGCCGGGCTGGGCGTGGAGATCTCGCGCC
AGACGCTGGCCAACTGGAGCGGGATGACCGCAGAGGCCTGCGGGCTGCTCATGCGCGAGATCCATCAGAGCGTGTTTGCCAGCGGGTATGTCCAGCTTGA
TGAGACACCCATTGAGTACTTGAGCCCCGGTCATGGCCAGACCAAAACGGGTTACCTGTGGGTGGCGCACTCGCCGCTGACCCAGGAGACGTTCTTCCAG
TGGCACACCGGACGGGCGGCCTTTTGTCTGGAGAGCCTGGTGCCTGCCGGGTTCGAGGGCATCATCCAGTGCGATGGGTATGCCGCTTACGCGTCTTTTG
CCCAGAGCCGGGAACGTGCGGGCACGATCCGGCTGGCGGGCTGCTGGGCGCATGCCCGGCGGAAGTTTTTTGAGGCCAGCACCTACTGCCAGGATGCCCT
GTGGGTGCTGGTGCAGATCCGGGAGCTTTATGCCGTGGAGGAAGAGTTGCGCGAGATCCGGGCTGGACCGGAGCAAAGACAGGCGGCCCGAGAAGCCCGA
AGCCGCCTTGTGATGGGGCAGGTCTATGAGAAGCTGGAGCAATGGCAGCAGCAACGCAAACACCTGCCCAAGAGTCCGACAGGGGCTGCGATACGCTATG
CCCTCAACCAGCGTGCGAGCCTTGAAGTGTTCCTTGAGGACGGACGCGTGGAGGTGGACAACAACCTGGTGGAGAACACGATCCGGCCCAGTGCCATCGG
CAAGAAGAACTGGTTGTTCGTGGGCGACGCGCAGGCGGGAGTGCGTGCGGCGACGTTTTACACGCTGTTGGACAATGCGAAGCGGGCTGGGGCGGATGCC
TACGAGTATCTCAAGGACCTGTTCACGAAGCTGCCAGCGATGACCAATCAACAGATGAAGGAAATCACGCCCCGGGCGTGGATGGCACGGCGTGCCGACC
AGTAAAAGTCAAGCCCTGCACCCACGGGGCTTGGTTACTCGCTTAC
TCAGCTCCACTCAAGCGTGATGTCAGGTCCAGGGTGCGCAGCACCCCGGCACAACGACACGCCGCGCTGGAGGCCTATGCCCGCTCAGGCCTGAGCGGGC
CCCAGTTCGCCAGATCTGCTGGCATCAAGTACCAAACGCTGGTCTCCTGGCGCAGGCAGGCCAAAATCTCCACCTGCATTGCCACTCAGGCTGCCTCCCC
TCCCATCGTGGCGTTTCTGGAGGCCGTCCCCATCTCCCCGCTGCCTTCGACACGCCTGGATCTTCTGCTGCCCGGCACCGCCCGCCTGCACCTGTCCTGC
TCCTCGCAGATGCCCCTGGTTGCAGCGCTGCTGCGTCAACTCGAACTCAAGCATTCCCCGGACCTCCACTCATGCTGAGCTTCTCAGGCAGCCTCAAGGT
CTTCATCGCCCTTCAACCCTGTGACATGAGGGCCGGGGTGGGCACCCTTCAGGGCATGGTCAGTGCGCAGTTGCAGGACGATCCCCGCAGCGGCTCGCTC
TTTGTCTTCAGCAACAGGCGCCACTCGGTGCTCAAGATCCTCTACTGGGACGGTTCGGGCTGGTGGCTGCTGGCCAAACGGCTCGAACAAGGCACCTTCA
GCTGGCCGGCTGTCGCCGATCCTGCACAGACCAAGCTGGCCCTGGCCCCGCAGGCCCTGGCCATGCTCACCGACGGCATTGATCTGCGGGGCGCGAAGAT
GCGCCCCTGGTATGAACGTGAGACGGGATAGTTTCCTGCTTCCTGCATCTGAAGCTGGACAAGGGCGCGGCTTGAGGCCACATCACGAGCCTCGTGACTC
CTGAACCTGACGCCGCCACTTTGATCGCAGAGCTTCGTGGGGAGCTTGCGGCGGTGCGTCTGGAAAACAAGCTGCTGCGCCTCAAGCTCGATGCCCTCTC
GCGCCGGATGTTCGGCAAAAGCAGCGAGAAGCTGGATGCAGAGCAGATGCAGCTGCTGCTCGACGGGATCGAGGAGCTTACGCTGGCCGAAGAGTCGGCC
CGTCAGACCCGGCCGGGCAGGGAGCCGTCCACCCCGGAGCCAGCCCGGGAGCGCAAGCCCCGCATCCCGGAGCACCTGCCGGTTAAAGAAGTGTTCATCG
ACCCCGAAGAGGTCAAGGCCTGTCCTGAGGACTGGGTGCACATCGGCGAAGAGGTCACCGAGCAGCTTGAGTACACCCCGGCAAGCTTCGAGCGCTTGCG
CATCATCCGGCGCAAGTATGTGCGCAAGGACCAGCGGCATCTGCCTCCGGTGGTGGCTCCGTTGCAGCCCTGCCTGCAGGAGCGCTGCGTGGCAGCGCCC
TCGCTGCTGGCCCACAGCATGGTCTCACGTTACCGGGACCATCTGCCCTGGCACCGGCTGGAGGGCATCTACGCCGGGCTGGGCGTGGAGATCTCGCGCC
AGACGCTGGCCAACTGGAGCGGGATGACCGCAGAGGCCTGCGGGCTGCTCATGCGCGAGATCCATCAGAGCGTGTTTGCCAGCGGGTATGTCCAGCTTGA
TGAGACACCCATTGAGTACTTGAGCCCCGGTCATGGCCAGACCAAAACGGGTTACCTGTGGGTGGCGCACTCGCCGCTGACCCAGGAGACGTTCTTCCAG
TGGCACACCGGACGGGCGGCCTTTTGTCTGGAGAGCCTGGTGCCTGCCGGGTTCGAGGGCATCATCCAGTGCGATGGGTATGCCGCTTACGCGTCTTTTG
CCCAGAGCCGGGAACGTGCGGGCACGATCCGGCTGGCGGGCTGCTGGGCGCATGCCCGGCGGAAGTTTTTTGAGGCCAGCACCTACTGCCAGGATGCCCT
GTGGGTGCTGGTGCAGATCCGGGAGCTTTATGCCGTGGAGGAAGAGTTGCGCGAGATCCGGGCTGGACCGGAGCAAAGACAGGCGGCCCGAGAAGCCCGA
AGCCGCCTTGTGATGGGGCAGGTCTATGAGAAGCTGGAGCAATGGCAGCAGCAACGCAAACACCTGCCCAAGAGTCCGACAGGGGCTGCGATACGCTATG
CCCTCAACCAGCGTGCGAGCCTTGAAGTGTTCCTTGAGGACGGACGCGTGGAGGTGGACAACAACCTGGTGGAGAACACGATCCGGCCCAGTGCCATCGG
CAAGAAGAACTGGTTGTTCGTGGGCGACGCGCAGGCGGGAGTGCGTGCGGCGACGTTTTACACGCTGTTGGACAATGCGAAGCGGGCTGGGGCGGATGCC
TACGAGTATCTCAAGGACCTGTTCACGAAGCTGCCAGCGATGACCAATCAACAGATGAAGGAAATCACGCCCCGGGCGTGGATGGCACGGCGTGCCGACC
AGTAAAAGTCAAGCCCTGCACCCACGGGGCTTGGTTACTCGCTTAC
Protein section
ORF number : 3
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
408 bp | 135 aa | 71 | 478 | + | No |
AG : IS66 TnpA
ORF sequence :
MTHSDPSSIASAPLKRDVRSRVRSTPAQRHAALEAYARSGLSGPQFARSAGIKYQTLVSWRRQAKISTCIATQAASPPIVAFLEAVPISPLPSTRLDLLL
PGTARLHLSCSSQMPLVAALLRQLELKHSPDLHSC
PGTARLHLSCSSQMPLVAALLRQLELKHSPDLHSC
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
360 bp | 119 aa | 472 | 831 | + | No |
AG : IS66 TnpB
ORF sequence :
MLSFSGSLKVFIALQPCDMRAGVGTLQGMVSAQLQDDPRSGSLFVFSNRRHSVLKILYWDGSGWWLLAKRLEQGTFSWPAVADPAQTKLALAPQALAMLT
DGIDLRGAKMRPWYERETG
DGIDLRGAKMRPWYERETG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1485 bp | 494 aa | 921 | 2405 | + | No |
Chemistry : DDE
ORF sequence :
MIAELRGELAAVRLENKLLRLKLDALSRRMFGKSSEKLDAEQMQLLLDGIEELTLAEESARQTRPGREPSTPEPARERKPRIPEHLPVKEVFIDPEEVKA
CPEDWVHIGEEVTEQLEYTPASFERLRIIRRKYVRKDQRHLPPVVAPLQPCLQERCVAAPSLLAHSMVSRYRDHLPWHRLEGIYAGLGVEISRQTLANWS
GMTAEACGLLMREIHQSVFASGYVQLDETPIEYLSPGHGQTKTGYLWVAHSPLTQETFFQWHTGRAAFCLESLVPAGFEGIIQCDGYAAYASFAQSRERA
GTIRLAGCWAHARRKFFEASTYCQDALWVLVQIRELYAVEEELREIRAGPEQRQAAREARSRLVMGQVYEKLEQWQQQRKHLPKSPTGAAIRYALNQRAS
LEVFLEDGRVEVDNNLVENTIRPSAIGKKNWLFVGDAQAGVRAATFYTLLDNAKRAGADAYEYLKDLFTKLPAMTNQQMKEITPRAWMARRADQ
CPEDWVHIGEEVTEQLEYTPASFERLRIIRRKYVRKDQRHLPPVVAPLQPCLQERCVAAPSLLAHSMVSRYRDHLPWHRLEGIYAGLGVEISRQTLANWS
GMTAEACGLLMREIHQSVFASGYVQLDETPIEYLSPGHGQTKTGYLWVAHSPLTQETFFQWHTGRAAFCLESLVPAGFEGIIQCDGYAAYASFAQSRERA
GTIRLAGCWAHARRKFFEASTYCQDALWVLVQIRELYAVEEELREIRAGPEQRQAAREARSRLVMGQVYEKLEQWQQQRKHLPKSPTGAAIRYALNQRAS
LEVFLEDGRVEVDNNLVENTIRPSAIGKKNWLFVGDAQAGVRAATFYTLLDNAKRAGADAYEYLKDLFTKLPAMTNQQMKEITPRAWMARRADQ
Blast result :
Comments
ISVsp3 orfC (the transposase) is 52% aa similar to ISThsp3.
There is 1 intact copy of ISVsp3 in the chromosome of Verrucomicrobium spinosum DSM 4136. At least 2 other copies also carry an insertion of a gene encoding a putative reverse transcriptase.
There is 1 intact copy of ISVsp3 in the chromosome of Verrucomicrobium spinosum DSM 4136. At least 2 other copies also carry an insertion of a gene encoding a putative reverse transcriptase.
References
1] DeBoy, R. (2010) Direct submission