ISApr4
- Family IS1595
- Group ISNwi1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_ABHC01000005 | ND | Alpha proteobacterium | Alpha proteobacterium BAL199 |
DNA section
IS Length : 3691 bp
Ends
IR Length : 20/25
IRL : GGGGATTATCCCCTTGACGCAAGTACGACCCGGACCTATATACAGGGTGT
IRR : GGAGATTATCCCCTGAGAGCAAGTAGCGCGTAGCGCCTCGCCGCAGACTT
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CAGATAGATTGGCCT | TAATTTTT | TGCCCGTTC | 8 |
DNA sequence
GGGGATTATCCCCTTGACGCAAGTACGACCCGGACCTATATACAGGGTGTCAACGGGAGGCACCCATGTCCGAGAAGTTCACAGTTCGCGATTTCTTCAA
GCGGTTCCCGGATGACGACGCGTGCCTCCAGCATGTGATGGAAGTCCGCTTCGGTGCCCGCCACGTTTGCGCGGCTTGCGGCGTCGAGAGCACCTTTCAC
CGGATCGCCAAGCGTAAGGCGTATGCCTGCGCCGCTTGTGGTGCCCACCTGTACCCGTGCGCCGGTACGATCTTTCAGGACAGCCGCACGTCCCTGCAGA
CCTGGTTCTACGTGATCTTCCTGTTCGTCACGACCCGGCACGGTGTGTCAGGCAAGGAGATCGAGCGGACGGTCGGCGTGACGTACAAGACCGCGTGGCG
GATGGGCCAGCAGATCCGTCAACTGATGGAGAAGGCTGACGGCTTCGACCTGATGCGCGGTCACGTCGAACTGGACGAGGCTTACGTCGGCGGCCATCGC
CCCGGCAAGCGCGGTCGCGGTGCGGCTGGCAAGACCATCGTCATGGGCATGAAAGAGCGTGGCGGCCGGATGTCGGCGGAAGTCATCCCGAACATCAAGA
AGGCCACCCTGCGCGAAGTAACCCTGCGCAACGTCGAGCCGGGCTCCATCGTGTCGACCGACGAACTGATGTCCTACGGTTTACTGGACGGCGACGGGTT
CAAGCACGGGACCGTGAAGCACGGCGCCAAGGAGTTCGCCTATTACGACTATCGGCACGACGCTACGCACCACACCAACAACGTCGAGTCGTTCTGGTGG
CTGTTCAAGCGCTCCATCGCGAGCACCCACATCCACGTCTCTCAGAAGTACATGAACCGGTATCTGGCCGAATTCACGTTCCGCTCGAACTATCGCCAGA
TGCGGAACGCGATGTTCGATCTGCTGATCGGGGCGGTGTAACCACCGTCTTGATCAGTCGATCGAATGAGGCGCGGTTAGCCACAACGCCCCGCGCATCC
TCCTGCGCCATGAACTCGTCTAATCGGCCCGTCCGACGGGCCTCTTCCAGAGTAAGTCTCATGATGCCTCCGCAAAGCGCCTGAATTGGTCGTCGGTCGC
CTGCATCGGGACGATGTTTGCGACATCGCCGACCATTTGGGCCTTCTCGCCAAGTTCATCGACTGATTTGACCACAGTGTTCAACGATATCGATTCATCA
ACTGCTCTAATGTCGCTAAACATCATGAACTTGTTTGACGCCGAGTTGGCGTGATTTGTCATATATTCGAATATCGCGCGCCTTCTGTTGGGGAACACGA
CCACGTTGTGTGCCGGCCACGCTACATGGCGACCAGAAATCTCCGCGCTCCGCGTGACGATTTTCTTCCCGAACACCTTAGCAATCCGGTCAAAAATCCG
TTCATTCTGGCTTCTGCTCTGAACGTCGGCAGCACGGCGTACAGCTTCGGCTGCGGCCCGGCTCGATGCGTTGCCAACACAGGCAACTGCCGATTCAAGG
CGCCCAATCGGCACCCACAATGCGAAGATCGCATAGCCGTCAAATCCGACGCTGAACTCATCTGCCGCCTTAGCCGCAGACGACCCATAGAACTCTTCGG
CCGCCGACAGTTCCGCTTCGACAAGCCCGTAGCCCATATCGGACACAAGAACCTTGTCGCCGTTCCGCTCCACCTGCACAACGACCATGGCGCCGCTCGG
ATACATGATCGGCAGGTCGACGACGACGCGCCCATCCATTTCGCGCGCGCCTGCGAGCCGCACCACGGCTTCCCGGACGCGCTCAATAAGTACCTGAGTG
CTCGTCGTGATGTTCATAGCAAGCGCCCCTGATCTCCAGGGTTGGGGATGTCGCTCCCATTTATGATGTTGTTTCTACTACATACTAACGCCAATGCGGT
AGGGAAATCGGCGGGTGGAGTCTCGATCCGTCGAGCCTGTTCAACAGCGCTTGCACGCAACGATTTGTCGGATCGAAGGCTGTCGTAGAACAGATGCTCG
TGGGTCTCGCCGGCAGCGAGGATCAGGCCCGGAAGATCCCCGGTGCCATACAACTTGTTCATGTGCCCGCGACGCGGGCCAAGTTCAAGGCGGTAGAGCG
GTACGTGCGTCCGGCCAGCCGGCAGATCGCATTCTAGCTGGAATGTCGCCCTAGCCAGCGAACCTGGATAGACGGTTATCCTGAACTGCACGCCACGAGG
AACGCCCCCTGCCACATCAACTCGGCACTCTAGGCAGCGCCGCTGCTCGTTCATATTCTTGGATTCAACGCTGCGCCATCTGAATGGTCGCAGCAGTTGC
TTCTCTTCTCGAAAGATCCGGTCGGCTATCTCAAACGCAATCGGCATCCGCCGATGGAACCACAACCGAGGATTTTACGCGAATCCAAACCGATGGGCGC
AACCTATAGCGTCGACAGAAACCCCGTCAGAGGTGACGGGCCTCCCCTTGCGTTTGATAGGCCGAATCTATCGCAGTCAGGACCAGTCCTGATCGTAAGG
AAGACCAAGGCGGATCGCCAATGCCTGCTTCGAAACGCCAAGAGCTTTCGAAAGCGTCCCTAGTTCCGTTTTCCCGTCATCCTGAAGACGATTCACGAGC
GGCCAAGGCATCAAAATCTCGGCGGCGTAGCGGTTCGCTTCCCGCTCAAGTTCGTTTGAGAGGCAACGGTAAAATTCGTCGTCCTCAATGCCGCTCGCTG
CTTTCCCTCGATGCAGCACAAAGTGCGCAATCTCATGAGCAACCGTGAACCTCTTACGGGCTGGCGGATGGGCAGAATTAACGATGATCGCATAGCCAGA
GTCCCCGCCGATCTCCTCGTCACGGCGAAGGACGCCGGAGACATGTCGACCGAGATCATCTTGGTAGACATTGATGCCCAGATCACGCGCAAGCGCGCCT
ACGTCAACGGGGGCGGCGCTTTGGTGCCGCCTTATTACCTCCAGCGCGCTGTCTGCTGTCTCCCCCGGGAACCTGGGGGAAGTCGGCATCCGGGCCAGGG
CCGCCACCATCATCGTCTCCCTCATCTACTTTCTTTGTCGTTCGGCGAGCCGCATCCTCACGCCTAAGATCATCCAACTGAGCGCGGACCATCTCTTTGA
ACCCATCATGCTCAAGAGCCTTTGCCACTGCCTCGGGAACGGCTTCATCCGCCTTCTTCAAGGCTAAACTCTGAATCGTATTGTATCCCCACACCCCGAG
AATGGCTATTCCGATACCCAGCGCCGTGATCACGACGACTGCCGTCGTTAGCAATATGGTGATCAGATCGATGAACTTCCAACCTTCCGGCGCGTCATCA
ATCACGTGAGAGCCTACAAACACCAAACCCAAGCCGATGATGACTAGGGCGGTTATGTAGACGATGAACTCAAGCGGCTTCATGCGGGGAGTGTGCAGCA
ATCGCCTTGGGAGCGTCAATTTCGTGAGATGCGGGCTCGCCTCGATCTCATAAACCTGGGTTCAAGCGAGTGCACCCCGGTGCACCGATCAGCCTGATAT
GGCTTGCATGCGTTGGTGCCAAGGCGATACGCCGCACCACCGACAATGCCAGCATTTAGACAGGTGAAATGCTACCATTGAATCCGACACCGGATCGGAC
TCGGCTCTCATTGATCTCATGAGACGACCTTGGTGCGACGCAAGTCTGCGGCGAGGCGCTACGCGCTACTTGCTCTCAGGGGATAATCTCC
GCGGTTCCCGGATGACGACGCGTGCCTCCAGCATGTGATGGAAGTCCGCTTCGGTGCCCGCCACGTTTGCGCGGCTTGCGGCGTCGAGAGCACCTTTCAC
CGGATCGCCAAGCGTAAGGCGTATGCCTGCGCCGCTTGTGGTGCCCACCTGTACCCGTGCGCCGGTACGATCTTTCAGGACAGCCGCACGTCCCTGCAGA
CCTGGTTCTACGTGATCTTCCTGTTCGTCACGACCCGGCACGGTGTGTCAGGCAAGGAGATCGAGCGGACGGTCGGCGTGACGTACAAGACCGCGTGGCG
GATGGGCCAGCAGATCCGTCAACTGATGGAGAAGGCTGACGGCTTCGACCTGATGCGCGGTCACGTCGAACTGGACGAGGCTTACGTCGGCGGCCATCGC
CCCGGCAAGCGCGGTCGCGGTGCGGCTGGCAAGACCATCGTCATGGGCATGAAAGAGCGTGGCGGCCGGATGTCGGCGGAAGTCATCCCGAACATCAAGA
AGGCCACCCTGCGCGAAGTAACCCTGCGCAACGTCGAGCCGGGCTCCATCGTGTCGACCGACGAACTGATGTCCTACGGTTTACTGGACGGCGACGGGTT
CAAGCACGGGACCGTGAAGCACGGCGCCAAGGAGTTCGCCTATTACGACTATCGGCACGACGCTACGCACCACACCAACAACGTCGAGTCGTTCTGGTGG
CTGTTCAAGCGCTCCATCGCGAGCACCCACATCCACGTCTCTCAGAAGTACATGAACCGGTATCTGGCCGAATTCACGTTCCGCTCGAACTATCGCCAGA
TGCGGAACGCGATGTTCGATCTGCTGATCGGGGCGGTGTAACCACCGTCTTGATCAGTCGATCGAATGAGGCGCGGTTAGCCACAACGCCCCGCGCATCC
TCCTGCGCCATGAACTCGTCTAATCGGCCCGTCCGACGGGCCTCTTCCAGAGTAAGTCTCATGATGCCTCCGCAAAGCGCCTGAATTGGTCGTCGGTCGC
CTGCATCGGGACGATGTTTGCGACATCGCCGACCATTTGGGCCTTCTCGCCAAGTTCATCGACTGATTTGACCACAGTGTTCAACGATATCGATTCATCA
ACTGCTCTAATGTCGCTAAACATCATGAACTTGTTTGACGCCGAGTTGGCGTGATTTGTCATATATTCGAATATCGCGCGCCTTCTGTTGGGGAACACGA
CCACGTTGTGTGCCGGCCACGCTACATGGCGACCAGAAATCTCCGCGCTCCGCGTGACGATTTTCTTCCCGAACACCTTAGCAATCCGGTCAAAAATCCG
TTCATTCTGGCTTCTGCTCTGAACGTCGGCAGCACGGCGTACAGCTTCGGCTGCGGCCCGGCTCGATGCGTTGCCAACACAGGCAACTGCCGATTCAAGG
CGCCCAATCGGCACCCACAATGCGAAGATCGCATAGCCGTCAAATCCGACGCTGAACTCATCTGCCGCCTTAGCCGCAGACGACCCATAGAACTCTTCGG
CCGCCGACAGTTCCGCTTCGACAAGCCCGTAGCCCATATCGGACACAAGAACCTTGTCGCCGTTCCGCTCCACCTGCACAACGACCATGGCGCCGCTCGG
ATACATGATCGGCAGGTCGACGACGACGCGCCCATCCATTTCGCGCGCGCCTGCGAGCCGCACCACGGCTTCCCGGACGCGCTCAATAAGTACCTGAGTG
CTCGTCGTGATGTTCATAGCAAGCGCCCCTGATCTCCAGGGTTGGGGATGTCGCTCCCATTTATGATGTTGTTTCTACTACATACTAACGCCAATGCGGT
AGGGAAATCGGCGGGTGGAGTCTCGATCCGTCGAGCCTGTTCAACAGCGCTTGCACGCAACGATTTGTCGGATCGAAGGCTGTCGTAGAACAGATGCTCG
TGGGTCTCGCCGGCAGCGAGGATCAGGCCCGGAAGATCCCCGGTGCCATACAACTTGTTCATGTGCCCGCGACGCGGGCCAAGTTCAAGGCGGTAGAGCG
GTACGTGCGTCCGGCCAGCCGGCAGATCGCATTCTAGCTGGAATGTCGCCCTAGCCAGCGAACCTGGATAGACGGTTATCCTGAACTGCACGCCACGAGG
AACGCCCCCTGCCACATCAACTCGGCACTCTAGGCAGCGCCGCTGCTCGTTCATATTCTTGGATTCAACGCTGCGCCATCTGAATGGTCGCAGCAGTTGC
TTCTCTTCTCGAAAGATCCGGTCGGCTATCTCAAACGCAATCGGCATCCGCCGATGGAACCACAACCGAGGATTTTACGCGAATCCAAACCGATGGGCGC
AACCTATAGCGTCGACAGAAACCCCGTCAGAGGTGACGGGCCTCCCCTTGCGTTTGATAGGCCGAATCTATCGCAGTCAGGACCAGTCCTGATCGTAAGG
AAGACCAAGGCGGATCGCCAATGCCTGCTTCGAAACGCCAAGAGCTTTCGAAAGCGTCCCTAGTTCCGTTTTCCCGTCATCCTGAAGACGATTCACGAGC
GGCCAAGGCATCAAAATCTCGGCGGCGTAGCGGTTCGCTTCCCGCTCAAGTTCGTTTGAGAGGCAACGGTAAAATTCGTCGTCCTCAATGCCGCTCGCTG
CTTTCCCTCGATGCAGCACAAAGTGCGCAATCTCATGAGCAACCGTGAACCTCTTACGGGCTGGCGGATGGGCAGAATTAACGATGATCGCATAGCCAGA
GTCCCCGCCGATCTCCTCGTCACGGCGAAGGACGCCGGAGACATGTCGACCGAGATCATCTTGGTAGACATTGATGCCCAGATCACGCGCAAGCGCGCCT
ACGTCAACGGGGGCGGCGCTTTGGTGCCGCCTTATTACCTCCAGCGCGCTGTCTGCTGTCTCCCCCGGGAACCTGGGGGAAGTCGGCATCCGGGCCAGGG
CCGCCACCATCATCGTCTCCCTCATCTACTTTCTTTGTCGTTCGGCGAGCCGCATCCTCACGCCTAAGATCATCCAACTGAGCGCGGACCATCTCTTTGA
ACCCATCATGCTCAAGAGCCTTTGCCACTGCCTCGGGAACGGCTTCATCCGCCTTCTTCAAGGCTAAACTCTGAATCGTATTGTATCCCCACACCCCGAG
AATGGCTATTCCGATACCCAGCGCCGTGATCACGACGACTGCCGTCGTTAGCAATATGGTGATCAGATCGATGAACTTCCAACCTTCCGGCGCGTCATCA
ATCACGTGAGAGCCTACAAACACCAAACCCAAGCCGATGATGACTAGGGCGGTTATGTAGACGATGAACTCAAGCGGCTTCATGCGGGGAGTGTGCAGCA
ATCGCCTTGGGAGCGTCAATTTCGTGAGATGCGGGCTCGCCTCGATCTCATAAACCTGGGTTCAAGCGAGTGCACCCCGGTGCACCGATCAGCCTGATAT
GGCTTGCATGCGTTGGTGCCAAGGCGATACGCCGCACCACCGACAATGCCAGCATTTAGACAGGTGAAATGCTACCATTGAATCCGACACCGGATCGGAC
TCGGCTCTCATTGATCTCATGAGACGACCTTGGTGCGACGCAAGTCTGCGGCGAGGCGCTACGCGCTACTTGCTCTCAGGGGATAATCTCC
Protein section
ORF number : 5
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
876 bp | 276 aa | 66 | 941 | + | No |
Chemistry : DDE
ORF sequence :
MSEKFTVRDFFKRFPDDDACLQHVMEVRFGARHVCAACGVESTFHRIAKRKAYACAACGAHLYPCAGTIFQDSRTSLQTWFYVIFLFVTTRHGVSGKEIE
RTVGVTYKTAWRMGQQIRQLMEKADGFDLMRGHVELDEAYVGGHRPGKRGRGAAGKTIVMGMKERGGRMSAEVIPNIKKATLREVTLRNVEPGSIVSTDE
LMSYGLLDGDGFKHGTVKHGAKEFAYYDYRHDATHHTNNVESFWWLFKRSIASTHIHVSQKYMNRYLAEFTFRSNY
RTVGVTYKTAWRMGQQIRQLMEKADGFDLMRGHVELDEAYVGGHRPGKRGRGAAGKTIVMGMKERGGRMSAEVIPNIKKATLREVTLRNVEPGSIVSTDE
LMSYGLLDGDGFKHGTVKHGAKEFAYYDYRHDATHHTNNVESFWWLFKRSIASTHIHVSQKYMNRYLAEFTFRSNY
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
759 bp | 252 aa | 1059 | 1817 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MNITTSTQVLIERVREAVVRLAGAREMDGRVVVDLPIMYPSGAMVVVQVERNGDKVLVSDMGYGLVEAELSAAEEFYGSSAAKAADEFSVGFDGYAIFAL
WVPIGRLESAVACVGNASSRAAAEAVRRAADVQSRSQNERIFDRIAKVFGKKIVTRSAEISGRHVAWPAHNVVVFPNRRRAIFEYMTNHANSASNKFMMF
SDIRAVDESISLNTVVKSVDELGEKAQMVGDVANIVPMQATDDQFRRFAEAS
WVPIGRLESAVACVGNASSRAAAEAVRRAADVQSRSQNERIFDRIAKVFGKKIVTRSAEISGRHVAWPAHNVVVFPNRRRAIFEYMTNHANSASNKFMMF
SDIRAVDESISLNTVVKSVDELGEKAQMVGDVANIVPMQATDDQFRRFAEAS
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
534 bp | 177 aa | 1814 | 2347 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MPIAFEIADRIFREEKQLLRPFRWRSVESKNMNEQRRCLECRVDVAGGVPRGVQFRITVYPGSLARATFQLECDLPAGRTHVPLYRLELGPRRGHMNKLY
GTGDLPGLILAAGETHEHLFYDSLRSDKSLRASAVEQARRIETPPADFPTALALVCSRNNIINGSDIPNPGDQGRLL
GTGDLPGLILAAGETHEHLFYDSLRSDKSLRASAVEQARRIETPPADFPTALALVCSRNNIINGSDIPNPGDQGRLL
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
549 bp | 182 aa | 2477 | 3025 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MRETMMVAALARMPTSPRFPGETADSALEVIRRHQSAAPVDVGALARDLGINVYQDDLGRHVSGVLRRDEEIGGDSGYAIIVNSAHPPARKRFTVAHEIA
HFVLHRGKAASGIEDDEFYRCLSNELEREANRYAAEILMPWPLVNRLQDDGKTELGTLSKALGVSKQALAIRLGLPYDQDWS
HFVLHRGKAASGIEDDEFYRCLSNELEREANRYAAEILMPWPLVNRLQDDGKTELGTLSKALGVSKQALAIRLGLPYDQDWS
Blast result :ORF 5
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
123 bp | 40 aa | 3026 | 3148 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MKPFPRQWQRLLSMMGSKRWSALSWMILGVRMRLAERQRK
Blast result :
Comments
ISApr4 is 81% aa similar to ISSpo3.
The transposase is the first ORF, others are passenger gene.
The transposase is the first ORF, others are passenger gene.
References
1] Hagstrom,A., Ferriera,S., Johnson,J., Kravitz,S., Beeson,K.,Sutton,G., Rogers,Y.-H., Friedman,R., Frazier,M. and Venter,J.C. (2007) Direct submission GenBank.