ISNha5
- Family IS1595
- Group ISNha5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
CP000319 | ND | Nitrobacter hamburgensis | Nitrobacter hamburgensis X14 |
DNA section
IS Length : 3904 bp
Ends
IR Length : 27
IRL : CGGTATTAGGTAGCAAACTCACCAAGCGACTTATTAGCCACCCCACAGGC
IRR : CGGTATTAGGTAGCAAACTCACCAAGCAATGGATGGTCTAGCGAGCCGCG
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CAGAACCGCGGTG | CGGAAGTG | GTTCGACGATCCG | 8 |
DNA sequence
CGGTATTAGGTAGCAAACTCACCAAGCGACTTATTAGCCACCCCACAGGCCGCAACGCGCGGCCATTTTGATTCTGTGAATCTGTTAGAAATCATCGCGT
TACCTTCTTTGGGACGCATCAACTTGAACATGTTGCGAAATTCGCAACACAATCAACGGTGTTGATGATGGAAAAGTGCATTGAAATATCCGTTGCAGGG
GAACCCGACTCACCATATACGGGTGAAGTCTTAAAACCGGAAGCCGATTCCATGTCCAAAACGGTCGATCAGGTCATCAGCGAGCTGGCCGCCGAAATCA
CGGCGGAAGAAGCGTCGCTGCGAAAGAAGAAAGAGACGGTCAACACGCTCTGCGGCGCTATCGGCCGTCCGCCAGCCTATTCATTGGAGAGCGCTGCTAC
GGCGCTGCCTACTCAGATTCGGTCAGATCAGTTTTACGGCCAGCCACTCGCTGCATCGGTTCGGACGATTCTGGAGATGAGGCGCGCACAGAACCTCGGC
GCGGCAGCTAATCGTGAAATCTACGATTCGTTGGTTGCCGGTGGGTACGAGTTCGATACGAAGTCGGAGGACATCGCGCAAAAATCTCTGCGCAATTCGC
TCGCGAAGAATACGGCATTGTTTCACAAGCTGCCGAACGGCCAATTTGGACTGTTGGCGTGGTACCCGAATGTGAAGAAGCCAAAAGCTAGCGCAGCCGT
TCGCGGCGAAGACGAGATTGAAGAGCCGCCGACGAATGACAATACGGGCGAGTCGATCCTTGACTGACCGTTTAAACGAAAACCGGACCCGCAAGGGTCC
GGCTGATCTCTAACCAAGTGCGGCACCCGTTGAAGGCGGGTCGAAAGGTAGGTGGTACATGATAATATAAGGCTATTCAGTACCGTCAGAGGCTATCGGT
CGCGTTGGCGCGTGACTGGTAGCTTCTGAGACGGGCCGGGGCTGGTGCCGCTAGCTGGATTTCCACCCGCGATGCTTGGGTCCAATCCCCAACCGGTCCA
CCATCATTTCCTTATCAGGATCAGAGATTCATGTCGAGCGTGGGGCCTTTATCCCGCGCTCCTTTTCGTCATCAACAAATGAAATACCGCCAGTTTCCAA
AGCAGCTTTCATTGCCGTTAGATTGTTCCGTGTCGGCTCCCGCCGACCGGCCTCAAAGTCTCGCACGGTTGAAACGCCCACGTTCGCGGCTTTTGCAAGC
GCGTCTTGAGGCCAATTTAGCCATGCTCGGGCGGCTCGGCACTGTTCCGGGGTCATTGCCCGAATCTACGGACCTAGATTATTTTCGTCAAGGTCATAGA
TTTTCGTTGACGCCATCACTTTCATTACTTATCAATGAGGTCAACGGAAAACGTTGGCCTTATGTCTCAGCACTTCCTCCTCTCAGCAAAGGCGCGGACG
CTCAGTCTGGCGAAAGTCGCGCGCCTTTCTGACGACGAAGCTTATGAGACGTTCCGGCTGATCCGCTGGGCCGCAACCGATGGCACTCCGGTCTGCCCCC
GCTGTGACTGCGCCGGCGTCTACACCTACACCACGCGCCGCCTGTTCAAGTGCAAAGCCTGCTCGCATCAGTTCTCGGTTACGTCTGGCACGATCTTCGC
GAGCCGCAAACTGCCGGTCCGCGACTATCTCTTGGCCATTGCGATCTTCGTCAACGGCGCAAAGGGCCATAGCGCGCTTCAACTGAGCCGCGATCTCGAT
TGCCAGTATAAGACTGTCTTCGTCATGGCTCACAAAATCCGCGAGGCCCTTGCTGCGGAAGCCAGCAGCGCCACGGCCTCGGGTGAGGTCGAAGTTGATG
GTGCTTACTTCGGCGGTTACGTGAAGCCTGCAAACCACAAAGGTAATCGCCGCGATCGCCGCCGCATCATCAATCAGAACGGCAAGCGCCGCGTCGTGGT
CATCATGCGTGAGCGTGGCGGTCGCACGTTGCCTTTCGTGTTCAGGAGCGAAGATGCTTCGCTTGTGAAGATCGCGCGAACCGTTGCGCCGGGCAGCGTC
ATCCACGCCGATGAAGCAACGCATTGGGACGCGCTACACGCGCGCTTCCTGACCAAGCGTATCAATCATAGCGAAAGCTATTCGGACGGCGATGCTTGCA
CCAATCAAGCCGAGTCGTTTTTCTCTCGGATGCGCCGCGCCGAAATCGGCATTCACCACCGTATCGCTGGCGATTACCTATCAGCCTACGCGGGCGAAAT
GGCTTGGCGCGAAAATAACCGTCGTGTCAGCAACGGCGAGCAATATTTGATGGTTGCGGACGCGGCGTTGCATCATCCGGTTTCGCGGCAATGGAAGGGT
TATTGGCAGCGAAAGGGCGCCTAGCGACCTTCCTGCTCTGCAATTGTGAATTTCGAATACATGACAAAATCATCGACAAGGAGATTAAGCAACGTCTCCT
CGGGAGAGGTCAGGGGAATCTCCCAGTCGGAGATTATCGCACCGCGGTTAGCCCGTGCTTCCGCAAGGATTGGTTCTATGGATGGCAAGCTAAAATCAAA
AAATGCTCGATAAACGCGATGCCTGTCATCGAACGGTCGTTTATGGAAGGGATGATTCACTATGCTGCATGAACACGACATTGCGAGCATGGGATTGGCG
TATGCGGCTTCTATCGCCTTCTCTTCCGAGTTCGAGTCAGGATTTTGGCGAATAGGAGCGATCGCAACGATCGTGGCGCACGTCAGAGCTGCTCGTTTCG
AGCTATCTGTCCGGTGCCCATTGACCAAATAGATTTTCTTGTACGCCTCTGAGATTGCGAAATAGCAATGAGCCAAGATATCCGGCATGGAGTTATGATA
AAATAGCTCCGCTCCCCGCAGAGCGGGCGACGCCAGTTCATTATCGAGCATTTGCCGGAAATACTTCGCACGCGCGATAATATCATCCGCCGCGACATTT
ACGACAGGCACGCTTACCTCAAAATTGCTAAATTACTATCGGGTGCGCCTGCGAAGTTCCGTAAGAAAGTCGACTTGGCTTGCCGAAATTTCAGCCCGTT
GAAGCCGAGCAGCTTCAAGCCACAAATCTTCGCGTGTTTTCTCGCTGTTATTGTCATTTCGGATTTCACTTTCCTGCTTATAAGCATGGGAAGGGAAAGC
CTTTGGGGCGAGAAGGCGAAGGTCTGTTGAGTTAGGCATGGTGTCCACTCCAGGTAAGGACACTACGATCTGTAGCAGCCGAACGTAAAAAAACAAAAAC
CAGCCGCAAAGGTTCCAGCTCTCTATTGGAACCTCAACGATTTGAGATGCTTAAGGGAACAGGCGCAACGAATCAACTCCAAGCATCAAAGATCGCTCAG
AATGTCAATGATTCGTTGGAGTGAAGATACCCCTATTCCAATGCGCGAGCAGGTAGGTTTTAGGATAGAAGTTTCAAGACTGCAACGTCCATCGCAAATT
ACCCTTGCAATCCCCGAGAGACCGTACCGACCCGCTCTCTTTGAGCATCCGAAGCGCCTTGCTGACCCGCCTTGTCAGGTCCGTTACGTACTTACGATCC
CGCGCGTCCTGACCGCTCAGCGCAACAATTCCTTGAGCTATCTCTCGGCTTCCTAACGGTCGCGGCGCATCCCGTAGCTCGTCCAAAATCGCCCGCGTTA
GCTCTCCGCGCCCGAACAGGACTTGCCGCTTCTGGCGTGGCATTTCGGCGTCCAGATCGCCGGTATAACCGAGCGTCCCTAGCACCCGATCCAGCGCGCT
AATGTCGTTTTTGATTTCGGCCATTCGATCGCGGATGCGCTCGGCTTCGTTGAACAGATCGGCGCGCTTTTTGAGCAAGCCCGAAATGGTGTGCTCGTAG
GTATCGGTGCGGGCAAGGCGGATGGAGTCGGACATAGGCAGGAAGGTCGCAAAACGCGGCTCGCTAGACCATCCATTGCTTGGTGAGTTTGCTACCTAAT
ACCG
TACCTTCTTTGGGACGCATCAACTTGAACATGTTGCGAAATTCGCAACACAATCAACGGTGTTGATGATGGAAAAGTGCATTGAAATATCCGTTGCAGGG
GAACCCGACTCACCATATACGGGTGAAGTCTTAAAACCGGAAGCCGATTCCATGTCCAAAACGGTCGATCAGGTCATCAGCGAGCTGGCCGCCGAAATCA
CGGCGGAAGAAGCGTCGCTGCGAAAGAAGAAAGAGACGGTCAACACGCTCTGCGGCGCTATCGGCCGTCCGCCAGCCTATTCATTGGAGAGCGCTGCTAC
GGCGCTGCCTACTCAGATTCGGTCAGATCAGTTTTACGGCCAGCCACTCGCTGCATCGGTTCGGACGATTCTGGAGATGAGGCGCGCACAGAACCTCGGC
GCGGCAGCTAATCGTGAAATCTACGATTCGTTGGTTGCCGGTGGGTACGAGTTCGATACGAAGTCGGAGGACATCGCGCAAAAATCTCTGCGCAATTCGC
TCGCGAAGAATACGGCATTGTTTCACAAGCTGCCGAACGGCCAATTTGGACTGTTGGCGTGGTACCCGAATGTGAAGAAGCCAAAAGCTAGCGCAGCCGT
TCGCGGCGAAGACGAGATTGAAGAGCCGCCGACGAATGACAATACGGGCGAGTCGATCCTTGACTGACCGTTTAAACGAAAACCGGACCCGCAAGGGTCC
GGCTGATCTCTAACCAAGTGCGGCACCCGTTGAAGGCGGGTCGAAAGGTAGGTGGTACATGATAATATAAGGCTATTCAGTACCGTCAGAGGCTATCGGT
CGCGTTGGCGCGTGACTGGTAGCTTCTGAGACGGGCCGGGGCTGGTGCCGCTAGCTGGATTTCCACCCGCGATGCTTGGGTCCAATCCCCAACCGGTCCA
CCATCATTTCCTTATCAGGATCAGAGATTCATGTCGAGCGTGGGGCCTTTATCCCGCGCTCCTTTTCGTCATCAACAAATGAAATACCGCCAGTTTCCAA
AGCAGCTTTCATTGCCGTTAGATTGTTCCGTGTCGGCTCCCGCCGACCGGCCTCAAAGTCTCGCACGGTTGAAACGCCCACGTTCGCGGCTTTTGCAAGC
GCGTCTTGAGGCCAATTTAGCCATGCTCGGGCGGCTCGGCACTGTTCCGGGGTCATTGCCCGAATCTACGGACCTAGATTATTTTCGTCAAGGTCATAGA
TTTTCGTTGACGCCATCACTTTCATTACTTATCAATGAGGTCAACGGAAAACGTTGGCCTTATGTCTCAGCACTTCCTCCTCTCAGCAAAGGCGCGGACG
CTCAGTCTGGCGAAAGTCGCGCGCCTTTCTGACGACGAAGCTTATGAGACGTTCCGGCTGATCCGCTGGGCCGCAACCGATGGCACTCCGGTCTGCCCCC
GCTGTGACTGCGCCGGCGTCTACACCTACACCACGCGCCGCCTGTTCAAGTGCAAAGCCTGCTCGCATCAGTTCTCGGTTACGTCTGGCACGATCTTCGC
GAGCCGCAAACTGCCGGTCCGCGACTATCTCTTGGCCATTGCGATCTTCGTCAACGGCGCAAAGGGCCATAGCGCGCTTCAACTGAGCCGCGATCTCGAT
TGCCAGTATAAGACTGTCTTCGTCATGGCTCACAAAATCCGCGAGGCCCTTGCTGCGGAAGCCAGCAGCGCCACGGCCTCGGGTGAGGTCGAAGTTGATG
GTGCTTACTTCGGCGGTTACGTGAAGCCTGCAAACCACAAAGGTAATCGCCGCGATCGCCGCCGCATCATCAATCAGAACGGCAAGCGCCGCGTCGTGGT
CATCATGCGTGAGCGTGGCGGTCGCACGTTGCCTTTCGTGTTCAGGAGCGAAGATGCTTCGCTTGTGAAGATCGCGCGAACCGTTGCGCCGGGCAGCGTC
ATCCACGCCGATGAAGCAACGCATTGGGACGCGCTACACGCGCGCTTCCTGACCAAGCGTATCAATCATAGCGAAAGCTATTCGGACGGCGATGCTTGCA
CCAATCAAGCCGAGTCGTTTTTCTCTCGGATGCGCCGCGCCGAAATCGGCATTCACCACCGTATCGCTGGCGATTACCTATCAGCCTACGCGGGCGAAAT
GGCTTGGCGCGAAAATAACCGTCGTGTCAGCAACGGCGAGCAATATTTGATGGTTGCGGACGCGGCGTTGCATCATCCGGTTTCGCGGCAATGGAAGGGT
TATTGGCAGCGAAAGGGCGCCTAGCGACCTTCCTGCTCTGCAATTGTGAATTTCGAATACATGACAAAATCATCGACAAGGAGATTAAGCAACGTCTCCT
CGGGAGAGGTCAGGGGAATCTCCCAGTCGGAGATTATCGCACCGCGGTTAGCCCGTGCTTCCGCAAGGATTGGTTCTATGGATGGCAAGCTAAAATCAAA
AAATGCTCGATAAACGCGATGCCTGTCATCGAACGGTCGTTTATGGAAGGGATGATTCACTATGCTGCATGAACACGACATTGCGAGCATGGGATTGGCG
TATGCGGCTTCTATCGCCTTCTCTTCCGAGTTCGAGTCAGGATTTTGGCGAATAGGAGCGATCGCAACGATCGTGGCGCACGTCAGAGCTGCTCGTTTCG
AGCTATCTGTCCGGTGCCCATTGACCAAATAGATTTTCTTGTACGCCTCTGAGATTGCGAAATAGCAATGAGCCAAGATATCCGGCATGGAGTTATGATA
AAATAGCTCCGCTCCCCGCAGAGCGGGCGACGCCAGTTCATTATCGAGCATTTGCCGGAAATACTTCGCACGCGCGATAATATCATCCGCCGCGACATTT
ACGACAGGCACGCTTACCTCAAAATTGCTAAATTACTATCGGGTGCGCCTGCGAAGTTCCGTAAGAAAGTCGACTTGGCTTGCCGAAATTTCAGCCCGTT
GAAGCCGAGCAGCTTCAAGCCACAAATCTTCGCGTGTTTTCTCGCTGTTATTGTCATTTCGGATTTCACTTTCCTGCTTATAAGCATGGGAAGGGAAAGC
CTTTGGGGCGAGAAGGCGAAGGTCTGTTGAGTTAGGCATGGTGTCCACTCCAGGTAAGGACACTACGATCTGTAGCAGCCGAACGTAAAAAAACAAAAAC
CAGCCGCAAAGGTTCCAGCTCTCTATTGGAACCTCAACGATTTGAGATGCTTAAGGGAACAGGCGCAACGAATCAACTCCAAGCATCAAAGATCGCTCAG
AATGTCAATGATTCGTTGGAGTGAAGATACCCCTATTCCAATGCGCGAGCAGGTAGGTTTTAGGATAGAAGTTTCAAGACTGCAACGTCCATCGCAAATT
ACCCTTGCAATCCCCGAGAGACCGTACCGACCCGCTCTCTTTGAGCATCCGAAGCGCCTTGCTGACCCGCCTTGTCAGGTCCGTTACGTACTTACGATCC
CGCGCGTCCTGACCGCTCAGCGCAACAATTCCTTGAGCTATCTCTCGGCTTCCTAACGGTCGCGGCGCATCCCGTAGCTCGTCCAAAATCGCCCGCGTTA
GCTCTCCGCGCCCGAACAGGACTTGCCGCTTCTGGCGTGGCATTTCGGCGTCCAGATCGCCGGTATAACCGAGCGTCCCTAGCACCCGATCCAGCGCGCT
AATGTCGTTTTTGATTTCGGCCATTCGATCGCGGATGCGCTCGGCTTCGTTGAACAGATCGGCGCGCTTTTTGAGCAAGCCCGAAATGGTGTGCTCGTAG
GTATCGGTGCGGGCAAGGCGGATGGAGTCGGACATAGGCAGGAAGGTCGCAAAACGCGGCTCGCTAGACCATCCATTGCTTGGTGAGTTTGCTACCTAAT
ACCG
Protein section
ORF number : 5
ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
227 bp | 75 aa | 1029 | 1255 | - | No |
Annotation : Transcriptional regulator, XRE familyDescription : Transcriptional Regulator factor
ORF sequence :
MTPEQCRAARAWLNWPQDALAKAANVGVSTVRDFEAGRREPTRNNLTAMKAALETGGISFVDDEKERGIKAPRST
Blast result :ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
600 bp | 199 aa | 168 | 767 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MEKCIEISVAGEPDSPYTGEVLKPEADSMSKTVDQVISELAAEITAEEASLRKKKETVNTLCGAIGRPPAYSLESAATALPTQIRSDQFYGQPLAASVRT
ILEMRRAQNLGAAANREIYDSLVAGGYEFDTKSEDIAQKSLRNSLAKNTALFHKLPNGQFGLLAWYPNVKKPKASAAVRGEDEIEEPPTNDNTGESILD
ILEMRRAQNLGAAANREIYDSLVAGGYEFDTKSEDIAQKSLRNSLAKNTALFHKLPNGQFGLLAWYPNVKKPKASAAVRGEDEIEEPPTNDNTGESILD
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
990 bp | 329 aa | 1335 | 2324 | + | No |
Chemistry : DDE
ORF sequence :
MRSTENVGLMSQHFLLSAKARTLSLAKVARLSDDEAYETFRLIRWAATDGTPVCPRCDCAGVYTYTTRRLFKCKACSHQFSVTSGTIFASRKLPVRDYLL
AIAIFVNGAKGHSALQLSRDLDCQYKTVFVMAHKIREALAAEASSATASGEVEVDGAYFGGYVKPANHKGNRRDRRRIINQNGKRRVVVIMRERGGRTLP
FVFRSEDASLVKIARTVAPGSVIHADEATHWDALHARFLTKRINHSESYSDGDACTNQAESFFSRMRRAEIGIHHRIAGDYLSAYAGEMAWRENNRRVSN
GEQYLMVADAALHHPVSRQWKGYWQRKGA
AIAIFVNGAKGHSALQLSRDLDCQYKTVFVMAHKIREALAAEASSATASGEVEVDGAYFGGYVKPANHKGNRRDRRRIINQNGKRRVVVIMRERGGRTLP
FVFRSEDASLVKIARTVAPGSVIHADEATHWDALHARFLTKRINHSESYSDGDACTNQAESFFSRMRRAEIGIHHRIAGDYLSAYAGEMAWRENNRRVSN
GEQYLMVADAALHHPVSRQWKGYWQRKGA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
591 bp | 196 aa | 2321 | 2911 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MPVVNVAADDIIARAKYFRQMLDNELASPALRGAELFYHNSMPDILAHCYFAISEAYKKIYLVNGHRTDSSKRAALTCATIVAIAPIRQNPDSNSEEKAI
EAAYANPMLAMSCSCSIVNHPFHKRPFDDRHRVYRAFFDFSLPSIEPILAEARANRGAIISDWEIPLTSPEETLLNLLVDDFVMYSKFTIAEQEGR
EAAYANPMLAMSCSCSIVNHPFHKRPFDDRHRVYRAFFDFSLPSIEPILAEARANRGAIISDWEIPLTSPEETLLNLLVDDFVMYSKFTIAEQEGR
Blast result :ORF 5
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
462 bp | 153 aa | 3374 | 3835 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MSDSIRLARTDTYEHTISGLLKKRADLFNEAERIRDRMAEIKNDISALDRVLGTLGYTGDLDAEMPRQKRQVLFGRGELTRAILDELRDAPRPLGSREIA
QGIVALSGQDARDRKYVTDLTRRVSKALRMLKESGSVRSLGDCKGNLRWTLQS
QGIVALSGQDARDRKYVTDLTRRVSKALRMLKESGSVRSLGDCKGNLRWTLQS
Blast result :
Comments
The third ORF is the transposase, others are passengers genes. ISNha5 is 86% aa similar to ISAzca1.
References
1] Copeland,A., Lucas,S., Lapidus,A., Barry,K., Detter,J.C., Glavina del Rio,T., Hammon,N., Israni,S., Dalin,E., Tice,H., Pitluck,S., Chain,P., Malfatti,S., Shin,M., Vergez,L., Schmutz,J., Larimer,F., Land,M., Kyrpides,N., Ivanova,N., Ward,B., Arp,D., Klotz,M., Stein,L., O'Mullan,G., Starkenburg,S., Sayavedra,L., Poret-Peterson,A.T., Gentry,M.E. and Richardson,P. (2006) Direct submission GenBank.