ISNisp2
- Family IS1595
- Group ISNwi1
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AAMY01000002 | ND | Nitrobacter sp. | Nitrobacter sp. Nb-311A |
DNA section
IS Length : 2637 bp
Ends
IR Length : 20/24
IRL : GGCGATTGTGTAGTTTACAAACCCAGAAATCGTGGTAGTATCAACTTACT
IRR : GGCGATTATGCACTTTGCAAACCCTACGCGATTCACTTAGGTTGATGGAA
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTTACAACTC | TCCCGAGC | ACAAGAACATTAAGGGA | 8 |
DNA sequence
GGCGATTGTGTAGTTTACAAACCCAGAAATCGTGGTAGTATCAACTTACTTACTGAAAACACTAGGTTTGTCATATGTCCGTTCTTTCCAAGAAGTATTT
CCACGATGAAGCAGCCGCATTCCGCCATCTAGAGAAATTGCTGTGGGGCAATGGGGTTATTTGCCCGAAATGCGGTGAGGTTGATCGCGCGGGTAGGCTG
GAGGGCGTGAGGGGCAAGAACGGCAAGGCTCGTCTGGGTCTTTGGAAGTGCTACGGCTGCCGGAAACAGTTCACGGTGCGCGTCGGTACGGTTTTCGAGT
CGGCGCATATTCCCCTGCATAAGTGCTTGCAGGCTGTGCACCTCATGGTGTCGAGCAAGAAGGGTGTTTCGTCCCATCAACTGCATCGCATTCTTGAAAT
TCAGTACAAGTCGGCGTGGTTTTTGGCGCACCGCATCCGTGAAGCGATGCGCGATGGCAAGCTCGCGCCGATGGGCGGCGAAGGCAAAATCATCGAGGCG
GACGAAACTTTCACGGGCCGTCTGGCCGGCCAGCCCAAGAATAAAAAAGGCGGTTGGGCTCATAAAAACGTGGTTCTAACGCTCGTGGAACGGGGCGGTA
TAGCCCGAAGCTTCCATGTCGATGGTACGCGCGTTGCGGATATTGTGCCGATCGTGCGAGCCAATATTCGCCGCGAAAGCAGTCTGATGACCGATGAGGC
TCGCCACTACATCAGCGTTGGTAAAGAGTTTGCCAGCCACGACAGTGTGAACCACAAAGAAGAGGAATACGTTCGCGGCAACGTCTCAACCAATACAGTT
GAGGGCTATTACAGCATCTTCAAGCGCGGCATGAAGGGTGTCTATCAGCACTGCTCGGAGAAGCATCTGCACCGTTATCTAGCAGAGTTCGACTTCCGGT
ATAGCAACCGCATTGCACTGGGCGTGGACGATCAGGATAGGGCGGACGAGGCCATAAGGGGAATGGTAGGCAAGCGCCTAACATATCGGGTCACTGACAG
CGCGGCGGCCCGGTAGATGGGAGAGGAAGGCGGCGCGAATCAGTGCAAGAGGTGAGCAGCTAGAATTCGGGTTCCCCCGGAGAGAAAAGTGATTCGCGGT
CAATGAGTTATGTAACACATTGATTCAACTTGCAGAATCATATATAAAATAGCGGGACTCCGGAAGGGGGGTCGCGAGATTTATCTCGCGGCGCTGGATG
GTACGCTTGGCACCACTAGAAGCTGGGCTATTGATCATCCGGAAAACGCCTCCGGGTCCCACCATTGTGTGGGGTATCCGTAGTATCGGAAGATTTCCGC
CTGATCCGAGTCTAGCTTCGCTTTAGAGAAGCTGGGCAATAGCTTAACTCCATAGCCAGATATCTTATCAGATTCCCTTGCCATGCGAGGGCGCAGCAGT
TTTGCGTATTCGGGAGCGCAACCGCTGGTGTCGTATTTGTCGCAAGCCTTGAAAAACGATGCAGCTACCACGTCGGCGAGCTGCAACCCTTCCCGCTCCA
AATGCGGATAGACGCGAAGTAAGTTCATATGCAGCACTTCGAAGCACAATTTACCCCAAGGTAGAAAAGGCTTGTTCCGCTTCATTCGAAGCCATTCATA
GTATGCGTGCATCTGATCATATCGGAGTCCACCACGCGCGCTGTACTCAACGCGAAGATATCGAGGTTCGCCAAAGTCCAGCATAGAGCGATATAAGACG
AACCGTGTAACGCGCTCCAATAGAACGCGAGTCATCCAACAATAAAACCAGCACTGCGATGGAATCTTCGCTGCATCAGGGTTCGTATAGCCTTCCATGT
TCTTCTTATTGGAAGCCACCACGAAACAGCGAAGTTCCATTTCTGAAATTTTCCTGCAGACGGCGACCTTATTGGGATGGGACAGATCGGCGAAATGTAA
TGCCTTTGCTTGCTTGCTACGGAAGCCGGAACGAATATCTGTCACCCATTCGGCGACTTTGGACTCGTTGGTGGCACGAACCACTACGGCAGACAGCATT
AGCCATTCGCTAGAACCTGGAAAGTGAAGCGGTTTTACCGCCTTAAGGCCGTCGTCGCCGGACTCATCAATGTAGGCGATGTAATCATAACGGGGTGTGG
ACATGGCAAAAGTGCAGACTCAAATCGAAAAATTCAAACAAGCCGCCCGCGAGCTCGAAACGGACAACGACGAAGATCGCTTCAATGAGAAGTTAGAGAA
GATCGCTCGGCGAAAACCCGGCGCAAATCCAAAATGGAGCGGGCAATTTCCGCCCGTTCTAGGCGGCCCTAACGAGGCTCCGGAGGATTAGCTTCCAGCC
ATTTGTTTGTGCCCGCCATCGCGTTGACGAACCAGCGCTCGCCACGGAATACTTTTGTGACCCATATGCGGTCATTGTGGTCAACAAAGTCTCGAAAATG
TTCGACTAGTTCCTGCGGGGTATTTGTGACGTTGATCAACCACGTCGAGAGCTGCGTTCGGTGTGCGCCGAGCCGCCGAAGTTCGTCCCACAGTCTTTGA
TAATCTATTTGTGCGTTCTTCTTCTCTTCGACCAAGTCATACGCGATCAAATAAACGCTCATCAGAATCTCCGGTTGAGGGATCAAGTTCCATCAACCTA
AGTGAATCGCGTAGGGTTTGCAAAGTGCATAATCGCC
CCACGATGAAGCAGCCGCATTCCGCCATCTAGAGAAATTGCTGTGGGGCAATGGGGTTATTTGCCCGAAATGCGGTGAGGTTGATCGCGCGGGTAGGCTG
GAGGGCGTGAGGGGCAAGAACGGCAAGGCTCGTCTGGGTCTTTGGAAGTGCTACGGCTGCCGGAAACAGTTCACGGTGCGCGTCGGTACGGTTTTCGAGT
CGGCGCATATTCCCCTGCATAAGTGCTTGCAGGCTGTGCACCTCATGGTGTCGAGCAAGAAGGGTGTTTCGTCCCATCAACTGCATCGCATTCTTGAAAT
TCAGTACAAGTCGGCGTGGTTTTTGGCGCACCGCATCCGTGAAGCGATGCGCGATGGCAAGCTCGCGCCGATGGGCGGCGAAGGCAAAATCATCGAGGCG
GACGAAACTTTCACGGGCCGTCTGGCCGGCCAGCCCAAGAATAAAAAAGGCGGTTGGGCTCATAAAAACGTGGTTCTAACGCTCGTGGAACGGGGCGGTA
TAGCCCGAAGCTTCCATGTCGATGGTACGCGCGTTGCGGATATTGTGCCGATCGTGCGAGCCAATATTCGCCGCGAAAGCAGTCTGATGACCGATGAGGC
TCGCCACTACATCAGCGTTGGTAAAGAGTTTGCCAGCCACGACAGTGTGAACCACAAAGAAGAGGAATACGTTCGCGGCAACGTCTCAACCAATACAGTT
GAGGGCTATTACAGCATCTTCAAGCGCGGCATGAAGGGTGTCTATCAGCACTGCTCGGAGAAGCATCTGCACCGTTATCTAGCAGAGTTCGACTTCCGGT
ATAGCAACCGCATTGCACTGGGCGTGGACGATCAGGATAGGGCGGACGAGGCCATAAGGGGAATGGTAGGCAAGCGCCTAACATATCGGGTCACTGACAG
CGCGGCGGCCCGGTAGATGGGAGAGGAAGGCGGCGCGAATCAGTGCAAGAGGTGAGCAGCTAGAATTCGGGTTCCCCCGGAGAGAAAAGTGATTCGCGGT
CAATGAGTTATGTAACACATTGATTCAACTTGCAGAATCATATATAAAATAGCGGGACTCCGGAAGGGGGGTCGCGAGATTTATCTCGCGGCGCTGGATG
GTACGCTTGGCACCACTAGAAGCTGGGCTATTGATCATCCGGAAAACGCCTCCGGGTCCCACCATTGTGTGGGGTATCCGTAGTATCGGAAGATTTCCGC
CTGATCCGAGTCTAGCTTCGCTTTAGAGAAGCTGGGCAATAGCTTAACTCCATAGCCAGATATCTTATCAGATTCCCTTGCCATGCGAGGGCGCAGCAGT
TTTGCGTATTCGGGAGCGCAACCGCTGGTGTCGTATTTGTCGCAAGCCTTGAAAAACGATGCAGCTACCACGTCGGCGAGCTGCAACCCTTCCCGCTCCA
AATGCGGATAGACGCGAAGTAAGTTCATATGCAGCACTTCGAAGCACAATTTACCCCAAGGTAGAAAAGGCTTGTTCCGCTTCATTCGAAGCCATTCATA
GTATGCGTGCATCTGATCATATCGGAGTCCACCACGCGCGCTGTACTCAACGCGAAGATATCGAGGTTCGCCAAAGTCCAGCATAGAGCGATATAAGACG
AACCGTGTAACGCGCTCCAATAGAACGCGAGTCATCCAACAATAAAACCAGCACTGCGATGGAATCTTCGCTGCATCAGGGTTCGTATAGCCTTCCATGT
TCTTCTTATTGGAAGCCACCACGAAACAGCGAAGTTCCATTTCTGAAATTTTCCTGCAGACGGCGACCTTATTGGGATGGGACAGATCGGCGAAATGTAA
TGCCTTTGCTTGCTTGCTACGGAAGCCGGAACGAATATCTGTCACCCATTCGGCGACTTTGGACTCGTTGGTGGCACGAACCACTACGGCAGACAGCATT
AGCCATTCGCTAGAACCTGGAAAGTGAAGCGGTTTTACCGCCTTAAGGCCGTCGTCGCCGGACTCATCAATGTAGGCGATGTAATCATAACGGGGTGTGG
ACATGGCAAAAGTGCAGACTCAAATCGAAAAATTCAAACAAGCCGCCCGCGAGCTCGAAACGGACAACGACGAAGATCGCTTCAATGAGAAGTTAGAGAA
GATCGCTCGGCGAAAACCCGGCGCAAATCCAAAATGGAGCGGGCAATTTCCGCCCGTTCTAGGCGGCCCTAACGAGGCTCCGGAGGATTAGCTTCCAGCC
ATTTGTTTGTGCCCGCCATCGCGTTGACGAACCAGCGCTCGCCACGGAATACTTTTGTGACCCATATGCGGTCATTGTGGTCAACAAAGTCTCGAAAATG
TTCGACTAGTTCCTGCGGGGTATTTGTGACGTTGATCAACCACGTCGAGAGCTGCGTTCGGTGTGCGCCGAGCCGCCGAAGTTCGTCCCACAGTCTTTGA
TAATCTATTTGTGCGTTCTTCTTCTCTTCGACCAAGTCATACGCGATCAAATAAACGCTCATCAGAATCTCCGGTTGAGGGATCAAGTTCCATCAACCTA
AGTGAATCGCGTAGGGTTTGCAAAGTGCATAATCGCC
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
942 bp | 313 aa | 75 | 1016 | + | No |
Chemistry : DDE
ORF sequence :
MSVLSKKYFHDEAAAFRHLEKLLWGNGVICPKCGEVDRAGRLEGVRGKNGKARLGLWKCYGCRKQFTVRVGTVFESAHIPLHKCLQAVHLMVSSKKGVSS
HQLHRILEIQYKSAWFLAHRIREAMRDGKLAPMGGEGKIIEADETFTGRLAGQPKNKKGGWAHKNVVLTLVERGGIARSFHVDGTRVADIVPIVRANIRR
ESSLMTDEARHYISVGKEFASHDSVNHKEEEYVRGNVSTNTVEGYYSIFKRGMKGVYQHCSEKHLHRYLAEFDFRYSNRIALGVDDQDRADEAIRGMVGK
RLTYRVTDSAAAR
HQLHRILEIQYKSAWFLAHRIREAMRDGKLAPMGGEGKIIEADETFTGRLAGQPKNKKGGWAHKNVVLTLVERGGIARSFHVDGTRVADIVPIVRANIRR
ESSLMTDEARHYISVGKEFASHDSVNHKEEEYVRGNVSTNTVEGYYSIFKRGMKGVYQHCSEKHLHRYLAEFDFRYSNRIALGVDDQDRADEAIRGMVGK
RLTYRVTDSAAAR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
870 bp | 289 aa | 2104 | 1235 | - | No |
Annotation : Phage related proteinDescription :
ORF sequence :
MSTPRYDYIAYIDESGDDGLKAVKPLHFPGSSEWLMLSAVVVRATNESKVAEWVTDIRSGFRSKQAKALHFADLSHPNKVAVCRKISEMELRCFVVASNK
KNMEGYTNPDAAKIPSQCWFYCWMTRVLLERVTRFVLYRSMLDFGEPRYLRVEYSARGGLRYDQMHAYYEWLRMKRNKPFLPWGKLCFEVLHMNLLRVYP
HLEREGLQLADVVAASFFKACDKYDTSGCAPEYAKLLRPRMARESDKISGYGVKLLPSFSKAKLDSDQAEIFRYYGYPTQWWDPEAFSG
KNMEGYTNPDAAKIPSQCWFYCWMTRVLLERVTRFVLYRSMLDFGEPRYLRVEYSARGGLRYDQMHAYYEWLRMKRNKPFLPWGKLCFEVLHMNLLRVYP
HLEREGLQLADVVAASFFKACDKYDTSGCAPEYAKLLRPRMARESDKISGYGVKLLPSFSKAKLDSDQAEIFRYYGYPTQWWDPEAFSG
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
189 bp | 62 aa | 2103 | 2291 | + | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MAKVQTQIEKFKQAARELETDNDEDRFNEKLEKIARRKPGANPKWSGQFPPVLGGPNEAPED
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
318 bp | 105 aa | 2586 | 2269 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MIPQPEILMSVYLIAYDLVEEKKNAQIDYQRLWDELRRLGAHRTQLSTWLINVTNTPQELVEHFRDFVDHNDRIWVTKVFRGERWFVNAMAGTNKWLEAN
PPEPR
PPEPR
Blast result :
Comments
ISNisp2 is 72% aa similar to ISRpa1.
References
1] Waterbury,J., Ferriera,S., Johnson,J., Kravitz,S., Halpern,A., Remington,K., Beeson,K., Tran,B., Rogers,Y.-H., Friedman,R. and Venter,J.C. (2006) Direct submission GenBank.