ISNov3
- Family IS1595
- Group ISNha5
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NKIS01000002 | ND | Novosphingobium sp. | Novosphingobium sp. PASSN1 NODE_24 |
DNA section
IS Length : 3622 bp
Ends
IR Length : 25
IRL : CGGCATTATGTAGCAAACGCACCAAGCCTGAGGCGAACGTCATCGCCGGT
IRR : CGGCATTATGTAGCAAACGCACCAACGCAATGCTAGATGTAATTGCCTGC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CACGTGGAAC | ATCACTTG | CCGATGCAGA | 8 |
DNA sequence
CGGCATTATGTAGCAAACGCACCAAGCCTGAGGCGAACGTCATCGCCGGTGATGGCAACGGTGGTTGCTGTGATGATGCCTTGATCGTGAGGCACGCATG
ATTACCGCAATTCTTTTAGCGCTAAGCTGTATCGAAGGAGCTGGTGTGATCACCCCCACCAGTGACCCCATTCATTACTGCCTCAATGCCTCTCCACGCA
TCTGGATCCGTGATAATTTCCAAAGGAAAGCAGTGCACAGTTTTTCGCCTATCCTGCGAAAAAACATCATTATAGAAGATGGAAAAATAAACAAACAATC
TAGTTTCTCTGTGGAACGCTTTTCTCACGGATTCCATCGGGATGCAGCAATATTGAGTTTCGATTTGGGTTTGCGGAAAAAGAGGAGCACCGAACCCCAA
GTCCGCATTTTCTATCATTGGGGGCTTGCCGCTATCCGAGAAAGCCGAGAGTACAGAACATTGAACAGCCGGAGTATTTCCGACATTTCTGAATTGAGCA
GAAACCACCAAGTAGTTCCCAAAAACACCTTCAACTATGCTGTCGGTTGCAAGGCCCTTGTTCAGGCTGTGAAGCGATACGTAAGCTCGGTTTTGCGCCG
AGTGTTGCTCACTGGCGATTTGAATCGCTTGGGTAGCCAAGTCGCCAGCCCGTTTGGCCTCATTCGCGCTCGTTTCAGTATGTCTGGCAGCCTCGCGGGC
CCAGTGAGCCGCGCGCCAAGCAGCCGCGAGGGTGAAGCTCCCTATAGCCAACCCAATAATGCCAGCCCAAAACGTCCGAAAAGCCCACAATGCAGAATCA
GCAGCGGCATCTGCGGCCTTCCATTGGGCGCAAAGATCGCTTTGACGTTCATCCTGTCCGGGCTGGCAACCACCATCTGGTCGCGCAGCTTTTGGGAGTT
CTTCGACCGCCGCAGCAATCCGCTCAAGCTGGCTGGTGGCGTTCTGTTGGGATGGGGTCTGATCGCCTTGAGTTTCTTTGCATTCGCATGGACTGGATAG
GATCAGTCCAAGTGCAAGGATGAGCAGCCGATAGCTTGAAGACATTGGACCAAGAACACCGCTGTGAATTTGCCACGGCTGATCTTGTTCCTGATATTCG
GCTCAGAGTCCATCACGCCAACGTCTGCCAGCTTACCGACAAGCTGGGCGTAGGTAATCCCCTTGCGGGCAAGCTCAGCCTTGAGCAGCCGCTTCACCTT
GTCTTCCCATTCCTTTTCCTGAACCGGCATGTGCGCTCCTATGATGGTCAATGTCACCATATAGAGTGCATTTGCCATTGACAAGCGTGCATTACGTCAT
CATATAAAGTGCATAAGCACTGGAAACGATGACATGCAACACTTTCTCCTCTCTTCCGCAGCCCGCACACTGAGCCTCAAGGCCGTGTTCCGCATGGGCG
AGGACAAGGCGTATCGGACCTTCTGCGAAATGCGCTGGCCTGAGACGGACGGCGAAGCCGTGTGCCCGCGCTGCGGCTGCACTGAGACGTACAACATCAC
CTCGCGCCGCAAGTTCAAGTGCGTCGCCTGCTATCACCAGTTCAGCGTGACCAGCGGCACGATTTTTGCCTCGCGCAAAATGAGCTTCACCGATCTGCTC
GCCGCCATCGTGATCTTTGTGAACGGCGCGAAGGGCGTTGCCGCGCTTCACCTAAGCCGCGATTTGGATTGCCAGTATAAGACGGCCTTTGTCCTTACGC
ACAAGCTGCGCGAGGCGATGGCGCGCGAGGATGCCACGCAGACCCTGCAGGGCGAGATCGAAGTCGATGGCGCTTACTTTGGCGGCTACGTGAAACCTGC
CAACGAGAAGGAAAACCGCCGCGACCGTCGCAAGGTCGTCAACCAGAATGGCAAGCGCCAGGTTGTCGTGGTTGGCCGCGAGCGCGATGGGGAATCATTC
ACCGTGGTTGCCTCGACGGAAGCCAAAGGTGGCGAAGTGGTCGCCGCTCGCATCCATCATATGTCGACTGTCCACGCCGACGAAGCTTCGCACTGGGATG
GGTTGCATGCTAAGTTTGATACGCGCCGCATCAACCACACGGTCGAGTATTCGAATGGTCATGCTTGCACCAATCAGGCCGAATCGTTCTTCTCACGCCT
TCGCCGTATGGAAATCGGTACACACCACCACATAGCAGGGCCGTACCTCGCCAACTACGCTGCTGAGGCTTCTTGGCGCGAGGACAATCGCCGCATTGCC
AACGGCGCTCAAACTGCGATGGTTGGTGTTGCTGCTCTGGATGCGCCTGTCAGCCGCCAGTGGAAGGGGTACTGGCAGCGGTAGTTAGCCGAGGTTCAAT
ATAAACTGACCCTCAGCCCTAAATTCAAAATCAAAATTCTGTAGTACGTCCATGCCGAGAATCACATCAAACGAGTGATTGTCTGCAATATTGATTGCCT
CGACAGGGAACTCAACTCCAAAAAAGGAACGAATTGGTTCGCCGCCATCCAGCCTATCCGTCTCAAAAATAAAACCGACATTTATCCAATAAAGTAGGTG
CGCCTGAACACTATGAACATTCTGGATTGGTCGTTTTGCATGGCTGGCTAGACCCTCACGGGCTATTGTCGCGTGGGAGAGGCAAGTGCGTTGAGCACCT
GTGTCAATAAGTGCGCGATATGCGTTTACGGGGAATGGATTGGCGCTGTTCTGCGATGGCACTGAGACCGGAGTCGGAACAAACTTCTGCAATCCGACAG
GGACGATTGCCTGCCTATTCTCAAGCCTGCAACGCACTGCCCGCATAGGAATAGAACCCCAGGTTATCGACCTGCTCGGTAACTTCCTGCACGGAAAATC
GTTGCTCTCCGAATTGGGTCGTACCGGCGACAAACGCAGCGATAGAATCGTCGTGCAATTCAATCAACCGCTCATCGTGCAGCAGTGCATACCTACCGGG
CGCACGCTCAAGTAGGTCAGGCAGCATATGCTGGAATGCCTCGAAATTCCGATCAACCTCTGTGTCGATAGAATGCTGCATAAAAGCTCTCCCCGCTCTA
CAGAAAAAACTCTTATCACACGCTTCCGATTCGTGTCACGCTGCGATCTTGACAAAACTGAGAGAATGAGTCCCCGGATTCTGTTTGGAATCCGATCGAC
TCGATTCTTGAAATATTATTTCAACGGCGCGCAAGCGACCATTTAGCCGGGTACTGACTGGCATCCAGAATCACCGCGCCATCCCTTCTCCGGCTGAAGG
TGCTGTGAATGGTATTGGCGAGCATCCCACGAGATTCTGGTGTGTCTTCCCTTCCCTGCTTGGCCAGTACGCGAATGGCAATTTCCATTGATGTGAGCGG
CTCATTGACCTCGCGCAGAATAGTGAGCGCTTCACGGCTGCCTGAGCCACGAGGTGTGCCAGCGGGATTCTTCGAAAAGCTGCGTTTGGTCGCAATCTTT
GAAATGTCGTAGCTTGGGCTGAACATTTTGATGACGGCATCAATATGCGCCAACTCGCACTCAAGACGCATGATCTGAAACCGCCGGGCTTGAATTTCCC
CGTCGATGCGAGCGCGCTTGTCAGTCAAAGCGCTGATTGCGTATGTGTCTGCCATATGCCTACATACGCATCGCAGGCAATTACATCTAGCATTGCGTTG
GTGCGTTTGCTACATAATGCCG
ATTACCGCAATTCTTTTAGCGCTAAGCTGTATCGAAGGAGCTGGTGTGATCACCCCCACCAGTGACCCCATTCATTACTGCCTCAATGCCTCTCCACGCA
TCTGGATCCGTGATAATTTCCAAAGGAAAGCAGTGCACAGTTTTTCGCCTATCCTGCGAAAAAACATCATTATAGAAGATGGAAAAATAAACAAACAATC
TAGTTTCTCTGTGGAACGCTTTTCTCACGGATTCCATCGGGATGCAGCAATATTGAGTTTCGATTTGGGTTTGCGGAAAAAGAGGAGCACCGAACCCCAA
GTCCGCATTTTCTATCATTGGGGGCTTGCCGCTATCCGAGAAAGCCGAGAGTACAGAACATTGAACAGCCGGAGTATTTCCGACATTTCTGAATTGAGCA
GAAACCACCAAGTAGTTCCCAAAAACACCTTCAACTATGCTGTCGGTTGCAAGGCCCTTGTTCAGGCTGTGAAGCGATACGTAAGCTCGGTTTTGCGCCG
AGTGTTGCTCACTGGCGATTTGAATCGCTTGGGTAGCCAAGTCGCCAGCCCGTTTGGCCTCATTCGCGCTCGTTTCAGTATGTCTGGCAGCCTCGCGGGC
CCAGTGAGCCGCGCGCCAAGCAGCCGCGAGGGTGAAGCTCCCTATAGCCAACCCAATAATGCCAGCCCAAAACGTCCGAAAAGCCCACAATGCAGAATCA
GCAGCGGCATCTGCGGCCTTCCATTGGGCGCAAAGATCGCTTTGACGTTCATCCTGTCCGGGCTGGCAACCACCATCTGGTCGCGCAGCTTTTGGGAGTT
CTTCGACCGCCGCAGCAATCCGCTCAAGCTGGCTGGTGGCGTTCTGTTGGGATGGGGTCTGATCGCCTTGAGTTTCTTTGCATTCGCATGGACTGGATAG
GATCAGTCCAAGTGCAAGGATGAGCAGCCGATAGCTTGAAGACATTGGACCAAGAACACCGCTGTGAATTTGCCACGGCTGATCTTGTTCCTGATATTCG
GCTCAGAGTCCATCACGCCAACGTCTGCCAGCTTACCGACAAGCTGGGCGTAGGTAATCCCCTTGCGGGCAAGCTCAGCCTTGAGCAGCCGCTTCACCTT
GTCTTCCCATTCCTTTTCCTGAACCGGCATGTGCGCTCCTATGATGGTCAATGTCACCATATAGAGTGCATTTGCCATTGACAAGCGTGCATTACGTCAT
CATATAAAGTGCATAAGCACTGGAAACGATGACATGCAACACTTTCTCCTCTCTTCCGCAGCCCGCACACTGAGCCTCAAGGCCGTGTTCCGCATGGGCG
AGGACAAGGCGTATCGGACCTTCTGCGAAATGCGCTGGCCTGAGACGGACGGCGAAGCCGTGTGCCCGCGCTGCGGCTGCACTGAGACGTACAACATCAC
CTCGCGCCGCAAGTTCAAGTGCGTCGCCTGCTATCACCAGTTCAGCGTGACCAGCGGCACGATTTTTGCCTCGCGCAAAATGAGCTTCACCGATCTGCTC
GCCGCCATCGTGATCTTTGTGAACGGCGCGAAGGGCGTTGCCGCGCTTCACCTAAGCCGCGATTTGGATTGCCAGTATAAGACGGCCTTTGTCCTTACGC
ACAAGCTGCGCGAGGCGATGGCGCGCGAGGATGCCACGCAGACCCTGCAGGGCGAGATCGAAGTCGATGGCGCTTACTTTGGCGGCTACGTGAAACCTGC
CAACGAGAAGGAAAACCGCCGCGACCGTCGCAAGGTCGTCAACCAGAATGGCAAGCGCCAGGTTGTCGTGGTTGGCCGCGAGCGCGATGGGGAATCATTC
ACCGTGGTTGCCTCGACGGAAGCCAAAGGTGGCGAAGTGGTCGCCGCTCGCATCCATCATATGTCGACTGTCCACGCCGACGAAGCTTCGCACTGGGATG
GGTTGCATGCTAAGTTTGATACGCGCCGCATCAACCACACGGTCGAGTATTCGAATGGTCATGCTTGCACCAATCAGGCCGAATCGTTCTTCTCACGCCT
TCGCCGTATGGAAATCGGTACACACCACCACATAGCAGGGCCGTACCTCGCCAACTACGCTGCTGAGGCTTCTTGGCGCGAGGACAATCGCCGCATTGCC
AACGGCGCTCAAACTGCGATGGTTGGTGTTGCTGCTCTGGATGCGCCTGTCAGCCGCCAGTGGAAGGGGTACTGGCAGCGGTAGTTAGCCGAGGTTCAAT
ATAAACTGACCCTCAGCCCTAAATTCAAAATCAAAATTCTGTAGTACGTCCATGCCGAGAATCACATCAAACGAGTGATTGTCTGCAATATTGATTGCCT
CGACAGGGAACTCAACTCCAAAAAAGGAACGAATTGGTTCGCCGCCATCCAGCCTATCCGTCTCAAAAATAAAACCGACATTTATCCAATAAAGTAGGTG
CGCCTGAACACTATGAACATTCTGGATTGGTCGTTTTGCATGGCTGGCTAGACCCTCACGGGCTATTGTCGCGTGGGAGAGGCAAGTGCGTTGAGCACCT
GTGTCAATAAGTGCGCGATATGCGTTTACGGGGAATGGATTGGCGCTGTTCTGCGATGGCACTGAGACCGGAGTCGGAACAAACTTCTGCAATCCGACAG
GGACGATTGCCTGCCTATTCTCAAGCCTGCAACGCACTGCCCGCATAGGAATAGAACCCCAGGTTATCGACCTGCTCGGTAACTTCCTGCACGGAAAATC
GTTGCTCTCCGAATTGGGTCGTACCGGCGACAAACGCAGCGATAGAATCGTCGTGCAATTCAATCAACCGCTCATCGTGCAGCAGTGCATACCTACCGGG
CGCACGCTCAAGTAGGTCAGGCAGCATATGCTGGAATGCCTCGAAATTCCGATCAACCTCTGTGTCGATAGAATGCTGCATAAAAGCTCTCCCCGCTCTA
CAGAAAAAACTCTTATCACACGCTTCCGATTCGTGTCACGCTGCGATCTTGACAAAACTGAGAGAATGAGTCCCCGGATTCTGTTTGGAATCCGATCGAC
TCGATTCTTGAAATATTATTTCAACGGCGCGCAAGCGACCATTTAGCCGGGTACTGACTGGCATCCAGAATCACCGCGCCATCCCTTCTCCGGCTGAAGG
TGCTGTGAATGGTATTGGCGAGCATCCCACGAGATTCTGGTGTGTCTTCCCTTCCCTGCTTGGCCAGTACGCGAATGGCAATTTCCATTGATGTGAGCGG
CTCATTGACCTCGCGCAGAATAGTGAGCGCTTCACGGCTGCCTGAGCCACGAGGTGTGCCAGCGGGATTCTTCGAAAAGCTGCGTTTGGTCGCAATCTTT
GAAATGTCGTAGCTTGGGCTGAACATTTTGATGACGGCATCAATATGCGCCAACTCGCACTCAAGACGCATGATCTGAAACCGCCGGGCTTGAATTTCCC
CGTCGATGCGAGCGCGCTTGTCAGTCAAAGCGCTGATTGCGTATGTGTCTGCCATATGCCTACATACGCATCGCAGGCAATTACATCTAGCATTGCGTTG
GTGCGTTTGCTACATAATGCCG
Protein section
ORF number : 4
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
228 bp | 75 aa | 1230 | 1003 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MPVQEKEWEDKVKRLLKAELARKGITYAQLVGKLADVGVMDSEPNIRNKISRGKFTAVFLVQCLQAIGCSSLHLD
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
951 bp | 316 aa | 1334 | 2284 | + | No |
Chemistry : DDE
ORF sequence :
MQHFLLSSAARTLSLKAVFRMGEDKAYRTFCEMRWPETDGEAVCPRCGCTETYNITSRRKFKCVACYHQFSVTSGTIFASRKMSFTDLLAAIVIFVNGAK
GVAALHLSRDLDCQYKTAFVLTHKLREAMAREDATQTLQGEIEVDGAYFGGYVKPANEKENRRDRRKVVNQNGKRQVVVVGRERDGESFTVVASTEAKGG
EVVAARIHHMSTVHADEASHWDGLHAKFDTRRINHTVEYSNGHACTNQAESFFSRLRRMEIGTHHHIAGPYLANYAAEASWREDNRRIANGAQTAMVGVA
ALDAPVSRQWKGYWQR
GVAALHLSRDLDCQYKTAFVLTHKLREAMAREDATQTLQGEIEVDGAYFGGYVKPANEKENRRDRRKVVNQNGKRQVVVVGRERDGESFTVVASTEAKGG
EVVAARIHHMSTVHADEASHWDGLHAKFDTRRINHTVEYSNGHACTNQAESFFSRLRRMEIGTHHHIAGPYLANYAAEASWREDNRRIANGAQTAMVGVA
ALDAPVSRQWKGYWQR
Blast result :ORF 3
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
261 bp | 86 aa | 2981 | 2721 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MQHSIDTEVDRNFEAFQHMLPDLLERAPGRYALLHDERLIELHDDSIAAFVAGTTQFGEQRFSVQEVTEQVDNLGFYSYAGSALQA
Blast result :ORF 4
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
435 bp | 144 aa | 3555 | 3121 | - | No |
Annotation : Hypothetical proteinDescription :
ORF sequence :
MADTYAISALTDKRARIDGEIQARRFQIMRLECELAHIDAVIKMFSPSYDISKIATKRSFSKNPAGTPRGSGSREALTILREVNEPLTSMEIAIRVLAKQ
GREDTPESRGMLANTIHSTFSRRRDGAVILDASQYPAKWSLARR
GREDTPESRGMLANTIHSTFSRRRDGAVILDASQYPAKWSLARR
Blast result :
Comments
ISNov3 is 76% aa similar to ISBos1.
References
1] ISfinder annotation (2017)
2] Kojadinovic,M., Villain,A., Puppo,C., Fon Sing,S., Prioretti,L., Hubert,P., Gregori,G., Zhang,Y., Sassi,J.-F., Claverie,J.-M., Blanc,G. and Gontero,B. (2017) Direct GenBank submission.
2] Kojadinovic,M., Villain,A., Puppo,C., Fon Sing,S., Prioretti,L., Hubert,P., Gregori,G., Zhang,Y., Sassi,J.-F., Claverie,J.-M., Blanc,G. and Gontero,B. (2017) Direct GenBank submission.