ISPaen4
- Family IS1595
- Group ISPna2
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
FNSW01000001 | ND | Paenibacillus sp. | Paenibacillus sp. GP183 |
DNA section
IS Length : 1936 bp
Ends
IR Length : 24/31
IRL : GGCTATGTTAATTCTTAATGTTGATATTTGATATATACGAACAAATGATC
IRR : GGCTCTGTTTTCGTTTAATGTTGATTTTTGACTGAAAGAAAACCGCCAAC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CTCCTGCCGCT | TTTCTTTA | AAAAATGATA | 8 |
DNA sequence
GGCTATGTTAATTCTTAATGTTGATATTTGATATATACGAACAAATGATCTATAATTGTGGGTATAATCATAGTTCGGAGATGATTTACTTATGGATTTA
AAGGAATTGAAGAAGCTCGCTGACGCTTTAGACGATAGTTGTAAGCAAGAACTTATATACTTTTTGCAATCTCGAACGAAAAGCTTAGCGGCTCCAAATC
GTGCAATTGATGAAATCCAGGAACAAAAGCATAAAGATGGGCTTGTTTGTCCGCATTGCAACAATCATTCAGTTGTGCGGTTTGGAAAATACGTTATTAA
GACACGTACTGGGGAGGTCAAACGGCAACGTTACCGCTGTAAATCCTGCCGTCAAACCTTCAACGACCTCACCAACACCCCTCTTCAACGCACCAGAAGA
CCTCACCTTTGGGTTCGATTCATCGAATGTATGATTGAGGGTTTTTCACTGAGGAAATGTGCCGAACTGTTGCATGATGAGGTAACTCATGTAACCCTGT
TTTACTGGAGGCATAAGATTCTTGCCGCTCTGAAACAAATTCCAACCGAAACATTCCAGGGCATCGTCGAAATGGATGAGACCTATTTCTTGTTCTCTGA
AAAAGGCAAACGGACTATCGCTGACCGCAAGTCCCGTAAACGTGGTGGCAAAGCTAAATATCGTGGCATCAGTAACGACCAAATATGTGTATTGGTGGCT
CGTGACCGTCAAAAGATGACTTTCTCTGGCGTTCTTGGACGTGGACGTATTCGGAAAGCTAAATTGGATGAGGCTATCGGCTGTCATTTAACCGAATCAA
ACGTACTTTGTACCGATTCATGGAGGGCATTCAGTTCCTATGCGAATACGAAAGGATTAGCCCATTACCGCTTCAAGTCGGATGGGAAACAACGTGTTAA
AGGAGTTTACCACATTCAGAATGTAAACAACTATCATAGTCGGTTGAAACGCTGGATTTACCGCTTTAATGGTGTAGCAACCAAGTATTTACAACATTAT
CTGGCTTGGTTCCGTTACTTAACAGCAAGGAATATGAGAACACAACGTCGAATAAGAAAAATATGCTGGTCACCTCTTGCCTGTTTACTGTACATGAGAC
AAACACCAATCTTCGCAGAACATCTTTTTCCTCCTAAACTAACTGGCAGATAGATCAATGTTAATATCTGCGGTACAAATGTGTACATATAGATACGCCT
GATTAAACCGCTATAAGATGCCAAAGGTCATAATGGCTTTAGAATAAATGATTTCAATGATTTTAAGGGGGGCTCAACCCTTCTGTTGTCTTTAAATAGA
ATAAGGAGTTGTTACGTTAGATGAAACAGATTACTCTTCGCCAATCCCAAAATTCCGATGTTGAGACAATTGCTAATTTACGGGCAATTGTACTACGTAA
TGATTTAACTAGGTTAGGAAGGTTTGATGAAGAGAAAGTTCGGCAACGCTTCCGTAATGCATTTGACTCAGTTCATACTTGGATCATCGAGGCAGATTCT
TCTTTTGTTGGCTGCATAGCTTTTAAACCGACATTAGATGGTTATTTATTGGAACATTTTTATATTCATCCCAATTACCAAGGTAAAGGGGTCGGCAGTC
AAGTATTAAAAAATCTGCTTGAACAAAATTATGTAAAAGGAAAACGTGTAACATTAAATGTCCTACAAGGAAGCTCTGCTAGACGTCTTTATGAACGGTT
TGGTTTTAAAGTTGAAAGTGAGGATCTTATAGACGTTTACATGTCTGTGATTGTAGAGGAAAATTCCAGAACCGTCGAAGGATTGTAATACTGCTCTCTC
TAAAGTGGAATATTCGCTAACTTGAATCACAATAAAGGTGTTGGTTATTACGATGGAAAACGATAGTTAGATTAACCACCTTTTCAGTTGGCGGTTTTCT
TTCAGTCAAAAATCAACATTAAACGAAAACAGAGCC
AAGGAATTGAAGAAGCTCGCTGACGCTTTAGACGATAGTTGTAAGCAAGAACTTATATACTTTTTGCAATCTCGAACGAAAAGCTTAGCGGCTCCAAATC
GTGCAATTGATGAAATCCAGGAACAAAAGCATAAAGATGGGCTTGTTTGTCCGCATTGCAACAATCATTCAGTTGTGCGGTTTGGAAAATACGTTATTAA
GACACGTACTGGGGAGGTCAAACGGCAACGTTACCGCTGTAAATCCTGCCGTCAAACCTTCAACGACCTCACCAACACCCCTCTTCAACGCACCAGAAGA
CCTCACCTTTGGGTTCGATTCATCGAATGTATGATTGAGGGTTTTTCACTGAGGAAATGTGCCGAACTGTTGCATGATGAGGTAACTCATGTAACCCTGT
TTTACTGGAGGCATAAGATTCTTGCCGCTCTGAAACAAATTCCAACCGAAACATTCCAGGGCATCGTCGAAATGGATGAGACCTATTTCTTGTTCTCTGA
AAAAGGCAAACGGACTATCGCTGACCGCAAGTCCCGTAAACGTGGTGGCAAAGCTAAATATCGTGGCATCAGTAACGACCAAATATGTGTATTGGTGGCT
CGTGACCGTCAAAAGATGACTTTCTCTGGCGTTCTTGGACGTGGACGTATTCGGAAAGCTAAATTGGATGAGGCTATCGGCTGTCATTTAACCGAATCAA
ACGTACTTTGTACCGATTCATGGAGGGCATTCAGTTCCTATGCGAATACGAAAGGATTAGCCCATTACCGCTTCAAGTCGGATGGGAAACAACGTGTTAA
AGGAGTTTACCACATTCAGAATGTAAACAACTATCATAGTCGGTTGAAACGCTGGATTTACCGCTTTAATGGTGTAGCAACCAAGTATTTACAACATTAT
CTGGCTTGGTTCCGTTACTTAACAGCAAGGAATATGAGAACACAACGTCGAATAAGAAAAATATGCTGGTCACCTCTTGCCTGTTTACTGTACATGAGAC
AAACACCAATCTTCGCAGAACATCTTTTTCCTCCTAAACTAACTGGCAGATAGATCAATGTTAATATCTGCGGTACAAATGTGTACATATAGATACGCCT
GATTAAACCGCTATAAGATGCCAAAGGTCATAATGGCTTTAGAATAAATGATTTCAATGATTTTAAGGGGGGCTCAACCCTTCTGTTGTCTTTAAATAGA
ATAAGGAGTTGTTACGTTAGATGAAACAGATTACTCTTCGCCAATCCCAAAATTCCGATGTTGAGACAATTGCTAATTTACGGGCAATTGTACTACGTAA
TGATTTAACTAGGTTAGGAAGGTTTGATGAAGAGAAAGTTCGGCAACGCTTCCGTAATGCATTTGACTCAGTTCATACTTGGATCATCGAGGCAGATTCT
TCTTTTGTTGGCTGCATAGCTTTTAAACCGACATTAGATGGTTATTTATTGGAACATTTTTATATTCATCCCAATTACCAAGGTAAAGGGGTCGGCAGTC
AAGTATTAAAAAATCTGCTTGAACAAAATTATGTAAAAGGAAAACGTGTAACATTAAATGTCCTACAAGGAAGCTCTGCTAGACGTCTTTATGAACGGTT
TGGTTTTAAAGTTGAAAGTGAGGATCTTATAGACGTTTACATGTCTGTGATTGTAGAGGAAAATTCCAGAACCGTCGAAGGATTGTAATACTGCTCTCTC
TAAAGTGGAATATTCGCTAACTTGAATCACAATAAAGGTGTTGGTTATTACGATGGAAAACGATAGTTAGATTAACCACCTTTTCAGTTGGCGGTTTTCT
TTCAGTCAAAAATCAACATTAAACGAAAACAGAGCC
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1062 bp | 353 aa | 92 | 1153 | + | No |
Chemistry : DDE
ORF sequence :
MDLKELKKLADALDDSCKQELIYFLQSRTKSLAAPNRAIDEIQEQKHKDGLVCPHCNNHSVVRFGKYVIKTRTGEVKRQRYRCKSCRQTFNDLTNTPLQR
TRRPHLWVRFIECMIEGFSLRKCAELLHDEVTHVTLFYWRHKILAALKQIPTETFQGIVEMDETYFLFSEKGKRTIADRKSRKRGGKAKYRGISNDQICV
LVARDRQKMTFSGVLGRGRIRKAKLDEAIGCHLTESNVLCTDSWRAFSSYANTKGLAHYRFKSDGKQRVKGVYHIQNVNNYHSRLKRWIYRFNGVATKYL
QHYLAWFRYLTARNMRTQRRIRKICWSPLACLLYMRQTPIFAEHLFPPKLTGR
TRRPHLWVRFIECMIEGFSLRKCAELLHDEVTHVTLFYWRHKILAALKQIPTETFQGIVEMDETYFLFSEKGKRTIADRKSRKRGGKAKYRGISNDQICV
LVARDRQKMTFSGVLGRGRIRKAKLDEAIGCHLTESNVLCTDSWRAFSSYANTKGLAHYRFKSDGKQRVKGVYHIQNVNNYHSRLKRWIYRFNGVATKYL
QHYLAWFRYLTARNMRTQRRIRKICWSPLACLLYMRQTPIFAEHLFPPKLTGR
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
468 bp | 155 aa | 1321 | 1788 | + | No |
Annotation : N-acetylglutamate synthase, GNAT familyDescription :
ORF sequence :
MKQITLRQSQNSDVETIANLRAIVLRNDLTRLGRFDEEKVRQRFRNAFDSVHTWIIEADSSFVGCIAFKPTLDGYLLEHFYIHPNYQGKGVGSQVLKNLL
EQNYVKGKRVTLNVLQGSSARRLYERFGFKVESEDLIDVYMSVIVEENSRTVEGL
EQNYVKGKRVTLNVLQGSSARRLYERFGFKVESEDLIDVYMSVIVEENSRTVEGL
Blast result :
Comments
ISPaen4 is 77% aa similar to ISLca5.
References
1] ISfinder annotation (2017)
2] Varghese,N. and Submissions,S. (2016) Direct GenBank submission.
2] Varghese,N. and Submissions,S. (2016) Direct GenBank submission.