ISGdi17
- Family IS21
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
ND | Gluconacetobacter diazotrophicus | Gluconacetobacter diazotrophicus |
DNA section
IS Length : 2482 bp
Ends
IR Length : 9/10
IRL : TGTTGATTTCCAGCGAGAACTGACCCTGTAGGGGCGGAAATTTTCATTGA
IRR : TGTTAATTTCTGCTGAGATTTGACCCTGGGTTTTCATCGAGAACTGACCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
GGTCAGGCCGAAGGCGTGGC | TGCTCGCCATAGAAGGCTTC | 0 | |
CGGTCACCAGGGTCCCGATC | GTTCAGGGGCGCGTGGCCTC | 0 | |
TCCAGCGTGACATTGTTGGC | CCCAGACGGCGATAGCCAAA | 0 |
DNA sequence
TGTTGATTTCCAGCGAGAACTGACCCTGTAGGGGCGGAAATTTTCATTGAGAATTGACCCGTGTCTGACACTTCCCCGGAGCAACTGTCCGGGGGTTAAA
GGAGTGATCAGCATGGAATTGTTGAGTGTGATCCGGCGGTGGCATTACCGGGATCATGTACCGATCCGCGAGATTGAGCGTCGGACGCGATTGTCGCGCA
ACACGATCCGCAAGTATCTCCGGGCGGAGACGGTTGAGCCGCAGTTCAAGGTTGCCGAGCGTCCGAGCCGGCTGGACCCGTTTGCGGAGAAACTGGCGAC
CTGGCTTGCTCTGGAGACGTCAAAGTCGCGCAAGCAGCGCCGCACGGGGCGGCGCCTGCATGTGGATCTTGTAGCGCTGGGTTATGACGGATCGTATGGA
CGGGTTGCGGCCTTTATCCGGAACTGGAAGGCGGAACAGCAAAGGGCGCGGCAGACGACGGGACGCGGCGTGTTCGTTCCGTTGCGCTTTCAACCCGGAG
AGGCCTTCCAGTTCGACTGGGGAGAGGACTGGGCGGTGATCGGGGGGCGGCGCGTCAAGCTGCAGGTCGCCCACACCAAGCTGTCCTACAGCCGGGCCTT
CATTCTGCGGGCGTATCCGCTCCAGACTCACGAGATGCTGTTCGACGCGCTGACGGAAGCCTTTCGCGTTTTGGGCGGGGTGCCGCGTCGAGGTATTTTT
GACAATATGAAAACCGCCGTGGACCGGGTTGGGCCCGGCAAGGTCCGCCAGGTCAATCTGCGTTTTTCGGCCCTGGTGAGCCATTATCTGTTCGAGGCGG
AGTTCTGCAATCCGGCAGCGGGTTGGGAGAAAGGTCAAATCGAGAAGACCGTCCAGGACGCCCGGCGGCAGATCTGGCAGGAGATGCCGCATTTTCCTGA
TCTGGCCTCCCTGAACGTCTGGCTCGAGGCGCGTTGCCGGGAACGTTGGACTATCTTGAGGCATGTCGAATTGCCCGGCAGCCTCGCCGAGGCCCATGCG
GCGGAAGTGCCTCACCTGATGGTTCCAGGGCGCCCGTTCGACGGGTTCGTCGAACATACCAAACGGGTTTCGCCGACTTGCCTCGTGCAGTTCGAAAGCA
ACCGCTACAGTGTGCCCGCCTCTTTTGCCAATCGGCCGGTCAGCCTGCGCGTCTATCCCGACCGGTTGGTGATCGCGGCCGAGGGGCGGATCCTATGCGA
ACATCCCCGGATCGTCGAGCGGTCCCACGGCGTGCCCGGTCGCACGATCTATGACTGGCGGCATTACTTGGCGGTGCTCCAGCGCAAACCCGGGGCCTTG
CGCAATGGCGCGCCCTTCTCTGAATTGCCCGAGGCGTTCCGGACGCTGCAGACGCACCTCCTCCGGCGCACGGGCGGCGACCGGGAAATGGTCGAGATCC
TCGCCCTGGTGCTGCAGCATGATGAGCAGGCCGTGCTTTGCGCGGTTGAACTCGCGCTCGAGGAGGGAGTAGCCACCAAGACACACGTCCTCAATACGTT
GCATCGTCTGACGGACGCCAAGAAAACAGGAGCACCCAGGCTCGACGCGCCGCAGGCATTGGTGCTCGAACGCGAGCCTCAGGCCGATACCGGACGGTAT
GACGCCCTGCGCGGGGAGGCCCGTCATGCGTCATGATCCCGCGGCCGGTGCCCTCGTCGTCATGCTGCGCGGCCTGCGGATGTATGGCATGGCCCAGGCC
ACGGCCGAACTGACCGAACAGGGTGCGCCGGCATTCGAGGCCGCCATCCCCGTCCTCTCCCAGCTTTTGAAGGCGGAACTCGCCGAGCGAGAGGTGCGCT
CCATCGCCTATCAAACCAAGACTGCCAGGTTCCCGGCCTACAAAGATTTGGCAGGGTTCGATTTCTCGGCCGCCGAGGTCAACGAGGCCATGGTCCGTCA
ACTCCATGCCGGGGATTTCATCGACCGTGCCGACAACGTCGTCCTCATTGGTGGCCCAGGAACCGGCAAGACCCATCTGGCCACCGCACTTGCCGTGCAG
GCGATCGAACATCACCGCAAGAAGATACGGTTCTGGTCCACGGTCGACCTCGTCAACGCCCTCGAACAGGAAAAAACCGCCAATCGCGCAGGACAGATCG
CGGAACGTCTCCTGCGCCTCGATCTCGTGATCCTGGACGAACTTGGCTATTTGCCGTTCAGCGCATCAGGCGGTGCCCTGCTGTTCCATCTCCTCAGCCG
TCTCTACGAGCGCACCAGCGTCATCAATCTGAGCTTCAGCGAATGGGGCGAAGTCTTCGGTGATCCCAAAATGACGACAGCCCTGCTCGATCGCCTTACC
CACCACTGTCATATCCTCGAAACCGGAAATGACAGCTACCGGTTCCGCGCAAGCTCCGCCGCCCCCAGGAACCGGAAGGAAAAGGCAACCGCTTGACCAG
CCCATAAAACATAGCAGAGATAACCAAAGGCCGGGTCAGTTCTCGATGAAAACCCAGGGTCAAATCTCAGCAGAAATTAACA
GGAGTGATCAGCATGGAATTGTTGAGTGTGATCCGGCGGTGGCATTACCGGGATCATGTACCGATCCGCGAGATTGAGCGTCGGACGCGATTGTCGCGCA
ACACGATCCGCAAGTATCTCCGGGCGGAGACGGTTGAGCCGCAGTTCAAGGTTGCCGAGCGTCCGAGCCGGCTGGACCCGTTTGCGGAGAAACTGGCGAC
CTGGCTTGCTCTGGAGACGTCAAAGTCGCGCAAGCAGCGCCGCACGGGGCGGCGCCTGCATGTGGATCTTGTAGCGCTGGGTTATGACGGATCGTATGGA
CGGGTTGCGGCCTTTATCCGGAACTGGAAGGCGGAACAGCAAAGGGCGCGGCAGACGACGGGACGCGGCGTGTTCGTTCCGTTGCGCTTTCAACCCGGAG
AGGCCTTCCAGTTCGACTGGGGAGAGGACTGGGCGGTGATCGGGGGGCGGCGCGTCAAGCTGCAGGTCGCCCACACCAAGCTGTCCTACAGCCGGGCCTT
CATTCTGCGGGCGTATCCGCTCCAGACTCACGAGATGCTGTTCGACGCGCTGACGGAAGCCTTTCGCGTTTTGGGCGGGGTGCCGCGTCGAGGTATTTTT
GACAATATGAAAACCGCCGTGGACCGGGTTGGGCCCGGCAAGGTCCGCCAGGTCAATCTGCGTTTTTCGGCCCTGGTGAGCCATTATCTGTTCGAGGCGG
AGTTCTGCAATCCGGCAGCGGGTTGGGAGAAAGGTCAAATCGAGAAGACCGTCCAGGACGCCCGGCGGCAGATCTGGCAGGAGATGCCGCATTTTCCTGA
TCTGGCCTCCCTGAACGTCTGGCTCGAGGCGCGTTGCCGGGAACGTTGGACTATCTTGAGGCATGTCGAATTGCCCGGCAGCCTCGCCGAGGCCCATGCG
GCGGAAGTGCCTCACCTGATGGTTCCAGGGCGCCCGTTCGACGGGTTCGTCGAACATACCAAACGGGTTTCGCCGACTTGCCTCGTGCAGTTCGAAAGCA
ACCGCTACAGTGTGCCCGCCTCTTTTGCCAATCGGCCGGTCAGCCTGCGCGTCTATCCCGACCGGTTGGTGATCGCGGCCGAGGGGCGGATCCTATGCGA
ACATCCCCGGATCGTCGAGCGGTCCCACGGCGTGCCCGGTCGCACGATCTATGACTGGCGGCATTACTTGGCGGTGCTCCAGCGCAAACCCGGGGCCTTG
CGCAATGGCGCGCCCTTCTCTGAATTGCCCGAGGCGTTCCGGACGCTGCAGACGCACCTCCTCCGGCGCACGGGCGGCGACCGGGAAATGGTCGAGATCC
TCGCCCTGGTGCTGCAGCATGATGAGCAGGCCGTGCTTTGCGCGGTTGAACTCGCGCTCGAGGAGGGAGTAGCCACCAAGACACACGTCCTCAATACGTT
GCATCGTCTGACGGACGCCAAGAAAACAGGAGCACCCAGGCTCGACGCGCCGCAGGCATTGGTGCTCGAACGCGAGCCTCAGGCCGATACCGGACGGTAT
GACGCCCTGCGCGGGGAGGCCCGTCATGCGTCATGATCCCGCGGCCGGTGCCCTCGTCGTCATGCTGCGCGGCCTGCGGATGTATGGCATGGCCCAGGCC
ACGGCCGAACTGACCGAACAGGGTGCGCCGGCATTCGAGGCCGCCATCCCCGTCCTCTCCCAGCTTTTGAAGGCGGAACTCGCCGAGCGAGAGGTGCGCT
CCATCGCCTATCAAACCAAGACTGCCAGGTTCCCGGCCTACAAAGATTTGGCAGGGTTCGATTTCTCGGCCGCCGAGGTCAACGAGGCCATGGTCCGTCA
ACTCCATGCCGGGGATTTCATCGACCGTGCCGACAACGTCGTCCTCATTGGTGGCCCAGGAACCGGCAAGACCCATCTGGCCACCGCACTTGCCGTGCAG
GCGATCGAACATCACCGCAAGAAGATACGGTTCTGGTCCACGGTCGACCTCGTCAACGCCCTCGAACAGGAAAAAACCGCCAATCGCGCAGGACAGATCG
CGGAACGTCTCCTGCGCCTCGATCTCGTGATCCTGGACGAACTTGGCTATTTGCCGTTCAGCGCATCAGGCGGTGCCCTGCTGTTCCATCTCCTCAGCCG
TCTCTACGAGCGCACCAGCGTCATCAATCTGAGCTTCAGCGAATGGGGCGAAGTCTTCGGTGATCCCAAAATGACGACAGCCCTGCTCGATCGCCTTACC
CACCACTGTCATATCCTCGAAACCGGAAATGACAGCTACCGGTTCCGCGCAAGCTCCGCCGCCCCCAGGAACCGGAAGGAAAAGGCAACCGCTTGACCAG
CCCATAAAACATAGCAGAGATAACCAAAGGCCGGGTCAGTTCTCGATGAAAACCCAGGGTCAAATCTCAGCAGAAATTAACA
Protein section
ORF number : 2
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1524 bp | 507 aa | 113 | 1636 | + | No |
Chemistry : DDE
ORF sequence :
MELLSVIRRWHYRDHVPIREIERRTRLSRNTIRKYLRAETVEPQFKVAERPSRLDPFAEKLATWLALETSKSRKQRRTGRRLHVDLVALGYDGSYGRVAA
FIRNWKAEQQRARQTTGRGVFVPLRFQPGEAFQFDWGEDWAVIGGRRVKLQVAHTKLSYSRAFILRAYPLQTHEMLFDALTEAFRVLGGVPRRGIFDNMK
TAVDRVGPGKVRQVNLRFSALVSHYLFEAEFCNPAAGWEKGQIEKTVQDARRQIWQEMPHFPDLASLNVWLEARCRERWTILRHVELPGSLAEAHAAEVP
HLMVPGRPFDGFVEHTKRVSPTCLVQFESNRYSVPASFANRPVSLRVYPDRLVIAAEGRILCEHPRIVERSHGVPGRTIYDWRHYLAVLQRKPGALRNGA
PFSELPEAFRTLQTHLLRRTGGDREMVEILALVLQHDEQAVLCAVELALEEGVATKTHVLNTLHRLTDAKKTGAPRLDAPQALVLEREPQADTGRYDALR
GEARHAS
FIRNWKAEQQRARQTTGRGVFVPLRFQPGEAFQFDWGEDWAVIGGRRVKLQVAHTKLSYSRAFILRAYPLQTHEMLFDALTEAFRVLGGVPRRGIFDNMK
TAVDRVGPGKVRQVNLRFSALVSHYLFEAEFCNPAAGWEKGQIEKTVQDARRQIWQEMPHFPDLASLNVWLEARCRERWTILRHVELPGSLAEAHAAEVP
HLMVPGRPFDGFVEHTKRVSPTCLVQFESNRYSVPASFANRPVSLRVYPDRLVIAAEGRILCEHPRIVERSHGVPGRTIYDWRHYLAVLQRKPGALRNGA
PFSELPEAFRTLQTHLLRRTGGDREMVEILALVLQHDEQAVLCAVELALEEGVATKTHVLNTLHRLTDAKKTGAPRLDAPQALVLEREPQADTGRYDALR
GEARHAS
Blast result :ORF 2
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
771 bp | 256 aa | 1626 | 2396 | + | No |
AG : IS21 helper
ORF sequence :
MRHDPAAGALVVMLRGLRMYGMAQATAELTEQGAPAFEAAIPVLSQLLKAELAEREVRSIAYQTKTARFPAYKDLAGFDFSAAEVNEAMVRQLHAGDFID
RADNVVLIGGPGTGKTHLATALAVQAIEHHRKKIRFWSTVDLVNALEQEKTANRAGQIAERLLRLDLVILDELGYLPFSASGGALLFHLLSRLYERTSVI
NLSFSEWGEVFGDPKMTTALLDRLTHHCHILETGNDSYRFRASSAAPRNRKEKATA
RADNVVLIGGPGTGKTHLATALAVQAIEHHRKKIRFWSTVDLVNALEQEKTANRAGQIAERLLRLDLVILDELGYLPFSASGGALLFHLLSRLYERTSVI
NLSFSEWGEVFGDPKMTTALLDRLTHHCHILETGNDSYRFRASSAAPRNRKEKATA
Blast result :
Comments
ISGdi17 is 84% (ORFA) aa similar to ISRsp3 and % (ORFB) to IS
References
1] Miriam Land (2008) Direct submission.