ISEc20
- Family IS110
- Group
Isoform Synonym(s)
Accession number | Transposition | Origin | Host |
---|---|---|---|
NZ_AAJU01000001 | ND | Escherichia coli | Escherichia coli |
DNA section
IS Length : 1459 bp
Ends
Left end : CCCCGTTACTTTATTCGTACCCCTTATAATGGGGTGTTAGCCAGCCAGACCCGGCATGATTACTGCCCCCAGTCGTCCATGATCCGGGGGGTGATGTCAC
Right end : AGACGACGCTGCGACGTTCTGTTCGCCATGATGCGCGACGGGACTTTTTATACCCCGCAGGCGTCATAACATGCTTGACAACTTAATAGGGGCACCCCCC
Insertion site
Left flank | Direct repeat | Right flank | DR Length |
---|---|---|---|
CGGAAGCATCTTTTCGCAC | GGGATTCCGCTA | 0 |
DNA sequence
CCCCGTTACTTTATTCGTACCCCTTATAATGGGGTGTTAGCCAGCCAGACCCGGCATGATTACTGCCCCCAGTCGTCCATGATCCGGGGGGTGATGTCAC
CGGGTCTGGTGGGGCGCTGGTAACCGCTAATAGGGGTCAGGTCAGGCACTTTTGCCGGGACCGTCTGTAACGTGGATGCCGGTACCTGCTCCCCGTGGTT
ATCTGGTTAACCCATATACAAGGGAGACAGAATGACCGAATCCAGCGATTACGAATCCGTCCAGGTCTTTATCGGCGTTGATGTCGGTAAAGATACGCAT
CACGCTGTAGCCATTAATCGTTCAGGTAAACGCCTGTTCGATAAAGCATTACCCAACGACGAAAACAAACTCAGGTCGCTAATATCTGACCTGAAACAAC
ATGGTCAGATACTGCTGGTTGTTGATCAGCCAGCTACCATCGGTGCGTTACCTGTCGCCGTTGCCCGCTCAGAAGGAGTCCTTGTCGGATACCTCCCTGG
ACTGGCCATGCGCCGCATAGCCGACTTACACGCCGGTGAAGCTAAAACTGATGCTCGTGACGCTGCCATCATTGCCGAAGCTGCCCGTACCCTGCCTCAC
GCGCTACGCACGCTGAAACTGGCTGACGAGCAAATCGCCGAACTCTCCATGCTCTGCGGCTTCGATGATGATCTTGCCGCACAGACAACGCAGGCCAGCA
ACCGTATCCGCGGCCTTCTGACCCAGATACATCCGGCACTGGAGCGCGTTCTCGGTCCGAGACTTGATCACCCGGCGGTACTCGATCTTCTCCAGCGATA
TCCCTCACCAGAAAAACTCGCTTCGCTGGGTGAGAAGAAGCTGGCAGCCCAGCTCTGCAAACTTGCGCCTCGTCTGGGTAAACGCCTTGCAGCAGACATA
GCTCAGGCACTGGCCGAACAAACCGTCGTCGTTCCCGGCACGAATGCCGCTGCCGTAGTACTGCCACGTCTGGCACTCCCGCTCATCACGCTGCGTAAGC
AAAGAGACGAGGTGGCGCTTGCGGTAGAACAGCGAGTTCTTGCTCACCCTCTTTACCCGGTCCTGACCAGTATGCCCGGAGTCGGTGTCAGGACCGCAGC
CAGACTCCTCACCGAGGTCGCCTGCCGCGCCTTCGCCTCTGTCGCACATCTCGCTGCTTATGCTGGCCTTGCGCCGGTAACTCGGCGATCCGGCTCGTCA
ATACGCGGTGAGCATCCCTCGCGACGGGGTAATAAAGCTCTCAAACGGGCGTTGTTCCTGTCGGCCTTCGCCGCGCTCAGGGATCCGCTCTCCAGGGCTT
ACTACACCCGCAAAATGAGTCAGGGAAAACGACACAATCAGGCGCTTATCGCCCTGGCGAGACGACGCTGCGACGTTCTGTTCGCCATGATGCGCGACGG
GACTTTTTATACCCCGCAGGCGTCATAACATGCTTGACAACTTAATAGGGGCACCCCCC
CGGGTCTGGTGGGGCGCTGGTAACCGCTAATAGGGGTCAGGTCAGGCACTTTTGCCGGGACCGTCTGTAACGTGGATGCCGGTACCTGCTCCCCGTGGTT
ATCTGGTTAACCCATATACAAGGGAGACAGAATGACCGAATCCAGCGATTACGAATCCGTCCAGGTCTTTATCGGCGTTGATGTCGGTAAAGATACGCAT
CACGCTGTAGCCATTAATCGTTCAGGTAAACGCCTGTTCGATAAAGCATTACCCAACGACGAAAACAAACTCAGGTCGCTAATATCTGACCTGAAACAAC
ATGGTCAGATACTGCTGGTTGTTGATCAGCCAGCTACCATCGGTGCGTTACCTGTCGCCGTTGCCCGCTCAGAAGGAGTCCTTGTCGGATACCTCCCTGG
ACTGGCCATGCGCCGCATAGCCGACTTACACGCCGGTGAAGCTAAAACTGATGCTCGTGACGCTGCCATCATTGCCGAAGCTGCCCGTACCCTGCCTCAC
GCGCTACGCACGCTGAAACTGGCTGACGAGCAAATCGCCGAACTCTCCATGCTCTGCGGCTTCGATGATGATCTTGCCGCACAGACAACGCAGGCCAGCA
ACCGTATCCGCGGCCTTCTGACCCAGATACATCCGGCACTGGAGCGCGTTCTCGGTCCGAGACTTGATCACCCGGCGGTACTCGATCTTCTCCAGCGATA
TCCCTCACCAGAAAAACTCGCTTCGCTGGGTGAGAAGAAGCTGGCAGCCCAGCTCTGCAAACTTGCGCCTCGTCTGGGTAAACGCCTTGCAGCAGACATA
GCTCAGGCACTGGCCGAACAAACCGTCGTCGTTCCCGGCACGAATGCCGCTGCCGTAGTACTGCCACGTCTGGCACTCCCGCTCATCACGCTGCGTAAGC
AAAGAGACGAGGTGGCGCTTGCGGTAGAACAGCGAGTTCTTGCTCACCCTCTTTACCCGGTCCTGACCAGTATGCCCGGAGTCGGTGTCAGGACCGCAGC
CAGACTCCTCACCGAGGTCGCCTGCCGCGCCTTCGCCTCTGTCGCACATCTCGCTGCTTATGCTGGCCTTGCGCCGGTAACTCGGCGATCCGGCTCGTCA
ATACGCGGTGAGCATCCCTCGCGACGGGGTAATAAAGCTCTCAAACGGGCGTTGTTCCTGTCGGCCTTCGCCGCGCTCAGGGATCCGCTCTCCAGGGCTT
ACTACACCCGCAAAATGAGTCAGGGAAAACGACACAATCAGGCGCTTATCGCCCTGGCGAGACGACGCTGCGACGTTCTGTTCGCCATGATGCGCGACGG
GACTTTTTATACCCCGCAGGCGTCATAACATGCTTGACAACTTAATAGGGGCACCCCCC
Protein section
ORF number : 1
ORF 1
Length | Begin | End | Strand | Fusion ORF | |
---|---|---|---|---|---|
1197 bp | 398 aa | 232 | 1428 | + | No |
Chemistry : DEDD
ORF sequence :
MTESSDYESVQVFIGVDVGKDTHHAVAINRSGKRLFDKALPNDENKLRSLISDLKQHGQILLVVDQPATIGALPVAVARSEGVLVGYLPGLAMRRIADLH
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRIRGLLTQIHPALERVLGPRLDHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALPLITLRKQRDEVALAVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASVAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQAS
AGEAKTDARDAAIIAEAARTLPHALRTLKLADEQIAELSMLCGFDDDLAAQTTQASNRIRGLLTQIHPALERVLGPRLDHPAVLDLLQRYPSPEKLASLG
EKKLAAQLCKLAPRLGKRLAADIAQALAEQTVVVPGTNAAAVVLPRLALPLITLRKQRDEVALAVEQRVLAHPLYPVLTSMPGVGVRTAARLLTEVACRA
FASVAHLAAYAGLAPVTRRSGSSIRGEHPSRRGNKALKRALFLSAFAALRDPLSRAYYTRKMSQGKRHNQALIALARRRCDVLFAMMRDGTFYTPQAS
Blast result :
Comments
ISEc20 is 97% aa similar to ISSf14 (IS110).
References
1] NCBI Microbial Genomes Annotation Project (2005) Direct Submission GenBank.