ISAli20

  • Family Tn3
  • Group
MGE type ISRelated element(s) :
Isoform Synonym(s) TnAli20
Accession numberTranspositionOriginHost
ND Azospirillum lipoferum
Azospirillum lipoferum plasmid p5AZO 552
DNA section
IS Length : 3828 bp

Ends


IR Length :

IRL : GAGGGCATCGCGACATTTGTTCGGCAGACAACGCTATAAGGTCGAACGCA
IRR : GAGGGCATCGCGACATTTGTTCGGCAGACAACGCTAAGGTCGAACGGACC

Insertion site


Left flankDirect repeatRight flankDR Length

DNA sequence

GAGGGCATCGCGACATTTGTTCGGCAGACAACGCTATAAGGTCGAACGCACCGGGAACAGCGATGGATGATTGCCCAGCCTGATGCTGGTCAGGAGGTGT
ACGGCGGCCTCTGGAGGCCCGCCAAGCTCATCTCCCGGTAAACGGTGGATCTCCCCAACCCCAGCTGCTTGGCGGCTTCGGTCGGCGACAGGCCAGCTTC
CACCAGTTTCAGTGCCGCTTTGATCTTGTCCATGTCCACCGGCTCGCGACCTGGGCGTTTGCCGCGGGCCCGGGCGGGCGCGATGCCGTCCTTCGTGCGC
TCGGCGATCAGCCGCTGTTCGAAATGCGCAATCGCGCCGAAGACGTGGAACACCAGTTCGCCGGCCGCCGAAGTGGTGTCGATCCTCTCCTCCAGGCTGA
GCAGGCCGATCCCGCGCTCCTTCAGCATCGCCACTGTCGCCAGCAACTCAGCGAGCGATCGGCCGAGCCGGTCGAGGCGGACAATGGCCAGCGTGTCGCC
ACGGCGCGCATGGGCGAGCAACGCCTCCAACCCTGGCCGGTCCTTCGCCTTGCCCGAGCACACGTCGGTGAAGACCTTCAAGGCGCCGGCCTGCTCCAGC
CGCAACCGTTGGCCGGCGACATCCTGGTCGCCGGTCGAGACCCGTGCGTAGCCCAGCATGTCGCCCATGCCCGTCCCCCAACCGGCCGTTCTGTGGACGG
TCGCGCCGAACCCGATCAGACTGCTGCGCCGCCGTCCACAGAATACGTCCCGTTAATCCCTGCCGTCCACAGTCCAATCCGACGTTTTCTGGACAGGGAA
AGACCACGTGGCACGACGCCAATTGCTGACGGAGGAGGAACGCCGACTGTTGTTCGGCCTGCCCACGGATCGGGACGCCTTGGCCCGCCATTACACGTTC
ACCCGCTCGGATTTGGACCTCATCGCCAGCCGACGCGGTAATGCCAACCGCTTGGGCTTCGCCGTGCAATTGGCGCTCTTGCGTTATCCCGGATTGCCTC
TGCCCCATATCGGCGAGCCGATCGACGCCGTCGTCGGCTGGGTGGCCGAACACCTCGAGCTGCCGGTGACGGCCTTCGCCGAGTATGCGCGTCGATCACA
AACGATGACCGATCACGCCCGCGACACTGTCGCCGCGCTTGGGCTGCGGTTCCCACGCGAGGCCGATTTGCCTGACTTGATCGAGGCCGCAGCGCAGGCG
GCCTGGATTTCCGACCAAGGAATGTCGATCATGACCGGGACCATCGCTGCACTCCGGTCAGCGAAGATCGTCCTGCCGTCCCCGGCGGTGATCGAGCGCG
CCGCCCTGGCGGGCCGTGCTCGCGCCCGAAAGCGGGCGGCGGATGCCTTGGTGGCCGATCTCACGGCCGAGCAGCGGGATAAACTCGACAAGCTGCTGGC
CGTTGATCCAGCAACCGGGATCACCTCGCTGACCTGGTTGAGGACTATTCCCACGGCGCCGAAAGCGGATCACGTTCGCGACGTGATCGACAAACTTCAC
GTCGTTCGCGGCATCGGAATCGATGCCGAGGCGCAGGCGCGTGTCCATGAAACCCGCTTTCGCCAGTTTGCCCGCGAGGGCATGGCCTCGCCCACCTACC
TGATCGAGCGCTGTGCGCCGAACCGGCGGCGCGCGACACTGGTGGCGTTGCTGATCGACCTGGAGAACCGGCTCACCGACGCCGCGCTCGACATGGCCGA
CAAGCTGATCGGCGGTGCGTTCACGCGCGCGAAGAACAACAAGGAAAAGACCTGCGTCGCGAAGACGAAGGACGTCGGCCGTCTGATGCGTCTGTTTCAC
CGCACCATCGAGGCGCTCAGCCTAGCCCAGGAAAGCGACGGGGACGCCTTCGCTCTCGTCAACGAGGCGGTGGGCTGGCCGCAGTTGCTGCGCGTGCGCG
GCGAGGTGGCCAGCCTCGCCGAGCTCGCGGAAGAGGACCCGCTGGTCCGCGCCGCCGACCGGTACGTCACCATCCGGAAGTTCGCCCCGGCGCTCCTCGA
AGCGCTCACGTTCAAGGCTGCCCGGAGCAAGGACCCGATCCTGGCGGCGGTCGAGTTGCTCAAGGAGCTCAACCGATCCGGCAAGCGCGACATCCCGGCG
GACGCGCCGATGCTGTTCCGCAAGGAGTGGCGGCGCCTCGTCACCAAGGACGGCAAGCCCAACCGGCGGCTCTACGAGACAGCGGTGCTCGCCACCCTGC
GCAACAAACTGCGCTCGGGCGACGTGTGGGTGGAGCGGTCGTCCAACTACCGCCGCTTCGACAGCTATCTGCTGCCCGCGGCGGCGGCGGCCCCGATCTC
AGCGGATTTGACGCTGCCCGCGACGGCCGAAGAGTGGCTGGGGGCGCTGGGGCGCGACCTCGACGAACAGCTGAAGCGTTTTGCCCAGCGCCTGCGCGAC
GGCCAGCTCGAGGGCGTCGAATTGCGCGATGAGCGGCTGCACATCGCGGCGCTGAAGGCGACCGCGCCGCCAGAAGCGGACGTTCTCGCCGACCGGCTCG
ACGCCCTTCTGCCGCGCGTGCGCATCACCGAACTGCTGCACGAGGTCAACCGCGCGACCGGCTTCGCGGCGGCGTTCACCAACCTGCGCACCGGTGAATC
CTGCGACAACGAGAACGCGCTGCTCGCCGTCATCCTAGCCGACGGCACCAACCTGGGCCTGACACGCATGGCGGAGGCCAGCCAGGGCGTGACCCGCGAC
CAGCTCATCTGGACCGCCGACGCCTGCATCCGGCCTGAAACCTACCAGTCGGCCCTGGCCCGGATCATCGACGCTCACCATCGGCTGCCCATGGCCGCCG
TCTGGGGTGGCGGAACGACGTCCTCATCGGACGGCCAGTTCTTCCGTTCCGGCAAGCGCGGCAACGTCGCCGGCGAGGTGAACGCCCGGTATGGCGGCAG
TCCCGGCTTCAGCTTCTACACCCACGTCTCGGACCAGCACGGTCCGTACCATGTCCGGGTCATCTCGGCGGCGGCCCACGAGGCCCCCTACGTTCTGGAC
GGCCTGCTGCACCATGGGACCGGCTTGAAGCTCGACACCCACTACGTCGATACAGGCGGCACCTCGGATCACGTGTTCATTTTGGCCGCCATGCTCGGCT
TCCGCTTCTGTCCTCGCCTGCGCGATTTTCCCGAACGTCGGCTGGCCAGCATCGAGCCGTCGAGCTGTTATCCGGACCTCCAGCCACTGCTGGGCCGGCG
GGTCAAGGTGGACGTCATCCGTGAGCATTGGAACGACGTGGTGCGCCTGGTCGCGTCGCTGAAGGCCGGCACCGTGGCGCCCTCGACCATGTTGAAGAAG
CTGGCCGCCTACGAGCGACAAAACCAGCTTGATCTGGCGCTCCAGGAACTGGGCCGCATCGAGCGCACACTCTTCATGATCCGCTGGTTGGAAACACCCG
AGCTCAGACGGAGCTGTCACATCGGGTTGAACAAAGGGGAGCAGCGTCACGCTCTGGCCCAGGCGATCTGCACGTTCAAACAGGGCCGGATCGCCGACCG
CGGGTCCCAAGCGCAGCAGTATCGCGCCTCGGGGCTGAACCTGCTCATCGCCGCGATCGTCTATTGGAACTCGACCTACATGGCCGACGCGGTTGGTCAT
CTGCGCGCCGTCGGCGGGACCGTACCCGACGACCTGCTCGTCCACACCTCACCGGTCGGCTGGGAGCACATCGGTCTGTCCGGCGATTTCCTGTGGGGCC
GCGCCGCGGCCGTGCCCATCGGCAGGCGACCGCTTAACCTGCGACGGGACCGCCATGCCGCCTGAAAGCCAAGTTCCCGGTCCGTTCGACCTTAGCGTTG
TCTGCCGAACAAATGTCGCGATGCCCTC
Protein section
ORF number : 2

 

ORF 1
LengthBeginEndStrandFusion ORF
579 bp191 aa90668+No
ORF function : Accessory Gene
AG : Tn3 resolvase

ORF sequence :

MGDMLGYARVSTGDQDVAGQRLRLEQAGALKVFTDVCSGKAKDRPGLEALLAHARRGDTLAIVRLDRLGRSLAELLATVAMLKERGIGLLSLEERIDTTS
AAGELVFHVFGAIAHFEQRLIAERTKDGIAPARARGKRPGREPVDMDKIKALKLVEAGLSPTEAAKQLGLGRSTVYREMSLAGLQRPPYTS

 

Blast result :
ORF 2
LengthBeginEndStrandFusion ORF
2958 bp985 aa8083765+No
ORF function : Transposase
Chemistry : DDE

ORF sequence :

VARRQLLTEEERRLLFGLPTDRDALARHYTFTRSDLDLIASRRGNANRLGFAVQLALLRYPGLPLPHIGEPIDAVVGWVAEHLELPVTAFAEYARRSQTM
TDHARDTVAALGLRFPREADLPDLIEAAAQAAWISDQGMSIMTGTIAALRSAKIVLPSPAVIERAALAGRARARKRAADALVADLTAEQRDKLDKLLAVD
PATGITSLTWLRTIPTAPKADHVRDVIDKLHVVRGIGIDAEAQARVHETRFRQFAREGMASPTYLIERCAPNRRRATLVALLIDLENRLTDAALDMADKL
IGGAFTRAKNNKEKTCVAKTKDVGRLMRLFHRTIEALSLAQESDGDAFALVNEAVGWPQLLRVRGEVASLAELAEEDPLVRAADRYVTIRKFAPALLEAL
TFKAARSKDPILAAVELLKELNRSGKRDIPADAPMLFRKEWRRLVTKDGKPNRRLYETAVLATLRNKLRSGDVWVERSSNYRRFDSYLLPAAAAAPISAD
LTLPATAEEWLGALGRDLDEQLKRFAQRLRDGQLEGVELRDERLHIAALKATAPPEADVLADRLDALLPRVRITELLHEVNRATGFAAAFTNLRTGESCD
NENALLAVILADGTNLGLTRMAEASQGVTRDQLIWTADACIRPETYQSALARIIDAHHRLPMAAVWGGGTTSSSDGQFFRSGKRGNVAGEVNARYGGSPG
FSFYTHVSDQHGPYHVRVISAAAHEAPYVLDGLLHHGTGLKLDTHYVDTGGTSDHVFILAAMLGFRFCPRLRDFPERRLASIEPSSCYPDLQPLLGRRVK
VDVIREHWNDVVRLVASLKAGTVAPSTMLKKLAAYERQNQLDLALQELGRIERTLFMIRWLETPELRRSCHIGLNKGEQRHALAQAICTFKQGRIADRGS
QAQQYRASGLNLLIAAIVYWNSTYMADAVGHLRAVGGTVPDDLLVHTSPVGWEHIGLSGDFLWGRAAAVPIGRRPLNLRRDRHAA

 

Blast result :
Comments
ISAli20 is 62% (ORFA) and 58% (ORFB) aa similar to ISSod9. The second ORF is the transposase.
References
1] ISfinder annotation