4/12 Emily Johnson
Purpose: The goal for today was to use bioinformatics tools to compare our HNH Endonuclease sequences
Materials:
PhagesDB
Phamerator
NCBI Multiple Align
NCBI Global Align
EBI Clustal-Omega
EBI MUSCLE Alignment
Methods:
We divided up the work today: Navya was in charge of using PhagesDB and Phamerator to gather data about our gene, such as GC content, pham numbers, etc. Alex used NCBI Multiple Align, and Global Align to run comparisons on the genes, and I used EBI (the European NCBI) Clustal-Omega, and MUSCLE Alignment to compare two genes at once and three genes at once.
First, I used DNA Master to copy the nucleotide and amino acid sequences from each gene so I could run them through the databases.
Then, I used Clustal-Omega to compare the nucleotide sequences of: Caterpillar v Nubia, Caterpillar v Shrooms, and Nubia v Shrooms
I then used Clustal-Omega again to compare the amino acid sequences of: Caterpillar v Nubia, Caterpillar v Shrooms, and Nubia v Shrooms
After I did that, I realized that I could use MUSCLE Align on EBI to compare all three genomes, so I then did that where Sequence 1=Caterpillar, Sequence 2= Nubia and Sequence 3= Shrooms and obtained a percent identity matrix.
Results:
Nucleotide Similarity:
Caterpillar vs Nubia Nucleotide Similarity; Sequence 1=Caterpillar, 2=Nubia
Caterpillar vs Shrooms Nucleotide Similarity ;Sequence 1=Caterpillar, 2=Nubia
Nubia vs Shrooms Nucleotide Similarity; Sequence 1= Nubia, 2=Shrooms
Amino Acid Similarity:
Caterpillar vs Shrooms Amino Acid Similarity; Sequence 1=Caterpillar, 2=Shrooms
Nubia vs Shrooms Amino Acid Similarity;Sequence 1=Nubia, 2=Shrooms
Identity Matrix:
Sequence 1=Caterpillar, 2=Nubia, 3=Shrooms
Discussion:
So, from this we learned that the percent similarity of the nucleotide sequences is much higher because it can only be made of 4 options: A,G,T, or C while there are 20 options for each amino acid. Also, as we can see here, Caterpillar and Nubia have the highest percent amino acid similarity of 27%. Event though this is the highest similarity, it is not a significant percent similarity.
Next lab, I will hopefully be able to compare the structures of the proteins, not just the sequences. Because the sequences of Caterpillar and Nubia are the most similar, I would expect that the structures of the two are also the most similar.