April 19

4-15-19 — AQ Data Collection and Project Outline

AQ Data Collection and Project Outline

Date: 4-17-19

  • Rationale
    • The rational for this lab is to continue gathering data to use for our independent research project as well as plan an outline for the project.
  • Procedure
    1. DNAMaster was used to determine the start codon used for each gene by each phage in the AQ cluster.
    2. Findings were recorded on a Google Sheets page and tendencies were calculated.
    3. The group convened and discussed the parts of the project that needed to be worked on and an outline was formed.
  • Results
    • The outline was made as follows:
    • Title:
      Subclustering of bacteriophage determined by start codon preference
      Guiding Question:
      Every phage has a different tendency to choose one of the following base pair sequences as start codons: ATG, GTG, or TTG. Given the percentage of each codon’s use in phages from clusters that infect Arthrobacter, can this preference be used to further cluster phages within existing clusters?
      Abstract:
      Start codons are nucleotide sequences that are found at the beginning of protein-coding genes of DNA. In bacteriophage specifically, they start with ATG, GTG or TTG start codons. In our experiment, we looked at the preferred start codons of bacteriophage that infect Arthrobacter in different clusters to examine the percentage of each start codon preferred in and between clusters. We found that within clusters, there were repeated patterns of preferred start codons that were unique to the cluster. In the AM cluster, the last four genes preferred the start codons GTG, GTG, ATG, and ATG in all 14 phages we looked at. With these discoveries, we can more easily characterize and group phages into their clusters based on their start codons as well as use these known patterns to reassure the clusterization of the phage.
      List of tools used:
      PhagesDB
      DNA Master
      Excel
      Splitstree
      Introduction (Background Information):
      In all DNA, genes start with specific three nucleotide sequences that promote transcription by a ribosome. These sequences are known as start codons and organisms can have specific preferences in start codons. Protein-coding genes of bacteriophages all start with either an ATG, GTG, or TTG start codon. On the Guiding Principles of Bacteriophage Genome Annotation, it mentions that, “… TTG is rarely used (about 7% of all genes). ATG and GTG are used at almost equivalent frequencies.”
      Types of Data Collected:
      Phage genomic sequences for clusters AM, AR, AQ
      Phylogenetic interrelatedness between phages within and between clusters
      Start codons for each gene of each phage within each cluster sampled
      Start codon usage percentage for the above phages
      Results (to date):
      Similarities of phages in each cluster were found. For example for all AM cluster phages the start codons for the last four genes were GTG ,GTG, ATG, and ATG. Phage Molivia, which belongs to AQ Cluster appears to be more closely related to AR and AM phages rather than to other AQ cluster phages. The average ATG, GTG, and TTG percentage within clusters don’t exactly align with the guiding principles mentioned by PhagesDB. The percentage of ATG and GTG were not similar; ATG was the most predominantly preferred start codon.
      Conclusions:
      No conclusions have been made to date
    • The following is an example of the start-codon preferences recorded for the AQ phage Anansi
  • Future Plans
    • The next step is to continue gathering data for the AU cluster and compare the start codon data to phylogenetic data for each cluster.
April 19

Project Abstract

4/17/19

Rational:

To write the abstract for the research porject.

Procedure:

  • Found the catalytic region of a lysin A
  • Researched the difference between lysin A and endolysin
  • Wrote the abstract for the project

Observations:

  • Found that proteins called as Lysin A’s have the same catalytic structure as endolysins
  • Found that endolysin is probably a general term that includes lysin A

Conclusion:

Looked at the differences between other lysins to see if they had the same catalytic region. Wrote the abstract for the project. Next lab we will polish the abstract and start to create the presentation for the project.

April 19

Project Outline

4/15/19

Rational:

To compare the hydropathy plots of possible holin genes in other Arthrobacter phages. To create the outline for the researh presentation.

Procedure:

  • Took the amino acid sequences of the predicted holin genes of AM cluster phages and two outside the cluster
  • Created hydropathy plots using TMHMM with these sequences
  • Added the project title and guiding question to the outline
  • Created a basic outline for the abstract

Observations:

  • The hydopathy plots of the AM phages were very similar to NapoleonB and at first glance some look almost identical

Fig.15 – NapoleonB’s hydropathy plot (left) looks almost identcal to Arcadia’s (right) another AM cluster phage

  • The differences between NapoleonB’s hydropathy plot and phages from other clusters were more obvious than with phages from the AM cluster

Fig.16 – NapoleonB’s hydropathy plot (left) shows more differences with Madamato (right) a BI cluster phage

Conclusion:

Created hydropathy plots for all of the AM phage predicted holins and a few others to compare. Finishd the outline for the research project presentation. Next lab we will write the abstact for our research project.

 

April 19

4.17.19 IRP

Rationale:

To create an abstract and to work on the background information of the presentation.

Procedure:

  1. I worked on finishing the results figures.
  2. I also worked on creating the background information and introduction slides of the powerpoint.
  3. I assisted in creating the abstract for the project.

Conclusions/Next Steps:

We can conclude that the NMT is apart of the NADH metabolic pathway. We also finished with the creation of the abstract. The next step will be to finish creating our presentation. We will also attempt to improve our phylogenetic tree, as it currently shows little more than the relation between the phages in the AM cluster.

April 19

4.15.19 IRP

Rationale:

Our goal was to work on figures for the results and to acquire further background information.

Procedures:

  1. I looked at the MSA data and created a figure to show the alignment of the amino acids from other organisms and phages that matched with NapoleonB.
  2. I looked at background information for NMT and NMN in order to begin working on the background part of our project.

Data:

This is the graphic that was created to show the alignment between amino acids.

Conclusions/Next Steps:

From the graphic I created, it can be seen that there are certain amino acids and amino acid sequences that are conserved throughout most of the genomes analyzed. The background information also served to provide a greater understanding of what NMT is actually for. The next step will be to continue working on the aspects required for the presentation and the abstract.

 

April 19

Independent research 4/17

Rationale: Start codons for all Arthrobacter phage tape measure proteins, minor tail proteins, and major tail proteins were analyzed.

Procedure: DNA Master and PhagesDB were used to gather data. Obtained DNA Master file from PhagesDB and uploaded onto DNA Master. Protein sequence was analyzed, size of gene, and the cluster were all documented onto a google sheet. Abstract for independent research was written using Word.

Results:

Future work/Conclusions: Couple of reverse genes that is uncharacteristic of tail proteins. Majority of the major tail proteins have the start “ATG” with a couple of “GTG” with only one “TTG”. Make graphs for presentation representing the hard data, and look into the reverse genes, and what makes them unique. Only one reverse gene from the major tail proteins 1/154 genes that were analyzed. Further work will be needed to be done to analyze this reverse gene as long as other reverse genes found.

 

April 19

04/17/19 PGRP Domain and Lysin A Structure

Rationale:

The purpose of today’s lab was to create an abstract for the powerpoint presentation as well as look at lysin A catalytic regions.

Tools:

  • JMOL
  • Word
  • NCBI

Procedure:

  • Drafted the abstract for the QTM
  • Used JMOL to look at Lysin A structures to determine the possible catalytic region
  • Researched the PGRP superfamily

Results:

The Written abstract

The Folded Lysin A Protein

Conclusions:

The lysin A protein has the same structurally conserved catalytic region as the annotated “endolysin” in NapoleonB. Both proteins contain the same beta sheet structure, while the lysin A is missing the alpha helices at the end of the molecule that NapoleonB’s endolysin possesses.

Next Steps:

Fold the PGRP proteins and determine catalytic region as well as any other lysin variation we find.

April 19

04/15/19 Project Outline and Data Gathering

Rationale:

The purpose of today’s lab was to create a project outline for our presentation as well as compare phage endolysin proteins to determine whether the M23 peptidase domain is highly conserved.

Tools:

  • Word
  • PhagesDB
  • BLASTp
  • RaptorX

Procedure

  • An outline that highlighted the rest of the independent project was created.
  • After that, multiple Arthrobacter phage lysin proteins were compared using PhagesDB, BLASTp, and RaptorX to compare their sequences.

Results

The following outline was created:

In addition to this, the following comparisons were drawn between different phage holin and endolysin proteins

Conclusions:

After viewing a wide array of different clusters of Arthrobacter phages it was discovered that only AM phages have the annotated endolysin protein containing the specific “HLH” sequence we have been looking for. The M23 Peptidase domain is not as highly conserved as expected, with the majority of phages, such as cluster AN, having a PGRP domain within their lysin proteins. It was also noted that there is variation of annotation comments regarding these supposedly similar proteins ranging from “Lysin A” to “endolysin”. It is unclear what decides these variations.

Next Steps:

Continue looking for the “HLH” sequence in M23 Peptidase domains in the rest of the listed phages. Also determine the defining characteristics of the PGRP domain.

April 19

Independent Research Projects 4.17.19

Rationale:

To understand more about phage, the class was divided into several small groups to generate research questions that utilize bioinformatic tools to learn more about questions about NapoleonB and/or related phages.

Materials:
  • Laptop
  • Canvas Bio-Lab info page
  • DNA Master
  • NCBI & Phagesdb Database
  • Phire
Procedure & Results:

The NapoleonB genome was analyzed on DNA Master for promoters(sigma-70), the complete result

hasn’t been analyzed yet, however online promoter finders didn’t back up the hypothesis of the sequence being a promoter. Phage promoters don’t have a complete database so the result is expected.

The group has written an abstract for the project during lab after discussion:

Repeats in genomics have a wide variety of significance, from signifying the start of a transposable element to being the terminator for a gene. In a newly sequenced and annotated bacteriophage, NapoleonB, a 46-base pair sequence that was repeated twice was found at around 25,500bp and 26,165bp in a non-coding region. The sequence was analyzed to see if evidence was present for multiple functional elements, as well as compared to other AM cluster phages to search for similarity. Tools such as Gepard, NCBI’s BLAST, Clustal Omega, IS Finder, attP site finders, and others were used to try and match the sequence of interest to a potential function. The sequence was also found to have 4-6 base pair flanking inverted repeats, which piqued research interest towards a possible transposable element or promoter. Using the previously mentioned tools, we were able to rule out the notion that the sequence was a transposon, attP site, or an overlooked ORF or protein. However, with the sequence found to have been conserved across 11 of 14 AM cluster phages, it was concluded that the likelihood of the sequence occurring as observed at random was extremely low. Therefore, the leading remaining hypothesis that this repeat could function as a regulatory sequence or promoter seemed promising. In order to definitively confirm or rule out the possibility of a regulator sequence at this stage, the sequence would have to be removed in an in vitro setting and the following genes observed.

The Next Step:

The next lab would be to analyze the data in DNA Master and hopefully find supporting evidence for the hypothesis.

April 19

Independent Research Projects 4.15.19

Rationale:

To understand more about phage, the class was divided into several small groups to generate research questions that utilize bioinformatic tools to learn more about questions about NapoleonB and/or related phages.

Materials:
  • Laptop
  • Canvas Bio-Lab info page
  • DNA Master
  • NCBI & Phagesdb Database
Procedure & Results:

The sequence was not backed by transposon finder software(IS finder), the project was then discussed with coaches and the remaining possibility for a conserved sequence within a cluster might be a recognition site for gene transcription such as a promoter.

The sequences were all found in front of the gene of the same pham in the AM cluster, which further supports the hypothesis.

The Next Step:

The next step would be to try to find supporting evidence that backs up the new hypothesis.