How can bioinformatics be used as a tool to determine evolutionary relationships and to better understand genetic diseases? In this laboratory investigation, you will use BLAST to compare several genes, and then use the information to construct a cladogram. A cladogram is treelike, with the endpoints of each branch representing a specific species. The closer two species are located to each other, the more recently they share a common ancestor. In the cladogram below, you will also see the shared derived characters. Note that the placement of the shared derived character corresponds to when (in a general, not a specific sense) that character evolved; every species above the character label possesses that structure. “What did T. rex taste like? Little is known about the fossil. It appears to be a new species. Upon careful examination of the fossil, small amounts of soft tissue have been discovered. Normally, soft tissue does not survive fossilization; however, rare situations of such preservation do occur.
Scientists were able to extract DNA nucleotides from the tissue and use the information to sequence several genes. Your task is to use BLAST to analyze these genes and determine the most likely placement of the fossil species on Figure 4.A team of scientists has uncovered the fossil specimen in Figure 3 near Liaoning Province, China. Make some general observations about the morphology (physical structure) of the fossil, and then record your observations in your notebook. Step 1 Form an initial hypothesis as to where you believe the fossil specimen should be placed on the cladogram based on the morphological observations you made earlier. Step 2 Locate and download gene files. Click on “Saved Strategies” from the menu at the top of the page. Under “Upload Search Strategy,” click on “Browse” and locate one of the gene files you saved onto your computer. A screen will appear with the parameters for your query already configured. NOTE: Do not alter any of the parameters. Scroll down the page and click on the “BLAST” button at the bottom. After collecting and analyzing all of the data for that particular gene (see instructions below), repeat this procedure for the other two gene sequences.
Step 4 The results page has two sections. The first section is a graphical display of the matching sequences. Scroll down to the section titled “Sequences producing significant alignments.” The species in the list that appears below this section are those with sequences identical to or most similar to the gene of interest. The most similar sequences are listed first, and as you move down the list, the sequences become less similar to your gene of interest. Recall that species with common ancestry will share similar genes. The more similar genes two species have in common, the more recent their common ancestor and the closer the two species will be located on a cladogram. As you collect information from BLAST for each of the gene files, you should be thinking about your original hypothesis and whether the data support or cause you to reject your original placement of the fossil species on the cladogram.
The higher the score, the closer the alignment. The lower the e value, the closer the alignment. 1. What species in the BLAST result has the most similar gene sequence to the gene of interest? 2. Where is that species located on your cladogram? 3. How similar is that gene sequence? 4. What species has the next most similar gene sequence to the gene of interest? 5. Based on what you have learned from the sequence analysis and what you know from the structure, decide where the new fossil species belongs on the cladogram with the other organisms. If necessary, redraw the cladogram you created before. Compare and discuss your cladogram with your classmates. Does everyone agree with the placement of the fossil specimen? If not, what is the basis of the disagreement? On the main page of BLAST, click on the link “List All Genomic Databases.” How many genomes are currently available for making comparisons using BLAST?
How does this limitation impact the proper analysis of the gene data used in this lab? What other data could be collected from the fossil specimen to help properly identify its evolutionary history? Now that you’ve completed this investigation, you should feel more comfortable using BLAST. The next step is to learn how to find and BLAST your own genes of interest. Once you have found the gene on the website, you can copy the gene sequence and input it into a BLAST query. Your starting question may be: What is the function of actin in humans? Do other organisms have actin? If so, which ones? 3. Under “mRNA and Proteins,” click on the first file name. It will be named “NM001100.3” or something similar. These standardized numbers make cataloging sequence files easier. Do not worry about the file number for now. 4. Just below the gene title click on “FASTA.” This is the name for a particular format for displaying sequences.