homework 4 - polygenic risk scores
bios25328
homework
A. Complete the following assessments
- Lecture 6 assessment quiz (10 points)
- Lecture 7 assessment quiz (10 points)
B. Finish the knit lab 4 (70 points)
In this section, you will complete the practical implementation of polygenic risk scoring using PRSice software as done in Lab 04. As a reminder, it involves:
- Setting up the environment and downloading necessary data:
- Download and install PRSice software
- Set up the Personal Genome Project data
- Install required R packages (tidyverse, RSQLite, glue)
- Data exploration and preparation:
- Explore the phenotype database
- Analyze demographic distributions
- Create phenotype data files for analysis
- Running PRSice analysis:
- Compute polygenic risk scores for height
- Use UK Biobank height GWAS results as the base data
- Apply various p-value thresholds for score calculation
- Results analysis and visualization:
- Plot observed versus predicted height values
- Perform regression analysis to assess prediction accuracy
- Interpret the results in the context of polygenic prediction
The lab will help you understand: - The practical implementation of PRS calculations - The relationship between GWAS results and phenotype prediction - The challenges and limitations of polygenic risk scoring - The interpretation of PRS results in real-world data
Make sure to document your code, results, and interpretations clearly in your knitted document. The grading will focus on: - Correct implementation of the analysis pipeline - Quality of data visualization - Clarity of interpretation - Proper documentation of the process