### Plan an investigation to collect bivariate data on an

Goal: Your goal is to educate students about a bivariate variables. Choose between the following:
• COVID rates and GDP of a country
• COVID rates and vaccination rates
• COVID rates and age
• COVID deaths over time of one country
• COVID deaths in the world over time
• If you have another other ideas, please have them approved by your teacher first before beginning your research.

They will need to understand why data is so important to this process and how the development of critical questions informs the data gathering. Your specific goals are: Prepare a report for the information drive using a PowerPoint.

Role: You are part of a team of analysts who have been asked by your company to help educate students about COVID and the contributing factors that affect it.

Audience: Your audience will be students and individuals interested in vaccines and wanting to learn more about COVID to produce a more effective vaccine.

Situation:
The number of COVID cases have been on the rise again in the world. Some countries are handling it better than others. You need to research the different factors that could affect why COVID rates vary so greatly from place to place.

Now you will research your topic. Some suggestions for research sites are
- Our World in Data
- Worldometers
- WHO International
- Financial Times
- World Bank

Product: Report to be released to the Public
Your task is to plan an investigation to collect bivariate data on an aspect involving the factors that affect COVID, an epidemic the world has not seen in so long. Remember that bivariate data is data that deals with two variables, where each value of one of the variables is paired with a value of the other variable. Since you will be creating a scatterplot with your data as part of the report you will be releasing to the public so accuracy is a must. It would be helpful if you choose variables that are numerical or quantitative.

To arrive at the report, you need to?
• Determine what numerical data you need.
• Determine how you will gather the data. What sampling procedure will you use? (Try to gather at least 50 data points)
• After collecting the data, organize it based on the relationship you have observed when comparing two numerical data. You may use a relative frequency table to observe the data you collected and make initial inference.
• Compute the correlation coefficient between two numerical data at a time and use a scatterplot to analyze any clustering or outlier that may affect the line of best fit.
• Present your findings using a PowerPoint. Describe the behavior of the distribution by stating the measure of center and spread.You may use a relative frequency table to show the behavior of your data.
• As you need to be accurate in what you will report, your generalization must determine if the data set suggest correlation or causation.
• Use words that are mathematically accurate but can be understood by the public when you release the report.

Standards:?
S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box plots).
S.ID.2 Use statistics?appropriate to?the shape of the data distribution to compare center (median, mean) and spread (interquartile range, standard deviation) of two or more different data sets.?
S.ID.3 Interpret differences in shape, center, and spread in the context of the data sets, accounting for possible effects?of extreme data points (outliers).
S.ID.6 Represent data on two quantitative variables on a scatter plot and describe how the variables are related.?
S.ID.8 Compute (using technology) and interpret the correlation coefficient of a linear