Data & Statistics Projects Project 2 Minitab |
Group Expectations
Meetings: Each group should meet at least twelve hours before each report is due.
1st Draft: Before meetings and at least 48 hours before the report's due date on the syllabus, each group member should post a draft version of their part of the report in their D2L group discussions. Before meetings and at least 36 to 24 hours before each report is due, the group leader should post to the group project's dropbox a draft of the entire report. All group
Evaluate your peers by submitting one peer review for each member of your group. Individual grades will greatly depend on turning in your own evaluations by the due date and in the evaluations of you by fellow group members.
The group leader will immediately (within 2 to 3 days of each previous project or pre-project being due) assign all parts to the various group members, keeping the Wrapping It Up part for himself or herself and splitting the remaining parts as equally as possible based on the point distribution in the rubric and the part divisions already assigned in the project description. The group leader is responsible for ensuring group members post their respective assigned parts of the project at least two days prior to the project due date - or even earlier if the leader specifies an earlier time when assigning parts, at the leader's discretion. If a group member is late, the leader will reassign that member's part to active group members and will let the instructor know immediately.
Other
Report Expectations
To be successful with this project, students should:
***This video was made in a previous term using a previous version of Minitab. Some of the functions you are to do have been updated since then. Be sure to read and explicitly follow the instructions.
Instructions for Minitab Only
Note: Minitab should be completed all at once by one member of the group - or possibly by the group as a whole during a face-to-face meeting. Splitting parts of Minitab is far from ideal since Minitab parts cannot be copied and pasted from file to file.
Please note that Minitab 18 will only work on a PC. If you have no access to a PC where you can install Minitab or where Minitab is already installed (such as the Austin Peay library or the lab in the Austin Peay building), then please email your instructor. If you are having trouble following any of these instructions, then please contact your instructor.
1 - Look at all of your data from your appendix in Project 1. For all of the quantitative data, make sure there is nothing but numbers in the cells. In other words, if you have dollar signs, dashes, or any special characters, you must remove those. If you have data ranges such as 18-22, 23-27, 28-32 and so on, this will not be acceptable. Take an average of the class by taking the lower limit of the class you are in (e.g., 18) and the upper limit (e.g., 22) and averaging these two: 20. Replace all such data ranges with single values.
Once all of your quantitative data is purely numerical with no symbols or letters at all, highlight all of your
Once all data are in and all columns correctly have -T or no -T, you should add concise column labels to represent the data. For example, I'll transform my hypothetical pre-project report questions into column titles. Here are the questions:
1. On a scale of 0 to 10 with 0 being
2. On that same scale, how would you rate the book, Deathly Hallows, if you've read the book?
3. On a scale of 0 to 10 with 0 being indifferent and 10 being hopelessly devoted, how big of a Harry Potter fan are you?
4. Who is your favorite character in the movie?
5. How old are you, if you don't mind disclosing?
6. Harry/Hermione or Harry/Ginny?
7. If you could do one spell, what would it be?
And my column titles would
2 - Stats, Histograms, and Boxplots: Do Stat > Basic Statistics > Display Descriptive Statistics. Inside variables, select all possible quantitative variables listed. Click on Statistics and additionally select Interquartile range,
This step will give you 1) a histogram, 2) a boxplot, and 3) columns of detailed statistics in the session window for every quantitative variable. Each histogram and boxplot will have its own window.
2a - Heading: Right-click on the previous Descriptive Statistics function in the Minitab session window, select to add a note, & type the following:
Your Group Name
List of Group Members' Names
MATH 1530 Elements of Statistics
Project #2: Minitab
Due Date: (actually type the due date)
3 - Bar Charts: Graph > Bar Chart
5 - Pie Chart: Graph > Pie Chart. Leave the chart of unique values selected. Under categorical variables, choose all your categorical variables. Select the Pie Options button and choose Decreasing volume. Select Labels, choose the Slice Labels tab, and select category name and percent. Select the Multiple Graphs button and choose On separate graphs.
6 - To determine the linear regression model and to see it drawn on a scatterplot of data, select Stat > Regression > Fitted Line Plot. We want to model the relationship between two of your quantitative variables. So, in the window, double-click
The linear regression model equation for your variables will be displayed at the top of your graph window AND a graph of the scatterplot of the data (red points) with the regression line (blue line) will be drawn. Don't forget the regression model line contains points that represent the perfect or model data.
7 - Compute the correlation of your
In this part of the project, choose your quantitative data that looks the most
8 - Construct a normal probability of this normal data value by selecting Graph > Probability
9 - Prepare a column for receiving the standardized data by clicking in the gray cell just below the first empty column and typing z. To standardize, select CALC > Standardize, choose your best bell-shaped quantitative variable for input columns, choose the z column for storing results, and click OK. You should now have new data in the z column, which are the standardized values (z-scores) of every data value listed in your best bell-shaped quantitative data variable.
10 - Assume your best bell-shaped quantitative data are normally distributed with a mean equal to the computed sample mean and a standard deviation of equal to the computed sample standard deviation. Click CALC > Probability Distributions >
Save your project as Project2 (File > Save Project As > Project2_GroupName)
After loading your file, reopen straight from the dropbox to make sure that:
The Report
PART A
Choose the same quantitative variable from Project 1. Make a bolded heading for the first variable - the variable that you analyzed in Part B of Project 1. Under that heading, copy and paste that variable's frequency table from Minitab. Copy the statistics row for that data from the Descriptive Statistics in the session window of Minitab, including column headings. (Descriptive Statistics should have rows for each data variable, but delete all of the rows except for this first quantitative variable you've chosen.) Copy the histogram and boxplot for this data. Do the frequency table, stats, histogram, and boxplot in this Project 2 agree with what you found in Project 1? If not, what is different and why?
Next, copy the bar chart for this data. For the bar chart, tell whether there seems to be an association looking at the clustered bars. Note: Clustered bars will be the same heights (a uniform distribution) for data when no association exists. If bar heights are significantly different, an association between the displayed variables is implied.
Choose the same categorical variable from Project 1. Make a bolded heading for the categorical variable. Under that heading, copy and paste that variable's frequency table from Minitab. Does the frequency table agree with what you found in Project 1? If not, what is different and why? Copy the pie chart for this variable. Does the pie chart look similar to Project 1's pie chart? If not, what is different and why?
PART B
Copy and paste the scatterplot. Describe what the scatterplot looks like, the strength of the association of the variables from the scatterplot, and the direction of the association (positive or negative).
Copy and paste the correlation analysis. The correlation coefficient, r, for your
PART C
In order to use the normal distribution and its associated area under the curve to compute expected percentages or probability, we must assume the data are reasonably normally distributed: unimodal, symmetric, without skew. However, a histogram is not reliable enough when the number of data
Copy and paste in the histogram from the data you used for Part C, which is the quantitative data that was your most bell-shaped. Describe this histogram in terms of 1) modality, 2) symmetry, and 3) skewness.
Copy and paste your normal probability plot. Do the data in this plot line up well with each other in one single line? (Note: Lining up in multiple, perfectly straight lines is not what we are talking about, here. Rather, do the data roughly form one single, approximately straight line?) Does the normal probability plot indicate the data is normal, approximately normal, or not normal?
Copy and paste the standardized values (z-scores) from the z column you created. Using the Empirical Rule's definition of outliers, are any of the data values potentially extreme values (outliers)? If so, which ones are outliers? Explain why you were able to conclude that you do or do not have outliers.
Copy and paste the cumulative distribution functions for Q1 and Q3. The probability that a data value selected at random would be less than Q1 is ???
If the data were perfectly normal, what would the probability be of selected a data value less than Q1? More than Q3? (Hint: In other words, how much proportion of the data do we expect to be below Q1 and above Q3 by the definitions of Q1 and Q3?)
Wrapping It Up
Have a paragraph at the end detailing what exactly each group member did to contribute to the entire group effort.
You will want the document to have a title page with a title for the paper, Math 1530, the date, the name of the group, and a list of the group members.
Save your MS Word document as Project2_GroupName.doc or Project2_GroupName.docx.
Load the files to the
After grading and correcting your report, load the report to D2L's Project 2 Report dropbox. Every single group member needs to reopen the MS Word report straight from the dropbox to make sure that all parts are completely answered and that this report is the latest,
Evaluate your peers by submitting one peer review for each member of your group. Individual grades will greatly depend on turning in your own evaluations by the due date and in the evaluations of you by fellow group members as well as evidence in what you did in that last report paragraph and in the discussion forum.