UNC-Chapel Hill SOCI 052 - Measures of Variability Lab

Unformatted text preview:

Measures of Variability Lab Lab Objective: The objectives of this lab are: • Learn to calculate measures of variability in SPSS. • Learn to use box plots in SPSS. • Learn to use measures of variability to evaluate a research question. Directions: Use the data from the GSS to perform the following tasks. Demonstration: Calculating Measures of Variability and Constructing Box Plots with SPSS I will use the variable “hrs1” to demonstrate how to calculate the range, 25th percentile, 75th percentile, variance, and standard deviation using SPSS. a. What is this variable measuring? b. What is the level of measurement of this variable? To calculate measures of variability, do the following: a. From the pull-down menu, choose Analyze… Descriptive Statistics… Frequencies b. Choose the variable “hrs1” (NUMBER OF HOURS WORKED LAST WEEK) and use the arrow to move it to the blank space underneath the Variable(s) area. c. Click on Statistics d. In the area marked “dispersion”, make sure that there are checks next to the boxes with the following statistics: i. Std. deviation ii. Variance iii. Range iv. Minimum v. Maximum b. In the area marked “Percentile Values” put a check in the box next to Percentile(s) i. You can use this to add the 25th and 75th percentile 1. To do so, type in 25 and click add 2. Repeat the process with 75 instead of 25 c. You may also want to include measures of central tendency like: i. Mean ii. Median iii. Mode d. Click Continue… OK SPSS will calculate several statistics a. What are the standard deviation and variance of this variable? i. How do we interpret the variance or the standard deviation? 1b. What is the range? i. How do we interpret it? c. What is the Interquartile Range (IQR)? i. How do we interpret it? Suppose we wanted to look at the distribution of this variable. We can do this using a box plot or a histogram. To construct a box plot, do the following: a. From the pull-down menus select Graphs… Boxplot b. Select Simple c. Make sure that the bubble next to “Summaries of separate variables” is selected d. Click on “Define” e. Select the variable hrs1 (NUMBER OF HOURS WORKED LAST WEEK) and use the arrow to move it underneath the box marked “Boxes Represent” f. Click OK SPSS will produce a box plot of the variable a. In general, how do we interpret a box plot? i. What is being represented at the end of the vertical lines? ii. What is being represented inside the rectangular box? iii. What is being represented by the black band inside the rectangular box. b. What do you notice about the box plot produced by SPSS? i. Statistical packages rarely produce traditional box plots; instead they attempt to alert you to sample values which may be unusually distant from the bulk of the data. 1. These sample values are represented as circles or asterisks 2. They extend beyond the whiskers. This means that whiskers do not extend to the minimum and maximum of the sample, but to the smallest and largest values inside a “reasonable” distance from the end of the box. 3. The extra information provided by the flagging process enables you to distinguish between a truly skewed sample, and one whose apparent skewness is attributable to a single point. ii. SPSS has a two stage flagging process. 1. Values which are more than three box lengths from either end of the box receive an asterisk. 2. Values which are between one and a half and three box lengths from either end of the box receive a circle. 3. All other values are equivalent to a traditional box plot iii. The following diagram illustrates how SPSS calculates a box plot: 2Given this how do we interpret the statistics presented in the box plot? Suppose we wanted to compare the box plot output to the actual distribution of the data. We could use a histogram to achieve this. To do this, try the following: a. From the pull-down menu select Graphs… Histogram b. Select the variable hrs1 (NUMBER OF HOURS WORKED LAST WEEK) and use the arrow to move it underneath the box marked “Variable” c. Make sure that box next to “Display normal curve” is selected – this will fit a normal curve to the histogram SPSS will produce a histogram. How does the distribution of the data compare to the box plot? Lab Exercise We often hear that there is an earnings gap between men and women. Let’s explore this gap in income more closely by using data from the GSS. We will combine what we learned about measures of central tendency with what we have learned about measures of variability to investigate income differentials between men and women. We will use the following variables in our analysis: inccod98 and sex. a. If you do not already know what these variables are, then use the GSS webpage to help you identify the variables (http://webapp.icpsr.umich.edu/GSS/). b. What is the level of measurement of each variable? 3To look at differences between the variables, first look at the distribution of the variable sex. Unfortunately, SPSS does not calculate some measures of variability such as the index of qualitative variation (IQV), so this will have to be done by hand. a. Calculate the appropriate measure of variability for the variable sex i. What does this tell us about the distribution of this variable? ii. Is this distribution likely to be reflective of the larger population? b. Construct a box plot for both variables and compare the box plots to each other i. To plot two box plots on the same axis do the following: 1. From the pull-down menus select Graphs… Boxplot 2. Choose “Simple” 3. In the “Data in Chart Area” portion of the window, make sure the bubble next to “Summaries for groups of cases” is selected 4. Not “Summaries of Separate Variables” as in the last example. 5. Put the inccod98 variable in the blank under he “Variable” heading 6. Under the “Category Axis” heading, put the sex variable 7. Click OK ii. Compare the box plots to each other 1. Compare and contrast the two box plots. a. What is the approximate range of the data? b. What is the approximate IQR of the data? i. Where is most of the data located in each distribution? c. What is the median income of both groups? d. Are there any outliers or extreme cases (as defined by SPSS)? c. Construct separate histograms of income for men and women and compare them to the box plots. i. To plot two histograms


View Full Document

UNC-Chapel Hill SOCI 052 - Measures of Variability Lab

Download Measures of Variability Lab
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Measures of Variability Lab and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Measures of Variability Lab 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?