Front Back
Descriptive Statistics
to summarize and describe a set of numbers
Inferential Statistics
to draw conclusions and make inferences based on the scores collected in a study but going beyond them
Variable
any characteristic that varies among individuals (like height)
Value
possible number or category that a score can have
Nominal (categorical) Variable
variable that has values that are names or categories
Numeric (quantitative) Variable
variable that has values that are numbers
Rank-order (ordinal)
numeric variable in which values give relative position of things measured (like place in a race or differences between small, medium, and large)
Equal-Interval
numeric variable in which differences between values correspond to differences in the underlying thing being measured (age)
When making histograms...
put the frequencies on the y-axis and the values of a numerical variable on the x-axis
Frequency Polygon
a histogram made by joining the middle-top points of a histogram's columns, special line graphs
Compare histograms to bar graphs
Histograms- y-axis is frequencies, x-axis is numeric, no gaps between the columns, inherent order, can interpret shape Bar graphs- y-axis is variables, x-axis is frequencies, gaps between columns, no inherent order, cannot be interpreted
Name modalities
unimodal-one peak bimodal- two peaks multimodal- two or more peaks rectangular- approximately equal
What does skewed to the left mean?
low distribution in lower numbers, negatively (look at tail)
What does skewed to the right mean?
low distribution in higher numbers, positively (look at tail)
Mean
The arithmetic average, best guess in long run, sensitive to outliers, balances the scores, but not in the middle (most of the time) M=∑x÷N 1. Add up all the scores 2. Get total number of scores 3. Divide the sum by the number of scores in total
Deviation scores
distances from the mean
Mode
Most frequent score in distribution, preferred measure measure of central tendency for nominal variables
Median
middle score when arranged from lowest to highest 1. Sort numbers from lowest to highest 2. Calculate the position of the middle scores using (N+1)÷2 3. Determine the value of the middle score by counting inwards towards the middle of the sorted numbers
When are mean, median, and mode the same?
When the graph is symmetrical
Resistance to outliers
Median and mode are resistant to outliers. The mean is not.
Which measure of central tendency should we use
use the mean, unless there are extreme outliers, then use the median. Use the mode for nominal variables
Variability
how spread out or scattered the scores in a distribution are; the great the variability, the more spread are the scores
Variance
"how spread out scores are around the mean" the average of each score's SQUARED deviation from the mean
Calculating Variance
∑(x-M)2÷N=SD2 1. Calculate the mean 2. Calculate the deviation scores (M-Raw score) 3. Square the deviation scores 4. Sun of the squares 5. Divide that sum by the total number of scores Top part of formula is the Sum of Squares (SS), CAN NEVER BE NEGATIVE
Standard deviation
the positive square root of the variance. It is the average amount that scores differ from the mean. The standard deviation expresses variability in the same unit as the raw data CAN NEVER BE NEGATIVE SD=√SD2
Why do we usually prefer the standard deviation as a measure of variability?
Variance= average of each score's SQUARED deviation from the mean Standard deviation= average amount that scores differ from the mean
Resistant to outliers part 2
Variance and standard deviation are NOT resistant to outliers because they consider all scores- avoid using them for skewed deviations
Z Score
describes distance of single score to mean in standard deviation units. The number of standard deviations a score above or below the mean,
Converting raw scores into Z scores
Z=(X-M)÷SD 1. Calculate M and SD, if not provided. 2. Plug in numbers into formula
Converting Z scores into raw scores
X=Z×SD+M
Correlation
relationship between two interval variables.
Describing the relationship between 2 variables
describe type, direction, and strength of the relationship. Also look out for outliers
Types of relationships
Linear- pattern of dots fall roughly in a straight line *will only use linear Curvilinear- pattern of dots fall roughly on a curve
When do we get a positive, negative, or 0 Z score?
Positive= raw score above M 0= raw score equal to M Negative= raw score below M
Positive linear relationships
values on both variable go in the same direction
Negative linear relationships
values on the two variables go in opposite directions
Strength of a relationship
determined by how closely the points follow a clear form
Pearson Correlation coefficient
measures the direction and strength of a linear relationship between the two equal interval variable X and y r=∑ZxZy÷N r is the average of the cross-products of Z scores
Cross Products of Z scores
the result of multiplying a person's Z score on one variable by the person's Z score on another variable
Calculating r
1. Convert raw scores into Z scores if necessary 2. Multiply to get cross products 3. Sum up the cross products 4. Divide by N 5. Describe the results IN WORDS *remember to state strength and direction of the relationship and to express relationship in words
Interpreting r
r ranges from -1 to +1 The closer r is to 1, the stronger the relationship/ correlation.
Why use Z scores to calculate r?
Z scores are used since they bring the scores of the two variables onto the same scale. If Z scores within a pair tend to have: 1. the same signs: positive correlation 2. different signs: negative correlation The strength of the correlation is determined by how consistent these trends…
r and outliers
r is not resistant to outliers
Predicting scores
If there is a relationship between 2 variables, then we can predict the criterion variable (Y) from the predictor variable (X).
Predicting Z scores from Z scores
Zy= β×Zx β=r
Prediction error
prediction error= observed value-predicted value r2 expresses the proportion of variance(or variability) in one variable can be explained by the other variable. *proportion of variance accounted for* r2 expresses by what proportions the variability of our errors in predicting Y can be r…
Scatter Plots
1. Draw the axes and decide which variable goes on which axis *The variable that is doing the predicting or causing goes on the horizontal axis 2. Determine the range of values to use for each variable and mark them on the axes 3. Mark a dot for each pair of scores
β
standardized regression coefficient

Access the best Study Guides, Lecture Notes and Practice Exams

Login

Join to view and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?