DOC PREVIEW
UT Knoxville STAT 201 - 1) basic_stats_review

This preview shows page 1-2-3-4 out of 13 pages.

Save
View full document
Premium Document
Do you want full access? Go Premium and unlock all 13 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Course Analysis of Variance Topic Basic Statistics Review 1 BASIC STATISTICS REVIEW We ll begin with a review of basic statistics Hopefully this will bring us all up to date so that we are on the same page when we discuss more complex issues This basic review will briefly highlight important background information If you feel that you require more information I suggest that you review an undergraduate statistics book such as Gravetter F J Wallnau L B 2000 Statistics for the behavioral sciences 5th ed Australia Wadsworth CHARACTERISTICS OF DISTRIBUTIONS Distributions of data are characterized in terms of shape central tendency and variability Shape The height Y Axis of a distribution typically reflects the frequency or relative frequency of particular scores X axis in the distribution Symmetric Distributions A symmetrical distribution is one in which a straight line divides the distribution into mirror images Skewed distributions A skewed distribution has scores that are more frequent at one end of the distribution than the other A positively skewed distribution has the majority of scores on the negative end of the distribution and a few scores trail to the positive end e g a distribution of income most persons earn 20 000 80 000 a small number of individuals earn millions per year A negatively skewed distribution has the majority of scores on the positive end and few scores trail to the negative end e g a distribution of self esteem most persons feel positively about themselves a small portion of persons feel negatively about themselves Course Analysis of Variance Topic Basic Statistics Review 2 Central Tendency Measures of central tendency provide an estimate of the center of the distribution Two commonly used measures of central tendency are the mean and the mode Mean The mean is simply the average score That is the mean is computed by summing the values of each score and dividing by the total number of scores Greek letters represents population parameters and the mean of the population is expressed as X N Sample statistics are represented by Arabic letters and the sample statistic is expressed as X X N E g For a population of five scores 2 5 8 9 10 the mean is X 2 5 8 9 10 6 8 N 5 An important characteristic of the mean is its sensitivity to extreme scores For example if the value of 10 in the above example were changed to a value of 100 the mean would shift from 6 8 to 24 8 This suggests that extreme scores can have a dramatic affect on the average score Weighted mean Often researchers combine information from multiple samples and are interested in the average score of the combined sample A weighted mean adjusts the mean score across samples by the number of scores in each sample That is larger samples are weighted more heavily than are smaller samples another way of thinking of this which is reflected in the right most formula below is that the weighted mean simply sums across the scores in each sample and divides by the total number of scores when reading the formula keep in mind that via algebra X X n E g Bob watches on average 2 hours of television Monday through Firday and 4 hours of television on the weekend What is the average amount of television Bob watches in a week If your answer is 3 then you did not take into account the relative size of the samples The mean for the weekdays M 2 is based upon a sample size of 5 i e Mon Fri whereas the mean for the weekend M 4 is based upon a sample size of 2 i e Sat Sun A weighted mean takes into account the number of scores in each sample n1 n2 5 2 X 1 X 2 5 2 2 4 2 57 weightedmean X1 X 2 2 4 n1 n 2 n1 n2 7 7 n1 n 2 7 Median The median is the score that divides the distribution in half That is it is the value at which 50 of the scores are above and below i e the 50th percentile E g In the population 2 5 8 9 10 the median is 8 There are numerous formulas which provide different values for the median when there are repeated values in the middle of the distribution An important characteristic of the median is that unlike the mean it is unaffected by extreme scores Notice in the above example if the 10 were changed to a 100 the median would remain 8 Course Analysis of Variance Topic Basic Statistics Review 3 Mode The mode is the most frequently occurring score There can be several modes E g In the population 2 5 5 8 9 9 5 and 9 are the modes and the distribution is bimodal In the population 2 5 5 5 9 9 5 is the mode and the distribution is unimodal Variability Variability reflects the variation in the scores That is are all of the scores within a distribution similar or are they different E g Population A 5 6 7 8 9 and Population B 7 7 7 7 7 both have a mean of 7 however the populations differ in terms of the extent to which the scores within the distribution are similar There are several measures of variation such as range and semi interquartile range In this class however we will focus primarily on standard deviation and it s squared value the variance Standard deviation measures the average distance of the scores from the mean There are separate formulas for the standard deviation of a population and sample s X 2 N s X X 2 n 1 The difference between the two formulas aside from the notation for the mean is the denominator of the formula The sample formula divides by n 1 which adjusts for the tendency for samples to underestimate the variability of populations and the population formula divides by N The numerator of both formulas sums the squared deviations of each score from the mean Conceptually the formula provides the average deviation of scores from the mean The numerator of the formula is also known as sums of squares SS So standard deviation can also be expressed as SS N and s SS n 1 The square root function puts the standard deviation in the same unit of measurement as the scores in the distribution Variance is simply the standard deviation squared 2 SS N s2 SS n 1 Z SCORES A Z transformation enables comparisons of scores from different scales by transforming the scores to a common metric z X In particular the mean of the distribution is subtracted from the raw score X and this difference is divided by the standard deviation of the distribution This transformed z score is now in standard deviation units That is the Z score indicates how many standard deviations the raw score deviates from the mean For example a Z score of 5 indicates that the corresponding raw score is half a standard deviation below the mean For example


View Full Document

UT Knoxville STAT 201 - 1) basic_stats_review

Documents in this Course
Chapter 8

Chapter 8

43 pages

Chapter 7

Chapter 7

30 pages

Chapter 6

Chapter 6

43 pages

Chapter 5

Chapter 5

23 pages

Chapter 3

Chapter 3

34 pages

Chapter 2

Chapter 2

18 pages

Chapter 1

Chapter 1

11 pages

Load more
Download 1) basic_stats_review
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view 1) basic_stats_review and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view 1) basic_stats_review and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?