Unformatted text preview:

6C: The Normal Distribution6C: The Normal DistributionThe normal distribution (also called a Gaussian distribution) is a symmetric, bell-shaped distribution with a single peak. Its peak corresponds to the mean, median, and mode of the distribution.Each normal distribution is characterized by two numbers:the mean gives the location of the peak, andthe standard deviation gives the width of the peak.A data set that satisfies the following four criteria is likely to have a nearly normal distribution:1. Most data values are clustered near the mean, giving the distribution a well-defined single peak.2. Data values are spread evenly around the mean, making the distribution symmetric.3. Larger deviations from the mean become increasingly rare, producing the tapering tails of the distribution.4. Individual data values result from a combination of many different factors, such as genetic and environmental factors.Note for the savvy consumer of statistics: The two best-known “measurements” of people’s mental abilities (SAT scores, IQ) satisfy the normal distribution because they are designed (i.e., rigged) to have this characteristic. This rigging will hide any irregularities (e.g., non-unimodality) thatmight indicate flaws in the measurement process.The 68-95-99.7 Rule for a Normal Distribution:* About 68.3% of the data in a normallydistributed data set will fall within1 standard deviation of the mean.* About 95.4% of the data in a normallydistributed data set will fall within2 standard deviations of the mean.* About 99.7% of the data in a normallydistributed data set will fall within3 standard deviations of the mean.Example: The SAT is scored so that its meanis 500 and its standard deviation is 100. What percent of students score 600 or lower?Answer: The mean SAT score is 500. 68% of students score between 500–100=400 and 500+100=600. The remaining 32% score outside of this range.In each of these two groups (the 68% and the 32%), half score above the mean and half score below the mean (by symmetry):200 to 400: 1/2  32% = 16%400 to 500: 1/2  68% = 34%500 to 600: 1/2  68% = 34%600 to 800: 1/2  32% = 16%So (16+34+34)% = 84% score 600 or lower.(More accurately, 84.1%.)Example (continued): What percent of students score 700 or lower?Answer: 95% of students score between 500–2100=300 and 500+2100=700;the other 5% score outside this range.200 to 300: 1/2  5% = 2.5%300 to 500: 1/2  95% = 47.5%500 to 700: 1/2  95% = 47.5%700 to 800: 1/2  5% = 2.5%So (100-2.5)%=97.5% score below 700.(More accurately, 97.7%.)Combining this with the previous result, we see that 97.7%–84.1%=13.6% of students score between 600 and 700.Standard scores (aka “z-scores”):3 st.dev. above mean  z-score = 32 st.dev. above mean  z-score = 21 st.dev. above mean  z-score = 10 st.dev. above mean  z-score = 01 st.dev. below mean  z-score = –12 st.dev. below mean  z-score = –23 st.dev. below mean  z-score = –3The nth percentile of a data set is the smallest value in the set with the property that n% of the data values are less than or equal to it.(Examples: The 50th percentile = the median.The 25th percentile = the first (or lower) quartile.The 75th percentile = the third (or upper) quartile.)A data value that lies between two adjacent percentiles is said to lie in the lower percentile.(Example: The people who score between the 72nd and73rd percentiles are also said to have scored in the 72ndpercentile.)For normally distributed data with mean  and standard deviation : + 2 = the 98th percentile +  = the 84th percentile = the 50th percentile –  = the 16th percentile – 2 = the 2nd percentileFor normally distributed data, we can convert z-scores to percentiles and vice versa:z-score percentile--------------------- –2 2 –1 160 501 842 98Usually we use tables that have more entries and give more precision; see page 408.Example:A space colony is populated by 1,000,000 Martians and 1,000,000 Venusians. Colonists with an IQ of 140 or more get to serve on the Council of Leaders.Only 20% of the Council are from Venus.Is this evidence of discrimination?The Martians have an average IQ of 105;the Venusians have an average IQ of 95.The standard deviation is 16 for both groups.Martians:Mean = 105, standard deviation = 16 IQ pts140 = 35 pts above the mean= 35/16 s.d.’s above the mean= z-score of 35/16 = 2.1875According to Table 6.3, a z-score of 2.2 is at the 98.6th percentile.So 1.4% of the 1,000,000 Martians, i.e. 14,000 Martians, score over 140.Venusians:Mean = 95, standard deviation = 16 IQ pts140 = 45 pts above the mean= 45/16 s.d.’s above the mean= z-score of 45/16 = 2.8125According to Table 6.3, a z-score of 2.8 is at the 99.7th percentile.So 0.3% of the 1,000,000 Venusians, i.e. 3,000 Venusians, score over 140.So (3000)/(14000+3000) = 18% of the Council should be from Venus.So there is no evidence of discrimination, if our model is correct and our assumptions apply (specifically, if intelligence is governed by a normal distribution, and IQ measures intelligence).Mathematical fact: Small differences in the mean lead to great imbalances in the far tails of two normal distributions.Some statisticians use similar arguments to explain the absence of women and minorities in top-paying jobs, top-tier universities, etc.But... how realistic is this model?Real world fact: IQ tests are rigged to be normal. This rigging requires special delicacy to get the tails to work out right. Sothe use of the normal distribution is inappropriate for studying both those with very high IQ’s and those with very low IQ’s.Also: The model assumes that intelligence can be measured by a single number. If thereare different kinds of intelligence, with no clear way to measure any single one of them by a number, and no obvious way to combine those numbers to get a unified measure of intelligence, then it is even less clear that mathematical properties of the normal distribution have much to say about the real-world issue of discrimination and


View Full Document

UW-Madison MATH 141 - The Normal Distribution

Download The Normal Distribution
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view The Normal Distribution and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view The Normal Distribution 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?