DOC PREVIEW
UCSB ECON 240a - Exploratory Data Analysis and JMP

This preview shows page 1-2 out of 5 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 5 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 5 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 5 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

September 28, 2010 Lecture #2 240A-1 L. PhillipsExploratory Data Analysis and JMPI. Open the JMP program by going to Start, Programs, Statistics, JMP 5.0.1 (select).II. Open the data file students by clicking on the open data table button in the JMP starter window and scrolling over to the file students.jmp in the folder Sample Data.The five columns contain the five variables:Age: an ordinal variableSex: a nominal or categorical variableHeight: a cardinal or numeric variableWeight: a cardinal or numeric variable:Idnum: id number, a nominal variableNote: there are 233 observations or rowsIII. To display ordinal and nominal variables, from the menu bar choose analyze/distributionsIn the distribution dialog box, select the variables age and sex and drag to the y, columns window. Hit the OK button.You can see there are more boys than girls and more twelve year olds than other ages. The graph on the left for the variable age is a histogram, plotting the frequency or number of observations for each age category. The graph on its right is a mosaic bar chart , showing the fraction of observations in each category. BySeptember 28, 2010 Lecture #2 240A-2 L. PhillipsExploratory Data Analysis and JMPhitting the red triangle button to the left of the word age, and choosing histogram options, you can add a count axis to the histogram.IV. To display a numerical variable, click on the data window to make it active and , from the menu bar choose analyze/distributionsIn the distribution dialog box, select the variables height and weight and drag to the y, columns window. Hit the OK button.Use the hand icon and drag to the right on the histogram columns to obtain finer categories of height. You can see that the mode is 62 inches. The maximum height is 72 inches and the minimum height is 51 inches. The graph on the left for the variable height is a histogram, plotting the frequency or number of observations for each height. The graph on its right is an outlier box chart . The ends of the box are the 25th and 75th quantiles (quartiles), 58 and 64, respectively. The difference between these quartiles, 6, is the inter-quartile range, a measure of dispersion. Once again, for height, the 75th quartile is 64, with 25% of the observations lying above this height, and the 25th quartile is 58 with 25% of the observations lying below this height, so the inter-quartile range is 6. The median height is 61 inches, with 50 % 0f the observations above this height. The median is illustrated in the box by a line. The lines on either end of the box are whiskers, and extend to the outermost data points within the distance, for example,September 28, 2010 Lecture #2 240A-3 L. PhillipsExploratory Data Analysis and JMP75th quartile + 1.5* inter-quartile range, i.e. 64 + 1.5*6, or 73. Since the maximumheight is 72 inches, the whisker ends there. Thus there are no outliers, or heights to plot beyond this whisker. The 25th quartile is 58, so the whisker will potentially extend down to 49, but the minimum height is 51 inches, so the whisker ends at 51, and there are no outlier heights below this .The diamond is the called the means diamond. Note the mean or average height is 61.33 inches, above the median of 61. The extent of the diamond is a 95% confidence interval around the mean, i.e. the probabilty of the mean height lying above or below the diamond is only 5%. We will study the calculation of these confidence intervals in the weeks ahead.Note there is an outlier observation for the weight variable, so this may be an individual that requires medical diagnosis. The red bracket in the box plot designates the range of the shortest half of the data, i.e. the 50% of the observations that are most dense, i.e clustered around the central tendency.In the moments list are the mean and standard deviation of the observationvalues, for example for height.V. The Spinning PlotSelect the data window and from the graph menu choose spinning plot. In the dialog box, (use the control key to) select the height, weight, and age variablesand drag to the y, column box. Click OK.September 28, 2010 Lecture #2 240A-4 L. PhillipsExploratory Data Analysis and JMPNote the positive relationship or correlation between weight and height as age increases. Use the hand icon to rotate the three-dimensional data plot. Try using the white background( red triangle to the right of the rotation icons). You can use the lasso icon to select the outlier point and from the data table identify the idnum of this individual.VI. Help MenuThe manuals are available online and provide instructions for using the JMP program. Select help from the menu bar and select contents.VII. Analysis of a Subset of Female StudentsUse the students window and repeat the instructions at the beginning of section III above, i.e. from the menu bar choose analyze/distributions and select age and sex and drag to the y, columns window. Highlight females in the histogram. Note that all of the observations for females are now selected in the data window. From the Tables menu in the bar, select subset. In the dialog box, choose a name such as female subset of students. This data file can then be used to conduct analysis on the height, weight, and age variables, as before, including producing histograms and box plots, as well as a rotating plot, but restricted to females.September 28, 2010 Lecture #2 240A-5 L. PhillipsExploratory Data Analysis and


View Full Document

UCSB ECON 240a - Exploratory Data Analysis and JMP

Documents in this Course
Final

Final

8 pages

power_16

power_16

64 pages

final

final

8 pages

power_16

power_16

64 pages

Power One

Power One

63 pages

midterm

midterm

6 pages

power_16

power_16

39 pages

Lab #9

Lab #9

7 pages

Power 5

Power 5

59 pages

Final

Final

13 pages

Final

Final

11 pages

Midterm

Midterm

8 pages

Movies

Movies

28 pages

power_12

power_12

53 pages

midterm

midterm

4 pages

-problems

-problems

36 pages

lecture_7

lecture_7

10 pages

final

final

5 pages

power_4

power_4

44 pages

power_15

power_15

52 pages

group_5

group_5

21 pages

power_13

power_13

31 pages

power_11

power_11

44 pages

lecture_6

lecture_6

12 pages

power_11

power_11

42 pages

lecture_8

lecture_8

11 pages

midterm

midterm

9 pages

power_17

power_17

13 pages

power_14

power_14

55 pages

Final

Final

13 pages

Power One

Power One

53 pages

Summary

Summary

54 pages

Midterm

Midterm

6 pages

Lab #7

Lab #7

5 pages

powe 14

powe 14

32 pages

Lab #7

Lab #7

5 pages

Midterm

Midterm

8 pages

Power 17

Power 17

13 pages

Midterm

Midterm

6 pages

Lab Five

Lab Five

30 pages

power_16

power_16

64 pages

power_15

power_15

52 pages

Power One

Power One

64 pages

Final

Final

14 pages

Load more
Download Exploratory Data Analysis and JMP
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Exploratory Data Analysis and JMP and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Exploratory Data Analysis and JMP 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?