Unformatted text preview:

HRP 223 Data Management and Statistical Programming file:///c:/Documents%20and%20Settings/Brent/My%20Documents/Stanf...1 of 13 1/12/2006 4:13 PMHRP 223 - Data Management and Statistical Programming - 2005/2006 EditionIf you would like to see the last teaching please use the gold links at the bottom of this page.The goal of the course is to provide hands on instruction in data management and analysis techniques. Topics discussed include:Working with large databases - what makes a good database turn bad1.Data cleaning techniques2.Generating numerical and graphical presentations3.Descriptive statistics4.Inferential statistics including, but not limited to, analyses of contingency tables, logistic regression andsurvival analysis5.Contact information Professor Teaching Assistant(s)Raymond R. Balise Redwood Bldg. T115, MC 5092 Stanford, California 94305-5405 balise at stanford Voice (650) 724-2602 Fax (650) 725-6951Rebecca (aka Becca) Newton Thompsonrsnewton at post dot harvardPrerequisitesAdmission to Health Research and Policy and a comfortable knowledge of a Windows 2000/XP or UNIX.LecturesMonday and Wednesday 3:00-4:30 from September 26th through December 7st in room M108 in the medicalschool.Office HoursBy appointment in Redwood T226.NewsgroupsHRP 223 Data Management and Statistical Programming file:///c:/Documents%20and%20Settings/Brent/My%20Documents/Stanf...2 of 13 1/12/2006 4:13 PMIf you would like to ask a question or help others please visit the course newsgroup which is named: su.class.hrp223 If you do not know how to subscribe to a newsgroup click here.ReadingsThe Little SAS Book Third Edition and Walker's Common Statistical Methods for Clinical Research SecondEdition. The third edition (blue) Little SAS Book (TLSB) at SAS: http://www.sas.com/apps/pubscat/bookdetails.jsp?catid=1&pc=59216 Walkers book at SAS: http://www.sas.com/apps/pubscat/bookdetails.jsp?catid=1&pc=58086GradingGrades will be based on five homework problem sets. If you take the course for 3 units you must pass atleast four of the five homework assignments and you must not violate the virus policy below. If you take thecourse for 4 units you must pass all five assignments. There will be a couple of quick assignments that willnot directly affect grades.Turning in Homework and VirusesAll assignments will be submitted via email to balise at stanford dot edu. Any student that sends me a virus(or any other malicious code) will fail the course. There will be no exceptions made. Therefore, you arestrongly advised to download the latest version of the Norton Anti-Virus definitions and check your files priorto submitting each and every homework. If you have any questions about how to update your virusdefinitions, ask!Late policyEach of the assignments will be due at the beginning of class on the day specified.That said, there are unforeseen emergencies (illness, bike accidents, disk crashes, network troubles,childbirth, etc.). Instead of having to ask for special allowances on an individual basis, I give each of you theprivilege of granting yourself a small extension in case of crisis. You will have two “late days” which youmay use to extend the due dates of any assignments without penalty. To avoid any ambiguity, there are sevendays in a week and each day ends at 5:00 PM. Thus, if your assignment was due on Wednesday but turned inthe following Monday before 5:00, that assignment would be five days late. After the grace period is up eachassignment is down weighted 20% per day. In all cases, assignments will not be accepted more than onecalendar week after the original assignment due date.Computer PlatformsThe programs that you turn in must run on Windows SAS 9.1.3. I can provide good support for Windows,fair support for UNIX and no support for Mac. For returning students, material in orange is totallyHRP 223 Data Management and Statistical Programming file:///c:/Documents%20and%20Settings/Brent/My%20Documents/Stanf...3 of 13 1/12/2006 4:13 PMnew or was not covered in 2004/2005. Topics listed with a letter and number are optional. Core Lecture Material0 - Computing at Stanford and introduction to SASadministriviasoftware useful at Stanfordsecuritysoftware for data management analysisSAS the calculatorPowerpoint slides are here. Some notes on using UNIX/Cygwin are here. Assignment 0 is here. 1 - How data analysis works & how SAS worksgetting data into SASlibraries - libnamevariableshow the data step workssubsetting into new data setsdecision operatorssubmitting codebugscommon tasks and procedurescomments on commentswhen things go wrongPowerpoint slides are here. Example Code: Teletubbies2.sas is here. TLSB 1.1-1.4, paragraph 1 1.5, 1.6-1.9, 1.11 SAS procedure syntax keyboard macros for 2005 are here. Assignment 1 is here.2 - Data step programmingmore about libraries - proc datasets & proc contentsimporting - proc importimporting excel - TypeGuessRows, libname blah excel 'C:\blah.xls' mixed=yes, proc import with mixed= YESimporting and exporting the hard way - infile DSD DLMHRP 223 Data Management and Statistical Programming file:///c:/Documents%20and%20Settings/Brent/My%20Documents/Stanf...4 of 13 1/12/2006 4:13 PMdealing with tabs - DLM=‘09’Xcreating new data setsthe PDVworking with variables - if-else select-when-otherwiseproblem values - put blah=, put "ERROR-" blah=, put "WARNING:" blah=, put "NOTE:" blah=,putloglength of variables - lengthusing functionsuseful functions by category - cat, catt, cats, catxdata line and set line options - where keep drop firstobs obsworking with substrings - index, like, =:, =*formats - proc format value library= fmtlib, options fmtsearch = (work blah theLib)format ranges - low high <-<combined text and number format - proc format; picture blahdealing with missing formats - options nofmterrPowerpoint slides are here. Code from lecture is here. and more code is here. Sample Excel file FixedWeights.xls is here and RawWeights.xls is here. Example program of using the data step is here. Assignment 2 is here. Assignment 3 is hereSolution to Assignments 2 and 3 is here. TLSB 2.1-2.2, 2.3-2.8 read but don't stress on the details, 2.9-2.11, 2.17-2.18, 2.20-2.21, 3.1-3.4, 3.6, 3.11,6.9, 6.11, 6.12, 6.14Homework 1 is here and my solution is here.2b - SAS / Access (a few notes)log into a database - user= password=dealing with variables and tables names that are invalid in SAS - perserve_col_name preserve_tab_namePowerpoint slides are here.2c - Writing text


View Full Document

Stanford HRP 223 - Syllabus

Download Syllabus
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Syllabus and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Syllabus 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?