CSUN PSY 427 - Reliability - D1891734

Home> Schools> California State University, Northridge> Psychology (PSY) > PSY 427> Reliability

DOC PREVIEW

CSUN PSY 427 - Reliability

School name California State University, Northridge

Course Psy 427- Intro Psy Testing

Pages 53

This preview shows page 1-2-3-4-24-25-26-50-51-52-53 out of 53 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 53 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

Classical Test Theory and ReliabilityBasics of Classical Test TheoryClassical Test TheorySlide 4Slide 5Slide 6Slide 7Slide 8True ScoresDomain Sampling TheorySlide 11Slide 12Slide 13Classical Test Theory ReliabilityCTT: Reliability IndexSlide 16CTT: Test-Retest ReliabilitySlide 18Slide 19Slide 20Slide 21CTT: Parallel Forms ReliabilitySlide 23Slide 24CTT: Split Half ReliabilitySlide 26Spearman Brown FormulaSlide 28Slide 29Slide 30Detour 1: Variance Sum LawSlide 32Slide 33Slide 34CTT: Internal Consistency ReliabilitySlide 36Slide 37Slide 38Slide 39Slide 40Slide 41Detour 2: Dichotomous ItemsSlide 43CTT: Reliability of ObservationsSlide 45Slide 46Standard Error of MeasurementSlide 48CTT: The Prophecy FormulaSlide 50CTT: AttenuationSlide 52Slide 53Cal State NorthridgePsy 427Andrew Ainsworth, PhDBasics of Classical Test TheoryTheory and AssumptionsTypes of ReliabilityExampleClassical Test TheoryClassical Test Theory (CTT) – often called the “true score model”Called classic relative to Item Response Theory (IRT) which is a more modern approachCTT describes a set of psychometric procedures used to test items and scales reliability, difficulty, discrimination, etc.Classical Test TheoryCTT analyses are the easiest and most widely used form of analyses. The statistics can be computed by readily available statistical packages (or even by hand)CTT Analyses are performed on the test as a whole rather than on the item and although item statistics can be generated, they apply only to that group of students on that collection of itemsClassical Test TheoryAssumes that every person has a true score on an item or a scale if we can only measure it directly without errorCTT analyses assumes that a person’s test score is comprised of their “true” score plus some measurement error. This is the common true score modelX T E= +Classical Test TheoryBased on the expected values of each component for each person we can see thatE and X are random variables, t is constantHowever this is theoretical and not done at the individual level.( )( ) ( ) ( ) 0i ii i ii i i i i iX tE X tX t X t t tee e e== -- = - = - =Classical Test TheoryIf we assume that people are randomly selected then t becomes a random variable as well and we get:Therefore, in CTT we assume that the error :Is normally distributedUncorrelated with true scoreHas a mean of ZeroX T E= +TX=T+E measWithout s measWi th sTrue ScoresMeasurement error around a T can be large or smallT1T2T3Domain Sampling TheoryAnother Central Component of CTTAnother way of thinking about populations and samplesDomain - Population or universe of all possible items measuring a single concept or trait (theoretically infinite)Test – a sample of items from that universeDomain Sampling TheoryA person’s true score would be obtained by having them respond to all items in the “universe” of itemsWe only see responses to the sample of items on the testSo, reliability is the proportion of variance in the “universe” explained by the test varianceDomain Sampling TheoryA universe is made up of a (possibly infinitely) large number of itemsSo, as tests get longer they represent the domain better, therefore longer tests should have higher reliabilityAlso, if we take multiple random samples from the population we can have a distribution of sample scores that represent the populationDomain Sampling TheoryEach random sample from the universe would be “randomly parallel” to each otherUnbiased estimate of reliability = correlation between test and true score = average correlation between the test and all other randomly parallel tests1 1t jr r=1tr1 jrClassical Test Theory ReliabilityReliability is theoretically the correlation between a test-score and the true score, squaredEssentially the proportion of X that is TThis can’t be measured directly so we use other methods to estimate2 222 2 2T TXTX T Es srs s s= =+CTT: Reliability IndexReliability can be viewed as a measure of consistency or how well as test “holds together”Reliability is measured on a scale of 0-1. The greater the number the higher the reliability.CTT: Reliability IndexThe approach to estimating reliability depends on Estimation of “true” scoreSource of measurement errorTypes of reliabilityTest-retestParallel FormsSplit-halfInternal ConsistencyCTT: Test-Retest ReliabilityEvaluates the error associated with administering a test at two different times.Time Sampling ErrorHow-To:Give test at Time 1Give SAME TEST at Time 2Calculate r for the two scores• Easy to do; one test does it all.CTT: Test-Retest ReliabilityAssume 2 administrations X1 and X2The correlation between the 2 administrations is the reliability1 2( ) ( )i iX Xe e=1 22 2i iE Es s=1 21 21 222X XTX X XTX X Xssr rs s s\ = = =CTT: Test-Retest ReliabilitySources of errorrandom fluctuations in performanceuncontrolled testing conditions○extreme changes in weather○sudden noises / chronic noise○other distractionsinternal factors○illness, fatigue, emotional strain, worry ○recent experiencesCTT: Test-Retest ReliabilityGenerally used to evaluate constant traits.Intelligence, personalityNot appropriate for qualities that change rapidly over time.Mood, hungerProblem: Carryover EffectsExposure to the test at time #1 influences scores on the test at time #2Only a problem when the effects are random.If everybody goes up 5pts, you still have the same variabilityCTT: Test-Retest ReliabilityPractice effectsType of carryover effectSome skills improve with practice○Manual dexterity, ingenuity or creativityPractice effects may not benefit everybody in the same way.Carryover & Practice effects more of a problem with short inter-test intervals (ITI).But, longer ITI’s have other problemsdevelopmental change, maturation, exposure to historical eventsCTT: Parallel Forms ReliabilityEvaluates the error associated with selecting a particular set of items.Item Sampling ErrorHow To:Develop a large pool of items (i.e. Domain) of varying difficulty.Choose equal distributions of difficult / easy items to produce multiple forms of the same test.Give both forms close in time.Calculate r for the two administrations.CTT: Parallel Forms ReliabilityAlso Known As:Alternative Forms or Equivalent FormsCan

View Full Document