Linkage and associationSlide 2Slide 3Slide 4Alternative approachSlide 6DZ pairsHow to do this in mx?Slide 9Slide 10Slide 11SummaryAssociationSlide 14Allelic AssociationBiometrical modelBasic premise of assoc. for qualitative traitThe equivalent for a quantitative trait - run a regressionPractical – Find a gene for sensation seeking:Slide 20Slide 21Gene X is the gene for sensation seeking!What if there is true association?Calculate:False positives and false negativesHow to avoid spurious association?Fulker (1999) between/within association modelSlide 28Slide 29bij as Family ControlBTW – this is on top of a linkage modelSo…Combined Linkage & association Implemented in QTDT (Abecasis et al., 2000) and Mx (Posthuma et al., 2004)Implementation in Mx link_assoc.mxSticking it together…W = K+L ;Slide 37Slide 38Slide 39Slide 40Sticking it together using partSlide 42Slide 43Slide 44Slide 45Slide 46Linkage and associationSarah MedlandGenotypic similarity between relativesIBS Alleles shared Identical By State “look the same”, may have thesame DNA sequence but they are not necessarily derived from a known common ancestor - focus for associationIBD Alleles shared Identical By Descent are a copy of the same ancestor allele- focus for linkageM1Q1M2Q2M3Q3M3Q4M1Q1M3Q3M1Q1M3Q4M1Q1M2Q2M3Q3M3Q4IBSIBD21In linkage analysis we will be estimating an additional variance component QFor each locus under analysis the coefficient of sharing for this parameter will vary for each pair of siblingsThe coefficient will be the probability that the pair of siblings have both inherited the same alleles from a common ancestor ˆpQ A C EPTwin1E C A QPTwin2MZ=1.0 DZ=0.5MZ & DZ = 1.01 1 1 11 1 1 1q a c ee c a qˆpAlternative approach is a summary statistic ConvenientLoss of information .5 can mean p.ibd0=0 p.ibd1=1 p.ibd2=0 or p.ibd0=0 p.ibd1=.6 p.ibd2=.2Use all the informationˆpˆpAlternative approachModel each of the possible outcomes IBD0 IBD1 IBD2Weight each of the models by the probability that it is the correct modelThe pairwise likelihood is equal to the sum of likelihood for each model multiplied by the probability it is the correct modelThe combined likelihood is equal to the sum of all the pairwise likelihoodsDZ pairsT2QEQ Fq qf eT11 111Ee1Ff1.511m3 m4T2QEQ Fq qf eT11 111Ee1Ff111m5 m6T2QEQ Fq qf eT11 111Ee1Ff1111m1 m2* pIBD2 +* pIBD1 +* pIBD0How to do this in mx?Script link_mix.mxG2 DZ TWINS Data NInput=124 NModel=3 Missing =-99.00 Rectangular File=example3.dat Labels ….Select pheno1 pheno2 z0_20 z1_20 z2_20 age1 sex1 age2 sex2;Definition_variables z0_20 z1_20 z2_20 age1 sex1 age2 sex2;Tells Mx we will be using 3 different means and variance modelsTells Mx that these variables will be used as covariates – the values for these variables will be updated for each case during the optimization – the mxo will show the values for the final caseHow to do this in mx?Begin Matrices; X Lower nvar nvar = X1 Z Lower nvar nvar = Z1 D Lower nvar nvar = D1 B full 3 1 ! will contain IBD probabilities (from Genehunter) def var… Matrix H 0.5Specify B z0_20 z1_20 z2_20 ! put ibd probabilities in BThis script runs an AE model D is the QTL VC path coefficentWe are placing the prob. Of being IBD 0 1 & 2 in the B matrixHow to do this in mx? Begin Algebra; T = X*X'+Z*Z'+D*D' ; ! total variance U = H@X*X' ; ! IBD 0 cov (=non-qtl cov) K = U + H@D*D' ; ! IBD 1 cov W = U + D*D' ; ! IBD 2 cov A = T|U_ U|T_ ! IBD 0 matrix T|K_ K|T_ ! IBD 1 matrix T|W_ W|T ; ! IBD 2 matrixPre-computing the total variance and the covariance for the diff. IBD groupsStacking the pre-computed covariance matricesHow to do this in mx? Means G+O*R'| G+S*R'_G+O*R'| G+S*R'_G+O*R'| G+S*R'; Covariance A ; Weights B ;The means matrix contains corrections for age and sex – it is repeated 3 times and vertically stacked Tells Mx to weight each of the means and var/cov matrices by the IBD prob. which we placed in the B matrixSummaryWeighted likelihood approach more powerful than pi-hatQuickly becomes unfeasible3 models for sibship size 227 models for sibship size 4Q: How many models for sibship size 3?For larger sib-ships/arbitrary pedigrees pi-hat approach is method of choiceAssociationIntroductionAssociationSimplest design possibleCorrelate phenotype with genotypeCandidate genes for specific diseasescommon practice in medicine/geneticsGenome-wide association with millions available SNPs, can search whole genome exhaustivelyAllelic AssociationchromosomeSNPs trait variantGenetic variation yields phenotypic variationMore copies of ‘B’ allele More copies of ‘b’ allele2abb BBBbdmidpointGenotype Genetic ValueBBBbbbad-aVa (QTL) = 2pqa2 (no dominance)Biometrical modelBasic premise of assoc. for qualitative traitChose a phenotype & a candidate gene(s)Collect 2 groups - cases and controlsUnrelated individualsMatched for relevant covariatesGenotype your individuals for your gene(s)Count the % of cases & controls with each genotype Run a chi-square testYi = + Xi + eiwhereYi = trait value for individual iXi = 1 if allele individual i has allele ‘A’0 otherwisei.e., test of mean differences between ‘A’ and ‘not-A’ individuals00.20.40.60.811.2XYThe equivalent for a quantitative trait- run a regressionPlay with Association.xlsPractical – Find a gene for sensation seeking:Two populations (A & B) of 100 individuals in which sensation seeking was measuredIn population A, gene X (alleles 1 & 2) does not influence sensation seekingIn population B, gene X (alleles 1 & 2) does not influence sensation seekingMean sensation seeking score of population A is 90Mean sensation seeking score of population B is 110Frequencies of allele 1 & 2 in population A are .1 & .9Frequencies of allele 1 & 2 in population B are .5 & .5Sensation seeking score is the same across genotypes, within each population. Population B scores higher than population A Differences in genotypic frequencies.01 .18 .81 .25 .50 .25Genotypic freq.Suppose we are unaware of these two populations and have measured 200 individuals and typed gene X The mean sensation seeking score of this mixed population is 100What are our observed genotypic frequencies and means? Calculating genotypic frequencies in the mixed populationGenotype 11:1 individual
View Full Document