Lecture 03 - Haplotype Frequencies And Linkage Disequilibrium

Transcription

Haplotype Frequencies andLinkage DisequilibriumBiostatistics 666

Last LecturenGenotype FrequenciesAllele FrequenciesPhenotypes and PenetrancesnHardy-Weinberg EquilibriumnnnSimple demonstration

Exercise:NOD2 and Bowel DiseasenLeu1007fsnnnnPenetrancennnFrame shift mutation at position 1007Frequency of about 5%Disrupts geneGenotype / P(Crohn’s G) 0.1%-/ 0.2%Calculate frequency of -/- genotype inpopulation and among patients -/3%

Haplotype or Gamete FrequenciesBbAp ABp AbpAapaBpabpapBpb

Typical Genotype DatannnnTwo alleles for eachindividualObservationCTGChromosome origin foreach allele is unknownMany haplotype pairsmight be compatible withobserved dataWill ignore this complexityfor now GCAMarker1Marker2Marker3Possible StatesCTGGCACCGGTACCAGTGCTAGCG

Linkage EquilibriumnnIn a large random mating populationhaplotype frequencies converge to asimple function of allele frequencies(The reverse is always true!)

Linkage Equilibriump AB p A pBp Ab p A pb p A - pBpaB pa pB - p A pBpab pa pb - pA- pB

A new mutation Before MutationAGCGAfter MutationAGCGCCMutation

For a new mutation nOne haplotype frequency is zeronLinkage disequilibrium does not holdnnIn contrast, linkage disequilibriumHow is linkage equilibrium reached?

RecombinationABefore RecombinationGCGCCAAfter RecombinationGCGCCAGRecombinant Haplotype

Equilibrium or Disequilibrium?nnWe will present simple argument forwhy linkage equilibrium holds Balance of factorsnnnnGenetic drift (a function of population size)Random matingDistance between markers

Linkage Disequilibriump AB ¹ p A pBp Ab ¹ p A pb p A - pBpaB ¹ pa pB - p A pBpab ¹ pa pb - pA- pB

Disequilibrium Coefficient DABDAB p AB - p A pBp AB p A pB DABp Ab p A pb - DABpaB pa pB - DABpab pa pb DAB

DAB is hard to interpretnSign is arbitrary nnWe could set A, B to be the common alleleand a, b to be the rare alleleThe range of DAB depends on allelefrequenciesnHard to compare between markers

Range of DABnMust be greater thannnMust be smaller thannnMax(-pApB,-papb)Min(pA - pApB, pB - pApB)These constraints ensure that nohaplotypes have negative frequencies

Alternative MeasuresnThe most popular measuresnnnD’ and D’ D²Other common measuresnnChi-squaredP-value

D²2DABD² p A (1 - p A ) pB (1 - pB )c² 2nnRanges between 0 and 1nnn1 when the two markers provide identical information0 when they are in perfect equilibriumExpected value is 1/2n

D’DABìïï max( p p , p p )A Ba bD ' AB íDABïïî min( p A pb , pa pB )nDAB 0DAB 0Ranges between –1 and 1nnMore likely to take extreme values when allelefrequencies are small 1 implies at least one of the observed haplotypeswas not observed

Raw D’ data from Chr221.000.80D'0.600.400.200.000200400600Physical Distance (kb)8001000

Raw D² data from Chr221.000.80r20.600.400.200.000200400600Physical Distance (kb)8001000

Why Equilibrium is Reached nnEventually, random mating andrecombination should ensure thatmutations spread from originalhaplotype to all haplotypes in thepopulation Simple argument:nAssume fixed allele frequencies over time

Recombination FrequencynRecombination frequency 0.50nnFor loci on the same chromosomeObserved recombination refers to an oddnumber of crossovers

Recombination Rate (q)nnnProbability of an odd number of crossoversbetween two lociProportion of time alleles from two differentgrand-parents occur in the same gameteIncreases with physical (base-pair) distance,but rate of increase varies across genome

Without RecombinationAaBp A pB DABbp A pb - DABpa pB - DABpBpa pb DABpbpApaHaplotype Frequencies Remain Stable Over TimeP 1- q

With RecombinationAaBp A pBbp A pbp A pbpBpa pbpbpApaHaplotype Frequencies Are Function of Allele FrequenciesP q

Overall ChangeAaBp A pB (1 - q ) DABbp A pb - (1 - q ) DABp A pb - (1 - q ) DABpBpa pb (1 - q ) DABpbDisequilibrium Decreases pApa

PredictionsnDisequilibrium will decay eachgenerationnnAfter t generations nnIn a large populationDtAB (1-q)D0ABA better model should allow for changesin allele frequencies over time

ulation APopulation B

StratificationnDue to non-random matingnnAllele frequencies drift apart in each groupnnEg. Mating based on proximity or cultureEg. Allele frequency differences at many genesbetween African-Americans and CaucasiansDisease prevalences may also differnEg. Glaucoma has prevalence of 2% in elderlyCaucasians, but 8% in African-Americans

Microsoft PowerPoint - Lecture 03 - Haplotype Frequencies and Linkage Disequilibrium Created Date: 1/17/2003 2:55:09 PM .