QTL mapping using selective DNA pooling data.

Livestock Library/Manakin Repository

Show simple item record

dc.contributor Dekkers, JCM
dc.contributor Wang, J
dc.contributor Fernando, R
dc.date.accessioned 2012-01-25T11:48:34Z
dc.date.available 2012-01-25T11:48:34Z
dc.date.issued 1999
dc.identifier.citation Proc. Adv. Anim. Breed.Gen. (1999) 13:
dc.identifier.uri http://livestocklibrary.com.au/handle/1234/5378
dc.description.abstract Proc. Assoc. Advmt. Anim. Breed. Genet. Vol13 QTL MAPPING USING SELECTIVE DNA POOLING DATA J. C. M. Dekkers, J. Wang, and R. Fernando Department of Animal Science, Iowa State University, Ames, IA, 500 11, USA SUMMARY A method to map QTL based on selective DNA pooling in half-sib family designs was developed. QTL position was estimated from marker frequency differences between progeny with high and low phenotypes and did not depend on differences in phenotypic means. The QTL effect was estimated by relating differences in phenotypic means to differences in QTL frequencies, which were estimated from QTL position and marker frequencies. Methods were first developed for a single family with two markers and then extended to multiple families and multiple markers. Simulation of a single half-sib family of 2,000 individuals with two markers showed that close to unbiased results were obtained with high power. Biases increased when measurement errors on marker frequencies increased and the QTL effect was small. Biases were minimal when multiple families and multiple markers were used. Keywords: QTL mapping, genetic markers, selective DNA pooling INTRODUCTION Darvasi and Soller (1992) developed methods to identify markers linked to QTL based on differences in marker allele frequencies between individuals with high and low phenotypes. Darvasi and Soller (1994) showed that DNA pooling can be used to determine the required marker frequencies. Selective DNA pooling allows for a significant reduction in genotypings and was recently used to detect markers associated with milk protein content in dairy cattle by Lipkin et al (1998). Methods of Darvasi and Soller (1992, 1994) are, however, based on single marker analyses and do not allow separation of QTL position and effect. The objective of this study was, therefore, to develop methods to map QTL using selective DNA pooling data on flanking markers. Methods are initially derived for a single half-sib family with two markers and then extended to multiple families and markers. MATERIALS AND METHODS Single half-sib family. Consider an additive trait affected by a QTL and polygenes. The QTL is bracketed by two markers, M and N, with recombination rates r,, r,, and 8 between M-Q, N-Q, and M-N. Fractions s of half-sib progeny with the highest and lowest phenotypes are identified. The common sire is heterozygous for all loci (MQN/mqn) and marker haplotypes are known. Let fiM, $,, filM,and f,,, be the observed frequencies of M and N in the lower and upper tails, respectively, and PW P,,, PIIW and P,, the frequencies of progeny that received M and N from the sire. The latter can be obtained by adjusting observed marker frequencies by estimates of frequencies among dams (h): P1,\1 2f,`V- h,. = Let p,,, denote the (unobserved) upper tail frequency of progeny that received Q from the sire. Given p,,, expected frequencies of progeny in the upper tail that received M and N from the sire are: E(P,,,)= ( 1-r,b,,o + r, P,,, = r, + (1 -+I P,,~ E(P,,)= (1 -rz)Pl,u+ r2 P,,, = r2 + (1-2rJ P,,, 223 [`I PI Proc. Assoc. Advmt. Anim. Breed. Genet. Vol13 Similar equations can be derived for the lower tail by replacing U with L. Expected differences in paternal marker allele frequencies between the upper and lower tail are: P'M_ z (1-2r,)(pUQ-PJ = ( 1:2rJo(p,,- 1) P,, p, and, assuming no interference, pUN- = (1-28)(p,dp,J/(l-2r,) = (l-2e)(2p,dl)/(l-2r,) Solving equations [3] and [4] for r, and requiring r, to be less than _ results in: r, = --- V(l -2e)(p,,- p,)/(P,,P,) [31 [4] [51 Note that a solution to [5] does not exist if (p,,-p,,) and (p,,-p,,) differ in signs. Estimates of puQand pzQare obtained by substituting [5] in [3] and solving for puy: P,, = _+_(P,,-P,,)/ Cl-2rJ = _+-V( l-2e)(p,,- P,,)( pUNpLN) or puo = _-_(P~~-PJ (I -2rJ = _--V( I -2exp,,- P,,)( P,,- P,,) [oal Wbl Once estimates of QTL frequencies are obtained, the QTL substitution effect _ can be estimated by relating QTL frequency differences to differences in trait means using methods developed by Darvasi and Soller (1994) for marker-associations. Let _-L,_', and _ denote the mean phenotype of progeny in the lower tail, the upper tail, and all progeny, respectively. Considering progeny in the upper and lower tail, the mean phenotype of progeny that received allele Q from the sire, deviated from _, is: [71 PUL,P [P,&'-P) + P&W)l~@u*+ Pw> = Similarly, the mean phenotype of progeny that received allele q from the sire is: [gl CL = Kl-P,)(P,-ci) + (l-P;oxcL~-cL)1~(2-Pu9-P~~) vL,q Correcting for differences inmean phenotype due to factors other than the QTL under study based on selection intensity i,, a can be estimated as (Darvasi and Soller 1994): a = (p,,,-p,,,)/i,* [9] Multiple markers and mnftlple half-sib families. Methods described above can be extended to multiple markers and multiple families. Using [ 11, the observed frequency for marker k in tail j from sire i can be represented by the .following mixed model: pijk=rk + (I-2&)P, `+ &j& qjk + [lOI where aljk is a binomial sampling effect and, eijkis a random measurement error in marker frequencies from DNA pooling. The model for a vector of frequencies on linked markers for sire i in tail j is: I+= r + (1 -2r)Pua + q + erj IllI where Var(e$=Iu,' and Var(uij) is a matrix of covariances between frequencies of marker-QTL recombinants. For a given position of the QTL, Var(n$ is known and has diagonal elements (lr&/n (n is the number of progeny in the sample) and off-diagonals equal to (l-rk)rk(l-2&k~)/n for marker- QTL orders k'-k-QTL, with f&k'the (known) recombination rate between markers k and k', and off-diagonals equal to zero for marker-QTL orders k'-QTL-k. Similar sets of equations can be set up for other tails and families. Ignoring relationships, observations are independent among tails and families. 224 Proc. Assoc. Advmt. Anim. Breed. Genet. Vol I3 For a given position of the QTL, the vector of recombination rates between markers and the QTL, r, is known and, hence, model [ 1 l] can be fitted to observed marker frequencies using Henderson's mixed model equations, with residual variance, ue2, estimated iteratively. Similar to regression interval mapping, the model is fitted with the QTL at positions 1 CM apart on the chromosome. The position with the lowest residual sum of squares gives the estimate of QTL position. Estimates of QTL frequencies p,,, at this position can be used to estimate QTL substitution effects for each sire using [9]. RESULTS Methods were validated by stochastic simulation. A random measurement error was added to simulated marker frequencies. Results for a single family are in Table 1. Only replicates for which the estimate of rl existed and fell between -0.5 and +OS were considered. In addition, the estimate did not always place the QTL inside the marker bracket. In these cases, the QTL was positioned at the nearest marker, which will be referred to as restricting rl to the marker bracket. When the sire's marker alleles were absent among dams and measurement error was zero, estimates of QTL position and effect were nearly unbiased when QTL position was restricted to the bracket. Measurement errors caused a bias in position toward the center and increased the standard deviation of estimates. A substantial percentage of replicates had estimates that were outside the bracket or invalid (Tables l), which increased when power decreased. Measurement errors caused a slight bias in estimates of the QTL effect. When the sire's marker alleles were also present among dams and observed frequencies were adjusted based on an estimate of marker frequencies among dams (true frequencies were used here), biases in estimates of QTL effect and position increased slightly (Table 1). Initial results for mixed model analysis of a design with 7 families and 6 markers are in Table 2 for a QTL that explains 50 % (a=.50crP) or 12.5 % (a=.25o,) of genetic variance. Marker-QTL linkage phase differed by sire. Also, 50 % of sires were homozygous at the QTL and did, therefore, not contribute information. QTL position was estimated with little, bias (Table 2) but the standard deviation of estimates was substantial. Estimates of QTL frequencies were unbiased within QTL genotype. In ongoing research, these estimates will be used to estimate sire QTL substitution effects. DISCUSSION An important result from this study is that the estimate of QTL position from selective genotyping depends entirely on differences in marker frequencies in selected groups (equation [5]) and not on differences in phenotypic means. With a single family and only two markers, the ability to map the QTL is limited, with estimates being invalid or outside the bracket for many replicates. These problems can be overcome with the mixed model analysis with multiple families and markers. Estimates of frequencies of marker alleles from DNA pools are subject- to measurement errors (Khatib et al. 1994). Additional errors are introduced by adjustments for marker frequencies among dams (Lipkin et al. 1998). These errors are expected to be independent between markers and, therefore, their impact can be reduced by considering markers that are external to the flanking markers when mapping a QTL within a given marker bracket. This is capitalized on in the mixed model by modeling covariances between linked markers. 225 Proc. Assoc. Advmt. Anim. Breed. Genet. Voll3 Table 1. Means and standard deviations (in brackets) based on 5,000 replicates of estimates of QTL position and effect from selective DNA pooling data. Measurement error variance=.00278' Measurement error variance=0 Dam tieqr=O r, r, % a (x100) restricted to bracket' outside4 (% invalid5) (x100) 4.7 4.9 8.8 49.3 3.8 5.4 24.0 25.3 ( ( ( ( ( ( ( ( 3.5) 3.0) 0.0) 6.3) 7.8) 4.7) 0.5) 6.0) Dam freq>O Dam freq=O 0=0.2 r,=0.05 a=O.SOa, 4. I ( 6.7) 3.8 ( 9.8) 5.3 ( 4.4) 6.1 ( 5.7) 22.3 ( 0.1) 29.5 ( 0.7) 49.9 ( 7.9) 50.5 (10.0) 3.2 6.7 38.8 26.1 0=0.2 r,=0.05 a=0.25, (12.9) 4.2 (16.0) ( 6.5) 7.9 ( 7.5) ( 4.0) 50.2 ( 9.5) ( 7.6) 26.7 ( 9.9) Dam freqa 3.4 6.3 33.5 51.0 (11.3) ( 6.1) ( 1.4) (11.3) ' 4.7 16.8) 8.3 ( 7.8) 54.9 (13.2) 27.3 (10.9) ri (x100) r, restricted to bracket % outside4 ('% invalid') a (x100) ' Based on Lipkin et al (I 998) for microsatellite markers. r Frequency of the she's marker alleles among dams, with adjustment based on true dam frequencies. ' Estimates of position outside the bracket were test&ted to the nearest marker 4 The percentage of replicates for which the QTL was mapped outside the marker bracket. ' invalid refers to the percentage of replicates for which the estimate of QTL position was invalid. Table 2. Average estimates over 1,000 replicates of QTL position and frequencies based on a mixed model analysis of selective DNA pealing data. QTL effect (a) 0.50 up 0.25 op QTL position (CM) (True position = 46) Average St.dev. 45.67 6.98 45.87 16.10 Difference Qq sires in QTL allele frequencies qQ sires between upper and lower tail QQ or qq sires True 0.44 0.23 Estimate 0.44 0.23 True -0.44 -0.22 Estimate -0.44 -0.23 True 0.00 0.00 Estimate 0.00 0.00 This is in. contrast to QTL mapping based on individual genotypes (e.g. Haley and Knott, 1992) where, assuming no genotyping errors and no interference, external markers provide no additional information to map a QTL in an interval. Including external markers as covariates can, however, reduce the impact of QTL outside the interval (Jansen 1993). QTL mapping based on flanking markers in a half-sib design depends on knowing marker linkage phases in the common parent. With selective DNA pooling, frequencies at linked markers in pools selected based on phenotype give insufficient power to derive sire marker haplotypes; therefore, sire marker haplotypes must be obtained from, other means. Finally, hypothesis testing and power to detect QTL based on selective DNA pooling will be addressed in subsequent research. REFERENCES Darvasi, Dar&, Haley, Jansen, Khatib, Lipkin, A. and Soiler, M. (1992) Z%eor. AppZ. Gent-t. @5:353 A. and Soiler, M. (1994) Generics 138: 1365 C. S. and Knott, S. A. (1992) Heredity 69:3 15 R. C. (1993) Genetics 135:205 H., Darvasi, A., Plotsky,Y. and Soiler, M. (1994) PCR Methods Appl. 4: 13 E., Mosig, M. O., Darvasi, A., Ezra, E., Shalom, A., Friedmann, A. and Soiler, M. (1998) Genetics 149: 1557 226
dc.publisher AAABG
dc.source.uri http://www.aaabg.org/livestocklibrary/1999/AB99049.pdf
dc.title QTL mapping using selective DNA pooling data.
dc.type Research
dc.identifier.volume 13


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Livestock Library


Advanced Search

Browse

My Account