QTL detection using multiple regression on average transmission probabilities.

Livestock Library/Manakin Repository

Show simple item record

dc.contributor Henshall, JM
dc.contributor Goddard, ME
dc.date.accessioned 2012-01-25T11:48:35Z
dc.date.available 2012-01-25T11:48:35Z
dc.date.issued 1999
dc.identifier.citation Proc. Adv. Anim. Breed.Gen. (1999) 13:
dc.identifier.uri http://livestocklibrary.com.au/handle/1234/5395
dc.description.abstract Proc. Assoc. Advmt. Anim. Breed. Genet. Vol13 QTL DETECTION USING MULTIPLE REGRESSION PROBABILITIES ON AVERAGE TRANSMISSION . J. M. Henshall' and M. E. Goddard' ' Animal Genetics and Breeding Unit*, University of New England, Armidale, NSW 235 1 * Institute of Land and Food Resources, University of Melbourne, Parkvilie, Vic 3052 SUMMARY A method of applying multiple regression to marker data is described. Phenotype is regressed on QTL transmission probability for regions of the genome. The QTL transmission probabilities combine the information from multiple linked markers, and are likely to be more uniform in the proportion of informative offspring than single markers. Keywords: QTL, multiple regression INTRODUCTION Experiments to map quantitative trait loci (QTL) commonly compare the genotype at many marker loci with a smaller number of traits. QTL are mapped to positions between markers using maximum likelihood (Lander and Botstein 1989) or regression (Haley and Knott 1992) methods. Not uncommonly there are chromosomal regions in which QTL are suspected to map and the aim of the experiment is to confirm the existence of the QTL and their segregation in the current families. Such experiments use a number of markers from the relevant region of the chromosome but, unless the number of animals used is very large, they have little power to estimate the position of the QTL amongst the markers. Analysis methods which incorporate the information from multiple markers to estimate QTL effect and significance, without inferring location relative to the markers, may be ideal for such experiments. A multiple regression of phenotype on marker genotype has been considered for QTL, and has some advantages over single marker or marker bracket methods. The method is quick, multiple unlinked QTL are accounted for, problems with multiple testing are reduced, and computer software is readily available. However, a major disadvantage with the multiple regression approach to mapping QTL is the requirement that the independent variable, marker genotype, be available for all records included in the analysis. For marker genotype to be available for all markers for all animals, a marker must be fully informative for all animals. While this is theoretically possible for designed experiments, it does impose severe restrictions on which markers can be used. One method of avoiding the requirement that all animals are fully informative for all markers is to regress on a function of marker genotype, such as the probability of a particular marker allele being inherited. However, markers are unlikely to be equally informative, so markers may be identified as significant or not significant based on the proportion of informative animals. * AGBU is a joint institute of NSW Agriculture and The University of New England 219 Proc. Assoc. Advmt. Anim. Breed. Genet. Vol13 In this article, a method of combining information from a number of linked markers is described, to produce a QTL transmission probability (QTP). The multiple regression of phenotype on QTP is proposed as a method that retains the advantages of multiple regression, while minimising the disadvantages caused by uninformative markers. METHOD With interval mapping, for each animal, the observed marker transmission, assumed marker and QTL locations and the mapping function determine a QTL transmission probability at points (loci) on the chromosome. An alternative approach is to use a single QTL transmission probability for a region of the chromosome containing a number of linked markers. Such a QTP is easily obtained by integrating the mapping function over the region, and dividing by the map length. With Haldane's mapping function (Haldane 1919) this integral is best expressed in a mixture of units of map distance (6) and recombination rate (r). Let a sire have marker genotype AaBb for two linked markers, where A and B are on one chromosome of the homologous pair, and a and b on the other. Offspring can inherit AB, Ab, aB, or ab. If a QTL is located between the markers, then let q be the probability that the QTL allele on the AB chromosome was inherited by an offspring. Then, with Haldane's mapping function, and assuming that all locations within the bracket are equally likely for the QTL, and that r>O, (qlAB)=~+ l 2 2(1(q[ab)=~2 (Ylm=(w)=; r)d l 2(1 -r)d Additiona! probabilities are required for the situation where one of the markers is uninformative. For chromosome regions containing more than two markers, the transmission probabilities for marker brackets can be averaged, weighted by the map lengths of the brackets. Given a QTP for a region, maximum likelihood or regression approaches can be used to estimate QTL effects and significance levels. If markers are available for multiple regions, on more than one chromosome, then all markers can be combined in a single analysis by performing multiple regression on the QTP. As each QTP incorporates information from multiple markers, missing genotypes and unbalanced marker information should be less of a problem than with ordinary multiple regression. Test Data To demonstrate the method, a single sire, halfsib QTL detection experiment was simulated. 60 markers on 20 chromosomes were simulated for 100 progeny of the sire. A record was simulated for each halfsib with phenotypic variance of 1.0, of which approximately 20 % was 220 Proc. Assoc. Advmt. Anim. Breed. Genet. Vol13 Each of the chromosomes contained one QTL, due to QTL for which the sire was heterozygous. with the variance due to the QTL on the first chromosome 0.1, the second 0.05, the third 0.025 and so on, down to the tenth chromosome with QTL variance 0.00019. For chromosomes 11 to 20 the QTL variance was zero. All QTL-marker pairs were simulated with a recombination rate of 0.05. A third marker was located with a recombination rate of 0.1 from one of the two flanking markers. All markers were informative half the time, but within markers, alleles were not equally informative. The marker allele linked to the positive (negative) QTL effect was simulated to be informative for 50 % (50 %), 90 % (10 Oh) and 30 % (70 Oh) of offspring. The QTL was located between the first two markers on even numbered chromosomes, and the last two markers on odd numbered chromosomes. Two analysis methods were applied to the simulated data. For both analyses, SAS procedure REG (SAS 1990) was used, with the stepwise option selected. In the first analysis, the phenotype was regressed on QTP, with QTP obtained as described above. Some examples of the QTP are displayed in Table 1. In the second analysis, the phenotype was regressed on marker genotype, with marker genotype coded -1 or +l for the two sire marker alleles, and 0 where the marker was uninformative. From the output from SAS procedure REG, the most significant marker on each chromosome was identified. Both analyses were performed on 200 simulated data sets. Table 1. Examples of observed marker genotype and estimated QTL transmission probability (QTP). For each animal, the genotype at three linked markers A, B, and C is shown with a dash where the marker is uninformative Animal Markers 1 2 3 4 5 6 8 -__ C QTP 0.5000 0.9033 0.9919 0.05 19 0.0270 0.0967 0.9730 A-C -bab---c -BC RESULTS AND DISCUSSION A summary of the results obtained appears in Table 2. When a nominal ~~0.05 significance level was used in the regression on markers, apparently significant (ie false positive) QTL were detected on chromosomes 11 ~ 20 in much more the 5 % of the simulations. When a nominal p<O.O05 significance level was used the actual type 1 error rate was close to 5 % (on a chromosome-wide basis). In the regression on QTP, use of a nominal ~~0.05 significance level resulted in approximately 5 % false positive QTL detected on chromosomes I l-20. When using the more stringent significance level in the marker regression, the power to detect QTL on chromosomes 1-4 was less than the power of the regression on QTP. Neither method had a high power to detect the small QTL on chromosomes 5-10. The estimated size of the QTL effect, in the replicates in which a significant effect was observed, overestimated the true effect, especially in the method using regression on the markers. 221 Proc. Assoc. Advmt. Anim. Breed. Genet, Vol13 From these results it appears that a stepwise multiple regression of phenotype on QTP may be useful in the detection of QTL in livestock. A logical extension to the method is to perform interval mapping on one region, using the results from the multiple regression to adjust for QTL effects in other regions. The method would then be similar to those of Zeng (1994) and Jansen (1994), but should be less afficted by unequal proportions of informative animals for each marker. Table 2. Results from stepwise regression analysis of 60 markers on 20 chromosomes. Phenotype was regressed on QTP, and on single markers with the most significant marker on each chromosome stored (Markers). Shown are the percenrtage of replieates that were significant at the 0.05 or 0.005 level, and the mean estimated effect (a) for significant replicates Chr0m QTP %sig(O.O5) a 96.0 0.469 78.5 0.363 49.5 0.327 27.5 0.318 16.5 0.294 17.5 0.249 11.5 0.182 8.5 0.155 6.5 0.107 7.0 0.234 6.0 0.097 3.5 0.048 7.0 -0.006 6.5 -0.112 5.5 -0.089 6.0 -0.135 8.0 0.004 8.0 -0.038 5.5 0.039 7.5 0.017 lwarkexs a 0.569 0.460 0.435 0.328 0.317 0.273 0.099 0.064 0.068 0.061 0.065 0.026 0.015 -0.027 -0.020 -0.028 0.009 -0.007 0.034 -0.058 -0some Effect 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 0.4472 0.3162 0.2236 0.1581 0.1118 0.0791 0.0559 0.0395 0.0280 0.0198 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 %sig(O.O5) 95.0 84.5 65.5 46.5 36.0 38.5 38.5 34.0 33.0 31.0 24.5 18.0 33.5 32.5 30.0 25.0 28.0 28.5 26.5 30.5 o/psig(O.OOS) a 71.5 0.638 0.525 52.0 0.502 36.5 0.382 22.5 17.5 0.422 12.5 0.300 14.0 0.104 0.084 16.0 13.0 0.153 0.04 1 10.5 8.0 0.05 1 0.084 8.5 -0.022 16.5 -0.148 10.0 12.0 -0.093 -0.039 11.0 7.0 0.066 0.033 12.5 8.0 0.315 14.5 0.001 REFERENCES Haldane, J. B. S. (1919) J. Genet. 8:299 Haley, C. S. and Knott, S. A. (1992) Hereh'ty 69:3 15 Jansen, R. C. (1994) Genetics 138:871 Lander, E. S. and Botstein, D. (1989) Genetics 121: 185 SAS (1990) 'SAS/STAT User's Guide, Version 6', 4' ed. SAS Institute, Cary Zeng, Z. (1994) Genetics 136: 1457 222
dc.publisher AAABG
dc.source.uri http://www.aaabg.org/livestocklibrary/1999/AB99048.pdf
dc.title QTL detection using multiple regression on average transmission probabilities.
dc.type Research
dc.identifier.volume 13


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Livestock Library


Advanced Search

Browse

My Account