Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

The analysis of patterns of segregating (i.e. polymorphic) sites in aligned sequences is routine in population genetics. Quantities of interest include the total number of segregating sites and the number of sites with mutations of different frequencies, the so-called site frequency spectrum. For neutrally evolving sequences, some classical results are available, including the expected value and variance of the spectrum in the Kingman coalescent model without recombination as calculated by Fu (1995). In this work, we use similar techniques to compute the third moments of the frequencies of three linked sites. Based on these results, we derive analytical results for the bias of Tajima's D and other neutrality tests. As a corollary, we obtain the second moments of the frequencies of two linked mutations conditional on the presence of a third mutation with a certain frequency. These moments can be used for the normalisation of new neutrality tests relying on these spectra.

Original publication

DOI

10.1016/j.tpb.2017.12.002

Type

Journal

Theoretical population biology

Publication Date

03/2018

Volume

120

Pages

16 - 28

Addresses

Institut für Genetik, Universität zu Köln, 50674 Köln, Germany. Electronic address: alexander.klassmann@uni-koeln.de.

Keywords

Animals, Humans, Genetics, Population, Mutation, Polymorphism, Single Nucleotide, Models, Genetic, Computer Simulation