核心概念
The author argues that current methods for population genetics inference may not accurately account for multiple merger events in genealogy, leading to limitations in understanding past demography and selection. By developing novel approaches like SMβC and GNNcoal, the author aims to improve accuracy in inferring complex demographic scenarios and selection effects.
要約
The content discusses the challenges in inferring past demography and selection from genome data due to multiple merger events in genealogy. The development of two new approaches, SMβC and GNNcoal, is presented as a solution to overcome these challenges. These methods are tested on simulated data under different scenarios to evaluate their accuracy in recovering population size variations and α parameters under the β-coalescent model. Results show that GNNcoal outperforms SMβC in most cases, providing a promising alternative for accurate population genetics inference.
Key points:
- Standard Wright-Fisher model assumptions may not apply to species with skewed offspring distribution or strong selection events.
- Current methods lack accuracy in detecting multiple merger events without accounting for complex demographic scenarios or recombination.
- Two novel approaches, SMβC and GNNcoal, are developed to address these limitations.
- Simulated data tests demonstrate the effectiveness of these methods in inferring past demographic history and selection effects.
- GNNcoal shows superior performance compared to SMβC, especially when analyzing larger sample sizes.
統計
The probability for a parent to have 10 or more offspring is ≈ 10−8 (Kingman coalescent).
Mutation and recombination rate set to 10^-8 per generation per bp.
Population size calculations for the Beta coalescent yield N = 106 individuals.
引用
"We tackle these limitations by developing two novel and different approaches to infer multiple merger events from sequence data or the ancestral recombination graph (ARG): a sequentially Markovian coalescent (SMβC) and a graph neural network (GNNcoal)."
"Our findings stress the aptitude of neural networks to leverage information from the ARG for inference but also the urgent need for more accurate ARG inference approaches."