CSIRO's Data61, Hobart, Tasmania, Australia.
CSIRO's Oceans and Atmospheres, Hobart, Tasmania, Australia.
PLoS One. 2021 Nov 4;16(11):e0259113. doi: 10.1371/journal.pone.0259113. eCollection 2021.
In population genetics, the amount of information for an analytical task is governed by the number of individuals sampled and the amount of genetic information measured on each of those individuals. In this work, we assessed the numbers of individual yellowfin tuna (Thunnus albacares) and genetic markers required for ocean-basin scale inferences. We assessed this for three distinct data analysis tasks that are often employed: testing for differences between genetic profiles; stock delineation, and; assignment of individuals to stocks. For all analytical tasks, we used real (not simulated) data from four sampling locations that span the tropical Pacific Ocean. Whilst spatially separated, the genetic differences between the sampling sites were not substantial, a maximum of approximately Fst = 0.02, which is quite typical of large pelagic fish. We repeatedly sub-sampled the data, mimicking a new survey, and performed the analyses. False positive rates were also assessed by re-sampling and randomly assigning fish to groups. Varying the sample sizes indicated that some analytical tasks, namely profile testing, required relatively few individuals per sampling location (n ≳ 10) and single nucleotide polymorphisms (SNPs, m ≳ 256). Stock delineation required more individuals per sampling location (n ≳ 25). Assignment of fish to sampling locations required substantially more individuals, more in fact than we had available (n > 50), although this sample size could be reduced to n ≳ 30 when individual fish were assumed to belong to one of the groups sampled. With these results, designers of molecular ecological surveys for yellowfin tuna, and users of information from them, can assess whether the information content is adequate for the required inferential task.
在群体遗传学中,分析任务所需的信息量由采样个体的数量和每个个体所测量的遗传信息量决定。在这项工作中,我们评估了进行海洋盆地规模推断所需的黄鳍金枪鱼(Thunnus albacares)个体数量和遗传标记数量。我们评估了三种常用于分析的不同任务:测试遗传谱之间的差异;种群划分;以及个体归属到种群。对于所有分析任务,我们使用了跨越热带太平洋的四个采样地点的真实(非模拟)数据。虽然这些采样点在空间上是分开的,但遗传差异并不显著,最大约为 Fst = 0.02,这在大型远洋鱼类中是相当典型的。我们反复对数据进行了子采样,模拟了一项新的调查,并进行了分析。通过重新采样和随机分配鱼类到不同的组,我们还评估了错误阳性率。改变样本大小表明,一些分析任务,即谱测试,每个采样点所需的个体数量相对较少(n ≳ 10),单核苷酸多态性(SNPs,m ≳ 256)。种群划分每个采样点需要更多的个体(n ≳ 25)。将鱼类分配到采样地点需要更多的个体,实际上比我们拥有的还要多(n > 50),尽管当假设个体属于采样的一个群体时,可以将样本大小减少到 n ≳ 30。有了这些结果,黄鳍金枪鱼分子生态调查的设计者和信息使用者可以评估信息含量是否足以满足所需的推断任务。