收藏切换
Development of SNP parentage assignment techniques in the yellowfin seabream Acanthopagrus latus
收藏切换
PDF
Hongbo Zhao1, 2, 3, Liangmin Huang1, 2, Jing Zhang1, 2, Songyuan You1, 2, Qingmin Zeng4, Xiande Liu1, 2, *
Acta Oceanologica Sinica | 2024, 43(2) : 151 - 155
Less
收藏切换
Acta Oceanologica Sinica | 2024, 43(2): 151-155
Marine Biology
Development of SNP parentage assignment techniques in the yellowfin seabream Acanthopagrus latus
Full
Hongbo Zhao1, 2, 3, Liangmin Huang1, 2, Jing Zhang1, 2, Songyuan You1, 2, Qingmin Zeng4, Xiande Liu1, 2, *
Affiliations
  • 1 Fisheries College of Jimei University, Xiamen 361021, China
  • 2 Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Xiamen 361021, China
  • 3 Bureau of Agriculture and Rural Affairs of Wudang District, Guiyang City, Guizhou Province, Guiyang 550018, China
  • 4 Fisheries Research Institute of Fujian, Xiamen 361000, China
Published: 2024-02-25 doi: 10.1007/s13131-023-2221-7
Outline
收藏切换

Acanthopagrus latus is an essential aquaculture species on the south coast of China. However, there is a lack of systematic breeding of A. latus, which considerably limits the sustainable development of A. latus. As a result, genetic improvements are urgently needed to breed new strains of A. latus with rapid growth and strong resistance to disease. During selective breeding, it is necessary to estimate the genetic parameters of the target trait, which in turn depends on an accurate disentangled pedigree for the selective population. Therefore, it is necessary to establish the parentage assignment technique for A. latus. In this study, 95 individuals selected from their parents and their 14 families were used as experimental material. SNPs were developed by genome re-sequencing, and highly polymorphic SNPs were screened on the basis of optimized filtering parameters. A total of 14 392 738 SNPs were discovered and 205 SNPs were selected for parentage assignment using the CERVUS software. In the model where the gender of the parents is known, the assignment success rate is 98.61% for the male parent, 97.22% for the female parent, and 95.83% for the parent pair. In the model where the gender of the parents is unknown, the assignment success rate is 100% for a single parent and 90.28% for the parent pair. The results of this study were expected to serve as a reference for the breeding of new varieties of A.latus.

Acanthopagrus latus  /  parentage assignment  /  SNP  /  Genome re-sequencing
Hongbo Zhao, Liangmin Huang, Jing Zhang, Songyuan You, Qingmin Zeng, Xiande Liu. Development of SNP parentage assignment techniques in the yellowfin seabream Acanthopagrus latus[J]. Acta Oceanologica Sinica, 2024 , 43 (2) : 151 -155 . DOI: 10.1007/s13131-023-2221-7
The yellowfin seabream Acanthopagrus latus is a member of the genus Acanthopagrus in the family Sparidae, in the order Perciformes. Acanthopagrus latus is being cultivated on a large scale in the coastal areas of the Fujian and Guangdong Provinces of China, bringing considerable economic benefits to local villagers every year (Zheng et al., 2023). However, there is a lack of systematic breeding of A. latus, which considerably limits sustainable development of A. latus (Zhu et al., 2020; Zheng et al., 2023).
Selective breeding has been shown to be an effective method of breeding improved varieties. However, Gjedrem et al. (2012) estimated that only 10% of aquaculture animals in the world were based on genetically improved populations. An influential reason is the difficulty of obtaining pedigree information for aquatic animals when formulating breeding programs. For example, newly hatched fry are only a few centimetres in size, making it difficult to perform regular physical tagging. Pedigree information is an essential basis for the formulation of breeding programs, evaluation of population genetic parameters, and breeding value (Vandeputte and Haffray, 2014). However, a complete and accurate record of pedigree information has not been established for most Chinese aquaculture varieties. Incorrect pedigree information will reduce the accuracy of genetic evaluation, affect the progress of breeding original varieties, and cause economic losses to the aquaculture industry (Houston et al., 2020). In view of this, it is of great importance to carry out studies on the analysis of the pedigree of aquatic animals.
In the field of parentage assignment, microsatellites have been the dominant approach for some time. High heterozygosity and abundant polymorphism allow it to assign offspring to the correct parents and establish accurate pedigree relationships. In the early 1990s, microsatellite research entered a golden period of development (Flanagan and Jones, 2019). Scientific researchers have successively developed microsatellite markers for a large number of species, and parentage assignment techniques based on microsatellite markers have gradually matured.
With the development of high-throughput technologies and advances in SNP typing methods, population geneticists and breeders have shifted their goals to more potential SNP markers. Compared to microsatellites, SNP has some advantages, including its suitability for high-throughput genotyping, low genotyping error rate, and the ability to merge and standardize SNP data from different sources (Yue and Xia, 2014). Therefore, more and more aquatic animals estimate pedigree information using SNP markers. For example, in Cyprinus carpio, Xu et al. (2017) developed a 48-SNP array based on the Fluidigm genotyping platform, and successfully assigned 94% of the offspring to the most-likely parental pair, which was consistent with actual pedigree records. To evaluate the highly efficient alternative of the SNP-haplotypes approach for microsatellite loci in inferring parentage, García-Fernández et al. (2018) screened 164 SNPs from gilthead sea bream Sparus aurata using direct sequencing as the final panel and genotyped 58 individuals using iPLEX Gold MassARRAY technology. The results showed that SNP-haplotypes resolved the parentage assignment with an accuracy of more than 99%, resulting in 10.5% and 3.5% better accuracy than the set of 58 SNPs and 9 microsatellite loci, respectively. Zhao et al. (2018) generated 58 450 genome-wide SNPs for Florida bass (Micropterus floridanus) using a cost-effective genotyping-by-sequencing method and a total of 58 SNPs were shown to assign parents to offspring with 100% accuracy, irrespective of sex and with the presence of full-sib relationships. ​In addition, the accuracy and assignment power of the SNP panel are better than those of the ten microsatellites genotyped on the same parent and progeny. In Atlantic salmon (Salmo salar), a total of 86 468 SNPs were identified from RAD sequencing libraries. Following the application of quality control filters and stringent selection criteria, a panel comprising 94 SNPs balanced across the salmon genome were identified, providing 100% assignment accuracy in known pedigrees (Holman et al., 2017). Hauser et al. (2011) evaluated the assignment power of 80 SNPs (H e=0.30, 80 independent alleles) and 11 microsatellites (H e=0.85, 192 independent alleles) in a wild population of about 400 sockeye salmon (Oncorhynchus nerka). The results indicate that assignment success is higher for SNPs than for microsatellites, especially for the parent pair, regardless of the method used. Liu et al. (2016) developed a panel of 95 SNPs for parentage assignment in rainbow trout (Oncorhynchus mykiss), and was used to genotype 499 fish with known pedigrees. After analysis, when all 95 SNPs were used, the parentage assignment matches perfectly with the known pedigree. For the data, no parentage assignment using SNP markers was reported in A. latus.
In this experiment, 95 individuals of A. latus, selected from parents and their families, were used as experimental material. SNP markers with high polymorphisms were developed by genome re-sequencing. The parentage assignment technology of A. latus was established based on SNP markers. The results of this study are expected to serve as a reference for the breeding of new varieties and for the evaluation of the effects of A. latus enhancement and release.
In September 2020, 14 families of A. latus were built at Dajing Farm in Longhai City, Fujian Province, China. 95 individuals selected from their parents and their families were used as experimental material (Table 1). The pedigree of each family was recorded.
Genomic DNA was extracted from the muscle or fin of A. latus using the Genome DNA Extraction Kit (Vazyme, China) according to the manufacturer’s instructions. DNA quality was detected using the Agilent 5400 fragment analysis system.
The fAcaLat1.1 (GenBank assembly access: GCA_904848185.1) was used as the reference genome of A. latus in this study. The total length of the assembly sequence is 685 Mb, including 215 contigs and 14.88 Mb for contig N50. FastQC was used to assess the basic quality of the sequenced reads and to determine whether the primer adapters were contaminated. Burrows-Wheeler-Alignment Tool - maximal exact matches (BWA-MEM) was used to compare the reads of each sample with the reference genome. SAMtools was used to filter the unpaired reads in the output file and to avoid invalid tags in further analysis (Li and Durbin, 2009). To obtain high-quality BAM files, all duplicate reads were labeled with Picard software. The GATK 4.0 genome analysis toolkit (Genome Analysis Toolkit) was used for mutation, detection a VCF file was obtained containing all the information from the mutation site.
To obtain a highly polymorphic SNP, the filter parameters were set as follows. (1) It must be a biallelic site. (2) The max-missing ratio is less than 20%. (3) The minor allele frequency (MAF) is greater than 0.15. (4) The genotype quality is greater than 30. (5) SNPs without polymorphisms in the parent pool are filtered out. (6) SNPs with no polymorphisms in the offspring pool are filtered out. (7) We filter out anomalous loci that do not obey Mendel’s law of heredity. (8) Sites that are not on the 24 chromosomes of A. latus are filtered out. (9) Sites that do not conform to the Hardy-Weinberg equilibrium are filtered out. Additionally, the aforementioned genome was used for the VCF annotation after QC filtering in an off-line mode to explore the SNP distribution.
In order to find the best SNP combination for parentage assignment in A. latus, i.e., the success rate of parentage assignment is more than 90%, and the number of required SNP was as small as possible, 334 polymorphism SNPs were classified from high to low according to the expected heterozygosity (H e), and divided into five groups. The corresponding SNP number of five groups was 46 (H e > 0.4), 87 (H e > 0.35), 146 (H e > 0.3), 205 (H e > 0.25), and 334 (H e > 0.2). Parentage assignment was analyzed using CERVUS 3.0.3 software (Kalinowski et al., 2007). The observed heterozygosity, expected heterozygosity and polymorphism information content of the optimal SNP combination were calculated at 95% confidence. The parameters for parentage assignment were set as follows: 10 000 simulated offspring, 99% candidate parent detection rate, 84.36% mean percentage of loci typed, 10% typing error rate, and confidence levels set to 80% and 95%.
The results of the parentage assignment based on SNP markers from the CERVUS software were compared with real pedigree records to determine the success rate of the parentage assignment.
All qualified samples were sequenced using the dual-terminal 150-bp mode in the Illumina NovaSeq sequencing platform. Altogether, 423.8 Gb of clean data were obtained, and 4.461 Gb of clean data were generated on average for each sample. The Q30 of all samples exceeded 90.25% and the GC content ranged from 42.21% to 43.22% with normal distribution.
The clean data was compared with the reference genome using the BWA software. The size of the reference genome is 685 144 197 bp. The mapping rate ranged from 97.75% to 99.41% for all samples. The average coverage depth of the reference genome (excluding the N region) was 4.24−5.67×, and coverage with 1× (at least one base coverage) is more than 96.25%. The mapping results are normal and can be used for subsequent mutation detection and correlation analysis.
The SNPs were mined from the 95 A. latus resequencing data using the Joint Calling software. Collectively, 14 392 738 SNPs were obtained (Table 2). SNPs are mainly distributed in Intronic, Intergenic, and Others, with 7 161 298 (49.76%), 3 836 766 (26.66%) and 2 040 243 (14.18%), respectively. The remaining SNPs are located in the exonic, splicing, upstream, downstream, and upstream/downstream regions.
​A total of 334 SNPs were obtained after optimizing the filter parameters. Observed heterozygosity, expected heterozygosity, and polymorphism information content were calculated using CERVUS3.0.3. The results showed that, with 95% confidence, the average observed heterozygosity for 334 SNPs was 0.56, the average expected heterozygosity was 0.39, and the polymorphism information content was 0.31. These SNPs can be used in the analysis of parentage assignment.
In the model where the gender of the parent is known, the identification results for the 72 offspring are shown in Fig. 1. When there were 46 SNPs, the identification success rate was over 65% for the male parent, 47.2% for the female parent and 37.5% for the parent pair. As the number of SNPs increased, the identification success rate also increased. When the number of SNPs was 146, the identification success rate was more than 90% for male, female and parent pair. When the number of SNPs reached 205, the identification success rate exceeded 95% for the male parent, female parent and the parent pair. When the number of SNPs was increased to 334, the identification success rate was 100% for all of them.
In the model where the gender of the parents is unknown, only the identification success rates of single and parent pair are calculated. Due to the lack of gender information for male parent and female parent, the candidate parents provided by CERVUS can be either male parent or female parent. The results of the parentage assignment for 46, 87, 146, 205 and 334 SNPs are shown in Fig. 2. The success rate of parentage assignment gradually increased with increasing SNP number at 95% confidence level. When the number of SNPs reached 87, the identification success rate for a single parent was more than 90%, and a minimum of 205 SNPs were required to achieve the same level of success for the parent pair.
The results of the parentage assignment for the five sets of SNPs are shown in Table 3. Both models showed a positive correlation between the identification success rate and the number of SNPs used. The success rate of identification using the same number of SNPs in the model with unknown gender was lower compared to the model with known parent gender, suggesting that the absence of information about the gender of the parents negatively affects the identification of the parents and offspring.
In general, 205 SNP markers were ideal for the identification of parent-offspring of A. latus. On the one hand, the number of SNP required was relatively minor and, on the other hand, the identification success rate was strong enough (more than 90%).
In this study, 205 polymorphic SNP markers were ultimately selected as the optimal combination for the assignment of A. latus parentage. The results of parentage assignment under both models showed that the identification success rate increases with the number of SNP markers, confirming that the number of SNPs can compensate for the deficiency in the polymorphic information content of the SNPs. Under the model where the gender of the parents is known, the difficulty of identifying a parent pair is highest, followed by the female parent and then the male parent. Under the model where the gender of the parents is unknown, the difficulty of identifying a parent pair is also higher than that of a single parent. Compared to two models, the success rate for single parent identification is higher in the model where the gender of the parents is unknown than in the model where the gender of the parents is known. This may be due to the increased probability of matching the correct parent due to the absence of gender differentiation under the model where the gender of the parents is unknown (Slate et al., 2000; Kalinowski et al., 2007).
Based on data simulation, Anderson and Garza (2006) predicted that 60-100 SNPs would be sufficient for parentage assignment. This conjecture has also been confirmed in several species. For example, Hauser et al. (2011) successfully carried out parentage assignment in wild sockeye salmon population using 80 SNPs. In Oncorhynchus mykiss, 95 SNPs can be used to complete parentage assignment (Abadía-Cardoso et al., 2013; Steele et al., 2013). Tokarska et al. (2009) demonstrated that 50-60 SNPs with the highest heterozygosity were sufficient for parent-offspring identification of the bison. In pigs, 60 SNPs or 80 SNPs were able to match the correct male parent from a large number of candidate boars with 100% accuracy (Harlizius et al., 2011). Similarly, Weinman et al. (2015) found that about 60 SNPs with an average heterozygosity of 0.41 could be used for phylo-genetic identification in bird populations. In this study, 205 SNPs were needed to achieve a success rate of identification of more than 95% when the gender of both parents is known.
In general, the success rate of parentage assignment is influenced by numerous aspects, such as the test population, the number of SNP markers, the content of polymorphic information, the accuracy of the genotyping and the number of parents (Marshall et al., 1998; Walling et al., 2010; Vandeputte and Haffray, 2014; Yue and Xia, 2014). In this study, families were constructed using the same batch of fish, for which parent-parent relationships are close and therefore difficult to identify. In addition, many of the families were half-siblings, and it was more difficult to identify the parents in these families than in full-sibling families. For the above two reasons, the number of SNPs used in this study is relatively large.
In this study, genome-wide resequencing was used to develop polymorphic SNP markers, and the parentage assignment technique for A. latus is well established based on the constructed family material. The results showed that using 205 SNPs markers, in the model where the gender of the parents was known, the assignment success rate for the male parent, the female parent and the parent pair was greater than 95%; in the model where the gender of the parents was unknown, the assignment success rate was 100% for single parents and 90.28% for the parent pair.
  • Fujian Province science and technology plan project under contract No.2023N0011.
Abadía-Cardoso A, Anderson E C, Pearse D E, et al. 2013. Large-scale parentage analysis reveals reproductive patterns and heritability of spawn timing in a hatchery population of steelhead (Oncorhynchus mykiss). Molecular Ecology, 22(18): 4733–4746, doi: 10.1111/mec.12426
Anderson E C, Garza J C. 2006. The power of single-nucleotide polymorphisms for large-scale parentage inference. Genetics, 172(4): 2567–2582, doi: 10.1534/genetics.105.048074
Flanagan S P, Jones A G. 2019. The future of parentage analysis: From microsatellites to SNPs and beyond. Molecular Ecology, 28(3): 544–567, doi: 10.1111/mec.14988
García-Fernández C, Sánchez J A, Blanco G. 2018. SNP-haplotypes: An accurate approach for parentage and relatedness inference in gilthead sea bream (Sparus aurata). Aquaculture, 495: 582–591, doi: 10.1016/j.aquaculture.2018.06.019
Gjedrem T, Robinson N, Rye M. 2012. The importance of selective breeding in aquaculture to meet future demands for animal protein: a review. Aquaculture, 350–353: 117–129, doi: 10.1016/j.aquaculture.2012.04.008
Harlizius B, Lopes M S, Duijvesteijn N, et al. 2011. A single nucleotide polymorphism set for paternal identification to reduce the costs of trait recording in commercial pig breeding. Journal of Animal Science, 89(6): 1661–1668, doi: 10.2527/jas.2010-3347
Hauser L, Baird M, Hilborn R, et al. 2011. An empirical comparison of SNPs and microsatellites for parentage and kinship assignment in a wild sockeye salmon (Oncorhynchus nerka) population. Molecular Ecology Resources, 11(S1): 150–161, doi: 10.1111/j.1755-0998.2010.02961.x
Holman L E, De La Serrana D G, Onoufriou A, et al. 2017. A workflow used to design low density SNP panels for parentage assignment and traceability in aquaculture species and its validation in Atlantic salmon. Aquaculture, 476: 59–64, doi: 10.1016/j.aquaculture.2017.04.001
Houston R D, Bean T P, Macqueen D J, et al. 2020. Harnessing genomics to fast-track genetic improvement in aquaculture. Nature Reviews Genetics, 21(7): 389–409, doi: 10.1038/s41576-020-0227-y
Kalinowski S T, Taper M L, Marshall T C. 2007. Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment. Molecular Ecology, 16(5): 1099–1106, doi: 10.1111/j.1365-294X.2007.03089.x
Li Heng, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics, 25(14): 1754–1760, doi: 10.1093/bioinformatics/btp324
Liu Sixin, Palti Y, Gao Guangtu, et al. 2016. Development and validation of a SNP panel for parentage assignment in rainbow trout. Aquaculture, 452: 178–182, doi: 10.1016/j.aquaculture.2015.11.001
Marshall T C, Slate J, Kruuk L E B, et al. 1998. Statistical confidence for likelihood-based paternity inference in natural populations. Molecular Ecology, 7(5): 639–655, doi: 10.1046/j.1365-294x.1998.00374.x
Slate J, Marshall T, Pemberton J. 2000. A retrospective assessment of the accuracy of the paternity inference program CERVUS. Molecular Ecology, 9(6): 801–808, doi: 10.1046/j.1365-294x.2000.00930.x
Steele C A, Anderson E C, Ackerman M W, et al. 2013. A validation of parentage-based tagging using hatchery steelhead in the Snake River basin. Canadian Journal of Fisheries and Aquatic Sciences, 70(7): 1046–1054, doi: 10.1139/cjfas-2012-0451
Tokarska M, Marshall T, Kowalczyk R, et al. 2009. Effectiveness of microsatellite and SNP markers for parentage and identity analysis in species with low genetic diversity: the case of European bison. Heredity, 103(4): 326–332, doi: 10.1038/hdy.2009.73
Vandeputte M, Haffray P. 2014. Parentage assignment with genomic markers: a major advance for understanding and exploiting genetic variation of quantitative traits in farmed aquatic animals. Frontiers in Genetics, 5: 432, doi: 10.3389/fgene.2014.00432
Walling C A, Pemberton J M, Hadfield J D, et al. 2010. Comparing parentage inference software: reanalysis of a red deer pedigree. Molecular Ecology, 19(9): 1914–1928, doi: 10.1111/j.1365-294X.2010.04604.x
Weinman L R, Solomon J W, Rubenstein D R. 2015. A comparison of single nucleotide polymorphism and microsatellite markers for analysis of parentage and kinship in a cooperatively breeding bird. Molecular Ecology Resources, 15(3): 502–511, doi: 10.1111/1755-0998.12330
Xu Jian, Feng Jingyan, Peng Wenzhu, et al. 2017. Development and evaluation of a high-throughput single nucleotide polymorphism multiplex assay for assigning pedigrees in common carp. Aquaculture Research, 48(4): 1866–1876, doi: 10.1111/are.13024
Yue Genhua, Xia Junhong. 2014. Practical considerations of molecular parentage analysis in fish. Journal of the World Aquaculture Society, 45(2): 89–103, doi: 10.1111/jwas.12107
Zhao Honggang, Li Chao, Hargrove J S, et al. 2018. SNP marker panels for parentage assignment and traceability in the Florida bass (Micropterus floridanus). Aquaculture, 485: 30–38, doi: 10.1016/j.aquaculture.2017.11.014
Zheng Guobin, Zhao Hongbo, Huang Liangmin, et al. 2023. Discovery and verification of SNP in Acanthopagrus latus. Journal of Tropical Oceanography (in Chinese), 42(2): 78–86, doi: 10.11978/2022108
Zhu Kecheng, Song Ling, Liu Baosuo, et al. 2020. Establishment of parentage determination in yellowfin seabream (Acanthopagrus latus). Journal of Fisheries of China (in Chinese), 44(3): 351–357, doi: 10.11964/jfc.20181011480
Year 2024 volume 43 Issue 2
PDF
54
29
Cite this Article
BibTeX
Article Info
doi: 10.1007/s13131-023-2221-7
  • Receive Date:2023-03-06
  • Online Date:2025-11-17
  • Published:2024-02-25
Article Data
Affiliations
History
  • Received:2023-03-06
  • Accepted:2023-05-31
Funding
Fujian Province science and technology plan project under contract No.2023N0011.
Affiliations
    1 Fisheries College of Jimei University, Xiamen 361021, China
    2 Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Xiamen 361021, China
    3 Bureau of Agriculture and Rural Affairs of Wudang District, Guiyang City, Guizhou Province, Guiyang 550018, China
    4 Fisheries Research Institute of Fujian, Xiamen 361000, China

Corresponding:

References
Share
https://castjournals.cast.org.cn/joweb/aos/EN/10.1007/s13131-023-2221-7
Share to
QR

Scan QR to access full text

Cite this article
BibTeX
Citations
表12种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏
  • BibTeX
  • EndNote
  • RefWorks
  • TxT