Exploring the Structure of Haplotype Blocks and Genetic Diversity in Chinese Indigenous Pig Populations for Conservation Purpose

Abstract

Chinese indigenous pigs in the Taihu Lake region are well known for their high fecundity and other excellent characteristics. To better understand the characteristics of these breeds in this area as well as to provide the government and breeders the molecular basis for formulating a reasonable conservation policy, we explored the structure of haplotype blocks and genetic diversity of the 7 populations which is relevant for the management and conservation of these important genetic resources using next-generation sequencing data. In this study, a total of 131 300 single-nucleotide polymorphisms with minor allele frequencies ⩾0.05 were obtained for further analysis. In general, there are similar within-breed genetic diversities (He, Ho, P_n, A_r) among these 7 pig populations in the Taihu Lake region. Average values for the inbreeding coefficients estimates in the 7 populations are 0.110 (F1), 0.056 (F2), and 0.078 (F3). All the breeds have seen a continuous decline in Ne estimates over time with FJ and SW populations having a very similar curve. Moreover, the Ne of SMS pig breeds were smaller than other Chinese pig breeds, indicating that SMS pig breeds underwent stronger selection pressure than other Chinese pig breeds. The average genetic distances among the 7 populations in the Taihu Lake region were 0.235 (MMS), 0.240 (SMS), 0.269 (EH), 0.248 (MI), 0.221 (FJ), 0.254 (JX), and 0.212 (SW). A summary of the number of haplotype blocks and haplotype diversity was also presented. This study provide a deep understanding of the current situation of conservation in this region, thereby uncovering the pertinent insight to better formulate more reasonable preservation policies for the government departments and breeding planners to follow-up.

Keywords

Chinese indigenous pigs haplotype blocks genetic diversity conservation

Introduction

The developing countries which are characterized with production environments that are low to medium input and high stress harbors most of the world’s breeds and each of these breeds are expected to have adapted to their specific environment. This expectation is strongly supported by empirical evidence implying that the genetic basis of population differentiation will be nonadditive for fitness traits and each breed will have different adaptive gene complexes. The Chinese indigenous pigs in the Taihu Lake region are well known for their high fecundity and enjoy the reputation of “national treasure.” These domestic pigs around the Taihu Lake also have many other excellent features such as disease resistance, resistant to rough feeding, and excellent meat quality. The first survey of pig resources in China regarded these animals as a single breed termed Taihu pigs.¹ However, since 1974, Taihu pigs have been divided into 7 breeds including Erhualian (EH), Meishan (MS), Fengjing (FJ), Jiaxing Black (JX), Mi (MI), Shawutou (SW), and Hengjing pig breed which is now extinct.According to the second survey of China’s swine breeds in 2011, these pigs were divided to 6 breeds (MS, FJ, JX, MI, SW and EH) with the Meishan breed further subdivided into 2 subpopulations which are called the Middle Meishan (MMS) and the Small Meishan (SMS).^2,3

From a global perspective, conservation is not only about endangered breeds but also about those that are not being used efficiently. A small proportion of breeds (mainly in the developed world) are involved in planned genetic improvement programs. For other breeds, and particularly in the developing world, there is an urgent need to develop breeding programs to improve their production and productivity. In recent years, the state and government have attached great importance to the protection of local genetic resources. The main objective is to conserve the indigenous breeds with the aim of minimizing the loss of among breed diversity which includes breeding programs that ensure efficient utilization and conserving those at risk. Therefore, the program urges farms to establish an efficient strategy to maintain genetic diversity and avoid inbreeding within these local breeds, which requires an in-depth investigation of population structure and phylogenetic relationships of these breeds. Genetic diversity contributes to the prioritization process as a tool (revealing migration routes, refuges, and historical responses) and as an object of conservation concern.⁴

We can consider the implications of the relationship between genetic diversity and conservation at many levels: genes, individuals, populations, varieties, subspecies, species, genera, and so on. Genetic diversity provides a retrospective view of evolutionary lineages of taxa, a snapshot of the current genetic structure within and among populations, and a glimpse ahead to the future evolutionary potential of populations and species. Genetic diversity is also one of 3 forms of biodiversity recognized by the International Union for Conservation of Nature (IUCN) as deserving conservation, along with species and ecosystem diversity. Genetic diversity has been defined as the variety of alleles and genotypes present in a population and this is reflected in morphological, physiological, and behavioral differences between individuals and populations.⁵ From a functional point of view, genetic diversity can be classified as neutral, deleterious, or adaptive.⁶ Since the beginning of the 1990s, the development of appropriate tools has resulted in a leading role for molecular markers in the characterization of genetic diversity. At this level, genetic diversity is usually measured by the frequencies of genotypes and alleles, the proportion of polymorphic loci, the observed and expected heterozygosity, or the allelic diversity.⁷

Recent studies have extensively evaluated genetic variation and population structure of Chinese pig breeds using not only high-density single-nucleotide polymorphism (SNP) markers^8,9 but also whole-genome sequencing data.¹⁰ These investigations focused primarily on genetic diversity and detected the signatures of positive selection between Chinese and Western pig breeds. Wang et al¹¹ studied genetic diversity and population structure of Chinese indigenous pig breeds in the Taihu region to provide basis for the division of breeds. Chen et al¹² reported on genetic diversity and population structure in Chinese indigenous pig breeds in Zhejiang Province. Also, many studies have extensively analyzed linkage disequilibrium (LD) features and haplotype blocks in livestock species, especially in pigs.¹³ The genome structure especially LD and haplotype blocks can provide fundamental information on the genome organization of these pig breeds and gives us a reference for formulating a new conservation strategy. However, to our knowledge, research on the genetic properties and relationships, especially for the structure of the haplotype block of the 7 indigenous pig populations in the Taihu Lake region from the perspective of conservation, is lacking. Therefore, in this study, we investigated the genetic diversity and haplotype blocks of the 7 Chinese indigenous populations in the Taihu Lake region to reveal current status of conservation and relationships of these pig populations. By studying the current indigenous pig breeds in the Taihu Lake region, we can provide a theoretical basis for our next step to better protect their genetic diversity and formulate conservation policies.

Materials and Methods

Ethics statement

All experimental procedures were approved by the Institutional Animal Care and Use Committee of Shanghai Jiao Tong University, and all methods involved pigs were in accordance with the agreement of Institutional Animal Care and Use Committee of Shanghai Jiao Tong University (contract no. 2011-0033).

Population and sequencing data

A total of 445 pigs (75 Small Meishan, 97 Medium Meishan, 36 Mizhu, 42 Erhualian, 91 Jiaxing Black, 72 Shawutou, and 32 Fengjing) from the 6 Chinese indigenous breeds in the Taihu Lake region were selected, including 252 pigs (69 Small Meishan, 50 Medium Meishan, 36 Mizhu, 31 Erhualian, 29 Jiaxing Black, 21 Shawutou, and 16 Fengjing) from Wang et al.¹¹ All DNA samples were genotyped according to the GGRS protocol¹⁴ (http://klab.sjtu.edu.cn/GGRS/). Briefly, high-molecular-weight genomic DNA samples were extracted from ear tissue, digested with AvaII and then ligated with a unique adapter barcode. Next, the samples were pooled and enriched to construct a sequencing library. Finally, the sequence libraries (fragments ranging from 300 to 400 bp [base pairs], including the adapter barcode sequence) were sequenced on an Illumina HiSeq2000 (the sequencing process is given in detail by the manufacturer, Illumina) instrument with a paired-end (2 × 100 bp) pattern. The SNPs were identified and genotyped using SAMtools,¹⁵ and these variants were retained for further analysis according to the following criteria: (1) SNP test scores are greater than or equal to 20 (ie, the accuracy of more than 99%), (2) the calling rates of SNP are greater than or equal to 90%, (3) the minor allele frequency (MAF) was greater than or equal to 5%, and (4) the detected SNP is the only one that appears on a fixed chromosome. Before these genotyped SNPs are phased by FASTPHASE¹⁶ for further analysis, the missing genotypes were imputed using iBLUP¹⁷ with the command line “perl iBLUP.pl genotype.vcf 445 89 0.1,” in which 445 is the total number of samples, 89 is the minimum detected number of samples, and 0.1 is the LD threshold. iBLUP is a genotype imputation method that imputes missing genotypes using identity-by-descent and LD information.¹⁷ A total of 131 300 SNPs with MAFs ⩾0.05 were obtained in our study.

Genetic diversity within populations

The allelic richness (A_r), proportion of polymorphic markers (P_n), expected heterozygosity (He) and observed heterozygosity (Ho) were used to investigate the genome-wide genetic variability within these 7 populations. A_r was calculated using ADZE v1.0.¹⁸ P_n, He, and Ho were calculated using PLINK v1.07.¹⁹

Marker-based inbreeding coefficients were estimated using the GCTA software.²⁰ Three different metrics were obtained using the -ibc option of the program: based on (1) the variance of the additive genotype (F1), (2) the excess of homozygosity (F2), and (3) the correlation between uniting gametes (F3).²⁰

The historical effective population size (Ne) was estimated by the software of SNeP v1.1,²¹ which can estimate Ne trends across generations using multilocus SNP data. This approach estimates historical effective population size based on the relationship between LD, Ne, and recombination rate, as well as corrects for sample size simutaneously:

N_{T (t)} = {(4 f (c_{t}))}^{- 1} (E {[r_{a d j}^{2} | c_{t}]}^{- 1} - α)

where $N_{T (t)}$ is the effective population size t generations ago calculated as $t = {(2 f (c_{t}))}^{- 1}$ , $c_{t}$ is the recombination rate for a specific physical distance between SNPs which was estimated by the Haldane mapping function in this study, $r_{a d j}^{2}$ is the LD value corrected for sample size, and $α$ is a correction for the occurence of mutations.

Genetic relationship among populations

Genetic distance

To estimate the genetic distances among populations, all 131 300 SNPs were used to calculate the average proportion of alleles shared, $D st$ , using PLINK v1.07.¹⁹ The definition of $D st$ is as follows:

D st = \frac{I B S_{2} + 0.5 * I B S_{1}}{N}

where IBS1 and IBS2 are the numbers of loci that share 1 or 2 alleles at 1 locus. The genetic distance (D) between all pairwise combinations of individuals was calculated as follows: $1 - D st$ . Neighbor-joining (NJ)²² trees were constructed using MEGA v6.0²³ based on the matrix of D. MEGA is a popular software to infer phylogenetic histories and conduct molecular evolutionary analysis.

Haplotype construction and haplotype diversity

A haplotype is a contraction of the phrase haploid genotype and is a stretch of DNA that is inherited as a unit. In diploid genomes, haplotypes are a set of closely linked nucleotides present on a chromosome that are inherited together. Thus, haplotypes are stretches of DNA in LD that are not broken up by recombination.

Haplotype blocks are estimated following the default procedure in HAPLOVIEW (v4.1). HAPLOVIEW (v4.1) was also used to define the haplotype blocks present in the genome. The method followed for block definition was previously described by Gabriel et al.²⁴ Haplotype diversity is defined as $1 - \sum f_{i}^{2}$ where $f_{i}$ is the frequency of the ith haplotype.

Data availability

All the SNP data we used were uploaded to our Web site (https://jbox.sjtu.edu.cn/l/XH2s6V). Supplemental File S1 contains all the figures of the NJ tree within the 7 pig populations, respectively. Supplemental File S2 contains the results of the haplotype blocks of the other 6 populations (MMS, MI, EH, FJ, SW, and JX).The authors affirm that all data necessary for confirming the conclusions of the article are present within the article, figures, and tables.

Results

Genetic diversity within populations

He Ho Fis P_n A_r

The within-breed genetic diversity of the 7 pig populations is presented in Table 1. Among the 7 populations, FJ had the largest He and Ho, whereas the value for Ho in the EH population was the lowest. Overall, the values of Ho were always greater than the values of He among the 7 populations and the Fis values were all negative which indicates excess of heterozygosity. The P_n values were all the same (0.999) in these 7 indigenous pig populations, which means that these populations have the similar proportions of polymorphic markers and also suggests that these populations were regarded as a single breed termed Taihu pigs. As for genetic diversity measured by allelic richness, the 7 populations had similar values of 1.99, almost closer to 2 indicating that the 7 populations have higher allelic richness.

Table 1.

Sample sizes and genetic diversities of the 7 pig populations in the Taihu Lake region.

Population	Sample size	Indices of genetic diversity					Nsnp
Population	Sample size	He	Ho	Fis	P_n	A_r	Nsnp
MMS	97	0.323	0.381	−0.180	0.999	1.999	100 599
SMS	75	0.342	0.410	−0.199	0.999	1.999	92 557
FJ	32	0.359	0.486	−0.354	0.999	1.994	83 012
EH	42	0.340	0.352	−0.035	0.999	1.997	98 822
JX	91	0.338	0.373	−0.104	0.999	1.999	89 486
MI	36	0.338	0.389	−0.151	0.999	1.994	98 897
SW	72	0.346	0.468	−0.353	0.999	1.999	100 654

He, expected heterozygosity; Ho, observed heterozygosity; Fis, fixation index; A_r, allelic richness; P_n, proportion of SNPs that displayed polymorphisms; Nsnp, number of SNPs of the 7 pig populations.

Inbreeding coefficient

Average values for the positive estimates in the 7 populations were 0.110 (F1), 0.056 (F2), and 0.078 (F3) (Table 2). Although the estimates from various approaches were different, however, they all showed similar results. SMS, SW, and MMS had the lowest inbreeding degree compared with MI, FJ, and EH. Overall, the average inbreeding coefficient for all populations in the Taihu Lake region was 0.081, which implies that there is an acceptable conservation effect for the Chinese indigenous pigs in the Taihu Lake region. The correlations of the inbreeding coefficients estimated by the 3 methods were all higher with values of 0.75, 0.95, and 0.88, respectively (Figure 1).

Table 2.

The inbreeding coefficients in the 7 populations.

Inbreeding coefficient	Breed							Average
Inbreeding coefficient	MMS	SMS	EH	MI	FJ	JX	SW	Average
F1	0.042	0.033	0.225	0.155	0.134	0.115	0.068	0.110
F2	0.050	0.045	0.112	0.018	0.030	0.130	0.004	0.056
F3	0.040	0.035	0.168	0.074	0.078	0.120	0.032	0.078
Average	0.044	0.034	0.168	0.082	0.081	0.122	0.035	0.081

Figure 1.

The correlation of the inbreeding coefficient estimated by the 3 methods.

Effective population size

The tendency of effective population size (Ne) of each pig breed along the generations is shown in Figure 2. The past Ne was reflected by LD over shorter recombinational distances and the longer distances provided recent ancestry.²⁵ In general, all the breeds have seen a continuous decline on Ne estimates over time and FJ and SW populations have very similar curves and results. Also, the Ne of SMS pig breeds are smaller than other Chinese pig breeds, indicating that SMS pig breeds might undergo stronger selection pressure than other Chinese pig breeds.

Figure 2.

The tendency of Ne of the 7 indigenous pig populations.

Genetic relationship among populations

Genetic distance

The average genetic distances among the 7 populations in the Taihu Lake region were 0.235 (MMS), 0.240 (SMS), 0.269 (EH), 0.248 (MI), 0.221 (FJ), 0.254 (JX), and 0.212 (SW). All these 7 populations had similar genetic distance within their own groups and it is obvious that SW has the nearest genetic distance of the 7 populations, which is consistent with our previous results in the inbreeding coefficient.

The NJ trees were also constructed using genome-wide genotypes for more intuitive expression of genetic distances within the 7 pig populations (Supplemental Files S1 and S2).

The analysis of the haplotype blocks and haplotype diversity

A summary of the number of haplotype and haplotype diversity is presented in Table 3. We also give a summary of the distribution, size, number, and SNPs involved in the haplotype blocks per chromosome of SMS population in Table 4 and the statistics of the haplotype blocks of the other 6 populations are presented in Supplemental File S3. From Table 3, it can be seen that there are more haplotypes in MS (MMS, SMS) population compared with the other populations. Also, there are no significant differences among the haplotype diversities and the average of haplotype diversity was about 0.418, which means that there is a similar conservation status among these 7 populations in the Taihu Lake region.

Table 3.

The number of haplotype and haplotype diversity among the 7 populations.

Breed	MMS	SMS	SW	MI	JX	FJ	EH
No. of haplotype	14 960	14 076	14 168	9830	13 766	8286	11 046
Haplotype frequency	0.22	0.21	0.21	0.21	0.22	0.21	0.21
Haplotype diversity	0.393	0.414	0.420	0.431	0.403	0.445	0.419

Table 4.

The block structure per chromosome in MMS.

Chromosome	No. of blocks	Total block length, kb	Min. block length, kb	Max. block length, kb	No. of SNPs in blocks	% of SNPs in blocks
1	1342	929.27	0.02	125.52	3167	8.94
2	1308	458.01	0.02	38.35	3044	8.60
3	1049	343.48	0.02	25.67	2510	7.09
4	812	577.58	0.02	169.62	1963	5.54
5	676	410.00	0.02	92.42	1612	4.55
6	1180	318.13	0.02	84.24	2773	7.83
7	910	355.29	0.02	70.60	2130	6.01
8	534	286.03	0.02	36.76	1257	3.55
9	1019	629.90	0.02	94.28	2421	6.84
10	570	331.91	0.02	82.20	1393	3.93
11	514	125.35	0.02	33.28	1187	3.51
12	582	135.32	0.02	18.19	1381	3.90
13	752	740.26	0.02	194.80	1788	5.05
14	1087	587.20	0.02	72.69	2554	7.21
15	721	325.50	0.02	23.98	1721	4.86
16	451	166.09	0.02	26.49	1074	3.03
17	571	212.85	0.02	25.61	1340	3.78
18	446	409.14	0.02	97.39	1074	3.03
X	436	510.04	0.02	179.24	1030	2.91
All	14 961	7851.36	0.02	194.80	35 419	5.27

In the MMS population, a total of 14 960 haplotype blocks spanning 7851 kb of the genome were detected (Table 4). The average block size was 0.52 kb, ranging from 0.02 to 194.80 kb (chr13, 166282257-166477058, 4 SNPs). In total, 35 491 SNPs (35.28% of all SNPs used in MMS) formed blocks with a range of 2 to 9 SNPs per block. The autosomes showing the longest and shortest haplotypic structures in the genome were chr1 with 1342 blocks spanning 929.27 kb and chr18 with 446 blocks covering 409.14 kb. One of the important reasons for this is that it can be related to the length of chromosomes in pigs. Also, chromosome 1 had the highest density (8.94%) of SNP in haplotype blocks, whereas the lowest density (2.91%) was observed in chromosome X.

We also made a statistics of common haplotypes across populations. As shown in Table 5, most common haplotypes (283) were found to occur between MMS and SMS and the least common haplotypes (63) were between FJ and EH. The numbers of common haplotypes had no significant differences for every 2 populations. However, the population-specific haplotypes can better represent unique characters in one population and thus indicates that we are supposed to pay more attention to the population-specific haplotypes in future conservation programs.

Table 5.

The statistics of common haplotypes across populations.

	MMS	SMS	FJ	JX	SW	EH	MI
MMS	14 960	283	105	132	139	108	121
SMS	283	14 076	188	249	278	212	197
FJ	105	188	8286	103	104	63	93
JX	132	249	103	13 766	152	92	112
SW	139	278	104	152	14 168	101	130
EH	108	212	63	92	101	11 046	79
MI	121	197	93	112	130	79	9830

Discussion

There are many ways to measure the genetic variation and the loss of its diversity. With the application of molecular marker technology to the study of livestock and poultry diversity, it is very important for a specific conservation population and population genetic diversity indicators to measure the genetic variation and conservation of genetic diversity. The sensitivity of different genetic diversity indicators is different. Hence, the actual situation is crucial to be used as selection criteria for genetic diversity indicators.

Heterozygosity is one of the major genetic variations in natural populations. It is often one of the first “parameters” that one presents in a data set. It can tell us a great deal about the structure and even history of a population. High heterozygosity means lots of genetic variability, and low heterozygosity means little genetic variability. Often, we will compare the observed level of heterozygosity with what we expect under Hardy-Weinberg equilibrium. If the observed heterozygosity is lower than expected, we tend to attribute the discrepancy to forces such as inbreeding. Allelic richness is the number of alleles per locus rarefied to match the number of observations in the population with the lowest sample size.²⁶ This measure obviously depends on sample size, and to compare samples of different sizes, the number of alleles per locus is often replaced by allelic richness. Allelic richness provides complementary information to gene diversity (expected heterozygosity). Situations can be given of populations with the same heterozygosity but different allelic richness and vice versa. However, the consequences of these different population compositions can be different in terms of potentiality of the population for adaptation and evolution. Allelic richness and gene diversity can also behave differently in terms of genetic differentiation between subpopulations in the context of a subdivided population. El Mousadik and Petit²⁶ proposed a coefficient of allelic richness differentiation (ρST) and found that this parameter gives higher values than gene diversity differentiation in an analysis of allozymes in argan trees. Also, a locus could be defined as monomorphic if the most common allele frequency is 100%, 99%, or 95% of all sampled alleles. As loss of rare alleles is expected to be one of the most immediate results of reduced population size, either the 100% or 99% criterion may be better estimates in endangered species. This may appear a straightforward measure but different studies vary in what criteria are used for scoring a locus as polymorphic.

Another aspect of interest while studying a population under selection pressure is to study the level of inbreeding. Traditional estimation of the inbreeding coefficient based on pedigree data²⁷ is dependent on the completeness and accuracy of the available pedigree records. Currently, using the information provided by molecular markers (genome-wide SNP chip panels), we can estimate this coefficient with or without pedigree information. Several methods have been described for this purpose.^20,28,29 Individuals with the same inbreeding coefficient could be classified as inbred when they were sampled from a population with few inbred individuals and as outbred when they were sampled from a population where inbreeding was more frequent. Effective population size (Ne) is another important population genetic parameter which can describe the amount of genetic drift in populations. It has been subject to much research to estimate Ne over the past 80 years. The methods to estimate Ne from LD were developed about 40 years ago. However, only the most recent advances in DNA technology have made the calculation of Ne depending on large amounts of genetic marker data available.²¹

Genetic distance is a measure of the genetic divergence between species or between populations within a species, whether the distance measures time from common ancestor or degree of differentiation.³⁰ Populations with many similar alleles have small genetic distances. This indicates that they are closely related and have a recent common ancestor. Genetic distance is useful for reconstructing the history of populations. Genetic distance is also used for understanding the origin of biodiversity. For example, the genetic distances between different breeds of domesticated animals are often investigated to determine which breeds should be protected to maintain genetic diversity.³¹

Substantial evidence has already accumulated that the genome can be parsed into haplotype blocks of variable length. Haplotype blocks, together with the corresponding tag SNPs and common haplotypes determined by haplotype block–partitioning algorithms, can be used in genome-wide association studies, as well as in the fine-scale mapping of complex disease genes. Understanding the patterns of haplotype blocks is useful to develop appropriate management and conservation programs, to maintain overall genetic diversity and avoid inbreeding. Allelic diversity is an alternative criterion to measure genetic diversity, and some authors^32,33 consider that this parameter is the most relevant in conservation programs, as high number of alleles imply a source of single-locus variation for important traits such as the major histocompatibility complex, which is responsible for the recognition of pathogens. It is also important from a long-term perspective because the limit of selection response is determined by the initial number of alleles³⁴ and it is more sensitive to bottlenecks than expected heterozygosity.

The protection and preservation of the current breeds or populations depend on the phenotype, origin, and distribution and lack of molecular genetic basis. In this study, we can essentially understand their genetic differences using genome-wide high-density markers to investigate haplotype structures and genetic diversity in domestic animal populations. Through the calculation of A_r and P_n, we confirmed that the genetic diversity of the indigenous pig breeds in China is at a relatively high level. This may be due to the lower selective pressure and the higher genetic diversity of their ancestors. The haplotype structure and genetic diversity of the indigenous pig breeds in the Taihu Lake region were evaluated using 131 300 SNPs that were relatively evenly distributed in the genome. This will further deepen our understanding of the characteristics of Chinese indigenous pig germplasm resources and provide a molecular basis for the subsequent development of conservation and policy formulation.

As a result of environmental conditions and breed selection procedures used across decades, a number of indigenous pig breeds have been developed over time in the Taihu Lake region. Our results depict the haplotype blocks and genetic diversity of the 7 populations pig populations in the Taihu Lake region, which is relevant for the management and conservation of these important genetic resources in indigenous pig breeds.

Conclusions

In this study, we analyzed the haplotype structure and genetic diversity of the 7 local pig populations in the Taihu Lake region to better achieve the utilization and protection of their genetic resources. It is proved that the genetic diversity of the 7 Chinese indigenous pig populations in the Taihu Lake region is at a high level as a whole, but it is still necessary to further improve the conservation effect. Furthermore, we provided some derivations and perspectives on conservation strategies in a subdivided meta-population and discussed how to contribute to the sustainable livestock systems in highly variable and challenging environments. In brief, we conducted a comprehensive survey of the nucleotide variability of the 7 Chinese indigenous pig populations in the Taihu Lake region on a genome-wide scale and believe that the findings presented will lay a good foundation for the development of a national plan for the conservation and utilization of these pig populations.

Supplemental Material

Supplemental_Material – Supplemental material for Exploring the Structure of Haplotype Blocks and Genetic Diversity in Chinese Indigenous Pig Populations for Conservation Purpose

Supplemental material, Supplemental_Material for Exploring the Structure of Haplotype Blocks and Genetic Diversity in Chinese Indigenous Pig Populations for Conservation Purpose by Qing-bo Zhao, Hao Sun, Zhe Zhang, Zhong Xu, Babatunde Shittu Olasege, Pei-pei Ma, Xiang-zhe Zhang, Qi-shan Wang and Yu-chun Pan in Evolutionary Bioinformatics

Footnotes

Acknowledgements

The authors also greatly appreciate the 2 anonymous reviewers for their diligent work and associate editor, whose comments and suggestions gave a great contribution to the improvement of the manuscript.

Funding:

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by the National Natural Science Foundation of China (grant nos 31772552, U1402266, 31672386, 31472069).

Declaration of Conflicting Interests:

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Author Contributions

YP and QW designed and supervised the study, whereas QZ analyzed the data and wrote the manuscript. HS and ZX collected the samples and all authors read and edited the manuscript.

Supplemental Material

Supplemental material for this article is available online.

References

Zhang

Chinese Pig Breed Records. Shanghai, China: Shanghai Science and Technology Press; 1986.

Zhang

Chinese Taihu Pig. Shanghai, China: Shanghai Scientific and Technical Publishers; 1991.

Commission of Animal Genetic Resources. Animal Genetic Resources in China Pigs. Beijing, China: Chinese Agriculture Press; 2001.

Hoban

Integrative conservation genetics: prioritizing populations using climate predictions, adaptive potential and habitat connectivity. Molec Ecol Resources. 2018;18:14–17.

Frankham

Briscoe

Ballou

JD.

Introduction to Conservation Genetics. Cambridge, UK; New York, NY: Cambridge University Press; 2002.

Hedrick

PW.

Conservation genetics: where are we now?

TREE. 2001;16:629–636.

Toro

Caballero

Characterization and conservation of genetic diversity in subdivided populations. Philos Trans R Soc Lond B Biol Sci. 2005;360:1367–1378.

Huang

Ren

Genetic diversity, linkage disequilibrium and selection signatures in Chinese and Western pigs revealed by genome-wide SNP markers. PLoS ONE. 2013;8:e56001.

Yang

Xie

Chen

Ren

Population history and genomic signatures for high-altitude adaptation in Tibetan pigs. BMC Genomics. 2014;15:834.

10.

Fang

Yang

et al . Adaptation and possible ancient interspecies introgression in pigs identified by whole-genome sequencing. Nat Genet. 2015;47:217–225.

11.

Wang

Chen

Yang

et al . Genetic diversity and population structure of six Chinese indigenous pig breeds in the Taihu Lake region revealed by sequencing data. Animal Genet. 2015;46:697–701.

12.

Chen

Peng

Xiao

et al . The genetic diversity and population structures of indigenous pig breeds in Zhejiang Province revealed by GGRS sequencing. Animal Genet. 2018;49:36–42.

13.

Uimari

Tapio

Extent of linkage disequilibrium and effective population size in Finnish Landrace and Finnish Yorkshire pig breeds. J Animal Sci. 2011;89:609–614.

14.

Chen

Yang

et al . Genotyping by genome reducing and sequencing for outbred animals. PLoS ONE. 2013;8:e67500.

15.

Handsaker

Wysoker

et al . The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079.

16.

Scheet

Stephens

A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Human Genet. 2006;78:629–644.

17.

Yang

Wang

Chen

et al . A new genotype imputation method with tolerance to high missing rate and rare variants. PLoS ONE. 2014;9:e101025.

18.

Szpiech

Jakobsson

Rosenberg

NA.

ADZE: a rarefaction approach for counting alleles private to combinations of populations. Bioinformatics. 2008;24:2498–2504.

19.

Purcell

Neale

Todd-Brown

et al . PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Human Genet. 2007;81:559–575.

20.

Yang

Lee

Goddard

Visscher

PM.

GCTA: a tool for genome-wide complex trait analysis. Am J Human Genet. 2011;88:76-82.

21.

Barbato

Orozco-terWengel

Tapio

Bruford

MW.

SNeP: a tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front Genet. 2015;6:109.

22.

Thomasset

Hodkinson

Restoux

Frascaria-Lacoste

Douglas

Fernandez-Manjarres

JF.

Thank you for not flowering: conservation genetics and gene flow analysis of native and non-native populations of Fraxinus (Oleaceae) in Ireland. Heredity. 2014;112:596–606.

23.

Tamura

Stecher

Peterson

Filipski

Kumar

MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Molec Biol Evol. 2013;30:2725–2729.

24.

Gabriel

Schaffner

Nguyen

et al . The structure of haplotype blocks in the human genome. Science. 2002;296:2225–2229.

25.

Hayes

Visscher

McPartlan

Goddard

ME.

Novel multilocus measure of linkage disequilibrium to estimate past effective population size. Genome Res. 2003;13:635–643.

26.

El Mousadik

Petit

. High level of genetic differentiation for allelic richness among populations of the argan tree [Argania spinosa (L.) Skeels] endemic to Morocco. Theor Appl Genet. 1996;92:832-839.

27.

Wright

Coefficients of inbreeding and relationship. Am Naturalist. 1922;56:330–338.

28.

Stranden

Tiirikka

Sevon-Aimonen

Kantanen

A comparison of approaches to estimate the inbreeding coefficient and pairwise relatedness using genomic and pedigree data in a sheep population. PLoS ONE. 2011;6:e26256.

29.

Powell

Visscher

Goddard

ME.

Reconciling the analysis of IBD and IBS in complex trait studies. Nat Rev Genet. 2010;11:800–805.

30.

Nei

Molecular Evolutionary Genetics. New York, NY: Columbia University Press; 1987.

31.

Ruane

A critical review of the value of genetic distance studies in conservation of animal genetic resources. J Animal Breed Genet. 1999;116:317–323.

32.

Petit

El Mousadik

Pons

Identifying populations for conservation on the basis of genetic markers. Conservat Biol. 1998;12:844–855.

33.

Barker

JSF

. Conservation and management of genetic diversity: a domestic animal perspective. Canadian J Forest Res. 2001;31:588–595.

34.

Hill

Rasbash

Models of long term artificial selection in finite population. Genet Res. 1986;48:41–50.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.63 MB

Exploring the Structure of Haplotype Blocks and Genetic Diversity in Chinese Indigenous Pig Populations for Conservation Purpose

Abstract

Keywords

Introduction

Materials and Methods

Ethics statement

Population and sequencing data

Genetic diversity within populations

Genetic relationship among populations

Genetic distance

Haplotype construction and haplotype diversity

Data availability

Results

Genetic diversity within populations

He Ho Fis Pn Ar

Inbreeding coefficient

Effective population size

Genetic relationship among populations

Genetic distance

The analysis of the haplotype blocks and haplotype diversity

Discussion

Conclusions

Supplemental Material

Supplemental_Material – Supplemental material for Exploring the Structure of Haplotype Blocks and Genetic Diversity in Chinese Indigenous Pig Populations for Conservation Purpose

Footnotes

Acknowledgements

Funding:

Declaration of Conflicting Interests:

Author Contributions

Supplemental Material

References

Supplementary Material

He Ho Fis P_n A_r