Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genome-wide association study of salt tolerance at the seed germination stage in lettuce

  • Modan K. Das ,

    Roles Data curation, Formal analysis, Methodology, Writing – original draft, Writing – review & editing

    modan.das@usda.gov

    Affiliation USDA-Agricultural Research Service, Sam Farr United States Crop Improvement and Protection Research Center, Salinas, CA, United States of America

  • Sunchung Park,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Current address: USDA-Agricultural Research Service, Sustainable Perennial Crops Lab, Beltsville, MD, United States of America

    Affiliation USDA-Agricultural Research Service, Sam Farr United States Crop Improvement and Protection Research Center, Salinas, CA, United States of America

  • Neil D. Adhikari,

    Roles Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Writing – review & editing

    Current address: California Department of Public Health, Sacramento, CA, United States of America

    Affiliation USDA-Agricultural Research Service, Sam Farr United States Crop Improvement and Protection Research Center, Salinas, CA, United States of America

  • Beiquan Mou

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – review & editing

    Affiliation USDA-Agricultural Research Service, Sam Farr United States Crop Improvement and Protection Research Center, Salinas, CA, United States of America

Abstract

Developing lettuce varieties with salt tolerance at the seed germination stage is essential since lettuce seeds are planted half an inch deep in soil where salt levels are often highest in the salinity-affected growing regions. Greater knowledge of genetics and genomics of salt tolerance in lettuce will facilitate breeding of improved lettuce varieties with salt tolerance. Accordingly, we conducted a genome-wide association study (GWAS) in lettuce to identify marker-trait association for salt tolerance at the seed germination stage. The study involved 445 diverse lettuce accessions and 56,820 single nucleotide polymorphism (SNP) markers obtained through genotype-by-sequencing technology using lettuce reference genome version v8. GWAS using two single-locus and three multi-locus models for germination rate (GR) under salinity stress, 5 days post seeding (GR5d_S) and a salinity susceptibility index (SSI) based on GR under salinity stress and control conditions, 5 days post seeding (SSI_GR5d) revealed 10 significant SNPs on lettuce chromosomes 2, 4, and 7. The 10 SNPs were associated with five novel QTLs for salt tolerance in lettuce, explaining phenotyping variations of 5.85%, 4.38%, 4.26%, 3.77%, and 1.80%, indicating the quantitative nature of these two salt tolerance-related traits. Using the basic local alignment search tool (BLAST) within 100 Kb upstream and downstream of each of the 10 SNPs, we identified 25 salt tolerance-related putative candidate genes including four genes encoding for major transcription factors. The 10 significant salt tolerance-related SNPs and the 25 candidate genes identified in the current study will be a valuable resource for molecular marker development and marker-assisted selection for breeding lettuce varieties with improved salt tolerance at the seed germination stage.

Introduction

Soil salinity is one of the most important abiotic stresses limiting crop production worldwide. Soil salinization can arise from natural causes, or from human activities. Natural causes include rock weathering, high temperatures, low rainfall, and closeness to salt water. Climate change further exacerbates this issue, with rising temperatures promoting increased transpiration from plants and evaporation from soil, leaving even more salt behind in soil. On the other hand, human-induced salinity results from deforestation and agricultural practices such as irrigation and chemical fertilization. Food and Agriculture Organization (FAO) recently reported that more than 833 million hectares of subsoil and 424 million hectares of topsoil are salt-affected globally (https://www.fao.org/soils-portal/data-hub/soil-maps-and-databases/global-map-of-salt-affected-soils/en/).). Therefore, many farmers worldwide grow crops in salt-affected soils. The most economic and environmentally friendly solution for managing soil salinity problem is through development of salt-tolerant crop cultivars.

Lettuce (Lactuca sativa L.) is one of the most important leafy vegetable crops globally and the U.S. is the second largest (by weight) lettuce producer in the world behind China. In 2022, lettuce accounted for nearly one-fifth of the $21.8 billion earned by the U.S. growers from sales of vegetables and melons, including romaine, iceberg, and leaf type lettuce sales of $1.54 billion, $1.33 billion, and $1.25 billion, respectively (https://www.ers.usda.gov/data-products/chart-gallery/gallery/chart-detail/?chartId=106516). California grows more than 70% of the U.S. lettuce, with the main producing region located in the Salinas Valley [1]. Commercial lettuce production in the irrigated valleys of the southwest, is often located on soils that are somewhat saline or at least subject to salinization and soil salinity in the Salinas Valley is becoming a growing concern especially due to its proximity to the ocean and resulting seawater intrusion [2, 3].

Lettuce is a self-pollinated annual crop species with a diploid chromosome number of 2n = 18 and a total genome size of 2.6 Gb. The wild species of lettuce, Lactuca serriola is known to be one of or the only direct ancestor(s) of modern-day lettuce [48]. Genetic diversity in shape, size, and color among lettuce cultivars exists and based on leaf shape, size and texture, head formation and stem type, it is generally classified into six horticultural types: crisphead, butterhead, romaine, leaf, stem, and Latin [8].

Germination and establishment have been identified as very important stages in crop growth and productivity, however, most studies on the effects of salt stress on plant growth have focused on stages beyond early seedling growth [911]. Salt stress inhibits seed germination by reducing water uptake (osmotic stress) and creating ionic imbalance within the seed (ionic stress) [11]. However, a recent study reported that lettuce seed germination is inhibited by salt-induced osmotic stress rather than ion toxicity, since seeds could germinate fully after salt solutions were removed [12]. A previous study reported that 60 mM NaCl salt stress delayed and reduced germination of lettuce seed [13]. A study on the effect of salinity (0–120 mM NaCl) and temperature (20–30°C) on germination of ‘Phoenix’ lettuce seed, reported that increasing levels of both factors inhibited germination and reduced fresh weight of young seedlings [10]. The interaction between salinity and temperature had a significant inhibitory effect on germination and while lettuce seed germination at lower salt levels (20–40 mM NaCl) was similar to the control (water) at 20°C, it was considerably reduced at 30°C [10].

The sensitivity of lettuce to salinity causes reduction in yield and affects grower profits. Lettuce seeds are planted half an inch deep in soil where salt levels are often highest in salinity-affected growing regions. Therefore, it is imperative to breed lettuce varieties that can germinate and establish in salt-affected soil. Knowledge of the genetic diversity and genomics of salt tolerance and identification of genomic regions associated with salt tolerance at the seed germination stage will greatly facilitate breeding salt-tolerant lettuce varieties. In a previous study, four lettuce varieties were tested for their germination ability under salt stress at 0, 50, 100, and 150 mM NaCl, where the varieties Vista and Verte were highly sensitive, and Romaine and Augusta were less sensitive to higher levels of salt stress, demonstrating variability for salt tolerance among the four varieties [14]. In another study, significant variation in seed germination among 27 lettuce varieties was observed under two salt stress conditions (150 and 200 mM NaCl) [12]. Substantial variation in salt tolerance among 178 lettuce cultivars and germplasm accessions from four major horticultural types (butterhead, crisphead, romaine, and leaf) and wild type was reported from a greenhouse study [15]. Reports on molecular markers associated with salinity tolerance in lettuce are limited and to our knowledge, there has been no marker-trait association study on salt tolerance at the seed germination stage in lettuce. An earlier study, using AFLP and EST markers and a recombinant inbred line (RIL) population derived from a bi-parental cross between L. sativa and L. serriola identified nine quantitative trait locus (QTLs) including three major QTLs associated with salt-induced changes in root system architecture and ion accumulation at the seedling stage [16]. In a separate study, using an F2 population derived from L. sativa x L. serriola and 384 SNP markers, four QTLs on chromosome 7 and two QTLs on chromosome 9 for salt tolerance (ion content traits) were identified at seedling stage (35 days old) in lettuce [17]. It should be noted here that for both the above-mentioned studies, seeds were first germinated in non-salt-stress condition followed by salt-stress at the seedling stage [16, 17].

GWAS serves as a model for detecting associations between genotypic and phenotypic values. GWAS has emerged as a powerful tool to resolve complex trait variation at the sequence level using historical and evolutionary recombination events at the population level [18, 19]. GWAS is made possible by the existence of linkage disequilibrium, as it detects and locates QTL based on the strength of correlation between mapped genetic markers and the traits under investigation [20]. GWAS relies on linkage disequilibrium that is initially present in a population, and over several generations, in an unstructured population, only correlations between markers closely linked to QTL remains [20]. However, in the presence of population structure, family structure and cryptic relatedness, marker-trait associations detected by GWAS can be spurious [2022]. Therefore, to avoid spurious associations, it is important to correct for population structure, family structure and cryptic relatedness in GWAS. Currently, GWAS models are available such as MLM that uses both fixed and random effects and corrects for population structure, family structure and cryptic relatedness [23, 24]. In fact, mixed models are ideal for GWAS, as they can be applied without explicit identification of relatedness within the samples [22]. MLM is a single-locus model that tests one marker at a time, iteratively for every marker in a dataset, while multi-locus models consider the information of all loci simultaneously [25]. When multiple loci contribute to the phenotypic variation of a trait in an additive fashion, single-locus models are not adequate in finding marker-trait associations [26]. To overcome this shortcoming, multi-locus models such as Multiple Locus Mixed Linear Model (MLMM) [27], Fixed and random model Circulating Probability Unification (FarmCPU) [25], and Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK) [28] among others have been developed in recent years.

To our knowledge, there is no report of GWAS on salt tolerance in lettuce. GWAS has been applied to identify marker-trait associations for other traits in lettuce [2933]. However, GWAS on salt tolerance at the seed germination stage has been reported in other crops such as rice, mungbean, flax and Brassica napus [3437]. Using high density SNPs and 478 accessions, 11 SNP loci associated with salinity susceptibility index of seed germination-related traits under 60 mM NaCl salt stress condition were identified in rice [34]. Using GWAS, SNP loci associated with seed germination percent under 50 mM NaCl salt stress were identified on mungbean chromosomes 7 and 9 [35]. GWAS was performed with 674,074 SNPs and 200 diverse flax accessions, and many SNPs located on all 15 flax chromosomes associated with salt tolerance at the seed germination stage were identified [36]. In another study, GWAS on salt tolerance at the seed germination stage identified 31 salt-stress related QTLs in B. napus [37].

In an earlier GWAS on bolting in lettuce that included 400 accessions common to the current study, genetic diversity and population structure of the GWAS panel was reported [31]. The objective of the present study was to perform a genome-wide association analysis in lettuce to identify SNP loci significantly associated with salt tolerance at the seed germination stage, which could potentially be used for marker development for breeding improvement of lettuce for salinity tolerance. However, since 45 accessions in the current study were different from the accessions used in the earlier study [31], we also studied population structure and linkage disequilibrium and briefly discuss them in this study.

Materials and methods

Plant materials

A set of 445 diverse lettuce accessions (Lactuca spp.), mainly consisting of cultivated types with a few wild and primitive types, was selected from the germplasm collection at the USDA-ARS, Salinas, CA for the current study. In an earlier study, a total of 400 accessions common to this diversity panel was used for a genetic diversity and genome-wide association analysis for bolting in lettuce by our group [31]. Based on horticultural types, the 445 accessions in the current study included 117 butterhead, 106 romaine, 100 leaf, 86 crisphead, eight stem, seven wild type (L. serriola), five primitive, four transitional, three Batavia and one of each of Latin, lobbed, and L. perennis type (S1 Table).

Phenotyping and phenotypic data analyses

Phenotypic data were collected through replicated seed germination experiments using petri dishes and filter papers in a germination chamber (Percival, model GR41L; Perry, IA). The experiment involved 150 seeds per accession and was evaluated under two treatment conditions: salt-stress and control. Salt-stress condition was 100 mM NaCl solution prepared in deionized water, while the control condition used deionized water only. A completely randomized experimental design with three replications for both salt-stress and control conditions was employed. Twenty-five seeds for each replication were placed in a 100 mm X 20 mm petri dish containing a single layer filter paper, and then 4.5 mL of NaCl solution and 4.5 mL of deionized water were added, respectively, to the petri dishes with salt-stress and control conditions. A temperature of 21°C and continuous light conditions were maintained in the germination chamber during seed germination. Seed germination was recorded 2 days and 5 days post seeding. A seed was considered germinated when its radicle length reached ≥ 5 mm and cotyledon leaves were open.

GR and SSI were calculated from the seed germination data. GR was expressed as the germination percentage, calculated as GR = Nt/N0 X 100, where Nt is the number of seeds germinated on a particular day and N0 is the total number of seeds. SSI was calculated following Fischer and Maurer [38] as SSI = (1 − Ys/Yp)/D, where Ys = mean performance of a genotype under salt stress; Yp = mean performance of the same genotype without salt stress; D (stress intensity) = 1 − (mean Ys of all genotypes/mean Yp of all genotypes).

Statistical analyses included analysis of variance (ANOVA) and Pearson’s correlation coefficient. Using the mean square values from ANOVA, broad-sense heritability (H2) on entry-mean basis for GR under salt stress condition and SSI were estimated after Fehr [39] as H2 = σ2g /(σ2g + σ2e/r), where σ2g = genetic variance, σ2e = error variance, r = number of replication, and (σ2g + σ2e/r) = phenotypic variance. GR and SSI data were normalized using the rank-based inverse normal transformation [40] for ANOVA and heritability estimates. This transformation method was chosen, because the more commonly used data normalization methods such as log, square root, Box-Cox transformation did not adequately normalize our data set.

SNP genotyping

Genomic DNA extraction and SNP genotyping were conducted by Data2Bio (Ames, IA) from leaf tissue of 528 lettuce accessions. Genotyping was carried out using a modified tGBS® Genotyping and Sequencing technology with the restriction enzyme BSP1682I [41]. Samples were sequenced using an Illumina HISeq X instrument, and the reads were aligned to the Lactuca sativa v8 reference genome (Reyes-Chin-Wo et al. [42]; genome assembly: Lsat_Salinas_v8) after debarcoding and trimming of reads. Data2Bio filtered the SNP sites that meet the tGBS® genotyping criteria resulting in 539,061 SNPs. The initial filtering criteria included: a minimum call rate ≥ 20%, allele number = 2, number of genotypes ≥ 2, minor allele frequency (MAF) ≥ 1%, and heterozygosity rate range = 0%–(2 x Frequency of allele1 x Frequency of allele2 + 20%). The missing rate of SNPs in each sample was plotted against the fraction of SNPs that were heterozygous and 37 samples were removed due to high missing rate, leaving 491 samples. Due to insufficient seed quantities, only 445 accessions were used in the present study from this set of 491 samples. To retain high-quality SNPs for marker-trait association analyses, SNPs were further filtered with a minimum call rate ≥ 50% and MAF ≥ 5%, resulting in a total of 56,820 SNPs. All downstream analyses were performed using 56,820 SNPs and 445 samples. The missing values were imputed using Beagle version 5.4 software [43] with 3 Burnin iteration and 12 Phasing iteration. This software uses an algorithm that implements haplotype phase interphase and provided a high accuracy imputation with an error rate of < 0.01.

Analysis of linkage disequilibrium

LD (as measured by r2) between pairs of SNPs was computed by TASSEL software version 5.2.87 [44] with 50 sliding windows. LD and LD decay graphs were created using the R program, based on implementation in Remington et al. [45] by plotting LD (r2) values against the corresponding genetic distances (bp). Genome-wide LD and LD decay were computed for the set including all accessions (445 accessions = cultivated + wild type + non-cultivated) and for the set with only cultivated accessions (417) from the GWAS panel. Chromosome-wise LD gives associations between alleles on the same chromosome. Therefore, LD and LD decay were also computed individually for each of the nine chromosomes across all (445) accessions. The LD decay distance in base pairs was determined when LD declined to half of the maximum r2 value.

Analysis of population structure

In GWAS, population structure can lead to spurious association if not properly corrected [21]. To address this, we employed principal component analysis (PCA) for detection and correction of population structure using the TASSEL software. PCA is an effective approach for population structure study, and it is computationally tractable when GWAS data set is large [46, 47], such as the 56,820 SNPs in our data set. For visualization of population structure, PC1 and PC2 were plotted using the R package ggplot2 [48]. A total of 424 accessions including the five cultivated horticultural types butterhead, crisphead, leaf, romaine, and stem and the wild type, L. serriola were used for the PCA. Other types of accessions with limited numbers (1 to 4) were excluded from the analysis.

Genome-wide association study

Both single-locus and multi-locus models were employed for GWAS using the genotypic data consisting of 56,820 high-quality SNPs and phenotypic data for GR5d_S and SSI_GR5d from the 445 lettuce accessions using the software packages GAPIT version 3 [49] and TASSEL. Phenotypic data was normalized using the rank-based inverse normal transformation, which is widely used for GWAS [40, 50, 51]. The models: General Linear Model (GLM) [46], MLM, MLMM, FarmCPU, and BLINK were implemented using GAPIT. The models GLM and MLM were also implemented using TASSEL. GLM and BLINK use fixed-effect, FarmCPU is a hybrid model that uses both the fixed-effect and the random effect, while MLM and MLMM use a fixed and random effects mixed model. GLM and MLM are single-locus models while FarmCPU, MLMM and BLINK are multi-locus models. To determine how many PCs were needed to correct population structure, three PCs were used in a Bayesian information criterion (BIC)-based model selection procedure implemented in GAPIT. The result indicated that no principal components (PCs) were required to correct population structure in the GWAS models. Accordingly, all GWAS models in GAPIT were run without any PCs. However, the two models in TASSEL were run using the default settings that uses 5 PCs. Manhattan plot was used for visualizing and identifying significant SNPs above the threshold level, while Q-Q plot was used to evaluate model fitness. GAPIT created Manhattan and Q-Q plots were used for the models implemented by this software, while for the models implemented in TASSEL, the Manhattan and Q-Q plots were created from the TASSEL association analysis output files using the R package “qqman” [52]. Bonferroni corrected P value threshold (0.05/56820 SNPs; -log10(P) = 6.06) was used for identifying significant SNPs in the marker-trait association. Further, a significant SNP was considered stable if it was detected by at least two models or software packages or was identified for both the traits, since the two traits are essentially the same.

Favorable allele effect

To determine which allele of each of the significant SNPs contributed favorably to salt tolerance at the seed germination stage, the average trait value of the two types of homozygous genotypes (homozygous for one or the other allele) at a SNP locus was compared using Kruskal-Wallis test [53]. Heterozygous genotypes were excluded in this calculation to avoid complications arising from any dominance or heterotic effect.

Identification of putative candidate genes

We used the National Center for Biotechnology Information (NCBI) BLAST to identify putative candidate genes. Considering the presence of highly associated SNPs within the LD decay physical distance in our study and with an objective of detecting the closest genes to the highly associated SNPs, following the recommendations in the article by Alqudah et al. [54], we searched the putative candidate genes within a genomic region spanning 100 Kb upstream and downstream of each significant SNP locus. Both the v8 and v11 (Reyes-Chin-Wo et al. [42]; genome assembly: Last_Salinas_v11) versions of the lettuce reference genomes were used for the candidate gene search.

Results

Variability and correlation

Mean, standard error (S.E.), and range for the germination related traits measured under control (water) and salt stress conditions are given in Table 1. Mean GR in water, 2 days post seeding (GR2d_W) and 5 days post seeding (GR5d_W) were 97% and 99%, respectively, while the ranges were 21% to 100% and 84% to 100%, respectively. Mean GR under salt stress condition, 2 days post seeding (GR2d_S) and 5 days post seeding (GR5d_S) were 62% and 77%, respectively, while the range was 0% to 100% for both traits. Mean and ranges for the SSI, 2 days post seeding (SSI_GR2d) were 1.02 and -0.18 to 2.81 while those for the SSI, 5 days post seeding (SSI_GR5d) were 1.0 and -0.26 to 4.38, respectively. For SSI, the lower the value the better the salinity tolerance for that genotype, since SSI is a susceptibility index. The SSI has been used to evaluate genotype performance under stress and non-stress conditions by several other studies [34, 5557].

thumbnail
Table 1. Mean, standard error (S.E.), and range for seed germination related traits of 445 lettuce accessions evaluated in germination chamber under control (water) and salt stress (100 mM NaCl) conditions.

https://doi.org/10.1371/journal.pone.0308818.t001

Mean and standard errors for GR5d_S and SSI_GR5d of the four major horticultural types (butterhead, crisphead, leaf and romaine) are presented by bar graphs (Fig 1). Under salt stress condition, the romaine type had the highest mean GR5d_S of 85%, while the butterhead type had the lowest mean GR5d_S of 72%. The leaf and the crisphead types had mean GR5d_S of 81% and 76%, respectively. Similarly, the mean SSI_GR5d was lowest (0.65) for the romaine type and highest (1.2) for the butterhead type, indicating the highest salinity tolerance at the seed germination stage for the romaine type. ANOVA indicated significant differences (P < 0.001) among the 445 lettuce accessions for GR2d_S, GR5d_S, SSI_GR2d, and SSI_GR5d (S2 Table). Genetic variance ranged from 0.695 for GR5d_S to 0.799 for SSI_GR2d. Broad-sense heritabilities were 0.92 and 0.93 for the traits SSI_GR5d and SSI_GR2d, respectively, and 0.95 for both GR5d_S and GR2d_S (S2 Table).

thumbnail
Fig 1. Bar graphs of the means for the two traits for the four major horticultural types of lettuce.

A) Mean germination rate under salt stress, 5 days post seeding (GR5d_S), B) Mean salinity susceptibility index, 5 days post seeding (SSI_GR5d). Error bars indicate the standard errors of the means.

https://doi.org/10.1371/journal.pone.0308818.g001

Correlation coefficients among the seed germination-related traits evaluated under control and salinity stress conditions are given in Table 2. All 15 correlations among the traits were significant at P < 0.01. The coefficients/values were higher for some correlations than for others. For instance, the correlation coefficient between GR2d_W and GR5d_W was 0.705, while it was 0.869 between GR2d_S and GR5d_S. Correlation coefficients between GR in water and GR under salt stress were lower (r = 0.195 to r = 0.319). This discrepancy can be attributed to the fact that most accessions germinated well under control (water) condition, while germination rates varied highly under salt stress conditions due to the differences in salt tolerance among the accessions. This can be observed from the lower mean value and wider range for germination rates under salt stress than under control condition (Table 1). The correlation coefficient between GR5d_S and SSI_GR5d was r = -0.999 (R2 = 0.998), while that of between GR2d_S and SSI_GR2d was r = -0.998 (R2 = 0.996). These later two correlation coefficients were high but negative since salinity tolerance implies higher germination rate but lower salinity susceptibility index.

thumbnail
Table 2. Correlation coefficients among several seed germination related traits evaluated under control (water) and salt stress (100 mM NaCl) conditions in lettuce.

https://doi.org/10.1371/journal.pone.0308818.t002

Population structure

We employed PCA to study population structure using 424 accessions including the five cultivated horticultural types and the wild type, L. serriola lettuce (Fig 2). The ellipses around the clusters in the scatterplot represented a 95% confidence level for a bi-variate t-distribution. PC1 and PC2 accounted for 11.1% and 5.4% of the total variation, respectively, collectively accounting for 16.5% of the total variation. The accessions clustered predominantly based on their horticultural types. The cluster of the leaf type accessions showed complete overlap with the L. serriola type cluster and nearly complete overlap with the stem type. The leaf type also exhibited considerable overlap with the romaine type and some overlap with the butterhead type. Although the number of accessions from L. serriola and stem types were smaller compared to the other four horticultural types in this study, the clustering indicated a genetic closeness of the leaf type to the L. serriola and the stem types. There was no overlap between the butterhead and romaine types, however, some genetic closeness was apparent among the leaf, romaine and butterhead types (Fig 2). In contrast, the crisphead type did not overlap with any other types, indicating that it was relatively genetically distant from the other horticultural types of lettuce. It has been reported that romaine lettuce developed from the wild lettuce while leaf, butterhead, and Batavia-crisphead types originated from the romaine type and the stem type was later derived from the leaf type [6]. Even later, the iceberg type of crisphead lettuce was developed from the Batavia type in the U.S. in the 1940s [8]. Overall, the PCA plot indicated presence of population stratification among the 424 lettuce accessions in our GWAS panel.

thumbnail
Fig 2. PCA plot of the five horticultural types and the wild-type lettuce in the GWAS panel.

Butterhead (red, round), crisphead (green, triangle), leaf (green, square), romaine (blue, plus), L. serriola (blue, square cross), stem (blue, star). Ellipses indicate 95% confidence level for a bi-variate t-distribution.

https://doi.org/10.1371/journal.pone.0308818.g002

Genome-wide SNP distribution

The total number of genome-wide SNPs for this study was 56,820 high-quality SNPs with a filtering criteria of minimum call rate ≥ 50% and MAF ≥ 5%. These SNPs were identified using lettuce reference genome version v8. The size of this reference genome is 2.37 Gb. Thus, the average genome-wide SNP coverage was 24 SNPs/Mb. Chromosome-wise distributions of the SNPs on nine lettuce chromosomes are given in Table 3 and Fig 3. Chromosome size varied from 193.11 Mb (chromosome 6) to 399.64 Mb (chromosome 5). The number of SNPs on individual chromosomes ranged from 4,189 on chromosome 6 to 10,112 on chromosome 4. SNP density on individual chromosomes ranged from 21.43/Mb on chromosome 5 to 26.78/Mb on chromosome 4.

thumbnail
Fig 3. SNP density plot on 9 lettuce chromosomes showing number of SNPs within 1 Mb window size.

The x-axis represents chromosome length in Mb. Plotted by https://www.bioinformatics.com.cn/en, a free online platform for data analysis and visualization.

https://doi.org/10.1371/journal.pone.0308818.g003

thumbnail
Table 3. Chromosome-wise SNP distribution on nine lettuce chromosomes in reference genome v8.

https://doi.org/10.1371/journal.pone.0308818.t003

Linkage disequilibrium and LD decay

A total of 342,637 pairwise LD values were found to be significant (P ≤0.05) based on the genome-wide LD for the set of 445 accessions with an average LD of 0.407, while 327,466 pairwise LD values were significant (P ≤0.05) for the set of 417 cultivated accessions with an average LD of 0.435 (Table 4). The genome-wide LD for the set of 445 accessions decayed to half of its maximum value at a distance of 290.8 Kb, while the genome-wide LD for the set of 417 cultivated accessions decayed to half at 351.8 Kb (Fig 4). This difference can be attributed to the presence of seven wild type (L. serriola) and 21 non-cultivated lettuce in the set of 445 accessions. The pairwise LD comparison for individual chromosomes showed that chromosome 4 had the maximum number of 72,572 significant (P ≤0.05) LD values, with an average LD of 0.423, whereas chromosome 6 had the minimum number of 11,816 significant (P ≤0.05) LD values with an average LD of 0.372 (Table 4). Chromosome 6 had the fastest LD decay to half of its maximum at 89.3 Kb, while chromosome 4 had the slowest LD decay at 403.7 Kb (S1 Fig).

thumbnail
Fig 4. Linkage disequilibrium (LD) and LD decay graphs, based on the whole genome of lettuce.

A) For the 445 accessions including cultivated, wild, and non-cultivated types, and B) for the 417 cultivated accessions of lettuce. LD decayed to half of its maximum at 290818 bp for the 445 accessions, and at 351755 bp for the 417 accessions.

https://doi.org/10.1371/journal.pone.0308818.g004

thumbnail
Table 4. Chromosome-wise (445 accessions) and genome-wide linkage disequilibrium (LD) and LD decay to half of its maximum in the lettuce GWAS panel.

https://doi.org/10.1371/journal.pone.0308818.t004

Genome-wide association study

Using the 56,820 high-quality SNPs, GWAS was performed on the two traits (GR5d_S and SSI_GR5d) related to salt tolerance at the seed germination stage using five GWAS models and two software packages: GAPIT (GLM, MLM, MLMM, FarmCPU, BLINK) and TASSEL (GLM, MLM). Phenotypic data was normalized using the rank-based inverse normal transformation. Distribution of the residuals from GWAS using MLM in TASSEL are presented in S2 and S3 Figs, respectively, for the two traits GR5d_S and SSI_GR5d. Although the lettuce accessions used in the present study indicated presence of population structure (Fig 2), the BIC-based model selection procedure in GAPIT indicated that no PCs were required for correction of population structure in the GWAS models for both traits (S3 Table). However, the BIC values for 0, 1, 2 and 3 PCs were not very different from each other, therefore, we conducted GWAS using GAPIT with 0, 1, 2, and 3 PCs in the GWAS models for comparison of the results (S4 and S5 Tables). Comparison of the results for both traits indicated that identified SNPs were more consistent across models when no PCs were provided in the GWAS models. Therefore, we used the results from the analyses using no PCs in the GWAS models in GAPIT (S4 and S5 Tables). We also conducted GWAS using the GLM and MLM models in TASSEL with the default parameters that uses 5 PCs in the GWAS model, and the results are presented in S6 and S7 Tables, respectively, for the traits GR5d_S and SSI_GR5d.

The Manhattan plots and the Q-Q plots from GWAS for all five models using GAPIT are given respectively, in the Figs 5 and 6 for the trait GR5d_S and in the Figs 7 and 8 for the trait SSI_GR5d, while the Manhattan plots and the Q-Q plots for the models GLM and MLM using TASSEL are given in the Figs 9 and 10, respectively, for the traits GR5d_S and SSI_GR5d. For the trait GR5d_S, when GAPIT was used, GLM identified 122 significant SNPs spanning over all nine lettuce chromosomes, MLM identified six significant SNPs on chromosome 7, MLMM identified one SNP on chromosome 7, FarmCPU identified a total of seven significant SNPs on chromosomes 2, 3, 4, 7, and 8, while BLINK identified one SNP on chromosome 2, three SNPs on chromosome 4, and one SNP on chromosome 7 (S4 Table, Fig 5). For GR5d_S, when TASSEL was used, GLM identified 94 significant SNPs on chromosomes 3 through 9, while MLM identified six SNPs on chromosome 7 (S6 Table, Fig 9). It should be noted here that although no PCs were used for MLM in GAPIT and 5 PCs were used for MLM in TASSEL, the same six significant SNPs were identified by both software packages (S4 and S6 Tables). For the trait SSI_GR5d, when GAPIT was used, GLM identified 134 significant SNPs spanning over all nine lettuce chromosomes, MLM identified six significant SNPs on chromosome 7, MLMM identified one SNP on chromosome 7, FarmCPU identified a total of five significant SNPs on chromosomes 2, 4, 7, and 8, while BLINK identified the same four SNPs as for the trait GR5d_S, on chromosomes 2 and 4 and a different SNP on chromosome 7 (S5 Table, Fig 7). For SSI_GR5d, when using TASSEL, GLM identified 72 SNPs on chromosomes 3 through chromosome 9, while MLM identified the same six SNPs on chromosome 7 as for GR5d_S (S7 Table, Fig 10). For GLM, Q-Q plots from both GAPIT and TASSEL, for both the traits showed that the line for the observed P-values deviated strongly from the line for the expected P-values (the diagonal line), indicating that the confounding effects of population structure, family structure and cryptic relatedness were not adequately corrected by the GLM (Figs 6, 810). Thus, although the GLM identified over 100 significant SNPs for both the traits, many of these SNPs might be spurious. On the other hand, the Q-Q plots for the rest of the models for both the traits showed a substantial reduction of the deviation of the observed P-values, with only minor deviations towards the tail, suggesting that the models MLM, MLMM, FarmCPU and BLINK effectively corrected for the confounding effects and well-suited to our dataset. It should be noted here that the BIC-based model selection results suggested that no PC input was required for correcting population structure in the GWAS models in GAPIT. Thus, although the PCA in our study detected population structure in our GWAS panel, the population structure might not be strong and the GWAS models were able to effectively remove the confounding effects through the kinship matrix. Therefore, we identified significant SNPs based on the results from the four models MLM, MLMM, FarmCPU and BLINK. SNP significance was based on the Bonferroni correction threshold at α = 0.05, -log10(P) = 6.06. A significant SNP was considered stable if it was detected by at least two models or software packages or it was identified for both the traits, since the two traits were essentially the same. Based on this criteria, 10 stable SNPs were identified for both the traits (Table 5).

thumbnail
Fig 5. Manhattan plots of the GWAS results for the trait germination under salt stress, 5 days post seeding (GR5d_S) using GAPIT.

The genome-wide association study (GWAS) results from the GAPIT software package involved 445 lettuce accessions and 56,820 SNPs. The five plots from top to bottom are based on the GWAS models: general linear model (GLM), mixed linear model (MLM), Multiple Locus Mixed Linear Model (MLMM), Fixed and random model Circulating Probability Unification (FarmCPU), and Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK). The X-axis shows the genomic positions of the SNPs and the Y-axis shows the negative log base 10 of the P-values. Each of the nine lettuce chromosomes are represented with different colors. The horizontal line represents the Bonferroni correction threshold for significant marker-trait association.

https://doi.org/10.1371/journal.pone.0308818.g005

thumbnail
Fig 6.

Quantile-quantile (Q-Q) plots of the GWAS results for the trait germination under salt stress, 5 days post seeding (GR5d_S) using GAPIT. The genome-wide association study (GWAS) results from the GAPIT software package involved 445 lettuce accessions and 56,820 SNPs. The Q-Q plots are for the five GWAS models: general linear model (GLM), mixed linear model (MLM), Multiple Locus Mixed Linear Model (MLMM), Fixed and random model Circulating Probability Unification (FarmCPU), and Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK). The Y-axis and X-axis represent, respectively, the observed and the expected, negative log base 10 of the P-values. The dotted lines represent the 95% confidence interval for the Q-Q plot under the null hypothesis of no association between the SNP and the trait.

https://doi.org/10.1371/journal.pone.0308818.g006

thumbnail
Fig 7.

Manhattan plots of the GWAS results for the trait salinity susceptibility index, 5 days post seeding (SSI_GR5d) using GAPIT. The genome-wide association study (GWAS) results from the GAPIT software package involved 445 lettuce accessions and 56,820 SNPs. The five plots from top to bottom are based on the GWAS models: general linear model (GLM), mixed linear model (MLM), Multiple Locus Mixed Linear Model (MLMM), Fixed and random model Circulating Probability Unification (FarmCPU), and Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK). The X-axis shows the genomic positions of the SNPs and the Y-axis shows the negative log base 10 of the P-values. Each of the nine lettuce chromosomes are represented with different colors. The horizontal line represents the Bonferroni correction threshold for significant marker-trait association.

https://doi.org/10.1371/journal.pone.0308818.g007

thumbnail
Fig 8.

Quantile-quantile (Q-Q) plots of the GWAS results for the trait salinity susceptibility index, 5 days post seeding (SSI_GR5d) using GAPIT. The genome-wide association study (GWAS) results from the GAPIT software package involved 445 lettuce accessions and 56,820 SNPs. The Q-Q plots are for the five GWAS models: general linear model (GLM), mixed linear model (MLM), Multiple Locus Mixed Linear Model (MLMM), Fixed and random model Circulating Probability Unification (FarmCPU), and Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK). The Y-axis and X-axis represent, respectively, the observed and the expected, negative log base 10 of the P-values. The dotted lines represent the 95% confidence interval for the Q-Q plot under the null hypothesis of no association between the SNP and the trait.

https://doi.org/10.1371/journal.pone.0308818.g008

thumbnail
Fig 9. Manhattan and quantile-quantile (Q-Q) plots of the GWAS results for the trait germination under salt stress, 5 days post seeding (GR5d_S) using TASSEL.

The genome-wide association study (GWAS) results from the TASSEL software package involved 445 lettuce accessions and 56,820 SNPs. Panel A) Based on the general linear model (GLM), Panel B) Based on the mixed linear model (MLM). The upper and the lower horizontal lines in the Manhattan plots represent the Bonferroni and the Benjamini-Hochberg correction thresholds for significant marker-trait associations, respectively. For the Q-Q plots, the Y-axis and X-axis represent, respectively, the observed and the expected, negative log base 10 of the P-values.

https://doi.org/10.1371/journal.pone.0308818.g009

thumbnail
Fig 10. Manhattan and quantile-quantile (Q-Q) plots of the GWAS results for the trait salinity susceptibility index, 5 days post seeding (SSI_GR5d) using TASSEL.

The genome-wide association study (GWAS) results from the TASSEL software package involved 445 lettuce accessions and 56,820 SNPs. Panel A) Based on the general linear model (GLM), Panel B) Based on the mixed linear model (MLM). The upper and the lower horizontal lines in the Manhattan plots represent the Bonferroni and the Benjamini-Hochberg correction thresholds for significant marker-trait associations, respectively. For the Q-Q plots, the Y-axis and X-axis represent, respectively, the observed and the expected, negative log base 10 of the P-values.

https://doi.org/10.1371/journal.pone.0308818.g010

thumbnail
Table 5. List of the ten stable significant SNPs associated with the two traits GR5d_S and SSI_GR5d related to salt tolerance at the seed germination stage in lettuce identified in the present study.

https://doi.org/10.1371/journal.pone.0308818.t005

Based on the SNP positions, we designated the 10 stable SNPs as Lsv8Lg2-109644094, Lsv8Lg4-38095762, Lsv8Lg4-286540234, Lsv8Lg4-374691029, Lsv8Lg7-50257083, Lsv8Lg7-50257096, Lsv8Lg7-50257104, Lsv8Lg7-50552408, Lsv8Lg7-51509072, and Lsv8Lg7-51616430, where Ls = Lactuca sativa, v8 = version 8 of the reference genome, Lg = chromosome, followed by chromosome number and SNP location. Although both BLINK and FarmCPU identified the four SNPs, Lsv8Lg2-109644094, Lsv8Lg4-38095762, Lsv8Lg4-286540234, and Lsv8Lg4-374691029 as significant SNPs, the percent phenotypic variation explained (PVE) by these SNPs in FarmCPU were of magnitude zero to very small, except for Lsv8Lg4-374691029, where the PVE were 8% and 11% for the traits GR5d_S and SSI_GR5d, respectively (S4 and S5 Tables). Thus, a SNP with a very small PVE% was not included in deciding the stable SNPs, despite its significance. The six significant SNPs identified by the single-locus model MLM were located within ~ 1.36 Mb on chromosome 7. However, each of the three multi-locus models MLMM, FarmCPU and BLINK identified only one of the six SNPs. The MLMM identified the SNP Lsv8Lg7-51509072 for both the traits, while FarmCPU and BLINK both identified the SNP Lsv8Lg7-50257083 for the trait GR5d_S and the SNP Lsv8Lg7-50257096 for the trait SSI_GR5d. Thus, most likely these six SNPs indicated the same QTL. Considering the six SNPs on chromosome 7 indicating the same QTL, the 10 stable SNPs were associated with five QTLs in the present study (Table 6). Estimates of the PVE% by different models and software packages varied. For example, the average PVE% for the two traits for the SNP Lsv8Lg4-374691029 was 10% and 2%, reported respectively, by FarmCPU and BLINK. For the MLM model, TASSEL consistently reported a 6% to 7% average PVE for each of the six SNPs on chromosome 7, while GAPIT reported a 0% to 3% PVE for these SNPs (S4S7 Tables). Since the PVE% estimated by the BLINK and the MLMM models were consistent for the two traits, we reported the average PVE% of the two traits for each of the five QTLs estimated by these two models (Table 6). The average PVE ranged from 2% for the QTL qSTgs4.3 to 6% for the QTL qSTgs7.1, indicating the quantitative nature of these two salt tolerance traits GR5d_S and SSI_GR5d.

thumbnail
Table 6. List of the five QTLs associated with the two traits GR5d_S and SSI_GR5d related to salt tolerance at the seed germination stage in lettuce identified in the present study.

https://doi.org/10.1371/journal.pone.0308818.t006

Mapping SNPs from lettuce reference genome version v8 to reference genome version v11

Variant calls for genotyping for the current project were performed using the latest lettuce reference genome version, v8, available at that time. However, the most recent lettuce reference genome version, v11, was recently submitted to NCBI while we were preparing this manuscript. Total genome size of the reference genomes v8 and v11 are 2.4 Gb and 2.6 Gb, respectively. The 10 significant and stable SNPs identified in the reference genome v8 in the present study were mapped to reference genome v11 (Table 7) and were confirmed by alignment of 2 Kb (±1 Kb) sequence length flanking each SNP site in v8 with the corresponding SNP site in v11. Three of the six SNP loci 50257083, 50257096 and 50257104 on chromosome 7 were within 21 base pairs in the reference genome v8, and the corresponding SNP loci 52319139, 52319152, and 52319160 in the reference genome v11 were also within 21 base pairs. However, there were differences in distances between reference genome v8 and v11 for the rest of the SNP loci. These disparities arise because the v11 reference genome is larger, incorporating additional nucleotides between these SNP loci (Table 7).

thumbnail
Table 7. Mapping of the ten stable SNPs on lettuce chromosomes 2, 4 and 7 from reference genome v8 to v11.

https://doi.org/10.1371/journal.pone.0308818.t007

Favorable allele effect

The allele effect of each of the stable SNPs was determined for the two traits GR5d_S and SSI_GR5d by comparing the average trait values of the two types of homozygous genotypes (homozygous for one allele or the other) at the SNP locus and the significance of the difference was determined by the Kruskal-Wallis test. Genotypes homozygous for minor allele and major allele were designated as minor and major genotypes, respectively (Table 8). The SNP Lsv8Lg2-109644094 had 401 major genotypes, 44 heterozygotes, but no minor genotype. The SNP Lsv8Lg4-38095762 had a minor allele frequency of 0.08, and the average GR5d_S for the minor and major genotypes were 95% and 75%, respectively, and the average GR5d_S of the minor genotype was significantly (P <0.001) higher (by 20%) than that of the major genotype based on Kruskal-Wallis test (Table 8). Thus, the minor allele of this SNP was favorable. Similarly, the average SSI_GR5d for the minor and major genotypes of this SNP were 0.19 and 1.06, respectively, and the average SSI_GR5d for the minor genotype was significantly (P <0.001) lower (by 0.87) than that of the major genotype (lower value is better for SSI). Based on similar comparisons, the minor alleles were favorable for each of the six SNPs on chromosome 7, while the major alleles were favorable for the SNPs Lsv8Lg4-286540234 and Lsv8Lg4-374691029 (Table 8).

thumbnail
Table 8. Minor/major allele contributions to the two salt-tolerance related traits GR5d_S and SSI_GR5d in lettuce.

https://doi.org/10.1371/journal.pone.0308818.t008

Crossing over and recombination of SNP alleles

The six significant SNPs on chromosome 7 were located within a 1.36 Mb region in the reference genome v8. The LD (r2 and D´) among these six SNPs are presented in S8 Table. Crossing over among these six SNPs were also manually evaluated (S9 Table). However, due to heterozygosity, crossing over could not be evaluated for few accessions. The three SNP loci 50257083, 50257096 and 50257104 were within a 21 base pair length in the sequence, and both the r2 and the D´ values of the three LDs among these three SNPs were 1, indicating the SNP pairs are in complete LD. However, one crossover event between SNP loci 50257096 and 50257104 was identified from the accession BM_GBS_25, where SNP locus 50257104 was heterozygous (S9 Table). Out of the 445 accessions in our study, 19 accessions were at heterozygous state for the SNP loci 50257083, 50257096 and 50257104, thus it was not possible to determine if there was any crossing over among the three SNPs for these 19 accessions. This finding indicated that these three SNPs were predominantly inherited as a haplotype, and the allelic effect of each of these SNPs was the same (Table 8). The distance between the SNP loci 50257104 and 50552408 was ~295.3 Kb and four crossovers were observed between these two SNP loci. The distance between SNP loci 50552408 and 51509072 was ~956.7 Kb and eight crossovers were observed between these two SNP loci. There was one crossover between the SNP loci 51509072 and 51616430 and the distance was ~ 107.4 Kb between these two loci. Since there were 445 accessions in the present study, the crossover rate was 2 to 4-fold greater than average crossover rate among the SNP loci 50257104, 50552408, 51509072 and 51616430. Although the D´ values for few of these SNP pairs were 1, the r2 values were less than 1, indicating occurrence of some recombination events among these SNP loci (S8 Table). The genotypes at the six SNP loci of the seven wild type (L. serriola) accessions in the present study are given in S10 Table. Five of the seven accessions had genotype ‘CGCCGT’, while the genotypes of the other two were ‘TATCGT’ and ‘YRYCGT’, where the alleles for the two heterozygous loci were Y = C/T and R = A/G. Out of the 445 accessions in the present study, only two other accessions, a primitive type, and a leaf type, had the genotype ‘CGCCGT’. Other than the seven wild type accessions, there were 21 non-cultivated type and 417 cultivated type of accessions and 262 of these accessions had the genotype ‘CGCAAC’ (major alleles) and 138 accessions had the genotype ‘TATCGT’ (minor alleles) for these six SNP loci (S10 Table). Although the number of L. serriola accessions was limited in this study, based on the available information, it appears that the alternate alleles at all six SNP loci may have evolved from L. serriola and have become common alleles in the cultivated types of lettuce (S9 and S10 Tables). Analyses of the recombination patterns among these six SNPs indicated that the minor/major alleles were in coupling phase linkage (S9 Table).

Identification of putative candidate genes

A total of 25 salt tolerance-related and several other putative/predicted candidate genes were identified through NCBI BLAST search within 100 Kb upstream and downstream of each of the 10 significant SNPs identified in the present study. A list of these 25 putative/predicted salt tolerance-related genes along with brief description of their function/expression is presented in Table 9. The 25 genes were located over all lettuce chromosomes except chromosome 1, with seven genes on chromosome 7, and one to three genes on the rest of the chromosomes, while chromosome location of two of these genes were unknown. Nine of the genes had 100 percent sequence identity within the BLAST search, while the rest of the genes displayed sequence identity ranging from 83 to 98 percent. Four of the identified candidate genes encode the four transcription factors bHLH041, enhanced ethylene response protein 5 (EER5), MYB36, and bZIP from four major transcription factor families in plants.

thumbnail
Table 9. List of 25 salt tolerance-related candidate genes and their functions/expressions.

https://doi.org/10.1371/journal.pone.0308818.t009

Distances of these salt-tolerance related genes from the identified SNPs are presented in Table 10. Two of these genes: the L. sativa cytokinin dehydrogenase 3 gene and the transcription factor EER5 encoding gene were located in the 1.36 Mb region on chromosome 7, harboring the six closely linked SNPs associated with the QTL qSTgs7.1. The L. sativa putative transcription factor bHLH041encoding gene was located 58.4 Kb downstream of the SNP Lsv8Lg7-50257083, while the L. sativa 1-aminocyclopropane carboxylate oxidase 1 gene was located 77.1 Kb upstream of the SNP Lsv8Lg7-51616430. Three other candidate genes were located within <100 Kb from three of the identified SNPs (Table 10).

thumbnail
Table 10. Candidate gene locations relative to identified SNPs.

https://doi.org/10.1371/journal.pone.0308818.t010

Discussion

Breeding lettuce varieties that can germinate and establish in salt-affected soil is essential since lettuce seeds are planted half an inch deep in soil where salinity levels are often the highest in salinity-affected growing regions. In recent years, GWAS has become a very useful tool for identifying markers associated with traits that can be effectively used by plant breeders for developing new and improved crop varieties. In the present research, we conducted a GWAS to identify SNP markers associated with salt tolerance at the seed germination stage in lettuce. A soil salinity level of 100 mM NaCl is about as high as most crop plants tolerate without significant reduction in growth and yield [98] and an earlier study on lettuce seed germination in our laboratory (unpublished data) with 70, 80, and 100 mM NaCl stress conditions indicated highest genetic variability among the tested entries with 100 mM NaCl stress. Therefore, in the present study, we used a salt stress condition of 100 mM NaCl. Through this research, we successfully identified 10 highly significant SNPs associated with five QTLs for salt tolerance at the seed germination stage. GWAS has been successfully applied for discovering marker-trait associations for several traits in lettuce [29, 3033]. However, to our knowledge, this is the first marker-trait association study of any kind on salt tolerance at the seed germination stage in lettuce and is the first report on genome-wide association study of salt tolerance in lettuce. GWAS for salt tolerance at the seed germination stage has been reported in rice, mungbean, flax and Brassica napus [3437].

Some of the most important considerations in GWAS are: genetic diversity within the GWAS panel, sample size, trait diversity, linkage disequilibrium, and population structure [20, 99, 100]. To ensure sufficient genetic diversity in a GWAS panel, mixed populations containing subpopulations of different genetic backgrounds may be suitable, as a simulation study demonstrated that GWAS’ power with a mixed population was generally higher than that of with a separate population [99]. That diverse populations add power to GWAS results has also been shown in GWAS with human populations, where diverse populations facilitated the discovery of novel type 2 diabetes aetiological factors owing to their divergent allele frequency across populations [101]. An earlier study with 441 accessions from the same GWAS panel and with 400 common accessions as the present study found the GWAS panel to be diverse [31]. In that study, PCA clustered the four major horticultural types: butterhead, crisphead, leaf and romaine into separate clusters with some overlap of the butterhead and romaine types with leaf type [31]. Since 45 accessions in the present study were different from the earlier study [31], we also conducted PCA to assess the genetic diversity and population structure of the 445 accessions in the current study. We included the four major horticultural types as well as the stem and wild types of lettuce accessions in this PCA. The PCA revealed clustering of the lettuce accessions according to their horticultural types (Fig 2). Thus, our genetic diversity analysis through PCA confirmed the result of the earlier study [31] that our GWAS panel was diverse. This diversity among groups or subpopulations in our GWAS panel likely contributed to the identification of the 10 significant SNPs in our study.

In human populations, where hundreds of thousands to over a million SNPs are commonly used for GWAS, sample sizes typically exceed 1,000 [102], while in plant populations generally a lower number of SNPs are used, and many successful GWAS studies in plants have been conducted with a sample size of a few hundred [103]. Our GWAS panel consisted of 445 diverse lettuce accessions which is a higher number of accessions than commonly used for GWAS in plants [103]. In the present study, descriptive statistics and ANOVA indicated substantial phenotypic variation for the seed germination-related traits under salt stress (Table 1 and S2 Table). Broad-sense heritability estimates of these traits were high, indicating that most of the phenotypic variation was due to genetic factors (S2 Table).

LD is a measure of the correlation between alleles at different loci. Rapid decay of LD is favorable for association testing of candidate genes that are located near mapped QTLs and have functional relevance to trait variation [45]. In maize, LD generally decayed rapidly with distance (r2 <0.1 within 1.5 Kb), however, rates of decline were highly variable among genes [45]. In general, LD decay is slower in self-pollinated species like lettuce as compared to cross-pollinated species, where recombination rate is much higher [104]. The LD decay in our GWAS panel was comparable to other self-pollinated species and was suitable for GWAS. For example, LD decayed to half of its maximum at approximately 450–500 Kb and 500–550 Kb in desi and kabuli chickpeas, respectively [105]. LD decayed to half of its maximum at ~75 Kb and ~150 Kb, in wild and cultivated soybeans, respectively (106). In a separate study, LD decayed to half of its maximum at 60 Kb and 100 Kb, respectively, in wild and cultivated types of mungbeans [107]. The LD decay examples from soybean and mungbean above showed that the LD decays of the wild types were faster compared to the cultivated type [106, 107]. These results are in congruence with the result in the present study as we observed that the LD decay to half of its maximum for the 445 lettuce accessions (at 290.8 Kb) including 417 cultivated, seven wild, and 21 non-cultivated types was faster than that of the 417 cultivated type accessions (at 351.8 Kb). It should be noted here that the seven wild type accessions in our study were too few to conduct LD analysis and compute LD decay. In a previous study that included 400 common accessions to the present study, LD decay of the four major cultivated horticultural types were computed separately, resulting in LD decay to half of its maximum at 241 Kb, 349 Kb, 388 Kb and 746 Kb, respectively, for leaf, romaine, butterhead and crisphead types [31].

The 10 significant SNPs identified in the present study were the same for both the salt tolerance-related traits GR5_S and SSI_GR5d (Table 5). The two traits were essentially the same, however, the formula for computing SSI compares seed germination rate under salinity stress to germination rate under non-salinity or control (water) condition, thus it reflects the true genetic variability among the lettuce accessions under salt-stress and avoids the confounding effects of loss of seed viability due to seed age, dormancy, or seed being affected by disease. In the absence of such confounding effects, the results from SSI_GR5d and GR5d_S would be the same or similar. However, we used both the traits to compare the results, and because the results from GR5d_S (percent germination) is more interpretable and comprehensible than SSI_GR5d (an index), especially when comparing favorable allele effects using the significant SNPs and phenotypic data.

An earlier study [16], using a RIL population derived from a bi-parental cross, identified 9 QTLs for salt tolerance at the seedling stage in lettuce including a major QTL on chromosome 2 and one major and one minor QTLs located on chromosome 7. The QTL on chromosome 2 (reference genome, v8), mapped to ~ 23.2 Mb upstream of the SNP Lsv8Lg2-109644094/QTL qSTgs2.1, that we identified on chromosome 2. The major QTL on chromosome 7 mapped to ~ 33 Mb upstream of the end SNP Lsv8Lg7-51616430, while the minor QTL mapped to ~ 9.1 Mb downstream of the lead SNP Lsv8Lg7-50257083 of the QTL qSTgs7.1 in our study. Thus, the three QTLs were different from the QTLs identified in the present study. The locations of the four seedling-stage salt tolerance-related QTLs on lettuce chromosome 7, identified in a separate study [17] were also different as compared to the locations of the five QTLs in the present study. Thus, the five QTLs identified in our study are novel QTLs for salt tolerance in lettuce.

Minor alleles were favorable for all six significant SNP loci on chromosome 7 and for one SNP on chromosome 4, while major alleles were favorable for two SNPs on chromosome 4, for both the traits GR5d_S and SSI_GR5d. This favorable allele information will be useful for breeding lettuce for salinity tolerance. Favorable effect of minor allele for salt tolerance related SNPs/haplotype at the seed germination stage were reported in rice [34]. Evaluation of crossing over and recombination among the six SNP loci in the present study revealed that the three SNPs, Lsv8Lg7-50257083, Lsv8Lg7-50257096 and Lsv8Lg7-50257104 predominantly inherited as a haplotype, whereas there were 2- to 4-fold greater than average crossing over among the four SNPs, Lsv8Lg7-50257104, Lsv8Lg7-50552408, Lsv8Lg7-51509072, and Lsv8Lg7-51616430. Greater than average recombination may arise for the benefit of creating genetic diversity through genomic rearrangement along with selection for maintaining favorable gene recombination [108]. Evaluation of the recombination patterns among these six SNPs also indicated that the minor/major alleles in these SNP loci were in coupling phase linkage (S9 Table).

Employing five models across two software packages (GAPIT and TASSEL), the present study demonstrated the advantages of using multiple models and software for GWAS in identifying marker-trait associations. This approach strengthens confidence in the results, aligning with the GAPIT user manual, which recommends validating findings with other software like TASSEL. In the present study, we identified 25 candidate genes with known salt tolerance-related functions including four genes encoding transcription factors from four major transcription factor families in plants. Extensive literature search on the function and expression analysis of these genes revealed their important roles in salt tolerance and potential involvement in other abiotic stress tolerance such as drought and cold tolerance (Table 9). These findings highlight their potential for further validation and application in marker-assisted selection for salt tolerance in lettuce.

Conclusion

Soil salinity is one of the most important abiotic stress factors limiting crop production globally. For successful germination and establishment in the salinity-affected growing region it is important to breed lettuce varieties with salt tolerance at the seed germination stage. To our knowledge, this is the first study on GWAS to identify genomic regions associated with salt tolerance in lettuce and the first marker-trait association study of any kind for salt tolerance at the seed germination stage in lettuce. Ten significant marker-trait associations representing five novel QTLs for salt tolerance at the seed germination stage were identified in the current GWAS. Favorable alleles for these SNPs were also identified. Based on these SNP locations, twenty-five salt-tolerance related candidate genes including four genes encoding for transcription factors from four major transcription factor families in plants were identified. Collectively the findings in the current study will help develop molecular markers related to salt tolerance at the seed germination stage and thereby help breeding improvement of lettuce with enhanced salt tolerance at the seed germination stage.

Supporting information

S1 File. Phenotypic data used in this study.

https://doi.org/10.1371/journal.pone.0308818.s001

(XLSX)

S1 Fig. Chromosome-wise linkage disequilibrium (LD) and LD decay for the nine lettuce chromosomes.

LD and LD decay involving 56,820 SNPs and 445 lettuce accessions. LD decayed to half of its maximum at 261664 bp, 199542 bp, 349541 bp, 403689 bp, 259715 bp, 89382 bp, 232815 bp, 332532 bp, and 328898 bp for the chromosomes 1, 2, 3, 4, 5, 6, 7, 8 and 9, respectively.

https://doi.org/10.1371/journal.pone.0308818.s002

(DOCX)

S2 Fig. Distribution of residuals for the trait germination rate under salt stress, 5 days post seeding (GR5d_S).

Residuals distribution for the trait GR5d_S from GWAS using mixed linear model (MLM) in the TASSEL software package.

https://doi.org/10.1371/journal.pone.0308818.s003

(TIF)

S3 Fig. Distribution of residuals for the trait salinity susceptibility index, 5 days post seeding (SSI_GR5d).

Residuals distribution for the trait SSI_GR5d from GWAS using mixed linear model (MLM) in the TASSEL software package.

https://doi.org/10.1371/journal.pone.0308818.s004

(TIF)

S1 Table. List of the 445 Lactuca accessions in the GWAS panel.

https://doi.org/10.1371/journal.pone.0308818.s005

(XLSX)

S2 Table. Analysis of variance (mean squares), genetic and phenotypic variance, and broad-sense heritability.

https://doi.org/10.1371/journal.pone.0308818.s006

(DOCX)

S3 Table. Bayesian information criterion (BIC)-based model selection results.

https://doi.org/10.1371/journal.pone.0308818.s007

(DOCX)

S4 Table. Comparison of results with 0, 1, 2 and 3 PCs in the GWAS models for the trait GR5d_S.

https://doi.org/10.1371/journal.pone.0308818.s008

(XLSX)

S5 Table. Comparison of results with 0, 1, 2 and 3 PCs in the GWAS models for the trait SSI_GR5d.

https://doi.org/10.1371/journal.pone.0308818.s009

(XLSX)

S6 Table. Significant SNPs identified for the trait GR5d_S by the TASSEL software.

https://doi.org/10.1371/journal.pone.0308818.s010

(XLSX)

S7 Table. Significant SNPs identified for the trait SSI_GR5d by the TASSEL software.

https://doi.org/10.1371/journal.pone.0308818.s011

(XLSX)

S8 Table. Linkage disequilibrium (r2), and D’ between pairs of the six significant SNPs on chromosome 7.

https://doi.org/10.1371/journal.pone.0308818.s012

(XLSX)

S9 Table. Evaluation of crossing over/recombination and coupling phase linkage.

https://doi.org/10.1371/journal.pone.0308818.s013

(XLSX)

S10 Table. Seven L. serriola accession genotypes at the six significant SNP loci on chromosome 7.

https://doi.org/10.1371/journal.pone.0308818.s014

(XLSX)

Acknowledgments

We gratefully acknowledge the technical assistance of Phi Diep, Jorge Escutia, and Carlos Saavedra.

Disclaimer: The USDA is an equal opportunity provider and employer. Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture.

References

  1. 1. Simko I, Hayes RJ, Mou B, McCreight JD. Lettuce and Spinach. In: Smith S, Diers B, Specht J, Carver B, editors. Yield gains in major US field crops. Am Soc Agron Inc., Crop Sci Soc Am Inc., Soil Sci Soc Am Inc. 2014. p. 53–86. https://doi.org/10.2135/cssaspecpub33.c4.
  2. 2. Ayers AD, Wadleigh CH, Bernstein L. Salt tolerance of six varieties of lettuce. Proc Am Soc Hort Sci. 1951;57: 237–41.
  3. 3. Reinelt P. Seawater intrusion policy analysis with a numerical spatially heterogeneous dynamic optimization model. Water Resour Res. 2005;41: W05006. https://doi.org/10.1029/2004WR003111.
  4. 4. Lindqvist K. On the origin of cultivated lettuce. Hereditas. 1960;46: 319–50.
  5. 5. de Vries IM. Crossing experiments of lettuce cultivars and species (Lactuca sect. Lactuca, Compositae). Plant Syst Evol. 1990;171: 233–48.
  6. 6. de Vries IM. Origin and domestication of Lactuca sativa L. Genet Resour CropEvol. 1997;44: 165–74.
  7. 7. Kesseli R, Ochoa O, Michelmore R. Variation at RFLP loci in Lactuca spp. and origin of cultivated lettuce (L. sativa), Genome 1991;34: 430–6.
  8. 8. Mou B. Lettuce. In: Prohers J, Nuez F, editors. Handbook of Plant Breeding, Vegetables I, Asteraceae, Brassicaceae, Chenopodicaceae, and Cucurbitaceae. Springer: New York: Springer; 2008. p. 75–116.
  9. 9. Munns R, Tester M. Mechanisms of salinity tolerance. Annu Rev Plant Biol. 2008;59: 651–81. pmid:18444910
  10. 10. Odegbaro OA, Smith OE. Effects of kinetin, salt concentration, and temperature on germination and early seedling growth of Lactuca sativa L. J Am Soc Hort Sci.1969;94: 167–70.
  11. 11. de la Reguera E, Veatch J, Gedan K, Tully KL. The effects of saltwater intrusion on germination success of standard and alternative crops. Env Exp Bot. 2020;180: 104254. https://doi.org/10.1016/j.envexpbot.2020.104254.
  12. 12. Baz H, Creech M, Chen J, Gong H, Bradford K, Huo H. Water-soluble carbon nanoparticles improve seed germination and post-germination growth of lettuce under salinity stress. Agronomy 2020;10(8): 1192. https://doi.org/10.3390/agronomy10081192.
  13. 13. Kaya C, Higgs D, Sakar E. Response of two leafy vegetables grown at high salinity to supplementary potassium and phosphorous during different growth stages. J Plant Nutr. 2002;25: 2663–2676.
  14. 14. Nasri N, Kaddour R, Rabhi M, Plassard C, Lachaal M. Effect of salinity on germination, phytase activity and phytate content in lettuce seedling. Acta Physiol Plant. 2011;33: 935–942. https://doi.org/10.1007/s11738-010-0625-4.
  15. 15. Xu C, Mou B. Evaluation of lettuce genotypes for salinity tolerance. HortSci. 2015;50: 1441–46.
  16. 16. Wei Z, Julkowska MM, Laloe J-O, Hartman Y, de Boer G-J, Michelmore RW, et al. A mixed-model QTL analysis for salt tolerance in seedlings of crop-wild hybrids of lettuce. Mol Breed. 2014;34: 1389–1400. https://doi.org/10.1007/s11032-014-0123-2.
  17. 17. Uwimana B, Smulders MJM, Hooftman DAP, Danny AP, Hartman Y, van Tienderen PH, et al. Hybridization between crops and wild relatives: the contribution of cultivated lettuce to the vigour of crop-wild hybrids under drought, salinity and nutrient deficiency conditions. Theor Appl Genet. 2012;125: 1097–1111. pmid:22660630
  18. 18. Nordborg M, Tavare S. Linkage disequilibrium: what history has to tell us. Trends Genet. 2002;18(2): 83–90. pmid:11818140
  19. 19. Ikegawa S. A short history of the genome-wide association study: where we were and where we are going. Genomics Inform. 2012;10(4): 220–25. pmid:23346033
  20. 20. Mackay I, Powell W. Methods for linkage disequilibrium mapping in crops. Trends Plant Sci. 2007;12(2): 57–63. pmid:17224302
  21. 21. Tian C, Gregerson PK, Seldin MF. Accounting for ancestry: population substructure and genome-wide association studies. Hum Mol Genet. 2008;17: R143–R150. pmid:18852203
  22. 22. Sul JH, Martin LS, Eskin E. Population structure in genetic studies: Confounding factors and mixed models. PLoS Genet. 2018;14(12): e1007309. pmid:30589851
  23. 23. Yu J, Pressoir G, Briggs WH, Bi IV, Yamasaki M, Doebley JF, et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006;38: 203–08. pmid:16380716
  24. 24. Sul JH, Eskin E. Mixed models can correct for population structure for genomic regions under selection. Nat Rev Genet. 2013;14: 300. pmid:23438871
  25. 25. Liu X, Huang M, Fan B, Buckler ES, Zhang Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 2016;12: e1005767. pmid:26828793
  26. 26. Rakitsch B, Lippert C, Stegle O, Borgwardt K. A Lasso multi-marker mixed model for association mapping with population structure correction. Bioinformatics. 2013;29(2): 206–14. pmid:23175758
  27. 27. Segura V, Vilhjalmsson BJ, Platt A, Korte A, Seren U, Long Q, et al. An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations. Nat Genet. 2012;44: 825–30. pmid:22706313
  28. 28. Huang M, Liu X, Zhou Y, Summers RM, Zhang Z. BLINK: A package for the next level of genome-wide association studies with both individuals and markers in the millions. Gigascience. 2019;8(2): giy154. pmid:30535326
  29. 29. Kwon S, Simko I, Hellier B, Mou B, Hu J. Genome-wide association of 10 horticultural traits with expressed sequence tag-derived SNP markers in a collection of lettuce lines. Crop J. 2013;1(1): 25–33. https://doi.org/10.1016/j.cj.2013.07.014.
  30. 30. Kandel JS, Peng H, Hayes RJ, Mou B, Simko I. Genome-wide association mapping reveals loci for shelf life and developmental rate of lettuce. Theor Appl Genet. 2020;133: 1947–66. pmid:32123958
  31. 31. Park S, Kumar P, Shi A, Mou B. Population genetics and genome-wide association studies provide insights into the influence of selective breeding on genetic variation in lettuce. Plant Genome. 2021;14: e20086. pmid:33629537
  32. 32. Simko I, Peng H, Kandel JS, Zhao R. Genome-wide association mapping reveals genomic regions frequently associated with lettuce field resistance to downy mildew. Theor Appl Genet. 2022;135: 2009–24. pmid:35419653
  33. 33. Simko I, Kandel JS, Peng H, Zhao R, Subbarao KV. Genetic determinants of lettuce resistance to drop caused by Sclerotinia minor identified through genome-wide association mapping frequently co-locate with loci regulating anthocyanin content. Theor Appl Genet. 2023;136: 80. https://doi.org/10.1007/s00122-023-04421-y.
  34. 34. Shi Y, Gao L, Wu Z, Zhang X, Wang M, Zhang C, et al. Genome-wide association study of salt tolerance at the seed germination stage in rice. BMC Plant Biol. 2017;17: 92. pmid:28558653
  35. 35. Breria CM, Hsieh C-H, Yen T-B, Jo-Yi Yen J-Y, Noble TJ, et al. A SNP-based genome-wide association study to mine genetic loci associated to salinity tolerance in mungbean (Vigna radiata L.). Genes. 2020;11(7): 759. https://doi.org/10.3390/genes11070759.
  36. 36. Li X, Guo D, Xue M, Li G, Yan Q, Jiang H, et al. Genome-Wide Association Study of Salt Tolerance at the Seed Germination Stage in Flax (Linum usitatissimum L.). Genes. 2022;13(3): 486. https://doi.org/10.3390/genes13030486.
  37. 37. Zhang G, Zhou J, Peng Y, Tan Z, Li L, Yu L, et al. Genome-wide association studies of salt tolerance at seed germination and seedling stages in Brassica napus. Front Plant Sci. 2022;2: 772708. https://doi.org/10.3389/fpls.2021.772708.
  38. 38. Fischer RA, Maurer R. Drought resistance in spring wheat cultivars. I. Grain yield response. Aust J Agric Res. 1978;29: 897–912.
  39. 39. Fehr WR. Principles of cultivar development. vol. 1. Theory and technique. New York: Macmillan Publishing Company; 1987.
  40. 40. McCaw ZR, Lane JM, Saxena R, Redline S, Lin X. Operating characteristics of the rank-based inverse normal transformation for quantitative trait analysis in genome-wide association studies. Biometrics. 2020;76: 1262–72. pmid:31883270
  41. 41. Ott A, Liu S, Schnable JC, Yeh CT, Wang KS, Schnable PS. tGBS® genotyping-by-sequencing enables reliable genotyping of heterozygous loci. Nucleic Acids Res. 2017;45(21): e178. pmid:29036322
  42. 42. Reyes-Chin-Wo S, Wang Z, Yang X, Kozik A, Arikit S, Song C, et al. Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce. Nat Commun. 2017; 8: 14953. pmid:28401891
  43. 43. Browning BL, Zhou Y, Browning SR. A one-penny imputed genome from next-generation reference panels. Am J Hum Genet. 2018;103: 338–48. pmid:30100085
  44. 44. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23: 2633–35. pmid:17586829
  45. 45. Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, Doebley J, et al. Structure of linkage disequilibrium and phenotypic associations in the maize genome. PNAS. 2001;98: 11479–84. pmid:11562485
  46. 46. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shidick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006;38: 904–9. pmid:16862161
  47. 47. Price AL, Zaitlen NA, Reich D, Patterson N. New approaches to population stratification in genome-wide association studies. Nat Rev genet. 2010;11(7): 459–63. pmid:20548291
  48. 48. Wickham H. ggplot2: Elegant graphics for data analysis. New York: Springer-Verlag; 2016.
  49. 49. Wang J, Zhang Z. GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction. Genom Proteom Bioinf. 2021;19: 1–12. pmid:34492338
  50. 50. Scuteri A, Sanna S, Chen WM, Uda M, Albai G, Strait J, et al. Genome-wide association scan shows genetic variants in the fto gene are associated with obesity-related traits. PLOS Genetics. 2007;3: e115. pmid:17658951
  51. 51. Barber MJ, Mangravite LM, Hyde CL, Chasman DI, Smith JD, McCarty CA, et al. Genome-wide association of lipid-lowering response to statins in combined study populations. PLOS One. 2010;5: e9763. pmid:20339536
  52. 52. Turner SD. qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. J Open Sour Soft. 2018;3(25): 731.
  53. 53. Kruskal WH, Wallis WA. Use of ranks in one-criterion variance analysis. J Amer Stat Assoc. 1952;47: 583–621.
  54. 54. Alqudah AM, Sallam A, Baenziger PS, Börner A. GWAS: Fast-forwarding gene identification and characterization in temperate Cereals: lessons from Barley—A review. J Adv Res. 2019;22: 119–135. pmid:31956447
  55. 55. Kumar V, Singh A, Mithra SV, Krishnamurthy SL, Parida SK, Jain S, et al. Genome-wide association mapping of salinity tolerance in rice (Oryza sativa). DNA Res. 2015; 22(2): 133–45. pmid:25627243
  56. 56. Thiry AA, Chavez Dulanto PN, Reynolds MP, Davies WJ. How can we improve crop genotypes to increase stress resilience and productivity in a future climate? A new crop screening method based on productivity and resistance to abiotic stress. J Exp Bot. 2016;67(19): 5593–603. pmid:27677299
  57. 57. Tiwari S, Sl K, V. Kumar B. Singh A.R. Rao A. Mithra Sv, et al. Mapping QTLs for salt tolerance in Rice (Oryza sativa L.) by bulked Segregant analysis of recombinant inbred lines using 50K SNP Chip. PLoS One. 2016;11(4): e0153610. pmid:27077373
  58. 58. Delormel TY, Boudsocq M. Properties and functions of calcium-dependent protein kinases and their relatives in Arabidopsis thaliana. New Phytologist. 2019;224(2): 585–604. https://doi.org/10.1111/nph.16088.
  59. 59. Asano T, Hayashi N, Kobayashi M, Aoki N, Akio M, Mitsuhara I, et al. A rice calcium-dependent protein kinase OsCPK12 oppositely modulates salt-stress tolerance and blast disease resistance. Plant J. 2012;69: 26–36. pmid:21883553
  60. 60. Gao W, Xu F-C, Guo D-D, Zhao J-R, Liu J, Guo Y-W, et al. Calcium-dependent protein kinases in cotton: insights into early plant responses to salt stress. BMC Plant Biol. 2018;18: 15. pmid:29343239
  61. 61. Lin F, Li S, Wang K, Tian H, Gao J, Zhao Q, et al. A leucine-rich repeat receptor-like kinase, OsSTLK, modulates salt tolerance in rice. Plant Sci. 2020;296: 110465. pmid:32540023
  62. 62. Jorge JG, Villalobos-López MA, Chavarría-Alvarado KL, Ríos-Meléndez S, López-Meyer M, Arroyo-Becerra A. Genome-wide transcriptional changes triggered by water deficit on a drought tolerant common bean cultivar. BMC Plant Biol. 2020;20: 525. pmid:33203368
  63. 63. Jones AM, Xuan Y, Xu M, Wang R-S, Ho C-S, Lalonde S, et al. Border control–a membrane-linked interactome of Arabidopsis. Science. 2014;344(6185): 711–6. https://doi.org/10.1126/science.1251358.
  64. 64. Xu X, Zhang L, Zhao W, Fu L, Han Y, et al. Genome-wide analysis of the serine carboxypeptidase-like protein family in Triticum aestivum reveals TaSCPL184-6D is involved in abiotic stress response. BMC Genomics. 2021; 22: 350.
  65. 65. Ji W, Zhu Y, Li Y, Yang L, Zhao X, Cai H, et al. Over-expression of a glutathione S- transferase gene, GsGST, from wild soybean (Glycine soja) enhances drought and salt tolerance in transgenic tobacco. Biotechnol Lett. 2010;32: 1173–1179.
  66. 66. Qi YC, Liu WQ, Qiu LY, Zhang SM, Ma L, Zhang H. Overexpression of glutathione S-transferase gene increases salt tolerance of Arabidopsis. Russian J Plant Physiol. 2010;57: 233–40.
  67. 67. Ruiz JM, Blumwald E. Salinity-induced glutathione synthesis in Brassica napus. Planta. 2002;214: 965–69. pmid:11941474
  68. 68. Mittova V, Theodoulou FL, Kiddle G, Gomez L, Volokita M, Tal M, et al. Coordinate induction of glutathione biosynthesis and glutathione-metabolizing enzymes is correlated with salt tolerance in tomato. FEBS Lett. 2003;554: 417–21. pmid:14623104
  69. 69. Song JB, Wang YX, Li HB, Li BW, Zhou SZ, Gao S, et al. F-box family genes as key elements in response to salt, heavy metal, and drought stresses in Medicago truncatula. Funct Integr Genomics. 2015;15: 495–507. pmid:25877816
  70. 70. Jia Q, Xiao ZX, Wong FL, Sun S, Liang KJ, Lam HM. Genome-wide analysis of soybean F-box gene family in response to salt stress. Int J Mol Sci. 2017;18: 818. https://doi.org/10.3390/ijms18040818.
  71. 71. Missaoui K, Gonzalez-Klein Z, Pazos-Castro D, Hernandez-Ramirez G, Garrido-Arandia M, Brini F, et al. Plant non-specific lipid transfer proteins: An overview. Plant Physiol Biochem. 2022;171: 115–27. pmid:34992048
  72. 72. Hairat S, Baranwal VK, Khurana P. Identification of Triticum aestivum nsLTPs and functional validation of two members in development and stress mitigation roles. Plant Physiol. Biochem. 2018;130: 418–30. https://doi.org/10.1016/j.plaphy.2018.07.030.
  73. 73. Edqvist J, Blomqvist K, Jeroen Nieuwland J, Salminen TA. Plant lipid transfer proteins: are we finally closing in on the roles of these enigmatic proteins? J Lipid Res. 2018;59: 1374–1382. pmid:29555656
  74. 74. Xia-Yu G, Meng Z, Ming-Dong Z, Ji-Rui L, Zhong-Wei W, Jian-Wu L, et al. Comparative Transcriptomic Analysis of the Super Hybrid Rice Chaoyouqianhao under Salt Stress. BMC Plant Biol. 2022;22: 233. pmid:35525915
  75. 75. Aghamdi BA, Bafeel SO, Edris S, Atef A, Al-Matary M, Bahieldin A. Molecular mechanisms underlying salt stress tolerance in jojoba Simmondsia chinensis. Biosci. Biotech Res Asia. 2021;18(1). https://doi.org/10.13005/bbra/2895.
  76. 76. Kirk P, Amsbury S, German L, Gaudioso-Pedraza R, Benitez-Alfonso Y. A comparative met proteomic pipeline for the identification of plasmodesmata proteins and regulatory conditions in diverse plant species. BMC Biol. 2022;20: 128. https://doi.org/10.1186/s12915-022-01331-1.
  77. 77. Liu H, Tang X, Zhang N, Li S, Huaijun SiRole H. Role of bZIP transcription factors in plant salt stress. Int J Mol Sci. 2023;24: 7893. pmid:37175598
  78. 78. Zhang X, Zhao J, Wu X, Hu G, Fan S, Ma Q. Evolutionary relationships and divergence of KNOTTED1-like family genes involved in salt tolerance and development in cotton (Gossypium hirsutum L.). Front Plant Sci. 2021;12: 774161. https://doi.org/10.3389/fpls.2021.774161.
  79. 79. Nishiyama R, Watanabe Y, Fujita Y, Le DT, Kojima M, Werner T, et al. Analysis of cytokinin mutants and regulation of cytokinin metabolic genes reveals important regulatory roles of cytokinins in drought, salt and abscisic acid responses, and abscisic acid biosynthesis. Plant Cell. 2011;23: 2169–83. pmid:21719693
  80. 80. Du Y, Zhang Z, Gu Y, Li W, Wang W, Yuan X, et al. Genome-wide identification of the soybean cytokinin oxidase/dehydrogenase gene family and its diverse roles in response to multiple abiotic stress. Front Plant Sci. 2023;14: 1163219. pmid:37139113
  81. 81. Christians MJ, Robles LM, Zeller SM, Larsen PB. The eer5 mutation, which affects a novel proteasome-related subunit, indicates a prominent role for the COP9 signalosome in resetting the ethylene-signaling pathway in Arabidopsis. Plant J. 2008;66: 467–477. pmid:18429939
  82. 82. Lei G, Shen M, Li ZG, Zhang B, Duan KX, Wang N, et al. EIN2 regulates salt stress response and interacts with a MA3 domain-containing protein ECIP1 in Arabidopsis. Plant Cell Environ. 2011; 34(10): 1678–92. pmid:21631530
  83. 83. Binder BM. Ethylene signaling in plants. J Biol Chem. 2020;295: 7710–25. pmid:32332098
  84. 84. Sugimoto M, Houjyo Y, Maekawa M, Terada R. Rice salt-tolerance enhancement by expression of 1-aminocyclopropane-1-carboxylic acid oxidase gene from salt tolerant barley. J Plant Biochem Physiol. 2018;6: 4. https://doi.org/10.4172/2329-9029.1000226.
  85. 85. Liu D, Li YY, Zhou ZC, Xiang X, Liu X, Wang J, et al. Tobacco transcription factor bHLH123 improves salt tolerance by activating NADPH oxidase NtRbohE expression. Plant Physiol. 2021;186:1706–20. pmid:33871656
  86. 86. Yu L, Fan J, Xu C. Peroxisomal fatty acid β-oxidation negatively impacts plant survival under salt stress. Plant Signal Behav. 2019;14(2): 1561121. pmid:30618323
  87. 87. Sukumaran S, Lethin J, Liu X, Pelc J, Zeng P, Hassan S. Genome-wide analysis of MYB transcription factors in the wheat genome and their roles in salt stress response. Cells 2023;12(10): 1431. pmid:37408265
  88. 88. Laluk K, Abuqamar S, Mengiste T. The Arabidopsis mitochondria-localized pentatricopeptide repeat protein PGN functions in defense against necrotrophic fungi and abiotic stress tolerance. Plant Physiol. 2011;156: 2053–68. pmid:21653783
  89. 89. Zsigmond L, Szepesi A, Tari I, Rigo G, Kiraly A. Overexpression of the mitochondrial PPR40 gene improves salt tolerance in Arabidopsis. Plant Sci. 2012;182: 87–93. pmid:22118619
  90. 90. Moin M, Saha A, Bakshi A, Madhav MS, Kirti PB. Constitutive expression of Ribosomal Protein L6 modulates salt tolerance in rice transgenic plants. Gene. 2021;789: 145670. pmid:33892070
  91. 91. Shiraku ML, Magwanga RO, Cai X, Kirungu JN, Xu Y, Mehari TG, et al. Knockdown of 60S ribosomal protein L14-2 reveals their potential regulatory roles to enhance drought and salt tolerance in cotton. J Cotton Res. 2021;4: 27. https://doi.org/10.1186/s42397-021-00102-7.
  92. 92. Liaqat A, Alfatih A, Jan SU, Sun L, Zhao P, Xiang C. Transcription elongation factor AtSPT4-2 positively modulates salt tolerance in Arabidopsis thaliana. BMC Plant Biol. 2023;23: 49. https://doi.org/10.1186/s12870-023-04060-x.
  93. 93. Xiong G, Cheng K, Pauly M. Xylan O-acetylation impacts xylem development and enzymatic recalcitrance as indicated by the Arabidopsis mutant tbl29. Mol Plant. 2013;6: 1373–75. pmid:23340742
  94. 94. Lunin VV, Wang H-T, Bharadwaj VS, Alahuhta M, Pena MJ, Yang J-Y et al. Molecular mechanism of polysaccharide acetylation by the Arabidopsis xylem O-acetyltransferase XOAT1. Plant Cell. 2020;32(7): 2367–2382. https://doi.org/10.1105/tpc.20.00028.
  95. 95. Homma Y, Hiragi S, Fukuda M. Rab family of small GTPases: an updated view on their regulation and functions. FEBS J. 2021 Jan; 288(1): 36–55. pmid:32542850
  96. 96. Zhang Y, Tian H, Chen D, Zhang H, Sun M, Chen S, et al. Cysteine-rich receptor-like protein kinase: emerging regulators of plant stress responses. Trends Plant Sci. 2023;28: 7. https://doi.org/10.1016/j.tplants.2023.03.028. pmid:36328871
  97. 97. Huang Y, Jiao Y, Xie N, Guo Y, Zhang F, Xiang Z, et al. OsNCED5, a 9-cis- epoxycarotenoid dioxygenase gene, regulates salt and water stress tolerance and leaf senescence in rice. Plant Sci. 2019;287: 110188. pmid:31481229
  98. 98. Munns R, James RA, Läuchli A. Approaches to increasing the salt tolerance of wheat and other cereals. J Exp Bot. 2006;57(5): 1025–43. pmid:16510517
  99. 99. Hamazaki K, Kajiya-Kanegae H, Yamasaki M, Ebana K, Yabe S, Nakagawa H, et al. Choosing the optimal population for a genome-wide association study: A simulation of whole-genome sequences from rice. Plant Genome. 2020;13: e20005. pmid:33016626
  100. 100. Flint-Garcia SA, Thuillet A-C, Yu J, Pressoir G, Romero SM, Mitchel SE, et al. Maize association population: a high-resolution platform for quantitative trait locus dissection. Plant J. 2005;44(6): 1054–64. pmid:16359397
  101. 101. Barros I. The importance of increasing population diversity in genetic studies of type 2 diabetes and related glycaemic traits. Diabetologia. 2021;64(12): 2653–64. pmid:34595549
  102. 102. Hong EP, Park JW. Sample Size and Statistical Power Calculation in Genetic Association Studies. Genomics Inf. 2012;10(2): 117–22. pmid:23105939
  103. 103. Korte A, Farlow A. The advantages and limitations of trait analysis with GWAS: a review. Plant Methods. 2013;9(1): 29. pmid:23876160
  104. 104. Flint-Garcia SA, Thornsberry JM, Buckler ES. (2003). Structure of linkage disequilibrium in plants. Annu Rev Plant Biol. 2003;54: 357–74. https://doi.org/10.1146/annurev.arplant.
  105. 105. Bajaj D, Das S, Badoni S, Kumar V, Singh M, Bansal KC, et al. Genome-wide high- throughput SNP discovery and genotyping for understanding natural (functional) allelic diversity and domestication patterns in wild chickpea. Sci Rep. 2015;5: 12468. pmid:26208313
  106. 106. Lam HM, Xu X, Liu X, Chen W, Yang G, Wong FL, et al. Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection. Nat. Genet. 2010;42: 1053–59. pmid:21076406
  107. 107. Noble TJ, Tao Y, Mace ES, Williams B, Jordan DR, Douglas CA, et al. Characterization of linkage disequilibrium and population structure in a mungbean diversity panel. Front Plant Sci. 2018;8: 2102. pmid:29375590
  108. 108. Lichten M, Goldman ASH. Meiotic Recombination Hotspots. Ann Rev Genet. 1995;29: 423–44. pmid:8825482