Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genetic diversity and population structure of Ethiopian Sweet Sorghum [Sorghum bicolor (L.) Moench] germplasms using SSR markers

  • Tefera Habtegiorgis Teklewold ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Writing – original draft

    tefera.ethio36@yahoo.com

    Affiliation Department of Microbial, Cellular and Molecular Biology, College of Natural and Computational Sciences, Addis Ababa University, Addis Ababa, Ethiopia

  • Tsegaye Getahun Bogale,

    Roles Funding acquisition, Resources, Supervision, Validation, Writing – review & editing

    Affiliation Institute of Biotechnology, Addis Ababa University, Addis Ababa, Ethiopia

  • Demsachew Guadie Tseganeh,

    Roles Funding acquisition, Resources, Supervision, Validation, Writing – review & editing

    Affiliation Institute of Biotechnology, Addis Ababa University, Addis Ababa, Ethiopia

  • Muluken Enyew Birara,

    Roles Resources, Supervision, Visualization, Writing – review & editing

    Affiliations Institute of Biotechnology, Addis Ababa University, Addis Ababa, Ethiopia, School of Biological Sciences, Washington State University, Pullman, Washington, United States of America

  • Tileye Feyissa Senbeta

    Roles Funding acquisition, Project administration, Resources, Supervision, Validation, Writing – review & editing

    Affiliations Department of Microbial, Cellular and Molecular Biology, College of Natural and Computational Sciences, Addis Ababa University, Addis Ababa, Ethiopia, Institute of Biotechnology, Addis Ababa University, Addis Ababa, Ethiopia

Abstract

Sweet sorghum is a cereal crop in the grass family belonging to the genus Sorghum bicolor L. Moench. It is known in its sugary juice that is accumulated in its stalk and efficient C4 photosynthetic pathway. Only few molecular genetic diversity studies of Ethiopian sweet sorghum have been carried out. Understanding the genetic diversity of plants is the basis for genetic improvement, effective conservation and efficient utilization of genetic resources. Therefore, the objective of this study was to evaluate the genetic diversity and population structure of Ethiopian sweet sorghum genotypes collected from major growing areas of Ethiopia. In the present study, thirteen SSR markers produced a total of 136 alleles across all the 91 sweet sorghum accessions with an average of 10.46 alleles per marker. The major allele frequency per marker ranged from 0.16 to 0.41 with an average of 0.25. The number of alleles per marker ranged from 6 to 15. The mean PIC value was 0.80. The pair-wise genetic differentiation among the five studied sweet sorghum populations ranged from 0.07 to 0.19. The highest Fst (0.19) and the lowest Fst (0.07) population differentiation were observed between sweet sorghum population of South Wollo and Oromia Liyu Zone, and North Shewa and East Gojam sweet sorghum populations, respectively. The analysis of gene flow across populations showed that the highest gene flow was recorded between North Shewa and East Gojam (2.879), whereas the least gene flow was observed between South Wollo and Oromia Liyu Zone (0.618). The analysis of molecular variance revealed that 16% variation was observed among populations and 84% variation has been observed within populations. Meanwhile, the STRUCTURE and UPGMA methods of clustering suggested that the sampled sweet sorghum populations were clustered into two main groups (K =  2). This comprehensive study of genetic diversity and population structure of sweet sorghum (Sorghum bicolor) in Ethiopia suggests that future sweet sorghum improvement and utilization strategies should take the magnitude and pattern of genetic diversity into consideration.

Introduction

Sweet sorghum is a cereal crop which is classified under the grass family and the genus Sorghum bicolor L. Moench. It is known in its sugary juice which is accumulated in its stalk and efficient C4 photosynthetic pathway [1,2]. Many researchers often considered it as “the camel crop” because of its drought tolerance along with the properties of high water and nutrient use efficiency [13].

The report of the previous studies indicated that, sweet sorghum is originated from North East Africa particularly from Ethiopia and Sudan [46]. Through time, it spreads to other African regions, Southern Asia, Europe, Australia and the United States [1]. Historical evidences show that sweet sorghum has been found in Ethiopia as early as 200 AD. Its stem has been used to chew as a snack and the grains were used for making local beer by the local people [79]. Even though sweet sorghum is considered as a multipurpose crop by many scientists, currently it is widely used as a syrup production and forage around the world. [1] Stated that an average yield of 1900 L of syrup per hectare can be achieved.

The main difference between sweet sorghum and grain sorghum is that; sweet sorghum has juice rich sugary stalks that grows rapidly with higher biomass [10]. As it is suggested by many scholars, Shorghum bicolor (L.) Moench grouped under Andropogoneae tribe, subgroup panicoideae of the grass family, Poaceae [11,12]. Furthermore the genus Sorghum is classified into five subgenera; these are: Sorghum, Stipo sorghum, Chaetosorghum, Heterosorghum, and Parasorghum [12]. The subgenus Sorghum contains three species including S. bicolor, S. propinquum, and S. halepense [12]. In addition, Sorghum bicolor has three subspecies including Sorghum bicolor, Sorghum bicolor drummondii, and Sorghum bicolor verticilliforum [1114].

Based on morphological features such as grain shape, glume, and panicle, cultivated varieties of Sorghum bicolor have been classified into five basic races including bicolor, guinea, caudatum, kafir, and durra [12,13]. Sweet sorghum and grain sorghum have different class of race. Most of the grain sorghum varieties belong to the races caudatum, kafir, and durra, whereas sweet sorghum and forage sorghum varieties are mainly grouped in the race bicolor [12,15,16]. On the contrary, later studies showed that sweet sorghum has a polyphyletic origin and therefore, apart from race bicolor, may have parentage from other previously mentioned races as well [12,17]. Other studies suggested that in the continent of Africa particularly in Ethiopia where most of the wild germplasm has originated, there are intermediate varieties of sorghum bicolor [12,18].

Understanding the genetic variation of plants has great importance in crop improvement program [19]. Genetic improvement of a crop is mainly dependent on the strength of genetic diversity within the crop species. Furthermore, understanding the molecular basis of the essential biological phenomena in plants is important for the effective conservation, management, and efficient utilization of plant genetic resources.

Molecular markers are nucleotide sequence found in the genome of living organisms and their variations between accessions enable to identify the pattern of inheritance [20]. Molecular markers help breeders to select suitable genotypes for crossing in breeding programs [19]. Evaluation of genetic diversity can be studied using different ways, among the most efficient of which is molecular markers [20]. Simple sequence repeats (SSR) markers are characterized by their tandem repeats which range one to six nucleotides. They are distributed throughout the genome of an organism, and have multiple alleles that often have the same loci between related species [1921]. They have been widely used in different genetic studies such as diversity analysis, quantitative trait locus mapping, and genotype identification [19].

Even though the Ethiopian Biodiversity Institute contains large number of sorghum germplasms in its gene bank, most of the collections are not well characterized. In addition sweet sorghum germplasms are not maintained separately from grain sorghum germplasms [22]. Furthermore, only a few molecular studies of Ethiopian sweet sorghum have been carried out to show the extent of genetic variation. However, these studies did not include germplasms of sweet sorghum from all geographical range of Ethiopia. Therefore, the objective of this study was to evaluate the genetic diversity and population structure of Ethiopian sweet sorghum genotypes collected from major growing areas of Ethiopia particularly those areas which were not covered in the previous studies.

Materials and methods

Plant materials

In this study, a total of ninety one sweet sorghum accessions were used. From these accessions, twenty nine of them were obtained from Ethiopian Biodiversity Institute (EBI) and sixty two of them were collected from major sweet sorghum growing areas of Amhara and Tigray regions of Ethiopia (Fig 1). Those sweet sorghum accessions that were obtained from Ethiopian Biodiversity Institute (EBI) were a mixture of grain sorghum; therefore, purification and isolation have been done by cultivating them in the farm during 2022 growing season. While they reach at maturity stage (3 to 4 months), they possessed distinctive morphological features that make them different from grain sorghum. The color of the midrib of their leaf and the juice accumulated in their stalk were used to differentiate them from grain sorghum. The remaining sixty two sweet sorghum accessions were collected directly from farmer’s sweet sorghum farms during 2022 harvesting season. Ethiopian farmers cultivate a mixture of sweet sorghum and grain sorghum on the same plot of land at the same time. They use the sweet sorghum juicy stalk as a chewing snack and the grain sorghum for making foods such as local pancake called ‘Injera’ and local beverage. Farmers from the selected areas of Amhara and Tigray regions, were the sources for the collection of these sixty two sweet sorghum accessions. Based on their collection sites, they were categorized into five populations: North Shewa, South Tigray, South Wollo, East Gojam and Oromia Liyu Zone.

thumbnail
Fig 1. Collection sites of the studied sweet sorghum accessions.

https://doi.org/10.1371/journal.pone.0316549.g001

DNA extraction

All the ninety one sweet sorghum accessions were grown in a pot at the greenhouse of Institute of Biotechnology, Addis Ababa University. Healthy young leaves were collected from two-week-old plants of each accession and kept in plastic bag containing silica gel. Genomic DNA was extracted from young leaves using modified CTAB plant DNA extraction protocol [23]. DNA quality was checked by loading 2µl of DNA mixed with 2µl of loading dye with 4µl of ethidium bromide on a 1% agarose gel and separated at 100 V for 45 minutes. The extracted genomic DNA was quantified using a Nano-Drop spectrophotometer (ND-2000) and stored at − 20°C until used for further analyses.

PCR and gel electrophoresis

Twenty SSR primer-pairs developed by [24] were used for initial screening, amplification, polymorphism and specificity to target loci. It could have been more appropriate to use large number of SSR markers in this study; however due to limitation of resources such as chemicals and reagents for running polymerase chain reaction and gel electrophoresis, limited number of markers were used. According to the manufacturer’s instruction, the primers were dissolved using nuclease-free water to a final concentration of 100 mol/µl and further diluted in 1 to 10 ratio. Out of the twenty screened primer pairs, thirteen of them amplified the genomic DNA and showed polymorphism across tested sweet sorghum accessions (Table 1). DNA amplification was performed in 11μl reaction volume containing 4μl of 2X Taq plus Master Mix (containing Taq DNA polymerase, dNTPs, MgCl2, PCR buffer, PCR reaction enhancer, stabilizer and blue tracer dye) (Sangon Biotech), 1μl of 100 ng/µl template DNA, 0.5μl forward and 0.5µl reverse primers, and 5µl of nuclease free water. The PCR program for the primer pairs consisted of 4 min at 94°C for the initial denaturation followed by 35 cycles of 94°C for 30 second, primer annealing at 52-60°C for 30 seconds, extension cycle at 72°C for 1min followed by final extension at 72°C for 10min and then hold at 4°C. The PCR amplified products were separated by loading 10µl of each of the PCR product on 2% agarose gel electrophoresis which is mixed with 4 µl of ethidium bromide and run using 1 × TBE buffer at 100 V for 2hours. The product was finally visualized under gel documentation system (BioDoc-ItTM imaging system) and subsequently photographed. A 100 bp size molecular marker (DNA ladder) was used to estimate the size of the amplified products.

thumbnail
Table 1. List of SSR markers used to amplify the genomic DNA of sweet sorghum.

https://doi.org/10.1371/journal.pone.0316549.t001

Data analysis

After PCR amplification and gel electrophoresis have been conducted, the bands were photographed by BioDoc-ItTM imaging system (Figs 25). All of the bands of the studied sweet sorghum accessions under different SSR markers were scored using Image Lab™ Touch software version 2.3. All the molecular analysis were carried out based on the molecular weight of each bands. Parameters used to characterize and analyze the SSR markers such as major allele frequency (MAF), the number of allele (Na), gene diversity (GD) and Polymorphic information contents (PIC) were analyzed using Power marker version 3.25 software [25]. On the other hand, number of different alleles, effective number of alleles, Shannon’s information index, number of private alleles, expected heterozygosity, observed heterozygosity, fixation index, pairwise population genetic distance, genetic differentiation, Principal Coordinate Analysis (PCOA) and molecular variance (AMOVA) were analyzed using GenAlEx version 6.503 software [26]. The two integrated, STRUCTURE version 2.3.4 and Structure harvester software were used to estimate population structure. The structure 2.3.4 software was used to estimate the optimum number of clusters (K) of the studied sweet sorghum accessions. The software uses Bayesian algorithm and admixture model based on Monte Carlo Markov Chain algorithm [27]. In the sweet sorghum population structure estimation, the admixture model used the allele frequency correlation value. The number of ancestral sweet sorghum populations was calculated for K ranging from 1 to 8 with 8 runs for each K value. For each run, a burn-in period of 30000 iterations was used. In addition, MCMC (Markov Chain Monte Carlo) replications of 15000 was performed. The structure calculation results were submitted to the online tool Structure Harvester [28] to determine the number of clusters in the group [29]. Furthermore, cluster analysis and phylogenetic tree construction were carried out by the combination of MEGA version 11.0.13 [30] and power marker version 3.25 [25] software.

thumbnail
Fig 2. The band pattern of some of the sweet sorghum accessions amplified by primer Xisep0310.

https://doi.org/10.1371/journal.pone.0316549.g002

thumbnail
Fig 3. The band pattern of some of the sweet sorghum accessions amplified by primer Xtxp12.

https://doi.org/10.1371/journal.pone.0316549.g003

thumbnail
Fig 4. The band pattern of some of the sweet sorghum accessions amplified by primer Xtxp3.

https://doi.org/10.1371/journal.pone.0316549.g004

thumbnail
Fig 5. The band pattern of some of the sweet sorghum accessions amplified by primer Xtxp015.

https://doi.org/10.1371/journal.pone.0316549.g005

Results

Polymorphism and allelic diversity

Thirteen SSR markers produced a total of 136 alleles across all 91 sweet sorghum accessions with an average of 10.46 alleles per marker. The major allele frequency per marker ranged from 0.16 (Xtxp012) to 0.41 (Xtxp114), with an average of 0.25. The number of alleles per marker ranged from 6 (Xtxp114) to 15 (Xtxp12) (Table 2). All of the SSR markers were highly polymorphic. The PIC values for the SSR markers ranged from 0.68 to 0.88 with an average of 0.80 (Table 2). The highest PIC value of 0.88 was obtained for marker Xtxp012, whereas the lowest PIC value was recorded for marker Xtxp114. The overall heterozygosity (Ho) over loci varied from 0.00 (Xcup53, Xtxp015, Xtxp114 and SbAGE01) to 0.26 (Xtxp12) with an average of 0.12. The gene diversity index ranged from 0.72 (Xtxp114) to 0.89 (Xtxp012).

thumbnail
Table 2. Patterns of alleles amplified by 13 SSR markers.

https://doi.org/10.1371/journal.pone.0316549.t002

The total expected heterozygosity (Ht) recorded for all 13 SSR markers in the present study ranged from 0.71 to 0.88 with a mean of 0.82 (Table 3). The mean value of expected heterozygosity (He) revealed by each SSR markers ranged from 0.54 (Xtxp114) to 0.79 (Xtxp012) (Table 3). On the other hand, the mean value of the observed heterozygosity (Ho) ranged from 0.00 (Xcup53, Xtxp015, Xtxp114, SbAGE01) to 0.35 (Xisep0310 and Xisep0310) (Table 3). The overall mean of expected heterozygosity (He) and observed heterozygosity (Ho) of the 13 SSR markers were 0.66 and 0.14, respectively (Table 3).

thumbnail
Table 3. F-statistics and estimates of gene flow (Nm) over all populations for each markers.

https://doi.org/10.1371/journal.pone.0316549.t003

Genetic diversity within and among populations

Among the studied sweet sorghum populations, sweet sorghum population of North Shewa revealed high mean value in terms of number of different alleles (8.46), number of effective alleles (5.15), expected heterozygosity (0.79), Shannon’s Information Index (1.79) and unbiased expected heterozygosity (0.80) (Table 4). On the contrary, sweet sorghum populations of South Wollo showed the least number of different alleles (3.462). In terms of other genetic diversity parameters, Oromia Liyu Zone revealed the least number of effective alleles (2.500), expected heterozygosity (0.541), observed heterozygosity (0.006), Shannon’s Information Index (0.994), unbiased heterozygosity (0.564) and fixation index (0.989) (Table 4). North Shewa population revealed the highest mean value of private alleles (1.54), whereas the least number of private alleles were recorded in the sweet sorghum population of Oromia Liyu Zone and East Gojam (0.77). The analysis of the number of different alleles with a frequency>= 5%, revealed that relatively high mean value (5.69) was scored for North Shewa population and the least mean value was scored in South Wollo population (3.462). On the other hand, the number of locally common alleles with frequency>= 5% showed that North Shewa population has revealed the highest mean value (2.00), whereas South Wollo population has shown the least mean value (0.46) (Table 4).

Genetic differentiation, genetic distance and gene flow

The pair-wise genetic differentiation among the populations ranged from 0.071 to 0.196 (Table 5). The highest population differentiation was observed between South Wollo and Oromia Liyu Zone (Fst =  0.196) followed by East Gojam and Oromia Liyu Zone (Fst =  0.186). The lowest population differentiation (Fst =  0.071) was observed between North Shewa and East Gojam (Table 5). The highest Nei’s genetic distance was observed between East Gojam and Oromia Liyu Zone (1.201) followed by East Gojam and South Tigray (1.080). The lowest Nei’s genetic distance (0.543) was observed between East Gojam and North Shewa (Table 5). The highest gene flow across populations was recorded between North Shewa and East Gojam (2.879), whereas the least gene flow was observed between South Wollo and Oromia Liyu Zone (0.618) (Table 6).

thumbnail
Table 5. Pairwise population genetic differentiation analysis measured by Fst (below the diagonal) and Nei Genetic Distance (above the diagonal) among different geographic regions.

https://doi.org/10.1371/journal.pone.0316549.t005

thumbnail
Table 6. Pairwise population gene flow (Nm) Values (below diagonal).

https://doi.org/10.1371/journal.pone.0316549.t006

Analysis of molecular variance

Analysis of molecular variance (AMOVA) revealed that majority of the variance was restricted within populations (84%), whereas 16% variance was partitioned among populations (Table 7).

thumbnail
Table 7. Analysis of molecular variance (AMOVA) among and within sweet sorghum populations based on genetic distance matrix.

https://doi.org/10.1371/journal.pone.0316549.t007

Cluster analysis

The phylogenetic analysis revealed clustering of sweet sorghum accessions is mainly based on their geographical origin with a few exceptions (Fig 6). The 91 accessions grouped into two major clusters. The first cluster (C1) consist of accessions from South Wollo and South Tigray Zones. The second cluster (C2) consisted of accessions collected from the North Shewa, East Gojam, and Oromia liyu Zone (Fig 3). The phylogenetic tree also showed that the two major clusters further grouped into seven sub clusters (SC1 to SC7). Three sub clusters and forty accessions were included in the first cluster. The remaining four sub clusters with fifty one accessions were included in the second cluster. Some accessions collected from North Shewa were clustered along with accessions collected from South Wollo.

thumbnail
Fig 6. Unweighted pair-group method with arithmetic mean cluster analysis showing the genetic relationship among the 91 sweet sorghum accessions using 13 SSR markers.

https://doi.org/10.1371/journal.pone.0316549.g006

Principal coordinates analysis

The Principal coordinate analysis (PCoA) result revealed 31.26% of variations explained by the first three axes. The first, second and third principal coordinate axes showed 12.55%, 10.38% and 8.33%, respectively (Table 8). Although the groups were not completely separated in the two-dimensional plot, the distribution of accessions on the plot indicated the presence of genetic diversity (Fig 7). Most of the accessions collected from North Shewa zone were clustered on the lower left and right sides of the coordinate graph. In addition, some sweet sorghum accessions are concentrated on the upper right side of the coordinate graph along with the accessions collected from Oromia Liyu Zone. On the other hand, the accessions of South Tigray were clustered on the upper left and most of the accessions of Oromia Liyu Zone were concentrated on the upper right of the axis. All accessions collected from North Shewa, South Wollo and East Gojam zones aggregated together on the lower left and right sides of the coordinate graph and there is no clear separation among them.

thumbnail
Table 8. Percentage of variation explained by the first 3 axes.

https://doi.org/10.1371/journal.pone.0316549.t008

thumbnail
Fig 7. Principal coordinates analysis of 91 sweet sorghum accessions using 13 SSR markers.

https://doi.org/10.1371/journal.pone.0316549.g007

Population structure

In this study the population structure results for the sweet sorghum populations obtained using STRUCTURE software showed that the maximum value for ‘K’ is 2 (Fig 8). The population structure analysis indicated that the 91 accessions were optimally divided into 2 groups. The genetic structure of sweet sorghum confirmed that the pattern of population structure is closely related to the geographical distribution of accessions. Accessions from adjacent area were grouped together (Fig 9). Although there was some degree of genetic mixing (gene exchange) between groups, there was still clear differentiation. The results of PCoA and unweighted pair-group method with arithmetic mean (UPGMA) cluster analysis further supported those from STRUCTURE software. Individuals from the adjacent area tended to cluster together.

thumbnail
Fig 8. Bayesian model-based estimation of population structure (K=  2) of the 91 sweet sorghum accessions.

https://doi.org/10.1371/journal.pone.0316549.g008

thumbnail
Fig 9. Bar Plot showing the Population structure of 91 sweet sorghum accessions using 13 SSR molecular markers with (K=  2).

https://doi.org/10.1371/journal.pone.0316549.g009

Discussion

Polymorphism of SSR markers and allelic patterns

Allele frequency is the concept used to quantify genetic variation. In this study, thirteen SSR markers produced a total of 136 alleles across all 91 sweet sorghum accessions with an average of 10.46 alleles per marker. In a similar study, [22] obtained a total of 159 alleles using 13 markers in 202 sweet sorghum accessions. In addition, [31] in his study, reported a total of 123 alleles with an average of 10.25 alleles per marker and the major allele frequency per marker ranged from 0.159 (Xtxp012) to 0.41 (Xtxp114), with an average of 0.25. The study conducted by [32] reported the average major allele frequency of 0.5, which is higher than 0.25. In the present study higher numbers of alleles were recorded in comparison with the previous studies. This difference could be due to the types of markers, the size of the population, and the number of germplasms. This variation of alleles and genotypes present in the populations can be a basis for selection and breeding improvement programs.

Polymorphic information content (PIC) is the ability of a marker to establish polymorphism in the population based on the number of alleles discovered and on their distribution frequency [33]. Having a high PIC value for a marker is one of the most important indicators that the marker can be used successfully in the evaluation of genetic variation [34]. The study conducted on Ethiopian sweet sorghum by [22], the PIC of SSR markers values ranged from 0.37 to 0.85. Another study conducted on sorghum by [31], the range of PIC value was 0.39 to 0.85 using 12 SSRs across 160 accessions. The study of [19] also recorded a PIC value of 0.22 to 0.75 with a mean value of 0.52. Furthermore, the study conducted by [32] reported an average PIC value of 0.67. In this study relatively higher PIC values were observed compared to the previous studies. The PIC values for the 13 SSR primers ranged from 0.68 to 0.88 with an average value of 0.80. According to [35,36], markers with a PIC value > 0.5 are considered to be highly informative, 0.25 to 0.5 moderately informative and less than 0.25 slightly informative. In fact, PIC is determined by heterozygosity and number of alleles [37]. In this study the mean PIC value of 0.80 confirmed that the markers used were highly informative. This implies that the SSR markers used in this study are effective in showing the diversity. Therefore, these SSR markers can be a choice for future genetic characterization and diversity studies.

The average number of private alleles observed in this study ranged from 0.08 (East Gojam and Oromia Liyu Zone) to 1.538 (North Shewa). According to [33], expected hetrozyousity (He) can be defined as the likelihood that an individual in the population is heterozygous for the locus. Observed heterozygosity (Ho) is the fraction of heterozygous genes in a population. The total expected heterozygosity recorded for all 13 SSR markers in the present study ranged from 0.71 (Xtxp114) to 0.88 (Xtxp012) with a mean of 0.82. In this study, the observed heterozygosity (Ho) ranged from 0.00 (Xcup53, Xtxp015, Xtxp114 and SbAGE01) to 0.35 (Xisep0310) with a mean of 0.14. This result is higher than the He value reported by [31] which was 0.66 and that reported by [22], in which the expected heterozygosity (He) ranged from 0.42 to 0.85 with a mean of 0.72 and the overall observed heterozygosity (Ho) ranged from 0.04 to 0.16 with an average of 0.09. This difference could be because of the difference in SSR markers used, the size of the studied population and the number of accession used in the study. Among the studied sweet sorghum populations, North Shewa population revealed high mean value of number of different alleles (8.46), number of effective alleles (5.15), expected heterozygosity (0.79), observed heterozygosity (0.14), unbiased expected heterozygosity (0.80) and Shannon’s Information Index (1.79) and fixation index (0.83).

The gene diversity (GD) of a locus, also known as its expected heterozygosity (He) describes the expected proportion of heterozygous genotypes under Hardy-Weinberg Equilibrium [38]. The high levels of heterozygosity that was recorded in this study implies the presence of substantial genetic diversity that breeders can explore. Allelic patterns across these five populations of sweet sorghum showed the presence of private alleles and implication for breeding is that the private alleles are an indication of new genes that could be crucial in desired or novel trait. On the contrary, sweet sorghum populations of Oromia Liyu Zone revealed the least mean value of number of effective alleles (2.50), expected heterozygosity (0.54), Shannon’s Information Index (0.99), observed heterozygosity (0.01), unbiased expected heterozygosity (0.56), and fixation index (0.99). All populations exhibited unique private alleles. The highest number of private alleles (1.54) were exhibited by North Shewa populations and the least number of private alleles (0.08) was displayed by East Gojam and Oromia Liyu Zone populations. The private allelic patterns across the sweet sorghum populations suggest the presence of new genes that may be contributing to desired traits.

Genetic differentiation and population structure

Genetic differentiation (Fst) measures the amount of genetic variance that can be explained by population structure based on Wright’s F-statistics. [39] Suggested that genetic differentiation among populations can be categorized as (Fst < 0.05, low; 0.05_Fst < 0.15, medium; Fst > 0.15, high). In the present study, 0.19 average value of Fst was recorded. The highest population differentiation was observed between South Wollo and Oromia Liyu Zone populations (Fst =  0.19), followed by East Gojam and Oromia Liyu Zone (Fst =  0.18). The lowest population differentiation (Fst =  0.07) was observed between North Shewa and East Gojam populations. In this study, the pair wise Fst value varied from 0.07 to 0.19. This analysis showed that there was high level of genetic differentiation between each pair of populations. Since all the results of Fst so far observed in this study were very low and approximated to 0, it showed that they are not completely isolated one another and there is sharing of genetic material through breeding activity among the studied sweet sorghum population.

The pairwise Nei’s genetic distance analysis showed that the lowest (0.54) and the highest (1.20) Nei’s genetic distance were recorded among the studied populations. The highest Nei’s genetic distance was observed between East Gojam and Oromia Liyu Zone (1.20) populations followed by East Gojam and South Tigray (1.08). The lowest Nei’s genetic distance (0.54) was observed between East Gojam and North Shewa. East Gojam population showed the highest (1.20) pairwise Nei’s genetic distance with South Tigray and Oromia Liyu Zone populations and is the most genetically distinct population. This can be partly explained by the fact that the location of East Gojam is geographically the most distant to Oromia Liyu Zone and South Tigray. This may restricted the exchange of seeds among farmers, hence, these populations may serve as potential sources of new genetic variation of important traits that can be used in sweet sorghum breeding programs.

According to [39], the gene flow (Nm) is divided into three grades: high (≥1.0), medium (0.250–0.99) and low (0.0–0.249). When Nm >  1 existed, there is certain gene flow between populations, in the present study, 1.15 average value of Nm was recorded. This result suggest that there was high gene flow among the studied sweet sorghum populations. The analysis of gene flow showed the highest gene flow was recorded between North Shewa and East Gojam (2.879) populations whereas the least gene flow was observed between South Wollo and Oromia Liyu Zone (0.618) populations.

AMOVA revealed that the variance within populations is higher than that of the variance observed among populations. The majority of the variance was within populations (84%), whereas 16% variance was among populations. This significant differences between accessions might be due to high sexual recombination or the mutation of a number of repeats of a given genotype. Additionally, this high genetic variation could be due to the natural selection process. On the contrary, the relatively low variation observed among population could be the result of gene flow through exchange of seeds via breeding programs or market. Since the variation present among population is lower than within population, it may implies the influence of climatic factor is low in the expression of the genetic diversity of accessions. Similar study conducted by [40], revealed that 68.1% of variation was recorded within population and 31.9% was restricted among the studied sorghum populations. The study conducted on grain sorghum by [41] revealed that 93.26% of the total genetic variation within populations and 6.74% among populations. Another study conducted by [31] on grain sorghum using 12 SSR markers showed that 54.44% of the variation was observed within populations and 32.76% was among populations. Furthermore, the study conducted by [42] on 200 accessions of grain sorghum using 39 SSR markers reported that 99.62% of the variation was accounted to within sweet sorghum populations and 0.38% to the total variability observed was among the studied sweet sorghum populations.

Genetic differentiations and relationships observed among sweet sorghum accessions using distance methods, pairwise differentiation, PCoA, UPGMA analysis and Bayesian model-based cluster analysis confirmed the differentiation of accessions based on their geographical location although there are considerable intermixes. As the cluster analysis of this study revealed, most of the sweet sorghum accessions collected from the same geographical areas were clustered together. The unweighted pair-group method with arithmetic mean (UPGMA) grouped the 91 sweet sorghum accessions into two broad classes. This clustering of sweet sorghum accessions showed that there was significant relationship between geographic origin and genetic similarity, although there is considerable intermixes. On the other hand, some sweet sorghum accessions that were collected from areas that are not adjacent to each other clustered under the same group. This clustering suggests that there were some level of gene flow and exchange of seeds among sweet sorghum growing areas in Ethiopia through market or breeding programs. This pattern of genetic relationships where accessions from the same or adjacent zones were genetically related could be attributed to existence of variety exchange patterns of such accessions among farmers.

PCoA is a multivariate dataset that provides the ability to find and archive key patterns in multiple loci and multiple samples [43]. PCoA can be utilized to derive a 2 or 3-dimensional scatter plot of individuals, such that the geometrical distances among individuals in the plot reflect the genetic distances among them with minimal distortion. Aggregations of individuals in such a plot will reveal sets of genetically similar individuals [4345]. PCoA results revealed similarities of accessions from North Shewa, South Wollo and East Gojam. On the other hand accessions of South Tigray and Oromia Liyu Zone showed similarity. This similarity could be because of gene flow among farmers. As it is shown by the PCoA analysis, the North Shewa populations shared with other populations. On the contrary, South Tigray population was uniquely isolated from the other populations indicating absence of exchange of genetic material. The overall distribution of the accessions on the diagram showed that there is a gene flow process among sweet sorghum accessions especially in areas that are geographically adjacent to each other. The distribution also shows that genetic diversity is quite strong among sweet sorghum populations collected from different areas of Ethiopia. According to [40,46], several factors could have contributed to the detected patterns of genetic diversity and observed structure. These include environmental, biological, cultural, and socioeconomic factors that play a role in farmer’s decisions to choose or keep a particular sweet sorghum cultivars at any given time.

Conclusions

Unlocking the genetic diversity of germplasm is the first step for improvement and development of superior cultivars for different breeding purpose. The present study reports the molecular genetic diversity studies and population structure analysis of Ethiopian sweet sorghum accessions collected from different parts of Ethiopia using SSR markers. The SSR markers employed in this study exhibited a high degree of polymorphism, effectively revealing the genetic differences among and within the sweet sorghum populations. The high informative degree of these SSR markers make them valuable candidates for future genetics studies. The results of this study showed a higher level of genetic diversity within the studied sweet sorghum populations than between sweet sorghum populations. This indicates the presence of gene flow among the sweet sorghum accessions sharing close geographical distance and similar ecology. This high level of genetic variability will have a great role for future genetic improvement through breeding. It also contributes a lot for conservation of the sweet sorghum accessions in their native habitat.

The present study identified the sweet sorghum accessions collected from different agroecologies of Ethiopia were classified into two main groups. Although there is considerable intermixes, this clustering of sweet sorghum accessions showed significant relationship between geographic origin and genetic similarity. Additionaly, the study identified the role of gene flow and geographical isolation on genetic variation exhibited within and among sweet sorghum populations.The high level of gene flow observed in the present study had a role to bring a new allele within each sweet sorghum population. In addition this high level of gene flow resulted to increase the genetic diversity within the sweet sorghum population and reduce the diversity exihibited among populations. The private alleles observed among the studied accessions provides more evidence for novel alleles that can be used for further studies. These alleles uniquely found in a specific population, can be used as a source of desirable traits. The genetic variations observed so far in these accessions can be a basis to work on sugar content, drought resistance and other important agronomic traits. Since Ethiopia is the center of origin and diversity for sweet sorghum, more accessions should be collected and studied from all agro-ecologies. In addition, large number of markers should be employed to show the extent of molecular diversity in a better way.

Supporting information

S1 Data. Recorded molecular size of 91 sweet sorghum accessions that are clustered into 5 populations amplified using 13 SSR markers.

https://doi.org/10.1371/journal.pone.0316549.s001

(XLSX)

References

  1. 1. Gnansounou E, Dauriat A, Wyman CE. Refining sweet sorghum to ethanol and sugar: economic trade-offs in the context of North China. Bioresour Technol. 2005;96(9):985–1002. pmid:15668196
  2. 2. Mokariya LK, Malam KV. Sweet sorghum: A smart, multipurpose crop. Agriculture and Environment. 2020;1(1).
  3. 3. Woods J. Integrating Sweet sorghum and sugarcane for bioenergy: Modelling the potential for electricity and ethanol production in SE Zimbabwe, Ph.D. Thesis, Kings College, London. 2000.
  4. 4. Vavilov NI. The origin, variation, immunity and breeding of cultivated plants. Chronica Botanica. 1951;13(1):1–366.
  5. 5. Monteiro JST, Havrland B, Ivanova T. Sweet sorghum (Sorghum bicolor (L.) Moench) bioenergy value – importance for Portugal. Agricultura Tropica Et Subtropica. 2012;45(1):12–9.
  6. 6. Mamo M, Worede F, Bantayehu M. Evaluation of sorghum (Sorghum bicolour (L.) Moench) genotypes for Striga resistance and yield and yield related traits. BSJ Agri. 2020;3(2):85–95.
  7. 7. Dajue, Li. Developing sweet sorghum to meet the challenge of problems on Food, Energy and Environment in 21st century”. Proceedings of first international sweet sorghum conference, Beijing, China. 1997.
  8. 8. Makanda I, Derera J, Tongoona P, Sibiya J. Development of sorghum for bioenergy: a view from the stakeholders and priorities for breeding dual purpose varieties. African Journal of Agricultural Research. 2011;6:4477–86.
  9. 9. Gerrano AS, Labuschagne MT, van Biljon A, Shargie NG. Genetic variability among sorghum accessions for seed starch and stalk total sugar content. Sci agric (Piracicaba, Braz). 2014;71(6):472–9.
  10. 10. Rao P, Srinivasa PJVNS, Umakanth AV, Reddy Belum VS. Sweet sorghum (Sorghum bicolor (L.) Moench) - A new generation water use efficient bioenergy crop. Indian Journal of Dryland Agriculture Research and Development. 2011;26(1):65–71.
  11. 11. Rao PS, Kumar CG, Reddy BVS. Sweet sorghum: from theory to practice. In: Rao PS, Kumar CG, editors. Characterization of improved sweet sorghum cultivars. Berlin: Springer. 2012; 2(25):1–15.
  12. 12. Mathur S, Umakanth AV, Tonapi VA, Sharma R, Sharma MK. Sweet sorghum as biofuel feedstock: recent advances and available resources. Biotechnol Biofuels. 2017;10:146. pmid:28603553
  13. 13. De Wet JMJ. Systematics and evolution of Sorghum sect. sorghum (Gramineae). American Journal of Botany. 1978;65(4):477–84.
  14. 14. Wiersema JH, Dahlberg J. The nomenclature of Sorghum bicolor (L.) Moench (Gramineae). TAXON. 2007;56(3):941–6.
  15. 15. Harlan JR, de Wet JMJ. A Simplified Classification of Cultivated Sorghum1. Crop Science. 1972;12(2):172–6.
  16. 16. Whitfield MB, Chinn MS, Veal MW. Processing of materials derived from sweet sorghum for biobased products. Industrial Crops and Products. 2012;37(1):362–75.
  17. 17. Ritter KB, McIntyre CL, Godwin ID, Jordan DR, Chapman SC. An assessment of the genetic relationship between sweet and grain sorghums, within Sorghum bicolor ssp. bicolor (L.) Moench, using AFLP markers. Euphytica. 2007;157(1–2):161–76.
  18. 18. Mekbib F. Farmers’ breeding of sorghum in the center of diversity, Ethiopia: I. Socio-ecotype differentiation, varietal mixture and selection efficiency. Maydica. 2009;54:25–37.
  19. 19. Lekgari A, Dweikat I. Assessment of Genetic Variability of 142 Sweet Sorghum Germplasm of Diverse Origin with Molecular and Morphological Markers. OJE. 2014;04(07):371–93.
  20. 20. Schulman AH. Molecular markers to assess genetic diversity. Euphytica. 2006;158(3):313–21.
  21. 21. Brown SM, Hopkins MS, Mitchell SE, Senior ML, Wang TY, Duncan RR, et al. Multiple methods for the identification of polymorphic simple sequence repeats (SSRs) in sorghum [Sorghum bicolor (L.) Moench]. Theor Appl Genet. 1996;93(1–2):190–8. pmid:24162217
  22. 22. Disasa T, Feyissa T, Admassu B, Paliwal R, De Villiers SM, Odeny DA. Molecular evaluation of Ethiopian sweet sorghum germplasm and their contribution to regional breeding programs. Aust J Crop Sci. 2016;10(04):520–7.
  23. 23. Doyle JJ. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemistry. 1987;19(11):1155–7.
  24. 24. Billot C, Rivallan R, Sall MN, Fonceka D, Deu M, Glaszmann J-C, et al. A reference microsatellite kit to assess for genetic diversity of Sorghum bicolor (Poaceae). Am J Bot. 2012;99(6):e245-50. pmid:22645098
  25. 25. Liu K, Muse SV. PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics. 2005;21(9):2128–9. pmid:15705655
  26. 26. Peakall R, Smouse PE. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update. Bioinformatics. 2012;28(19):2537–9. pmid:22820204
  27. 27. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–59. pmid:10835412
  28. 28. Earl DA, vonHoldt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genet Resour. 2011;4(2):359–61.
  29. 29. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14(8):2611–20. pmid:15969739
  30. 30. Tamura K, Stecher G, Kumar S. MEGA11: Molecular evolutionary genetics analysis version 11. Molecular Biology and Evolution. 2021;38:3022–7.
  31. 31. Asfaw A. Analysis of in situ diversity and population structure in Ethiopian cultivated Sorghum bicolor (L.) landraces using phenotypic traits and SSR markers. Springer Plus. 2014;3(212):1–14.
  32. 32. Weerasooriya DK, Maulana FR, Bandara AY, Alemu T, Amsalu A, Girma M, et al. Genetic diversity and population structure among sorghum (Sorghum bicolor, L.) germplasm collections from Western Ethiopia. African Journal of Biotechnology. 2016;15(23):1147–58.
  33. 33. Chesnokov YV, Artemyeva AM. Evaluation of the measure of polymorphism information of genetic diversity. Agricultural Biology. 2015;50:571–8.
  34. 34. Dutta SK, Singh S, Chatterjee D, Boopathi T, Singh A, Saha S. Morphological and genetic diversity of pole type common bean (Phaseolus vulgaris L.) landraces of Mizoram (India). Indian Journal of Biotechnology. 2016;15:550–9.
  35. 35. Botstein D, White RL, Skolnick M, Davis RW. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet. 1980;32(3):314–31. pmid:6247908
  36. 36. Nei M. Analysis of gene diversity in subdivided populations. Proc Natl Acad Sci U S A. 1973;70(12):3321–3. pmid:4519626
  37. 37. Aljumaah RS, Musthafa MM, Al-Shaikh MA, Badri OM, Hussein MF. Genetic diversity of Ardi goats based on microsatellite analysis. African Journal of Biotechnology. 2012;11(100):16539–45.
  38. 38. Nei M. Genetic distance between populations. The American Naturalist. 1972;106(2):283–92.
  39. 39. Wright S. Evolution and the genetics of populations. In: Variability within and Among Natural Populations, University of Chicago Press, Chicago.1978;4.
  40. 40. Salih SA, Labuschange LT, Mohammed AH. Assessment of genetic diversity of Sorghum [Sorghum bicolor (L.) Moench] germplasm in East and Central Africa. World J Biol Biotechnol. 2016;1(3):115.
  41. 41. Nemera B, Kebede M, Enyew M, Feyissa T. Genetic diversity and population structure of sorghum [Sorghum bicolor (L.) Moench] in Ethiopia as revealed by microsatellite markers. Acta Agriculturae Scandinavica, Section B — Soil & Plant Science. 2022;72(1):873–84.
  42. 42. Alemu T, Tesfaye T, Asfaw A, Hussein M, Dan K. Genetic diversity among Ethiopian sorghum [Sorghum bicolor (L.) Moench] gene bank accessions as revealed by SSR markers. Afr J Biotechnol. 2020;19(2):84–91.
  43. 43. Mohammadi SA, Prasanna BM. Genetic Diversity Analysis-Statistical Tools and Considerations. Crop Science. 2003; 43:1235–1248.
  44. 44. Karp A, Kresovich S, Bhat KV, Ayad WG, Hodgkin T. Molecular tools in plant genetic resources conservation: A guide to the technologies. IPGRI, Rome.1997.
  45. 45. Warburton M, Crossa J. Data analysis in the CIMMYT Applied Biotechnology Center for Fingerprinting and Genetic Diversity Studies. CIMMYT, Mexico. 2000.
  46. 46. Hobbs RJ, Huenneke LF. Disturbance, Diversity, and Invasion: Implications for Conservation. Conservation Biology. 1992;6(3):324–37.