A genome‑wide approach to the systematic and comprehensive analysis of LIM gene family in sorghum (Sorghum bicolor L.)

Article information

Genomics Inform. 2023;21.e36
Publication date (electronic) : 2023 September 27
doi : https://doi.org/10.5808/gi.23007
1Department of Genetic Engineering and Biotechnology, Faculty of Biological Science and Technology, Jashore University of Science and Technology, Jashore 7408, Bangladesh
2Department of Genetic Engineering and Biotechnology, Faculty of Biological Sciences, University of Rajshahi, Rajshahi 6205, Bangladesh
*Corresponding author: E-mail: rauf.gebt@yahoo.com
Received 2023 February 2; Revised 2023 June 23; Accepted 2023 August 9.

Abstract

The LIM domain-containing proteins are dominantly found in plants and play a significant role in various biological processes such as gene transcription as well as actin cytoskeletal organization. Nevertheless, genome-wide identification as well as functional analysis of the LIM gene family have not yet been reported in the economically important plant sorghum (Sorghum bicolor L.). Therefore, we conducted an in silico identification and characterization of LIM genes in S. bicolor genome using integrated bioinformatics approaches. Based on phylogenetic tree analysis and conserved domain, we identified five LIM genes in S. bicolor (SbLIM) genome corresponding to Arabidopsis LIM (AtLIM) genes. The conserved domain, motif as well as gene structure analyses of the SbLIM gene family showed the similarity within the SbLIM and AtLIM members. The gene ontology (GO) enrichment study revealed that the candidate LIM genes are directly involved in cytoskeletal organization and various other important biological as well as molecular pathways. Some important families of regulating transcription factors such as ERF, MYB, WRKY, NAC, bZIP, C2H2, Dof, and G2-like were detected by analyzing their interaction network with identified SbLIM genes. The cis-acting regulatory elements related to predicted SbLIM genes were identified as responsive to light, hormones, stress, and other functions. The present study will provide valuable useful information about LIM genes in sorghum which would pave the way for the future study of functional pathways of candidate SbLIM genes as well as their regulatory factors in wet-lab experiments.

Introduction

LIM proteins belong to the family of cysteine-rich proteins (CRPS), which are found in almost all eukaryotic organisms [1]. The name LIM denotes the acronyms of three LIM domain-containing proteins which are LIN11, ISL1, as well as MEC3. ISL1 was first identified in rats, while LIN11 and MEC3 were observed in Caenorhabditis elegans [2]. During transcription, LIM domains are actively associated with the process of DNA binding. A minimum of one LIM domain, which is responsible for mediating protein interactions, is present in LIM proteins found in eukaryotic organisms. There have been observations that LIM proteins sometimes include either homologous domains or protein kinase domains. LIM proteins serve two important functions: (1) as transcription factors (TFs), LIM proteins in plants are responsible for the regulation of the gene expression associated with a pathway of phenylpropanoid biosynthesis [3]; and (2) actin-binding proteins, regulating the structure of the actin cytoskeleton. Both of these functions are important for plant survival [4]. The LIM is a zinc coordination domain that contains over 50 amino acids and is abundant in cysteine and histidine [5]. The sequence of the conserved LIM domain is [C-X2-C-X17-H-X2-C]-X2-[C-X2-C-X17-CX2-H] which is shared by many species [6].

It has also been discovered that the LIM protein family has a function in cell migration in animals, which suggests that it may have a role in both the development of tumors and metastasis [7]. Among plants, LIM proteins were detected for the first time in sunflowers, where it was shown as exclusively produced in the plant's pollen [8]. Some varieties of plants have been discovered to have LIM proteins. Some of these plants include Arabidopsis [9], tobacco [10], tomato [11], cotton [12], etc. CRP and DAR are two separate subfamilies of LIM proteins. Similar to the CRP (cysteine-rich protein) family in animals, the CRP family has two conserved LIM domains [13]. The C-terminus of LIM proteins that are members of the CRP subfamily is short, and there is a total absence of glycine-rich region after the LIM domain. Based on the patterns of their expression, CRP proteins in plants are classified into two categories: WLIMs (WLIM1 as well as WLIM2), which are found in a wide range of plant tissues; PLIMs (PLIM1 as well as PLIM2) are expressed significantly in pollen tubes [13]. The plant CRP proteins are divided into four subclasses: αLIM1, βLIM1, γLIM2, as well as δLIM2 based on the similarities in their amino acid sequences. The αLIM1 represents PLIM1, WLIM1, as well as FLIM. While γLIM2 and δLIM2 include WLIM2 as well as PLIM2 respectively. On the other hand, βLIM1 is a recently found LIM protein that contributes to the formation of a novel cluster in the phylogenetic tree whose precise function is not yet understood [13]. The LIM subfamily DAR/DA1 proteins are distinguished from one another by their presence of a conserved single LIM domain, single or multiple ubiquitin interaction motif domains, as well as conserved C-terminus. Based upon similarities in their amino acid sequences, DAR/DA1 may be categorized into two classes: class I as well as class II. Only terrestrial plants contain DAR/DA1 proteins [14].

There was an observation of interaction between LIM proteins and actin proteins, and also the participation of LIM in the pathway of phenylpropane secondary metabolism, playing an important function in regulating the development of plant cells as well as stress tolerance [15]. For example, the tobacco (Nicotiana tabacum) NtLIM1 protein initiates a crucial gene PAL (phenylalanine ammonia-lyase) related to the metabolism of lignin after binding with the PAL box in the promoter [16]. Besides, not only GhWLIM1a overexpression of upland cotton enhanced the cinnamyl coenzyme A reductase (CCR), 4-coumarate: coenzyme A ligase as well as cinnamyl alcohol dehydrogenase, but also altered the fibroblasts secondary cell walls constitution [17]. Furthermore, overexpression of PtaGLIMa may alter poplar xylem development as well as lignin contents [18]. LIM domain proteins are localized in the nucleus, cytoplasm, and cytoskeletal which indicates their involvement in communication between nucleus, and cytoplasm [19]. LIM protein’s nuclear localization is critical for regulating tissue-specific genes as well as differentiation in the cell. For example, in tobacco, LIM1 proteins localized in the nucleus initiate gene expression of glucuronidase reporters [20]. In addition, six LIM domain-containing proteins are found in the actin cytoskeleton in Arabidopsis thaliana as well as may bind to actin protein which is controlled by Ca2+ binding activity and pH [21]. Recent research has revealed that expression in the LIM gene changes by responding to biotic as well as abiotic stresses like salt, drought, as well as various hormones [14]. Studies have found that in plants, the cytoskeleton performs a very important function in various stress responses [22-24]. The cytoskeleton is associated with various cell life activities, as well as cytoskeleton elements recombination can be a mechanism of abiotic stress tolerance. LIM genes have been found to carry out a valuable function in responding to resisting abiotic stress in plants [14]. As a result, the family of the LIM gene is a very important and interesting topic of research in plant science [25,26].

Sorghum (Sorghum bicolor L.) is a C4 type most common perennial cereal crop belonging to the Gramineae family grown all over the world [27]. It is extensively used in the food, feed, brewing, and biofuel industry [28] due to its abundance in vitamins, proteins, polyphenols, and resistant starch [29,30]. It occupied the fifth position all over the world regarding both area cultivated and production among grain crops [31]. Sorghum has evolved into many different varieties as a result of long-term environmental adaptation, but still, there is a big impact of some severe abiotic and biotic stresses on the plant's growth as well as development. For instance, S. bicolor plants exhibit decreased weight of single-grain and floret fertility in high temperatures, which lowers yield [32,33]; low temperatures weaken this crop's potential for growth, and frost generally causes significant damage to plants [34]. Although sorghum has a strong root system which helps it endure drought to some extent [35,36], long-term severe drought impacts negatively growth as well as yield [36]. Pests, illnesses, weeds as well as other biotic factors all significantly reduce yields during the production of sorghum [35]. Reduction in sorghum production due to these abiotic, biotic factors makes it necessary to develop a sorghum variety resistant to these factors. As LIM genes are found to be associated with various stress responses in plants [14], it will have significant research and economic value to identify and characterize LIM genes in sorghum.

Whole genome sequencing of S. bicolor in 2009 allowed us to subsequently investigate, clone, and validate the LIM genes involved in stress resistance [37]. The genome of S. bicolor is 750 Mb long, with approximately 30,000 genes, which is about 75% higher compared to rice [38]. Previous studies identified six LIM proteins in Arabidopsis [13], 10 in foxtail millet [39], six in rice [13], 22 of brassica [40], 14 in pear [41] as well as 12 in poplar [42]. It has been found that a family of LIM proteins may carry out a significant role in various important biological functions in plants, however, still, there are no genome-wide studies on LIM genes in sorghum to date. As a result, we identified and in silico characterized the LIM gene family in sorghum genome. The goal of our present study was to accumulate valuable information to further study the plants LIM genes as well as a solid ground for future research of LIM genes in sorghum.

Methods

Genome-wide identification of LIM genes in sorghum

The whole genome and protein sequences of S. bicolor were retrieved from a database called Phytozome v13.0 (http://www.phytozome.net) (Supplementary Figs. 18). For retrieving the all-LIM genes in S. bicolor, all the published sequences as well as annotation information of the LIM gene family of Arabidopsis were obtained from the TAIR database (http://www.arabidopsis.org/). The domains of the LIM protein family were collected from the Pfam database (http://pfam.janelia.org/) using Hidden Markov Model and subsequently, the typical LIM domains were utilized to find out LIM genes in sorghum using a program named BlastP. At last, the probable candidate LIM gene sequence of S. bicolor was obtained through the databases Pfam (http://pfam.xfam.org/family), NCBI-CDD (https://www.ncbi.nlm.nih.gov/cdd/) as well as SMART (http://smart.embl-heidelberg.de/) for prediction of conserved domains of the protein and utilized for subsequent analysis.

The gene length, open reading frame, primary transcript, and chromosomal location of the predicted genes were obtained from the genome database of S. bicolor stored in the Phytozome database. In addition, the basic physiochemical properties of LIM proteins of sorghum such as the number of amino acids, molecular weight, values of isoelectric points (pI) as well as grand average hydrophilicity (GRAVY) values of identified LIM proteins were deeply studied by an online tool ExPASy (https://web.expasy.org/protparam/).

Phylogenetic analysis and multiple sequence alignment

A phylogenetic tree was built using candidate LIM protein sequences of sorghum and previously identified LIM protein sequences of 10 different plant species, including Arabidopsis (Arabidopsis thaliana L.), tobacco (Nicotiana tabacum L.), medicago (Medicago truncatula L.), soybean (Glycine max L.), tomato (Solanum lycopersicum L.), quinoa (Chenopodium quinoa L.), maize (Zea mays L.), rice (Oryza sativa L.), and wheat (Triticum aestivum L.). The MEGA 11.0 software was used to import all the LIM protein sequences, and the Clustal W method with default parameters as well as bootstrap value 1,000 was utilized to perform multiple sequence alignment (MSA). A phylogenetic tree was built based on the Neighbor-joining method [43] using the Clustal Omega approach (https://www.ebi.ac.uk/Tools/msa/clustalo/) [44]. Additionally, thousands of bootstrap replicates were used to find out the evolutionary relationships, and evolutionary distances were predicted by using the Equal Input method [45]. The MSA was performed against sequences of LIM proteins of sorghum as well as Arabidopsis using Clustal X (version 2.1) [46], Clustal W method [47], and MEGA 11.0 software [48].

Analysis of conserved functional domains and motifs

The conserved functional domains of predicted LIM genes in sorghum and previously identified LIM genes of Arabidopsis were investigated using the online databases Pfam, NCBI-CDD, and SMART. Additionally, conserved structural motifs in proteins of sorghum and Arabidopsis were predicted using an online bioinformatics approach called Multiple Expectation Maximization for Motif Elicitation (MEME-Suite v5.3.3) [49]. This MEME analysis was performed by utilizing some specific parameters such as (1) maximum motif number of 20 as well as (2) optimum motif length of ≥6 and ≤50.

Analysis of gene structure and chromosomal localization

The structure (exon-intron configuration) of LIM genes in sorghum and Arabidopsis was studied by using the online tool Gene Structure Display Server (GSDS2.0: https://gsds.cbi.pku.edu.cn) [50]. Besides, chromosomal localizations of the candidate LIM genes of sorghum were predicted using an online bioinformatics tool called MapGene2Chromosome V2 (http://mg2c.iask.in/mg2c_v2.0/) [51].

Subcellular localization and gene ontology analysis

The subcellular localizations of the candidate LIM proteins were predicted into different cell organelles by the online tool plant subcellular localization integrative predictor (PSI) (http://bis.zju.edu.cn/psi/) [52]. Gene ontology (GO) enrichment analysis was performed for predicting the functions of identified LIM genes in sorghum utilizing the online tool Plant Transcription Factor Database (PlantTFDB, http://planttfdb.cbi.pku.edu.cn//) [53].

Analysis of regulatory network between TFs and LIM genes in S. bicolor

The online tool PlantTFDB 4.0 (http://planttfdb.cbi.pku.edu.cn//) [53] was used to predict various important TFs related to the candidate LIM genes. In addition, the interaction network between the predicted LIM genes and TFs was built and displayed by another online tool called Cytoscape 3.9.1 [54].

Analysis of cis-acting regulatory elements of promoter

The cis-acting regulatory elements (CAREs) related to different factors responsive were predicted by analyzing 2.0 kb upstream sequences from the start codon of the candidate LIM genes using a web-based prediction tool called Signal Scan search program of PlantCARE database (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) [55].

Results and Discussion

Identification as well as physiochemical properties of LIM genes in sorghum

Plants encountered a variety of biotic, abiotic stresses at the time of growth and development. At these adverse conditions, various TFs carry out vital roles in functioning. Several biotic, abiotic stresses trigger different TFs of LIM genes. In this study, five LIM genes were detected in sorghum by analyzing the whole genome sequence of sorghum compared with LIM genes of Arabidopsis (Table 1). We named the genes SbLIM1-SbLIM5 based on the position of each candidate LIM gene in the sorghum chromosome. The length of SbLIM genes varies from 1,488 bp to 8,120 bp. The range of coding sequence length of SbLIMs is 585 bp to 4,428 bp. LIM proteins amino acids number ranges from 195 to 1,476 and their molecular weights vary between 21.615 kDa and 158.98 kDa (Table 1). Theoretical pI (isoelectric point) values of SbLIM proteins vary from 4.63 to 8.98 and the GRAVY was found below 0, which suggests that all SbLIM proteins are hydrophilic. Previous studies identified six LIM genes in both Arabidopsis and rice [13], 12 in poplar [42], 10 in foxtail millet [39], 14 in pear [41] as well as 22 in brassica [40] which suggests that LIM gene’s numbers vary among different plant species.

The physicochemical properties of Sorghum bicolor LIM gene family members

Phylogenetic tree analysis

A complete understanding of a species history of evolution is the primary step for understanding the gene’s evolutionary relationships, which may help study their functions as well as evolution. In a phylogenetic tree, the closer the genes in a cluster are, the more alike the function of genes [56]. To find out the relationships of sorghum’s LIM genes with other plant species, we built a phylogenetic tree by using LIM proteins of sorghum, Arabidopsis, rice, wheat, corn, tobacco, tomato, Medicago, alfalfa, soybean, and quinoa. According to this phylogenetic tree sorghum LIM genes were classified into four individual subfamilies, named αLIM, βLIM, γLIM, and δLIM (Fig. 1). Among these families, αLIM includes one (SbLIM4) protein, while βLIM, δLIM, and γLIM include one (SbLIM5), two (SbLIM2 and SbLIM3), and one (SbLIM1) protein, respectively. The phylogenetic tree revealed that sorghum’s SbLIM1, SbLIM2, and SbLIM4 genes are closely related to Zea mays ZmWLIM2, ZmPLIM2b, and ZmWLIM1 genes respectively, while SbLIM3 and SbLIM5 showed a close relationship with rice OsPLIM2b & ZmPLIM2a and OsLIM respectively. These outcomes suggest that sorghum, corn, and rice LIM genes may remain vastly conserved during evolution, which can be passed through the studied species and they may provide similar types of functions.

Fig. 1.

Phylogenetic tree displaying the evolutionary relationships of sorghum LIM proteins to that of Arabidopsis, wheat, rice, tobacco, corn, tomato, soybean, Medicago, quinoa, and alfalfa. The name of each LIM protein starts with a species acronym: At, Arabidopsis thaliana; MS, Medicago sativa L; Mt, Medicago truncatula; Nt, Nicotiana tabacum; Sl, Solanum lycopersicum; Gm, Glycine max; Cq, Chenopodium quinoa; Os, Oryza sativa; Zm, Zea mays; Ta, Triticum aestivum.

MSA analysis

MSA of predicted sorghum LIM proteins with previously identified Arabidopsis LIM proteins showed that they both contain two LIM domains (LIM1 and LIM2) which are separated by an inter LIM region (Fig. 2). Individual amino acids including most conserved cysteine-histidine residues, were found conserved in LIM domain as well as other regions of the sequence. Immediate flanking regions of LIM1, as well as LIM2 domains, also exhibited sequence similarity. For example, the N-terminus and around 14 amino acid residues of the C-terminal of LIM2 are found well conserved. In addition, Inter LIM region exhibits a similarity of around 15 residues of the C-terminal of LIM1 and 12 residues within the N-terminal of LIM2. Therefore, this study suggests that LIM proteins of sorghum may have similar kinds of functionality to LIM proteins of Arabidopsis.

Fig. 2.

Multiple sequence alignment analysis of LIM gene’s amino acid sequences of sorghum and Arabidopsis by using Clustal X (version 2.1) and Clustal W program in MEGA 11.0.

Analysis of conserved domain of SbLIM proteins

To deeply analyze the functional domain of five SbLIM proteins, we utilized the protein family database (Pfam, https://pfam.xfam.org/) to get conserved domains of LIM proteins in sorghum. We also analyzed the conserved domain of six LIM proteins of Arabidopsis to compare with the SbLIMs domain. The analysis revealed that all identified SbLIM proteins possess two LIM domains spaced by a small amino acid sequence called inter LIM region which was almost similar to all AtLIM proteins (Fig. 3). Therefore, this investigation suggests that LIM proteins of sorghum will give functionality similar to those of LIM proteins of Arabidopsis.

Fig. 3.

The conserved domains of the predicted SbLIM proteins were drawn by Pfam, SMART, and NCBI-CDD database information.

Motif analysis of SbLIM genes

The Multiple Expectation Maximization for Motif Elicitation (MEME) was used to predict motifs for each member of the SbLIM protein families against 20 motifs of Arabidopsis. The number of motifs in five SbLIM proteins (SbLIM1-SbLIM5) is 7, 6, 7, 7, and 18 respectively. On the other hand, motifs number in AtLIM proteins varies from 6 to 10. SbLIM5 contains the highest number of motifs. SbLIM2 was found identical to Arabidopsis AtWLIM1, AtWLIM2a, and AtPLIM2c respectively in terms of motifs and motifs number (6 motifs) which suggests that they will give similar types of functions (Fig. 4). In addition, SbLIM1, SbLIM3, and SbLIM4 may have similar functionality like Arabidopsis AtPLIM2a and AtPLIM2b as they are very close in terms of motif number and configuration. Among 20 motifs, 5 motifs were found highly conserved as they were observed in all LIM proteins of Arabidopsis and sorghum. Collectively, motif analysis suggests that SbLIM proteins may have similar functions to LIM proteins of Arabidopsis.

Fig. 4.

The conserved motifs of the predicted SbLIM protein families are drawn using MEME-suite (a maximum of 20 motifs are analyzed). (A) Different colors indicated individual motifs identified in each SbLIM protein domain. (B) The sequence motifs logos are present in Sorghum bicolor SbLIM proteins.

Gene structure analysis

The structure of a gene defines its function. For example, by changing the rate of transcription, nuclear output as well as stability of transcription, introns may promote transcription level [57]. For observing the gene structure of predicted SbLIM genes, their exon-intron structure was studied using GSDS 2.0 with the A. thaliana respective LIM gene family. The exon-intron structure of predicted SbLIM genes exhibited a significant amount of conservation with AtLIM genes of Arabidopsis as expected (Fig. 5). Gene structure analysis revealed that all SbLIM genes contain five exons and four introns which is similar to those of Arabidopsis AtPLIM2b, AtWLIM1, and AtWLIM2a. While, the number of introns in AtPLIM2a, AtPLIM2c, and AtWLIM2b are 2, 3, and 5, respectively. In addition, exon-intron length and their relative positions were also found quite similar in all SbLIM and AtLIM genes except for the SbLIM05 gene. It was also observed that genes within a cluster in the phylogenetic tree exhibit a considerable amount of similarity in exon-intron configuration, which supports phylogenetic tree construction and also confirmed conserved evolutionary characteristics of the LIM gene family. Most of the plants LIM genes contain 4–5 introns, as well as exon numbers 1, 2, and 4 of LIM genes remain highly conserved [13] which is consistent with the present study. In this study, the exon-intron configuration of SbLIM genes was found a significant amount of similarity to Arabidopsis AtLIM genes which suggests that our candidate SbLIM genes may give similar functions to those of Arabidopsis LIM genes.

Fig. 5.

Gene structure of the predicted SbLIM genes in Sorghum bicolor with Arabidopsis by using Gene Structure Display Server (GSDS 2.0, http://gsds.cbi.pku.edu.cn/index.php).

Analysis of chromosomal localization

To find out the chromosomal localizations of predicted five SbLIM genes a detailed chromosomal map was constructed. The chromosomal mapping revealed that SbLIM genes were unevenly distributed among five different chromosomes out of 20 chromosomes of sorghum (Fig. 6). The number of chromosomes corresponds to SbLIM1, SbLIM2, SbLIM3, SbLIM4, and SbLIM5 genes were observed are Chr01, Chr04, Chr06, Chr08, and Chr010, respectively. As they are located in six independent chromosomes, therefore there is no chance for them to be linked and will independently segregate at the time of cell division based on Mendelian genetics [58]. There is still a probability of happening recombination events among these genes. There is a sequence similarity present in different SbLIM genes indicating that the LIM gene has been recombined or duplicated at the time of the evolution of sorghum. During the evolution of plants, gene or chromosomal duplication phenomenon plays a very important role in multiplying family members [59]. All the predicted SbLIM genes were only observed in chromosome numbers Chr01, Chr04, Chr06, Chr08, and Chr10 which may suggest that generation of the sorghum LIM gene family has happened after occurring events of chromosomal duplication.

Fig. 6.

The chromosomal localization of the predicted SbLIM genes. The scale to indicate the chromosomal length is provided on the left.

Analysis of subcellular localization

The subcellular localization study exhibited that the identified SbLIM proteins are mainly localized in two subcellular organelles: nucleus and peroxisome (Fig. 7). Notably, all SbLIM proteins were predominantly found in the nucleus. Additionally, SbLIM2, SbLIM3, and SbLIM4 proteins were also observed in the peroxisome. Biological processes and functions in plants are highly dependent on the subcellular locations of specific proteins [60,61]. LIM proteins located in the nucleus are involved in transcriptional regulation of developmental gene expression [16,17,26,62]. Therefore, it can be hypothesized that predicted SbLIM proteins can have significant roles in regulating gene expression of developmental genes too. LIM protein’s subcellular localization in different organelles suggests that they may give different functionality [1] which suggests that SbLIM2, SbLIM3, and SbLIM4 proteins may provide similar kinds of functions due to their localization in the peroxisome.

Fig. 7.

Subcellular localization analysis for the SbLIM proteins. The predicted SbLIM proteins were analyzed in nuclear, mitochondrial, cytoplasmic, chloroplast, cytoskeletal, peroxisomal, plasma membrane, extracellular, golgi, lysosomal, endoplasmic reticulum, and vacuole.

GO analysis of SbLIM genes in sorghum

To predict functions of identified SbLIM genes, GO analysis was used in our study. The analysis predicted a total of 24 GO terms in all SbLIM genes (Fig. 8A, Supplementary Table 1). We sorted the GO terms into two classes based on their function. One class is called biological process (P) as well as the other one is called molecular function (F). The biological process was found more abundant between these two classes which include 14 GO terms (GO:0071840, GO:0016043, GO:0006996, GO:0044085, GO:0043933, GO:1902589, GO:0022607, GO:0071822. GO:0007010, GO:0030029, GO:0030036, GO:0007015, GO:0051017, and GO:0061572) (Fig. 8B). While the class of molecular function includes 10 GO functions (GO:0043167, GO:0044877, GO:0043169, GO:0046872, GO:0008092, GO:0032403, GO:0003779, GO:0008270, and GO:0051015) (Fig. 8C). Out of 24 GO terms, five terms (GO:0043167, GO:0043169, GO:0046872, GO:0046914, and GO:0008270) of the F category were found in all SbLIM genes. While the other 19 GO functions were observed only in three SbLIM genes (SbLIM01, SbLIM02, and SbLIM04) (Supplementary Table 1).

Fig. 8.

(A) The heatmap for the predicted Gene Ontology (GO) terms corresponding to the predicted SbLIM genes is represented for biological process (B), molecular functions (C), and their association to the SbLIM genes whether they are present or absent.

We also performed a p-value analysis of identified GO functions. The results revealed that different GO terms have different p-values. GO:0043167 has a maximum p-value of 0.00523, next GO:0071840 (p = 0.00329) and GO:0016043 (p = 0.00214), then GO:0006996 (p = 0.00037), GO:0043169 (p = 0.00026), etc. (Supplementary Table 1). On the other hand, the p-value of the rest of the GO terms is quite similar. Besides, analysis of p-values against F and P function categories was done individually. We observed that among the F function category, GO:0043167 has the highest p-value of 0.00523, then GO:0043169 (p = 0.00026), next GO:0046872 (p = 0.00025), etc. While among GO terms of P function categories, GO:0071840 was found with the highest p-value of 0.00329, subsequently, GO:0016043 (p = 0.00214), GO:0006996 (p = 0.00037), etc. (Supplementary Table 1). Therefore, this investigation predicts that SbLIM proteins of sorghum may have functions in organ development, cytoskeletal formation, and binding activities with various partners. Thus, after binding with partners like DNA, proteins, and ions SbLIM proteins may regulate the expression of target genes. These results showed a significant amount of consistency with earlier studies related to the LIM protein family in plants [15].

Analysis of regulatory relationships between TFs and LIM genes in sorghum

TFs carry out a crucial role in a variety of biological functions in living systems, particularly in plants. For example, plant’s responses to various biotic, and abiotic stresses, metabolism, growth, and development as well as defense against different microbial infections [63-67]. In plants, TFs work as a key regulator a molecular switch for various functional genes which express under specific conditions like stresses, growth, as well as development. There is a variety of TFs such as ERF, MYB, Dof (DNA-binding one finger), bZIP, CBF/DREB1, NAC, HSF, AP2/EREBP, WRKY, MIKC_MADS, TGA6, and BOS1 families which present in plants as well as play important functions in response to various biotic abiotic stresses as well as developmental conditions [65,68,69].

In this study, out of 334 TFs, a total of 224 unique TFs were detected which regulate the identified SbLIM genes (Fig. 9A, Supplementary Table 2). These detected TFs were divided into 34 groups based on TF families. The analyzed network revealed that different families of TFs showed different structures and connected to the candidate SbLIM genes. Such as TFs of the ERF family are dominantly linked to the SbLIM05 gene. Besides, the rest of the four SbLIM genes were also found to be controlled by the ERF family; all of them were also linked to SbLIM05. Additionally, ERF, MYB, WRKY, NAC, bZIP, C2H2, Dof, and G2-like families included a total of 136 TFs which may have an important role in controlling SbLIM genes because they are the major eight families containing 33, 20, 13, 15, 9, 18, 9, and 19 TFs respectively which accounted 60.71% of total 224 identified TFs (Fig. 9B, Supplementary Table 3).

Fig. 9.

(A) The regulatory network among the transcription factors (TFs) and the predicted SbLIM genes. Each node of the network was colored differently based on SbLIM genes and TFs. SbLIM genes were represented by orange color. Different node symbols were used for different families of TFs. TFs were configured at the hub node level using black. (B) The map represents the related number of TFs with the predicted SbLIM LIM genes. (C) SbLIM gene-mediated sub-network for ERF, MYB, WRKY, NAC, bZIP, C2H2, Dof, and G2-like TFs families which is expressed as Heatmap.

Further, studied the sub-network relationships among TFs as well as candidate SbLIM genes (Fig. 9C, Supplementary Table 4). The results showed that all major TFs families (ERF, MYB, WRKY, NAC, bZIP, C2H2, Dof, and G2-like) are connected to all SbLIM genes in sorghum except for Dof, NAC, and WRKY which are not linked to SbLIM2, SbLIM4, and SbLIM3 and SbLIM4 respectively. In addition, by performing node degree analysis we identified 14 hub TFs which were found to have interactions with all the predicted SbLIM genes. Among 14 hubs TFs, 12 belong to the ERF family, and the rest two TFs belong to the C2H2 family.

The ERF (ethylene response factor) is involved in ethylene (ET) signaling as well as the response pathway in plants which has a characteristic single AP2 domain [70]. ERF also responds to plant hormones with enhanced survival in stressful conditions. Such as some AP2/ERF families show response to plant growth regulators (PGRs) abscisic acid (ABA) as well as ET for stimulating ABA, ET dependent, or independent stress-responsive (SR) related genes [71]. A previous experimental study showed that an ERF (SlERF5/ERF5) helps to enhance adaptation to salt and drought tolerance in tomato [72]. MYB TFs were seen also in high numbers in plants. In Arabidopsis, MYB TFs comprised around 9% of total TFs [73]. This TF family is also connected to various important biological processes in plants, for example, defense as well as stress responses, circadian rhythm, cell fate as well as the identity, floral and seed development, and primary as well as secondary metabolism regulation [73,74]. The C2H2 TFs family encodes proteins that carry out various important functions in growth, development, as well as biotic stress resistance in plants [75]. There has been a considerable amount of evidence showing that the family of bZIP TFs plays key roles in various biological processes in plants, including embryogenesis, seed maturation, organ differentiation, flower as well as vascular development [76]. Growing evidence has also revealed that bZIP TFs have an important part in regulating plant’s response to biotic as well as abiotic stresses [76,77]. The Dof is a plant-specific gene family of TFs seen in green algae to higher plants which have characteristics of bifunctional binding with DNA as well as proteins to regulate the transcriptional machinery of plant cells [78,79]. Dof is also associated with gene regulation relating to seed maturation as well as germination, plant-hormone as well as light-dependent regulation, and plant’s resistance to biotic as well as abiotic stresses [78-80].

The patterns of expression of the WRKY TF family are related to the defense mechanism against necrotrophic pathogens, and biotrophic pathogens as well as also act as anti-microbial defense [81-83]. Besides, the expression of LIM gene families can be regulated by MYB, NAC, and WRKY TFs families at the time of different adverse conditions, as well as they, have a direct or indirect link to plant development, and stress response [82,84-86]. Regulation of defensive gene expression is carried out through interactive actions of Calmodulin with the various specific TFs for example MYB, NAC, and WRKY [87,88].

The G2-like (GOLDEN2-LIKE) proteins are members of the GARP (Golden2, ARR-B as well as Psr1) domain superfamily of TFs [89,90]. The G2-like TF family plays significant roles in forming and developing chloroplasts [91-95] as well as has been related to various defense mechanisms in organisms, including biotic, and abiotic stresses [96-100]. MIKC_MADS TFs family carry out a great role in the development of flowers as well as fruit [101]. The basic helix-loop-helix TF family carries out significant roles in plant’s stress tolerance, besides their great activities in reproduction, for example in the development of flower as well as fruit [102] and secondary metabolites biosynthesis, e.g., anthocyanin [103]. The analysis of the regulatory network precisely revealed that the predicted SbLIM genes and their associated TFs in S. sorghum will show a broad range of expression patterns which may be obtained by a thorough study of these genes in the future.

CAREs analysis of LIM genes in sorghum

The CAREs are composed of non-coding short DNA sequences ranging from 5 to 20 bp in length which are normally present in the promoter regions of a gene [104,105]. They contain binding sites for TFs as well as other various regulatory elements for triggering the transcription of genes [105]. In plants, CAREs provide defensive functions against various biotic, and abiotic stresses [104] and play an important role in developmental as well as physiological activities by regulating gene expression [105]. The CAREs analysis was performed to search for functional variations of motifs relating to the promoter region of the proposed LIM genes in S. bicolor. The database PlantCARE provided the required information about the motifs as well as their functions related to the genes. This analysis identified a total of 47 functional motifs relating to light-responsive (LR), hormone-responsive (HR), SR, and other functions (OT) (Fig. 10, Supplementary Table 5). We found that the maximum number (21 motifs) of motifs were LR, vastly located in the promoter region of all SbLIM genes. Among LR motifs AE-box, G-box, GATA-motif, GT1-motif, I-box, Sp1, and TCT-motif were common in most of the SbLIM genes in sorghum. Photosynthesis highly relies on the response of light which typically occurs in the leaves of every plant. It is also a crucial physiological factor in sorghum like other plants which is ultimately related to various aspects of increasing the quality and productivity of grain [106]. An enhanced photosynthesis rate can use solar radiation appropriately resulting in reduced flowering time as signals of flowering are generated in leaves [107]. Therefore, our identified LR-associated CAREs are supposed to have direct connections to increased photosynthesis rates in sorghum leaves.

Fig. 10.

The cis-acting regulatory elements (CAREs) in the upstream promoter region of predicted SbLIM genes, respectively. The deep color represents the presence of that element with the corresponding genes. LR, light-responsive; HR, hormone-responsive; SR, stress-responsive; OT, other function.

PGRs play important regulatory roles coordinately or individually in plant growth as well as different development actions [108,109]. PGRs have a significant biological role in the germination of seeds and growth, development as well as metabolic activities in plants [104,110,111]. Eleven CAREs related to significant plants HR were recognized in this study. Among the HR motifs, abscisic acid responsiveness, CGTCA-motif (MeJA-responsiveness), GC-motif (associated in anoxic specific inducibility), O2-site (zein metabolism regulation), TCA-element (salicylic acid responsiveness), TGA-element (auxin responsiveness) and TGACG-motif (MeJA-responsiveness) were shared by most of the SbLIM genes. HR motifs prediction suggests that they have very significant biological functions in sorghum. MBS (MYB binding site) related to drought inducibility [112], TC-rich repeats associated with defense as well as stress responsiveness [113], and long terminal repeat elements involved in response to low-temperature were commonly observed as SR CAREs among all predicted SbLIM genes of S. bicolor.

There were also 12 other important CAREs found and designated as other functions (OT). A-box, ARE (essential for the anaerobic induction), CAAT-box (a common element in promoter as well as enhancer regions), CAT-box (involved in meristem expression), CCAAT-box (MYBHv1 binding site), GCN4_motif (associated with endosperm expression), TATA-box (core promoter element about –30 of transcription start), and WUN-motif (wound-responsive element) were considered as other CAREs shared by the maximum number of predicted SbLIM genes of sorghum which was consistent with the previous studies [40,114,115]. In addition, 35 CAREs with unknown functions were also identified in our study (Supplementary Table 5). Collectively, CAREs distributed among putative LIM gene family in sorghum will give important information about their roles in growth, development as well as defense against various biotic and abiotic stresses.

In the present study, in total five LIM genes were identified in the sorghum genome. Moreover, in silico characterization was performed using the integrated bioinformatics approaches. Phylogenetic tree analysis demonstrated that identified SbLIM genes were classified into four individual subfamilies, named αLIM, βLIM, δLIM, and γLIM. Configuration of the conserved domain, motifs as well as gene structures exhibited the highest similarity compared with the Arabidopsis LIM gene family. Besides, analysis of the GO of predicted SbLIM genes revealed that they are mainly involved in actin cytoskeletal organization and metal ion binding. In this study, we discovered a network of regulatory relationships between TFs as well as predicted LIM genes. Possible TFs and CAREs associated with plant growth, development, and gene regulation related to SbLIM genes were recognized. Therefore, these findings will provide a solid ground for further functional study of LIM genes in S. bicolor for clarifying their regulatory functions in plant growth, and development as well as biotic and abiotic stress resistance.

Notes

Authors’ Contribution

Conceptualization: MARS, SMR. Data curation: MARS, MSI, FTZ. Formal analysis: MARS, MSI. Methodology: MARS, MSI, FTZ. Writing – original draft: MARS, SS, FTZ. Writing – review & editing: MARS, SS, MSI, FTZ, SMR.

Conflicts of Interest

No potential conflict of interest relevant to this article was reported.

Acknowledgements

The authors are very grateful to the Department of Genetic Engineering and Biotechnology, Faculty of Biological Science and Technology, Jashore University of Science and Technology, Jashore 7408, Bangladesh for providing the opportunity to conduct this research. The authors acknowledge and appreciate the reviewers and the members of the editorial panel for their valuable comments and critical suggestions for improving the quality of this manuscript. The authors wish to thank Mr. Abdul Wahid Dippro, Assistant Professor, Department of English, Faculty of Arts and Social Science, Jashore University of Science and Technology, Jashore 7408, Bangladesh for extensively editing the manuscript to avoid grammatical errors.

Supplementary Materials

Supplementary data can be found with this article online at http://www.genominfo.org.

Supplementary Table. 1.

The details GO analysis of the identified five Sorghum bicolor genes was performed using the online tool Plant Transcription Factor Database (PlantTFDB, http://planttfdb.cbi.pku.edu.cn/).

gi-23007-Supplementary-Table-1.pdf
Supplementary Table 2.

Identified in a total of 334 transcription factors associated with the regulation of identified five SbLIM genes in Sorghum bicolor genome

gi-23007-Supplementary-Table-2.pdf
Supplementary Table 3.

Identified 34 transcriptions family and 136 associated with the regulation of identified five SbLIM genes in Sorghum bicolor genome

gi-23007-Supplementary-Table-3.pdf
Supplementary Table 4.

Identified 136 unique transcription factors (TFs) associated with the regulation of identified five SbLIM genes in Sorghum bicolor genome

gi-23007-Supplementary-Table-4.pdf
Supplementary Table 5.

The predicted cis-acting regulatory elements of the upstream promoter region (2.0 kb genomic sequences) of LIM gene families in Sorghum bicolor

gi-23007-Supplementary-Table-5.pdf
Supplementary Fig. 1.

Full-length genomic sequences of LIM gene families of Arabidopsis thaliana (Doc).

gi-23007-Supplementary-Fig-1.pdf
Supplementary Fig. 2.

Full-length coding sequences of LIM gene families of Arabidopsis thaliana (Doc).

gi-23007-Supplementary-Fig-2.pdf
Supplementary Fig. 3.

Full-length peptide sequences of LIM gene families of Arabidopsis thaliana (Doc).

gi-23007-Supplementary-Fig-3.pdf
Supplementary Fig. 4.

Full-length genomic sequences of LIM gene families of Arabidopsis thaliana with 2,000 bp upstream region (Doc).

gi-23007-Supplementary-Fig-4.pdf
Supplementary Fig. 5.

Full-length genomic sequences of LIM gene families of Sorghum bicolor (Doc).

gi-23007-Supplementary-Fig-5.pdf
Supplementary Fig. 6.

Full-length coding sequences of LIM gene families of Sorghum bicolor (Doc).

gi-23007-Supplementary-Fig-6.pdf
Supplementary Fig. 7.

Full-length peptide sequences of LIM gene families of Sorghum bicolor (Doc).

gi-23007-Supplementary-Fig-7.pdf
Supplementary Fig. 8.

Full-length genomic sequences of LIM gene families of Sorghum bicolor with 2,000 bp upstream region (Doc).

gi-23007-Supplementary-Fig-8.pdf

References

1. Eliasson A, Gass N, Mundel C, Baltz R, Krauter R, Evrard JL, et al. Molecular and expression analysis of a LIM protein gene family from flowering plants. Mol Gen Genet 2000;264:257–267.
2. Way JC, Chalfie M. mec-3, a homeobox-containing gene that specifies differentiation of the touch receptor neurons in C. elegans. Cell 1988;54:5–16.
3. Kawaoka A, Ebinuma H. Transcriptional control of lignin biosynthesis by tobacco LIM protein. Phytochemistry 2001;57:1149–1157.
4. Maul RS, Song Y, Amann KJ, Gerbin SC, Pollard TD, Chang DD. EPLIN regulates actin dynamics by cross-linking and stabilizing filaments. J Cell Biol 2003;160:399–407.
5. Dawid IB, Breen JJ, Toyama R. LIM domains: multiple roles as adapters and functional modifiers in protein interactions. Trends Genet 1998;14:156–162.
6. Dawid IB, Toyama R, Taira M. LIM domain proteins. C R Acad Sci III 1995;318:295–306.
7. Labalette C, Nouet Y, Levillayer F, Colnot S, Chen J, Claude V, et al. Deficiency of the LIM-only protein FHL2 reduces intestinal tumorigenesis in Apc mutant mice. PLoS One 2010;5e10371.
8. Baltz R, Evrard JL, Domon C, Steinmetz A. A LIM motif is present in a pollen-specific protein. Plant Cell 1992;4:1465–1466.
9. Papuga J, Thomas C, Dieterle M, Moreau F, Steinmetz A. Arabidopsis LIM domain proteins involved in actin bundling exhibit different modes of regulation. FEBS J 2009;276:245.
10. Thomas C, Hoffmann C, Dieterle M, Van Troys M, Ampe C, Steinmetz A. Tobacco WLIM1 is a novel F-actin binding protein involved in actin cytoskeleton remodeling. Plant Cell 2006;18:2194–2206.
11. Khatun K, Robin AH, Park JI, Ahmed NU, Kim CK, Lim KB, et al. Genome-wide identification, characterization and expression profiling of LIM family genes in Solanum lycopersicum L. Plant Physiol Biochem 2016;108:177–190.
12. Li Y, Jiang J, Li L, Wang XL, Wang NN, Li DD, et al. A cotton LIM domain-containing protein (GhWLIM5) is involved in bundling actin filaments. Plant Physiol Biochem 2013;66:34–40.
13. Arnaud D, Dejardin A, Leple JC, Lesage-Descauses MC, Pilate G. Genome-wide analysis of LIM gene family in Populus trichocarpa, Arabidopsis thaliana, and Oryza sativa. DNA Res 2007;14:103–116.
14. Zhao M, He L, Gu Y, Wang Y, Chen Q, He C. Genome-wide analyses of a plant-specific LIM-domain gene family implicate its evolutionary role in plant diversification. Genome Biol Evol 2014;6:1000–1012.
15. Srivastava V, Verma PK. The plant LIM proteins: unlocking the hidden attractions. Planta 2017;246:365–375.
16. Kawaoka A, Kaothien P, Yoshida K, Endo S, Yamada K, Ebinuma H. Functional analysis of tobacco LIM protein Ntlim1 involved in lignin biosynthesis. Plant J 2000;22:289–301.
17. Han LB, Li YB, Wang HY, Wu XM, Li CL, Luo M, et al. The dual functions of WLIM1a in cell elongation and secondary wall formation in developing cotton fibers. Plant Cell 2013;25:4421–4438.
18. Yang S, Zhu G, Wang C, Chen L, Song Y, Wang J. Regulation of secondary xylem formation in young hybrid poplars by modifying the expression levels of the PtaGLIMa gene. Mol Breed 2017;37:124.
19. Sala S, Ampe C. An emerging link between LIM domain proteins and nuclear receptors. Cell Mol Life Sci 2018;75:1959–1971.
20. Zheng Q, Zhao Y. The diverse biofunctions of LIM domain proteins: determined by subcellular localization and protein-protein interaction. Biol Cell 2007;99:489–502.
21. Papuga J, Hoffmann C, Dieterle M, Moes D, Moreau F, Tholl S, et al. Arabidopsis LIM proteins: a family of actin bundlers with distinct expression patterns and modes of regulation. Plant Cell 2010;22:3034–3052.
22. Wang C, Zhang LJ, Huang RD. Cytoskeleton and plant salt stress tolerance. Plant Signal Behav 2011;6:29–31.
23. Wang Y, Liu GJ, Yan XF, Wei ZG, Xu ZR. MeJA-inducible expression of the heterologous JAZ2 promoter from Arabidopsis in Populus trichocarpa protoplasts. J Plant Dis Prot 2011;118:69–74.
24. Wasteneys GO, Yang Z. The cytoskeleton becomes multidisciplinary. Plant Physiol 2004;136:3853–3854.
25. Baltz R, Schmit AC, Kohnen M, Hentges F, Steinmetz A. Differential localization of the LIM domain protein PLIM-1 in microspores and mature pollen grains from sunflower. Sex Plant Reprod 1999;12:60–65.
26. Moes D, Gatti S, Hoffmann C, Dieterle M, Moreau F, Neumann K, et al. A LIM domain protein from tobacco involved in actin-bundling and histone gene transcription. Mol Plant 2013;6:483–502.
27. Chen F, Hu Y, Vannozzi A, Wu K, Cai H, Qin Y, et al. The WRKY transcription factor family in model plants and crops. Crit Rev Plant Sci 2017;36:311–335.
28. Zhao ZY, Che P, Glassman K, Albertsen M. Nutritionally enhanced sorghum for the arid and semiarid tropical areas of Africa. Methods Mol Biol 2019;1931:197–207.
29. Pelpolage SW, Han K, Koaze H, Hamamoto T, Hoshizawa M, Fukushima M. Influence of enzyme-resistant fraction of sorghum (Sorghum bicolor L.) flour on gut microflora composition, short chain fatty acid production and toxic substance metabolism. J Food Nutr Res 2019;58:135–145.
30. Xiong Y, Zhang P, Warner RD, Fang Z. Sorghum grain: from genotype, nutrition, and phenolic profile to its health benefits and food applications. Compr Rev Food Sci Food Saf 2019;18:2025–2046.
31. USDA. Quick Stats. Washington, DC: United States Department of Agriculture, Natioanl Agricultural Statistics Service; 2019. Accessed 2023 Feb 2. Available from: https://quickstats.nass.usda.gov/.
32. Han Y, Song L, Liu S, Zou N, Li Y, Qin Y, et al. Simultaneous determination of 124 pesticide residues in Chinese liquor and liquor-making raw materials (sorghum and rice hull) by rapid Multi-plug Filtration Cleanup and gas chromatography-tandem mass spectrometry. Food Chem 2018;241:258–267.
33. Prasad PV, Djanaguiraman M, Perumal R, Ciampitti IA. Impact of high temperature stress on floret fertility and individual grain weight of grain sorghum: sensitive stages and thresholds for temperature and duration. Front Plant Sci 2015;6:820.
34. Prasad PV, Pisipati SR, Mutava RN, Tuinstra MR. Sensitivity of grain sorghum to high temperature stress during reproductive development. Crop Sci 2008;48:1911–1917.
35. Rooney WL. Sorghum improvement: integrating traditional and new technology to produce improved genotypes. Adv Agron 2004;83:37–109.
36. Tsuji W, Ali ME, Inanaga S, Sygimoto Y. Growth and gas exchange of three sorghum cultivars under drought stress. Biol Plant 2003;46:583–587.
37. Li H, Payne WA, Michels GJ, Rush CM. Reducing plant abiotic and biotic stress: drought and attacks of greenbugs, corn leaf aphids and virus disease in dryland sorghum. Environ Exp Bot 2008;63:305–316.
38. Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, et al. The Sorghum bicolor genome and the diversification of grasses. Nature 2009;457:551–556.
39. Yang R, Chen M, Sun JC, Yu Y, Min DH, Chen J, et al. Genome-wide analysis of LIM family genes in foxtail millet (Setaria italica L.) and characterization of the role of SiWLIM2b in drought tolerance. Int J Mol Sci 2019;20:1303.
40. Park JI, Ahmed NU, Jung HJ, Arasan SK, Chung MY, Cho YG, et al. Identification and characterization of LIM gene family in Brassica rapa. BMC Genomics 2014;15:641.
41. Cheng X, Li G, Muhammad A, Zhang J, Jiang T, Jin Q, et al. Molecular identification, phylogenomic characterization and expression patterns analysis of the LIM (LIN-11, Isl1 and MEC-3 domains) gene family in pear (Pyrus bretschneideri) reveal its potential role in lignin metabolism. Gene 2019;686:237–249.
42. Arnaud D, Dejardin A, Leple JC, Lesage-Descauses MC, Boizot N, Villar M, et al. Expression analysis of LIM gene family in poplar, toward an updated phylogenetic classification. BMC Res Notes 2012;5:102.
43. Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol 1987;4:406–425.
44. Sievers F, Higgins DG. Clustal Omega for making accurate alignments of many protein sequences. Protein Sci 2018;27:135–145.
45. Tajima F, Nei M. Estimation of evolutionary distance between nucleotide sequences. Mol Biol Evol 1984;1:269–285.
46. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 1997;25:4876–4882.
47. Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 1994;22:4673–4680.
48. Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis version 11. Mol Biol Evol 2021;38:3022–3027.
49. Bailey TL, Johnson J, Grant CE, Noble WS. The MEME suite. Nucleic Acids Res 2015;43:W39–W49.
50. Hu B, Jin J, Guo AY, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics 2015;31:1296–1297.
51. Chao JT, Kong YZ, Wang Q, Sun YH, Gong DP, Lv J, et al. MapGene2Chrom, a tool to draw gene physical map based on Perl and SVG languages. Yi Chuan 2015;37:91–97.
52. Liu L, Zhang Z, Mei Q, Chen M. PSI: a comprehensive and integrative approach for accurate plant subcellular localization prediction. PLoS One 2013;8e75826.
53. Jin J, Tian F, Yang DC, Meng YQ, Kong L, Luo J, et al. PlantTFDB 4. 0: toward a central hub for transcription factors and regulatory interactions in plants. Nucleic Acids Res 2017;45:D1040–D1045.
54. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2003;13:2498–2504.
55. Lescot M, Dehais P, Thijs G, Marchal K, Moreau Y, Van de Peer Y, et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res 2002;30:325–327.
56. Shen XX, Salichos L, Rokas A. A genome-scale investigation of how sequence, function, and tree-based gene properties influence phylogenetic inference. Genome Biol Evol 2016;8:2565–2580.
57. Shaul O. How introns enhance gene expression. Int J Biochem Cell Biol 2017;91:145–155.
58. Castle WE. Mendel's Law of Heredity. Science 1903;18:396–406.
59. Magadum S, Banerjee U, Murugan P, Gangapur D, Ravikesavan R. Gene duplication as a major force in evolution. J Genet 2013;92:155–161.
60. Ehrlich JS, Hansen MD, Nelson WJ. Spatio-temporal regulation of Rac1 localization and lamellipodia dynamics during epithelial cell-cell adhesion. Dev Cell 2002;3:259–270.
61. Glory E, Murphy RF. Automated subcellular location determination and high-throughput microscopy. Dev Cell 2007;12:7–16.
62. Thaler JP, Lee SK, Jurata LW, Gill GN, Pfaff SL. LIM factor Lhx3 contributes to the specification of motor neuron and interneuron identity through cell-type-specific protein-protein interactions. Cell 2002;110:237–249.
63. Khan SA, Li MZ, Wang SM, Yin HJ. Revisiting the role of plant transcription factors in the battle against abiotic stress. Int J Mol Sci 2018;19:1634.
64. Latchman DS. Transcription factors: an overview. Int J Biochem Cell Biol 1997;29:1305–1312.
65. Lutova LA, Dodueva IE, Lebedeva MA, Tvorogova VE. Transcription factors in developmental genetics and the evolution of higher plants. Genetika 2015;51:539–557.
66. Sasaki K. Utilization of transcription factors for controlling floral morphogenesis in horticultural plants. Breed Sci 2018;68:88–98.
67. Shu Y, Liu Y, Zhang J, Song L, Guo C. Genome-wide analysis of the AP2/ERF superfamily genes and their responses to abiotic stress in Medicago truncatula. Front Plant Sci 2015;6:1247.
68. Mengiste T, Chen X, Salmeron J, Dietrich R. The BOTRYTIS SUSCEPTIBLE1 gene encodes an R2R3MYB transcription factor protein that is required for biotic and abiotic stress responses in Arabidopsis. Plant Cell 2003;15:2551–2565.
69. Meshi T, Iwabuchi M. Plant transcription factors. Plant Cell Physiol 1995;36:1405–1420.
70. Muller M, Munne-Bosch S. Ethylene response factors: a key regulatory hub in hormone and stress signaling. Plant Physiol 2015;169:32–41.
71. Xie Z, Nolan TM, Jiang H, Yin Y. AP2/ERF transcription factor regulatory networks in hormone and abiotic stress responses in Arabidopsis. Front Plant Sci 2019;10:228.
72. Pan Y, Seymour GB, Lu C, Hu Z, Chen X, Chen G. An ethylene response factor (ERF5) promoting adaptation to drought and salt tolerance in tomato. Plant Cell Rep 2012;31:349–360.
73. Cao Y, Li K, Li Y, Zhao X, Wang L. MYB transcription factors as regulators of secondary metabolism in plants. Biology (Basel) 2020;9:61.
74. Ramya M, Kwon OK, An HR, Park PM, Baek YS, Park PH. Floral scent: regulation and role of MYB transcription factors. Phytochem Lett 2017;19:114–120.
75. Cao H, Huang P, Zhang L, Shi Y, Sun D, Yan Y, et al. Characterization of 47 Cys2 -His2 zinc finger proteins required for the development and pathogenicity of the rice blast fungus Magnaporthe oryzae. New Phytol 2016;211:1035–1051.
76. Wei K, Chen J, Wang Y, Chen Y, Chen S, Lin Y, et al. Genome-wide analysis of bZIP-encoding genes in maize. DNA Res 2012;19:463–476.
77. Jakoby M, Weisshaar B, Droge-Laser W, Vicente-Carbajosa J, Tiedemann J, Kroj T, et al. bZIP transcription factors in Arabidopsis. Trends Plant Sci 2002;7:106–111.
78. Gupta S, Malviya N, Kushwaha H, Nasim J, Bisht NC, Singh VK, et al. Insights into structural and functional diversity of Dof (DNA binding with one finger) transcription factor. Planta 2015;241:549–562.
79. Noguero M, Atif RM, Ochatt S, Thompson RD. The role of the DNA-binding One Zinc Finger (DOF) transcription factor family in plants. Plant Sci 2013;209:32–45.
80. Azam SM, Liu Y, Rahman ZU, Ali H, Yan C, Wang L, et al. Identification, characterization and expression profiles of Dof transcription factors in pineapple (Ananas comosus L). Trop Plant Biol 2018;11:49–64.
81. Mzid R, Marchive C, Blancard D, Deluc L, Barrieu F, Corio-Costet MF, et al. Overexpression of VvWRKY2 in tobacco enhances broad resistance to necrotrophic fungal pathogens. Physiol Plant 2007;131:434–447.
82. Oh SK, Baek KH, Park JM, Yi SY, Yu SH, Kamoun S, et al. Capsicum annuum WRKY protein CaWRKY1 is a negative regulator of pathogen defense. New Phytol 2008;177:977–989.
83. Shim JS, Jung C, Lee S, Min K, Lee YW, Choi Y, et al. AtMYB44 regulates WRKY70 expression and modulates antagonistic interaction between salicylic acid and jasmonic acid signaling. Plant J 2013;73:483–495.
84. Guo H, Wang Y, Wang L, Hu P, Wang Y, Jia Y, et al. Expression of the MYB transcription factor gene BplMYB46 affects abiotic stress tolerance and secondary cell wall deposition in Betula platyphylla. Plant Biotechnol J 2017;15:107–121.
85. Mare C, Mazzucotelli E, Crosatti C, Francia E, Stanca AM, Cattivelli L. Hv-WRKY38: a new transcription factor involved in cold- and drought-response in barley. Plant Mol Biol 2004;55:399–416.
86. Prakash V, Chakraborty S. Identification of transcription factor binding sites on promoter of RNA dependent RNA polymerases (RDRs) and interacting partners of RDR proteins through in silico analysis. Physiol Mol Biol Plants 2019;25:1055–1071.
87. Ranty B, Aldon D, Galaud JP. Plant calmodulins and calmodulin-related proteins: multifaceted relays to decode calcium signals. Plant Signal Behav 2006;1:96–104.
88. Reddy AS, Ali GS, Celesnik H, Day IS. Coping with stresses: roles of calcium- and calcium/calmodulin-regulated gene expression. Plant Cell 2011;23:2010–2032.
89. Riechmann JL, Heard J, Martin G, Reuber L, Jiang C, Keddie J, et al. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes. Science 2000;290:2105–2110.
90. Xiao Y, You S, Kong W, Tang Q, Bai W, Cai Y, et al. A GARP transcription factor anther dehiscence defected 1 (OsADD1) regulates rice anther dehiscence. Plant Mol Biol 2019;101:403–414.
91. Fitter DW, Martin DJ, Copley MJ, Scotland RW, Langdale JA. GLK gene pairs regulate chloroplast development in diverse plant species. Plant J 2002;31:713–727.
92. Jarvis P, Lopez-Juez E. Biogenesis and homeostasis of chloroplasts and other plastids. Nat Rev Mol Cell Biol 2013;14:787–802.
93. Powell AL, Nguyen CV, Hill T, Cheng KL, Figueroa-Balderas R, Aktas H, et al. Uniform ripening encodes a Golden 2-like transcription factor regulating tomato fruit chloroplast development. Science 2012;336:1711–1715.
94. Rossini L, Cribb L, Martin DJ, Langdale JA. The maize golden2 gene defines a novel class of transcriptional regulators in plants. Plant Cell 2001;13:1231–1244.
95. Waters MT, Moylan EC, Langdale JA. GLK transcription factors regulate chloroplast development in a cell-autonomous manner. Plant J 2008;56:432–444.
96. Han XY, Li PX, Zou LJ, Tan WR, Zheng T, Zhang DW, et al. GOLDEN2-LIKE transcription factors coordinate the tolerance to Cucumber mosaic virus in Arabidopsis. Biochem Biophys Res Commun 2016;477:626–632.
97. Murmu J, Wilton M, Allard G, Pandeya R, Desveaux D, Singh J, et al. Arabidopsis GOLDEN2-LIKE (GLK) transcription factors activate jasmonic acid (JA)-dependent disease susceptibility to the biotrophic pathogen Hyaloperonospora arabidopsidis, as well as JA-independent plant immunity against the necrotrophic pathogen Botrytis cinerea. Mol Plant Pathol 2014;15:174–184.
98. Nagatoshi Y, Mitsuda N, Hayashi M, Inoue S, Okuma E, Kubo A, et al. GOLDEN 2-LIKE transcription factors for chloroplast development affect ozone tolerance through the regulation of stomatal movement. Proc Natl Acad Sci U S A 2016;113:4218–4223.
99. Savitch LV, Subramaniam R, Allard GC, Singh J. The GLK1 'regulon' encodes disease defense related proteins and confers resistance to Fusarium graminearum in Arabidopsis. Biochem Biophys Res Commun 2007;359:234–238.
100. Schreiber KJ, Nasmith CG, Allard G, Singh J, Subramaniam R, Desveaux D. Found in translation: high-throughput chemical screening in Arabidopsis thaliana identifies small molecules that reduce Fusarium head blight disease in wheat. Mol Plant Microbe Interact 2011;24:640–648.
101. Ng M, Yanofsky MF. Function and evolution of the plant MADS-box gene family. Nat Rev Genet 2001;2:186–195.
102. Sun W, Jin X, Ma Z, Chen H, Liu M. Basic helix-loop-helix (bHLH) gene family in Tartary buckwheat (Fagopyrum tataricum): genome-wide identification, phylogeny, evolutionary expansion and expression analyses. Int J Biol Macromol 2020;155:1478–1490.
103. Zhao R, Song X, Yang N, Chen L, Xiang L, Liu XQ, et al. Expression of the subgroup IIIf bHLH transcription factor CpbHLH1 from Chimonanthus praecox (L.) in transgenic model plants inhibits anthocyanin accumulation. Plant Cell Rep 2020;39:891–907.
104. Kaur A, Pati PK, Pati AM, Nagpal AK. In-silico analysis of cis-acting regulatory elements of pathogenesis-related proteins of Arabidopsis thaliana and Oryza sativa. PLoS One 2017;12e0184523.
105. Wittkopp PJ, Kalay G. Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergence. Nat Rev Genet 2011;13:59–69.
106. Hidayati N, Anas I. Photosynthesis and transpiration rates of rice cultivated under the system of rice intensification and the effects on growth and yield. Hayati J Biosci 2016;23:67–72.
107. Lee HW, Cho C, Kim J. Lateral organ boundaries domain16 and 18 act downstream of the AUXIN1 and LIKE-AUXIN3 auxin influx carriers to control lateral root development in Arabidopsis. Plant Physiol 2015;168:1792–1806.
108. Gray WM. Hormonal regulation of plant growth and development. PLoS Biol 2004;2:E311.
109. Ogawa M, Hanada A, Yamauchi Y, Kuwahara A, Kamiya Y, Yamaguchi S. Gibberellin biosynthesis and response during Arabidopsis seed germination. Plant Cell 2003;15:1591–1604.
110. Shariatipour N, Heidari B. Investigation of drought and salinity tolerance related genes and their regulatory mechanisms in Arabidopsis (Arabidopsis thaliana). Open Bioinform J 2018;11:12–28.
111. Zhou Y, Hu L, Wu H, Jiang L, Liu S. Genome-wide identification and transcriptional expression analysis of cucumber superoxide dismutase (SOD) family in response to various abiotic stresses. Int J Genomics 2017;2017:7243973.
112. Chen W, Provart NJ, Glazebrook J, Katagiri F, Chang HS, Eulgem T, et al. Expression profile matrix of Arabidopsis transcription factor genes suggests their putative functions in response to environmental stresses. Plant Cell 2002;14:559–574.
113. Arias JA, Dixon RA, Lamb CJ. Dissection of the functional architecture of a plant defense gene promoter using a homologous in vitro transcription initiation system. Plant Cell 1993;5:485–496.
114. Li C, Sun Y, Chang Q, Guo C, Bai Y. Bioinformatics and expression pattern analysis of LIM family in wheat. Preprint at: https://doi.org/10.21203/rs.3.rs-1615320/v1 (2022).
115. Zhu X, Wang B, Wang X, Zhang C, Wei X. Genome-wide identification, characterization and expression analysis of the LIM transcription factor family in quinoa. Physiol Mol Biol Plants 2021;27:787–800.

Article information Continued

Fig. 1.

Phylogenetic tree displaying the evolutionary relationships of sorghum LIM proteins to that of Arabidopsis, wheat, rice, tobacco, corn, tomato, soybean, Medicago, quinoa, and alfalfa. The name of each LIM protein starts with a species acronym: At, Arabidopsis thaliana; MS, Medicago sativa L; Mt, Medicago truncatula; Nt, Nicotiana tabacum; Sl, Solanum lycopersicum; Gm, Glycine max; Cq, Chenopodium quinoa; Os, Oryza sativa; Zm, Zea mays; Ta, Triticum aestivum.

Fig. 2.

Multiple sequence alignment analysis of LIM gene’s amino acid sequences of sorghum and Arabidopsis by using Clustal X (version 2.1) and Clustal W program in MEGA 11.0.

Fig. 3.

The conserved domains of the predicted SbLIM proteins were drawn by Pfam, SMART, and NCBI-CDD database information.

Fig. 4.

The conserved motifs of the predicted SbLIM protein families are drawn using MEME-suite (a maximum of 20 motifs are analyzed). (A) Different colors indicated individual motifs identified in each SbLIM protein domain. (B) The sequence motifs logos are present in Sorghum bicolor SbLIM proteins.

Fig. 6.

The chromosomal localization of the predicted SbLIM genes. The scale to indicate the chromosomal length is provided on the left.

Fig. 7.

Subcellular localization analysis for the SbLIM proteins. The predicted SbLIM proteins were analyzed in nuclear, mitochondrial, cytoplasmic, chloroplast, cytoskeletal, peroxisomal, plasma membrane, extracellular, golgi, lysosomal, endoplasmic reticulum, and vacuole.

Fig. 8.

(A) The heatmap for the predicted Gene Ontology (GO) terms corresponding to the predicted SbLIM genes is represented for biological process (B), molecular functions (C), and their association to the SbLIM genes whether they are present or absent.

Fig. 9.

(A) The regulatory network among the transcription factors (TFs) and the predicted SbLIM genes. Each node of the network was colored differently based on SbLIM genes and TFs. SbLIM genes were represented by orange color. Different node symbols were used for different families of TFs. TFs were configured at the hub node level using black. (B) The map represents the related number of TFs with the predicted SbLIM LIM genes. (C) SbLIM gene-mediated sub-network for ERF, MYB, WRKY, NAC, bZIP, C2H2, Dof, and G2-like TFs families which is expressed as Heatmap.

Fig. 10.

The cis-acting regulatory elements (CAREs) in the upstream promoter region of predicted SbLIM genes, respectively. The deep color represents the presence of that element with the corresponding genes. LR, light-responsive; HR, hormone-responsive; SR, stress-responsive; OT, other function.

Table 1.

The physicochemical properties of Sorghum bicolor LIM gene family members

Sl No. Gene Accession No. Chromosomal location ORF (bp) Gene length (bp) Protein
Molecular weight (kD) Protein length (aa) pI GRAVY
1 SbLIM1 Sobic.001G426200.1 Chr01:70577409..70579551 594 2,142 21.802 198 8.98 –0.554
2 SbLIM2 Sobic.004G303700.1 Chr04:64293407..64295083 603 1,676 21.615 201 8.38 –0.406
3 SbLIM3 Sobic.006G159000.1 Chr06:51718518..51720006 612 1,488 22.297 204 7.48 –0.421
4 SbLIM4 Sobic.008G110200.1 Chr08:51315200..51317365 585 2,165 21.889 195 8.63 –0.629
5 SbLIM5 Sobic.010G097600.1 Chr10:8834292..8842412 4,428 8,120 158.98 1,476 4.63 –0.976

ORF, open reading frame; GRAVY, grand average hydrophilicity.