Abstract
Commelina benghalensis L. 1753, a member of the Commelinaceae family, holds significant medicinal and culinary value. This study represents the first documentation of the sequencing and assembly of the entire plastome of C. benghalensis. The genome spans a total length of 160,663 bp, exhibiting a conventional quadripartite architecture that comprises a large single-copy (LSC) region (87,750 bp), a small single-copy (SSC) region (18,417 bp), and two inverted repeats (IR) regions (both 27,248 bp). In its entirety, the C. benghalensis plastome encompasses 129 genes (with 108 being unique), incorporating 77 individual protein-coding genes, 37 unique tRNA genes, and four unique rRNA genes. Phylogenetic analysis revealed a close resemblance between C. benghalensis and C. communis. The sequencing of this plastome stands to expedite the development of molecular markers and significantly contribute to genetic assays involving this distinctive plant.
Introduction
Commelina, the largest genus in the Commelinaceae family, comprises around 170 species known as dayflowers due to their short-lived blooms (Pellegrini and Forzza Citation2017). These perennial or annual herbaceous plants exhibit zygomorphic flowers and distinctive spathes encircling their flower stalks. Typically displaying blue or purple flowers with three unequal petals, they also feature sword-shaped or ovate leaves, spike-like inflorescences, and capsule fruits. Certain species like Commelina diffusa Burm.f. 1768 and Commelina benghalensis L. 1753 are utilized for medicinal and edible purposes (Wang et al. Citation2023).
Commelina benghalensis, a widely distributed folkloric medicinal plant in Asia, Africa, and South America, has long been recognized for its therapeutic properties. This plant is a rich source of diverse bioactive compounds, including alkaloids, volatile oils, wax, vitamin C, and high levels of lutein and β-carotene (Kansagara and Pandya Citation2019). Pharmacologically, C. benghalensis exhibits a broad spectrum of activities, including laxative, anti-inflammatory, antimicrobial, anticancer, sedative, analgesic, hepatoprotective, antidepressant, antiviral, antioxidant, antidiarrheal, demulcent, emollient, diuretic, and febrifuge properties. In traditional medicine, it has been used to treat a variety of ailments such as pain, constipation, headache, leprosy, fever, snake bites, jaundice, psychosis, epilepsy, nose blockage, insanity, and exophthalmia (Pranabesh et al. Citation2019). In Chinese traditional medicine, C. benghalensis is valued for its diuretic, febrifuge, laxative, and anti-inflammatory effects (Hasan et al. Citation2009). While numerous studies have focused on the pharmacological activities, compound isolation, and toxicity of this plant, there is still a limited amount of molecular research conducted on it.
To date, the plastome of C. communis has been reported (Cui and Liang Citation2019) in the Commelina genus (MK863371). Another version of the plastome of C. communis (MW617984) and the plastome of C. caroliniana has also been deposited in the GenBank (OR936140). Understanding genetic diversity is crucial for Commelina conservation and elucidating its evolution. For example, genetic analysis was conducted on C. communis f. ciliata using microsatellite markers to understand its genetic diversity and population structure (Katsuhara et al. Citation2019). Additionally, the Angiosperms353 probe set was employed to capture target sequences from the nucleus, plastids, and publicly available transcriptomes and complete plastomes, resulting in the extraction of additional sequences. Three large datasets were generated to analyze the phylogenetic relationships within the order Commelinales (Zuntini et al. Citation2021). To contribute more genetic data and assess the phylogenetic position of C. benghalensis within the Commelina genus, we sequenced and characterized the complete plastome of C. benghalensis, aiming to support further evolutionary research.
Materials
The C. benghalensis samples () were collected from Peony District, Heze City, Shandong Province, China (35° 16′ 23.62′’ N, 115° 27′ 36.01′’ E). The specimen was deposited at Heze University Herbarium (contact person: Hongqin Li, [email protected]) under specimen number HZ20220806. The fresh leaves were used to extract total genomic DNA by using the plant genomic DNA kit (Tiangen Biotech, Beijing, China)
Methods
Total genomic DNA extracts were fragmented into about 300 bp short-insert fragments for library construction and were sequenced 2 × 150 bp paired-end reads on Illumina NovaSeq 6000 technology platforms at Wuhan Benagen Technology Company Limited (Wuhan, China). The filtering of raw reads was performed using Trimmomatic 0.35 (Bolger et al. Citation2014), e.g. removing adapters and low-quality bases. Then, about 34 GB of clean reads were assembled by using GetOrganelle v1.7.1 (Jin et al. Citation2020). The finished plastome was annotated by using CPGAVAS2 (Shi et al. Citation2019), and then manually adjusted by Apollo (Pontius Citation2018). The sequencing depth of the genome was calculated by BWA (Li and Durbin Citation2009) coupled with samtools (Li et al. Citation2009). Finally, the annotated plastome was submitted to GenBank using Bankit (The accession number is OQ2656460). The CPGview (http://www.1kmpg.cn/Cpgview/) was used to illustrate the circular genome map of the new plastome (Liu et al. Citation2023).
To determine the phylogenetic relationships of C. benghalensis, 30 of the Commelinaceae species plastome with the highest similarity to that of C. benghalensis were selected based on blast results of whole plastome sequences in the GenBank. Two outgroups of Acorus calamus (AJ879453) and Acorus tatarinowii (MN536753) were downloaded from GenBank. The 23 entire plastome sequences were aligned using the MAFFT (version 7) software with default parameters (Katoh and Standley Citation2016). Then, a maximum-likelihood (ML) phylogenetic tree was constructed based on the Best-fit model of GTR + F + I + R4 according to BIC by IQ-TREE v2.0 (Nguyen et al. Citation2015) with 1000 bootstrap replicates.
Results
The plastome of the C. benghalensis deciphered in this study is a circular DNA molecule with a total length of 160,663 bp. The reliability of genome assembly was strongly supported by the results of the mapping experiment (Figure S1 A, B). The maximum sequencing depth, average sequencing depth, and minimum sequencing depth were 7,992×, 2789.5× and 52×. The comparison of reads mapped to genomes at the loci with minimum sequencing depth is shown in Figure S1 B. The genome has a conservative quadripartite structure, including a large-single copy (LSC) region, a small-single copy (SSC) region, and a pair of inverted repeats (IR) regions, with a length of 87,750 bp, 18,417 bp, and 27,248 bp, respectively. The GC content of the whole genome is 35.83%, which is lower than that of IR regions (42.24%) and higher than that of the LSC (33.13%) and SSC regions (29.77%) (Table S1). The plastome contains 129 genes (108 unique genes), including 77 distinct proteins, 27 distinct tRNAs, and four distinct rRNA genes (, Table S2). Eight protein-coding genes (atpF, ndhA, ndhB, petB, petD, rpl16, rpl2, rpoC1) contain one intron, and three genes (clpP, rps12, ycf3) contain two introns (Table S2). The genome contains 11 unique cis-splicing genes (atpF, clpP, ndhA, ndhB (×2), petB, petD, rpl2 (×2), rpl16, rpoC2, rpoC1, ycf3) (Figure S2 A) and one unique trans-splicing gene rps12 (Figure S2 B). Five unique tRNA genes (trnA-UGC (×2), trnI-GAU (×2), trnK-UUU, trnL-UAA, trnS-CGA) contain one intron.
In order to analyze the phylogenetic position of C. benghalensis in the Commelinaceae, we reconstructed the phylogenetic tree of the Commelinaceae species with the whole plastome sequences. Phylogenetic analysis shows that Commelina species form a monophyletic clade with a bootstrap value of 100. Commelina benghalensis is placed at the base of the Commelina genus ().
Discussion and conclusion
We present the first comprehensive plastome analysis of C. benghalensis. The plastome exhibits a conservative quadripartite structure spanning 160,663 base pairs and encodes 129 genes. Phylogenetic analysis reveals all Commelina species formed a monophyletic clade, supported by a robust bootstrap value of 100.
The length of this plastome exceeds that of C. communis by 999 base pairs. However, it contains eight fewer genes than the C. communis plastome (Cui and Liang Citation2019). The evolutionary tree reconstructed using complete plastomes for the species of Commelinaceae is topologically consistent with the tree reconstructed from shared protein-coding gene sequences (Jung et al. Citation2021). Due to its position at the base of the Commelina genus evolutionary tree, C. benghalensis may be one of the oldest species in the Commelina genus. Plastomes with high conservation have been widely used in species identification and phylogenetic analysis (Szymon et al. Citation2016). Molecular markers developed from plastomes are potentially valuable for studying the intra- and inter-species genetic structure. Establishing a phylogenetic tree based on plastomes will enhance our understanding of the genetic structure and evolution in Commelina species.
In the phylogenetic analysis, Tradescantia pallida, T. ohiensis and T. virginiana do not constitute a monophyletic group. A prevalent issue in phylogenomic data is the inconsistency between gene trees and species trees, which obstructs our efforts to accurately reconstruct and interpret the tree of life (Steenwyk et al. Citation2023). This incongruence can arise from various biological factors such as incomplete lineage sorting, horizontal gene transfer, hybridization, introgression, recombination, and convergent molecular evolution, resulting in gene phylogenies that diverge from the species tree. Additionally, analytical factors like stochastic, systematic, and treatment errors can also contribute to this inconsistency (Steenwyk et al. Citation2023). To elucidate the phylogenetic relationship of Commelinaceae members, future research should prioritize the exploration of additional plastomes within this family.
Ethics statement
The Commelina benghalensis specimen is not designated as an endangered species. It requires no specific permissions or licenses. In this study, the collection of C. benghalensis leaves was conducted following the guidelines provided by Heze University.
Author contributions
The manuscript includes the contributions of all authors. LiQiang Wang conceived and designed this study; Shuming Zhang collected the sample and extracted the total DNA of the species; Hongqin Li identified the species; Liqiang Wang assembled and annotated the plastome. Hongqin Li and Shu Wang analyzed the structure of the plastome and drafted the manuscript. All authors approved the publication of the version and agreed to be accountable for all aspects of the work.
Supplemental Material
Download MS Word (1,005.3 KB)Disclosure statement
The authors declare no competing interests in the preparation or execution of this study. Dr. Liqiang Wang is the author of this manuscript and also serves as the Associate-Editor of Mitochondrional DNA Part B journal. Dr. Liqiang Wang declares no conflicts of interest regarding the research findings in this study.
Data availability statement
The plastome sequence has been deposited in GenBank (https://www.ncbi.nlm.nih.gov/genbank/) with the accession number OQ354383 (https://www.ncbi.nlm.nih.gov/nuccore/OQ354383). The associated BioProject, Bio-Sample, and SRA numbers are PRJNA928567, SAMN36852649 and SRR23251264. (https://www.ncbi.nlm.nih.gov/sra/?term=SRR12620715).
Additional information
Funding
References
- Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114–2120. doi:10.1093/bioinformatics/btu170.
- Cui Y, Liang R. 2019. The chloroplast genome sequence of Commelina communis (Commelinaceae). Mitochondrial DNA B Resour. 4(2):2631–2632. doi:10.1080/23802359.2019.1642153.
- Gao J, Zhang M, Guan S, Chen Y, Liu A, Yan Y, Wang N, Zhang G. 2020. The complete chloroplast genome of Tradescantia pallida (Rose) D.R.Hunt. Mitochondrial DNA B Resour. 5(3):2932–2933. doi:10.1080/23802359.2020.1787262.
- Goremykin VV, Holland B, Hirsch-Ernst KI, Hellwig FH. 2005. Analysis of Acorus calamus chloroplast genome and its phylogenetic implications. Mol Biol Evol. 22(9):1813–1822. doi:10.1093/molbev/msi173.
- Gu Y, Ma Q. 2021. The complete chloroplast genome of Pollia japonica (Commelinaceae) from Southeast China. Mitochondrial DNA B Resour. 6(4):1486–1487. doi:10.1080/23802359.2021.1911717.
- Hasan S, Hossain M, Akter R, Jamila M, Mazumder MEH, Rahman S. 2009. Sedative and anxiolytic effects of different fractions of the Commelina benghalensis Linn. Drug Discov Thers. 3(5):221–227.
- Jin JJ, Yu WB, Yang JB, Song Y, dePamphilis CW, Yi TS, Li DZ. 2020. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):241. doi:10.1186/s13059-020-02154-5.
- Jung J, Kim C, Kim JH. 2021. Insights into phylogenetic relationships and genome evolution of subfamily Commelinoideae (Commelinaceae Mirb.) inferred from complete chloroplast genomes. BMC Genomics. 22(1):231. doi:10.1186/s12864-021-07541-1.
- Kansagara PA, Pandya DJ. 2019. A complete review on medicinally active herbal weed: commelina benghalensis L. (Commelinaceae). J Pharm Sci Res. 11(4):1165–1171.
- Katoh K, Standley DM. 2016. A simple method to control over-alignment in the MAFFT multiple sequence alignment program. Bioinformatics. 32(13):1933–1942. doi:10.1093/bioinformatics/btw108.
- Katsuhara KR, Nakahama N, Komura T, Kato M, Miyazaki Y, Isagi Y, Ito M, Ushimaru A. 2019. Development of microsatellite markers for the annual andromonoecious herb Commelina communis f. ciliata (Commelinaceae). Genes Genet Syst. 94(3):133–138. doi:10.1266/ggs.18-00058.
- Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 25(14):1754–1760. doi:10.1093/bioinformatics/btp324.
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. 2009. The sequence alignment/map format and SAMtools. Bioinformatics. 25(16):2078–2079. doi:10.1093/bioinformatics/btp352.
- Liu S, Ni Y, Li J, Zhang X, Yang H, Chen H, Liu C. 2023. CPGView: a package for visualizing detailed chloroplast genome structures. Mol Ecol Resour. 23(3):694–704. doi:10.1111/1755-0998.13729.
- Liu Y, Liu GC, Xu Y. 2021. Characterization of the complete chloroplast genome of the perennial plant Tradescantia ohiensis Raf. (Commelinales: commelinaceae). Mitochondrial DNA B Resour. 6(12):3506–3507. doi:10.1080/23802359.2021.2005477.
- Ma L, Jiang SZ, Lian H, Xiong YF, Liu ZJ, Chen SP. 2020. The complete chloroplast genome sequence of Acorus tatarinowii (Araceae) from Fujian, China. Mitochondrial DNA B Resour. 5(3):3159–3160. doi:10.1080/23802359.2020.1806133.
- Nguyen LT, Schmidt HA, Haeseler AV, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32(1):268–274. doi:10.1093/molbev/msu300.
- Pellegrini M, Forzza RC. 2017. Synopsis of Commelina L. (Commelinaceae) in the state of Rio de Janeiro, reveals a new white-flowered species endemic to Brazil. PhytoKeys. 5(78):59–81. doi:10.3897/phytokeys.78.11932.
- Pontius E. 2018. Apollo. Ann Emerg Med. 72(5):616. doi:10.1016/j.annemergmed.2018.06.016.
- Pranabesh G, Alolika D, Maitrayee B, Swagata B, Labani H, Sudip KN. 2019. Phytomorphological, chemical and pharmacological discussions about Commelina benghalensis Linn. (Commelinaceae): a review. Pharma Innov J. 8(6):12–18.
- Saha PS, Jha S. 2019. A molecular phylogeny of the genus Drimia (Asparagaceae: Scilloideae: Urgineeae) in India inferred from non-coding chloroplast and nuclear ribosomal DNA sequences. Sci Rep. 9(1):7563. doi:10.1038/s41598-019-43968-z.
- Shi L, Chen H, Jiang M, Wang L, Wu X, Huang L, Liu C. 2019. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 47(W1):W65–W73. doi:10.1093/nar/gkz345.
- Steenwyk JL, Li YN, Zhou XF, Shen XX, Rokas A. 2023. Incongruence in the phylogenomics era. Nat Rev Genet. 24(12):834–850. doi:10.1038/s41576-023-00620-x.
- Szymon AO, Ewelina Ł, Tomasz K, Tomasz S. 2016. Chloroplasts: state of research and practical applications of plastome sequencing. Planta. 244:517–527. doi:10.1007/s00425-016-2551-1.
- Wang LY, Zhao WY, Chen ZX, Huang WC, Ding MY, Luo JC, Liao WB, Guo W, Fan Q. 2023. Commelina danxiaensis (Commelinaceae), a new species from Guangdong, China. PhytoKeys. 218:117–126. doi:10.3897/phytokeys.218.91199.
- Zuntini AR, Frankel LP, Pokorny L, Forest F, Baker WJ. 2021. A comprehensive phylogenomic study of the monocot order Commelinales, with a new classification of Commelinaceae. Am J Bot. 108(7):1066–1086. doi:10.1002/ajb2.1698.