Skip to main content

Comparative analysis of the whole mitochondrial genomes of four species in sect. Chrysantha (Camellia L.), endemic taxa in China

Abstract

Background

The sect. Chrysantha Chang of plants with yellow flowers of Camellia species as the “Queen of the Tea Family”, most of these species are narrowly distributed endemics of China and are currently listed Grde-II in National Key Protected Wild Plant of China. They are commercially important plants with horticultural medicinal and scientific research value. However, the study of the sect. Chrysantha species genetics are still in its infancy, to date, the mitochondrial genome in sect. Chrysantha has been still unexplored.

Results

In this study, we provide a comprehensive assembly and annotation of the mitochondrial genomes for four species within the sect. Chrysantha. The results showed that the mitochondrial genomes were composed of closed-loop DNA molecules with sizes ranging from 850,836 bp (C. nitidissima) to 1,098,121 bp (C. tianeensis) with GC content of 45.71–45.78% and contained 48–58 genes, including 28–37 protein-coding genes, 17–20 tRNA genes and 2 rRNA genes. We also examined codon usage, sequence repeats, RNA editing and selective pressure in the four species. Then, we performed a comprehensive comparison of their basic structures, GC contents, codon preferences, repetitive sequences, RNA editing sites, Ka/Ks ratios, haplotypes, and RNA editing sites. The results showed that these plants differ little in gene type and number. C. nitidissima has the greatest variety of genes, while C. tianeensis has the greatest loss of genes. The Ka/Ks values of the atp6 gene in all four plants were greater than 1, indicating positive selection. And the codons ending in A and T were highly used. In addition, the RNA editing sites differed greatly in number, type, location, and efficiency. Twelve, six, five, and twelve horizontal gene transfer (HGT) fragments were found in C. tianeensis, Camellia huana, Camellia liberofilamenta, and C. nitidissima, respectively. The phylogenetic tree clusters the four species of sect. Chrysantha plants into one group, and C. huana and C. liberofilamenta have closer affinities.

Conclusions

In this study, the mitochondrial genomes of four sect. Chrysantha plants were assembled and annotated, and these results contribute to the development of new genetic markers, DNA barcode databases, genetic improvement and breeding, and provide important references for scientific research, population genetics, and kinship identification of sect. Chrysantha plants.

Peer Review reports

Introduction

The sect. Chrysantha Chang plants are a special group of the genus Camellia in the Theaceae family and are known for their golden yellow petals [1]. Li Shizhen’s book “Compendium of Materia Medica” states the following: “Camellia production in the south, late winter flowering, red petals and yellow stamens, there are also yellow ones.” [2]. Camellia petelotii plants are distributed only in China and Vietnam; 42 species and 5 varieties of C. petelotii are known, and there are approximately 20 species in China, of which 18 are endemic. In China, C. petelotii plants are concentrated in southwestern Guangxi, with only a few species distributed in Guizhou and Yunnan [3]. In recent years, as people’s understanding of plants of the sect. Chrysantha has deepened, its multifaceted value has gradually been explored and recognized. Its flowers, fruits, and leaves contain a variety of natural nutrients, such as tea polysaccharides, tea polyphenols, total saponins, total flavonoids, tea pigments, caffeine, and many other microelements and macronutrients that are beneficial to the human body. These components give golden flower tea antioxidant, lipid-regulating, sugar-lowering, anti-allergic, anti-skin photoaging, antibacterial, and other pharmacological activities [4,5,6,7,8]. Its seeds are also used to extract oil and as a raw material for lubricants or other solvents. The wood is hard and dense, making it an excellent material for handicrafts [9]. C. petelotii, an endangered species, has a history spanning hundreds of millions of years. Through the study of C. petelotii, scientists can gain an in-depth understanding of its unique biological properties and genetic mechanisms, providing important references for research in the fields of botany and ecology. Moreover, the identification of the special components in C. petelotii will provide new ideas and methods for drug development. In conclusion, the plants of the sect. Chrysantha are of great significance in terms of ornamental value, medicinal value, economic value, and scientific research value.

The wide distribution of plant species in the sect. Chrysantha, their high morphological variability, and their susceptibility to environmental influences have made the classification of their species more difficult. To solve the problems of species identification and understand interspecific relationships in sect. Chrysantha, most previous studies focused on leaf characteristic, sporology, and karyotype analyses [10,11,12]. However, the morphological characteristics of plants are greatly affected by changes in the environment, and under the same conditions, the floral traits within the sect. Chrysantha plants vary, and natural hybridization can occur [12]. It is difficult to correctly identify species within the sect. Chrysantha based on morphological characteristics. With the development of molecular biotechnology, many scholars have used molecular marker technology and DNA sequencing methods to study the classification of species [13, 14]. However, the different molecular markers have limitations, often lead to different conclusions, and have not been able to resolve long-standing evolutionary controversies. Fang Wei et al. [15] applied DNA fragments to examine the phylogeny and affinities of the genus Camellia, and their results suggested that the sect. Chrysantha may be a complex lineage group. Liufu Yongqing [16] constructed a phylogenetic tree of 33 species of C. petelotii using chloroplast small single copy (SSC) sequences, and his results clarified the phylogenetic positions of the species and possible interspecific hybridization. However, the strong conservation of SSC regions in the chloroplast genome may not provide sufficient information about variation in interspecific or low-level taxonomic unit studies, thus affecting the accuracy of phylogenetic tree construction. Yao Hanya et al. [17] studied 22 species of C. petelotii using DNA barcoding technology. The results showed that rbcL, matK, psbA-trnH, trnL-trnF, and combined sequences could effectively distinguish different species of C. petelotii, and thus, they could be used as barcodes for species identification of C. petelotii. Zhang Xiaoyu [18] conducted a comparative analysis of 23 species of C. petelotii plants by means of chloroplast whole genomes, and the results of this study resolved the taxonomic status of some of the species. However, due to factors such as the fewer number of valid information loci, some of the evolutionary controversies still need to be resolved via further investigation. Inconsistent results between analyses of the chloroplast and nuclear genomes due to hybridization necessitated further exploration of the evolution of sect. Chrysantha species using a wider range of data. Currently, the mitochondrial genomes of sect. Chrysantha plants are rarely reported. Comparative analyses of their whole mitochondrial genomes have not yet been performed.

Plant mitochondria play a crucial role in energy conversion, fatty acid synthesis, amino acid metabolism, and stress response [19]. Together, these effects contribute to the adaptive evolution of plants, enabling them to better adapt to changing environmental conditions. The size of mitochondrial genomes in angiosperms is usually between 100 kb and 10 Mb. Plant mitochondrial genomes are structurally complex, consisting mostly of noncoding DNA sequences containing many short repetitive sequences and homologous sequences, accounting for 2–60% of the total genome length [20]. However, a complete set of tRNA genes sufficient to read all codons is generally lacking in plant mitochondrial genomes, requiring input from the nucleus or cytoplasm. The GC content of the mitochondrial genomes of different plants is closely related to plant evolution and stability. For plants, maintaining the relative stability of the mitochondrial genome is essential for maintaining normal mitochondrial function and adaptation to the environment [21]. Therefore, studying the GC content of plant mitochondrial genomes can provide valuable clues for understanding the evolutionary history and ecological adaptations of plants. The study of plant mitochondrial RNA editing sites has revealed widespread RNA editing in plant mitochondrial genomes. These editing sites are numerous and unevenly distributed across plants and genes, but the vast majority are located in coding regions and are essential for normal protein function [22]. RNA editing regulates gene expression by altering transcript sequences and may play an important role in plant mitochondrial function.

In this study, we assemble the whole mitochondrial genomes of three sect. Chrysantha plants. The mitochondrial genomes of the sect. Chrysantha plants reported in the present study were obtained from the National Center for Biotechnology Information (NCBI) database, and comparative analyses of the basic structure, GC content, codon preference, repetitive sequences, RNA editing sites, Ka/Ks ratio, Haplotypes, and RNA editing sites were carried out for the four Aurelia group plants. Moreover, covariance analyses revealed rearrangements during the evolutionary process of four sect. Chrysantha plants, and high-level homologous sequence transfer between mitochondria and chloroplasts was analyzed. Finally, we constructed a phylogenetic tree of sect. Chrysantha plants based on mitochondrial protein-coding genes. Our study of the mitochondrial genome of plants in the sect. Chrysantha not only enriches the genetic database of this taxon but also provides valuable insights into its speciation, kinship identification, and molecular phylogenetic studies.

Materials and methods

Material collection, DNA extraction and sequencing

The plant samples of Camellia huana T. L. Ming & W. J. Zhang, Camellia liberofilamenta Chang & C. H. Yang, and Camellia nitidissima C. W. Chi collected in this study were from Tianba Village, Luodian County, Luxiong Village, Ceheng County and Ceheng State Forestry Farm, Ceheng County, respectively. The specimens were preserved in the Tree Herbarium of the College of Forestry, Guizhou University (GZAC), under the specimen numbers of LZ-20,221,102, LZ-20,231,102, and LZ-20,230,404, respectively. The botanical specimens were identified by Li Zhi. The selected plants were healthy and disease-free. Young, healthy leaves were collected and stored at -80 °C. The experimental materials were placed in a mortar and pestle, frozen with liquid nitrogen, and then quickly ground to powder form. Genomic DNA was extracted from the leaves using a modified CTAB method [23]. DNA integrity was detected by 1% agarose gel electrophoresis, and DNA purity and concentration were checked by a Nanodrop instrument (Thermo Fisher Scientific, Waltham, MA, USA, 2000c UV Vis). Qualified samples were sent to Shanghai Lingen Biotechnology (http://www.biozeron.com/) for Illumina sequencing (Illumina NovaSeq 6000) and Nanopore sequencing (Oxford Nanopore Technologies).

Genome assembly and annotation

First, Illumina sequencing data were assembled using GetOrganelle v1.7.1 [24]. Then, the second-generation assembled sequences were compared to Nanopore third-generation data using BWA v0.7.17 [25] to extract the third-generation data of the target samples, and the extracted third-generation data were mixed and assembled with the second-generation data using SPAdes v3.14.1 software; the sequences with a sufficiently high depth of coverage and a longer assembly length were selected as the candidate sequences, and the sequences were compared to the NT library to confirm the mitochondrial scaffold sequences and link the sequences according to overlap. Clean reads were compared to the mitochondrial genome sequence, and bases were corrected using Pilon v1.23. The final mitochondrial genome sequence was obtained by determining the start position and orientation of the mitochondrial scaffold from the reference genome (Camellia tianeensis; GenBank accession numbers: PP727208).

Mitochondrial genes were annotated using the online GeSeq tool (https://chlorobox.mpimp-golm.mpg.de/geseq) [26] with default parameters to predict protein-coding genes, transfer RNA (tRNA) genes, and ribosomal RNA (rRNA) genes. The location of each coding gene was determined by a BLAST search of the reference mitochondrial gene (C. tianeensis) [27]. Genes with start/end codons and intron/exon boundaries were manually corrected in SnapGene Viewer. Genome mapping was performed using the OGDRAW tool [28].

Mitochondrial genome-shared gene analysis

Mitochondrial genome coding sequence (CDS) sequences were extracted from four plants of the sect. Chrysantha. Homologous sequences were extracted by BLAST software and compared using MAFFT v7.149 software. Phylogenetic trees were constructed for 10 shared genes in mitochondria using MEGA software. Haplotype analysis was performed using DnaSP v5 [29], and mapping was performed with PopART v1.7 [30]. The nonsynonymous/synonymous mutation ratio (Ka/Ks) of the 10 shared genes in the four sect. Chrysantha plants were calculated using Ka/Ks-Calculator v3.0 software [31] and finally plotted using Origin 2022 software.

Preferential use of codons

CDSs larger than 300 bp were selected for analysis according to the selection criteria for codon-preferred sequences [32]. The screened mitochondrial CDSs were analyzed using Codon W software to obtain the effective number of codon (ENC) and relative synonymous codon usage (RSCU) data for each CDS [33]. Parameters such as the GC content of the 1st nucleobase (GC1), 2nd nucleobase (GC2), and 3rd nucleobase (GC3) of the codon, the average GC content of the 1st nucleobase and 2nd nucleobase (GC12), the average GC content of the three nucleobases (GCall), and the contents of A, T, C, and G on the 3rd nucleobase (A3, T3, C3, and G3) were analyzed using CUSP online software [34, 35].

A neutral plot was drawn with mitochondrial GC12 as the vertical coordinate and the GC3 position as the horizontal coordinate. If there is a significant correlation between GC12 and GC3, the slope of the regression curve is close to 1, indicating that codon preference is strongly influenced by base mutations, and vice versa, indicating that it is strongly influenced by natural selection [36]. The ENC plot was drawn with GC3 as the horizontal coordinate and ENC as the vertical coordinate. An ENC of 20 indicates that only one codon per base is used, which is an extreme preference; an ENC of 61 indicates that the codons are biased toward random use, and there is no usage preference. Using G3/(G3 + C3) as the horizontal coordinate and A3/(A3 + T3) as the vertical coordinate, a gene codon bias analysis plot (PR2-plot) was generated. The center point of the PR2 plot represents the codon state when there is no bias in use, i.e., A = U and C = G, and the vector distances of the remaining points from the center point represent the degree and direction of the base bias of the codons of the respective genes [37].

Repetitive sequences and prediction of RNA editing sites

The mitochondrial genome simple repeat sequences (SSRs) of four plants in sect. Chrysantha were identified through the MISA website (https://webblast.ipk-gatersle-ben.de/misa/) [38] with the following parameters: the number of simple repeats was set to 10 for single nucleotides, 5 for dinucleotides, the number of trinucleotide simple repeats was set to 4, and the number of tetranucleotide, pentanucleotide and hexanucleotide simple repeats was set to 3. Forward, reverse, complementary, and palindromic repeats within the genome were identified by the website REPuter (https://bibiserv.cebitec.uni-bielefeld.de/reputer) [39] with parameters set to minimum length = 30 bp and Hamming distance = 3. The sequences of protein-coding genes from the mitochondrial genomes of the four sect. Chrysantha plants were predicted for RNA editing sites using Prep-Mt [40]. Statistical analysis and graphing of the data were performed in Excel 2010.

Identification of homologous fragments and covariance analysis

Organellar genome fragments were exchanged using BLASTN software to compare the mitochondrial and chloroplast genomes of the corresponding species with an E value of le-5, setting the minimum comparison length to 50 bp, and mapping the covariance of the chloroplast and mitochondrial genomes using Circos software [41]. The four mitochondrial genomes were compared using MUMmer v3.23 (http://mummer.sourceforge.net/) software, which identified large-scale covariance relationships between the genomes. Interregion comparison was then performed using LASTZ v1.03.54 to confirm the local positional alignment relationships. Mapping was performed using Mauve software.

Phylogenetic tree

To investigate the phylogenetic relationships of mitochondria in four species of the sect. Chrysantha of plants, 24 mitochondrial genome sequences (seven from the genus Camellia) were downloaded from the NCBI database, with Ginkgo biloba serving as the outgroup. Their protein-coding genes were extracted for phylogenetic tree construction, and sequence alignment was performed using MAFFT v7.5 software. Phylosuite software [42] was used to construct the phylogenetic tree by Bayesian inference (BI), and iqtree maximum likelihood (ML) was used to construct phylogenetic trees using MFP as the model. The resulting files were uploaded to the Interactive Tree of Life (iTOL) website [43] to construct the phylogenetic tree.

Results

Characterization of the mitochondrial genomes of the four sect. Chrysantha plants

The mitochondrial DNA compositions of four plants (Camellia tianeensis, Camellia huana, Camellia liberofilamenta and Camellia nitidissima) were statistically analyzed. The results showed that the mitochondrial genomes of the four sect. Chrysantha plants were composed of closed-loop DNA molecules with sizes ranging from 850,836 bp (C. nitidissima) to 1,098,121 bp (C. tianeensis) (Fig. 1). The A, T, C and G contents of the constituent amino acids were 27.1–27.25%, 27.02–27.19%, 22.84–22.91%, and 22.85–22.94%, respectively. The GC content was highly conserved during evolution, with values in the range 45.71–45.78%. CDSs, tRNAs, and rRNAs accounted for 2.49–3.46%, 0.22–0.28% and 0.47–0.71% of the total mitochondrial genome, respectively (Table 1). The number of encoded proteins was consistent and included complex I-complex V, cytochrome c synthase subunits, ribosomal proteins, maturation enzyme proteins, and transit membrane proteins, ranging in number from 48 to 58, while specific gene duplications were observed in different species. Multiple copies were observed to varying degrees in all four plants. A total of nine annotated genes were found to contain introns, of which cox2, rpl2, trnA-UGC, trnI-GAU and ccmFc contained only one intron, and ccmFc appeared to be intronic in all four sect. Chrysantha plants (Table S1). Overall, the mitochondrial genomes of the four plant species varied only slightly in terms of gene type or number. By comparing the PCGs of the mitochondrial genomes of the four individual species, gene loss and gene duplication were found for some genes. For example, rpl1 was present only in C. nitidissima, while it was absent in the other three species of sect. Chrysantha. The rpl2 gene was present only in C. nitidissima and C. huana and was lost in C. liberofilamenta and C. tianeensis. The gene variety was most complete in C. nitidissima, while the gene loss was most severe in C. tianeensis (Table S1).

Table 1 Characterization of the mitochondrial genomes of four species of sect. Chrysantha plants
Fig. 1
figure 1

Mitochondrial genome mapping of four sect. Chrysantha plants. (A: C. tianeensis; B: C. huana; C: C. liberofilamenta; D: C. nitidissima)

Comparative analysis of shared genes

To better understand the mitochondrial genomes of the four sect. Chrysantha species, we identified 10 shared genes (atp1, atp6, cob, cox2, ccmFc, nad2, nad5, rpl10, rps1, and sdh3) through analysis of their protein-coding genes and constructed a phylogenetic tree by using these shared genes (Fig. 2). The results indicated that C. huana and C. liberofilamenta are very closely related to each other. Moreover, to gain a deeper understanding of their genetic variation, we analyzed the mutation sites of 10 shared genes in the four plant species and constructed a haplotype network map (Fig. 3). C. nitidissima showed a typical haplotype construction, with C. tianeensis and C. liberofilamenta having the same composition of these 10 shared genes. In addition, C. huana shared these 10 genes with three other plants. The Ka/Ks values were calculated for 10 shared genes from the mitochondrial genomes of the four sect. Chrysantha plants to clarify the evolutionary selection of the mitochondrial genomes under environmental stress (Fig. 4). The Ka/Ks values of the atp6 gene were all greater than 1, indicating positive selection.

Fig. 2
figure 2

A phylogenetic tree was constructed based on 10 shared genes in four sect. Chrysantha plants

Fig. 3
figure 3

Haplotype network mapping of four sect. Chrysantha plants constructed on the basis of 10 shared genes

Fig. 4
figure 4

Ka/Ks ratios of 10 shared genes in the mitochondrial genomes of four sect. Chrysantha plants (using C. huana as a reference)

Codon preference analysis

A total of 26 mitochondrial genome coding sequences (CDSs) were screened for C. tianeensis, 26 for C. liberofilamenta, 27 for C. huana and 27 for C. nitidissima. Among the mitochondrial genome codons, GCall, GC1, GC2, GC3 and GC12 had percentages of 42.22-42.51%, 47.38-48.01%, 42.73-42.87%, 36.33-36.83% and 45.06-45.37%, respectively, and the ENC was 50.22, 49.74, 49.42, and 49.99, respectively, with a weak genomic codon preference (Fig. 5A, Table S2). The frequency distributions of the effective codon ratios and plots of ENC versus GC3 (ENC plot) for the mitochondrial genomes of the four Chrysantha plants showed (Fig. 5B) that the mitochondrial genome ENC mostly deviated from the standard curves, suggesting that their codon preferences were more susceptible to base mutations. Neutral plotting analysis (Fig. 5C) revealed that there was no significant correlation between GC12 and GC3 in any of the four plants, suggesting that the third position of the mitochondrial genome codon in plants of the sect. Chrysantha has a different pattern of base mutation from the first and second bases and that codon preference over base mutation is more strongly influenced by natural selection. The base bias at position 3 of the codons of each gene showed (Fig. 5D) that the genes were unevenly distributed in four different regions, with most being located in the lower and right halves of the plane view. This indicates that the mitochondrial genomic codon of plants in sect. Chrysantha occurs more frequently at position 3 for T bases than for A bases and more frequently for G bases than for C bases. Thus, genomic codon preference is affected not only by mutation but also by natural selection.

Fig. 5
figure 5

Codon preference analysis of the mitochondrial genomes of four sect. Chrysantha plants. (A: Distribution of mean values of different GC contents; B: ENC plot; C: codon neutrality plot; D: PR2 plot; E: relative synonymous codon usage)

Analysis of the relative usage of synonymous codons (RSCU) of mitochondrial genes showed (Fig. 5E, Table S3) that there were 29 mitochondrial gene high-frequency codons (RSCU greater than 1), including 11 ending in A, 15 ending in T, 2 ending in C, and 1 ending in G. There were 2 codons with RSCU equal to 1: ATG and TGG. There were 31 low-frequency codons (RSCU less than 1) in the gene, 3 for A, 16 for C, and 12 for G. This finding suggests that the most highly used codons in the mitochondrial genomes of the four plant species are those ending in A and T.

Repeat sequence analysis and RNA editing sites

Simple repetitive sequences have the characteristics of high reproducibility, codominant inheritance, uniparental inheritance, and strong relative conservatism, which make them highly efficient molecular markers and most suitable for species identification and evaluation of genetic variation at the population and individual levels. Comparative SSR analyses of the mitochondrial genomes of the four sect. Chrysantha plants sequenced in this study revealed a total of 1099 SSR loci. These included 123 mononucleotides, 314 dinucleotides, 146 trinucleotides, 434 tetranucleotides, 72 pentanucleotides, and 10 hexanucleotides. The number of SSRs in different species ranged from 233 to 306, with the highest number of simple repetitive sequences in C. tianeensis and the lowest number of simple repetitive sequences in C. huana (Fig. 6A). Among all the types, tetranucleotide repeats were the most abundant, ranging in number from 91 to 124 and accounting for 39% of all SSRs. This was followed by dinucleotides, with an average of 78 per species for a total of 29%, trinucleotides (Tri-) at 13%, mononucleotides (Mono-) at 11%, pentanucleotides at 7%, and hexanucleotides at 1% (Fig. 6B). There are two types of single nucleotide repeats, of which A/T was the most numerous. There are three types of dinucleotide repeats. There are seven types of trinucleotide repeats. Among the 25 types of tetranucleotide repeats, the ACAT/ATGT and ACGG/CCGT types were unique to C. huana, the ACAG/CTGT type was unique to C. nitidissima, and the ATCC/ATGG type was absent from C. liberofilamenta. Among the 15 types of pentanucleotide repeats, the AAGGT/ACCTT and ACTAT/AGTAT types were unique to C. huana, and the AAAGG/CCTTTT, AAATT/AATTT, AACTT/AAGTT, and AAGGC/CCTTG types were unique to C. nitidissima. There were five types of hexanucleotide repeats, all of which were present in C. huana (Fig. 6C).

Fig. 6
figure 6

Repeat sequence analysis of the mitochondrial genomes of the four sect. Chrysantha plants. (A: Number of six SSR types; B: proportion of six SSR types; C: number of SSR repetitive sequence types; D: number of four dispersed repetitive sequence types; E: length of four long repetitive sequence types)

Long repetitive sequences can be classified according to length into 9 types (30–34, 35–39, 40–44, 45–49, 50–54, 55–59, 60–64, 65–69, and ≥ 70 bp) and according other features into four types (forward repetitive sequence (F), reverse repetitive sequence (R), complementary repetitive sequence (C), and palindromic sequence (P)). Analysis of the long repetitive sequences of the complete mitochondrial genomes of the four species revealed numbers ranging from 1000 to 2010 for each species, with F and P sequences exhibiting the highest percentage, followed by R and C. The long repetitive sequences of the complete mitochondrial genomes of the four species were analyzed in the same way. The fewest types of duplicates were found in C. huana, and no type C duplicates were found in C. tianeensis or C. huana (Fig. 6D). In all four species, 30–34 bp repeats occurred most frequently, followed by 35–39 bp repeats, and 60–64 bp repeats occurred least frequently. Interestingly, repeat sequences greater than 70 bp in length were present in all four species. Analysis of the sequence lengths of the repetitive elements of different repeat types revealed an inverse relationship between 30 and 69 bp repeats among all four types of repeats detected, accompanied by a gradual decrease in the number of repetitive elements as the sequence length increased. The sequence lengths of the repeats in most species repeats were between 30 and 34 bp (Fig. 6E).

RNA editing sites were predicted in the four plants by Prep-Mt. A total of 24 genes were found to undergo RNA editing, with a total of 1,587 editing sites, all of which were C-to-U transitions. Among them, no editing occurred in the rpl2 genes of C. tianeensis and C. liberofilamenta, and no editing occurred in the rps7 gene of C. huana (Fig. 7A). A total of 549 (35%) site edits occurred at the first base of the codon, 1,038 (65%) site edits occurred at the second base of the codon, and none occurred at the third base of the codon (Fig. 7B, C). Amino acid changes occurred at all sites, with the main patterns of change being A(Ala)→V(Val), H(His)→Y(Tyr), L(Leu)→F(Phe), P(Pro)→F, P→L, P→S(Ser), Q(Gln)→*(Stop codon), R(Arg)→*, R→C(Cys), R→W(Trp), S→F, S→L, T(Thr)→I(Ile), and T→M(Met). Of these, the highest frequency of changes occurred at P→L, followed by S→L, and the least at Q→* (Fig. 7D). Before RNA editing, of the 1,587 edited sites, 61% were hydrophilic amino acids, whereas after editing, 13% were hydrophilic amino acids; the percentage of hydrophobic amino acids increased from 39 to 87% (Fig. 7E, F). This resulted in a shift from hydrophilic to hydrophobic amino acids, which led to an increase in the hydrophobicity of the protein. Many of the amino acid changes triggered by RNA editing introduced more hydrophobic amino acids into the protein structure, thereby altering the hydrophilic nature of the protein, and may play a key role in maintaining the regulation of mitochondrial gene expression.

Fig. 7
figure 7

Prediction of RNA editing sites in four plant species based on protein-coding genes. (A: number of RNA editing sites in each gene; B: ratio of codons in different positions; C: number of codons in different positions; D: type of RNA editing site; E: ratio of hydrophilicity to hydrophobicity of amino acids before RNA editing; F: ratio of hydrophilicity to hydrophobicity of amino acids after RNA editing)

Horizontal gene transfer and covariance in the organellar genome

The mitochondrial genomes of the four sect. Chrysantha plants in this study were approximately 4.6–7.0 times longer than their chloroplast genomes (Table S4). However, the protein-coding gene sequences in their mitochondrial genomes accounted for only 2.5–3.5% of their full length, most of which consisted of noncoding regions, and the function of this portion remains unknown. Twelve, six, five, and twelve gene exchange fragments were found in C. tianeensis, C. huana, C. liberofilamenta, and C. nitidissima, respectively, and their total lengths were 42,875 bp, 6,469 bp, 10,377 bp, and 11,269 bp, respectively (Fig. 8D, Table S4). In C. tianeensis, all 12 migrating fragments exceeded 100 bp, 6 fragments were in the range of 100–500 bp, 4 fragments were in the range of 500–1000 bp, and 2 fragments were more than 1 kb in length. There were 9 integrative genes on these fragments, which were CDSs (5), tRNAs (3), and rRNAs (1). There were three fragment sequences of 100–500 bp in C. huana, and the remaining three were above 1 kb in size. There were six genes, comprising three each of CDSs and rRNAs (Fig. 8A). The C. liberofilamenta fragments were all more than 1 kb in length and were integrated into the genes rpl2, trnA-UGC, and 23 S (Fig. 8B). There were 7 transfer fragments of 100–500 bp in C. nitidissima, and the remaining 5 were more than 1 kb in size and were integrated into 11 genes, i.e., 8 and 3 each of CDSs and rRNAs, respectively (Fig. 8C). Numerous chloroplast protein-coding genes migrated from chloroplasts to mitochondria, most of which lost their integrity during evolution, and partial sequences of these genes, e.g., nad1, ccmC, and rrn18, are found in the mitochondrial genome. Transferred tRNA genes are much more conserved, suggesting that they have indispensable roles in mitochondria.

Fig. 8
figure 8

Analysis of homologous fragments in different organelles of four sect. Chrysantha plants. The green arcs represent chloroplast DNA, and the yellow arcs represent mitochondrial DNA. (A: C. huana; B: C. liberofilamenta; C: C. nitidissima; D: number of horizontal gene transfers in different species)

The results of the rearrangement and collinearity of the mitochondrial genomes of the four sect. Chrysantha plants showed that the four plants split into numerous localized colinear blocks. The size of these local colinear blocks and their relative positions varied greatly (Fig. 9). Their genes were arranged in an inconsistent order, with poor covariance and internal structural inversions or gene rearrangements.

Fig. 9
figure 9

Mitochondrial genome covariance mapping in four species of sect. Chrysantha plants

Phylogenetic tree

In this study, the mitochondrial genome sequence was used for the first time for the evolutionary analysis of plants in the sect. Chrysantha, and a phylogenetic tree was constructed by Bayesian inference and maximum likelihood methods by combining 28 species from 12 families. The results show (Fig. 10) clear clustering of plants from each of the different families, with the four species of the sect. Chrysantha clustered on the same branch and the 12 species of Theaceae clustered into one group. The four species of sect. Chrysantha plants are more closely related, with C. liberofilamenta and C. huana closer together and then clustered with C. tianeensis and C. nitidissima, suggesting that they have closer affinities. In this study, the whole mitochondrial genome sequence was applied to the evolutionary analyses of sect. Chrysantha plants, which will provide new ideas for future genetic relationship studies of sect. Chrysantha plants.

Fig. 10
figure 10

A phylogenetic tree based on protein-coding genes constructed from the mitochondrial genomes of 28 plant species

Discussion

This study is the first to comparatively analyze the mitochondrial genomes of four plants from the sect. Chrysantha. The mitochondrial genomes of all four plants were ring shaped, which is consistent with the fact that most of the mitochondrial genomes were reported to be circular in structure [44]. The mitochondrial genome size ranged from 850,836 bp to 1,098,121 bp. The GC content was evolutionarily conserved, with values ranging from 45.71 to 45.78%, and was higher than that of Sunflower (45.22%), Angelica dahurica (45.06%), and Prunella vulgaris (43.92%); these are high levels among higher plants [45,46,47]. The protein-coding region occupies only approximately 2.5–3.5% of the full length, and the noncoding region occupies approximately 90%. The relatively similar functional classification of protein-coding genes in the mitochondrial genomes suggested that the number of mitochondrial genomes of closely related species is essentially the same and that the protein-coding gene sequences are highly conserved [48, 49]. The coding regions of the genome are more conserved than the noncoding regions, and the noncoding regions are also a major source of mitochondrial genomic variation [50]. The mitochondrial genome intergenic region is composed mainly of repetitive sequences, chloroplast genome homologous sequences, and nuclear genome homologous sequences. Repetitive sequences containing tandem, short and long repeats, are widely present in the mitochondrial genome [51] and are essential for intermolecular recombination of the mitochondrial genome. They are usually considered the main cause of mitochondrial genome differences in plants [52]. Most protein-coding genes begin with a typical ATN codon [53]. Some genes contain one or more introns, and these introns may play an important role in the regulation of gene expression [54]. At the same time, we annotated two identical rRNA genes (rrnL and rrnS) in the mitochondrial genomes of the four species, as is the case for most land plants [44]. The occurrence of partial deletions of ribosomal protein genes in the four species is consistent with the tendency of these genes to be readily lost during angiosperm evolution [55]. Therefore, the results of this study deepen the understanding of the genome structure of sect. Chrysantha plants and provide a reference for the discovery of their new functional genes and genetic improvement.

The Ka/Ks ratio is important for evaluating the effects of plants on environmental stresses during evolution and can reveal the effects of genetic changes on the phenotypes of different individual seed plants in terms of the evolutionary selection of genes [31]. We screened atp6 as a positively selected gene from 10 shared protein-coding sequences of four mitochondrial sect. Chrysantha plants. Their high ratios suggest that they play an important role in species adaptation to selection. From the distribution of haplotypes and the haplotype network structure, it was found that haplotype H1 was distributed in all 10 shared genes of C. tianeensis and occupied a dominant position, which indicated that H was an obvious shared haplotype and that it was presumed to be an ancient ancestral haplotype. This analysis is of great scientific value for understanding the molecular evolutionary pathways, the conservation and innovation of gene functions, and the genetic basis of adaptive differences between species in plants of the sect. Chrysantha.

The numbers of codons in the mitochondrial genomes of the four species of plants in sect. Chrysantha were 42.22–42.51%, GCall, GC3, and 36.33–36.83%, with most values below 50%, indicating that base 3 of the mitochondrial gene codons of the four species was predominantly A/T, which is consistent with the mitochondrial codon preferences of Ganoderma and Setaria italica [56, 57]. This shows that the plant organelle genome evolutionary trends were roughly the same. The RSCU is an important indicator for evaluating the codon usage pattern of biological organelle genomes. In this study, there were 29 high-frequency codons, 26 of which ended in A/T; this finding is consistent with the codon usage pattern of the mitochondrial genome of Oenanthe javanica [58]. The results of neutral mapping analysis, ENC plot analysis, and PR2 plot analysis all showed that mitochondrial genome codons are strongly influenced by natural selection, which is consistent with the findings for Hemerocallis citrina codons [59], unlike those for Masteraria chamomilla codons, which are influenced mainly by base mutations [60], suggesting that diverse factors influence species codon preferences. This convergence phenomenon may have originated from the adaptive selection of the four sect. Chrysantha plant species by similar environmental factors during long-term natural evolution. This preliminary study revealed that the main reason for the formation of codon preferences in the mitochondrial genomes of the four species is natural selection, which will be highly important for exploring the molecular properties and genetic diversity of sect. Chrysantha, determining the evolutionary pressure of genes, and performing molecular breeding.

From an evolutionary point of view, differences in repetitive sequences between species are the result of natural selection [61], and the more concentrated distribution of plants in the sect. Chrysantha and the study of the characteristics of the repetitive sequences within their genomes are of great significance to the evolution of these species. In this study, it was found that the AAGGT/ACCTTT and ACTAT/AGTAT types were unique to C. huana, and the AAAGG/CCTTTT, AAATT/AATTT, AACTT/AAGTT, and AAGGC/CCTTG types were unique to C. nitidissima, which is helpful for tracing the roots of interspecific crosses in the sect. Chrysantha. Mitochondrial genomic SSRs are short (1–6 bp) repetitive unit sequences in the mitochondrial genome that are shorter in length and more limited in number and distribution than SSRs in the nuclear genome [62, 63]. The types of repetitive SSR units in the mitochondrial genome of the four sect. Chrysantha plants were more consistent, indicating that the mitochondrial genome sequences of the sect. Chrysantha of plants are relatively conserved, which is similar to the results of a previous study [64]. In this study, we chose to analyze the differences among the four plant species of the sect. Chrysantha under the same setup conditions, and the results can reflect the differences between the species more realistically; moreover, how these simple repeat sequences actually affect the evolution of the species still needs to be further investigated.

The phenomenon of RNA editing is prevalent in the organelle genomes of higher plants and plays an important role in organelle development, morphogenesis, and stress response in higher plants [65]. In this study, we found that RNA editing sites in the mitochondrial genomes of four plant species were prone to causing changes in the bases at the first and second positions in the corresponding amino acids, with a greater frequency of editing occurring at the second position, which is consistent with other plant mitochondrial RNA editing features [44]. A total of 24 genes were detected with editing at locus 1038, and all were typical C-to-U transitions, similar to previous studies [66]. During RNA editing, 14 types of amino acid changes can occur: A→V, H→Y, L→F, P→F, P→L, P→S, Q→*, R→*, R→C, R→W, S→F, S→L, T→I, and T→M. Most of the amino acids are converted to hydrophobic amino acids, and the hydrophobicity of the edited proteins increases [67]. Hydrophilic amino acids are distributed on the surface of protein molecules, while hydrophobic amino acids are predominantly distributed in the interior, and the correlation of hydrophilic and hydrophobic amino acids can often be used to determine the general trend of protein folding [68]. An increase in hydrophobic amino acids during RNA editing may increase protein stability [65]. The identification of these RNA editing sites provides essential clues for future studies on the evolution and neo-codon prediction of gene function and can help us better understand gene expression in the mitochondrial genome of plants.

The genomes of plant mitochondria vary greatly in size and structure, which is mainly due to the large number of repetitive sequences contained therein on the one hand and, on the other hand, due to frequent horizontal migration [44]. The plant mitochondrial genome often exchanges genetic material with the nucleus and the chloroplast genome, and some genes in chloroplasts are often integrated into the mitochondrial genome. DNA migration is common in plants, and the mitochondrial genome contains many sequences that have migrated from the chloroplast genome [69]. It varies from species to species during autophagy, gametogenesis, and fertilization [70] and plays an important role in the evolution of plant mitochondrial genomes. DNA from chloroplasts can be found in virtually every plant whose mitochondrial genome has been sequenced to date. Typically, this portion of the migrating integrated sequence accounts for 1–12% of the total length of the mitochondrial genome [71]. In the mitochondrial genomes of the four sect. Chrysantha plants, the full length of the sequences from chloroplasts ranged from 6,469 bp–11,269 bp, which accounted for 2.5–3.5% of the total length. Except tRNAs, the vast majority of sequences in the mitochondrial genome that have migrated from the chloroplast genome are now considered “dead on arrival” [72]. The mitochondrial genome has been shown to express chloroplast-derived tRNA genes [73], and the expression of these genes is associated with the nuclear genome-encoded rpoTm polymerase [74]. This feature is also specific to the mitochondrial genomes of higher plants, as no similar phenomenon has been found to exist in some of the lower plants for which organelle genomes have been determined. It is unlikely that sequences found within the chloroplast genome that come from the mitochondrial genome exist in higher plants [75]. However, evidence of the migration of the mitochondrial genome to the chloroplast genome has been analyzed in studies of the mitochondrial genome in Vitis vinifera and Daucus carota by researchers [76, 77]. In contrast to the migration of chloroplast sequences to the mitochondrial genome, the migration of mitochondrial sequences to the chloroplast genome is not present in all species, and the lengths of the migrating sequences found thus far are very limited. Our results did not reveal sequence migration of mitochondria towards the chloroplast genome.

Plant mitochondrial genomes are characterized by structural rearrangements, large numbers of genes lost or gained, the transfer of mitochondrial genes to nuclear genes, and very low nucleic acid mutation rates [78], providing unique information for phylogenetic analyses [79] and analyses of homology among plant mitochondrial genes that can reveal the relationships and evolutionary history among different species [80]. Evolutionary analyses and comparisons revealed that the mitochondrial genomes of the four sect. Chrysantha species underwent frequent genetic recombination events during the evolutionary process. These colinear regions and inverted structures not only reveal the conserved patterns of the mitochondrial genomes but also reflect the evolutionary relationships among species and the evolutionary history of sect. Chrysantha plants, as well as revealing the affinities and evolutionary histories of the species, which provides a new perspective for exploring their genetic bases and evolutionary histories.

A large amount of data demonstrates that the chloroplast genome sequence, with its moderate rate of base variation and other characteristics, serves as an ideal material for constructing phylogenetic trees [81]. However, there are relatively few phylogenetic studies on mitochondrial genome construction. In this first application of whole mitochondrial genome sequences to the evolutionary analyses of sect. Chrysantha plants, the mitochondrial genomes of four sect. Chrysantha plants that have been sequenced and the published mitochondrial genome sequences of seven other families were selected for phylogenetic relationship analysis. The results showed that the classification of each family was still clear, and the four species of sect. Chrysantha were the closest relatives, indicating that the mitochondrial genome can be used to analyze the phylogenetic relationships of sect. Chrysantha plants. Our present study on the mitochondrial genome represents only the tip of the iceberg, and the mitochondrial genomes of the sect. Chrysantha of plants need to be further investigated before accurate conclusions can be drawn.

Conclusion

In this study, we successfully assembled three high-quality mitochondrial genomes from plants of the sect. Chrysantha and, by using the published C. tianeensis sequence, a comprehensive comparative analysis of the mitochondrial genomes of the four species was carried out in terms of their compositional structure, genotypes, haplotypes, codon preferences, and RNA editing sites, thus revealing the structure of the mitochondrial genomes of sect. Chrysantha and their compositions. A comparison of PCGs of the mitochondrial genomes of four individual species revealed the presence of gene loss and gene duplication in some genes. In addition, RNA editing site and phylogenetic tree analyses revealed a shift from hydrophilic to hydrophobic amino acids, leading to an increase in protein hydrophobicity. Twelve species of Theaceae were clustered into one group. C. huana and C. liberofilamenta clustered together, and then C. tianeensis and C. nitidissima clustered together as a group, indicating that they have a closer relationship. This study provides more comprehensive genetic information for the genomes of C. tianeensis, C. huana, C. liberofilamenta, and C. nitidissima, which is important for revealing the function of the mitochondrial genome and for studying the genetic characteristics, origin evolution, conservation, and utilization, as well as the taxonomic status, of plants in the sect. Chrysantha.

Data availability

The mitochondrial genomes of Camellia nitidissima, Camellia liberofilamenta and Camellia huana in this study have been uploaded to NCBI. Their Gene Bank accession numbers are PP975885-PP975887.

References

  1. Zhang HD, Ren SX. Flora of China, Beijing: Science Press. 1998; 49 (3): 101–112.

  2. Li SZ. Compendium of Materia Medica. Beijing: People’s Medical Publishing House; 1999. pp. 1922–3.

    Google Scholar 

  3. Liang SY. The world list of Camellia. Guangxi Forestry Sci. 2007;36(4):221–3. https://doiorg.publicaciones.saludcastillayleon.es/10.19692/j.cnki.gfs.2007.04.013.

    Article  Google Scholar 

  4. Chen Y, Gong SX, Xu X, Xu J, Zhang TJ, Hou XT, Deng JG. Research progress on chemical constituents and pharmacological studies of Camellia Nitidissima. Drug Evaluation Res. 2022;45(03):575–82. https://doiorg.publicaciones.saludcastillayleon.es/10.7501/j.issn.1674-6374.2022.03.025.

    Article  Google Scholar 

  5. Xia X, Huang JJ, Wang ZP, Wang Q, Pan WG. Hypoglycaemic effect and acute toxicity study of Camellia chrysantha leaves. Lishizhen Med Materia Med Res. 2013;24(5):1281–2. https://doiorg.publicaciones.saludcastillayleon.es/10.3969/j.issn.1008-0805.2013.05.117.

    Article  Google Scholar 

  6. Wang ZL, Guo YJ, Zhu YY, Chen L, Wu T, Liu DH, Huang BS, Du HZ. Active fractions of Camellia Nitidissima inhibit non-small cell lung cancer via suppressing epidermal growth factor receptor. China J Chin Materia Med. 2021;46(20):5362–71. https://doiorg.publicaciones.saludcastillayleon.es/10.19540/j.cnki.cjcmm.20210628.701.

    Article  CAS  Google Scholar 

  7. He DY, Li XY, Sai X, Wang LL, Li SY, Xu YP. CamNitidissimaissima C.W. Chi: a review of botany, chemistry, and pharmacology. Phytochem Rev. 2018;17:327–49. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s11101-017-9537-x.

    Article  CAS  Google Scholar 

  8. Wang ZN, Sun B, Yang R, Jia AQ. Flavonoids and other phenolics from Camellia Nitidissima Chi flowers. Nat Prod Res. 2021;37(2):180–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/14786419.2021.1960326.

    Article  PubMed  CAS  Google Scholar 

  9. Qin HZ, Zou R, Chai SF, Tang JM, Wei X. Study on seed germination and seedling photosynthetic characteristics of a rare and endangered plant Camellia indochinensis. Seed. 2022;41(08):27–34. https://doiorg.publicaciones.saludcastillayleon.es/10.16590/j.cnki.1001-4705.2022.08.027.

    Article  Google Scholar 

  10. Shi SY, Wu WF, Cui J, Zhang YY, Li ZH, Wang Y. Pollen morphology and taxonomic significance of ten species of sect. Chrysantha Guihaia. 2022;42(01):68–77. https://doiorg.publicaciones.saludcastillayleon.es/10.11931/guihaia.gxzw202008024.

    Article  CAS  Google Scholar 

  11. Li H, Qin XM, Journal of Guangxi Normal University (Natural Science Edition). Cluster analysis of karyotype resembl ancenear coefficent of sect. Chrysantha Chang,. 1997; 15 (2): 77–81, https://doiorg.publicaciones.saludcastillayleon.es/10.16088/j.issn.1001-6600.1997.02.017

  12. Wang Y, La YF, Dai YQ, Huang XN, Shi SY. Observation and comparison on flowering phenology and floral morphological characteristics of nine species of yellow camellia plants in Nanning City. J Plant Resour Environ. 2020;29(3):43–9. https://doiorg.publicaciones.saludcastillayleon.es/10.3969/j.issn.1674-7895.2020.03.06.

    Article  CAS  Google Scholar 

  13. Lu JS, Huang ZW, Li XM, Sun MY, Yu HJ, Zhou JY, Li CN, Bu ZY. Genetic Diversity Analysis of Camellia sect. Chrysantha Chang Germplasm resources by using iPBS molecular markers technology. Southwest China J Agricultural Sci. 2020;33(8):1638–44. https://doiorg.publicaciones.saludcastillayleon.es/10.16213/j.cnki.scjas.2020.8.006.

    Article  Google Scholar 

  14. Vijayan K, Zhang WJ, Tsou CH. Molecular taxonomy of Camellia (Theaceae) inferred from nrITS sequences. Am J Bot. 2009;96(7):1348–60. https://doiorg.publicaciones.saludcastillayleon.es/10.3732/ajb.0800205.

    Article  PubMed  CAS  Google Scholar 

  15. Fang W, Yang JB, Yang SX, Li DZ. Phylogeny of Camellia sects. Longipedicellata, Chrysantha and Longissima. Plant Divers. 2010;32(1):1–13. https://doiorg.publicaciones.saludcastillayleon.es/10.3724/SP.J.1143.2010.09120. (Theaceae) Based on Sequence Data of Four Chloroplast DNA Loci.

  16. Liufu YQ. Phylogenetic Studies of Yellow Camellias Based on Chloroplast Small Single Copy (SSC) Sequence, Guangxi normal University. 2015.

  17. Yao HY, Li YY, Bai YY, Gao H, Yan GY, Xie YJ. Ldentification of 22 kinds of Camellia Nitidissima Germplasm resources by DNA Barcoding Technology. Seed. 2023;42(10):144–53. https://doiorg.publicaciones.saludcastillayleon.es/10.16590/j.cnki.1001-4705.2023.10.144.

    Article  CAS  Google Scholar 

  18. Zhang XY. Study on the evolution of chloroplast genome of Sect. Chrysantha Chang, Shanxi Univisity. 2024.

  19. Ishihara N. Dynamics of mitochondrial double membranes, genomes, and functions. BBA Bioenerg. 2018;1859:e4. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.bbabio.2018.09.012.

    Article  Google Scholar 

  20. Dong SS, Chen L, Liu YL, Wang YL, Zhang SZ, Yang LL, Lang XA, Zhang SZ. The draft mitochondrial genome of Magnolia biondii and mitochondrial phylogenomics of angiosperms. PLoS ONE. 2020;15(4):e0231020. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pone.0231020.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Singh R, Ming R, Yu QY. Comparative analysis of GC content variations in Plant genomes. Trop Plant Biol. 2016;9:136–49. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s12042-016-9165-4.

    Article  CAS  Google Scholar 

  22. Cummings MP, Myers DS. Simple statistical models predict C-to-U edited sites in plant mitochondrial RNA. BMC Bioinformatics. 2004;5(132). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1471-2105-5-132.

  23. Pahlich E, Gerliz C. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemistry. 1980;19(1):11–3. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/0031-9422(80)85004-7.

    Article  CAS  Google Scholar 

  24. Dierckxsens N, Mardulyn P, Smits G. Nucleic Acids Res. 2017;45(4):e18. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/gkw955. NOVOPlasty: de novo assembly of organelle genomes from whole genome data.

  25. Cuong PQ, Binh KD, Tran NT. A high-performance FPGA-based BWA-MEM DNA sequence alignment, Concurrency and Computation. Practire Rxperience. 2019;33(2):e5328. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/cpe.5328.

    Article  Google Scholar 

  26. Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S. GeSeq versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017;45(1):W6–11. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/gkx391.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  27. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biology. 1990;215(03):403–10. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S0022-2836(05)80360-2.

    Article  CAS  Google Scholar 

  28. Greiner S, Lehwark P, Bock R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019;47:W59–64. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/gkz238.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  29. Librado P., Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–2. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/bioinformatics/btp187.

    Article  PubMed  CAS  Google Scholar 

  30. Jessica WL, David B. Popart: full-feature software for haplotype network construction, methods in Ecology and Evolution.2015; 6 (9): 1110–6, https://doiorg.publicaciones.saludcastillayleon.es/10.1111/2041-210X.12410

  31. Zhang Z. KaKs_Calculator 3.0: calculating selective pressure on Coding and non-coding sequences. Genom Proteom Bioinform. 2022;20(3):536–40. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.gpb.2021.12.002.

    Article  CAS  Google Scholar 

  32. Darracq A, Varre JS, Drouard LM, Courseaux A, Castric V, Laprade PS, Oztas S, Lenoble P, Vacherie B, Barbe V, Touzet P. Structural and content diversity of mitochondrial genome in beet: acomparative genomic analysis. Genome Biol Evol. 2011;3:723–36. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/gbe/evr042.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  33. He B, Dong H, Jiang C, Cao FL, Tao ST, Xu LA. Analysis of codon usage patterns in Ginkgo biloba reveals codon usage tendency from A/U-ending to G/C-ending. Sci Rep. 2016;6(1):35927. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/srep35927.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  34. Ran ZH, Li Z, Xiao X, An MT, Yan C. Complete chloroplast genomes of 13 species of sect. Tuberculata Chang (Camellia L.): genomic features, comparative analysis, and phylogenetic relationships. BMC Genomics. 2024;25:108. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-024-09982-w.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  35. Gu L, Su T, An MT, Hu GX. The complete chloroplast genome of the vulnerable Oreocharis esquirolii (Gesneriaceae): structural features, comparative and phylogenetic analysis. Plants. 2020;9(12):1692. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/plants9121692.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  36. Xiao X, Zhao ZH, Lu JT, Yang GY, Li Z. Analysis on Chloroplast Genome characteristics and Codon usage biases of Camellia rubituberculata. Seed. 2002;41(12):19–26. https://doiorg.publicaciones.saludcastillayleon.es/10.16590/j.cnki.1001-4705.2022.12.019.

    Article  CAS  Google Scholar 

  37. Wright F. The effective number of codons used in a gene. Gene. 1990;87(1):23–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/0378-1119(90)90491-9.

    Article  PubMed  CAS  Google Scholar 

  38. Beier S, Thiel T, Munch T, Scholz U, Mascher M. MISA-web: a web server for microsatellite prediction. Bioinformatics. 2017;33(16):2583–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/bioinformatics/btx198.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  39. Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001;29(22):4633–42. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/29.22.4633.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  40. Mower JP. The PREP suite: predictive RNA editors for plant mitochondrial genes, chloroplast genes and user-defined alignments. Nucleic Acids Res. 2009;37:W253–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/gkp337.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  41. Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19:1639–45. https://doiorg.publicaciones.saludcastillayleon.es/10.1101/gr.092759.109.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  42. Zhang D, Gao FL, Jakovlić I, Zou H, Zhang J, Li WX, Wang GT. Mol Ecol Resour. 2019;20(1):348–55. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/1755-0998.13096. PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies.

  43. Letunic I, Bork P. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49(W1):293–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/gkab301.

    Article  CAS  Google Scholar 

  44. Chen Y, He XX, Priyadarshani SN, Ye Y, Shi C, Ye KZ, Zhao Q, Luo ZQ, Deng F, Cao L, Zheng P, Mohammad A, Qin Y. Assembly and comparative analysis of the complete mitochondrial genome of Suaeda Glauca. BMC Genomics. 2021;22(1):15–25. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-021-07490-9.

    Article  CAS  Google Scholar 

  45. Makarenko MS, Omelchenko DO, Usatov AV, Gavrilova VA. The insights into mitochondrial genomes of Sunflowers. Plants. 2021;10:1774. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/plants10091774.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  46. Li YY, Liu YY, Zeng X, Wu P, Li QM, Guo SX, Hao ZG. Complete mitochondrial genome of Angelica Dahurica and its implications on evolutionary analysis of complex mitochondrial genome architecture in Apiaceae, Front. Plant Sci. 2024;15:1367299. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fpls.2024.1367299.

    Article  Google Scholar 

  47. Sun Z, Wu Y, Fan P, Guo D, Zhang S, Song C. Assembly and analysis of the mitochondrial genome of Prunella vulgaris, Front. Plant Sci. 2023;14:1237822. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fpls.2023.1237822.

    Article  Google Scholar 

  48. Clifton SW, Minx P, Fauron CM, Gibson M, Allen JM, Sun H, Thompson M, Barbazuk WB, Kanuganti S. Sequence and comparative analysis of the maize NB mitochondrial genome. Plant Physiol. 2004;136(3):3486–503. https://doiorg.publicaciones.saludcastillayleon.es/10.1104/pp.104.044602.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  49. Ogihara Y, Yamazaki Y, Murai K, Kanno A, Terachi T, Shiina T, Miyashita N, Nasuda S, Nakamura C, Mori N. Structural dynamics of cereal mitochondrial genomes as revealed by complete nucleotide sequencing of the wheat mitochondrial genome. Nucleic Acids Res. 2005;33(19):6235–50. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/gki925.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  50. Christensen AC. Plant mitochondrial genome evolution can be explained by dna repair mechanisms. Genome Biol Evol. 2013;19(6):79–86. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/gbe/evt069.

    Article  CAS  Google Scholar 

  51. Guo WH, Zhu A, Fan WS, Mower JP. Complete mitochondrial genomes from the ferns Ophioglossum californicum and Psilotum nudum are highly repetitive with the largest organellar introns. New Phytol. 2016;213(9):18–31. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/nph.14135.

    Article  CAS  Google Scholar 

  52. Dong SS, Zhao CX, Chen F, Li YH, Zhang SZ, Wu H, Zhang LS, Liu Y. The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination. BMC Genomics. 2018;19(1):1–12. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12864-018-4991-4.

    Article  CAS  Google Scholar 

  53. Lin GM, Xiang P, Sampurna BP, Hsiao CD. Genome skimming yields the complete mitogenome of Chromodoris Annae (Mollusca: Chromodorididae). Mitochondrial DNA Part B. 2017;2(2):09–10. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/23802359.2017.1372715.

    Article  Google Scholar 

  54. Liao XF, Zhao YH, Kong XJ, Khan A, Zhou B, Liu DM, Kashif MH, Chen P, Wang H, Zhou RY. Complete sequence of kenaf (Hibiscus cannabinus) mitochondrial genome and comparative analysis with the mitochondrial genomes of other plants. Sci Rep. 2018;8(1):79. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-018-30297-w.

    Article  CAS  Google Scholar 

  55. Adams KL, Qiu YL, Stoutemyer M, Palmer JD. Punctuated evolution of mitochondrial gene content: high and variable rates of mitochondrial gene loss and transfer to the nucleus during angiosperm evolution. Proc Natl Acad Sci USA. 2002;99(15):9905–12. https://doiorg.publicaciones.saludcastillayleon.es/10.1073/pnas.042694899.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  56. Wu P, Xiao W, Luo Y, Xiong Z, Chen X, He J, Sha A, Gui M, Li Q. Comprehensive analysis of codon bias in 13 Ganoderma mitochondrial genomes, Front. Microbiol. 2023;14:1170790. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fmicb.2023.1170790.

    Article  Google Scholar 

  57. Zhang J, Liu G, Wei J. Assembly and comparative analysis of the first complete mitochondrial genome of Setaria italica. Planta. 2024;260(23):1–23. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00425-024-04386-2.

    Article  PubMed  CAS  Google Scholar 

  58. Li X, Han Q, Li M, Luo Q, Zhu S, Zheng Y, Tan G. Complete mitochondrial genome sequence, characteristics, and phylogenetic analysis of Oenanthe javanica. Agronomy. 2023;13:2103. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/agronomy13082103.

    Article  CAS  Google Scholar 

  59. Zhang K, Wang Y, Zhang Y, Shan XF. Codon usage characterization and phylogenetic analysis of the mitochondrial genome in Hemerocallis citrina. BMC Genom Data. 2024;25(6). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12863-024-01191-4.

  60. Ruzicka J, Novak J. Mitochondrial genome variation between different accessions of Matricaria chamomilla L. (Asteraceae) based on SNP mutation analysis. Genet Resour Crop Evol. 2020;67:853–64. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10722-020-00881-z.

    Article  CAS  Google Scholar 

  61. Li XY, Wang XY, Journal of North China University of Science and Technology(Natural Science Edition). Effects of repetitive sequences to evolution of Plant Genome size,. 2021; 43 (4): 98–107, https://doiorg.publicaciones.saludcastillayleon.es/10.3969/j.issn.2095-2716.2021.04.015

  62. Zhao CX, Zhu RL, Liu Y. Simple sequence repeats in bryophyte mitochondrial genomes. Mitochondrial DNA Part A. 2016;27(1):191–7. https://doiorg.publicaciones.saludcastillayleon.es/10.3109/19401736.2014.880889.

    Article  CAS  Google Scholar 

  63. Shen JS, Li XQ, Li MZ, Cheng HF, Huang XL, Jin SH. Characterization, comparative phylogenetic, and gene transfer analyses of organelle genomes of Rhododendron × pulchrum. Front Plant Sci. 2022;13:969765. https://doiorg.publicaciones.saludcastillayleon.es/10.3389/fpls.2022.969765.

    Article  PubMed  PubMed Central  Google Scholar 

  64. Bai Y, Ye L, Kang Y, Wang H. Genome Survey and SSR Analysis of Camellia Nitidissima Chi (Theaceae). Genet Res. 2022;8:5417970. https://doiorg.publicaciones.saludcastillayleon.es/10.1155/2022/5417970.

    Article  CAS  Google Scholar 

  65. Bi CW, Paterson AH, Wang XL, Wu YQ, Wu DY, Qu YS, Jiang AN, Ye QL, Ye N. Analysis of the complete mitochondrial genome sequence of the diploid cotton gossypium raimondii by comparative genomics approaches. Biomed Res Int. 2016;5040598. https://doiorg.publicaciones.saludcastillayleon.es/10.1155/2016/5040598.

  66. Michael U, Joachim RM, Petra B, Brennicke A. The mitochondrial genome of Arabidopsis thaliana contains 57 genes in 366,924 nucleotides. Nat Genet. 1997;15(1):57–61. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/ng0197-57.

    Article  Google Scholar 

  67. Bentolila S, Alfonso AA, Hanson MR. A pentatricopeptide repeat-containing gene restores fertility to cytoplasmic male-sterile plants. Proc Natl Acad Sci USA. 2002;99(16):10887–92. https://doiorg.publicaciones.saludcastillayleon.es/10.1073/pnas.102301599.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  68. Hirokazu H. The complete nucleotide sequence and RNA editing content of the mitochondrial genome of rapeseed (Brassica napus L.): comparative analysis of the mitochondrial genomes of rapeseed and Arabidopsis thaliana. Nucleic Acids Res. 2003;20:5907. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/gkg795.

    Article  Google Scholar 

  69. Chang SX, Wang YK, Lu JJ, Gai JY, Li JJ, Chu P, Guan RZ, Zhao TJ. The mitochondrial genome of soybean reveals complex genome structures and gene evolution at intercellular and phylogenetic levels. PLoS ONE. 2013;8(5):66. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pone.0056502.

    Article  CAS  Google Scholar 

  70. Huang CY, Ayliffe MA, Timmis JN. Direct measurement of the transfer rate of chloroplast DNA into the nucleus. Nature. 2003;422(6927):69–78. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nature01435.

    Article  CAS  Google Scholar 

  71. Mower JP, Sloan DB, Alverson AJ. Plant mitochondrial genome diversity: the Genomics Revolution. Plant Genome Divers. 2012;123–44. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/978-3-7091-1130-7_9.

  72. Joyce PB, Gray MW. Chloroplast-like transfer RNA genes expressed in wheat mitochondria. Nucleic Acids Rescarch. 1989;17(14):5461–76. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/nar/17.14.5461.

    Article  CAS  Google Scholar 

  73. Miyata S, Nakazono M, Hirai A. Transcription of plastid-derived tRNA genes in rice mitochondria. Curr Genet. 1998;34(3):216–20. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s002940050389.

    Article  PubMed  CAS  Google Scholar 

  74. Allison LA, Simon LD, Maliga P. Deletion of rpoB reveals a second distinct transcription system in plastids of higher plants. EMBO Jourmal. 1996;15(11):2802–9.

    Article  CAS  Google Scholar 

  75. Leister D. Origin, evolution and genetic effects of nuclear insertions of organelle DNA. Trends Genet. 2005;21(12):655–63. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.tig.2005.09.004.

    Article  PubMed  CAS  Google Scholar 

  76. Iorizzo M, Senalik D, Szklarczyk M, Grzebelus D, Spooner D, Simon P. De novo assembly of the carrot mitochondrial genome using next generation sequencing of whole genomic DNA provides first evidence of DNA transfer into an angiosperm plastid genome. BMC Plant Biol. 2012;12:61. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1471-2229-12-61.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  77. Goremykin VV, Salamini F, Velasco R, Viola R. Mitochondrial DNA of Vitis vinifera and the issue of rampant horizontal gene transfer. Mol Biol Evol. 2009;26(1):99–110. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/molbev/msn226.

    Article  PubMed  CAS  Google Scholar 

  78. Duminil J, Besnard G. Utility of the mitochondrial genome in plant taxonomic studies. Methods Mol Biol. 2021;2222:107–18. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/978-1-0716-0997-2_6.

    Article  PubMed  CAS  Google Scholar 

  79. Christensen AC. Plant mitochondria are a riddle wrapped in a mystery inside an Enigma. J Mol Evol. 2021;89(3):151–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00239-020-09980-y.

    Article  PubMed  CAS  Google Scholar 

  80. Kozik A, Rowan BA, Lavelle D, Berke L, Schranz ME, Michelmore RW, Christensen AC. The alternative reality of plant mitochondrial DNA: one ring does not rule them all. PLoS Genet. 2019;15(8):e1008373. https://doiorg.publicaciones.saludcastillayleon.es/10.1371/journal.pgen.1008373.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  81. Liu BW, Ge JH, Zhu H, Li SY, Dong XQ, Liu GX, Hu ZY. Complete chloroplast genome of the green alga Chaetophora lobata (Chlorophyceae, Chlorophyta): morphological features and phylogenetic and comparative analysis. Eur Joumal Phycol. 2021;18(6):1–12. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/09670262.2021.1923809.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank the editor and the anonymous reviewers for their careful comments and suggestions on the manuscript. We also thank the Jiangxi Provincial Key Laboratory of Improved Variety Breeding and Efficient Utilization of Native Tree Species (2024SSY04093) for providing the platform conditions and financial support.

Funding

This study was supported by the Guizhou Provincial Basic Research Program (Natural Science) 2022 (072) and the National Natural Science Foundation of China (31960043, 32400179).

Author information

Authors and Affiliations

Authors

Contributions

Z.L. conceived the study. Z.L. and Z.H.R. were responsible for analyzing and writing this manuscript; X.X. and M.T.A. were responsible for collecting and identifying the material; C.Y. and J.X. helped with the analysis of the data; and M.T. and Z.L. revised the manuscript. All authors reviewed and approved the final manuscript.

Corresponding authors

Correspondence to Ming Tang or Mingtai An.

Ethics declarations

Ethics approval and consent to participate

All materials used in this study comply with international and national legal standards. The collected species material does not pose a threat to other species, and the collection of the species is recognized by the relevant authorities.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1: Table S1 Classification of mitochondrial genes in four sect. Chrysantha plants.

12870_2024_5673_MOESM2_ESM.xlsx

Supplementary Material 2: Table S2 GC content and effective codon number of mitochondrial genome codons in four sect. Chrysantha plants.

12870_2024_5673_MOESM3_ESM.xls

Supplementary Material 3: Table S3 RSCU values of mitochondrial genome codons in four species of sect. Chrysantha plants.

Supplementary Material 4: Table S4 Mitochondrial genome horizontal gene transfer in four sect. Chrysantha plants.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Z., Ran, Z., Xiao, X. et al. Comparative analysis of the whole mitochondrial genomes of four species in sect. Chrysantha (Camellia L.), endemic taxa in China. BMC Plant Biol 24, 955 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12870-024-05673-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12870-024-05673-6

Keywords