
Genome Resource of Colletotrichum spaethianum, the Causal Agent of Leaf Anthracnose in Polygonatum falcatum
- Yuniar Devi Utami
- Kei Hiruma †
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902 Japan
Colletotrichum spaethianum is a fungal phytopathogen causing leaf anthracnose that is phylogenetically related to the root endophyte C. tofieldiae within the spaethianum species complex of Colletotrichum spp. fungi. However, limited studies of this fungus have rendered its pathogenesis and host interaction elusive. Here, for the first time, we characterized draft genome sequences of C. spaethianum MAFF 239500, which causes leaf anthracnose in Polygonatum falcatum A. Gray in Japan. This study will provide insight into the genomic potential and serve as a material to get a better understanding of the diverging traits between related phytopathogenic and endophytic fungi.
The ascomycetes from the genus Colletotrichum have been noted as phytopathogens that cause anthracnose diseases in various economically important plants worldwide (Dean et al. 2012). C. spaethianum has been identified as a cause of this disease on plants from genus Polygonatum, such as P. falcatum (also called narukoyuri, a popular market flower in Japan), P. odoratum (an ornamental flowering plant), and P. cyrtonema (a traditional Chinese herb) (Liu et al. 2020; Ma et al. 2021; Tomioka et al. 2008). For example, C. spaethianum MAFF 239500 (originally identified as C. dematium) causes chlorotic to brown spots on P. falcatum leaves at the initial infection stage, followed by severe lesions, making the leaves finally blighted and defoliated (Sato et al. 2015; Tomioka et al. 2008). Interestingly, C. spaethianum has a close phylogenetic relationship to C. tofieldiae, a beneficial endophyte that promotes plant growth of the model plant Arabidopsis thaliana plants by transferring phosphorus to the host under phosphate-limiting conditions (Hiruma et al. 2016). These fungal species, along with other phytopathogenic species such as C. incanum and C. liriopes, form the spaethianum species complex within the genus Colletotrichum phylogenetic tree (Talhinhas and Baroncelli 2021). However, the molecular bases underlying the lifestyle differences in those closely related species are hitherto unknown. As the first step to elucidate these mechanisms, we present the first draft genome and annotation of C. spaethianum, which causes diseases on P. falcatum.
Fungal colonies of C. spaethianum MAFF 239500 were isolated from diseased leaves of narukoyuri (P. falcatum A. Gray) grown in open fields in Kagawa Prefecture, Japan (Tomioka et al. 2008). Mycelia for DNA extraction was obtained by culturing agar fragments containing fungal colonies in liquid Mathur's medium incubated on a rotary shaker for 100 rpm at 25°C for 3 days. The genomic DNA was subsequently extracted by the cetyltrimethylammonium bromide method as described by Damm et al. (2008). Equimolar of DNA samples were sent for long-read sequencing on a PacBio RSII system in Macrogen Corp., Japan. DNA samples were also used to identify ITS, ACT, CHS-1, TUB2, HIS3, and GAPDH genes by PCR for phylogenetic assessment, as described by Cannon et al. (2012). The phylogenetic position of MAFF 239500 as C. spaethianum within the spaethianum species complex was confirmed (Supplementary Fig. S1) by maximum-parsimony analysis using PAUP v.1.3.3 with a heuristic search option (Swofford 2003).
Genome assembly and annotation statistics are summarized in Table 1. Briefly, pre-assembly filtering and de novo assembly for generated reads were performed using Falcon v.2.1.4 (Chin et al. 2016), resulting in a total sequence of 50.9 MBp in 84 scaffolds (Table 1). Draft-genome completeness was measured using BUSCO v.5.1.2 (Simão et al. 2015) based on the glomerellales_odb10 lineage dataset, resulting in 92.6% genome completeness. Repetitive regions were detected as 1.2% of the genome using Dfam TETools containing RepeatMasker v.4.1.2-p1 with fungi as species option (Storer et al. 2021). Sequence identity to phylogenetically close species was explored using Mauve whole-genome alignment using the MCM algorithm (Darling et al. 2004). The longest locally colinear block formed between C. spaethianum and C. tofieldiae (0861 strain, GenBank accession: GCA_001625265.1) was 1.26 Mb with 71.2% identity, with C. liriopes (A2 strain, GenBank accession: GCA_015832465.1) it was 0.92 Mb with 72% identity, and with C. incanum (MAFF238704, GenBank accession: GCA_001625285.1) it was 0.72 Mb with 66.4% identity.
Table 1. Summary of genome sequencing, assembly, and annotation statistics

Protein and coding sequence (CDS) data of C. tofieldiae (Hacquard et al. 2016) were used as training data for SNAP v2.31.8 (Korf 2004). The resulting gene training model from SNAP was used for gene prediction using MAKER v.2.31.8 (Cantarel et al. 2008), resulting in 12,842 CDS. Subsequent functional annotation was performed using Protein BLAST+ v.2.6.0 (Camacho et al. 2009) based on the UniProt Swiss-Prot database (June 2018 version). Detection and annotation of transfer RNA (tRNA) and ribosomal RNA (rRNA) were conducted using tRNA-scan v.2.0.7 (Lowe and Chan 2016) and barrnap v.0.9 (Seeman 2018), resulting in 355 tRNAs and 65 rRNAs, respectively. Prediction of secretome was conducted as described by Crestana et al. (2021) using SignalP v.5.0 (Almagro Armenteros et al. 2019), TMHMM v.2.0 (Krogh et al. 2001), and ProSite PS-Scan against motif PS00014 (de Castro et al. 2006). From 1,245 candidate secretomes, 428 candidate effector proteins were detected using EffectorP v.3.0 (Sperschneider and Dodds 2022). Additionally, identification of 629 carbohydrate-active enzymes (CAZymes) was performed with the hmmscan function from HMMER v.3.1b2 (Eddy 2011) using dbCAN HMMdb release 9.0 (Zhang et al. 2018) as a database. From these CAZymes, 127 genes are categorized under the ligninolytic and lytic polysaccharide mono-oxygenases family, 15 genes under the noncatalytic carbohydrate-binding modules family, 54 genes under the carbohydrate esterases family, 308 genes under the glycoside hydrolases family, 91 genes under the glycosyltransferases family, and 34 genes under the polysaccharide lyases family. We also detected 42 gene clusters as candidates of secondary metabolites using anti-SMASH v.6.0.1 (Blin et al. 2021). The numbers of these effectors, CAZymes, and secondary metabolite clusters are comparable with previously sequenced species in the same spaethianum species complex with higher genome completeness (Supplementary Table S1).
To the best of our knowledge, this is the first draft genome sequence of C. spaethianum, which is a pathogenic fungus causing leaf anthracnose in P. falcatum. We expect this draft genome will provide valuable information for future comparative studies of the spaethianum species complex, including pathogenic and endophytic Colletotrichum spp., and broaden our understanding of plant and fungal molecular interaction.
Data and Material Availability
The assembled contigs and annotations generated in this study were deposited in the DNA Data Bank of Japan under the BioProject PRJDB11870 and BioSample SAMD00334500 with contig accession numbers BQXU01000001 to BQXU01000084. Fungal isolates material is available at NARO Genebank, Japan as isolate MAFF 239500.
Note Added in Final Publication
Changes were made to this article just prior to final publication to reflect new information. Previous information for C. spaethianum reported here had been switched with that of a closely related Colletotrichum species (C. liriopes). The whole-genome sequencing was carried out at the same time with the same procedure.
The author(s) declare no conflict of interest.
Literature Cited
- 2019. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol. 37:420-423. https://doi.org/10.1038/s41587-019-0036-z Crossref, Google Scholar
- 2021. antiSMASH 6.0: Improving cluster detection and comparison capabilities. Nucleic Acids Res. 49:W29-W35. https://doi.org/10.1093/nar/gkab335 Crossref, Google Scholar
- 2009. BLAST+: Architecture and applications. BMC Bioinf. 10:421. https://doi.org/10.1186/1471-2105-10-421 Crossref, Google Scholar
- 2012. Colletotrichum – current status and future directions. Stud. Mycol. 73:181-213. https://doi.org/10.3114/sim0014 Crossref, Google Scholar
- 2008. MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 18:188-196. https://doi.org/10.1101/gr.6743907 Crossref, Google Scholar
- 2016. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13:1050-1054. https://doi.org/10.1038/nmeth.4035 Crossref, Google Scholar
- 2021. Complete chromosome-scale genome sequence resource for Sporisorium panici-leucophaei, the causal agent of sourgrass smut disease. Mol. Plant-Microbe Interact. 34:448-452. https://doi.org/10.1094/MPMI-08-20-0218-A Link, Google Scholar .
- 2008. Novel Phaeoacremonium species associated with necrotic wood of Prunus trees. Persoonia 20:87-102. https://doi.org/10.3767/003158508X324227 Crossref, Google Scholar
- 2004. Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements. Genome Res. 14:1394-1403. Crossref, Google Scholar
- 2012. The top 10 fungal pathogens in molecular plant pathology. Mol. Plant Pathol. 13:414-430. https://doi.org/10.1111/j.1364-3703.2011.00783.x Crossref, Google Scholar
- 2006. ScanProsite: Detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res. 34:W362-W365. Crossref, Google Scholar
- 2011. Accelerated Profile HMM Searches. PLoS Comput. Biol. 7:e1002195. https://doi.org/10.1371/journal.pcbi.1002195 Crossref, Google Scholar
- 2016. Survival trade-offs in plant roots during colonization by closely related beneficial and pathogenic fungi. Nat Comm. 7:11362. Crossref, Google Scholar
- 2016. Root endophyte Colletotrichum tofieldiae confers plant fitness benefits that are phosphate status dependent. Cell 165:464-474. https://doi.org/10.1016/j.cell.2016.02.028 Crossref, Google Scholar
- 2004. Gene finding in novel genomes. BMC Bioinf. 5:59. https://doi.org/10.1186/1471-2105-5-59 Crossref, Google Scholar
- 2001. Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J. Mol. Biol. 305:567-580. https://doi.org/10.1006/jmbi.2000.4315 Crossref, Google Scholar
- 2020. Leaf spot of Polygonatum odoratum caused by Colletotrichum spaethianum. J. Gen. Plant Pathol. 86:157-161. https://doi.org/10.1007/s10327-019-00903-4 Crossref, Google Scholar
- 2016. tRNAscan-SE On-line: Integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 44:W54-W57. https://doi.org/10.1093/nar/gkw413 Crossref, Google Scholar
- 2021. Colletotrichum spaethianum causing anthracnose on Polygonatum cyrtonema Hua in Anhui Province, China. Plant Dis. 105:509. https://doi.org/10.1094/PDIS-04-20-0778-PDN Link, Google Scholar
- 2015. Anthracnose fungi with curved conidia, Colletotrichum spp. belonging to ribosomal groups 9-13, and their host ranges in Japan. JARQ 49:351-362. https://doi.org/10.6090/jarq.49.351 Crossref, Google Scholar
- 2018. barrnap 0.9: Rapid ribosomal RNA prediction. https://github.com/tseemann/barrnap Google Scholar
- 2015. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210-3212. https://doi.org/10.1093/bioinformatics/btv351 Crossref, Google Scholar
- 2022. EffectorP 3.0: Prediction of apoplastic and cytoplasmic effectors in fungi and oomycetes. Mol. Plant-Microbe Interact. 35:146-156. https://doi.org/10.1094/MPMI-08-21-0201-R Link, Google Scholar
- 2021. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob DNA 12:2. https://doi.org/10.1186/s13100-020-00230-y Crossref, Google Scholar
- 2003. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. Sinauer Associates, Sunderland, MA, U.S.A. https://doi.org/10.1111/j.0014-3820.2002.tb00191.x Google Scholar
- 2021. Colletotrichum species and complexes: Geographic distribution, host range and conservation status. Fungal Divers. 110:109-198. https://doi.org/10.1007/s13225-021-00491-9 Crossref, Google Scholar
- 2008. Anthracnose of Polygonatum falcatum caused by Colletotrichum dematium. J. Gen. Plant Pathol. 74:402-404. https://doi.org/10.1007/s10327-008-0112-6 Crossref, Google Scholar
- 2018. dbCAN2: A meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 46:W95-W101. Crossref, Google Scholar
Funding: This work was funded by the Japan Society for the Promotion of Science KAKENHI (grants JP20H02986 and JP21H05150) and Japan Science and Technology Agency (grant JPMJFR200A).
The author(s) declare no conflict of interest.
Copyright © 2022 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.