
A High-Quality Genome Resource for the Oak Wilt Pathogen Bretziella fagacearum
- Karandeep Chahal1
- Mohit Mahey1
- Carmen Medina-Mora1
- Steven Ahrendt2
- Robert Riley2
- Anna Lipzen2
- Juying Yan2
- Emily Savage2
- Maxim Koriabine2
- Vivian Ng2
- Igor V. Grigoriev2 3
- Thomas C. Harrington4
- Eric L. Patterson1
- Timothy D. Miles1 †
- Monique L. Sakalidis1 5 †
- 1Department of Plant, Soil, and Microbial Sciences. Michigan State University, East Lansing, MI 48824, U.S.A.
- 2United States Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, U.S.A.
- 3Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA 94720, U.S.A.
- 4Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, U.S.A.
- 5Department of Primary Industries and Regional Development, Perth, WA 6151, Australia
Abstract
Bretziella fagacearum is a destructive vascular wilt fungal pathogen affecting oaks in the United States and Canada. The epidemiology of oak wilt varies across different geographical locations, indicating the need to investigate the population dynamics of B. fagacearum to discern potential differences in its genotypes using genomic tools. A good-quality genome of B. fagacearum is crucial as a reference for population genetics studies. Here, we report a high-quality genome of B. fagacearum isolate C519. The genome assembly consists of nine scaffolds, corresponding to the nine chromosomes, totaling 27,072,536 bp with a GC content of 47.29%. It is predicted to encode 7,554 proteins, which are annotated using RNA sequencing data from the same isolate. The circular mitochondrial genome consists of a chromosome of 174,403 bp with a GC content of 28.59% and contains 54 open reading frames, including 14 core genes, 28 tRNAs, 4 rRNAs, and 8 hypothetical proteins. The reference genome can enhance the understanding of molecular epidemiology and biology of B. fagacearum, aiding in identifying genetic variations and pathogen–host interactions and developing diagnostic tools and disease management strategies.
The author(s) have dedicated the work to the public domain under the Creative Commons CC0 “No Rights Reserved” license by waiving all of his or her rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law, 2025.
Genome Announcement
Oak wilt, caused by the fungal pathogen Bretziella fagacearum (Bretz) Z.W. de Beer, Marinc., T.A. Duong & M.J. Wingf., (formerly Ceratocystis fagacearum [Bretz]), is a significant disease affecting oaks (Quercus spp.) in the United States (de Beer et al. 2017; Henry 1944). B. fagacearum spreads through belowground networks of grafted roots among neighboring trees, thereby forming centers or pockets of diseased host species (Blaedow and Juzwik 2010; Chahal et al. 2024). The pathogen is also transmitted aboveground by sap or nitidulid beetles (Coleoptera: Nitidulid) and oak bark beetles (Pityophthorus spp.) (Gibbs and French 1980; Rexrode and Jones 1970). B. fagacearum infections can disrupt both urban and natural forest ecosystems if not detected and managed in a timely manner. Currently, oak wilt has been confirmed in 24 states in the United States, and widespread epidemics are reported in the Great Lakes region and Texas (Appel 1986; Chahal et al. 2019b, 2021; Juzwik et al. 2011; Rexrode and Lincoln 1965). Oak wilt continues to spread, expanding into new geographical locations where it has not been previously reported (Gauthier et al. 2023; McLaughlin et al. 2022). Detection of oak wilt in Ontario, Canada, in June 2023 marked the first identification of B. fagacearum outside the United States (Canadian Food Inspection Agency 2023).
The epidemiology of oak wilt varies between the Great Lakes region and Texas disease epidemics. In the Great Lakes region, red oaks (section Lobatae) are prevalent and highly susceptible compared with white oaks (section Quercus) that exhibit moderate to high levels of resistance (Appel 1995; Juzwik et al. 2011). In the Midwestern United States, B. fagacearum infections have also been reported on orchard-grown chestnuts (Bretz and Long 1950; Chahal et al. 2024). In Texas, the widespread host Texas live oak (Quercus fusiformis, section Virentes) shows intermediate susceptibility compared with red and white oaks (Appel 1986, 1995). However, clonal propagation of Texas live oak through sprouts originating from a shared root system facilitates the spread of B. fagacearum, leading to expanding pockets of mortality (Appel 1995). The outcomes of oak wilt occurrences observed in Texas live oak populations have not been observed in any other host or state within its distribution range.
The sporulation structures of B. fagacearum, known as mycelial mats, predominantly occur on red oaks, rarely on white oaks and have never been reported on Texas live oak (Appel 1995; Juzwik et al. 2011). In the Great Lakes region, oak wilt incidence is high due to the homogenous stands of highly susceptible red oaks, prevalent fungal sporulation, sandy soils that favor root grafting, and the presence of active insect vectors (Juzwik et al. 2011). In the Great Lakes region, insect-vector activity and mycelial mat formation peak during the growing season, particularly from April to June (Chahal et al. 2021; Curl 1955; Juzwik et al. 2011; Morris et al. 2024). No mycelial mat formation has been reported in Michigan from mid-November through mid-April (Chahal et al. 2021), whereas in Texas, insect vectors remain active year-round, coinciding with the continuous formation of fungal mycelial mats (Appel 1995). In Texas, B. fagacearum infection centers typically span several hectares and expand at rates up to 50 m/year (Appel 2008). This contrasts with the slower growth and smaller sizes of foci in deciduous red oaks in the Great Lakes region, expanding at rates of 12 m/year in Michigan and 1.9 to 7.6 m/year in Minnesota (Bruhn et al. 1991; Gearman and Blinnikov 2019). Human-mediated activities, such as poorly timed pruning of oaks and the long-distance transmission of infected and mycelial mat-bearing firewood, contribute to vector-mediated infections across the distribution range of oak wilt (Gearman and Blinnikov 2019). Factors contributing to differences in oak wilt epidemiology include the land cover and distribution of different host species, climate, soil type, human population density (Gearman and Blinnikov 2019), and potentially the genotypes of B. fagacearum. The diversity of B. fagacearum genotypes within geographically different populations may contribute to observed variations in disease incidence and epidemiology (Chahal 2024).
The aim of this study was to generate a high-quality genome sequence of B. fagacearum and annotate it using RNA sequencing data from the same isolate, providing a reference genome for future studies. B. fagacearum isolate C519 was isolated from infected Q. rubra in Sherburne County, MN, in 1992. The isolate was confirmed as B. fagacearum using real-time PCR (Bourgault et al. 2022) and by PCR amplifying and sequencing the 60S, LSU, and MCM7 gene regions (de Beer et al. 2017). Mycelium was grown on potato dextrose broth and harvested following the established protocol (Chahal et al. 2019a, 2022). Genomic DNA was extracted from harvested mycelium using a phenol-chloroform protocol described in Parada-Rojas and Quesada-Ocampo (2018); 5 µg of genomic DNA was sheared to >10 kb using Covaris g-Tubes. The sheared DNA was treated with exonuclease to remove single-stranded ends and a DNA damage repair mix, followed by end repair and ligation of blunt adapters using the SMRTbell Template Prep Kit 1.0 (Pacific Biosciences). The library was purified with AMPure PB beads. PacBio sequencing primer was annealed to the SMRTbell template library, and sequencing polymerase was bound using the Sequel II Binding Kit 1.0. The prepared SMRTbell template libraries were sequenced on a Pacific Biosciences Sequel II sequencer using 8M v1 SMRT cells and version 1.0 sequencing chemistry with 1 × 900 min sequencing movie run times. Sequence data were processed with the JGI QC pipeline to remove artifacts. The genome was assembled with Falcon version pb-assembly = 0.0.2|falcon-kit = 1.2.3|pypeflow = 2.1.0 (Chin et al. 2016), improved with FinisherSC version 2.1 (Lam et al. 2015), and polished with Arrow version SMRT Link v7.0.1.66975. Contigs shorter than 1,000 bp were excluded, and those identified as contaminants were removed. Contaminants were detected by performing basic local alignment search tool (BLAST) searches of the contigs separately against bacterial/viral and fungal UniProt accessions. Contigs with a higher number of bacterial/viral hits in length compared with fungal hits were discarded. To better assemble B. fagacearum, which had substantially higher coverage than the contaminant, preassembled reads (preads) were subsampled using BBTools v38.79 (reformat.sh samplerate = 0.3). The subsampled reads were assembled with Flye v2.6 (flye –pacbio-corr -g 40m –asm-coverage 50) and polished with GCpp (–algorithm arrow, SMRT Link v8.0.0.80529). The genome features are presented in Circos plots (Krzywinski et al. 2009) that were created using SAMtools (Danecek et al. 2021), bedtools (Quinlan and Hall 2010) was used for data extraction, and bedtops (Neph et al. 2012) was used for converting between formats (Fig. 1). The mitochondrial genome was assembled separately from the Falcon preads using the joint genome institute tool, assemblemito.py (https://code.jgi.doe.gov/gaa/jgi-fungal-asm/-/tree/dev/bin). The preads were filtered and polished with Arrow (SMRT Link v7.0.1.66975) (Fig. 2).

FIGURE 1 Nuclear genome of Bretziella fagacearum. The Circos plot represents the following from the outer to inner track: Chromosome length (Mbp), labeled 1 through 9, with “T” representing telomere; GC-content distribution, with high GC (blue) and low GC (yellow) regions; gene density, represented as the number of genes per 10 kb of genomic region (0 to 0.006); and gene positions across the entire genome.

FIGURE 2 Mitochondrial genome of Bretziella fagacearum. A, The Circos plot represents the following from the outer to inner track: Chromosome length (Kbp), labeled as 1; GC-content distribution, with high GC (blue) and low GC (yellow) regions; gene density, represented as the number of genes per 1 Kbp of genomic region (0 to 0.005); and gene positions across the genome. B, Genome images are produced using Circos from annotated genomes. The black line represents scaffold(s), with gene models displayed clockwise along the outside. The inner graph shows average GC% across the length of the assembly, ranging from 0 to 100% inside to outside: values below 25% are in blue, while values above 25% are in green. Interior links between scaffold sections denote regions of 100% identity. “LG” denotes LAGLIDADG.
For transcriptome analysis, total RNA from harvested mycelium of B. fagacearum C519 was extracted using the EZNA Fungal RNA Mini Kit (Omega Bio-tek Inc., Norcross, GA). Plate-based RNA sample preparation was performed on the PerkinElmer Sciclone NGS robotic liquid handling system using Illumina's TruSeq Stranded mRNA HT sample prep kit, following the protocol outlined by Illumina for poly-A selection of mRNA. Total RNA starting material was 1 µg/sample, and eight cycles of PCR were used for library amplification. The prepared library was quantified using KAPA Biosystem's next-generation sequencing library qPCR kit (Roche) and run on a Roche Light Cycler 480 real-time PCR instrument. The quantified libraries were multiplexed and prepared for sequencing on the Illumina NovaSeq 6000 platform using NovaSeq XP v1.5 reagent kits and an S4 flow cell, following a 2 × 150-bp indexed run recipe. After sequencing, read artifacts (k-mer = 25 bp, 1 mismatch) were detected with BBDuk (https://sourceforge.net/projects/bbmap/). Detected artifacts were trimmed from the 3′ end of reads. General quality trimming was performed with the Phred trimming method set at Q6. Reads shorter than 25 bp or one-third of the original read length were removed, as well as RNA spike-in reads, PhiX reads, and reads containing any ambiguous characters (Ns). Filtered transcriptome sequencing (RNA-Seq) reads were assembled using Trinity v2.8.5 (Grabherr et al. 2011). RNA-Seq capture was performed by aligning a 1% subsample of RNA reads to the final DNA assembly using BBTools v38.67. The genome was annotated using the JGI Annotation pipeline (Grigoriev et al. 2014). Gene clusters of interest were searched in the clustering table available at the JGI MycoCosm portal (Grigoriev et al. 2014). Clusters were computed following the TRIBE-Markov cluster (MCL) clustering method (Enright et al. 2002) from all-versus-all BLAST analysis of the proteins in the set of organisms included in a clustering run.
Following joint genome institute genome annotation pipeline, we predicted 7,445 full length transcripts, with an average transcript length of 1, 722 bp and an average gene length of 1,928 bp. The assembled genome consisted of nine scaffolds or pseudo-chromosomes, with N50 of 3 Mbp and L50 value of 4.39 Mbp. Except for scaffold 3, all scaffolds have telomere-to-telomere assembly, starting with repeats of “ACCCTA” and ending in complementary repeats of “TAGGGT” (Fig. 1) (Diotti et al. 2021). The benchmarking universal single-copy orthologs (BUSCO) (Manni et al. 2021) analysis of the whole genome with the Ascomycota fungal group revealed a completeness of 96.7%, with 1,650 complete BUSCOs found out of 1,706 total BUSCO groups searched. Of this, 1,649 were found to be complete and single copy, one was complete and had a duplicated BUSCO, while 52 were missing, and four fragmented BUSCO were reported. The BUSCO analysis of the transcriptome revealed a completeness of 97.9%, consisting of 1,669 complete BUSCO out of a total of 1,706. Among these, there are 1,668 complete single-copy BUSCO, one complete duplicated BUSCO, and four fragmented BUSCO, with 33 BUSCO reported as missing. The mitochondrial genome was also sequenced, yielding an assembly length of 1.7 Mbp with a single scaffold. A total of 54 open reading frames were annotated, including 14 core genes, 28 tRNAs, 4 rRNAs, and 8 hypothetical proteins.
Although a draft genome of B. fagacearum is available from Wingfield et al. (2016), this genome assembly is significantly improved (Table 1). Our genome assembly is gapless, with contigs and scaffolds being synonymous and extending telomere-to-telomere in all cases except for scaffold 3, in contrast to the 2,949 contigs and 1,257 scaffolds in Wingfield et al. (2016). Moreover, this assembly includes 270 Kbp more sequence than the previous genome.
TABLE 1 Parameters of available genomes of Bretziella fagacearum

This publicly available genome resource can serve as a reference for future studies, including population and comparative genomics, to explore molecular mechanisms underlying the epidemiology and biology of B. fagacearum. Insights gained from this genome will enhance the understanding of genetic diversity, evolutionary history, and pathogenicity, aiding in the development of effective disease management strategies.
The author(s) declare no conflict of interest.
Literature Cited
- 1986. Recognition of oak wilt in live oak. J. Arboric. 12:213-218. https://doi.org/10.48044/jauf.1986.047 Google Scholar
- 1995. The oak wilt enigma: Perspectives from the Texas epidemic. Annu. Rev. Phytopathol. 33:103-118. https://doi.org/10.1146/annurev.py.33.090195.000535 CrossrefGoogle Scholar
- 2008. Oak Wilt Biology, Impact, and Host Pathogen Relationships: A Texas Perspective. Paper presented at the 2nd National Oak Wilt Symposium, Austin, TX, U.S.A. Google Scholar
- 2010. Spatial and temporal distribution of Ceratocystis fagacearum in roots and root grafts of oak wilt affected red oaks. Arboric. Urban For. 36:28-34. https://doi.org/10.48044/jauf.2010.004 CrossrefGoogle Scholar
- 2022. Benchmarking a fast and simple on-site detection assay for the oak wilt pathogen Bretziella fagacearum. Front. For. Glob. Change 5:1068135. https://doi.org/10.3389/ffgc.2022.1068135 CrossrefGoogle Scholar
- 1950. Oak wilt fungus isolated from Chinese chestnut. Plant Dis. Rep. 34:291. Google Scholar
- 1991. Probit analysis of oak wilt transmission through root grafts in red oak stands. For. Sci. 37:28-44. https://doi.org/10.1093/forestscience/37.1.28 Google Scholar
Canadian Food Inspection Agency . 2023. Oak wilt. https://inspection.canada.ca/plant-health/invasive-species/plant-diseases/oak-wilt/eng/1325624048625/1325624535106 (accessed May 2024). Google Scholar- 2019a. Assessment of alternative candidate subcortical insect vectors from walnut crowns in habitats quarantined for thousand cankers disease. Environ. Entomol. 48:882-893. https://doi.org/10.1093/ee/nvz064 CrossrefGoogle Scholar
- 2022. Differential virulence among Geosmithia morbida isolates collected across the United States occurrence range of thousand cankers disease. Front. For. Glob. Change 5:726388. https://doi.org/10.3389/ffgc.2022.726388 CrossrefGoogle Scholar
- 2021. Sporulation timing of the invasive oak wilt fungus, Bretziella fagacearum in Michigan. (Abstr.) Phytopathology 111:S2.51. https://doi.org/10.1094/PHYTO-111-10-S2.1 Google Scholar
- 2019b. Biology, epidemiology and detection of oak wilt in Michigan. (Abstr.) Phytopathology 109:S2.33. https://doi.org/10.1094/PHYTO-109-10-S2.1 Google Scholar
- 2024. Epidemiology, Biology, Molecular Detection, and Population Structure of the Oak Wilt Fungus, Bretziella fagacearum. Doctoral dissertation, Michigan State University, East Lansing, MI, U.S.A. Google Scholar
- 2024. First report of Bretziella fagacearum infecting chestnut in Michigan. Plant Dis. 108:1397. https://doi.org/10.1094/PDIS-10-23-2267-PDN LinkGoogle Scholar
- 2016. Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13:1050-1054. https://doi.org/10.1038/nmeth.4035 CrossrefGoogle Scholar
- 1955. Natural availability of oak wilt inocula. Ill. Nat. Hist. Surv. Bull. 26:277-323. https://doi.org/10.21900/j.inhs.v26.178 Google Scholar
- 2021. Twelve years of SAMtools and BCFtools. GigaScience 10:giab008. https://doi.org/10.1093/gigascience/giab008 CrossrefGoogle Scholar
- 2017. Bretziella, a new genus to accommodate the oak wilt fungus, Ceratocystis fagacearum (Microascales, Ascomycota). MycoKeys 27:1-19. https://doi.org/10.3897/mycokeys.27.20657 CrossrefGoogle Scholar
- 2021. Telomeric and sub-telomeric structure and implications in fungal opportunistic pathogens. Microorganisms 9:1405. https://doi.org/10.3390/microorganisms9071405 CrossrefGoogle Scholar
- 2002. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30:1575-1584. https://doi.org/10.1093/nar/30.7.1575 CrossrefGoogle Scholar
- 2023. Biosurveillance of oak wilt disease in Canadian areas at risk. Can. J. Plant Pathol. 46:27-38. https://doi.org/10.1080/07060661.2023.2261890 CrossrefGoogle Scholar
- 2019. Mapping the potential distribution of oak wilt (Bretziella fagacearum) in east central and southeast Minnesota using Maxent. J. For. 117:579-591. https://doi.org/10.1093/jofore/fvz053 CrossrefGoogle Scholar
- 1980. The Transmission of Oak Wilt. USDA Forest Service Research Paper NC-185. North Central Forest Experiment Station, St. Paul, MN, U.S.A. https://doi.org/10.2737/NC-RP-185 CrossrefGoogle Scholar
- 2011. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat. Biotechnol. 29:644-652. https://doi.org/10.1038/nbt.1883 CrossrefGoogle Scholar
- 2014. MycoCosm portal: Gearing up for 1000 fungal genomes. Nucleic Acids Res. 42:D699-D704. https://doi.org/10.1093/nar/gkt1183 CrossrefGoogle Scholar
- 1944. Chalara quercina n. sp., the cause of oak wilt. Phytopathology 34:631-635. Google Scholar
- 2011. Challenges and successes in managing oak wilt in the United States. Plant Dis. 95:888-900. https://doi.org/10.1094/PDIS-12-10-0944 LinkGoogle Scholar
- 2009. Circos: An information aesthetic for comparative genomics. Genome Res. 19:1639-1645. https://doi.org/10.1101/gr.092759.109 CrossrefGoogle Scholar
- 2015. FinisherSC: A repeat-aware tool for upgrading de novo assembly using long reads. Bioinformatics 31:3207-3209. https://doi.org/10.1093/bioinformatics/btv280 CrossrefGoogle Scholar
- 2021. BUSCO update: Novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 38:4647-4654. https://doi.org/10.1093/molbev/msab199 CrossrefGoogle Scholar
- 2022. Early detection of the oak wilt fungus (Bretziella fagacearum) using trapped nitidulid beetle vectors. For. Pathol. 52:e12767. https://doi.org/10.1111/efp.12767 CrossrefGoogle Scholar
- 2024. Seasonal activity and phoresy rates of nitidulid beetles (Coleoptera: Nitidulidae) captured in stands with oak wilt infections in northern Michigan, USA. Environ. Entomol. nvae101. https://doi.org/10.1093/ee/nvae101 CrossrefGoogle Scholar
- 2012. BEDOPS: High-performance genomic feature operations. Bioinformatics 28:1919-1920. https://doi.org/10.1093/bioinformatics/bts277 CrossrefGoogle Scholar
- 2018. Analysis of microsatellites from transcriptome sequences of Phytophthora capsici and applications for population studies. Sci. Rep. 8:5194. https://doi.org/10.1038/s41598-018-23438-8 CrossrefGoogle Scholar
- 2010. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26:841-842. https://doi.org/10.1093/bioinformatics/btq033 CrossrefGoogle Scholar
- 1965. Distribution of oak wilt. Plant Dis. Rep. 49:1007-1010. Google Scholar
- 1970. Oak bark beetles–important vectors of oak wilt. J. For. 68:294-297. https://doi.org/10.1093/jof/68.5.294 Google Scholar
- 2016. Draft genome sequences for Ceratocystis fagacearum, C. harringtonii, Grosmannia penicillata, and Huntiella bhutanensis. IMA Fungus 7:317-323. https://doi.org/10.5598/imafungus.2016.07.02.11 CrossrefGoogle Scholar
Data availability: The genome and transcriptome sequence of C519 was deposited in the National Center for Biotechnology Information (NCBI) GenBank under the BioProject accession numbers PRJNA677221 and PRJNA677222 and BioSample accession numbers SAMN16773066 and SAMN16774004. The data can be accessed at https://mycocosm.jgi.doe.gov/Brefa1/Brefa1.home.html.
Funding: The work conducted by the United States Department of Energy (DOE) Joint Genome Institute (https://ror.org/04xm1d337), a DOE Office of Science User Facility, is supported by the Office of Science of the United States DOE operated under contract number DE-AC02-05CH11231. This research was funded in part by the Michigan Invasive Species Grant Program and Project GREEEN.
The author(s) declare no conflict of interest.