BacteriologyFree Access icon

Detection of a Single-Copy Plasmid, pXFSL21, in Xylella fastidiosa Strain Stag’s Leap with Two Toxin-Antitoxin Systems Using Next-Generation Sequencing

    Affiliations
    Authors and Affiliations
    • Christopher Van Horn
    • Fengnian Wu
    • Zheng Zheng
    • Zehan Dai
    • Jianchi Chen
    1. First and fifth authors: United States Department of Agriculture–Agricultural Research Service San Joaquin Valley Agricultural Sciences Center, Parlier, CA 93648, USA; and second, third, and fourth authors: Department of Plant Pathology, South China Agricultural University, Guangzhou, Guangdong, China.

    Abstract

    Plasmids are important genetic elements contributing to bacterial evolution and environmental adaptation. Xylella fastidiosa is a nutritionally fastidious Gram-negative bacterium causing economically devastating diseases such as Pierce’s disease (PD) of grapevine. In this study, the plasmid status of a highly virulent PD strain, Stag’s Leap, originally isolated from Napa Valley, CA, was studied using sequencing and bioinformatics tools. DNA samples extracted from a pure culture in periwinkle wilt medium (in vitro DNA) and a PD-symptomatic grapevine artificially inoculated in the greenhouse (in planta DNA) were subject to next-generation sequencing (NGS) analyses (Illumina MiSeq or HiSeq). Sequence analyses and polymerase chain reaction experiments revealed the presence of a circular plasmid, pXFSL21, of 21,665 bp. This plasmid existed as a single copy per bacterial genome under both in vitro and in planta conditions. Two toxin-antitoxin (T-A) systems (ydcD-ydcE and higB-higA) were detected in pXFSL21, a possible mechanism for the long-term survival of this single-copy plasmid in the bacterial population. BLAST searches against the GenBank database (version 222) detected homologs of the two T-A systems in chromosomes or plasmids of some X. fastidiosa strains. However, double T-A systems were found only in pXFSL21. pXFSL21 was not found in other known PD strains and, therefore, could serve as a molecular marker for strain Stag’s Leap monitoring and tracking. The NGS-based technique outlined in this article provides an effective tool for identifying single- or low-copy-number plasmids in fastidious prokaryotes.

    Plasmids are self-replicating, extrachromosomal genetic elements responsible for horizontal gene transfer that contribute to bacterial evolution and environmental adaptation, including virulence (Coplin 1989). High-copy-number plasmids are well known for biotechnological manipulations such as DNA fragment cloning. However, many bacterial plasmids such as those associated with environmental adaptation could be at a low copy number or even single copy. Low-copy plasmids are advantageous for survival by lowering host metabolic burdens (Sternberg and Austin 1981). Virulence plasmids are often large (>20 kbp), because they carry genes for functions of replication, dissemination, and maintenance (RDM) in addition to those conferring advantageous phenotypes (CAPs) (Sengupta and Austin 2011).

    Xylella fastidiosa is a Gram-negative, xylem-limited, bacterial pathogen causing diseases in many economically important plants; for example, Pierce’s disease (PD) of grapevine in California and the southeastern United States (Hopkins 1989), citrus variegated chlorosis (CVC) in Brazil (Chang et al. 1993), and olive quick decline syndrome (OQDS) in Italy (Giampetruzzi et al. 2015; Saponari et al. 2013). Due to its nutritional fastidiousness, research on X. fastidiosa has been highly challenging. A 1.3-kbp plasmid was observed in an X. fastidiosa periwinkle wilt (PW) strain (Chen et al. 1992) and sequenced by Sanger’s method (Pooler and Hartung 1997). Concurrent with a significant increase in X. fastidiosa research since the 2000s due to outbreaks of PD, CVC, and OQDS, in total, 46 plasmids ranging from 1.3 to 64 kbp in size have been identified, annotated, and deposited in the GenBank database (version 222) (Supplementary Table S1). Most of the plasmid discoveries were derived from whole-genome sequencing efforts, particularly the use of next-generation sequencing (NGS) technology. Thirty-three of the X. fastidiosa plasmids are >20 kbp in size. In all cases, copy numbers of the sequenced plasmids were not estimated, with the exception of the PW plasmid, estimated to be at least 60 copies per genome (Chen et al. 1992).

    Although in low copy number, large plasmids have intricate mechanisms for their maintenance in bacterial populations. One mechanism is the toxin-antitoxin (T-A) system (Gerdes et al. 1986; Hayes 2003). In the T-A system, a plasmid encodes a stable toxin and a labile antitoxin. Daughter cells without the plasmid will be killed by the toxin because the unstable antitoxin is degraded. Such a phenomenon is also called postsegregational killing (Gerdes et al. 1986). The presence of T-A systems in X. fastidiosa were detected through genome sequence analyses (Pandey and Gerdes 2005), and functions of the plasmid-borne T-A system (pemI/pemK) have been confirmed in vitro (Lee et al. 2014; Stenger et al. 2010). T-A systems have also been found in bacterial chromosomes and received extensive research in light of mechanisms of virulence and programing cell death (Burbank and Stenger 2016; Pandey and Gerdes 2005).

    The Stag’s Leap strain of X. fastidiosa subsp. fastidiosa was originally isolated from Napa Valley in California (Buzkan et al. 2005) and has been commonly used as a model for PD research because of its high virulence (Krivanek and Walker 2005; Krugner et al. 2014; Wallis et al. 2013). We are interested in further studying the biology of this model strain using NGS technology and bioinformatics tools. Toward this end, a draft whole-genome sequence of the Stag’s Leap strain was obtained (Chen et al. 2016). Based on available genomic sequence data, this study evaluated the plasmid status of this strain through both in vitro and in planta experiments and analyses. A single-copy plasmid, pXFSL21, with two T-A systems was identified. Implications of using an NGS approach to study plasmids in X. fastidiosa and other fastidious prokaryotes are also discussed.

    MATERIALS AND METHODS

    Bacterial strains and DNA samples.

    X. fastidiosa strain Stag’s Leap was maintained on PW medium (Davis et al. 1981) at 28°C, with regular transfer every 20 to 30 days since 2010. Bacterial DNA was obtained from both in vitro and in planta sources. For the in vitro sample, cells were collected from culture (14 days) and DNA was extracted according to a standard procedure (Maniatis et al. 1982). For in planta samples, a grapevine (Vitis vinifera), cultivar Thompson Seedless, grown in the greenhouse for 4 months, was inoculated twice on the stem using the pinprick method with 10-µl drops of X. fastidiosa cell suspension using a 20-guage syringe needle (Hopkins 2001). Twelve weeks after inoculation, PD symptoms on leaves (marginal necrosis) were observed and the most severe symptoms were near the site of inoculation. Leaf petiole samples were collected and macerated with an automated tissue homogenizer (Geno/Grinder; SPEX SamplePrep, Metuchen, NJ). DNA was extracted using the NucleoSpin Plant II kit (Macherey-Nagel, Düren, Germany), quantified and used for polymerase chain reaction (PCR) or sequencing experiments. The presence of X. fastidiosa in the sample was confirmed by standard PCR with primer sets Teme150fc-Teme454rg and Dixon454fc-Dixon1261rg (Chen et al. 2005). Additionally, primer set Teme150fc-Teme454rg was used for quantitative real-time PCR (qPCR) with SYBR green.

    DNA sequencing and whole-genome assembly.

    NGS experiments were performed through a commercial service. A brief description on sequencing the Stag’s Leap genome from in vitro culture DNA with the Illumina MiSeq 2 × 300 platform has been published (Chen et al. 2016). The in planta sample (mixture of both X. fastidiosa and grapevine DNA) was sequenced by Illumina HiSeq 2 × 100 (Illumina, Inc., San Diego, CA). With sequence reads from either MiSeq (in vitro) or HiSeq (in planta), both referenced and de novo assemblies were performed using CLC Genomics Workbench software (v. 10.1.1). For reference assembling of HiSeq reads (in planta), default parameters were used. The published Stag’s Leap genome sequence (NZ_LSMJ00000000.1) (Chen et al. 2016) was used as a reference. For de novo assembling, parameters were as follows; word size = 64, bubble size = 200, minimum contig length = 250, mismatch cost = 2, insertion cost = 3, deletion cost = 3, length fraction = 0.6, and similarity fraction = 0.9.

    Plasmid detection.

    Two approaches were used for plasmid screening in the draft Stag’s Leap genome sequence (NZ_LSMJ00000000.1). (i) One approach identified contigs with high similarity to the known X. fastidiosa plasmids. The 46 X. fastidiosa plasmid sequences in GenBank were downloaded. The 15 contigs in the draft Stag’s Leap genome were used as queries to BLAST search against the 46 X. fastidiosa plasmids (word size = 7, e-value = 1e−10). Contigs with high bit score were selected as candidate plasmids. (ii) Another approach analyzed contigs for typical plasmid structures. Annotation of the 15 contigs in NZ_LSMJ00000000.1 through the RAST server (http://rast.nmpdr.org/) (Aziz et al. 2008) were examined for the simultaneous presence of genes associated with both RDM and CAP (Coplin 1989).

    Plasmid sequence enclosure, annotation, and protein three-dimensional structure analyses.

    Contig 15 of NZ_LSMJ00000000.1 was identified as a plasmid candidate. Two approaches were used to prove circularity. (i) One approach involved in silico read mapping, as previously described (Zheng et al. 2018). Briefly, a 5,000-bp section (A) at the 5′ end of contig 15 was cut and attached to the 3′ end of the remaining region (B) to make sequence B-A. As a control, a 5,000-bp sequence randomly selected from nucleotide position 564,797 to 569,797 bp of contig 1 (designated as C) was cut and attached to region B to replace A and make sequence B-C. Both sequences B-A and B-C were used as the references for mapping with both MiSeq and HiSeq reads and check for smooth read transition (Supplementary Fig. S1). (ii) Another approach involved PCR and sequencing. Primers were designed from both the 5′ and 3′ ends of contig 15 with outward directionality (primer sets 21250F-137R and 21124F-617R) (Supplementary Table S3). Primer3Plus (Untergasser et al. 2007) was used to assist primer designs. Amplicons of standard PCR were further sequenced through the Sanger method on an Applied Biosystems 3130xl Genetic Analyzer. To search for chromosomal integration, contig 15 was used as a query to BLAST search against the other 14 contigs in NZ_LSMJ00000000.1 for highly similar regions. CLC Genomics software was used to view gene orientation and annotation and to generate the genetic map.

    Protein structure analysis of selected genes was initially predicted in the Phyre server (http://www.sbg.bio.ic.ac.uk/∼phyre2/html/page.cgi?id=index) using a profile-profile alignment algorithm (Kelley et al. 2015). Final three-dimensional (3D) structure was optimized by using PyMOL Molecular Graphics System, version 2.0 (Schrödinger LLC).

    Copy number estimation.

    Two methods were used to estimate plasmid copy number. (i) The first method was comparison of average nucleotide coverages (ANCs) between plasmid candidate and chromosomal contig sequences, as well as selected plasmid and chromosomal genes, through read mapping using MiSeq and HiSeq data. The ANC ratio of plasmid/chromosomal sequence were calculated. (ii) The second method was comparison of cycle threshold (Ct) values between plasmid-specific gene primer sets and chromosome-specific gene primer sets. Primer sets were designed to generate 200-bp amplicons. The single-copy glyceraldehyde 3-phosphate dehydrogenase (GAPDH) gene was used as a reference to estimate the copy number of the target plasmid gene by the ratio of (Etarget)^Ct(target)/(Ereference)^Ct(reference) (Pfaffl 2001) based on six replications.

    PCR procedures and statistical analyses.

    Standard PCR was performed according to a previous report (Chen et al. 2005). Briefly, PCR (20 μl) was carried out using the Takara Taq (Hot Start Version) kit (Takara Bio Inc., Otsu, Japan). The reaction mixture contained 10 mM Tris-HCl, pH 8.3; 50 mM KCl; 1.5 mM MgCl2; 100 μM each dNTP; 0.5 μM each primer; 1 unit of Taq DNA polymerase; and 1 μl of cell suspension. Amplification was conducted in a thermocycler (Model PTC-100; MJ Research, Waltham, MA) with an initial denature at 98°C for 5 min followed by 30 cycles consisting of denaturing at 96°C for 30 s, annealing at 55°C for 15 s, and extension at 72°C for 60 s. The amplification products were then stored at 4°C. The amplified DNAs were resolved through 1.5% agarose gel electrophoresis and visualized after staining.

    For qPCR, the DNA sample was diluted to 2 ng/µl in pure water. Using Fast SYBR Green Master Mix (Thermo Fisher Scientific, Waltham, MA), qPCR reactions were set up containing 2 µl of DNA template, 2× Fast SYBR Green Master Mix (containing AmpliTaq Fast DNA polymerase, dNTPs, SYBR Green I dye, ROX dye passive reference, MgCl2, and stabilizers), and 0.25 µM each of forward and reverse primers for a final reaction volume of 20 µl.

    Homologs of T-A system genes.

    Each T-A system gene sequence was used as query to BLAST search against the GenBank nr and whole-genome shotgun sequence database (version 222) that included 46 plasmids and 41 whole-genome sequences of X. fastidiosa (Supplementary Table S2) with word size = 11 and expected threshold = 10. Homologs were evaluated by query coverage and sequence identity. Similarly, related non-X. fastidiosa bacteria were identified.

    RESULTS

    DNA sequencing and whole-genome assembly.

    Selected metrics of Illumina MiSeq and HiSeq sequencing and the derived assemblies are shown in Table 1. MiSeq generated 6 million 301-bp reads with a total of nearly 2 giga bp. HiSeq generated >300 million 101-bp reads with a total of nearly 32 giga bp. However, the HiSeq data were a mixture of sequences of both grape and X. fastidiosa. The Xylella DNA was estimated to be approximately 2% [(2,316,903 × 508 ANC)/31,935,159,902] (Tables 1 and 2). Metrics of X. fastidiosa-only were shown under reference mapping, with 493 contigs (2,272,127 bp) from MiSeq and 235 contigs (2,316,903 bp) from HiSeq covering the draft Stag’s Leap genome sequence (NZ_LSMJ00000000.1). Adding all de novo contigs with ≥1,000 bp, the draft genome size from in vitro DNA (2,170,878 bp in 77 contigs) and the draft genome size from in planta DNA (2,297,788 bp in 104 contigs) were comparably similar but not identical, likely due to variations in assembling processes.

    TABLE 1. Comparison of selected metrics of MiSeq (in vitro) and HiSeq (in planta) experiments and genome assembly of Xylella fastidiosa strain Stag’s Leap

    TABLE 2. Comparison of average nucleotide coverages (ANCs) on mapping the draft genome sequence of Xylella fastidiosa strain Stag’s Leap (NZ_LSMJ00000000.1) between MiSeq (in vitro) and HiSeq (in planta) dataa

    Plasmid detection.

    Contig 15 of NZ_LSMJ00000000.1 (Table 2) was identified to be a plasmid because (i) according to annotation, the contig with 27 open reading frames (ORFs) contained both RDM genes (14 ORFs: ORF1, 5, 8, 9, and 18 to 27) and CAP genes (2 ORFs: ORF13 and 14) (Table 3); (ii) contig 15 had high BLAST bit scores (>8,000) to four X. fastidiosa plasmids (pXF-P4.OLS0478, pXF-P4.OLS0479, pXF-De Donno, and unnamed from CoDiRO); (iii) MiSeq read mapping showed continuity in the connection of 5′ and 3′ ends; and (iv) PCR primer sets 21250F-137R and 21124F-617R (Fig. 1B) amplified the expected DNA bands (Fig. 1C). The enclosure was further confirmed by Sanger sequencing of the amplicons. Therefore, contig 15 was designated as pXFSL21 (Fig. 1B). pXFSL21 derived from two de novo contigs of MiSeq data (in vitro DNA) or four de novo contigs of HiSeq data (in planta DNA) (Table 1). The sequence of pXFSL21 has been deposited in GenBank under the accession number MG879030.

    TABLE 3. General information of pXFSL21 from Xylella fastidiosa strain Stag’s Leap

    Fig. 1.

    Fig. 1. Schematic descriptions of pXFSL21. A, Chromosomal integration of dnaG. Annotation of selected genes: gdhA = NAD-specific glutamate dehydrogenase, nadK = NAD kinase, Nt5 = 5′-nucleotidase, hyp = hypothetical protein, romA = outer membrane protein, and exo1 = exodeoxyribonuclease I. B, Circular pXFSL21 with annotated genes and open reading frames. The two toxin-antitoxin systems (ydcD-ydcE and higB-higA) are highlighted. Solid single-line arrows are relative positions of primers used for polymerase chain reaction (PCR) enclosure experiments. C, Agarose gel electrophoresis of PCR amplicons from enclosure primer sets (21250F-137R and 21124F-617R) from in vitro and in planta experiments.

    Download as PowerPoint

    Among the 27 ORFs, 22 had predicted gene functions, including 3 DNA primase, 2 resolvase/integrase, 8 in the trb conjugative transfer family, 4 that formed two pairs of T-A systems (ydcD-ydcE, and higB-higA), and 4 in a multidrug efflux pump system involving acriflavine resistance (Table 3). The 3D structures of the two T-A systems proteins are shown in Figure 2A to D.

    Fig. 2.

    Fig. 2. Predicted three-dimensional structure, by Phyre2, of proteins encoded by A, YdcD; B, YdcE; C, HigB; and D, HigA. N = N terminal and C = C terminal. The helix-turn-helix motif in higA is the DNA binding site.

    Download as PowerPoint

    Copy number estimation.

    ANCs of both MiSeq and HiSeq data to the 15 contigs of the draft Stag’s Leap genome sequences were listed in Table 2. Note that contig 15 was renamed as pXFSL21 (GC% = 50.1). Contig 12 had a size of 21,851 (GC% = 50.4), similar to that of pXFSL21. ORF analyses of contig 12 did not reveal typical RDM and CAP genes. Instead, four tRNA genes (tRNA-arg, tRNA-pro, tRNA-his, and tRNA-lys), typically chromosome borne, were found. Therefore, contig 12 was used as a chromosome reference to pXFSL21 for ANC ratio calculations. For in vitro experiments, ANC-pXFSL21/ANC-contig 12 was 1.09 (441/403) (Table 2). For in planta experiments, ANC-pXFSL21/ANC-contig was 0.98 (481/492) (Table 2). The near one-to-one ANC ratios suggested that pXFSL21 is single copy per genome of X. fastidiosa under both in vitro and in planta conditions.

    Results of ANC analyses at a gene level are shown in Table 4. Using the double-copied chromosomal gene of 16S ribosomal RNA (rRNA) (rrs) as a reference, all selected pXFSL21 genes assembled from MiSeq (in vitro DNA) data had ANC ratios to rrs of approximately 0.5 (0.37 to 0.59), with the exception of dnaG1. BLAST search with pXFSL21 as query detected an identical copy of dnaG1 in contig 2 (Fig. 1A). Therefore, this gene is double copied, one in the chromosome and the other in pXFSL21 (Fig. 1B).

    TABLE 4. Average nucleotide coverage (ANC) analyses of selected genes from pXFSL21 and Xylella fastidiosa strain Stag’s Leap chromosomea

    In the HiSeq (in planta DNA) data mapping, ANC of rrs was extraordinarily high (55,770×) (Table 4). This was not surprising due to the conserved nature of 16S rRNA sequences between the plant chloroplast and bacteria. The double-copied dnaG1 gene sequence was then used as reference for the ANC ratio calculation. All selected pXFSL21 genes showed the ratio of approximately 0.5 (0.44 to 0.56) (Table 4). As a control, all selected single-copy chromosomal genes also showed a ratio of approximately 0.5 (0.36 to 0.56).

    PCR experiments comparing plasmid genes tetR and acrB to the singled-copied chromosomal GAPDH gene showed copy numbers of 1.16 and 0.84, respectively, with the in vitro DNA, and 1.13 and 1.16, respectively, with the in planta DNA (Table 5). These results further supported the conclusions from in silico analyses that pXFSL21 was a single-copy plasmid in strain Stag’s Leap.

    TABLE 5. pXFSL21 copy-number (CN) analysis by quantitative polymerase chain reactiona

    Homologs of pXFSL21 T-A systems.

    Homologs of ydcD-ydcE were detected in 26 X. fastidiosa GenBank sequence records (10 plasmids and 16 whole-genome sequences) (Table 6). Based on sequence similarity, a gene in plasmid PP1 of Pseudomonas syringae was highly related to ydcD, and a gene in plasmid pP2 of Xanthomonas citri was highly similar to ydcE. The predicted 3D structure of ydcE protein, or YdcE, was not similar to the representative structure of the YdcE/MazF/PemK family of endoribonuclease functions (Pellegrini et al. 2005). Homologs of higB-higA were detected in only two Xylella fastidiosa strains, Ann-1 and CO33 (Table 7). The closest bacterium with the higB-higA homolog was Sphingobium cloacae with its plasmid pSCLO_4. The predicted 3D structures of both HigB and HigA were very similar to the those of a Proteus vulgaris strain (Schureck et al. 2014).

    TABLE 6. Homologs of ydcD-ydcE of pXFSL21 in Xylella fastidiosa and representative bacteria in GenBank database (version 222)a

    TABLE 7. Homologs of higA-higB of pXFSL21 in Xylella fastidiosa and representative bacteria in GenBank database (version 222)a

    DISCUSSION

    Traditionally, plasmids are identified by their unique physical features (compared with chromosomal DNA) such as supercoiled conformation and small size through appropriate experimental procedures such as agarose gel electrophoresis. Copy number could be estimated based on intensity of stained DNA bands (Chen et al. 1992; Projan et al. 1983). However, visualization of low-copy-number plasmids in agarose gels is difficult if the plasmid quantity is not significantly enriched. Adding to the problem in this study is the slow-growth nature of X. fastidiosa in vitro. Preparation of large quantities of DNA from slow-growing bacteria is challenging. The Southern blot procedure could increase detection sensitivity (Chen et al. 1992); however, preknowledge of plasmid sequence or DNA is required, which was not available for the strain Stag’s Leap.

    In this study, we successfully applied NGS technology to detect a single-copy plasmid in strain Stag’s Leap. The key to success was the high-throughput capacity of NGS. From a microliter level of DNA sample, NGS (HiSeq or MiSeq) generated a high volume of short-sequence reads that evenly or almost evenly covered the whole X. fastidiosa genome over hundreds of times (Table 2). The short sequence reads were assembled to form long sequence contigs. Candidate plasmid contigs were predicted based on bioinformatics information such as genes involving RDM and CAP. Plasmid circularity was evaluated through read mapping, and confirmed by PCR-Sanger sequencing. Plasmid copy number was readily estimated through ANC analyses and supported by qPCR experiments.

    Another significant advantage of the NGS approach is that pXFSL21 was detected and characterized not only from pure in vitro X. fastidiosa cultures but also from infected in planta grapevine tissue. This approach is particularly useful for fastidious prokaryote research where pure culture is difficult, or even not possible, to obtain. Indeed, the NGS approach is playing a major role in detection and characterization of circular plasmids or prophages in a nonculturable bacterium, ‘Candidatus Liberibacter asiaticus’, associated with citrus huanglongbing (Zheng et al. 2018). Interestingly, the culturable property of the X. fastidiosa allowed us to validate both in vitro and in planta methods simultaneously (Tables 2 and 4).

    Growth of X. fastidiosa in grapevine is a natural system in which to study pathogen–host interactions of PD. NGS technology provided an opportunity to study the behavior of the X. fastidiosa genome directly in a plant host. pXFSL21 (contig 15) was clearly single copy under both in vitro and in planta conditions (Tables 2 and 4). However, this was not the case for contig 6 (Table 2). The 19,107-bp contig 6 doubled its copy number under in planta condition (ANC = 940) compared with that under in vitro condition (ANC = 490). Annotation showed that contig 6 contained 24 ORFs that encoded exclusively phage or hypothetical proteins, including a phage lysozyme. Detailed analyses on contig 6 are outside the scope of this study and deserve future attention. Yet analyses of pXFSL21 and contig 6 have demonstrated the potential research power of an NGS approach.

    T-A systems are mechanisms well known for their role in maintaining low-copy-number plasmids in a bacterial population (Dmowski and Jagura-Burdzy 2013; Unterholzner et al. 2013; Yang and Walsh 2017). A single T-A system in a X. fastidiosa DNA was described previously (Lee et al. 2014; Stenger et al. 2010). This study detected two T-A system in pXFSL21. The Stag’s Leap culture hosting pXFSL21 in this study has been subcultured in PW medium numerous times since 2010. It is apparent that pXFSL21 was consistently maintained or survived through all the subculturing as well as in planta manipulations. For this reason, we assumed that the double T-A systems could have provided a double dosage effect that strengthened the survivability of pXFSL21. However, further experimental validations are needed to prove this theory.

    The T-A systems of ycdD-ycdE and higB-higA have been described in multiple bacterial systems (Budde et al. 2006; Pellegrini et al. 2005; Schureck et al. 2014). Both ydcD-ydcE and higB-higA are type II T-A systems (i.e., the toxin is neutralized by binding of the partner antitoxin protein). The proteinaceous nature of the type II system makes it a feasible target for artificial activation of the toxin for pathogen control. Identification of compounds that disrupt T-A complexes would directly activate the toxin, leading to bacterial cell killing (Unterholzner et al. 2013). The two toxin genes are relatively small: ydcE is 315 bp coding for a 104-amino-acid (aa) protein (Fig. 2B) and higB is 279 bp coding for a 92-aa protein (Fig. 2C). The identification of these T-A system genes on a unique plasmid creates the opportunity to clone and express these toxin genes in planta to potentially control PD. The knowledge obtained through NGS along with bioinformatics analyses provides the necessary sequence and genomic structural information to generate targeted experiments on the biology and gene function of an organism.

    Due to application of NGS, the GenBank database has accumulated >40 X. fastidiosa plasmid sequences and >40 of the bacterial whole-genome sequences. The large sequence collections provided a convenient and efficient means for X. fastidiosa characterization population wide. For example, the ydcD-ydcE system was found in X. fastidiosa strains from different geographical origins and formed two groups: one with ydcD almost identical (100% length coverage, 98 to 100% identity) to that in pXFSL21, and the other much different ydcD (48% length coverage, 70 to 76% identity) (Table 6). In contrast, higB-higA was limited only to strain Ann-1 and CO33. The two T-A systems could be either plasmid- or chromosome-borne, as in other bacteria (Pandey and Gerdes 2005). Many non-Xylella sequences similar to the four T-A system genes were found. Phylogenetic trees constructed from these sequences were not informative. Instead, the most similar bacteria were listed for phylogenetic reference. Along the line of plasmid characterization, we were not able to identify ori of pXFSL21 with high confidence.

    Furthermore, among the five GenBank documented PD strains (i.e., Temecula1, M23, GB514, ATCC 35879, and Stag’s Leap), pXFSL21 was only found in the Stag’s Leap strain. Given the genetic content of this plasmid, particularly the CAP genes acrA and acrB, predicted to be involved in resistance to toxic compounds, pXFSL21 may be a promising lead for targeted experiments into the function of plasmid located genes and their effect on the biology of X. fastidiosa. This plasmid may play a role in the prevalence or virulence specific to the Stag’s Leap strain. However, direct experimental evidence is currently lacking. Nevertheless, pXFSL21 could serve as a molecular marker, or an initial point, to study the uniqueness or its role in virulence of the Stag’s Leap strain.

    ACKNOWLEDGMENTS

    We thank G. Phillips, R. Huerta, and S. Vargas for technical assistance. The mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. The USDA is an equal opportunity provider and employer.

    LITERATURE CITED