Genomic Analysis of the Carrot Bacterial Blight Pathogen <i>Xanthomonas hortorum</i> pv. <i>carotae</i> in Korea

Mi-Hyun Lee; Sung-Jun Hong; Dong Suk Park; Hyeonheui Ham; Hyun Gi Kong

doi:10.5423/PPJ.NT.11.2022.0149

Plant Pathol J > Volume 39(4); 2023 > Article

Lee, Hong, Park, Ham, and Kong: Genomic Analysis of the Carrot Bacterial Blight Pathogen Xanthomonas hortorum pv. carotae in Korea

Note

The Plant Pathology Journal 2023;39(4):409-416.

Published online: August 01, 2023

DOI: https://doi.org/10.5423/PPJ.NT.11.2022.0149

Genomic Analysis of the Carrot Bacterial Blight Pathogen Xanthomonas hortorum pv. carotae in Korea

Mi-Hyun Lee¹, Sung-Jun Hong², Dong Suk Park¹, Hyeonheui Ham¹, Hyun Gi Kong^1,^3,^*

¹Crop Protection Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 54875, Korea

²Organic Agricultural Division, National Institute of Agricultural Sciences, Rural Development Administration, Wanju 54875, Korea

³Department of Plant Medicine, College of Agriculture, Life & Environment Sciences, Chungbuk National University, Cheongju 28644, Korea

^*Corresponding author. Phone) +82-43-261-2554, FAX) +82-43-271-4414, E-mail) khgidea@chungbuk.ac.kr

Handling Editor: Young-Su Seo

(Received on November 16, 2022; Revised on June 2, 2023; Accepted on June 2, 2023)

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Bacterial leaf blight of carrots caused by Xanthomonas hortorum pv. carotae (Xhc) is an important worldwide seed-borne disease. In 2012 and 2013, symptoms similar to bacterial leaf blight were found in carrot farms in Jeju Island, Korea. The phenotypic characteristics of the Korean isolation strains were similar to the type strain of Xhc. Pathogenicity showed symptoms on the 14th day after inoculation on carrot plants. Identification by genetic method was multi-position sequencing of the isolated strain JJ2001 was performed using four genes (danK, gyrB, fyuA, and rpoD). The isolated strain was confirmed to be most similar to Xhc M081. Furthermore, in order to analyze the genetic characteristics of the isolated strain, whole genome analysis was performed through the next-generation sequencing method. The draft genome size of JJ2001 is 5,443,372 bp, which contains 63.57% of G + C and has 4,547 open reading frames. Specifically, the classification of pathovar can be confirmed to be similar to that of the host lineage. Plant pathogenic factors and determinants of the majority of the secretion system are conserved in strain JJ2001. This genetic information enables detailed comparative analysis in the pathovar stage of pathogenic bacteria. Furthermore, these findings provide basic data for the distribution and diagnosis of Xanthomonas hortorum pv. carotae, a major plant pathogen that infects carrots in Korea.

Keywords: multilocus sequence analysis, whole genome analysis, Xanthomonas hortorum pv. carotae

Bacterial leaf blight caused by the plant pathogenic bacterium Xanthomonas hortorum pv. carotae (Xhc) was first described in carrots (Daucus carota L. subsp. sativus Hoffm.) in 1934 (Kendric, 1934). Bacterial blight caused by Xhc has become established globally, and is frequently observed in carrot crops in Europe, North America, and Asia. In Korea, carrot bacterial blight caused by Xhc has been designated and managed as a plant quarantine pathogen since 1996. The first Korean outbreak of Xhc was observed in a carrot field on Jeju Island in 2012, with a subsequent outbreak seen in two Jeju carrot fields in 2013 (du Toit et al., 2014; Myung et al. 2014; Pruvost et al., 2010).

Much of the global carrot seed supply is supplied by New York State in the United States. The US carrot seed supply produces 60% of seeds used to produce carrot root crops in Korea and up to 40% of the world’s supply of carrot seed. Harvested plant seeds can be a source of Xhc transmission, and infected seeds can act as a distribution mechanism by which Xhc can spread to new countries (du Toit et al., 2005). Symptoms of bacterial blight in carrot include small, irregular yellow lesions on the leaves, stems, and petioles that may appear as water-soaked necrotic lesions (Gilbertson, 2002). X. hortorum is one of several plant pathogenic Xanthomonas species and requires phylogenetic identification in addition to symptomatic characterization. Several studies have sequenced the 16S rRNA and 16S-23S internal transcribed spacer regions in the Xanthomonas genus (Adriko et al., 2014; Hauben et al., 1997; Maes, 1993), and phylogenetic classification of the Xanthomonas genus can also be achieved using multilocus sequence analysis (MLSA) (Parkinson et al., 2007; Young et al., 2008). X. hortorum was defined using DNA hybridization studies within the Xanthomonas genus (Vauterin et al., 1995). Subsequent phylogenetic and multi-position sequencing studies using gyrB or four housekeeping genes confirmed that X. hortorum grouped with X. cynarae and X. gardneri to form diverse clades (Young et al., 2008). However, more accurate chromosome mapping is needed for precise pathogen diagnosis and to facilitate Xhc characterization and evolutionary analysis. Currently, three Xhc assemblies are available in GenBank, one with a chromosome assembly and two with unassembled contigs (National Center for Biotechnology Information [NCBI] Xanthomonas hortorum pv. carotae database). However, a complete genome assembly with improved sequence quality and fewer contigs is essential to fully understand the species-specific Xhc genome and virulence diversity among highly conserved Xhc strains. In this study, we report a complete genome sequence of Xhc from Korea with high resolution. There have been no further reports of Xhc in Korea to date; however, to prepare for the risk of domestic outbreaks, a disease survey was conducted in 159 farms in eight cities and counties in the main carrot producing regions in 2020.

Xanthomonas strains were isolated from plants with bacterial leaf blight lesions from carrot farms in Korea (Fig. 1A). The sample surveyed a total of 159 farms in 8 cities and 5 provinces (Supplementary Table 1). Collected carrot leaves with bacterial blight symptoms were first surface-sterilized with 70% ethanol, and then disease-lesion border regions were excised, cut into small pieces, and immersed in 500 μl sterile water for 30 min. Bacteria were isolated by cultivating the immersion solution on yeast extract-dextrose-CaCO (YDC) medium agar at 27°C for 4 days.

Genomic DNA was extracted from isolated bacterial cultures using a Bacterial Genomic DNA Isolation kit (NORGEN Biotek, Thorold, Canada). Extracted DNA was polymerase chain reaction (PCR)-amplified using GoTaq Flexi4 DNA Polymerase (Promega, Madison, WI, USA). PCR amplification of Xanthomonas chaperone protein (dnaK), DNA gyrase subunit B (gryB), TonB dependent receptor (fyuA), and RNA polymerase sigma factor (rpoD) genes was performed as described previously (Young et al., 2008) (Supplementary Table 2). PCR conditions were as follows: 95°C for 5 min, 35 cycles of 95°C for 30 s, 60°C for 60 s, and 72°C for 1 min, followed by 72°C for 10 min. Sequences of the dnaK, gryB, fyuA, and ropD genes were collected from 30 Xanthomonas strains from NCBI and PAMDB (http://genome.ppws.vt.edu/cgi-bin/MLST/home.pl) and from Xhc strains isolated from Jeju Island in Korea. The four gene sequences were linked to create a base sequence of 4,620 bp and aligned using BioEdit 7.2.5. A phylogenetic tree was created for the aligned nucleotide sequences by repeating the bootstrap number of 1,000 using the maximum likelihood method using the MEGA 7 program. Isolated bacteria was found that exhibited high similarity to the previously sequenced Xhc M081 strain (GenBank accession no. AEEU01000001) (Fig. 1B).

For pathogenicity analysis, carrot seeds were planted in topsoil and cultivated at 25°C with 60% relative humidity until four or more true leaves had developed. Seedlings were then inoculated with 1 μl bacterial culture (O.D.600 = 0.5) and lightly wounded by stabbing the back of the leaf with a sterile 1 ml injection needle. Inoculated plants were incubated in a transparent plastic box for 2 days, and then were removed and cultivated at 60% humidity. Symptom development was observed for 30 days. The isolates, Xhc JJ2001, produced infection and necrosis in inoculated carrot leaves (Fig. 1C).

The isolates, Xhc JJ2001, was confirmed as pathogenic and was used for sequencing. Genomic DNA was extracted from the JJ2001 strain using an MGTM genomic DNA purification kit (Epicenter, Madison, WI, USA). High-quality, high-molecular-weight genomic DNA (8 μg) was used to prepare a 20 kb SMRTbell template for PacBioRSII sequencing. Genomic DNA size was determined using a Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA). A final 10 μl library was prepared using a PacBio DNA Template Prep Kit 1.0. SMRTbell templates were annealed using the PacBio DNA/Polymerase Binding Kit P6. For sequencing, PacBio DNA Sequencing Kit 4.0 and eight SMRT cells were used, and each SMRT cell was captured using a PacBio RS-II (Pacific Biosciences, Menlo Park, CA, USA) sequencing platform from Macrogen (Seoul, Korea). Subreads generated by PacBio RS-II were assembled using the Hierarchical Genome Assembly Process (HGAP) (Chin et al., 2013) with default options. The chromosomal and plasmid assembly of the Xhc JJ2001 PacBio sequence was honed with additional Illumina short read sequencing.

For Illumina sequencing, genomic DNA integrity was confirmed by agarose gel electrophoresis and DNA was quantified using Quant-IT PicoGreen (Invitrogen, Carlsbad, CA, USA). Sequencing libraries were prepared with a TruSeq DNA Nano Sample Preparation Kit (Illumina, Inc., San Diego, CA, USA). Purified libraries were quantified using qPCR according to the manufacturer’s qPCR Quantification Protocol Guide (KAPA Library Quantification Kit for Illumina Sequencing Platform) and validated using an Agilent Technologies 2200 TapeStation. Libraries were then sequenced using HiSeq (Illumina, Inc.). Illumina raw data was processed to remove adapters and filtered by quality for error correction. Trimmomatic was used for adapter trimming. For error correction, reads with a base quality of at least 90% Q20 or higher were used. Genome assembly was modified using high-quality HiseqXten reads from Pilon v1.21 (Walker et al., 2014).

After assembly, Illumina reads were used to improve genomic sequence accuracy with Pilon. Subreads were mapped to assembled contigs to generate consensus sequences at depth-range. By adjusting the contigs, more accurate nucleotide genome sequences could be obtained and adapted for different analysis protocols (Bioto lnc., Daejeon, Korea). Genome annotation was performed using a prokaryotic genome annotation pipeline (Tatusova et al. 2016), including transfer RNAs and ribosomal RNAs. Prokka v1.13 (Seemann, 2014) was used for gene prediction and default annotations with the following options: --compliant, --rnamer, and --addgenes. For further annotation, predicted protein sets were identified with InterProScan v5.30-69.0 (Jones et al., 2014) and psiblast v2.4.0 (Camacho et al., 2009) using EggNOG DB v4.5 (Huerta Cepas et al., 2016). Circos v0.69.3 (Krzywinski et al., 2009) was used to generate a circular map representing each contig.

The draft genome sequence of Xhc JJ2001 had a total length of 5,458,083 bp and an N50 value of 5,443,372 bp. Contig 1 was 5,443,372 bp long and had a G + C content of 63.63 mol%. Contig 2 was 14,711 bp long and had a G + C content of 62.66 mol% (Table 1, Fig. 2A). The genome contained 4,571 coding sequence, six rRNA gene operons, and 54 tRNA genes. Genomic features are shown in Fig. 2. Using the EggNOG v. 4.5 database, 4,391 genes were classified into clusters of orthologous genes (COGs) functional groups. The most abundant COGs category was “Function unknown” (S; 1,193 genes), followed by “Replication, recombination and repair” (L; 346 genes), “Amino acid transport and metabolism” (E; 230 genes), “Cell wall, membrane biosynthesis” (M; 226 genes), “Inorganic ion transport and metabolism” (P; 215 genes), “Carbohydrate transport and metabolism” (G; 212 genes), “Signal transduction mechanisms” (T; 209 genes), and “Transcription” (K; 207 genes) noted as major categories (Fig. 2B). To evaluate similarities between the isolated pathogenic Xhc JJ2001 genome and genomes of other Xanthomonas species, the assembled M081 Xhc genome sequence was downloaded from the NCBI database, as well as sequences from the NCBI Xanthomonas RefSeq database (1,660 sequences), and species were compared by calculation of distance values (Ondov et al., 2016). The gene sequence-based species identification was performed using protein sequence analysis with the BLAST v2.9.0 program (Zhang et al., 2000) against the NCBI NR bacteria database (275,186,762 sequences), with blast (e-value ≤ 1e-10, best-match) used to estimate species via organism information for matching genes.

Comparative genome analysis was performed based on Mash-distance using genome data from Xhc JJ2001 and other Xanthomonas genome data from NCBI. Mash-distance was the lowest for Xhc CFBP7900 (GenBank accession no. NZ_CAJDKC010000000) 0.00892796, and second for Xhc M081, 0.0090861. Together with the results of the pathogenicity analysis, these results confirmed that Xhc JJ2001 was a X. hortorum pv. carotae (Supplementary Table 3).

In addition, 7 X. hortorum pathovar were selected and NCBI genome data (GCF_000505565.1, GCF_028580375.1, GCF_028370135.1, GCF_014338485.1, GCF_021352955.1, GCF_001908755.1, and GCF_021352995.1) was received, the genome was compared and analyzed. In the case of the isolated strain, JJ2001, it was found to have the most similar genome to Xhc (GCF_00505565.1), just like the MLSA results. In addition, in the case of each Pathovar, it was found to form a group similarly according to the host plant family (Fig. 3). Overall, there was no significant difference in gene size and G/C content between isolates and other pathovars (Table 2). Furthermore, through Mauve analysis, differences in pathogenic gene islands for each pathovar of X. hortoum were compared and analyzed. In the case of 7 pathovars, there was a difference in gene direction, but the distribution of pathogenic genes was the same (Fig. 4). The pathogenic gene island of JJ2001 was divided into two clusters, hrcT, hrpB7, hrcN, hrpB, hrpB4, hrcJ, jrpB2, and hrpB1 share a promoter, and hrcU and hrcV form another group. In the case of JJ2001, it had 7 avirulence protein encoding genes and 17 genes related to the type secretion system in the genome including the above genes (Supplementary Table 4). The complete genome sequence of X. hotorum JJ2001 has been deposited in NCBI GenBank assembly accession number CP101417 (https://www.ncbi.nlm.nih.gov/nuccore/CP101417.1/).

In conclusion, this study, hybrid assembly of long and contiguous Pacbio reads with short Illumina reads obtained from the novel JJ2001 Xhc strain generated a complete, high-resolution genome. Additional complete genomic assembly of X. hortorum pathovar could facilitate the exhaustive investigation of nucleotide and structural variations across Xanthomonas strains with complex taxonomic systems, which will aid in understanding the Xhc pathogen and provide effective and sustainable management strategies.

Notes

Conflicts of Interest

No potential conflicts of interest relevant to this article was reported.

Acknowledgments

This work was carried out with the support of the Cooperative Research Program for Agriculture Science and Technology Development (Project No. PJ01624101), Rural Development Administration, Republic of Korea. This work was supported by the research grant of Chungbuk National University in 2022.

Electronic Supplementary Material

Supplementary materials are available at The Plant Pathology Journal website (http://www.ppjonline.org/).

PPJ-NT-11-2022-0149-Supplementary-Table-1-3.pdf

PPJ-NT-11-2022-0149-Supplementary-Table-4.pdf

Fig. 1

Isolation and pathogenicity assay of Xanthomonas strains from carrot farmers in Jeju Island, Korea. (A) Bacterial blight symptoms in the foreground and carrot plant leaves of the sampled farmhouse. (B) Maximum likelihood algorithm for the dnaK, gryB, fyuA, ropD genes sequence showing the taxonomic positions of the isolates. The number of nodes represents a bootstrap value (>70%) and was calculated using maximum likelihood probabilities based on 1,000 replicates. (C) Symptoms on carrot leaves through plant inoculation of isolated X. hortorum pv. carotae (Xhc) JJ2001. Control is sterilized water treatment.

Fig. 2

Graphical circular map of the Xanthomonas hortorum pv. carotae genome and EggNOG annotations. (A) Outer circles represent genes on the sense and antisense strands (blue), and RNA genes (light green, tRNA; red, rRNA) are shown within the outer circles. Inner circles indicate the GC skew, with GC content shown in green and purple. Using the formula G-C/G + C, positive values indicate G dominance and negative values indicate C dominance. Thus, exterior green peaks indicate regions of higher G content and interior purple peaks indicate regions with higher C content. (B) EggNOG annotations, where the x-axis represents EggNOG categories and the y-axis represents the number of genes.

Fig. 3

Phylogenomic distribution of Xanthomonas hortorum pathovars. Genome sequences of 8 X. hortorum strains were compared using the MEGA tool with a maximum likelihood, 1,000 replication of bootstrap method. Each pathovars host plant group was painted with the assigned color.

Fig. 4

Comparison of Xanthomonas hortorum JJ2001 genome sequences with the genome of other Xanthomonas hortorum pathovar. (A) Mauve progressive alignment JJ2001, GCF_000505565.1 (pv. carotae), GCF_028580375.1 (pv. hederae), GCF_028370135.1 (pv. pelargonii), GCF_014338485.1 (pv. vitians), GCF_021352955.1 (pv. taraxaci), GCF_001908755.1 (pv. gardneri), and GCF_021352995.1 (pv. cynarae) genomes. (B) Physical map of Xanthomonas hortorum pathogenicity island in JJ2001 strains.

Table 1

Genome features of Xanthomonas hortorum pv. carotae strain JJ2001

Feature	Strain JJ2001
Size (bp)	5,458,083
No. of contigs	2
GC content (%)	63.63
Predicted CDS number	4,571
Ribosomal RNA number	6
Transfer RNA number	54
N50 (bp)^a	5,443,372
Genome repeat length (bp)	234,869
BUSCOs (%)^b	95.95

CDS, coding sequence.

^a N50: 50% of all bases come from contigs longer than this value.

^b BUSCO analysis was performed based on evolutionarily informed expectations of gene content from near-universal single-copy orthologs.

Table 2

Genomic features of four Xanthomonas hortorum pathovar

Feature	X. hortorum

	pv. carotae	pv. gardneri	pv. vitians	pv. taraxaci	pv. cynarae	pv. pelargonii	pv. hederae	JJ2001
Size (bp)	5,052,399	5,416,201	5,270,560	4,959,233	5,407,398	5,456,221	5,724,902	5,443,372
GC content (%)	63.84	63.49	63.64	63.86	63.40	63.58	63.57	63.63
Total genes	4,349	4,693	4,572	4,439	4,983	4,634	5,004	4,608
CDSs	4,104	4,444	4,295	4,039	4,473	4,424	4,643	4,547
rRNAs	3	6	6	3	3	6	3	6
tRNAs	50	55	56	51	52	54	51	54
Other RNAs^a	39	41	40	52	36	41	39	1
Pseudo genes	153	147	175	294	419	109	268	-
Accession no.	GCF_000	GCF_001	GCF_014	GCF_021	GCF_021	GCF_028	GCF_028	-
	505565.1	908755.1	338485.1	352955.1	352995.1	370135.1	580375.1

CDS, coding sequence.

^a Other RNAs includes non-coding RNA.

References

Adriko, J., Mbega, E. R., Mortensen, C. N., Wulff, E. G., Tushemereirwe, W. K., Kubiriba, J. and Lund, O. S. 2014. Improved PCR for identification of members of the genus Xanthomonas. Eur. J. Plant Pathol 138:293-306.

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K. and Madden, T. L. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421.

Chin, C.-S., Alexander, D. H., Marks, P., Klammer, A. A., Drake, J., Heiner, C., Clum, A., Copeland, A., Huddleston, J., Eichler, E. E., Turner, S. W. and Korlach, J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods 10:563-569.

du Toit, L. J., Crowe, F. J., Derie, M. L., Simmons, R. B. and Pelter, G. Q. 2005. Bacterial blight in carrot seed crops in the Pacific Northwest. Plant Dis 89:896-907.

du Toit, L. J., Derie, M. L., Christianson, C. E., Hoagland, L. and Simon, P. 2014. First report of bacterial blight of carrot in Indiana caused by Xanthomonas hortorum pv. carotae. Plant Dis 98:685.

Gilbertson, R. L. 2002. Bacterial leaf blight of carrot. In: Compendium of umbelliferous crop diseases, eds. by R. M. Davis and R. N. Raid, pp. 11-12. American Phytopathological Society, St Paul, MN, USA.

Hauben, L., Vauterin, L., Swings, J. and Moore, E. R. 1997. Comparison of 16S ribosomal DNA sequences of all Xanthomonas species. Int. J. Syst. Bacteriol 47:328-335.

Huerta-Cepas, J., Szklarczyk, D., Forslund, K., Cook, H., Heller, D., Walter, M. C., Rattei, T., Mende, D. R., Sunagawa, S., Kuhn, M., Jensen, L. J., von Mering, C. and Bork, P. 2016. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res 44:D286-D293.

Jones, P., Binns, D., Chang, H.-Y., Fraser, M., Li, W., McAnulla, C., McWilliam, H., Maslen, J., Mitchell, A., Nuka, G., Pesseat, S., Quinn, A. F., Sangrador-Vegas, A., Scheremetjew, M., Yong, S.-Y., Lopez, R. and Hunter, S. 2014. InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236-1240.

Krzywinski, M., Schein, J., Birol, I., Connors, J., Gascoyne, R., Horsman, D., Jones, S. J. and Marra, M. A. 2009. Circos: an information aesthetic for comparative genomics. Genome Res 19:1639-1645.

Maes, M. 1993. Fast classification of plant-associated bacteria in the Xanthomonas genus. FEMS Microbiol Lett 113:161-165.

Myung, I.-S., Yoon, M.-J., Lee, J.-Y., Kim, G.-D., Lee, M.-H., Hwang, E.-Y. and Shim, H. S. 2014. First report of bacterial leaf blight of carrot caused by Xanthomonas hortorum pv. carotae in Korea. Plant Dis 98:275.

Ondov, B. D., Treangen, T. J., Melsted, P., Mallonee, A. B., Bergman, N. H., Koren, S. and Phillippy, A. M. 2016. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 17:132.

Parkinson, N., Aritua, V., Heeney, J., Cowie, C., Bew, J. and Stead, D. 2007. Phylogenetic analysis of Xanthomonas species by comparison of partial gyrase B gene sequences. Int. J. Syst. Evol. Microbiol 57:2881-2887.

Pruvost, O., Boyer, C., Robène-Soustrade, I., Jouen, E., Saison, A., Hostachy, B. and Benimadhu, S. 2010. First report of Xanthomonas hortorum pv. carotae causing bacterial leaf blight of carrot in Mauritius. Plant Dis 94:1069.

Seemann, T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068-2069.

Tatusova, T., DiCuccio, M., Badretdin, A., Chetvernin, V., Nawrocki, E. P., Zaslavsky, L., Lomsadze, A., Pruitt, K. D., Borodovsky, M. and Ostell, J. 2016. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res 44:6614-6624.

Vauterin, L., Hoste, B., Kersters, K. and Swings, J. 1995. Reclassification of Xanthomonas. Int. J. Syst. Evol. Microbiol 45:472-489.

Walker, B. J., Abeel, T., Shea, T., Priest, M., Abouelliel, A., Sakthikumar, S., Cuomo, C. A., Zeng, Q., Wortman, J., Young, S. K. and Earl, A. M. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9:e112963.

Young, J. M., Park, D.-C., Shearman, H. M. and Fargier, E. 2008. A multilocus sequence analysis of the genus Xanthomonas. Syst. Appl. Microbiol 31:366-377.

Zhang, Z., Schwartz, S., Wagner, L. and Miller, W. 2000. A greedy algorithm for aligning DNA sequences. J. Comput. Biol 7:203-214.