Pseudarthrobacter phenanthrenivorans strain MHSD1 is a bacterial endophyte isolated from sterilized leaves of Pellaea calomelanos, a medicinal plant capable of growing in arid environments. Here, we report the draft genome sequence and annotation of this bacterial endophyte. The draft genome sequence of P. phenanthrenivorans strain MHSD1 has 4 450 468 bp with a G + C content of 65.30%. The National Center for Biotechnology Information Prokaryotic Genome Annotation Pipeline identified a total of 4004 protein-coding genes, 56 genes coding for RNAs, and 82 pseudogenes. Biosynthesis pathways for various phytohormones such as auxin, salicylic acid, ethylene, cytokinin, jasmonic acid, abscisic acid, and gibberellins were identified. Putative genes involved in various characteristics of bacterial endophyte lifestyle such as transport, motility, adhesion, membrane proteins, secretion and delivery systems, plant cell wall modification, and detoxification were identified. Phylogenomic analysis showed P. phenanthrenivorans strain MHSD1 to be a subspecies of P. phenanthrenivorans Sphe3.
Endophytes are microorganisms, often bacteria or fungi, that are associated with plant tissues without causing any harm.1 These microorganisms can spend part or all of their life cycle within their plant hosts.2 In their relations with plants, they display various interactions that involve mutualism and antagonism but rarely parasitism.3 All plants are probably associated with endophytes.2,4 Endophytes promote plant growth by enhancing plant’s uptake of nutrients such as nitrogen, phosphate, and potassium; also, they biologically control plant pathogens as well as the production of secondary metabolites with pharmaceutical or biotechnological interest, and phytostimulation through the production of phytohormones5-8
Endophytes produce bioactive secondary metabolites; moreover, endophytes associated with medicinal plants are known to produce similar secondary metabolites as their plant host, with increased therapeutic potential.9,10 Thus, endophytes are alternate sources of bioactive secondary metabolites, as it is better to scale up the microbial fermentation process to increase the production of biologically active compounds, than use high amounts of plant materials, which can result in deforestation, decreased biodiversity, and conservation.9,11 As such, the prospects of isolating and identifying new endophyte species from plants can be beneficial. In a recent study, we isolated and identified bacterial endophytes associated with Pellaea calomelanos,12 a medicinal plant capable of growing in arid conditions.
Pellaea calomelanos is a fern that belongs to Pteridaceae family.13 The plant has healing properties for ailments such as asthma, head colds, coughs, and chest colds.14,15 One of the isolated bacterial endophytes was identified as Arthrobacter sp. MSHD1 using 16S ribosomal RNA (rRNA) gene sequence and biochemical characterization.12 The whole genome of this strain has been sequenced and the sequence data submitted to National Center for Biotechnology Information (NCBI). The draft genome sequence is described here.
Materials and Methods
Genomic DNA isolation, library preparation, and sequencing
Total genomic DNA was extracted from glycerol stock cultures, maintained on nutrient agar at 30°C for 48 hours using the Nucleospin Microbial DNA extraction kit as per the manufacturer’s protocol. The concentration and quality of isolated DNA were determined using the NanoDrop ND-2000 UV-Vis spectrophotometer. The DNA was sent to a commercial service provider, Agricultural Research Council, Onderstepoort, South Africa, for sequencing with Illumina MiSeq platform. Briefly, the library was prepared using NEBNextUltra II DNA kit following the manufacturer’s protocol with a paired-end sequencing strategy (300 bp insert size) using Illumina MiSeq instrument v3.
Pre-processing, genome assembly, and annotation
All pre-annotation analyses were performed on Galaxy web platform available at https://usegalaxy.org.16 FastQC v 0.69 was used to assess the quality of the raw reads.17 Using default parameters, the sequence reads were de novo assembled using Unicycler v 0.4.1.118 and assessed with Quast v 4.6.3.19 The draft genome sequence was submitted to NCBI and annotated using Prokaryotic Genome Annotation Pipeline (PGAP)20 and Rapid Annotations using Subsystems Technology (RAST) server.21-23
Bioinformatics
The phylogenomic analysis was undertaken with the Type Strain Genome Server (TYGS) available at https://tygs.dsmz.de/24 and OrthoANI (Orthologous Average Nucleotide Identity) with the Orthologous Average Nucleotide Identity Tool (OAT) software.25 The genomic islands (GI) were identified by screening the PGAP annotation file generated from NCBI on the IslandViewer 4 Web site (http://www.pathogenomics.sfu.ca/islandviewer/).26 The Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) were predicted by CRISPRCas finder software.27-29 The RAST server was used to annotate and classify predicted genes according to function.21-23 The shared and unique genes of MHSD1 were analyzed by comparing it with 1 bacterial endophyte genome (Arthrobacter sp. PAMC 25486) as well as 3 closely related genomes (Pseudarthrobacter chlorophenolicus A6, P. phenanthrenivorans Sphe3, P. sulfonivorans Ar51) using EDGAR 2.0.30 The genome was masked for repeats using RepeatMasker.31
Accession of the genome sequence
The data from this Whole Genome Shotgun project have been deposited at DDBJ/ENA/GenBank with BioProject number PRJNA549841 and BioSample number SAMN12098155 under the accession VHJD00000000. The version described here is VHJD01000000.
Interpretation of Data Set
The draft genome of strain MHSD1 had 56 contigs with a total length of 4 450 468 bp, G + C content of 65.30%, and an N50 value of 363 437 bp. Using PGAP, the predicted number of genes was 4142, of which 4004 of them were protein-coding genes (CDSs), 56 were RNAs, 82 were pseudogenes, and 3 were non-coding RNAs (ncRNAs). The predicted RNA coding genes include 50 transfer RNAs (tRNAs) and 3 rRNAs (5S, 16S, and 23S). A total of 8566 bp comprising 0.19% were masked. The genome features are in Table 1. The TYGS whole genome–based taxonomic analysis showed that MHSD1 was closely related to Pseudarthrobacter phenanthrenivorans strain Sphe3 as shown in Figures 1 and 2 (Supplementary Data). MHSD1 formed a sister clade with Sphe3 in both phylogenetic trees.
Genome features of Pseudarthrobacter phenanthrenivorans strain MHSD1.
Attribute
Value
Genomic size (bp)
4,450,468
GC content
65.30%
Total number of genes
4142
Protein coding genes
4004
Number of RNAs
56
rRNA genes
3
tRNA gene
50
Protein coding genes with function prediction
3553
CRISPR repeats
1
Abbreviations: CRISPR, Clustered Regularly Interspaced Short Palindromic Repeats; rRNA, ribosomal RNA; tRNA, transfer RNA.
MHSD1 had a digital DNA-DNA hybridization (dDDH) of 85.3% and G + C% content difference of 0.03 with P. phenanthrenivorans Sphe3 (Table 1, Supplementary Data). The observed dDDH of 85.3% was greater than the species boundary value of dDDH >70% for delineating bacterial species as closely related species32; in addition, this value exceeded the dDDH >79%-80% for delineating subspecies.24 Based on the TYGS phylogenomic classification, MHSD1 is delineated as a subspecies of P. phenanthrenivorans Sphe3; initial identification of MSHD1 using the 16S rRNA gene was Arthrobacter sp. strain MSHD1. MHSD1 showed lower OrthoANI values (<70%) (Figure 3, Supplementary Data) than the species boundary of >95%-96%.25 Although the OrthoANI values were lower, the delineation of MHSD1 as a subspecies of P. phenanthrenivorans Sphe3 was based on the dDDH value in the TYGS because it was an enhanced method for species delineation and facilitates classification and identification of species as well as subspecies by comparison with published and described type strains.24
Pseudarthrobacter phenanthrenivorans strain MHSD1 shared 682 common genes with other closely related Pseudarthrobacter species, and only 14 genes were shared with bacterial endophyte Arthrobacter sp. PAMC 25486 (Figure 1). A total of 1765 genes were common among MHSD1 and all the selected comparison species (Figure 1). The 14 exclusive common genes between MHSD1 and PAMC 25486 (results not shown) encode transport proteins and transcriptional regulators, which are essential bacterial endophyte genes that have been previously identified in other bacterial endophyte species.33,34Pseudarthrobacter phenanthrenivorans strain MHSD1 had 367 unique genes, whereas P. phenanthrenivorans strain Sphe3 had 231 unique genes; this can be attributed to MHSD1 having a larger genome length than Sphe3.
Venn diagram of shared and unique genes of Pseudarthrobacter phenanthrenivorans MHSD1 and selected comparison species. 1: Arthrobacter sp. PAMC 25486; 2: Pseudarthrobacter chlorophenolicus A6; 3: P phenanthrenivorans Sphe3; 4: P. phenanthrenivorans MHSD1; 5: Pseudarthrobacter sulfonivorans Ar51.
Pseudarthrobacter phenanthrenivorans strain MHSD1 was found to consist of several sets of genes acquired through horizontal gene transfer. As such, 24 GI (Figure 2) were identified in P. phenanthrenivorans strain MHSD1 genome when aligned to reference genome P. phenanthrenivorans Sphe3.35 The details of the genes clustered on the genomic islands are shown in Table 2 (Supplementary Data). We identified only one CRISPR system with 1 spacer and 11 repeats (Table 2). Functional classification of the genes in P. phenanthrenivorans strain MHSD1 based on RAST annotation (Figure 3) showed that most of the predicted genes are involved in carbohydrate metabolism, which is consistent with the bacterial endophyte lifestyle within which acquisition and mobilization of nutrients such as phosphate, nitrogen, and iron are important for symbiotic plant-bacteria interaction.36
Genetic islands of Pseudarthrobacter phenanthrenivorans MHSD1 aligned with reference genome of P phenanthrenivorans Sphe3. A total of 24 genetic islands were predicted using IslandViewer 4. The green outer circle represents the scale line of the genome in Mbps, and the obtained genomic islands are represented by the following colors: IslandPath-DIMOB (blue), SIGI-HMM (orange), and integrated detection (red). Gray lines indicate contig boundaries.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) sequences present within Pseudarthrobacter phenanthrenivorans strain MHSD1 identified using CRISPRCasFinder.
Functional classification of predicted genes of Pseudarthrobacter phenanthrenivorans MHSD1 genome based on RAST annotation server. RAST indicates Rapid Annotations using Subsystems Technology.
In this study, several putative genes involved in bacterial endophyte behavior or lifestyle were predicted and compared with bacterial endophyte Enterobacter sp. 638 as well as nonendophyte P. phenanthrenivorans Sphe3 (Table 3, Supplementary Data). Genes putatively involved in transport, motility, adhesion, membrane proteins, secretion and delivery systems, plant cell wall modification, detoxification, substrate utilization, stress protection, and transcriptional regulators were identified. In addition, genes important in bacterial endophyte life style, such as those involved in nitrogen fixation and siderophore production, were identified in MHSD1. Although most of the genes present in bacterial endophytes, Enterobacter sp. 638 and P. phenanthrenivorans MHSD1, were also present in P. phenanthrenivorans Sphe3, distinctness of the bacterial endophytes was due to the presence of genes encoding transport proteins and transcriptional regulators important in endophytic behavior or lifestyle, which were not present in the latter. More work is currently underway to describe MHSD1 as a subspecies of P. phenanthrenivorans Sphe3.
Biosynthesis pathways of various phytohormones of P. phenanthrenivorans MHSD1 consist of various important plant hormones such as auxin, salicylic acid, ethylene, cytokinin, jasmonic acid, abscisic acid, and gibberellins (Figure 4, Supplementary Data). The phytohormones are essential for the development and growth of plants through various mechanisms such as cell elongation, division and differentiation, access to nutrients, stress tolerance, and defense against phytopathogens.37-39
Supplemental Material
supplementary_data_final_EB_xyz30819af645758_(1) – Supplemental material for Draft Genome Sequence of Pseudarthrobacter phenanthrenivorans Strain MHSD1, a Bacterial Endophyte Isolated From the Medicinal Plant Pellaea calomelanos
Supplemental material, supplementary_data_final_EB_xyz30819af645758_(1) for Draft Genome Sequence of Pseudarthrobacter phenanthrenivorans Strain MHSD1, a Bacterial Endophyte Isolated From the Medicinal Plant Pellaea calomelanos by Khuthadzo Tshishonga and Mahloro Hope Serepa-Dlamini in Evolutionary Bioinformatics
Footnotes
Funding:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Research Foundation of South Africa (Thuthuka Grant No. TTK170405225920).
Declaration of conflicting interests:
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Author Contributions
Work was planned by MHS-D and executed by KT.
ORCID iD
Mahloro Hope Serepa-Dlamini
Supplemental Material
Supplemental material for this article is available online.
References
1.
SinghRDubeyAK.Endophytic actinomycetes as emerging source for therapeutic compounds. Indo Global J Pharm Sci. 2015;5:106-116.
2.
GoudaSDasGSenSKShinHPatraJK.Endophytes: a treasure house of bioactive compounds of medicinal importance. Front Microbiol. 2016;7:1538. doi:10.3389/fmicb.2016.01538.
3.
NairDNPadmavathyS.Impact of endophytic microorganisms on plants, environment and humans. Sci World J. 2014;2014:250693.
4.
DudejaSSGiriR.Beneficial properties, colonisation, establishment and molecular diversity of endophytic bacteria in legumes and non-legumes. Afr J Microbiol Res. 2014;8:1562-1572. doi:10.5897/AJMR2013.6541.
5.
HardoimPRvan OverbeekLSBergG, et al. The hidden world within plants: ecological and evolutionary considerations for defining functioning of microbial endophytes. Microbiol Mol Biol Rev. 2015;79:293-320. doi:10.1128/MMBR.00050-14.
6.
MaYRajkumarMZhangCFreitasH.Beneficial role of bacterial endophytes in heavy metal phytoremediation. J Environ Manage. 2016;174:14-25. doi:10.1016/j.jenvman.2016.02.047.
7.
VejanPAbdullahRKhadiranTIsmailSNasrulhaq BoyceA.Role of plant growth promoting rhizobacteria in agricultural sustainability—a review. Molecules. 2016;21:573. doi:10.3390/molecules21050573.
8.
SharmaIPChandraSKumarNChandraD. PGPR: heart of soil and their role in soil fertility. In: MeenaSishraPKBishtJKPattanayakA eds. Agriculturally Important Microbes for Sustainable Agriculture, Vol. 1. Singapore: Springer; 2017:51-67. doi:10.1007/978-981.
9.
StrobelGA.Endophytes as sources of bioactive products. Microbes Infect. 2003;5:535-544.
10.
SubbulakshmiGKThalavaipandianABagyalakshmiRVRajendranA.Bioactive endophytic fungal isolates of Biota orientalis (L) Endl., Pinus excelsa wall and Thuja occidentalis L. Int J Adv Life Sci. 2012;4:9-15.
11.
AlvinAMillerKINeilanBA.Exploring the potential of endophytes from medicinal plants as sources of antimycobacterial compounds. Microbiol Res. 2014;169:483-495.
12.
MahlanguSGSerepa-DlaminiMH.First report of bacterial endophytes from the leaves of Pellaea calomelanos in South Africa. S Afr J Sci. 2018;114:55-63.
13.
SchuettplezESchneiderHHuietLWindhamMDPryerKM.A molecular phylogeny of the fern family Pteridaceae: assessing overall relationships and the affinities of previously unsampled genera. Molec Phylogenet Evol. 2007;44:1172-1185.
14.
HutchingsAScottAHLewisGCunninghamAB.Zulu Medicinal Plants: An Inventory. Pietermaritzburg, South Africa: University of Natal Press; 1996.
AfganEBakerDvan den BeekM, et al. The galaxy platform for accessible, reproducible and collaborative biomedical analyses. Nucleic Acids Res. 2016;44:3-10.
WickRRJuddLMGorrieCLHoltKE.Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol. 2017;13:e1005595. doi:10.1371/journal.pcbi.1005595.
19.
GurevichASavelievVVyahhiNTeslerG.QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072-1075.
AzizRKBartelsDBestAA, et al. The RAST server: rapid annotations using subsystems technology. BMC Genomics. 2010;9:75.
22.
OverbeekROlsonRPuschGD, et al. The SEED and the rapid annotation of microbial genomes using subsystems technology (RAST). Nucleic Acids Res. 2014;42:D206-D214.
23.
BrettinTDavisJJDiszT, et al. RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep. 2015;5:8365.
24.
Meier-KolthoffJPGokerM.TYGS is an automated high-throughput platform for state-of-the-art genome-based taxonomy. Nat Commun. 2019;10:2182.
25.
LeeIKimYOParkSCChunJ.OrthoANI: an improved algorithm and software for calculating average nucleotide identity. Int J Syst Evol Microbiol. 2015;66:1100-1103.
26.
BertelliCLairdMRWilliamsKP, et al. IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets. Nucleic Acids Res. 2017;45:W30-W35.
27.
GrissaIVergnaudGPourcelC.CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 2007;35:W52-W57.
28.
AbbySSNeronBMenagerHTouchonMRochaEP.MacSyFinder: a program to mine genomes for molecular systems with an application to CRISPR-cas systems. PLoS ONE. 2014;9:e110726. doi:10.1371/journal.pone.0110726.
29.
CouvinDBernheimAToffano-NiocheC, et al. CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for cas proteins. Nucleic Acids Res. 2018;46:W246-W251.
30.
BlomJKreisJSpanigS, et al. EDGAR 2.0: an enhanced software platform for comparative gene content analyses. Nucleic Acids Res. 2016;44:W22-W28. doi:10.1093/nar/gkw255.
AuchAFKlenkHPGokerM.Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs. Stand Genomic Sci. 2010;2:142-148.
33.
TaghaviSvan der LelieDHoffmanA, et al. Genome sequence of the plant growth promoting endophytic bacterium Enterobacter sp. 638. PLoS Genet. 2010;6:e1000943.
34.
AliDDuanJCharlesTCGlickBR.A bioinformatics approach to the determination of genes involved in endophytic behaviour in Burkholderia spp. J Theor Biol. 2014;343:193-198. doi:10.1016/j.jtbi.2013.10.007.
35.
KallimanisALabuttiKMLapidusA, et al. Complete genome sequence of Arthrobacter phenanthrenivorans type strain (Sphe3). Stand Genomic Sci. 2011;4:123-130.
36.
PinskiABetekhtinAHupert-KocurekKMurLAJHasterokR.Defining the genetic basis of plant–endophytic bacteria interactions. Int J Mol Sci. 2018;20:1947. doi:10.3390/ijms20081947.
37.
JasimBJimthaJCJyothisMRadhakrishnanE.Plant growth promoting potential of endophytic bacteria isolated from Piper nigrum. J Plant Growth Regul. 2013;71:1-11.
38.
TaghaviSGarafolaCMonchyS, et al. Genome survey and characterization of endophytic bacteria exhibiting a beneficial effect on growth and development of poplar trees. Appl Environ Microbiol. 2009;75:748-757.
39.
HayatSAhmadA.Salicylic Acid—A Plant Hormone. Dordrecht, The Netherlands: Springer; 2007.
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.