Evaluation of Primers Targeting the Diazotroph Functional ...

15
ORIGINAL RESEARCH published: 30 April 2018 doi: 10.3389/fmicb.2018.00703 Edited by: Marcus A. Horn, Leibniz University of Hanover, Germany Reviewed by: Oliver Schmidt, University of Bayreuth, Germany Petri Penttinen, Zhejiang Agriculture and Forestry University, China *Correspondence: Dagmar Woebken [email protected] Present address: Roey Angel, Soil and Water Research Infrastructure, Biology Centre CAS, ˇ Ceské Bud ˇ ejovice, Czechia Specialty section: This article was submitted to Terrestrial Microbiology, a section of the journal Frontiers in Microbiology Received: 28 January 2018 Accepted: 27 March 2018 Published: 30 April 2018 Citation: Angel R, Nepel M, Panhölzl C, Schmidt H, Herbold CW, Eichorst SA and Woebken D (2018) Evaluation of Primers Targeting the Diazotroph Functional Gene and Development of NifMAP – A Bioinformatics Pipeline for Analyzing nifH Amplicon Data. Front. Microbiol. 9:703. doi: 10.3389/fmicb.2018.00703 Evaluation of Primers Targeting the Diazotroph Functional Gene and Development of NifMAP – A Bioinformatics Pipeline for Analyzing nifH Amplicon Data Roey Angel , Maximilian Nepel, Christopher Panhölzl, Hannes Schmidt, Craig W. Herbold, Stephanie A. Eichorst and Dagmar Woebken* Division of Microbial Ecology, Department of Microbiology and Ecosystem Science, Research Network ‘Chemistry meets Microbiology’, University of Vienna, Vienna, Austria Diazotrophic microorganisms introduce biologically available nitrogen (N) to the global N cycle through the activity of the nitrogenase enzyme. The genetically conserved dinitrogenase reductase (nifH) gene is phylogenetically distributed across four clusters (I–IV) and is widely used as a marker gene for N 2 fixation, permitting investigators to study the genetic diversity of diazotrophs in nature and target potential participants in N 2 fixation. To date there have been limited, standardized pipelines for analyzing the nifH functional gene, which is in stark contrast to the 16S rRNA gene. Here we present a bioinformatics pipeline for processing nifH amplicon datasets – NifMAP (“NifH MiSeq Illumina Amplicon Analysis Pipeline”), which as a novel aspect uses Hidden-Markov Models to filter out homologous genes to nifH. By using this pipeline, we evaluated the broadly inclusive primer pairs (Ueda19F–R6, IGK3–DVV, and F2–R6) that target the nifH gene. To evaluate any systematic biases, the nifH gene was amplified with the aforementioned primer pairs in a diverse collection of environmental samples (soils, rhizosphere and roots samples, biological soil crusts and estuarine samples), in addition to a nifH mock community consisting of six phylogenetically diverse members. We noted that all primer pairs co-amplified nifH homologs to varying degrees; up to 90% of the amplicons were nifH homologs with IGK3–DVV in some samples (rhizosphere and roots from tall oat-grass). In regards to specificity, we observed some degree of bias across the primer pairs. For example, primer pair F2–R6 discriminated against cyanobacteria (amongst others), yet captured many sequences from subclusters IIIE and IIIL-N. These aforementioned subclusters were largely missing by the primer pair IGK3–DVV, which also tended to discriminate against Alphaproteobacteria, but amplified sequences within clusters IIIC (affiliated with Clostridia) and clusters IVB and IVC. Primer pair Ueda19F–R6 exhibited the least bias and successfully captured diazotrophs in cluster I and subclusters IIIE, IIIL, IIIM, and IIIN, but tended to discriminate against Firmicutes and subcluster IIIC. Taken together, our newly established bioinformatics pipeline, NifMAP, along with our systematic evaluations of nifH primer pairs permit more robust, high-throughput investigations of diazotrophs in diverse environments. Keywords: nitrogen fixation, primer evaluation, nifH gene, Illumina amplicon sequencing, NifMAP Frontiers in Microbiology | www.frontiersin.org 1 April 2018 | Volume 9 | Article 703

Transcript of Evaluation of Primers Targeting the Diazotroph Functional ...

fmicb-09-00703 April 27, 2018 Time: 16:27 # 1

ORIGINAL RESEARCHpublished: 30 April 2018

doi: 10.3389/fmicb.2018.00703

Edited by:Marcus A. Horn,

Leibniz University of Hanover,Germany

Reviewed by:Oliver Schmidt,

University of Bayreuth, GermanyPetri Penttinen,

Zhejiang Agriculture and ForestryUniversity, China

*Correspondence:Dagmar Woebken

[email protected]

†Present address:Roey Angel,

Soil and Water ResearchInfrastructure, Biology Centre CAS,

Ceské Budejovice, Czechia

Specialty section:This article was submitted to

Terrestrial Microbiology,a section of the journal

Frontiers in Microbiology

Received: 28 January 2018Accepted: 27 March 2018

Published: 30 April 2018

Citation:Angel R, Nepel M, Panhölzl C,

Schmidt H, Herbold CW, Eichorst SAand Woebken D (2018) Evaluation

of Primers Targeting the DiazotrophFunctional Gene and Development

of NifMAP – A Bioinformatics Pipelinefor Analyzing nifH Amplicon Data.

Front. Microbiol. 9:703.doi: 10.3389/fmicb.2018.00703

Evaluation of Primers Targeting theDiazotroph Functional Gene andDevelopment of NifMAP – ABioinformatics Pipeline for AnalyzingnifH Amplicon DataRoey Angel†, Maximilian Nepel, Christopher Panhölzl, Hannes Schmidt,Craig W. Herbold, Stephanie A. Eichorst and Dagmar Woebken*

Division of Microbial Ecology, Department of Microbiology and Ecosystem Science, Research Network ‘Chemistry meetsMicrobiology’, University of Vienna, Vienna, Austria

Diazotrophic microorganisms introduce biologically available nitrogen (N) to the globalN cycle through the activity of the nitrogenase enzyme. The genetically conserveddinitrogenase reductase (nifH) gene is phylogenetically distributed across four clusters(I–IV) and is widely used as a marker gene for N2 fixation, permitting investigators tostudy the genetic diversity of diazotrophs in nature and target potential participants inN2 fixation. To date there have been limited, standardized pipelines for analyzing thenifH functional gene, which is in stark contrast to the 16S rRNA gene. Here we presenta bioinformatics pipeline for processing nifH amplicon datasets – NifMAP (“NifH MiSeqIllumina Amplicon Analysis Pipeline”), which as a novel aspect uses Hidden-MarkovModels to filter out homologous genes to nifH. By using this pipeline, we evaluatedthe broadly inclusive primer pairs (Ueda19F–R6, IGK3–DVV, and F2–R6) that targetthe nifH gene. To evaluate any systematic biases, the nifH gene was amplified withthe aforementioned primer pairs in a diverse collection of environmental samples (soils,rhizosphere and roots samples, biological soil crusts and estuarine samples), in additionto a nifH mock community consisting of six phylogenetically diverse members. Wenoted that all primer pairs co-amplified nifH homologs to varying degrees; up to 90%of the amplicons were nifH homologs with IGK3–DVV in some samples (rhizosphereand roots from tall oat-grass). In regards to specificity, we observed some degree ofbias across the primer pairs. For example, primer pair F2–R6 discriminated againstcyanobacteria (amongst others), yet captured many sequences from subclusters IIIEand IIIL-N. These aforementioned subclusters were largely missing by the primer pairIGK3–DVV, which also tended to discriminate against Alphaproteobacteria, but amplifiedsequences within clusters IIIC (affiliated with Clostridia) and clusters IVB and IVC. Primerpair Ueda19F–R6 exhibited the least bias and successfully captured diazotrophs incluster I and subclusters IIIE, IIIL, IIIM, and IIIN, but tended to discriminate againstFirmicutes and subcluster IIIC. Taken together, our newly established bioinformaticspipeline, NifMAP, along with our systematic evaluations of nifH primer pairs permit morerobust, high-throughput investigations of diazotrophs in diverse environments.

Keywords: nitrogen fixation, primer evaluation, nifH gene, Illumina amplicon sequencing, NifMAP

Frontiers in Microbiology | www.frontiersin.org 1 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 2

Angel et al. Evaluating nifH Primers and NifMAP

INTRODUCTION

Nitrogen gas (N2) fixing microorganisms (diazotrophs) are oneof the most ecologically important functional guilds on Earth,providing the primary natural source for nitrogen to ecosystemsthrough biological N2 fixation (BNF; Fowler et al., 2013).Isotopic evidence suggests that BNF has emerged as early asca. 3.2 Gyr ago (Stüeken et al., 2015). It is thought to haveevolved in an anaerobic archaeon and was later transferred toan aerobic bacterium (Boyd et al., 2015). Considering the timescale of the BNF evolution and the importance of N2 fixation,it is not surprising that the genetic potential to perform N2fixation (i.e., the nif genes) is found widely among differentbranches in the phylogenetic trees of archaea and bacteria. It isestimated that 6–15% of all sequenced microbial genomes harborthe minimum number of nif genes to provide them with thegenetic capacity to fix N2 (Dos Santos et al., 2012; Boyd et al.,2015), making diazotrophs one of the most diverse functionalguilds.

Among all known diazotrophs, N2 fixation is mediated bythe nitrogenase enzyme complex along with ca. 20 functionaland regulatory genes that are organized in several operonstermed together the Nif regulon (Rubio and Ludden, 2002).Of those genes, the nifH gene encoding for the dinitrogenasereductase is considered one of the most genetically conservedgenes in the regulon and has been traditionally used as a markergene for studying the genetic diversity of diazotrophs in nature(Zehr et al., 2003; Gaby and Buckley, 2011). In addition to thecanonical MoFe nitrogenase that is shared among all knowndiazotrophs, some diazotrophs possess in their genome one ofthe two alternative versions of nitrogenase, which use eitherVFe (Vnf ) or FeFe (Anf ) as metal cofactors (Raymond et al.,2004).

Based on phylogenetic analysis, the nifH genes typicallyform four clusters, termed clusters I–IV (Zehr et al., 2003;Raymond et al., 2004). Most nifH sequences fall into clusterI, which is composed almost entirely of sequences from thecanonical MoFe nitrogenase. This cluster typically consists ofsequences from Cyanobacteria, Frankia, Proteobacteria, andsome are affiliated with Clostridia, Bacilli, and Nitrospirae. Inaddition, cluster I contains some sequences of the alternativenitrogenase vnfH. Cluster II comprises nearly all sequences of thealternative vnfH nitrogenase as well as all the known sequencesof the second alternative nitrogenase, anfH. In addition, thenifH genes of some Archaea also fall into cluster II. ClusterIII consists mostly of nifH sequences of anaerobic bacteriaand archaea such as methanogens, spirochetes, sulfate reducers,non-sulfur purple bacteria, green sulfur bacteria, acetogens andClostridia. Cluster IV was considered until recently to onlycontain “uncharacterized” or “non-functional” nifH sequences,but in 2015 the first isolate of a diazotroph containing an activenitrogenase belonging to cluster IV was cultivated from a termitegut (Zheng et al., 2016). While it has been shown in several purecultures that most of the known cluster IV nitrogenases do notencode for a protein involved in N2 fixation (e.g., Staples et al.,2007), it is certainly possible that some of the environmentalsequences in this cluster do encode for active nitrogenases.

The dinitrogenase reductase gene is found in nearly everyenvironment studied so far (Gaby and Buckley, 2011), andenvironmental genetic surveys of the nifH gene have producedextensive datasets encompassing several tens of thousands ofunique sequences (Gaby and Buckley, 2014; Heller et al., 2014),and novel sequences continue to be discovered. Such PCR-basedsurveys continue to serve as an important tool for studying thediazotroph diversity and, if the nifH transcripts are targeted (viacDNA), can also reveal transcription patterns in the environment.Yet, the choice of primer pair can have a significant impacton the extent of diversity that is uncovered. Since the firstdegenerate primers were used to amplify environmental DNA(Kirshtein et al., 1991), several dozens of primer pairs targetingthe nifH gene have been designed to serve as group-specific orgeneral nifH primers. In 2012, Gaby and Buckley published anextensive review of all known primers for nifH, and evaluatedtheir performance in silico using a comprehensive databaseof nifH sequences (Gaby and Buckley, 2012). Through theiranalysis, several primer pairs were identified for their potentialto capture the largest diversity of nifH sequences and weretested for their ability to produce a PCR product from DNAfrom several diazotrophic strains and two soil samples (Gabyand Buckley, 2012). However, the extent to which these primersare able to capture nifH diversity in environmental samplesand their potential preferences of amplification have not beentested. Furthermore, the majority of the nifH sequences in publicdatabases have sequence data between positions 100 and 500bases (positions are relative to Azotobacter vinelandii), making itchallenging to perform in silico coverage estimates of primer pairsflanking this region.

Given the importance of exploring the diazotrophiccommunities in the environment and the need for bioinformaticspipelines for analysis of the nifH gene, we developed abioinformatics pipeline for processing nifH amplicon datasetsderived from the MiSeq Illumina sequencing platform – NifMAP(“NifH MiSeq Illumina Amplicon Analysis Pipeline”). Usingthis pipeline, our further goal of this study was to evaluate theperformance of general primer sets – selected based on theirhigh in silico coverage for amplifying nifH – via high-throughputsequencing of a mock community and across a diverse collectionof environmental samples. We discuss the performance ofthe tested primer pairs and provide a standard operatingprocedure for analyzing nifH genes using high-throughputsequencing.

MATERIALS AND METHODS

Primers Used in This StudyFour forward primers: Nh21F, Ueda19F, F2, and IGK3 andfour reverse primers: nifH1, nifH3, R6, and DVV were chosenfollowing the in silico analysis of Gaby and Buckley (2012)(Table 1). The forward primers Nh21F, Ueda19F, and F2 andthe reverse primers nifH1, nifH3, and R6 were tested in allnine combinations. In addition, the primer pair IGK3–DVV wastested, as it was suggested as the best performing primer pair inGaby and Buckley (2012).

Frontiers in Microbiology | www.frontiersin.org 2 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 3

Angel et al. Evaluating nifH Primers and NifMAP

Samples Used in This StudyThirteen different samples were used in this study: soil samplesfrom (a) an Austrian beech forest (Rasche et al., 2011) and (b) anAustrian meadow (Angeringer and Karrer, 2008); (c) rhizosphereand (d) root of Arrhenatherum elatius (tall oat-grass) from anAustrian grassland site (Pötsch et al., 2013); (e) rhizosphere and(f) root of Oryza sativa (wetland rice, grown in the greenhouseon paddy soil from Vercelli, Italy); biological soil crusts (BSCs)from (g) a coastal, sub-arctic crust, Sweden, (h) a temperate crust,Germany, (i) a high Alpine crust, Austria, (j) a semiarid crust,Spain (Büdel et al., 2014) and (k) an arid crust, Israel (Angel andConrad, 2009); and two estuarine samples from (l) the Great Beltthat is separating the North Sea and the Baltic Sea and (m) theRoskilde Fjord, Denmark (Bentzon-Tilia et al., 2015).

DNA ExtractionWith the exception of the water samples, DNA was extracted from0.4 g of soil (or 0.2–0.4 g root and rhizosphere samples) usingmodification of a standard bead-beating protocol in the presenceof a CTAB buffer and phenol, according to a previously publishedphenol/chloroform-based extraction protocol (Angel, 2012).Following extraction, samples were purified using OneStepTM

PCR Inhibitor Removal Kit (Zymo, Irvine, CA, United States).DNA from the water samples were also obtained with aphenol/chloroform-based protocol, as described in Boströmet al. (2004). DNA was extracted from biological replicates: theAustrian beech forest (n = 6); Austrian meadow soil (n = 6);BSCs (n = 2/type); root and rhizosphere samples (n = 3/type);and estuarine samples (n = 2).

PCR Amplification and SequencingFor an initial pre-screening of the different nifH-primercombinations, DNA from the beech forest and meadow soils wereused for PCR amplification of the nifH gene fragment, and thePCR products were evaluated using agarose gel electrophoresis.Amplifications were performed in 25 µl volume using thefollowing mixture: 2.5 µl of 10× DreamTaq Green Buffer, 2 mM

MgCl2, 0.2 mM of each nucleotide dNTP mixture, 0.08 µg µl−1

of BSA, 0.625 U of DreamTaq Green DNA Polymerase (all fromThermo Fisher Scientific, Waltham, MA, United States) and0.8 µM of each primer (Thermo Fischer Scientific, Waltham,MA, United States) and 1 µl of DNA template. The primerswere designed to include a universal 16 bp head sequence attheir 5′ end for subsequent barcoding (Herbold et al., 2015).The following program was used for amplification: 94◦C for5 min followed by 35 cycles of 94◦C for 30 s, 52◦C for 45 s,and 72◦C for 30 s, and a single step of final elongation at72◦C for 10 min. Sequencing of amplified nifH genes was doneusing multiplexed barcoded amplicon sequencing on an IlluminaMiSeq platform (Illumina, San Diego, CA, United States), asdescribed previously (Herbold et al., 2015). First-step PCRamplifications were performed in triplicates of 25 µl each, usingthe mixture and conditions described above, except that PCRswere done in 25 cycles. Following PCR amplification, sampleswere purified using ZR-96 DNA Clean-up KitTM (Zymo, Irvine,CA, United States) and 3 µl from the purified sample wasused for a second PCR reaction, which was 50 µl in volumeand contained the following mixture: 5 µl of 10× DreamTaqGreen Buffer, 2 mM MgCl2, 0.2 mM of each nucleotide dNTPmixture, 0.08 µg µl−1 of BSA, 1.25 U of DreamTaq Green DNAPolymerase (all from Thermo Fisher Scientific, Waltham, MA,United States) and 0.4 µM of a barcode primer, which alsocontained a universal 16 bp head sequence at the 5′ end (Herboldet al., 2015). The following program was used for amplification:94◦C for 5 min followed by 10 cycles of 94◦C for 30 s, 52◦C for45 s, and 72◦C for 45 s, and a single step of final elongation at72◦C for 10 min. Following this barcoding PCR step, the sampleswere purified using ZR-96 DNA Clean-up KitTM (Zymo, Irvine,CA, United States), quantified using Quant-iTTM PicoGreen R©

dsDNA Assay Kit (Thermo Fisher Scientific, Waltham, MA,United States) on a Tecan Safire plate reader (Tecan, Männedorf,Switzerland) and pooled in equimolar amounts of 20× 109 copiesper sample. Library preparation and sequencing services wereprovided by Microsynth (Balgach, Switzerland). The library wasprepared by adaptor ligation and PCR using the TruSeq Nano

TABLE 1 | Summary of the nifH primers used in this study.

Primer name Direction1 Sequence (5′ to 3′) Position2 Reference Coverage (%) with mismatches3

0 1 2

Nh21F F GCI WTY TAY GGN AAR GG 19–35 Deslippe et al., 2005 81% 94% 98%

Ueda19F F GCI WTY TAY GGI AAR GGI GG 19–38 Ueda et al., 1995 81% 94% 98%

F2 F TGY GAY CCI AAI GCI GA 115–131 Marusina et al., 2001 87% 91% 98%

IGK3 F GCI WTH TAY GGI AAR GGI GGI ATH GGI AA 19–47 Ando et al., 2005 83% 96% 98%

nifH1 R ADN GCC ATC ATY TCN CC 406–476 Zehr and McReynolds, 1989 85% 93% 95%

nifH3 R ATR TTR TTN GCN GCR TA 478–494 Zani et al., 2000 89% 93% 96%

R6 R GCC ATC ATY TCI CCI GA 457–473 Marusina et al., 2001 90% 93% 96%

DVV R ATI GCR AAI CCI CCR CAI ACI ACR TC 388–413 Ando et al., 2005 96% 98% 99%

1Either forward (F) or reverse (R).2Position is relative to A. vinelandii nifH (GenBank ACCN# M20568).3Matching of the primers to nifH sequences in the “nifH_2014April04.arb” database (Heller et al., 2014; 41,231 sequences in the database). The number of sequencesin the database that extended into each respective primer matching region are: Nh21F (2156); Ueda19F (2156); F2 (14,529); IGK3 (2156); nifH1 (9788); nifH3 (5523); R6(13,673); and DVV (30,142). The percentages refer to the fraction of these aforementioned sequences covered by the primers with either 0, 1, or 2 mismatches.

Frontiers in Microbiology | www.frontiersin.org 3 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 4

Angel et al. Evaluating nifH Primers and NifMAP

DNA Library Prep Kit (Illumina, Cat FC-121-4001) accordingto the TruSeq Nano protocol (Illumina, FC-121-4003), butexcluding the fragmentation step. Sequencing was performed ona MiSeq platform (Illumina, San Diego, CA, United States). TheMiSeq was run in the 2 × 300 cycle configuration using theMiSeq Reagent kit v3 (Illumina, San Diego, CA, United States).The raw sequence data were deposited into the NCBI ShortRead Archive under BioProject accession number PRJNA432667.

Construction of Hidden-Markov Modelsfor Filtering Non-nifH Reads and forAligning nifH SequencesNon-nifH reads were filtered out using four Hidden-MarkovModels (HMMs). One model (hmm_nuc_1160_nifH.hmm)was based on 1160 nucleotide sequences from thenifH_2014April04.arb database (Heller et al., 2014),with sequence data matching the amplification regionof Ueda19F–R6 primer pair (the longest ampliconin our test). Three other models (bchX.hmm, chlL-bchL.hmm, and Zehr_2014_1812genomes_nifH_AA.hmm)were based on amino acid sequences. The modelZehr_2014_1812genomes_nifH_AA.hmm was based on 1812aligned nifH amino acid sequences from sequenced genomes,obtained from the nifH_2014April04.arb database. The modelbchX.hmm included 79 amino acid sequences from the NCBIEntrez Protein Cluster chlorophyllide reductase iron proteinsubunit X (bchX, PCLA_924109; also useful in discriminatingagainst the chromosome partitioning protein (parA) sequences),and chlL-bchL.hmm included 250 amino acid sequences fromtwo light-independent protochlorophyllide reductase iron-sulfurATP-binding proteins (chlL, CHL00072 and bchL, PCLA_858313,PCLA_3385405). Because bchX and chlL-bchL sequences tend tobe more similar to nifH sequences than to each other, an attemptto generate a single HMM covering all three homologs butexcluding nifH was unsuccessful. The sequences for the HMMshmm_nuc_1160_nifH.hmm, bchX.hmm, and chlL-bchL.hmmwere aligned using MAFFT (V7.271; Katoh and Standley,2013), employing the L-INS-i algorithm, and then individualHMMs were built from each multiple sequence alignmentusing the hmmbuild in HMMER V3.1 (Mistry et al., 2013). Forconvenience, a single HMM database (nifH_chlL_bchX.hmm)was generated from the three amino-acid based HMMs usingcommand hmmpress in HMMER.

NifMAP – An Automated Pipeline forAnalyzing nifH Amplicon ReadsTo process the raw MiSeq amplicon reads, we devisedthe following pipeline (Figure 1): (1) Paired raw MiSeqreads were assembled into contigs using QIIME’sjoin_paired_ends.py (Caporaso et al., 2012). (2) The mergedcontigs were filtered against the nucleotide-based HMM(hmm_nuc_1160_nifH.hmm) using hmmsearch command inHMMER. All reads passing through this filtering step wereaccepted for the next step, irrespective of the model match score.(3) Sequences were chimera-checked and clustered using the

UPARSE pipeline (Edgar, 2013). First, contigs were dereplicatedwith the -derep_fullength command and singleton uniquesequences were removed. OTU centroids were then determinedwith the -cluster_otus command (using 3% radius). Abundancesof OTUs were determined by mapping the filtered contigs (priorto dereplication) to OTU centroids using the -usearch_globalcommand (at a 0.97% identity, hereafter referred to as OTU97).(4) Translation into amino acids and (potential) frameshiftcorrections were done using FrameBot (Wang et al., 2013)against the nifH protein reference set. (5) Homologous genesto the nifH gene (bchX, chlL, bchL, and parA) were filtered outagainst the HMM nifH_ChlL_bchX.hmm (see above) using thehmmscan command in HMMER. Only OTU representatives thatscored highest against the nifH model compared with the bchXand chlL-bchL models were retained. (6) OTU classification andphylogenetic placement: the remaining OTU representativeswere classified using BLASTP (Camacho et al., 2009) againstthe RefSeq database (Pruitt et al., 2005). In addition, OTUrepresentatives were aligned using hmmalign to the HMMZehr_2014_1812genomes_nifH_AA.hmm and assigned tophylogenetic clusters of nifH using Classification And RegressionTrees (CART; Frank et al., 2016). OTU representatives werealso placed on a base tree using the Evolutionary PlacementAlgorithm (EPA) implementation in RAxML (Stamatakis,2014). The base tree was generated as follows: (1) All entriescontaining amino acid sequence information for the nifH genein the nifH_2014April04.arb database were extracted. Theseincluded 1971 entries from genomes and 39,258 entries fromnon-genome sequencing sources. (2) All sequences shorterthan 133 AA were filtered out and the remaining sequenceswere dereplicated. (3) The remaining sequences were clusteredat a clustering threshold of 90% identity using CD-HIT (Fuet al., 2012). In addition, the sequences used for constructingthe bchX and chlL-bchL HMMs were clustered at a thresholdof 80% identity using CD-HIT and merged with the clusterednifH sequences. The combined dataset was then aligned usingMAFFT L-INS-i against the aligned collection of 1812 amino acidsequences of genomic origin used for constructing the HMMZehr_2014_1812genomes_nifH_AA.hmm, in order to maintainan alignment compatible with the nifH_2014April04.arbdatabase. Lastly, a bootstrapped maximum likelihood treebased on a CAT model was reconstructed using RAxML.For placing the OTU representatives on the base tree, thesequences were aligned using MAFFT against the alignmentused for constructing the base tree and then added to thebase tree using RAxML, employing the EPA. Steps 1 and3 of the pipeline are identical to the procedures describedin Herbold et al. (2015), while steps 2, 4, 5, and 6 are newfor this work. The HMMs, base tree and shell script forreproducing steps 2, 4, 5, and 6 are publicly available athttps://github.com/roey-angel/NifMAP.

Design and Testing of a nifH MockCommunityA nifH mock community was developed to estimate variationsin sequencing quality amongst runs and cross-contamination

Frontiers in Microbiology | www.frontiersin.org 4 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 5

Angel et al. Evaluating nifH Primers and NifMAP

FIGURE 1 | Schematic overview of NifMAP – NifH MiSeq Illumina AmpliconAnalysis Pipeline. Pipeline begins with raw MiSeq reads from desiredsequencing facility (depicted in gray). Steps shaded in blue represent standardprocessing steps, while steps shaded in green represent steps specific forprocessing nifH sequences, introduced in this work.

within a run (Bokulich et al., 2016). This mock community wascomprised of DNA from the following six diazotrophic species:Anabaena torulosa (Carm.) Lagerh. (Cyanobacteria) AlgaeCollection Vienna (ASW 01028), Desulfosporosinus acidophilusstrain SJ4T (DSM22704) (Firmicutes) (Alazard et al., 2010),Kosakonia sacchari (Gammaproteobacteria) (in house strain, 16SrRNA gene sequence 99.89% identical (1105 bp) to the 16SrRNA gene sequence of K. sacchari SP1T (Chen et al., 2014),Mesorhizobium loti strain R7A (Alphaproteobacteria) (Kellyet al., 2014), Nostoc microscopicum Vaucher (Cyanobacteria)Algae Collection Vienna (ASW 01020), and Telmatospirillumsiberiense 26-4b1T (Alphaproteobacteria) (Hausmann et al.,2018). DNA was extracted using Qiagen DNeasy Blood andTissue Kit (Qiagen) according to the manufacturer’s instructionsexcept that 200 µl of culture was first transferred to LysingMatrix A tube and cells were disrupted using bead beating(30 s at 4 m s−1; FastPrep-24, MP Biomedicals, Santa Ana,CA, United States) after adding PBS and AL buffers. DNAwas used for PCR amplification of the nifH gene usingprimers Ueda19F–R6 as described above. PCR products werecloned using TOPO TA Cloning Kit (Thermo Fisher Scientific,Waltham, MA, United States). One clone containing nifHgene fragment from each strain was used for colony PCRamplification using primers M13, flanking the insert region(Thermo Fisher Scientific, Waltham, MA, United States). PCRwas performed with the following mixture: 5 µl of 10×DreamTaq Green Buffer, 2 mM MgCl2, 0.2 mM of eachnucleotide dNTP mixture, 0.05 µg µl−1 of BSA, 1.25 U ofDreamTaq Green DNA Polymerase (all from Thermo FisherScientific, Waltham, MA, United States) and 1 µM of eachprimer and the following conditions for amplification: 94◦Cfor 5 min followed by 30 cycles of 94◦C for 60 s, 55◦Cfor 60 s, and 72◦C for 60 s, and a single step of finalelongation at 72◦C for 10 min. Following amplification andpurification with ZR-96 DNA Clean-up KitTM (Zymo, Irvine,CA, United States), PCR products were quantified usingQuant-iTTM PicoGreen R© dsDNA Assay Kit (Thermo FisherScientific, Waltham, MA, United States) on a Tecan Safireplate reader (Tecan, Männedorf, Switzerland) and pooled inequal and also tiered proportions. These mixtures were used asPCR templates for MiSeq sequencing. Each mock communitymixture was amplified and sequenced in triplicate using primersUeda19F–R6, IGK3–DVV, and F2–R6 as described above.The information for generating the mock community can be

found at the Mockrobiota repository1 under mock-27 andmock-28.

Richness and Diversity EstimatesRichness was estimated using number of observed OTUs, whilediversity was estimated using the inverse Simpson metric:

1/D = 1/

R∑i=1

P2i

where R represents the number of OTUs in a sample and Piis the proportional abundance of each OTU. For each sample,both richness and diversity were estimated using a bootstrappedmethod by iterative subsampling (1000 iterations) to minimumread-depth (after dropping the lowest 15th percentile samples).

Quantitative PCR AssaysQuantitative PCR (qPCR) reactions were performed on a C1000Touch thermocycler equipped with a CFX96 Real Time Systemand the data were processed using CFX Manager software (allfrom Bio-Rad, Hercules, CA, United States). For all reactionplates, serially diluted standards containing known quantities ofDNA copies of the target gene (ranging between 4.2 × 101 and4.2 × 107) were used for establishing quantitative calibrationcurves. The standard was generated using a cloned fragment ofthe nifH gene from Didymococcus colitermitum TAV2 (ATCCBAA-2264; Wertz et al., 2012). A SYBR R© Green I-based assayfor nifH was established as follows: each reaction was 20 µlin volume and contained 10 µl of SYBR Green JumpStart TaqReadyMix (Bio-Rad, Hercules, CA, United States), 3 mM MgCl2,0.4 ng µl−1 BSA (Thermo Fisher Scientific, Waltham, MA,United States), 1.4 µM of each primer, and 2 µl of template.The program used was: 95◦C for 5 min, followed by 45 cyclesof 95◦C for 30 s, 52◦C for 45 s for annealing, 72◦C for 30 s forextension, and 78◦C for 10 s for signal acquisition. The reliabilityof quantification was evaluated using a melting curve from 55to 95◦C. A correction for the nifH copy numbers was done foreach sample separately by excluding the fraction of non-nifHsequences (i.e., the proportion of reads that were detected as non-nifH by the pipeline) from the initial values obtained from qPCR.Data were evaluated using ANOVA on log-transformed data.

RESULTS AND DISCUSSION

NifMAP – NifH MiSeq Illumina AmpliconAnalysis PipelineAs the majority of amplicon based diversity surveys employ the16S rRNA gene as a phylogenetic marker, there are standardizedanalysis pipelines and procedures (e.g., Caporaso et al., 2012;Edgar, 2013; Kozich et al., 2013). This is in stark contrast toamplicon-based surveys targeting functional genes, such as nifH.Some of the challenges limiting the development of standardizedpipelines for functional genes include co-amplification of non-target genes due to gene homology (Holmes et al., 1995);

1http://github.com/caporaso-lab/mockrobiota/

Frontiers in Microbiology | www.frontiersin.org 5 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 6

Angel et al. Evaluating nifH Primers and NifMAP

TABLE 2 | Relative proportion of the expected and observed reads for an even and tiered nifH gene mock community for the different primer pairs.

Mock communitycomposition

Expected distribution of reads Observed distribution of reads

Ueda19F–R6 IGK3–DVV F2–R6

Members Even Tiered Even Tiered Even Tiered Even Tiered

(1) D. acidophilus strainSJ4T (Firmicutes,cluster IC)

16.7% 10% 33 ± 0.9% 23 ± 1.0% 48 ± 1.0% 31 ± 0.5% 0.5 ± 0.0% 0.0 ± 0.0%

(2) M. loti strain R7A(Alphaproteobacteria,cluster IJ)

16.7% 20% 16 ± 0.4% 21 ± 0.2% 18 ± 0.4% 25 ± 0.7% 90.7 ± 1.1% 93.1 ± 0.7%

(3) N. microscopicumVaucher ASW 0120(Cyanobacteria, cluster IB)

16.7% 40% 12 ± 0.4% 34 ± 0.4% 14 ± 0.4% 36 ± 0.8% 0.7 ± 0.2% 0.3 ± 0.1%

(4) A. torulosa (Carm.)Lagerh. ASW 01028(Cyanobacteria, cluster IB)

16.7% 5% 16 ± 0.9% 4 ± 0.4% 19 ± 1.4% 5 ± 0.8% 0.0 ± 0.0% 0.0 ± 0.0%

(5) K. sacchari(Gammaproteobacteria,cluster IG)

16.7% 20% 12 ± 0.2% 14 ± 0.5% 1 ± 0.1% 2 ± 0.3% 5.7 ± 0.3% 5.7 ± 0.5%

(6) T. siberiense strain26-4b1T(Alphaproteobacteria,cluster IJ)

16.7% 5% 11 ± 0.4% 4 ± 0.5% 1 ± 0.0% 0 ± 0.0% 0.9 ± 0.1% 0.2 ± 0.0%

amino acid-based analyses making sequencing error (generatinginsertion and deletion errors) more detrimental; and lackof classification methods or databases. Furthermore as everyfunctional gene is different, the methods oftentimes need to beadjusted in a gene-specific manner (Penton et al., 2013). Severalmethods have been proposed in the literature to tackle issues suchas alignment and filtering (Fish et al., 2013), correct conversion toamino acids (Wang et al., 2013) and classification (Dumont et al.,2014; Frank et al., 2016), yet complete pipelines for analyzingenvironmental functional genes are still lacking. Recently, Gabyet al. (2017) published a perl-based pipeline for OTU clusteringand inference of taxonomic affiliation through BLAST of nifHamplicon data; however, this pipeline did not employ the use ofHMMs to filter out homologous genes.

We developed a bioinformatics pipeline for processing nifHreads derived from the MiSeq Illumina sequencing platformcalled “NifH MiSeq Illumina Amplicon Analysis Pipeline” –NifMAP, which uses HMMs to filter out homologous genesand taxonomically classifies sequences using three differentapproaches. In the first step, the sequences are filteredagainst nucleic acids based HMMs to remove non-target, non-homologous sequences. Subsequently, sequences are clusteredbased on 97% identity to the centroid sequence, translatedinto amino acids, and corrected for possible frame shifts. TheOTU representatives are screened against specific HMMs inorder to filter out nifH-homolog sequences (namely bchX,chlL, bchL, or parA). The use of HMMs ensures a highlyspecific and sensitive method to detect and remove non-targetsequences that is orders of magnitude faster than filteringbased on taxonomic or phylogenetic affiliation. Finally, theremaining sequences are classified taxonomically to their closest

relatives and their phylogenetic cluster (Figure 1). We chosea combined classification approach using several independentmethods, providing information on both the closest taxonomicrelative of the queried sequence in addition to its placement ina phylogenetic cluster, because the two classification approachesmight occasionally disagree (Frank et al., 2016). Our pipeline wasevaluated with a nifH mock community and a diverse collectionof environmental samples (including roots and rhizospheresamples, BSCs and estuarine water samples).

Evaluating Coverage of nifH Primer PairCombinationsAcross numerous environments, the distribution of the nifH genehas been evaluated using various primer pair combinations (e.g.,Ueda et al., 1995; Farnelid et al., 2011; Collavino et al., 2014).Although there have been some in silico investigations evaluatingthe performance of these primer pairs along with suggestionsas to the best combination(s) (Gaby and Buckley, 2012), theseprimer pairs have yet to be thoroughly evaluated and tested inthe wet-lab, especially when considering samples from highlydiverse environments such as soils. To that end, we evaluatedthe performance of 10 different nifH primer combinations. Theprimer pairs consisted of all possible combinations of threeforward primers (Nh21F, Ueda19F, and F2) and three reverseprimers (nifH1, nifH3, and R6) (Table 1) as described previously(Gaby and Buckley, 2012), along with the combination IGK3and DVV. These primers were chosen due to their presumedhigh coverage (Table 1) and ability to generate amplicons inlengths that are suitable for MiSeq sequencing (200–500 bp).Extracted DNAs from beech forest and meadow soil were initially

Frontiers in Microbiology | www.frontiersin.org 6 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 7

Angel et al. Evaluating nifH Primers and NifMAP

Ueda 19F-R6 IGK3-DVV F2-R6100%

75%

50%

25%

0%100%

75%

50%

25%

0%

Prop

. of O

TUs

Prop

. of r

eads

nifHNon-nifH

a hb c d e f g mi j k l

soil root orrhizosphere

BSCs aquatic

a hb c d e f g mi j k l

soil root orrhizosphere

BSCs aquatic

a hb c d e f g mi j k l

soil root orrhizosphere

BSCs aquatic

(A)

(B)

FIGURE 2 | Performance of the nifH primer pairs in amplifying nifH and non-nifH (homologous) genes across environmental samples based on proportion of reads(A) and OTUs (B). The environmental samples include: (a) beech forest soil; (b) meadow soil; (c) rhizosphere and (d) root samples of Arrhenatherum elatius; (e)rhizosphere and (f) root samples of Oryza sativa; (g) coastal, sub-arctic biological soil crust (BSC); (h) temperate BSC; (i) high alpine BSC; (j) semiarid BSC; (k) aridBSC; estuarine samples from the (l) Great Belt and (m) Roskilde Fjord. More details on the samples can be found in the Section “Materials and Methods.” Reads orOTUs classified as homologous genes (such as bchX, chlL, bchL, and parA) are summarized as “non-nifH.”

used in a pilot study to evaluate these combinations. The initialcriterion was generation of a correctly sized PCR fragment.Only three primer pairs produced PCR fragments of the correctsize: F2–R6 (358 bp), Ueda19F–R6 (394 bp), and IGK3–DVV(454 bp) without unspecific bands (data not shown). Primer pairsF2–R6 and IGK3–DVV have been used in the past for analyzingdiazotrophic communities (Gaby and Buckley, 2012), while thiswas the first time that the Ueda19F–R6 combination was used.

Assessing nifH Primer Pair CoverageUsing a Mock CommunityIn order to assess potential primer biases of these three primercombinations, their performance was tested on constructednifH mock communities containing six phylogenetically diversemembers. Mock communities, i.e., a defined mixture of knownmicrobial strains, have become a standard tool for benchmarkingdifferent aspects of high-throughput sequencing techniques, forexample, base calling error, variation between indecent runs andcross contamination (Schloss et al., 2011; Bokulich et al., 2016;Singer et al., 2016). In addition, mock communities can be usedto assess amplification bias of different primers. Until now nearlyall established mock communities were designed for 16S rRNAgene surveys, while there is a growing need to develop mockcommunities for other (functional) genes.

Two mock communities were generated using six clonescontaining nifH gene fragments of diazotrophic cultures of evenand tiered proportions (Table 2). Prior to pooling the clones,each insert was confirmed to perfectly match the used primers

based on Sanger sequencing (data not shown). The nifH genesin each mock community were amplified and sequenced intriplicate using all three primers pairs, in order to assess theability of the primers to reconstruct the diazotrophic communitystructure. Across all three primer pairs, sequencing produced664–7883 reads per sample after quality filtering, which wereclassified into 6 (primers Ueda19F–R6, and F2–R6) and 7 (IGK3–DVV) OTUs. OTUs 1–6 were confirmed to match the mockcommunity members (Table 2), while OTU 7 was a contaminantidentified as a Geobacter sp. and comprised of a single readin two mock community replicates amplified using the primerpair IGK3–DVV (data not shown). Sequencing exhibited a highdegree of reproducibility among replicates as indicated by thelow standard errors (Table 2), but showed deviations fromthe expected read distribution in the even and tiered mockcommunities. Using primers F2–R6, over 90% of the reads wereaffiliated to M. loti and about 6% were affiliated to K. sacchari,while other members of the mock community (including twocyanobacteria) were nearly not detected. This is despite perfectin silico primer matching to the nifH sequences of the mockcommunity members.

In contrast, all mock community members could be recoveredusing primers Ueda19F–R6 and IGK3–DVV. The bias from theexpected distribution (even and tiered) was greater in the IGK3–DVV primers compared to the Ueda19F–R6 primers (Table 2).Specifically, D. acidophilus was consistently overrepresented inboth the even and tiered mock communities for both primerpairs, while K. sacchari and T. siberiense were underrepresented.D. acidophilus, K. sacchari, and T. siberiense were particularly

Frontiers in Microbiology | www.frontiersin.org 7 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 8

Angel et al. Evaluating nifH Primers and NifMAP

biased against using the primer pair IGK3–DVV. Biases inthe composition of microbial communities resulting fromdifferential amplification of templates by PCR especially whenusing degenerate primer pairs, are well documented throughhigh-throughput sequencing of 16S rRNA gene as well asfunctional gene mock communities (e.g., Caporaso et al.,2011; Schloss et al., 2011; Pinto and Raskin, 2012; Pelikanet al., 2016). As in 16S rRNA gene amplicon sequencing(Schloss et al., 2011), our results therefore stress the needfor including mock community samples that are specific tothe target gene of interest in sequencing experiments forassessing the extent of primer biases. Furthermore, when mockcommunities are sequenced along with environmental sampleson one sequencing run, it allows verifying the reproducibility ofdifferent runs.

Assessing nifH Primer Pair PerformanceUsing Environmental SamplesThe specificities of the primer pairs Ueda19F–R6, IGK3–DVV,and F2–R6 were further evaluated by amplicon sequencing ofenvironmental samples. In addition to the above-mentionedtemperate soil samples, we extended our investigation to a diversecollection of terrestrial samples (such as BSCs, rhizosphere androot samples) along with two estuarine water samples. A totalof 424,234 raw Illumina paired-end reads were generated acrossthese primer pairs: Ueda19F–R6 (156,476 reads), IGK3–DVV(202,300 reads), and F2–R6 (65,458 reads). The raw readswere processed through our automated pipeline NifMAP (seesection “Materials and Methods” and Figure 1). After contigassembly, the merged reads were filtered using the nucleicacids based HMM, which filtered out different proportionsof reads depending on the used primer combinations. Mostreads were removed in the F2–R6 primer pair (ca. 7.5%)followed by Ueda19F–R6 (ca. 2.0%) and IGK3–DVV (ca. 0.3%).Investigating these non-nifH sequences using BLASTN againstthe Nr database gave no significant matches. The remaining

reads were clustered into OTUs with the IGK3–DVV primerpair generating the most reads (89,453) that were clustered into884 OTU97, followed by Ueda19F–R6 (76,897 reads; 528 OTU97)and F2–R6 (45808 reads; 763 OTU97). OTU representativeswere converted to amino acid sequences, but in no case didwe observe that frame shift correction was needed. This isin line with previous reports showing a particularly low indelerror rate for Illumina MiSeq technology (Schirmer et al., 2015).Next, filtering of the OTU sequence representative via HMMsof homologous genes to nifH showed that all primer pairs andsamples produced reads, and consequently OTUs, which werenot classified as nifH, but rather as these homologous genes.Most of these homologs were classified as chlL-bchL, followedby bchX, and only a minority of sequences were classified asparA.

While the amplification of homologous genes was observedin all three primer pairs, striking differences amongst theprimer pairs were seen in the proportion of reads and OTUsclassified as homologs rather than as nifH (Figure 2). Acrossmany environmental samples, the IGK3–DVV combinationexhibited the highest proportion of non-specific productswith on average 48% non-nifH OTUs and up to 92% non-nifH reads (Figure 2, green bars). The primer pairs F2–R6and the new tested combination, Ueda19F–R6, amplified lessnon-nifH sequences across many of the investigated samples,with an average of 7.5 and 13% for primers Ueda19F–R6and F2–R6, respectively. Our analysis shows not only largedifferences in the extent of non-specific products amongstthe different primer pairs (P < 0.01), but also amongstenvironmental samples when the same primer pair was used(P = 0.02). The most striking difference was detectableamongst the samples of rice and tall oat-grass. Using primersIGK3–DVV, nearly all reads and OTUs in the rice sampleswere classified as nifH-related (Figure 2, samples “e, f ”),while in the tall oat-grass samples only around 10% ofthem were identified as nifH-related (Figure 2, samples “c,

FIGURE 3 | Taxonomic classification of nifH sequences based on a BLASTP search for the different primer pairs and environmental samples. Colors of the heatmapindicate the mean relative abundance of the taxonomic classes from each sample. The environmental samples include: (a) beech forest soil; (b) meadow soil; (c)rhizosphere and (d) root samples of Arrhenatherum elatius; (e) rhizosphere and (f) root samples of Oryza sativa; (g) coastal, sub-arctic biological soil crust (BSC); (h)temperate BSC; (i) high alpine BSC; (j) semiarid BSC; (k) arid BSC; estuarine samples from the (l) Great Belt and (m) Roskilde Fjord.

Frontiers in Microbiology | www.frontiersin.org 8 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 9

Angel et al. Evaluating nifH Primers and NifMAP

FIGURE 4 | Taxonomic classification of nifH sequences based on CART for the different primer pairs and environmental samples. Colors of the heatmap indicate themean relative abundance of the subclusters in each sample. The environmental samples include: (a) beech forest soil; (b) meadow soil; (c) rhizosphere and (d) rootsamples of Arrhenatherum elatius; (e) rhizosphere and (f) root samples of Oryza sativa; (g) coastal, sub-arctic biological soil crust (BSC); (h) temperate BSC; (i) highalpine BSC; (j) semiarid BSC; (k) arid BSC; estuarine samples from the (l) Great Belt and (m) Roskilde Fjord.

d”). Similar trends can be seen for the crust and watersamples, though to a lesser extent. The problem of co-amplification of non-target homologous or non-homologousgenes during PCR is well documented in the literature,particularly when amplifying functional genes (Holmes et al.,1995). Unfortunately, this phenomenon can erroneously inflatediversity estimates and population size when left unnoticed.Our analysis clearly shows that different primer pairs can havestriking differences in amplification specificities and that thisshould be taken into consideration when designing a geneticsurvey.

Comparing Alpha-Diversity MetricsAcross the Primer PairsWe calculated the number of observed OTUs (richness estimate)and the inverse Simpson metric (diversity estimate) from eachsample type and with the three different primer pairs. Onaverage, similar numbers of nifH OTUs per sample were detected

using each of the three primer pairs Ueda19F–R6 (49 ± 3);IGK3–DVV (39 ± 3); and F2–R6 (49 ± 4), while the inverseSimpson index was somewhat lower using primers Ueda19F–R6(9.5 ± 0.5) compared to the other two primer pairs, IGK3–DVV (14.0 ± 1.5) and F2–R6 (14.0 ± 1.8) (SupplementaryFigure S1) indicating no clear advantage of one primer pairin capturing higher richness. A general agreement amongst theprimer pairs was seen across the different sample types regardingrichness and diversity patters. For example, the beech forest andmeadow soils harbored richer and more diverse communitiesthan the crust and water samples. Nevertheless, there were somedifferences amongst the primer pairs with regards to richnessand diversity estimates. Most notably primer pair F2–R6 yieldedhigher richness and diversity estimates for rice rhizosphere androot samples (Supplementary Figure S1, samples “e, f ”), butlower for many of the crust samples (Supplementary Figure S1,samples “g–k”), compared to primer pairs IGK3–DVV andUeda19F–R6.

Frontiers in Microbiology | www.frontiersin.org 9 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 10

Angel et al. Evaluating nifH Primers and NifMAP

ClusterIII

bchX

, chl

L,bc

hLCluster I

ClusterIIpa

rA

Cluste

rIV

0.10.

20.30.

40.50.

60.70.

80.91.

0

PlrSpec2

WP 007206408-1

ArbSvalb

Ueda19FR6 OTU 257

DelProt3F2R6 OTU 319

F2R6 OTU 68

IGK3DVV OTU 605

F2R6 OTU 139

IGK3

DVV

OTU

61

UncB5916

Uni

Nit7

6

Ueda19FR6 OTU 236

PhsS

pec5

ChcSpec3

UncM2717

Ueda19FR6 OTU 42

F2R6

OTU

166

Ueda

19FR

6 O

TU 3

70

IGK3

DVV

OTU

137

8

IGK3DVV OTU 1068

F2R6 OTU 318

DsrAutot

Can

Met

h3

Ueda19FR6 OTU 8

Ueda19FR

6 OTU

398

F2R6 OTU 250

AnaCyli4

F2R

6 O

TU 2

26

F2R6 OTU 308

Ueda19FR6 OTU 87

F2R6 O

TU 93

Ueda19FR6 OTU 205

Ueda19FR6 OTU 509

Ued

a19F

R6

OTU

165

Ueda19FR6 OTU 351

Ueda19FR6 OTU 6

IGK3DVV OTU 135

Ueda19FR6 OTU 187

IGK3DVV OTU 384

Ueda19FR6 OTU 151

IGK3

DVV

OTU

787

IGK3DVV OTU 788

CloKlu17

IGK3

DVV

OTU

509

Ueda19FR6 OTU 184

F2R6 OTU 291

IGK3DVV OTU 1226

SelS

pe12

IGK3DVV OTU 99

F2R6 OTU 202

452

UTO

6R2

F

IGK3DVV OTU 1290

F2R6 OTU 111

Ueda19FR6 OTU 435

Unc1

0959

F2R6 O

TU 81

WP 015169021-1

IGK3D

VV OTU 36

1

IGK3DVV OTU 560

Ued

a19F

R6

OTU

524 523

UTO

6RF91ade

U

Rum

Flav

e

IGK3

DVV

OTU

39

UniNit7

3

UncMi753

F2R6 OTU 239

MetLacus

WP 007814214-1

Uni

Bac5

0

MhmTherm

IGK3DVV O

TU 784

UncN1551

Ueda19FR6 OTU 47

IGK3DVV OTU 168

Uni

Nit9

4

AcnLong4

F2R6 O

TU 329

IGK3DVV OTU 544

IGK3DVV OTU 512

UncM2055

Unc

N16

28

UncM

1464

Ueda19FR6 OTU 191

IGK3DVV OTU 677

643UT

O6

R2F

Ueda19FR6 OTU 313

IGK3DVV O

TU 1566

UncPr21

8

Ueda19FR6 OTU 15

Ueda1

9FR6 O

TU 203

UncPr

180

IGK3

DVV

OTU

228

IGK3DVV OTU 1083

UncNos29

IGK3DVV OTU 943

UniNi155IG

K3D

VV O

TU 1

075

F2R

6 O

TU 3

9

F2R6 O

TU 16

2

F2R6

OTU

62

UncM

1309

UniNi143

IGK3D

VV OTU

321

NosSpe22C

loC

itr2

CloUltu

n

Ueda19FR

6 OTU

247

Ueda1

9FR6 O

TU 38

2

425UT

OVV

D3KGI

F2R

6 O

TU 1

85

IGK3

DVV O

TU 1

499

SntW

olfe

Ueda19FR6 OTU 202

AnrColih

IGK3D

VV OTU

766

IGK3D

VV OTU

73

F2R6 OTU 288

IGK3DVV OTU 1247

F2R6 O

TU 31

Ueda

19FR

6 O

TU 1

63

Ueda19FR6 OTU 493

IGK3DVV OTU 233

Ueda19FR

6 OTU

525

Ueda19FR6 OTU 432

Unc

N16

02 IGK3D

VV OTU

27

UncN1648

WP 019012955-1

WP 017672669-1

IGK3DVV OTU 1314

WP 008296332-1

IGK3DVV OTU 15

IGK3D

VV OTU

996

F2R6 O

TU 84

Ueda19FR

6 OTU

292

IGK3DVV OTU 386

443

UTO

6R2

F

IGK3DVV OTU 41

UncN

1539

WP 003605362-1

F2R6 OTU 374

IGK3

DVV

OTU

756

Ueda

19FR

6 OTU

196

Ueda19FR6 OTU 73

MhbSpec3

PaeF

uji2

F2R6

OTU

41

IGK3D

VV OTU

77

Ueda19FR

6 OTU

307

F2R6 O

TU 260

IGK3

DVV

OTU

258

Ueda19FR6 OTU 62

IGK3DVV OTU 1404

CloC

ellu

MicMarit

Ueda19FR

6 OTU

234

AciFerro

IGK3D

VV OTU

563

YP 635866-1

F2R6 OTU 138

Rum

Albu

9

UncM

2070

DlpSulfe

IGK3D

VV OTU

1558

Ued

a19F

R6

OTU

224

RhsRubru

F2R6 OTU 47

813UT

OVV

D3KGI

Ueda19FR6 OTU 71Uncu1116

IGK3

DVV

OTU

45

ThnPoten

Ueda19FR6 OTU 272

Unc

uS59

3

IGK3

DVV

OTU

442

Ueda1

9FR6 O

TU 280

IGK3

DVV

OTU

508

IGK3D

VV OTU

869

CltN

itro

342UT

OVV

D3KGI

IGK3D

VV OTU

225

IGK3DVV OTU 273

IGK3DVV OTU 719

IGK3

DVV

OTU 1

243

UniNi160

IGK3DVV OTU 1548

IGK3DVV OTU 750

IGK3

DVV

OTU

577

Ueda19FR6 OTU 39

WP 012475625-1

F2R

6 O

TU 3

43

Ueda19FR6 OTU 463

UniNi125

WP 009021548-1

F2R6 O

TU 377

IGK3D

VV OTU

1153

UncB5

670

UniNi139

IGK3DVV O

TU 928

IGK3DVV OTU 287

Ued

a19F

R6

OTU

140

CloSpe17

Uni

Bac5

6

Ueda19FR6 OTU 300

MnbEve

st

Ueda19FR6 OTU 458

IGK3DVV OTU 1462

F2R6 OTU 174Ueda19FR6 OTU 85

Ued

a19F

R6

OTU

66

IGK3D

VV OTU

88

Ueda19FR6 OTU 90

Ueda19FR6 OTU 341

F2R6 OTU 16

Unc

B576

1

95UT

OV

VD3

KGI

WP 008931734-1

IGK3DVV O

TU 1200

UncMi90

6

Ueda19FR6 OTU 296

CntSpe25

Meg

Spec

6

F2R6 OTU 281

Ueda19FR6 OTU 242

IGK3D

VV OTU

1446

IGK3DVV OTU 1622

DsfPiezo

211UT

OVV

D3KGI

Dia

Spec

i

F2R6 O

TU 350

F2R6 O

TU 127

UncN1817

NodSpum3

Ueda19FR6 OTU 442

CanM

eth5

F2R6 O

TU 128

ClsB

acte

WP 007196589-1

IGK3D

VV OTU

960

F2R6 OTU 275

Ueda19FR6 OTU 229

MmsSpeci

UncN1699

EubSpec3

LynSpec3

59UT

OVV

D3KGI

F2R6 O

TU 181

CloLjun4

IGK3DVV O

TU 673

F2R6 OTU 148

IGK3DVV OTU 504

Ueda19FR6 OTU 335

Uncu1115

IGK3

DVV

OTU

162

6

DlmTied2

IGK3DVV OTU 456

WP 018411937-1

Ueda19FR

6 OTU

340

Ueda19FR6 OTU 366

F2R6 OTU 65

Ueda19FR6 OTU 221

Ueda19FR6 OTU 72

IGK3D

VV OTU

457

IGK3DVV OTU 330

F2R6 O

TU 38

1

Ueda19FR6 OTU 498

MhnMar13

Ueda19FR6 OTU 387

IGK3

DVV

OTU

152

9

AlcFaeca

CnbUCYNA

MhhPalu

s

Unc

1545

6

IGK3DVV OTU 460

IGK3DVV OTU 1288

F2R6 O

TU 142

Ueda19FR6 OTU 115

F2R6 OTU 286

Ueda19FR6 OTU 348

CorAkaji

IGK3DVV O

TU 1359

IGK3DVV O

TU 1281

MetAce11

IGK3DVV O

TU 810

Ueda19FR

6 OTU

420

F2R

6 O

TU 2

74

IGK3D

VV OTU

1017

IGK3DVV OTU 910

DsfFruct

F2R6 OTU 44

F2R6 O

TU 32

IGK3DVV OTU 617

OstCyan2

866

UTO

VV

D3K

GI

Ueda19FR6 OTU 405

IGK3DVV OTU 502

IGK3DVV OTU 396

Ueda19FR6 OTU 33

F2R6

OTU

220

Ueda19FR6 OTU 93

Mhh

Palu

3

IGK3

DVV

OTU

670

Ueda19FR6 OTU 388

451UT

O6

RF91adeU

IGK3DVV OTU 1263

IGK3DVV OTU 1277

WP 008028560-1

Uncu1028Ueda19FR6 OTU 125

Mtg

Form

i

IGK3D

VV OTU

270

WP 015281842-1

DesAceto

MmsSpe

c3

Unc11188

Ueda19FR6 OTU 288

F2R6 OTU 64

F2R6 O

TU 339

Rsb

Spec

5

IGK3DVV OTU 128

CloPas16

UncPlanc

IGK3DVV OTU 3

Unc10685

Ueda

19FR

6 O

TU 2

49

Dia

Succ

i

F2R6 OTU 71

IGK3DVV OTU 109

Unc

N18

14

Rhs

Rub

r4

TolSpec5

IGK3DVV OTU 700

HdrThermIG

K3DVV OTU 587

4621UT

OVV

D3K

GI

IGK3D

VV OTU

768

IGK3DVV OTU 20

Rum

Obeu3

072UTO6RF91adeU

IGK3D

VV OTU

1227

ThfMobil

F2R6 OTU 159

F2R6 O

TU 146

Uncu1123

IGK3

DVV

OTU

132

9

CloLjung

MtrRumi2

Ueda19FR6 OTU 248

Mes

Cic

15

CntSpec7

IGK3DVV OTU 1450

Ued

a19F

R6

OTU

28

PlrSpeci

Hydr

oDSM

FlaPlau2

WP 007067496-1

AlkO

rem

l

F2R6 OTU 313

IGK3DVV O

TU 85

F2R6 O

TU 294

IGK3DVV OTU 892

IGK3DVV OTU 1480

F2R6 O

TU 118

Ueda19FR

6 OTU

350

F2R6 OTU 356

IGK3DVV O

TU 1107

Ueda19FR

6 OTU

510

Ueda19FR6 OTU 508

Ueda19FR6 OTU 246

UniNi164

UncM

1688

MhnAeoli

Ued

a19F

R6

OTU

116

IGK3DVV OTU 1475

WP 002735653-1

DslOrie2

IGK3DVV OTU 757

IGK3

DVV

OTU

153

F2R6 OTU 130

IGK3DVV OTU 376

F2R6 O

TU 379

UncN1447

F2R6 OTU 228

IGK3DVV OTU 1058

IGK3DVV OTU 684

IGK3DVV OTU 997

IGK3DVV OTU 903

IGK3

DVV

OTU

814

Ueda19FR6 OTU 95

IGK3DVV OTU 65F2R6 OTU 321

UncM

2112

UncM19

35

RhbSpha3

EubC

ell2

WP 007165005-1

CloC

lar2

WP 006453494-1

F2R6 OTU 158

Ued

a19F

R6

OTU

138

YP 001382181-1

IGK3DVV OTU 1317

YP 001312276-1

UncM1726

Ueda19FR6 OTU 262

F2R6 OTU 133

F2R6 OTU 342AztVine6

Unc10897

IGK3DVV OTU 320

IGK3DVV OTU 1363

FusU

lcer

BraSp130

F2R6 OTU 36

IGK3DVV OTU 952

F2R6 OTU 157

HlhHalop

IGK3D

VV OTU

1286

IGK3D

VV OTU

34

F2R6 OTU 230

Ueda19FR6 OTU 164

Ued

a19F

R6

OTU

483

IGK3D

VV OTU

100

F2R6 OTU 222

RhsRubr5

Ueda19FR6 OTU 179

Unc

Arc7

7

F2R6

OTU

311

IGK3DVV O

TU 1379

Ueda19FR6 OTU 223

Ueda19FR

6 OTU

251

F2R6 OTU 176

Ueda19FR6 OTU 36

WP 017667962-1

Ueda19FR6 OTU 254

DstHafn8

Ueda19FR6 OTU 425

Ued

a19F

R6

OTU

289

F2R6 O

TU 20

IGK3DVV OTU 126

Ueda19FR6 OTU 128

AzoBra45

IGK3DVV OTU 197

F2R6 OTU 22F2

R6 OTU

200

UncM20

06

792UT

O6

RF91adeU

MrvForma

Unc17650

IGK3D

VV OTU

805

UncMi92

6

F2R6 OTU 97

IGK3DVV OTU 145

831UT

OVV

D3K

GI

Ueda19FR6 OTU 222

Ueda

19FR

6 O

TU 2

18

IGK3DVV OTU 308

121UT

OV

VD3

KGI

IGK3DVV OTU 79

NP 043285-1

IGK3DVV OTU 1150

IGK3D

VV OTU

29

Ueda19FR

6 OTU

418IG

K3D

VV O

TU 7

49

UncuS980

WP 017299753-1

IGK3DVV O

TU 1463

IGK3DVV OTU 819

IGK3DVV OTU 1196

MhnVann4

134UT

OV

VD3

KGI 08

3UT

O6

RF91

ade

U

MhpStadt

IGK3DVV OTU 1140

WP 011662788-1

Ueda19FR6 OTU 111

Unc10836

Ueda19FR

6 OTU

256

Ueda19FR6 OTU 142

SnrFumar

IGK3DVV OTU 264

UncN1542

WP 008188882-1

AgaAlbus

63epSol

C

WP 012282383-1

IGK3DVV OTU 889

WP 011956643-1

F2R6 OTU 19

F2R6 OTU 334

CopC

atus

MetMaz

e6

UncNi917U

niN

it71

MagMagn2

IGK3DVV OTU 7

IGK3D

VV OTU 11

23

YP 003795641-1

IGK3D

VV OTU

826

IGK3D

VV OTU

64Ueda19FR6 OTU 354

864

UTO

6RF

91ad

eU

Ueda19FR

6 OTU

63

Ueda19FR6 OTU 148

IGK3D

VV OTU

497

IGK3DVV OTU 962

IGK3DVV OTU 1008

Ueda

19FR

6 O

TU 9

AnaCylin

Rhi

Lusi

t

Ueda19FR6 OTU 299

Ueda19FR6 OTU 3

IGK3DVV OTU 1313

F2R6 OTU 147

IGK3DVV OTU 688

F2R6 OTU 129

Ueda19FR6 OTU 220

F2R6 O

TU 13

7

F2R6 O

TU 296

Ueda19FR6 OTU 269

F2R6 O

TU 40

011

UTO

6R2

F

DlbPropi

Ueda19FR6 OTU 456

F2R6 OTU 24

WP 010933803-1

F2R6 OTU 312

WP 019478635-1

UncM18

10

F2R6 OTU 267

IGK3D

VV OTU

724

EthH

arbi

IGK3DVV OTU 1425

F2R6 OTU 322

Ueda19FR6 OTU 10

F2R6 OTU 73

WP 011566493-1

DsfVulg2

F2R6 OTU 164PepSpec2

NosSpe21

UncM22

26

IGK3

DVV

OTU

102

7

FusVariu

F2R6 O

TU 232

PecAtros

IGK3D

VV OTU

932

IGK3

DVV

OTU

274

Ued

a19F

R6

OTU

485

MnhMah

ii

Ueda19FR6 OTU 193

Ueda19FR6 OTU 106

F2R6 OTU 224

IGK3

DVV

OTU

758

UncMi99

3

IGK3DVV OTU 513

Ueda19FR

6 OTU

167

Ueda19FR6 OTU 178

IGK3D

VV OTU

91

IGK3DVV OTU 963

IGK3DVV OTU 1583

IGK3D

VV OTU

344

ThdN

arug

ThhSibi2

Ued

a19F

R6

OTU

305

FraS

ymbi

IGK3DVV OTU 224

F2R6 O

TU 72

Ueda19FR6 OTU 114

F2R6 OTU 136

HerSero6

IGK3D

VV OTU

764

IGK3

DVV

OTU 1

422

Ueda19FR

6 OTU

21

Ueda19FR6 OTU 492

Clo

Sord

e

IGK3DVV OTU 1039

UniN

i116

F2R6 O

TU 261

UncN1204

IGK3

DVV

OTU

115

1

Ueda19FR6 OTU 414

MhtThe

rm

TreM

altoG

orPa

mel

FirB

act4

Unc

N18

20

UncM15

33

IGK3DVV OTU 755

IGK3DVV OTU 1538

UncPr22

5

IGK3DVV OTU 17

IGK3DVV OTU 1350

F2R6 O

TU 88

IGK3DVV O

TU 1159

IGK3D

VV OTU 44

6

Ueda19FR6 OTU 472

UncM17

94

MtdVulca

IGK3DVV OTU 888

IGK3

DVV

OTU

134

4

PepSpeci

IGK3DVV OTU 1539

956

UTO

VV

D3K

GI

IGK3DVV OTU 476

WP 012333500-1

873UT

OV

VD3

KGI

DsfAfri6

Ueda19FR6 OTU 317

Clo

Lent

o

DsrAuto3

IGK3DVV OTU 1560

IGK3DVV OTU 1327

F2R6

OTU 1

91

Ueda19FR6 OTU 119

UniNi151

Ueda19FR

6 OTU

497

26UT

OVV

D3KGI

SubSpeci

Ueda19FR6 OTU 258

IGK3

DVV

OTU

120

Rum

Obe

umRum

Alb1

1

IGK3DVV OTU 5

Ueda1

9FR6 O

TU 35

6

UncMi820

129

UTO

VVD3

KGI

UncArc57

IGK3

DVV

OTU

988

IGK3DVV O

TU 1522

F2R6 OTU 140

CprSpeci

IGK3DVV OTU 213

FirB

act6

AztVine9

IGK3D

VV OTU

592

F2R

6 O

TU 3

5

Ued

a19F

R6

OTU

491

AcaSpeci

CloTerm

4

WP 012180173-1

YP 209557-1

PyrP

isco

Ueda19FR6 OTU 346

F2R6 O

TU 58

UniNi134

IGK3

DVV

OTU

108

4

Ueda19FR6 OTU 453

Unc

N15

29

IGK3DVV OTU 1469

UncM

1932

DehEthen

WP 008932829-1

Unc11323

Uni

Ni1

40

PaeD

ur19

WP 007563652-1

Ueda19FR6 OTU 353

Ueda19FR

6 OTU

40

Ueda19FR

6 OTU

19

Ueda19FR6 OTU 329

UncMi821

IGK3D

VV OTU

796

WP 012971816-1

IGK3DVV OTU 1309

F2R6 O

TU 48

Ueda19FR6 OTU 77

IGK3D

VV OTU 85

2

UniNit6

9

75UT

O6

R2F

IGK3DVV OTU 541

IGK3D

VV OTU

1553

Ueda19FR6 OTU 306

UncM13

75

IGK3D

VV OTU

1464

F2R6 O

TU 310

BeggAlba

F2R6 OTU 315

F2R6 OTU 105

UncB6166

Ueda19FR6 OTU 250

F2R6

OTU

155

Ued

a19F

R6

OTU

447

IGK3DVV OTU 803

F2R6 OTU 15

MhmOkina

IGK3DVV OTU 282

WP 015954400-1

F2R6 OTU 29

IGK3DVV OTU 217

Ueda19FR6 OTU 172

Ueda19FR6 OTU 166

Ued

a19F

R6

OTU

160

Ued

a19F

R6

OTU

416

F2R6 OTU 63

Ueda19FR6 OTU 68

RhmVan

n3

1011UT

OVV

D3KGI

AzrSpec3

IGK3DVV OTU 179

F2R6 OTU 209

IGK3DVV O

TU 777

UncM19

64

F2R6 OTU 259

F2R6 O

TU 152

TreP

rim6

Ueda19FR6 OTU 13

Ueda19FR6 OTU 53

IGK3DVV OTU 255

NP 904248-1

WP 008383919-1

Ueda19FR6 OTU 79

WP 006043641-1

UniN

i199

IGK3DVV O

TU 1002

Ueda19FR6 OTU 443

NP 045884-1

MtnMarb2

PaeS

abi8

IGK3DVV OTU 1105

CldSacch

Ued

a19F

R6

OTU

302

Ueda19FR6 OTU 364

WP 009538683-1

Mns

Hun

ga

CanDesul

Ueda19FR6 OTU 454

IGK3D

VV OTU

464

UncNi868

F2R6 O

TU 121

F2R6

OTU

227

IGK3D

VV OTU

12

IGK3D

VV OTU

84

IGK3DVV OTU 177

IGK3DVV OTU 115

IGK3DVV OTU 14

Ueda19FR6 OTU 198

F2R6 OTU 298

TreP

ri15

Ueda19FR6 OTU 57

Ueda19FR6 OTU 499

IGK3D

VV OTU

1261

F2R6 OTU 125

YP 003795500-1

VeiR

atti

F2R6 OTU 28

Unc

1545

7

F2R6 OTU 17

IGK3DVV OTU 173

F2R6 OTU 258

Ued

a19F

R6

OTU

104

IGK3DVV OTU 1543

IGK3DVV O

TU 1207

F2R6 OTU 135

IGK3D

VV OTU

48

Meg

Geno

2

Ueda

19FR

6 OTU

243

UncArc83

Rhb

Cap

su

IGK3DVV OTU 377

Ueda19FR6 OTU 321

Rum

Spe1

0

SnbGlyc2

AnaSpe20

F2R6 OTU 206

GrdAichi

IGK3DVV OTU 234

IGK3DVV OTU 886

Ueda19FR6 OTU 407

Ueda19FR6 OTU 379

F2R6 O

TU 363

UniNi110

IGK3DVV OTU 114

IGK3DVV OTU 1276

ThgPhaeu

IGK3DVV OTU 616

IGK3D

VV OTU

1503

Ueda19FR6 OTU 301

F2R6 O

TU 385

3411

UTO

VV

D3K

GI

WP 015225141-1

UncB

5739

F2R6 OTU 276

MmsSpec2

Ueda19FR6 OTU 41

BraSpe53

Ueda

19FR

6 O

TU 9

4

EubNodat

GeoSpe

c9

TurS

angu

UncNi891

Ueda19FR6 OTU 513

IGK3D

VV OTU 68

3

IGK3DVV OTU 651

IGK3DVV OTU 1256

Ueda19FR

6 OTU

214

UncM15

70

MtfL

imi2

Mes

Spe8

7

IGK3

DVV O

TU 59

9

Ueda19FR

6 OTU

480

Ued

a19F

R6

OTU

505

Uni

Nit2

8

WP 011157104-1

UncuS936

IGK3DVV OTU 1158

Ueda19FR

6 OTU

188

UncM21

96

94caBin

U

F2R6 OTU 149

ThrYello

Ueda19FR6 OTU 96

IGK3DVV O

TU 1218

IGK3DVV OTU 367

IGK3DVV O

TU 922

IGK3

DVV

OTU

612

Ueda19FR6 OTU 29

WatSamp2

Ueda19FR6 OTU 195

WP 011142366-1

CloTerm2

IGK3

DVV

OTU

275

IGK3DVV OTU 316

F2R6 OTU 6

UncMi908

IGK3D

VV OTU

1274

3UT

O6

R2F

F2R6 O

TU 263

UncN

1528

DesG

ibso

IGK3DVV OTU 57

Ueda

19FR

6 O

TU 4

8

IGK3DVV OTU 526

Uncu1120

6241

UTO

VV

D3K

GI

IGK3

DVV

OTU

650

F2R6 O

TU 141

Ueda19FR6 OTU

428

CloS

pe19

BraSpe29

F2R6 OTU 328

IGK3DVV OTU 797PalPropi

IGK3D

VV OTU

1239

RotDento

Glu

Dia

z2

UniNitro

IGK3DVV OTU 50

IGK3

DVV

OTU

618

552

UTO

6R2

F

UncN

1540

IGK3DVV OTU 1302

IGK3D

VV OTU

285

Uni

Ni1

02

IGK3DVV OTU 1389

UniNit77

IGK3DVV OTU 848

Ueda19FR6 OTU 171

AnaSpe19

Ueda19FR

6 OTU

75

IGK3DVV OTU 130

IGK

3DV

V O

TU 5

40

Ueda19FR6 OTU 417

Ueda19FR6 OTU 207

551UT

OVV

D3KGI

Unc10634

Unc

N18

12

67UT

OV

VD3

KGI

F2R6

OTU

96

IGK3DVV OTU 486

Ueda19FR

6 OTU

126

MetBar11

IGK3

DVV

OTU

167

UncB5

673

F2R6 OTU 10

IGK3D

VV OTU

137

UncM

1284

Can

Met

h9

F2R6 O

TU 193

UncNi892

IGK3DVV OTU 1072

UncPr2

10

694

UTO

VV

D3K

GI

Unc

Mi8

39

Ueda19FR6 OTU 11

IGK3D

VV OTU

1010

IGK3D

VV OTU

1160

CloC

ell4

WP 011356980-1

IGK3D

VV OTU

976

CloC

arb2

Ueda19FR6 OTU 78

IGK3D

VV OTU

1555

Ueda19FR

6 OTU

255

IGK3DVV OTU 1636

WP 013066406-1

IGK3DVV OTU 107

IGK3DVV OTU 680

GeoSpeci

Mes

Sp13

7

IGK3DVV O

TU 1340

MhnVolta

Ueda19FR6 OTU 526

Mht

Con

c2

IGK3

DVV

OTU

101

5

Ueda19FR6 OTU 209

UncM14

36

IGK3DVV OTU 1081IGK3DVV OTU 1514

MhbSpec7

023

UTO

6R2

F

F2R6 OTU 218

F2R6 OTU 355

F2R6 OTU 161

31UT

O6

R2F

Ueda

19FR

6 O

TU 7

6

YP 002327547-1

F2R6 O

TU 126

WP 011814420-1

Unc13328

UncM19

47

Ueda19FR6 OTU 471

F2R6 OTU 42

225

UTO

6RF

91ad

eU

F2R6 O

TU 302

IGK3DVV OTU 203

Ueda19FR6 OTU 25

F2R6 OTU 351

Fran

Alni

Ued

a19F

R6

OTU

92

F2R6 OTU 11

Glu

Azot

o

WP 018043288-1

UniN

i135

IGK3DVV OTU 991

MhnVann2 F2R6 OTU 369

F2R6 O

TU 27

NosSpe26

WP 009758350-1

Ued

a19F

R6

OTU

426

UncMi740

Ueda19FR6 OTU 263

F2R6 OTU 151

YP 002520006-1

MitM

ulta

IGK3DVV OTU 1634

F2R6 OTU 271

Ued

a19F

R6

OTU

109

WP 012499918-1

DlcAceto

WP 007196609-1

UncM14

76

UncPr

215

F2R6 OTU 246

Ueda1

9FR6 O

TU 13

7

IGK3DVV OTU 675

IGK3DVV OTU 527

IGK3D

VV OTU 4

IGK3DVV OTU 854

F2R6 OTU 76

F2R6 OTU 9

Unc1

6739

Ued

a19F

R6

OTU

467

Ued

a19F

R6

OTU

12

Met

Bark

7

UncM20

63

UncB5685

IGK3D

VV OTU

187

IGK3DVV OTU 1229

Clo

Bart4

Ued

a19F

R6

OTU

275

WP 017292740-1

IGK3DVV OTU 144

IGK3DVV OTU 400

Ueda19FR6 OTU 135

WP 015684485-1

F2R6 OTU 120

Rum

Spe1

4

WP 009538668-1

Ued

a19F

R6

OTU

97

Ueda19FR6 OTU 169IGK3DVV O

TU 8

Ueda19FR6 OTU 35

IGK3DVV OTU 432

Ued

a19F

R6

OTU

265

IGK3DVV OTU 191

Ueda19FR6 OTU 107

EthHarb3

IGK3DVV OTU 101

IGK3DVV OTU 978

IGK3DVV OTU 1157F2R6 OTU 172

DslAcidi

IGK3

DVV

OTU

939

IGK3DVV OTU 1031

Zym

Mob

il

IGK3DVV OTU 1497

IGK3DVV OTU 1009

IGK3D

VV OTU

637

Uni

Nit9

5

F2R6 OTU 207

ThvSpeci

IGK3

DVV

OTU

681

Ueda19FR6 OTU 141

Cls

Gen

om

WP 013640272-1

Ueda19FR6 OTU 226

MnrForm2

IGK3DVV OTU 1611

CloAcid2

WP 011924728-1

Unc11029

F2R6

OTU

59

Ueda1

9FR6 O

TU 52

7

F2R6 O

TU 361

UncB5744

IGK3DVV OTU 767

F2R6 OTU 362

OpiBacte

F2R6 OTU 279

IGK3DVV OTU 795

F2R6 O

TU 204

Ueda

19FR

6 O

TU 8

1

F2R6 OTU 33

Ued

a19F

R6

OTU

475

Ueda19FR6 OTU 253

SelS

pe11

IGK3D

VV OTU

180

F2R6 O

TU 99

DerG

umm

o

MhtHaru

n

UncB6000

IGK3DVV O

TU 1451UncN1230

MtrSpeci

IGK3DVV O

TU 566

Ueda19FR6 OTU 391

IGK3DVV O

TU 385

ThbTher3F2

R6 O

TU 1

95

IGK3DVV OTU 44

F2R6 O

TU 132

DorL

ong5

MhbSpe

ci

MlsTinda

IGK3DVV O

TU 1292

IGK3DVV OTU 365

Ueda19FR6 OTU 162

IGK3

DVV

OTU

884

IGK3DVV O

TU 1385

Des

Kuzn

e

UncMi99

2

273UT

OVV

D3K

GI

IGK3DVV OTU 625

F2R6 OTU 233

IGK3DVV OTU 251

F2R6 OTU 156

FirB

acte

IGK3DVV OTU 249

IGK3DVV OTU 302

IGK3DVV OTU 915

RsbS

pec2

F2R6 OTU 264

Meg

Spec

5

Ueda19FR6 OTU 143

Ueda19FR6 OTU 58

Meg

Micr

2

NosSpe19

Ueda19FR6 OTU

404

UncM13

74

Ueda19FR6 OTU 502

Unc10729

ChhThal3

Ueda19FR6 OTU 60

F2R6 O

TU 26

2

Rhi

Sp15

3

Unc11155

Ueda19FR

6 OTU

5

IGK

3DV

V O

TU 1245

F2R

6 O

TU 1

43

F2R6 O

TU 12

Ued

a19F

R6

OTU

338

IGK3

DVV

OTU

873

IGK3DVV OTU 1332

FusPerio

Ueda19FR6 OTU 235

IGK3D

VV OTU

412IG

K3D

VV O

TU 6

47

Meg

Elsd

6

Unc10640

RhdPalu9

NodHarve

PaeGram

3

MccChtho

DsfF

ruc2

UniNit68

IGK3DVV OTU 1191

333UT

O6

RF91adeU

Ueda19FR

6 OTU

183

IGK3DVV OTU 1056

Ueda19FR

6 OTU

470

UncM15

07

MtdSpeci

IGK3D

VV OTU

769

IGK3

DVV

OTU

121

9

CloPas20

IGK3D

VV OTU 80

IGK3DVV O

TU 1188

IGK3DVV OTU 280

UniNi170

DetAlkal

Ueda19FR6 OTU 424

IGK3DVV OTU 747

Rhd

Palu

s

Ueda19FR6 OTU 215

Ueda

19FR

6 O

TU 3

75

IGK3DVVOTU 744

AcnL

ong3

WP 007234715-1

754

UTO

6RF

91ad

eU

IGK3DVV OTU 104

F2R6 OTU 182

ZymMobi8

FirBact3

F2R6 OTU 101

IGK3DVV OTU 479

IGK3DVV O

TU 1149

9151UT

OVV

D3KGI

MncSpeci

Ueda19FR6 OTU 259

CanAccum

UncB5694

IGK3DVV OTU 1438

UniNi138

IGK3DVV OTU 1178

Ueda

19FR

6 O

TU 2

41

Ueda19FR

6 OTU

376

IGK3DVV OTU 929

UncB5686

WP 018411953-1

Unc15863

UncM1678

F2R6 OTU 365

Ueda19FR6 OTU 308

IGK3

DVV

OTU

141

6

IGK

3DV

V O

TU 4

84

F2R6 O

TU 98

CloTerm

3

DsfAfri4

Ueda

19FR

6 O

TU 1

89

Mnp

Petr3

UncMeth

9

HelMode3

Ueda19FR6 OTU 460

IGK3DVV OTU 423

PlanKSU1

WP 008944415-1

Ueda19FR

6 OTU

501

Ueda19FR6 OTU 309

F2R6 O

TU 240

Ueda19FR6 OTU 419

F2R6 OTU 123

UncB5914

DorS

peci

AngG

emi3

Ueda19FR6 OTU 506

IGK3DVV OTU 735

IGK3DVV OTU 185

WP 009470405-1

IGK3DVV OTU 1326

IGK3DVV OTU 638

Ueda19FR6 OTU 177

DorS

pec2

IGK3DVV O

TU 995

MhbIvan2

PseSpe11

Rhi

Le10

3

Ueda19FR

6 OTU

437

IGK3DVV O

TU 226

IGK3DVV OTU 543Unc

M17

02

F2R6 OTU 325

YP 001019173-1

CntSpe12

UncM14

84

Rum

Spe1

1

WP 011242451-1

IGK3D

VV OTU

733

F2R6 O

TU 23

6

IGK3DVV OTU 1082

F2R6 OTU 358

F2R6 OTU 169

Ueda19FR6 OTU 415

IGK3DVV O

TU 653

MnmZhil

i

962UT

O6

R2F

WP 006724818-1

IGK3DVV O

TU 601

UncArc82

Ueda19FR6 OTU 23

WP 013417907-1

IGK3

DVV

OTU

741

IGK3D

VV OTU

693

Ueda19FR6 OTU 208

Ueda19FR6 OTU 372

F2R6 OTU 94

UncB5943

IGK3D

VV OTU

781

IGK3

DVV

OTU

252

IGK

3DV

V O

TU 2

30

Ueda19FR6 OTU 311

F2R6 OTU 324

F2R6 OTU 364

WP 011338126-1

F2R6 O

TU 248

GeoUran

i

NP 777474-1

741

UTO

VVD3

KGI

Ueda

19FR

6 OT

U 51

8

TerTurne

DsfSale2

IGK3

DVV

OTU

799

IGK3DVV OTU 535

ThbTher4

IGK3DVV OTU 861

IGK3DVV OTU 1468

MhbTher3

MtbTund3

MhvHoll

a

Ueda19FR

6 OTU

519

IGK3

DVV

OTU

137

6

WP 012591062-1

F2R6 OTU 372

Unc10766

Ueda19FR6 OTU 217

Ueda

19FR

6 O

TU 3

90

523UT

OVV

D3K

GI

UncN1145

AceW

ood2

IGK3DVV O

TU 406BrvMcbre

F2R6 OTU 144

CloClar3

IGK3DVV OTU 1511

YP 003795466-1

IGK3DVV OTU 578

IGK3DVV OTU 1392

EubH

all3

UncM17

52

Ueda19FR6 OTU 237

IGK3DVV OTU 242

WP 020503930-1

IGK3D

VV OTU

999YP 003058343-1

F2R6 OTU 305IGK3DVV OTU 161

Ueda19FR6 OTU 212

UncN1537

F2R6 O

TU 221

Ueda19FR6 OTU 386

Ueda19FR

6 OTU

328

SutW

adsw

IGK3DVV OTU 851

IGK3DVV OTU 736

F2R6

OTU

306

Mls

Psyc

2

F2R6 O

TU 345

IGK3DVV OTU 93

ThtViola

Ueda19FR6 OTU 61

IGK3DVV OTU 313

Ueda19FR6 OTU 395

WP 009857125-1

F2R6 OTU 272

IGK3DVV OTU 1587

Ueda19FR

6 OTU

123

IGK3DVV OTU 1513

TreAzot3

FibS

ucci

refn

Icl

M

IGK3DVV OTU 1093

F2R

6 O

TU 7

F2R6 OTU 4

IGK3DVV OTU 1502

IGK3DVV OTU 751

OscSpec4

IGK3DVV OTU 664

F2R6 O

TU 163

IGK3DVV O

TU 1639

HlsHydro

PhsS

pec6

IGK3D

VV OTU

1

IGK3D

VV OTU

702

IGK3DVV OTU 1367

PlsF

erm

e

IGK3DVV O

TU 536

UniNi166

Uncu

1141

F2R6 OTU 249

Ueda19FR6 OTU 459

IGK3DVV OTU 469

Ueda19FR6 OTU 64

712UT

O6

R2F

SnbGlyco

UniNitr9

865UT

OVV

D3KGI

Meg

Micr

o

Ued

a19F

R6

OTU

174

IGK3DVV OTU 1092

F2R6 OTU 297

F2R6 OTU 340

IGK3DVV OTU 595

IGK3

DVV

OTU

870

Ueda19FR6 OTU 20

PolNapht

WP 009857104-1

IGK3

DVV

OTU

208

Unc10834

PlsFerm

5

Ueda

19FR

6 O

TU 3

8

IGK3DVV OTU 786

PsrA

lac2

CloSpe21

Ueda19FR

6 OTU

134

IGK3D

VV OTU

603

DsbPostg

F2R6 O

TU 316

PldLute2

IGK3

DVV

OTU

146

5

Ued

a19F

R6

OTU

385

IGK3DVV OTU 1578

IGK3DVV OTU 286

Ued

a19F

R6

OTU

514

Unc10052

DelProte

IGK3DVV OTU 315

IGK3DVV OTU 438

Ued

a19F

R6

OTU

266

IGK3DVV OTU 743

455

UTO

VV

D3K

GI

Ueda19FR

6 OTU

399

IGK3DVV OTU 293

IGK3D

VV OTU

328

IGK3DVV O

TU 955

Ueda19FR6 OTU 30

Ueda19FR

6 OTU

65

AngG

emin

UncPr191

IGK3DVV OTU 222

HalSpec2

F2R6 O

TU 386

WP 010162315-1

5277

1cn

U

F2R6 OTU 30

Ueda19FR6 OTU 88

WP 011058180-1

IGK3DVV OTU 436

Unc10648

F2R6

OTU

277

IGK3DVV O

TU 542

GeoSulfu

WolS

ucci

PlsF

er34

HolFoeti

Unc

Mi8

37

Ueda19FR

6 OTU

45

Rum

Torq

u

IGK3DVV OTU 310

F2R6 OTU 198

Ueda19FR6 OTU 371

UncM

2038

EubInfir

UncM

1722

Unc1

3343

UncB5688

F2R6 OTU 347

F2R6

OTU

103

DstH

afni

IGK3DVV OTU 1508

Ueda19FR6 OTU 204

Ueda19FR6 OTU 127

Mnn

Tard

a

CloB

olt3

381

UTO

6R2

F

UncMi94

2

MntFervi

Ueda19FR6 OTU 37

Ued

a19F

R6

OTU

186

IGK3

DVV

OTU

137

0

TrePrim7

YP 003227064-1

UncB

5664

Ueda1

9FR6 O

TU 52

0

F2R6 O

TU 18

9

Ueda19FR6 OTU 462

F2R6 OTU 53

IGK3DVV OTU 879

Clo

Diff

6

F2R

6 O

TU 1

WP 009150444-1

Ued

a19F

R6

OTU

18

IGK3DVV OTU 229IG

K3DV

V OTU

139

8

Ueda19FR6 OTU 153

Ueda

19FR

6 O

TU 2

64

Ueda19FR6 OTU 448

Ueda19FR6 OTU 279

F2R6 OTU 113

ThpA

rch2

IGK3DVV OTU 731DlrBaars

RhvSpec3

CloLent7

F2R6 OTU 86

Ued

a19F

R6

OTU

310

Ueda19FR

6 OTU

34

F2R6 O

TU 95

F2R6 O

TU 46

F2R6 OTU 34

EubH

alli

F2R6 OTU 349

Ueda19FR6 OTU 158Unc

M1917

F2R6

OTU

89

PhsS

pec3

IGK3D

VV OTU

890

MnpPetr2

Ueda19FR6 OTU 99

IGK3D

VV OTU

175

F2R6 OTU 215

IGK3DVV OTU 1405

IGK3DVV O

TU 43

F2R6 OTU 131

IGK3DVV OTU 78

IGK3D

VV OTU

1431

UncArc55

Unc11086

F2R6 OTU 184

Ueda19FR6 OTU 303

F2R6

OTU 1

54

WP 008383972-1

Unc1

1056

CalSpec5

F2R6 O

TU 370

UncM

1656

F2R6 OTU 245

IGK3DVV OTU 1104

DesKuzn2

Rhb

Spha

6

LptSpec5

WP 008028505-1

Ueda19FR6 OTU 51

Ueda19FR6 OTU 402

IGK3D

VV OTU

346

Ueda19FR6 OTU 482

IGK3DVV OTU 1575

IGK3DVV OTU 1412

Ueda19FR6 OTU 231

Uncu1168

LptSpec7

WP 019506700-1

F2R6 OTU 384

WP 017667988-1

SpiSmara

NP 050935-1

F2R6 OTU 270

Ueda19FR

6 OTU

439

IGK3DVV OTU 1080

F2R6 OTU69

Ued

a19F

R6

OTU

332

F2R

6 O

TU 2

92

AztChroo

Unc17889

IGK3DVV OTU 160

Uncu1155

IGK3DVV OTU 1248

TrePri17

IGK3DVV OTU 188

Ueda19FR6 OTU 194

Met

Acet

9

Ueda19FR6 OTU 469

IGK3

DVV

OTU

439

6811UT

OVV

D3K

GI 1331

UTO

VVD3

KGI

BraSp338

IGK3DVV OTU 1060

Ueda19FR6 OTU 362

F2R6 OTU 299

Mag

Mag

n3

IGK3DVV OTU 1046

WP 006786587-1

UncuS821

YP 002905320-1

F2R6 O

TU 18

Ued

a19F

R6

OTU

260

IGK3DVV OTU 69

IGK3DVV OTU 794

IGK3DVV OTU 373

Rhd

Pal3

7

IGK3DVV OTU 1421

MhrKandl

IGK3DVV OTU 248

Uni

Bac5

5

IGK3DVV OTU 1399

Ueda19FR6 OTU 326

IGK3DVV OTU 394UncB6227

AnaVari7

F2R6

OTU

51

F2R6 O

TU 168

TriE

ryth

IGK3DVV OTU 205

CloSpe23

IGK3

DVV

OTU

196

IGK3DVV OTU 1628

F2R

6 O

TU 3

82

AnaSpe18

IGK3D

VV OTU

190F2R

6O

TU 241

Ued

a19F

R6

OTU

479

Ueda19FR

6 OTU

488

IGK3DVV OTU 1282

IGK3D

VV OTU

1269

UniNi157

IGK3D

VV OTU 45

4

IGK3

DVV

OTU

186

F2R6 OTU 115

IGK3D

VV OTU

158

Ueda

19FR

6 O

TU 1

92

YP 536868-1 Ueda19FR6 OTU 284

054UT

O6

RF91adeU

WP 009021555-1

WP 008064756-1

Ueda

19FR

6 O

TU 1

29

061

UTO

6R2

F

Ued

a19F

R6

OTU

411

F2R6 O

TU 18

6

Ueda19FR6 OTU 206

Ueda19FR6 OTU 320

TrePrim5

DsfGigas

Unc

N14

33

DscAlken

UniNi154

F2R6 O

TU 290

IGK3DVV OTU 66

IGK3D

VV OTU

24

F2R6 OTU 87IG

K3DVV O

TU 1019

IGK3DVV OTU 1584

WP 013066431-1

Ued

a19F

R6

OTU

393

DysGadei

F2R6 OTU 77

MnpPetro

OslTric4

IGK3

DVV

OTU

493

WP 002727467-1

Ueda19FR

6 OTU

144

IGK3DVV OTU 1458

IGK3DVV OTU 983

IGK3D

VV OTU

298

MnlPalu

d

Mog

Spec

2

MhhPalu4

IGK3DVV O

TU 416

IGK3DVV OTU 335

IGK3D

VV OTU

119

F2R6 OTU 252

DenA

ceti

Ueda19FR

6 OTU

155

IGK3D

VV OTU

75

CloL

jun3

IGK3DVV O

TU 1113

UncM2082

334

UTO

6RF

91ad

eU

Ueda19FR

6 OTU

43

IGK3DVV O

TU 613

F2R6 OTU 2

MhbSpec2

Ueda

19FR

6 OT

U 47

6

UncMi88

7

Ueda19FR

6 OTU

400

MtbTund2

Ueda19FR

6 OTU

2

DlfAceto

Ueda19FR6 OTU 473

IGK3DVV OTU 1430

IGK3DVV OTU 89

791

UTO

6RF

91ad

eU

UniNi158

MhbSpe

c8

592UT

O6

R2F

DstHafn5

YP 002600955-1

IGK3DVV O

TU 453

F2R6 OTU 257

771

UTO

6R2

F

953

UTO

6R2

F

IGK3DVV O

TU 1353

Ued

a19F

R6

OTU

69

IGK3DVV OTU 22

IGK3DVV OTU 804

Ueda19FR6 OTU 136

PaeSpe

35

Ueda19FR6 OTU 102

Unc10879

F2R6

OTU

309

IGK3

DVV

OTU

582

Ueda19FR6 OTU 50

CloSpe35

Ueda19FR6 OTU 452

655

UTO

VV

D3K

GI

IGK3DVV OTU 1110

RhdPal38

IGK3D

VV OTU

1474

IGK3DVV OTU 68

Mnp

Lim

ic

DsmBacu2

WP 007234731-1

F2R6 O

TU 242

F2R6 OTU 366

SncSpec4

IGK3

DVV

OTU

117

IGK3DVV OTU 626

Ueda19FR6 OTU 304

IGK3

DVV

OTU

159

8

IGK3DVV OTU 546

F2R6 O

TU 273

IGK3DVV OTU 1347

Ueda19FR6 OTU 464

355

UTO

VVD3

KGI

Ueda19FR6 OTU 339

Ueda19FR6 OTU 331

UncB6103

Ueda19FR6 OTU 232

F2R6 OTU 38

F2R6 OTU 74

F2R6 OTU 91

IGK3D

VV OTU 13

6

WP 006910747-1

PsrA

lact

TreB

ren2

F2R6 OTU 60

Ueda19FR6 OTU 330

IGK3DVV O

TU 970

Unc

u115

3

UncM19

25

IGK3DVV OTU 1077

Rhv

Spec

2

DsfFruc3

WP 009819271-1

EubCellu

Unc13265

WP 010236050-1

IGK3D

VV OTU

1125

IGK3D

VV OTU

545

IGK3DVV OTU 340

IGK3

DVV

OTU

917

F2R6 O

TU 18

7

Ueda19FR6 OTU 383

IGK3DVV O

TU 1580

F2R6 O

TU 20

8

TreBrenn

Ueda19FR6 OTU 110

Ueda19FR6 OTU 361

IGK3D

VV OTU

427

F2R6 O

TU 100

322UT

O6

R2F

MtdInfer

6851UT

OVV

D3KGI

IGK3DVV OTU 418

SelS

pec8

IGK3DVV OTU 713

Ued

a19F

R6

OTU

7

662

UTO

6R2

F

IGK3DVV OTU 1388

F2R6

OTU

170

F2R

6 O

TU 6

6

F2R6 OTU 150

IGK3DVV OTU 1165

F2R6 OTU 378

BraJap13

Ueda19FR6 OTU 74R

hsC

ente

F2R6 OTU 300

Uni

Bac6

2

Ueda

19FR

6 O

TU 4

46

IGK3DVV OTU 941

IGK3DVV O

TU 1038

Ueda19FR6 OTU 422

Unc

1550

1

Ueda19FR6 OTU 84

F2R6 O

TU 122

IGK3DVV OTU 573

IGK3DVV O

TU 1136

Ueda19FR6 OTU 245

IGK3DVV OTU 428

Rum

Spe1

3

NovSpeci

UncM1710

UncM1559

IGK3DVV OTU 1220

UncM13

26

F2R6 OTU 214

Ueda19FR6 OTU 168

IGK3

DVV

OTU

812

IGK3DVV OTU 1163

NatAz

oti

MtmMeth5

IGK3DVV O

TU 1037

Ueda19FR6 OTU 145

IGK3D

VV OTU 22

7

WP 017712249-1

Ueda19FR6 OTU 238

IGK3

DVV

OTU

602

F2R6 O

TU 353

IGK3DVV OTU 38

WP 011662765-1

F2R6 OTU 178

TlmAuens

Ueda19FR6 OTU 294

UncDesul

IGK3DVV OTU 97

RhmVann2

F2R6 OTU 119

TreAzot5

Clo

Ster

2

Ueda19FR6 OTU 293

IGK3DVV OTU 1541

Ueda19FR6 OTU 285

Ueda19FR

6 OTU

287

Uncu1184

F2R6 OTU 180

IGK3DVV OTU 600

Ueda19FR6 OTU 175

MhbIvano

IGK

3DV

V O

TU 1415

NosSpe23

NosSpec9

IGK3DVV OTU 1129

Ueda19FR6 OTU 271

F2R6 O

TU 36

8

UncM17

54

LptSpec9

IGK3DVV OTU 1600

WP 018406824-1

IGK3

DVV

OTU

26

PaeSab

i7

F2R6 OTU 124

Uncu1142

MtdJanna

IGK3

DVV

OTU

130

3

CntSpe19

UncB

6106

UncMi93

0

Rum

Albu

s

Ued

a19F

R6

OTU

344

F2R6

OTU

56

IGK3D

VV OTU

355IGK3DVV OTU 468

IGK3D

VV OTU

634

LchBact2

IGK3DVV O

TU 1473

IGK3D

VV OTU

1487

512

UTO

VVD3

KGI

UncM15

65

IGK3D

VV OTU

40

214UT

O6

RF91adeU

IGK3DVV OTU 1197

LepFerr3

Ueda1

9FR6 O

TU 35

2

F2R6 OTU 265

UncN

1162

IGK3DVV OTU 1234

IGK3

DVV

OTU

136

5

Ueda19FR6 OTU 438

IGK3DVV OTU 1258

F2R6 OTU 387

F2R

6 O

TU 2

13

WP 007039482-1

UniN

i132

ThpArcha

F2R6 OTU 67

904UT

O6

RF91adeU

IGK3DVV OTU 720

UniNi165

Ueda19FR

6 OTU

334

F2R6 OTU 196

CloSpe29

Ueda19FR6 OTU 345

UncM

1704

Ueda19FR6 OTU 22

F2R6 OTU 284

IGK3D

VV OTU

238

UncM19

49

Ueda19FR6 OTU 474

UncB6134

UncPr

208

IGK3DVV OTU 674

Clo

Spe3

1

IGK3DVV OTU 1304

LptBory3

WP 013418673-1

IGK3DVV O

TU 1266Unc

Mi964

F2R6 OTU 303

IGK3DVV OTU 1520

Ueda19FR6 OTU 161

UncB

5847

WP 019425988-1

WP 014414765-1

F2R6 O

TU 357

UniNi136

IGK3DVV OTU 1595

SinF

re47

IGK3DVV OTU 306

PlsFer33

IGK3DVV OTU 309

IGK3DVV OTU 1299

Unc10688

DssThiod

IGK3

DVV O

TU 85

3

PirStale

UncB

5668

F2R6 OTU 212

WP 012333477-1

IGK3D

VV OTU

254

Ueda19FR6 OTU 274

IGK3

DVV

OTU

800

F2R6 O

TU 211

F2R6 O

TU 192

Rhi

Le21

3IGK3D

VV OTU

31

Meg

Spec

9

Ueda19FR

6 OTU

146

F2R6 OTU 197

F2R

6 O

TU 3

60

RumSpec7

58UT

O6

R2F

Ued

a19F

R6

OTU

32

IGK3D

VV OTU

909

F2R6 OTU 235

WP 002725185-1

Ueda19FR6 OTU 147

PaeDur13

CloC

itro

IGK3DVV OTU 192

IGK3DVV OTU 388

Ued

a19F

R6

OTU

52

RhbCap

s7

GeoBemid

Ueda19FR6 OTU 327

IGK3DVV O

TU 1182

CloB

olte

Ueda

19FR

6 O

TU 2

7

IGK3DVV OTU 639

HolFoet2

VerBacte

Ueda19FR6 OTU 363

F2R6 OTU 175

IGK3D

VV OTU

1380

F2R6 OTU 283

F2R6 O

TU 179

IGK3DVV OTU 483

IGK3DVV OTU 818

WP 014778424-1

F2R

6 O

TU 2

1

Uni

Bac6

3

IGK3

DVV

OTU

129

6

Ueda19FR6 OTU 378

F2R6 OTU 307

MasLam34

IGK3DVV O

TU 1064

Uni

Nit8

3

Rhm

Vanni

Mnc

Bour

g

IGK3DVV OTU 1620

IGK3D

VV OTU

801

F2R6 OTU 112

IGK3D

VV OTU

334

IGK3DVV OTU 692

AnaVari2

IGK3DVV OTU 1562

WP 019016010-1

F2R6 O

TU 75

Mhc

Labr

e

F2R6 OTU 348

DsfSpec8

F2R6 OTU 352

UniNit74

IGK3DVV OTU 425

Ueda19FR6 OTU 173

281UT

O6

RF91adeU

DslYoung

IGK3D

VV OTU

1175

592

UTO

6RF

91ad

eU

Clo

Ther

m

IGK3DVV OTU 163

F2R6 OTU 354

IGK3D

VV OTU

1291

IGK3

DVV

OTU

489

WP 012591049-1

F2R6 O

TU 225

TreS

ocra

MhmTher2

Ueda

19FR

6 OT

U 52

1

Ueda19FR6 OTU 486

WP 006724838-1

Mhc

Labr

3

5611

Ncn

U

UniN

i114

9261

UTO

VVD3

KGI

IGK3

DVV

OTU

174

IGK3DVV OTU 533

F2R

6 O

TU 2

53

PreB

ryan

Ued

a19F

R6

OTU

227

Uni

Ni1

47

IGK3D

VV OTU

610

F2R6 O

TU 243

BacA

zoto

IGK3

DVV

OTU

445

IGK3D

VV OTU

1592

IGK3DVV OTU 969

IGK3DVV OTU 1108

F2R6 O

TU 54

UncB5752

IGK3DVV O

TU 1619

IGK3DVV OTU 1295

Ueda19FR6 OTU 1

Ueda19FR6 OTU 55

Ueda19FR

6 OTU

131

Ueda19FR6 OTU 449AlkM

etal

IGK3

DVV

OTU

164

F2R6 OTU 55

WP 011388378-1

WP 007164986-1

BrvLinen

IGK3DVV OTU 105

IGK3DVV OTU 1278

IGK3D

VV OTU

1625

IGK3DVV OTU 294

F2R6 OTU 205

ThxNivea

BraSpe14

F2R6 O

TU 134

Ueda19FR

6 OTU

318

IGK3

DVV

OTU

620

F2R6

OTU

282

NitFixin

IGK3DVV OTU 1515

Ueda19FR6 OTU 16

IGK3DVV OTU 775

PlbCarbi

UncMi94

8

F2R6 OTU 92

IGK3D

VV OTU

440

Ueda19FR6 OTU 316

Ueda19FR6 OTU 273

F2R6 O

TU 375

Ueda19FR6 OTU 397

Meg

Gen

om

IGK3DVV OTU 265

F2R6 OTU 199

Ueda19FR6 OTU 49

UncB5697

IGK3D

VV OTU

607

IGK3DVV O

TU 727

052

UTO

VVD3

KGI

WP 007067478-1 F2R6 OTU 323

Uncu1196

Ueda19FR6 OTU 349

IGK3DVV OTU 149

IGK3DVV OTU 1061

UniNi162

AnaSpe21

F2R6

OTU 8

UncM19

78

IGK3DVV OTU 475

F2R

6 O

TU 3

67

F2R6 O

TU 371

IGK3D

VV OTU

654

Ueda19FR6 OTU 481

Ueda19FR6 OTU 369

F2R6 O

TU 52

F2R6

OTU

49

WP 002720427-1

Ueda19FR6 OTU 24

PhsS

peci

6031

UTO

VV

D3K

GI

F2R6 O

TU 33

8

Unc

N16

40

Ued

a19F

R6

OTU

394

UncB5741

DsfSpe11

UniNi200

IGK3DVV OTU 159

F2R6 OTU 116

WP 011566468-1

Ueda19FR6 OTU 322

IGK3DVV OTU 60

Ueda19FR6 OTU 130

IGK3D

VV OTU

984

IGK3DVV OTU 1116

Ueda19FR6 OTU 408

Ueda19FR6 OTU 219

NosSpe24

IGK3

DVV

OTU

364

CalSpec6

Ueda19FR6 OTU 429

Ueda19FR6 OTU 157

TrhDrews

UncM18

32

948UT

OV

VD3K

GI

713UT

O6

R2F

NosSpe20

Ueda19FR

6 OTU

441

Ueda19FR6 OTU 512

IGK3DVV OTU 1305

CanMeth

a

Ueda19FR6 OTU 54

F2R6 OTU 301

IGK3DVV OTU 1397

MlsPsy

ch

Ueda19FR

6 OTU

516

WP 015684462-1

IGK3DVV O

TU 641

F2R6 OTU 104

Ueda19FR6 OTU 355

F2R6 O

TU 50

IGK3

DVV

OTU

102

2

UncM18

79

MtrSmith

Ueda19FR6 OTU 343

Ueda19FR6 OTU 523Sl

rKuji

e

IGK3

DVV

OTU

571

IGK3DVV OTU 210

IGK3

DVV

OTU

81

SlaE

xigu

Unc

B601

4

Ueda19FR

6 OTU

430

Ueda

19FR

6 O

TU 3

47

WP 011390728-1

IGK3D

VV OTU

499

Unc

M22

50

SngS

peci

Ueda19FR6 OTU 291

F2R6 O

TU 14

Clo

Spe3

3

IGK3D

VV OTU

1217

TetElong

Ueda19FR6 OTU 337

IGK3

DVV

OTU

119

9

WP 012567286-1

YP 636005-1F2R

6 OTU

285

BurPhym3

Ueda

19FR

6 O

TU 3

42

UniNi120

UncM1285

IGK3

DVV

OTU

136

6

SpiTherm

Ueda19FR6 OTU 515

WP 015107703-1

F2R6 OTU 336

8401UT

OVV

D3K

GI

IGK3D

VV OTU

885

IGK3DVV OTU 33

IGK3DVV OTU 1130

ArcS

peci

IGK3DVV OTU 1417IGK3DVV OTU 746

F2R

6 O

TU 1

06

Ueda19FR6 OTU 46

WP 007423561-1

TrePri14

IGK3DVV OTU 297

IGK3DVV OTU 989

UncPr2

26

VibNatr2

IGK3D

VV OTU

2

MnrIgne2

DesAcet3

F2R6 OTU 45

Ueda19FR6 OTU 124

FirB

act5

Ued

a19F

R6

OTU

89

IGK3DVV OTU 415

F2R6 OTU 333

F2R6 OTU 330

IGK3DVV OTU 1057

IGK3DVV OTU 371

UncB5947

F2R6 OTU 61

Ueda19FR6 OTU 156

Ueda19FR6 OTU 357

UncM19

87

MhtCon

ci

Unc1

3332

IGK3DVV OTU 240

Rum

Torq

2

Ueda1

9FR6 O

TU 44

4

Ueda19FR6 OTU 477

Ueda19FR6 OTU 410

F2R6 OTU 247

IGK3DVV OTU 1137

MnlP

alu2

YP 002600835-1

PhsS

pec4

Ueda19FR

6 OTU

31

Unc15860

Bctrm

S50

IGK3DVV OTU 642

F2R6

OTU

229

IGK3D

VV OTU

951

Ueda19FR6 OTU 240

F2R6 O

TU 32

7

Ued

a19F

R6

OTU

14

F2R6 OTU 188

Ueda19FR6 OTU 122

Unc

1117

7

IGK3DVV O

TU 165

IGK3DVV OTU 184

Ueda

19FR

6 OTU

252

F2R6 OTU 37

UncM22

53

SpoO

vata

Ueda19FR6 OTU 228

IGK3DVV OTU 195

701

UTO

6R2

F

Ueda19FR6 OTU 139

F2R6 O

TU 11

7

ThsIndic

UniNi159

YP 003097534-1

BraSp344

F2R6 OTU 70

UncNi890

F2R6

OTU

167

IGK3D

VV OTU

1065

NosSpe53WP 006615531-1

MtmMeth3

F2R6 OTU 108

091

UTO

6R2

FIGK3DVV OTU 1222

IGK3DVV OTU 1372

IGK3DVV OTU 124

F2R6 OTU 203

Ueda19FR6 OTU 496

WP 010236086-188

5UT

OV

VD3

KGI

102UT

O6

RF91adeU

F2R6 O

TU 278

UncN1652

IGK3DVV OTU 1308

RotMuci3

Clo

Acid

u

WP 015170753-1

Ueda19FR6 OTU 133TrePrim8

RsbS

pec3

Rhi

Mul

11

IGK3

DVV

OTU

958

IGK3DVV OTU 1181

UncN1646

CloPas24

BraJap30

MgnSpeci

F2R6 OTU 373

LepSpeci

IGK3DVV O

TU 1638

3941UT

OV

VD3

KGI

Ueda19FR6 OTU 358

IGK3DVV OTU 495

WP 011124698-1

Ueda19FR

6 OTU

211

UncB5693

RubB

enzo

2ccaSol

CWP 006787504-1

TreAzot6

IGK3

DVV

OTU

872

Ueda19FR6 OTU 210

EubLimos

MtnTher3

IGK3DVV OTU 1540

UncN1511

Ueda19FR6 OTU 4

MetBarke

UncPr214

Unc10808

UncM12

34

DstH

af23

Unc

1550

2

WP 010162294-1

Ueda19FR6 OTU 82

F2R

6 O

TU 7

8

WP 018406854-1

WP 008296315-1

Clo

Spe4

6

IGK3D

VV OTU

266

WP 020086068-1

DorF

orm

8

IGK3DVV O

TU 835

Unc13347

Ueda19FR

6 OTU

108

IGK3D

VV OTU

178

IGK3DVV OTU 70

Ueda19FR6 OTU 365

UncMi91

6

Ueda19FR6 OTU 324

Ueda19FR6 OTU 112

Ueda19FR6 OTU 406

TrmLitor

F2R6 O

TU 216

UncN1607

UncArc84

Ueda

19FR

6 O

TU 2

44

Clo

Sacc

h

Unc

N16

49

XenSpec2

FirB

act2

UncM1452

IGK3D

VV OTU 69

8

WP 009470388-1

F2R6 O

TU 219

IGK3DVV OTU 1224

UncM

et11

Ueda19FR6 OTU 451

F2R6 OTU 145

IGK3D

VV OTU

1349

XanAuto2

FraS

p101

Ueda19FR6 OTU 368

623UT

O6

R2F

NP 848120-2

LchB

acte AcvC

ellu

McrLute4

F2R6 O

TU 114

Ueda

19FR

6 O

TU 1

49

Ueda19FR6 OTU 100

F2R6

OTU

165

Uncu1118

WP 020086085-1

CloC

arbo

UniNi152

WP 019429094-1

IGK3DVV OTU 1481

Ueda

19FR

6 O

TU 2

00

IGK3

DVV

OTU

36

Dia

Invi

3

IGK3DVV OTU 672

F2R6 OTU 26

MhdBurt

o

F2R6 O

TU 337

OpiBact7

F2R6 OTU 383

IGK3

DVV

OTU

142

Ueda19FR6 OTU 150

IGK3

DVV

OTU

156

RhdPalu

3

632UT

OVV

D3K

GI

RsfCaste

F2R6 OTU 268

MhnAeol4

F2R6 OTU 82

F2R6 OTU 341

IGK3

DVV

OTU

598

ClrBacte

Ueda19FR6 OTU 230

085UT

OVV

D3KGI

IGK3DVV OTU 661

MncSpec2

IGK3D

VV OTU

882

WP 019903624-1

F2R6 OTU 331

924UT

OVV

D3KGI

F2R6 OTU 43

Rum

Spec

2

EntRadi2

Ueda19FR

6 OTU

384

IGK3DVV O

TU 1042

CloB

artl

MtdVillo

Ueda19FR

6 OTU

427

AceWood4

Ueda1

9FR6 O

TU 51

1

Ueda19FR6 OTU 487

CalSpe10

IGK3DVV OTU 520

IGK3D

VV OTU

268

UncM19

11

IGK3DVV O

TU 725

Unc

N15

12

DstHaf25

IGK3D

VV OTU

338

CloS

pe32

Ueda19FR6 OTU 319

F2R6 OTU 380

CloB

ot22

Ued

a19F

R6

OTU

298

F2R6 OTU 256

F2R6 OTU 80

Ueda19FR6 OTU 121

UncM14

51

NP 958366-1

CopC

atu2

UncPr2

11

Unc

N14

44

Ueda19FR6 OTU 80

RivSpeci

RhbSph

ae

UncB5676

MhnAeol3

IGK3DVV OTU 281

IGK3DVV OTU 1386

CloKlu11

F2R

6 O

TU 2

01

BurUnama F2

R6 O

TU 3

32

Ueda19FR6 OTU 170

Uncu1146

AzrSpec8

F2R6 OTU 79

IGK3D

VV OTU

71

F2R6 OTU 109

Ueda1

9FR6 O

TU 277

F2R6

OTU

83

IGK3DVV OTU 863

F2R6 OTU 90

Ueda19FR6 OTU 17

F2R6 OTU 23

47UT

OVV

D3K

GI

IGK3

DVV

OTU

141

4

IGK3DVV O

TU 1460

IGK3DVV OTU 413

IGK3

DVV O

TU 8

2

AnmSpec3

874

UTO

VVD3

KGI

CopC

omes

IGK3D

VV OTU

329

Ued

a19F

R6

OTU

152

IGK3DVV OTU 314

F2R6 OTU 210

IGK3DVV OTU 382

UncN1513

IGK3D

VV OTU

1232

UncM

1330

693UT

O6

RF91adeU

F2R6 OTU 231

DsfSpec3

F2R6 OTU 5

AnmSpeci

F2R6 OTU 314

UncM15

66

CloPapyr

F2R6 OTU 102

IGK3DVV OTU 1393

Ueda19FR6 OTU 403

UncNi866

F2R6 O

TU 234

Unc15841

IGK3DVV OTU 707

IGK3DVV OTU 1348

Uncu1188

UncM17

78

F2R6 O

TU 280

IGK3D

VV OTU

1609

UncB

6464

F2R6 OTU 25

Ueda19FR6 OTU 176Unc

Mi920

Unc

N15

19

5545

1cn

U

IGK3D

VV OTU 13

42

F2R6 OTU 388

DslSpeci

Ueda19FR6 OTU 86

Ueda19FR6 OTU 278

IGK3DVV OTU 351

F2R6 O

TU 171

F2R6 OTU 153

YP 003289259-1

IGK3DVV OTU 665

IGK3DVV OTU 1316

IGK3DVV OTU 1260

WP 007814253-1

SpiTher2

702UT

OV

VD3K

GI

F2R6 O

TU 17

3

UncMi94

3

IGK3D

VV OTU

211

IGK3DVV OTU 515

Ued

a19F

R6

OTU

44

IGK3

DVV

OTU

150

7

F2R6 OTU 335

Unc11052

UncNi894

782

UTO

6R2F

WP 020042510-1

Ueda19FR6 OTU 360

IGK3

DVV

OTU

122

IGK3DVV OTU 732

WP 012258413-1

LptSpec6

F2R6 OTU 304

Ueda19FR6 OTU 466

Mog

Spec

i

Ueda1

9FR6 O

TU 70

UniNi172

IGK3D

VV OTU

500

982

UTO

6R2

F

UncPr322

IlyPolyt

Ueda19FR6 OTU 118

UncB6136

Uni

Ni1

53

Ueda19FR6 OTU 59

732UT

O6

R2F

IGK3DVV OTU 1032

IGK3DVV OTU 993

F2R6 OTU 238

DorL

ongi

IGK3D

VV OTU

1074

F2R6 O

TU 244

Ueda19FR6 OTU 281

Ueda19FR6 OTU 261

Ueda

19FR

6 O

TU 4

55

IGK3D

VV OTU

537

392

UTO

6R2

F

AnaVari8Do

rFor

m5

IGK3

DVV

OTU

393

IGK3

DVV

OTU

905

IGK3

DVV

OTU

783

Ued

a19F

R6

OTU

26

WP 009625694-1

Rhi

Gal

eg

UncN1811

Ueda19FR6 OTU 216

DorF

orm

i

Unc10754

PaeD

ur16

F2R6 OTU 251

F2R6 OTU 194

WP 011157083-1

Cluster ICluster IICluster IIICluster IVbchX, chlL, bchLparA

Primer pairsUeda19F-R6IGK3-DVVF2-R6

nifH & non-nifH clusters

Relative abundance (log)

FIGURE 5 | Unrooted RAxML-EPA tree of nifH deduced amino acid sequences (with at least 133 positions). The nifH and non-nifH (homologous genes) clusters arerepresented in different colors. Representative sequences from the “nifH_2014April04.arb” database (Heller et al., 2014) are depicted in black. The phylogeneticplacements of OTU representatives from this study for the three primer pairs are depicted at the terminal nodes: sequences obtained with primer pair Ueda19F–R6are shown in red, IGK3–DVV-derived sequences in blue, and F2–R6-derived sequences in yellow. Outermost bars represent the relative abundance [log(x% + 1)] ofeach OTU.

Taxonomic Description of theDiazotrophic Community Using MultipleClassification Approaches Across thePrimer PairsThe classification of nifH genes is not an easy task due toa lack of taxonomic affiliation to functional gene sequences(Gaby and Buckley, 2011), poor reflection of the nifH phylogenyto the 16S rRNA gene phylogeny (Zehr et al., 2003), andinsufficient information in k-mer frequencies to differentiateamongst the subclusters (Frank et al., 2016) as was also observed

with pmoA sequences (Dumont et al., 2014). As such, wepropose that a combined classification approach, using severalindependent methods might be more beneficial for characterizingnifH sequence datasets. Several approaches have been describedin the literature for rapid, accurate and biologically meaningfulclassification of functional gene sequences in general and nifH inparticular, including nearest neighbor identification in pairwisealignment using FrameBot (Wang et al., 2013), lowest commonancestor parsing of BLAST bit scores using MEGAN (Husonet al., 2007; Dumont et al., 2014), or CART (Frank et al., 2016).Here, we used a combined approach in our pipeline and describe

Frontiers in Microbiology | www.frontiersin.org 10 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 11

Angel et al. Evaluating nifH Primers and NifMAP

the taxonomic affiliation of the sequences using closest relativevia BLASTP, nifH cluster and subcluster affiliation using CART(Frank et al., 2016), and the EPA (Berger et al., 2011) forphylogenetic tree generation.

A representative from each OTU was classified using thebest matching hit in BLASTP (Figure 3), CART (Figure 4),and the EPA for phylogeny (Figure 5). The BLASTP classifiessequences based on their closest relative. This classificationyielded an overall agreement amongst the three primerpairs, with mainly members of the Alphaproteobacteria,Betaproteobacteria, Deltaproteobacteria, Cyanobacteria, andFirmicutes populating the diazotrophs across the environments(Figure 3). However, the relative abundance of these taxonomicgroups varied amongst the investigated primer pairs. Forexample, Cyanobacteria were underrepresented by primer pairF2–R6 in comparison to the other two primer pairs (inconcordance with the performance of primers F2–R6 amplifyingthe mock community). Furthermore, the relative abundance ofAlphaproteobacteria derived in PCRs with primer pair IGK3–DVV was lower in some samples than in PCRs with the othertwo primer combinations, and Firmicutes were comparativelyunderrepresented by primer pair Ueda19F–R6 in some samples(Figure 3). Some taxa (such as the Green Sulfur Bacteria)were completely missing with the IGK3–DVV primer pair,while Euryarchaeota were missing in the Ueda19F–R6 primerpair and at low relative abundance with primer pair F2–R6(Figure 3).

The CART classification assigns sequences to four clusters ofnifH, termed clusters I–IV with subclusters (Zehr et al., 2003;Raymond et al., 2004). The majority of the sequences amongstall primer pairs were classified into cluster I based on theCART classification (Figure 4). This cluster is known to reflect16S rRNA-based phylogeny relatively well (Zehr et al., 2003)and in many environments is the largest and most ecologicallyimportant diazotroph cluster (Gaby and Buckley, 2011). Withincluster I, differences in the relative abundances of certain definedclusters can be observed, some of which are in agreement withthe BLASTP results. For instance, the relative abundance ofsequences in cluster IB (affiliated with Cyanobacteria) derivedfrom PCRs with primer pair F2–R6 was lower than from theother two primer combinations. Furthermore, less sequencesassigned to cluster IJ and IK (containing Alphaproteobacteria)were derived from primer pair IGK3–DVV than in the other twoprimer pairs (Figure 4). The same trend was observed when thesequences were analyzed via BLASTP.

As expected, only a minority of the sequences wereaffiliated with clusters II, III, and IV. All three primer pairsamplified sequences from the same subclusters in clusters II–IV, with some exceptions in clusters III and IV, indicatingpreferential amplifications of subclusters amongst these primersets (Figure 4). Most notably the IGK3–DVV primer pairamplified many sequences from clusters IIIC (affiliated withClostridia) as well as from clusters IVB and IVC. In contrast,primer pairs Ueda19F–R6 and F2–R6 amplified very fewsequences from clusters IIIC, IVB and IVC. A close examinationof the sequence region matching the R6 primer showed thatthe primer has indeed two mismatches at the 3′ end of several

a hb c d e f g mi j k lsoil root or

rhizosphereBSCs aquatic

Avg.

nifH

cop

ies n

g DN

A-1 +/

- stn

d. e

rror

104

102

100

IGK3-DVVF2-R6

Ueda 19F-R6

FIGURE 6 | Average nifH copies per ng DNA ± standard error acrossenvironmental samples based on quantitative PCR using the different primerpairs. The environmental samples include: (a) beech forest soil; (b) meadowsoil; (c) rhizosphere and (d) root samples of Arrhenatherum elatius; (e)rhizosphere and (f) root samples of Oryza sativa; (g) coastal, sub-arcticbiological soil crust (BSC); (h) temperate BSC; (i) high alpine BSC; (j) semiaridBSC; (k) arid BSC; estuarine samples from the (l) Great Belt and (m) RoskildeFjord. Numbers of copies were corrected to exclude non-nifH genes that wereco-amplified using information obtained from amplicon sequencing of thesamples with the specific primer pairs.

Clostridia sequences affiliated with cluster IIIC (with residues CTinstead of GC).

The phylogenetic placement of OTU representatives suggeststhat the sequences are in clusters I–IV (Figure 5). It showeda nearly identical coverage of clusters I and II by all threeprimer pairs, thus indicating that all three primer pairs coverequally well the diazotrophic diversity in these clusters. Althoughall three primer pairs captured sequences from cluster III,discrepancies within subclusters were noted, especially with theprimer pair IGK3–DVV as compared to F2–R6 and Ueda19F–R6. While primers IGK3–DVV captured many sequences insubcluster IIIC, primers F2–R6 and Ueda19F–R6 captured manyfrom subclusters IIIE, IIIL, IIIM, and IIIN, which were largelymissed out by primers IGK3–DVV. Cluster IV sequences werenearly only captured by primer pair IGK3–DVV. However, theecological relevance of cluster IV members is still debatable,as until recently it was considered to be populated only bygenes encoding for functions unrelated to N2 fixation (Raymondet al., 2004; Zheng et al., 2016). Moreover this cluster includesseveral nifH homologs, which might be more abundant than nifHin certain environments. Attempts to use primers that capturesequences from this cluster could inflate datasets with non-targetgenes.

Suitability of nifH Primers for qPCRAssaysThe primer pairs F2–R6, IGK3–DVV, and Ueda19F–R6 werealso evaluated for their suitability in qPCR assays. Sinceusing degenerate primers incurs a potential template-dependedquantification bias, we increased the primer concentration inthese reactions to 1.4 µM. This has been recently shown to

Frontiers in Microbiology | www.frontiersin.org 11 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 12

Angel et al. Evaluating nifH Primers and NifMAP

minimize the effect of this specific bias (Gaby and Buckley,2017) and has increased efficiency in our assays (data notshown). High assay efficiencies across all three primer pairs –Ueda19F–R6 (97.4%), IGK3–DVV (92.5%), and F2–R6 (90.5%) –were attained (Supplementary Figure S2), illustrating that theseprimer sets worked almost equally well and therefore can beused for reliable quantification of nifH genes and transcripts.In a proof-of-principle experiment, we quantified nifH genes inthe environmental samples used in this study across all threeprimer pairs. A caveat when using such quantitative assays isthat qPCR works under the underlying assumption that allquantified templates belong to the target gene. As we havedemonstrated above, this is not the case for many types ofsamples when using general nifH-targeting primers. Therefore,we corrected for the proportion of nifH sequences from thetotal reads based on the sequencing results (Figure 6). Amongstthese diverse sample types, the numbers of nifH copies perng DNA ranged between 1.2 × 101 and 1.8 × 105 with ageometric average of 1.3 × 103. Interestingly, no single primerpair gave consistently higher or lower estimations compared toother primer pairs (P = 0.5, ANOVA test), further reiteratingthat all three primer sets worked equally well. Yet, significantdifferences appeared to be sample-type dependent (P < 0.01,ANOVA test). The estimated nifH copies per ng DNA werecongruent within a given sample amongst the three primer pairs,considering the precision limitations of qPCR (two- to threefolddifference between samples; Hospodsky et al., 2010). However,some exceptions to that observation were notable. For instance,primer pairs F2–R6 and Ueda19F–R6 estimated similar copynumbers in estuarine water samples (Figure 6, samples “l, m”)and the arid crust sample (Figure 6, sample “k”), while primerpair IGK3–DVV estimated an order of magnitude more copies.Furthermore, primers Ueda19F–R6 and IGK3–DVV measuredsimilar copy numbers in the temperate and semiarid BSCs(Figure 6, samples “h, j”) while primers F2–R6 measured anorder of magnitude less copies. This latter observation could beexplained by the fact that primer pair F2–R6 has the tendencyto discriminate against cyanobacteria, which would result ina decreased copy number in these cyanobacteria-dominatedsamples.

CONCLUSION

We tested the nifH-targeting primer combinations Ueda19F–R6,IGK3–DVV, and F2–R6 for their performance in genetic surveysof diazotrophs, thereby elaborating on the in silico analysis ofGaby and Buckley (2012). All primer pairs had a propensity toco-amplify homologs of the nifH gene at varying proportions,which is common in many studies of functional gene diversity.Most severe, primer combination IGK3–DVV had the largesttendency to co-amplify these sequences (which are most closelyrelated to cluster IV), with some samples yielding up to ca.90% homolog sequences. Using a pipeline such as our newlyestablished NifMAP, that permits the detection of non-specificco-amplified reads via specifically designed HMMs, proved to bea useful strategy in filtering out such reads.

When analyzing the specificity of these primers throughthe use of a mock community along with a diverse collectionof environmental samples, we observed some degree of biasamongst the primer pairs. As such, care should be taken whenchoosing primer pairs for diazotrophic diversity investigations.More specifically, primer pair F2–R6 has a propensity todiscriminate against cyanobacteria (amongst others) in our mockcommunities and environmental samples, yet captured manysequences from subclusters IIIE, IIIL, IIIM, and IIIN. Theseaforementioned subclusters were largely missing by the primerpair IGK3–DVV, which also tended to discriminate againstthe Alphaproteobacteria, but amplified many sequences withinclusters IIIC (affiliated with Clostridia) and clusters IVB andIVC. Primer pair Ueda19F–R6 exhibited the least bias basedon our mock community analysis and successfully captureddiazotrophs in cluster I as well as in subclusters IIIE, IIIL,IIIM, and IIIN, but discriminated against Firmicutes (basedon BLASTP analysis) and subcluster IIIC. Thus, depending onthe investigated environmental sample and the aforementionedprimer performance, one should choose the appropriate primercombination. Apart from providing useful analysis protocolsand standards for the study of environmental diazotrophs, ourwork highlights some of the important pitfalls and caveats ofstudying nifH, and potentially other functional genes in theenvironment. Furthermore, these primer pairs can be usedin targeted functional investigations in tandem with activitymeasurements of N2 fixation (Angel et al., 2018) to betterunderstand the ecophysiology of diazotrophs in the environmentand could potentially be used to identify target groups fordownstream single-cell analyses (Eichorst et al., 2015; Woebkenet al., 2015). We hope this knowledge, along with our newlydevelop pipeline (NifMAP) will aide investigators to better detectand capture diazotrophs in their respective environment.

AUTHOR CONTRIBUTIONS

RA, SE, and DW designed the study and participated in writing.RA, CP, and HS extracted DNA and performed the PCRs.MN and HS constructed the mock community. RA, MN, andCH established and tested the analysis pipeline, NifMAP. RAanalyzed the data.

FUNDING

This work was supported by an Austrian Science Fund (FWF)project grant (P25700-B20 to DW and SE). SE was supportedby the Austrian Science Fund (FWF) project grant (P26392-B20 to DW and SE). CP and HS were supported by an ERCStarting grant (Grant Agreement No: 636928 to DW) fromthe European Research Council (ERC) under the EuropeanUnion’s Horizon 2020 Research and Innovation Program. HSwas supported by a Marie Curie Intra European Fellowshipwithin the 7th European Community Framework Program(Grant Agreement No: 628361). MN was funded by a fellowshipfrom the Austrian Academy of Sciences (ÖAW). RA was alsosupported by the Ministry of Education, Youth and Sports

Frontiers in Microbiology | www.frontiersin.org 12 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 13

Angel et al. Evaluating nifH Primers and NifMAP

of the Czech Republic – MEYS (projects LM2015075 andEF16_013/0001782).

ACKNOWLEDGMENTS

The authors are grateful to Lasse Riemann for providing estuarinesamples, Gerhard Karrer for granting access to the samplingsite at Lainzer Tiergarten, Osnat Gillor and Burkhard Büdel forassistance in obtaining biological soil crust samples, Erich M.Pötsch for his assistance with sampling Arrhenatherum elatiusplants at AREC Raumberg-Gumpenstein, Michael Schagerl forproviding a culture of A. torulosa and N. microscopicum,

Stefanie Wienkoop for providing a culture of M. loti, BelaHausmann for providing gDNA of Desulfosporosinus acidophilusand Telmatospirillum siberiense, David Seki for his assistance withthe nifH mock community and culture of K. sacchari, and FlorianHöggerl for his assistance in preparing samples for ampliconsequencing.

SUPPLEMENTARY MATERIAL

The Supplementary Material for this article can be foundonline at: https://www.frontiersin.org/articles/10.3389/fmicb.2018.00703/full#supplementary-material

REFERENCESAlazard, D., Joseph, M., Battaglia-Brunet, F., Cayol, J. L., and Ollivier, B.

(2010). Desulfosporosinus acidiphilus sp. nov.: a moderately acidophilicsulfate-reducing bacterium isolated from acid mining drainagesediments. Extremophiles 14, 305–312. doi: 10.1007/s00792-010-0309-4

Ando, S., Goto, M., Meunchang, S., Thongra-ar, P., Fujiwara, T., Hayashi, H., et al.(2005). Detection of nifH sequences in sugarcane (Saccharum officinarum l.)and pineapple (Ananas comosus [l.] merr.). Soil Sci. Plant Nutr. 51, 303–308.doi: 10.1111/j.1747-0765.2005.tb00034.x

Angel, R. (2012). Total nucleic acid extraction from soil. Protoc. Exch. doi: 10.1038/protex.2012.046

Angel, R., and Conrad, R. (2009). In situ measurement of methane fluxes andanalysis of transcribed particulate methane monooxygenase in desert soils.Environ. Microbiol. 11, 2598–2610. doi: 10.1111/j.1462-2920.2009.01984.x

Angel, R., Panhölzl, C., Gabriel, R., Herbold, C., Wanek, W., Richter, A., et al.(2018). Application of stable-isotope labelling techniques for the detectionof active diazotrophs. Environ. Microbiol. 20, 44–61. doi: 10.1111/1462-2920.13954

Angeringer, W., and Karrer, G. (2008). “Preserving species richness in haymeadows of Vienna’s nature conservation area Lainzer Tiergarten,” in Universityof Natural Resources and Applied Life Sciences, Vienna, Center for EnvironmentalStudies and Nature Conservation (Hrsg.), Preservation of Biocultural Diversity –a Global Issue. Book of Abstracts, ed. U. Pröbstl (Wien: BOKU), 9.

Bentzon-Tilia, M., Traving, S. J., Mantikci, M., Knudsen-Leerbeck, H., Hansen,J. L. S., Markager, S., et al. (2015). Significant N2 fixation by heterotrophs,photoheterotrophs and heterocystous cyanobacteria in two temperate estuaries.ISME J. 9, 273–285. doi: 10.1038/ismej.2014.119

Berger, S. A., Krompass, D., and Stamatakis, A. (2011). Performance, accuracy, andweb server for evolutionary placement of short sequence reads under maximumlikelihood. Syst. Biol. 60, 291–302. doi: 10.1093/sysbio/syr010

Bokulich, N. A., Rideout, J. R., Mercurio, W. G., Shiffer, A., Wolfe, B.,Maurice, C. F., et al. (2016). mockrobiota: a public resource for microbiomebioinformatics benchmarking. mSystems 1:e00062-16. doi: 10.1128/mSystems.00062-16

Boström, K. H., Simu, K., Hagström, A., and Riemann, L. (2004). Optimization ofDNA extraction for quantitative marine bacterioplankton community analysis.Limnol. Oceanogr. Methods 2, 365–373. doi: 10.4319/lom.2004.2.365

Boyd, E. S., Costas, A. M. G., Hamilton, T. L., Mus, F., and Peters, J. W. (2015).Evolution of molybdenum nitrogenase during the transition from anaerobic toaerobic metabolism. J. Bacteriol. 197, 1690–1699. doi: 10.1128/JB.02611-14

Büdel, B., Colesie, C., Green, T. G. A., Grube, M., Suau, R. L., and Loewen-Schneider, K. (2014). Improved appreciation of the functioning and importanceof biological soil crusts in Europe: the soil crust international project (SCIN).Biodivers. Conserv. 23, 1639–1658. doi: 10.1007/s10531-014-0645-2

Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., et al.(2009). BLAST+: architecture and applications. BMC Bioinformatics 10:421.doi: 10.1186/1471-2105-10-421

Caporaso, J. G., Lauber, C. L., Walters, W. A., Berg-Lyons, D., Huntley, J., Fierer, N.,et al. (2012). Ultra-high-throughput microbial community analysis on the

Illumina HiSeq and MiSeq platforms. ISME J. 6, 1621–1624. doi: 10.1038/ismej.2012.8

Caporaso, J. G., Lauber, C. L., Walters, W. A., Berg-Lyons, D., Lozupone, C. A.,Turnbaugh, P. J., et al. (2011). Global patterns of 16S rRNA diversity at adepth of millions of sequences per sample. Proc. Natl. Acad. Sci. U.S.A. 108,4516–4522. doi: 10.1073/pnas.1000080107

Chen, M., Zhu, B., Lin, L., Yang, L., Li, Y., and An, Q. (2014). Completegenome sequence of Kosakonia sacchari type strain SP1T. Stand. Genome Sci. 9,1311–1318. doi: 10.4056/sigs.5779977

Collavino, M. M., Tripp, H. J., Frank, I. E., Vidoz, M. L., Calderoli, P. A.,Donato, M., et al. (2014). nifH pyrosequencing reveals the potentialfor location-specific soil chemistry to influence N2-fixing communitydynamics. Environ. Microbiol. 16, 3211–3223. doi: 10.1111/1462-2920.12423

Deslippe, J. R., Egger, K. N., and Henry, G. H. R. (2005). Impacts of warming andfertilization on nitrogen-fixing microbial communities in the Canadian HighArctic. FEMS Microbiol. Ecol. 53, 41–50. doi: 10.1016/j.femsec.2004.12.002

Dos Santos, P. C., Fang, Z., Mason, S. W., Setubal, J. C., and Dixon, R. (2012).Distribution of nitrogen fixation and nitrogenase-like sequences amongstmicrobial genomes. BMC Genomics 13:162. doi: 10.1186/1471-2164-13-162

Dumont, M. G., Lüke, C., Deng, Y., and Frenzel, P. (2014). Classification ofpmoA amplicon pyrosequences using BLAST and the lowest common ancestormethod in MEGAN. Front. Microbiol. 5:34. doi: 10.3389/fmicb.2014.00034

Edgar, R. C. (2013). UPARSE: highly accurate OTU sequences from microbialamplicon reads. Nat. Methods 10, 996–998. doi: 10.1038/nmeth.2604

Eichorst, S. A., Strasser, F., Woyke, T., Schintlmeister, A., Wagner, M., andWoebken, D. (2015). Advancements in the application of NanoSIMS andRaman microspectroscopy to investigate the activity of microbial cells in soils.FEMS Microbiol. Ecol. 91:fiv106. doi: 10.1093/femsec/fiv106

Farnelid, H., Andersson, A. F., Bertilsson, S., Al-Soud, W. A., Hansen, L. H.,Sørensen, S., et al. (2011). Nitrogenase gene amplicons from global marinesurface waters are dominated by genes of non-cyanobacteria. PLoS One6:e19223. doi: 10.1371/journal.pone.0019223

Fish, J. A., Chai, B., Wang, Q., Sun, Y., Brown, C. T., Tiedje, J. M., et al. (2013).FunGene: the functional gene pipeline and repository. Front. Microbiol. 4:291.doi: 10.3389/fmicb.2013.00291

Fowler, D., Coyle, M., Skiba, U., Sutton, M. A., Cape, J. N., Reis, S., et al. (2013). Theglobal nitrogen cycle in the twenty-first century. Philos. Trans. R. Soc. B Biol. Sci.368:20130164.

Frank, I. E., Turk-Kubo, K. A., and Zehr, J. P. (2016). Rapid annotation of nifH genesequences using classification and regression trees facilitates environmentalfunctional gene analysis. Environ. Microbiol. Rep. 8, 905–916. doi: 10.1111/1758-2229.12455

Fu, L., Niu, B., Zhu, Z., Wu, S., and Li, W. (2012). CD-HIT: accelerated forclustering the next-generation sequencing data. Bioinformatics 28, 3150–3152.doi: 10.1093/bioinformatics/bts565

Gaby, J. C., and Buckley, D. H. (2011). A global census of nitrogenase diversity.Environ. Microbiol. 13, 1790–1799. doi: 10.1111/j.1462-2920.2011.02488.x

Gaby, J. C., and Buckley, D. H. (2012). A comprehensive evaluation of PCR primersto amplify the nifH gene of nitrogenase. PLoS One 7:e42149. doi: 10.1371/journal.pone.0042149

Frontiers in Microbiology | www.frontiersin.org 13 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 14

Angel et al. Evaluating nifH Primers and NifMAP

Gaby, J. C., and Buckley, D. H. (2014). A comprehensive alignednifH gene database: a multipurpose tool for studies of nitrogen-fixing bacteria. Database 2014:bau001. doi: 10.1093/database/bau001

Gaby, J. C., and Buckley, D. H. (2017). The use of degenerate primers in qPCRanalysis of functional genes can cause dramatic quantification bias as revealedby investigation of nifH primer performance. Microb. Ecol. 74, 701–708.doi: 10.1007/s00248-017-0968-0

Gaby, J. C., Rishishwar, L., Valderrama-Aguirre, L. C., Green, S. J., andValderrama-Aguirre, A. (2017). Diazotroph community characterizationvia a high-throughput nifH amplicon sequencing and analysispipeline. Appl. Environ. Microbiol. 84:e01512-17. doi: 10.1128/AEM.01512-17

Hausmann, B., Pjevac, P., Schreck, K., Herbold, C. W., Daims, H., Wagner, M.,et al. (2018). Draft genome sequence of Telmatospirillum siberiense 26-4b1T,an acidotolerant peatland alphaproteobacterium potentially involved insulphur cycling. Genome Announc. 6:e01524-17. doi: 10.1128/genomeA.01524-17

Heller, P., Tripp, H. J., Turk-Kubo, K., and Zehr, J. P. (2014). ARBitrator: asoftware pipeline for on-demand retrieval of auto-curated nifH sequences fromGenBank. Bioinformatics 30, 2883–2890. doi: 10.1093/bioinformatics/btu417

Herbold, C. W., Pelikan, C., Kuzyk, O., Hausmann, B., Angel, R., Berry, D., et al.(2015). A flexible and economical barcoding approach for highly multiplexedamplicon sequencing of diverse target genes. Front. Microbiol. 6:731.doi: 10.3389/fmicb.2015.00731

Holmes, A. J., Costello, A., Lidstrom, M. E., and Murrell, J. C. (1995). Evidencethat participate methane monooxygenase and ammonia monooxygenase maybe evolutionarily related. FEMS Microbiol. Lett. 132, 203–208. doi: 10.1111/j.1574-6968.1995.tb07834.x

Hospodsky, D., Yamamoto, N., and Peccia, J. (2010). Accuracy, precision,and method detection limits of quantitative PCR for airborne bacteriaand fungi. Appl. Environ. Microbiol. 76, 7004–7012. doi: 10.1128/AEM.01240-10

Huson, D. H., Auch, A. F., Qi, J., and Schuster, S. C. (2007). MEGAN analysis ofmetagenomic data. Genome Res. 17, 377–386. doi: 10.1101/gr.5969107

Katoh, K., and Standley, D. M. (2013). MAFFT multiple sequence alignmentsoftware version 7: improvements in performance and usability. Mol. Biol. Evol.30, 772–780. doi: 10.1093/molbev/mst010

Kelly, S., Sullivan, J., Ronson, C., Tian, R., Bräu, L., Munk, C., et al. (2014). Genomesequence of the Lotus spp. Microsymbiont Mesorhizobium loti strain R7A.Stand. Genomic Sci. 9:6. doi: 10.1186/1944-3277-9-6

Kirshtein, J. D., Paerl, H. W., and Zehr, J. (1991). Amplification,cloning, and sequencing of a nifH segment from aquaticmicroorganisms and natural communities. Appl. Environ. Microbiol. 57,2645–2650.

Kozich, J. J., Westcott, S. L., Baxter, N. T., Highlander, S. K., and Schloss, P. D.(2013). Development of a dual-index sequencing strategy and curation pipelinefor analyzing amplicon sequence data on the miseq Illumina sequencingplatform. Appl. Environ. Microbiol. 79, 5112–5120. doi: 10.1128/AEM.01043-13

Marusina, A. I., Boulygina, E. S., Kuznetsov, B. B., Tourova, T. P., Kravchenko,I. K., and Gal’chenko, V. F. (2001). A system of oligonucleotide primersfor the amplification of nifH genes of different taxonomic groupsof prokaryotes. Microbiology 70, 73–78. doi: 10.1023/A:1004849022417

Mistry, J., Finn, R. D., Eddy, S. R., Bateman, A., and Punta, M. (2013).Challenges in homology search: HMMER3 and convergent evolutionof coiled-coil regions. Nucleic Acids Res. 41:e121. doi: 10.1093/nar/gkt263

Pelikan, C., Herbold, C. W., Hausmann, B., Müller, A. L., Pester, M., and Loy, A.(2016). Diversity analysis of sulfite- and sulfate-reducing microorganisms bymultiplex dsrA and dsrB amplicon sequencing using new primers and mockcommunity-optimized bioinformatics. Environ. Microbiol. 18, 2994–3009.doi: 10.1111/1462-2920.13139

Penton, C. R., Johnson, T. A., Quensen, J. F. I., Iwai, S., Cole, J. R., and Tiedje, J. M.(2013). Functional genes to assess nitrogen cycling and aromatic hydrocarbondegradation: primers and processing matter. Front. Microbiol. 4:279.doi: 10.3389/fmicb.2013.00279

Pinto, A. J., and Raskin, L. (2012). PCR biases distort bacterial and archaealcommunity structure in pyrosequencing datasets. PLoS One 7:e43093.doi: 10.1371/journal.pone.0043093

Pötsch, E. M., Resch, R., and Buchgraber, K. (2013). Auswirkung der Düngungauf P-Gehaltswerte im boden und futter sowie P-bilanzen von grünlandflächen.Alpenländisches Exp. 18, 41–50.

Pruitt, K. D., Tatusova, T., and Maglott, D. R. (2005). NCBI ReferenceSequence (RefSeq): a curated non-redundant sequence database of genomes,transcripts and proteins. Nucleic Acids Res. 33, D501–D504. doi: 10.1093/nar/gki025

Rasche, F., Knapp, D., Kaiser, C., Koranda, M., Kitzler, B.,Zechmeister-Boltenstern, S., et al. (2011). Seasonality and resourceavailability control bacterial and archaeal communities in soils ofa temperate beech forest. ISME J. 5, 389–402. doi: 10.1038/ismej.2010.138

Raymond, J., Siefert, J. L., Staples, C. R., and Blankenship, R. E. (2004). The naturalhistory of nitrogen fixation. Mol. Biol. Evol. 21, 541–554. doi: 10.1093/molbev/msh047

Rubio, L. M., and Ludden, P. W. (2002). CHAPTER 4 - The Gene Products ofthe nif Regulon A2 - Leigh, G. Jeffery. Nitrogen Fixation at the Millennium.Amsterdam: Elsevier Science, 101–136. doi: 10.1016/B978-044450965-9/50004-5

Schirmer, M., Ijaz, U. Z., D’Amore, R., Hall, N., Sloan, W. T., and Quince, C.(2015). Insight into biases and sequencing errors for amplicon sequencing withthe Illumina MiSeq platform. Nucleic Acids Res. 43:e37. doi: 10.1093/nar/gku1341

Schloss, P. D., Gevers, D., and Westcott, S. L. (2011). Reducing the effects of PCRamplification and sequencing artifacts on 16S rRNA-based studies. PLoS One6:e27310. doi: 10.1371/journal.pone.0027310

Singer, E., Andreopoulos, B., Bowers, R. M., Lee, J., Deshpande, S.,Chiniquy, J., et al. (2016). Next generation sequencing data of a definedmicrobial mock community. Sci. Data 3:160081. doi: 10.1038/sdata.2016.81

Stamatakis, A. (2014). RAxML Version 8: a tool for phylogenetic analysis andpost-analysis of large phylogenies. Bioinformatics 30, 1312–1313. doi: 10.1093/bioinformatics/btu033

Staples, C. R., Lahiri, S., Raymond, J., Herbulis, L. V., Mukhophadhyay, B.,and Blankenship, R. E. (2007). Expression and association of group IVnitrogenase nifD and nifH homologs in the non-nitrogen-fixing archaeonMethanocaldococcus jannaschii. J. Bacteriol. 189, 7392–7398. doi: 10.1128/JB.00876-07

Stüeken, E. E., Buick, R., Guy, B. M., and Koehler, M. C. (2015).Isotopic evidence for biological nitrogen fixation by molybdenum-nitrogenase from 3.2 Gyr. Nature 520, 666–669. doi: 10.1038/nature14180

Ueda, T., Suga, Y., Yahiro, N., and Matsuguchi, T. (1995). Remarkable N2-fixingbacterial diversity detected in rice roots by molecular evolutionary analysis ofnifH gene sequences. J. Bacteriol. 177, 1414–1417. doi: 10.1128/jb.177.5.1414-1417.1995

Wang, Q., Quensen, J. F., Fish, J. A., Lee, T. K., Sun, Y., Tiedje, J. M., et al. (2013).Ecological patterns of nifH genes in four terrestrial climatic zones exploredwith targeted metagenomics using framebot, a new informatics tool. mBio4:e00592-13. doi: 10.1128/mBio.00592-13

Wertz, J., Kim, E., Breznak, J. A., Schmidt, T. M., and Rodrigues, J. L. M.(2012). Genomic and physiological characterization of the VerrucomicrobiaDiplosphaera colitermitum gen. nov., sp. nov., reveals microaerophily andnitrogen fixation genes. Appl. Environ. Microbiol. 78, 1544–1555. doi: 10.1128/AEM.06466-11

Woebken, D., Burow, L. C., Behnam, F., Mayali, X., Schintlmeister, A., andFleming, E. D. (2015). Revisiting N2 fixation in Guerrero Negro intertidalmicrobial mats with a functional single-cell approach. ISME J. 9, 485–496.doi: 10.1038/ismej.2014.144

Zani, S., Mellon, M. T., Collier, J. L., and Zehr, J. P. (2000). Expression ofnifH genes in natural microbial assemblages in Lake George, New York,detected by reverse transcriptase PCR. Appl. Environ. Microbiol. 66, 3119–3124.doi: 10.1128/AEM.66.7.3119-3124.2000

Zehr, J. P., Jenkins, B. D., Short, S. M., and Steward, G. F. (2003).Nitrogenase gene diversity and microbial community structure: a cross-system

Frontiers in Microbiology | www.frontiersin.org 14 April 2018 | Volume 9 | Article 703

fmicb-09-00703 April 27, 2018 Time: 16:27 # 15

Angel et al. Evaluating nifH Primers and NifMAP

comparison. Environ. Microbiol. 5, 539–554. doi: 10.1046/j.1462-2920.2003.00451.x

Zehr, J. P., and McReynolds, L. A. (1989). Use of degenerateoligonucleotides for amplification of the nifH gene from the marinecyanobacterium Trichodesmium thiebautii. Appl. Environ. Microbiol. 55,2522–2526.

Zheng, H., Dietrich, C., Radek, R., and Brune, A. (2016). Endomicrobiumproavitum, the first isolate of Endomicrobia class. nov. (phylumElusimicrobia) – an ultramicrobacterium with an unusual cell cycle thatfixes nitrogen with a Group IV nitrogenase. Environ. Microbiol. 18, 191–204.doi: 10.1111/1462-2920.12960

Conflict of Interest Statement: The authors declare that the research wasconducted in the absence of any commercial or financial relationships that couldbe construed as a potential conflict of interest.

Copyright © 2018 Angel, Nepel, Panhölzl, Schmidt, Herbold, Eichorst and Woebken.This is an open-access article distributed under the terms of the Creative CommonsAttribution License (CC BY). The use, distribution or reproduction in other forumsis permitted, provided the original author(s) and the copyright owner are creditedand that the original publication in this journal is cited, in accordance with acceptedacademic practice. No use, distribution or reproduction is permitted which does notcomply with these terms.

Frontiers in Microbiology | www.frontiersin.org 15 April 2018 | Volume 9 | Article 703