Characterizing the floral resources of a North American metropolis using a honey bee foraging assay
Abstract
Roughly a third of described insect species visit flowers, making the flower–insect interface one of the chief pillars of global biodiversity. Studying flower–insect relationships at the scale of communities and landscapes has been hindered, however, by the methodological challenges of quantifying landscape-scale floral resources. This challenge is especially acute in urban landscapes, where traditional floral surveying techniques are ill-suited to the unique constraints of built environments. To surmount these challenges, we devised a “honey bee foraging assay” approach to floral resource surveying, wherein continuous colony weight tracking and DNA metabarcoding of pollen samples are used to capture both the overall availability and taxonomic composition of floral resources. We deploy this methodology in the complex urban ecosystem of Philadelphia, Pennsylvania, USA. Our results reveal distinct seasonality of floral resource availability, with pulses of high availability in May, June, and September, and a period of prolonged scarcity in August. Pollen genus richness mirrored this pattern, with peak richness in May and June. The taxonomic composition of pollen samples varied seasonally, reflecting underlying floral phenology, with especially strong turnover between May and June samples and between August and September samples delineating well-defined spring, summer, and fall floral resource communities. Trait analysis also revealed seasonal structure, with spring samples characterized by trees and shrubs, summer samples including a stronger presence of herbaceous “weeds”, and fall samples dominated by woody vines. Native flora predominated in spring, giving way to a preponderance of exotic flora in summer and fall. At a basic level, this yields insight into the assembly of novel urban floral resource communities, showcasing, for example, the emergence of a woody vine-dominated fall flora. At an applied level, our data can inform urban land management, such as the design of ecologically functional ornamental plantings, while also providing practical guidance to beekeepers seeking to adapt their management activities to floral resource seasonality. Methodologically, our study demonstrates the potential of the honey bee foraging assay as a powerful technique for landscape-scale floral resource surveying, provided the inherent biases of honey bee foraging are accounted for in the interpretation of the results.
Introduction
It has been estimated that roughly a third of all described insect species are either directly or indirectly dependent on flowers for food (Wardhaugh 2015). This ecological centrality of flowers-as-food extends to systems in which historic floral communities have been dramatically altered, such as urban landscapes characterized by novel assemblies of native and exotic flora (Aronson 2014). Urban landscapes can host diverse communities of flowers and flower visitors, sometimes even functioning as refugia for rare species (Baldock 2015) [though see (Geslin et al. 2013)], but little is known about the overall composition of urban floral resources and how they fluctuate phenologically through the growing season.
Studying floral resources at the landscape scale is technically daunting in any context (Frankl et al. 2005), but the challenge becomes especially acute in urban landscapes where extreme landscape heterogeneity, limited land access, and the physical obstacles of the built environment render traditional approaches to floral surveying impracticable. Flower-visiting insects might be harnessed as efficient environmental samplers of landscape-scale floral resources, provided the spatial and taxonomic scope of their interaction with the landscape is sufficiently understood and the materials they collect can be analyzed informatively (Wood et al. 2018). While not an unbiased representation of local flora, such an approach would manifestly be relevant to the question of trophic function at the flower–insect interface.
The western honey bee (Apis mellifera L.) is arguably the organism best suited for such a sampling approach. As an extreme generalist, honey bees have a diet breadth that overlaps considerably with that of many other nectar- and pollen-feeding insects (Butz Huryn 1997). A honey bee colony surveys the landscape around its nest at a range routinely extending several kilometers, dynamically allocating foraging effort to the most rewarding patches (Visscher and Seeley 1982). Thus, the materials it collects are informatively biased toward the richest resources within the colony’s foraging range and compatible with honey bee behavior and morphology (excluding, for example, flowers requiring sonication to release pollen from anthers).
Here, we present a two-year study of the floral resource composition and dynamics of Philadelphia, Pennsylvania, USA. Founded in 1682, the city of Philadelphia straddles the interface of the Appalachian Piedmont and Atlantic Coastal Plain physiographic provinces (Fenneman and Johnson 1946) and hosts over 2500 plant species (Clemants and Moore 2003). Its patchwork heterogeneity of sociological composition is both a product and driver of concomitant ecological variation in abiotic substrate and biotic communities, earning Philadelphia the apt moniker “City of Neighborhoods”. To characterize the floral resources of this complex urban environment, we employ a “honey bee foraging assay” using a network of sentinel apiaries distributed throughout the city. Combining DNA metabarcoding of pollen samples with continuous colony weight monitoring, we characterize both the taxonomic composition and overall availability of floral resources in our study system, together with their temporal dynamics throughout the foraging season. We conclude by comparing Philadelphia’s floral resource landscape with patterns described in other study systems and discussing the implications of our data for understanding the ecological function of urban flora.
Methods
Sites and study years
We conducted fieldwork in 2017 and 2018. Our study sites included 13 apiaries in Philadelphia, Pennsylvania, USA, owned and managed by the Philadelphia Bee Company (Fig. 1). Twelve apiaries were used in each study year, with 11 shared across years, 1 used only in 2017 (RB), and 1 used only in 2018 (NE). Each of our research apiaries included three honey bee colonies designated as research colonies for our study, resulting in a total of 36 research colonies across 12 sites in each year. Some sites also included additional colonies not involved in our study. Research colonies were initiated in April–May of each study year from 4- or 5-frame nucleus colonies installed in 10-frame Langstroth hives. Nucleus colonies were purchased together each year from a single supplier (Swarmbustin’ Honey, West Grove, Pennsylvania, USA). During colony installation, each research hive was equipped with a Sundance I bottom-mounted pollen trap (Ross Rounds, Canandaigua, New York, USA) and a Broodminder hive scale (Broodminder, Stoughton, Wisconsin, USA).

Weight monitoring and pattern characterization
We set our Broodminder hive scales to record hourly weights, beginning at colony installation in the spring of each year and continuing through the end of our sampling season in the fall of each year. During preliminary data visualization, we identified two types of artifacts in our weight data: apparently random misreadings, resulting in transient (single reading) weight spikes (i.e., additive outliers), and persistent weight shifts (i.e., level-shift outliers) caused by colony manipulations involving the addition or removal of material from the hive. Because our colonies were part of a working apicultural business, it was not possible to avoid or standardize these management artifacts.
To address both types of artifacts, we first took the first-order difference of each colony weight time series, which turns level-shift outliers into additive outliers. We then applied a filter to each time series where data points (i.e., weight changes between consecutive readings) with an absolute value of 2.5 kg or more were identified as outliers and set to zero. The threshold of 2.5 kg was chosen because histograms of first-order differenced time series showed consistent discontinuities around ± 2.5 kg, such that the interval −2.5 to 2.5 contained a smooth curve of ostensibly “normal” readings, while readings outside this interval were rare and extreme. We then reintegrated each time series to reconstruct de-artifacted weight curves and selected only midnight readings for downstream analysis. For the purposes of our study, we were interested principally in the temporal patterns of resource availability arising from landscape-scale floral phenology, and how these patterns might vary site to site. To isolate these signals from the noise of colony-level differences in amplitude of weight variation (an expected result of differences in colony strength), we normalized (scaled and centered) the de-artifacted weight curves for each colony. This pipeline is available in Data S1 and depicted graphically in Fig. 2. Raw data are available in Data S2.

Hives subject to known anomalies were omitted from downstream analysis. In 2017, we omitted all hives from sites CH, EG, and SP due to late colony installation. We also omitted CC hive 3 and RB hive 2 due to scale malfunctioning. In 2018, we omitted SG hive 3 and WP hive 3 due to user error (failure to replace dead batteries), all NE hives due to premature death, and SP hive 3 due to premature death. This left a total of 25 colonies across 9 sites for 2017 and 30 colonies across 11 sites for 2018.
To characterize weight patterns, we used generalized additive modeling (GAM) implemented in R (R Core Team 2019) using package mgcv (Wood 2017) to relate colony weight to time and site. Our goals were (1) to identify major gain and loss motifs, (2) to determine whether there was significant site-to-site variation in weight pattern, and (3) to determine whether there was sufficient similarity across sites to justify a global weight model. Accordingly, we constructed three nested hierarchical GAMs, following Pedersen et al. (2019). Model 1 included only a global smooth of weight by time, without site-specific smooths. Model 2 included both a global smooth and site-specific smooths. Model 3 included only site-specific smooths, without a global smooth. These models are analogous to the G, GI, and I models, respectively, of Pedersen et al. (2019). Since any effect of site or hive on group means was removed by our normalization step, we did not include site or hive random effects. After optimizing the fit of each model, we used AIC model selection to identify the most appropriate model. GAM smooths were then first-order differenced to enable the visualization of detrended gain and loss periods. R code for our GAM analysis is available in Data S1.
Pollen sampling
Pollen samples were collected at roughly monthly intervals (hereafter “sampling periods”). In 2017, sampling occurred in late May/early June (hereafter “June sampling period”), late June/early July (hereafter “July sampling period”), early/mid-August, and mid-September, for a total of 4 sampling periods. In 2018, we expanded our sampling to include early May, early June, early July, early August, early September, and early October, for a total of 6 sampling intervals. Pollen traps were activated for roughly 3–7 days at the end of each monthly period. Exact sampling dates and durations varied from site to site due to travel time and weather constraints, and a detailed sampling schedule is available in Appendix S1. For the purposes of downstream analysis, sampling dates were binned discretely by monthly sampling period. Samples were collected by scooping the contents of the pollen traps into 50-mL conical tubes. Excess pollen was removed from the trap and poured into the inside of the top box of the hive to help the colonies recover from any pollen deficit caused by our sampling and to ensure that the pollen trap would be empty in preparation for the next sampling period.
Pollen sample preparation
Pollen processing began with the creation of pooled subsamples by combining 4/n g of pollen from each hive within each site-date, where n represents the number of hives sampled in a site-date. For most samples, n was equal to 3 hives, though there were some cases where fewer than 3 hives at a given site could be sampled, either due to colony death or pollen trap malfunctioning. Each 4 g subsample (hereafter “sample”) was then suspended in 40 mL of 70% EtOH and vortexed until fully dispersed. The resulting suspension was then centrifuged (Eppendorf 5810 R, Hamburg, Germany) at 3000G for 2 min and decanted. We then resuspended the pollen sediment in 10 mL of 100% EtOH and repeated the vortexing and centrifugation steps. Lastly, the sample was air-dried in an open conical tube under a fume hood for 24 h to allow the EtOH to evaporate, yielding the final granularized pollen sample.
From each granularized sample, we transferred a 10 mg aliquot to a 2-mL beadmill tube. To each tube, we added 1 mL of either Qiagen buffer AP1 (for 2017 samples) or ultra-pure water (for 2018 samples) and approximately 0.5 mL of 1.3-mm chrome steel beads (BioSpec, BioSpec, Bartlesville, Oklahoma, USA). Samples were then subjected to 4 × 1 min run on an Omni Bead Ruptor 24 Elite (Omni International, Kennesaw, Georgia, USA) set to a speed of 6 m/s. Samples were iced between runs to prevent overheating.
For our 2017 samples, we purified beadmill lysates using a Qiagen DNeasy Plant Minikit according to the manufacturer’s instructions. In 2018, we streamlined our workflow by using Phire Direct PCR reagents (Thermo Fisher, Waltham, Massachusetts, USA) that allowed us to avoid the purification step (hence the use of water rather than buffer AP1 in beadmilling). In both methods, a 1 μL aliquot of the final product (either purified DNA extract or raw lysate) was used as PCR template.
Library preparation and sequencing
Past work has shown that using multiple loci in pollen DNA metabarcoding improves the robustness of the method, particularly with respect to quantitative reliability (Richardson et al. 2015, 2019a). For this study, we selected the plastid intron trnL and the nuclear ribosomal spacer regions ITS1 and ITS2. Each of these loci has been used individually or in combination with other loci in past applications of pollen DNA metabarcoding (Keller et al. 2015, Kraaijeveld et al. 2015, Smart et al. 2017), though ours is the first study to employ this exact combination. We chose ITS1 and ITS2 because these markers are highly divergent and offer relatively high taxonomic resolution (Chen 2010, Wang et al. 2015); trnL is far less divergent but easily amplified (Taberlet et al. 2007), and there is some evidence that single-copy plastid markers might be more quantitatively reliable than the ITS regions (Richardson et al. 2015).
Our PCR methods follow the nested protocol of Richardson et al. (2019a), which aims to minimize PCR biases introduced by interactions between template and multiplex indices (Berry et al. 2011). Briefly, PCR1 amplifies barcode target regions (trnL, ITS1, and ITS2, in separate reactions), PCR2 adds Illumina read-priming oligonucleotides, and PCR3 adds dual multiplex indices (Kozich et al. 2013, Sickel et al. 2015). For the initial amplification of our target loci (PCR1), we used the trnL-c and trnL-h primers described by Taberlet et al. (2007) and the ITS1-u1, ITS1-u2, ITS2-u3, and ITS2-u4 primers described by Cheng et al. (2016). Following PCR3, we cleaned and normalized our libraries using a SequalPrep normalization kit (Thermo Fisher), pooling libraries within markers so that they could be sequenced equimolarly. In addition to running negative controls on all gels, we also propagated 3 technical replicate negative controls for each marker through the entire nested PCR protocol and sequenced these alongside our positive samples, enabling us to detect trace contamination that did not appear on gels. We also used a 10% unsaturated matrix of multiplex tags so that critical mistagging rates could be both mitigated and measured (Esling et al. 2015, Schnell et al. 2015). For a full description of primers, PCR conditions, and negative controls, see Appendices S2, S3, S4.
Prepared libraries were sequenced using 2 × 300 Illumina MiSeq kits, with 2017 and 2018 samples sequenced separately. Sequencing was performed at the Genomics Core Facility of Pennsylvania State University. All sequencing output has been archived on NCBI Sequence Read Archive under Bioproject PRJNA548320.
Bioinformatics
Reference sequences were downloaded from NCBI GenBank on 22 February 2019 using the eDirect API (Table 1). Downloaded reference sequences were then filtered at the genus level to include only genera documented as occurring in the states of Pennsylvania, New York, New Jersey, Delaware, or Maryland based on the USDA PLANTS database (USDA and NRCS 2020). We then used MetaCurator (Richardson et al. 2019b) to trim our reference sequences to the specific regions of our amplicons and taxonomically dereplicate identical sequences.
Locus | Query |
---|---|
trnL | esearch -db nucleotide -query "Viridiplantae [ORGN] AND trnL [ALL]" | efilter -query 100:2000[SLEN] |
ITS1 | esearch -db nucleotide -query "Viridiplantae [ORGN] AND (its1 OR internal transcribed spacer [ALL])" | efilter -query 100:2000[SLEN] |
ITS2 | esearch -db nucleotide -query "Viridiplantae [ORGN] AND (its2 OR internal transcribed spacer [ALL])" | efilter -query 100:2000[SLEN] |
MiSeq reads were first mate-paired using PEAR (Zhang et al. 2014) and converted from FASTQ to FASTA format using the FastX Toolkit (A. Gordon et al., unpublished). Mate-paired reads were then aligned to our curated reference sequences using the usearch_global algorithm in VSEARCH (Rognes et al. 2016), with a minimum query coverage of 80% and a minimum identity of 75%. Only the top hit for each query was retained for downstream analysis.
After alignments were made, we performed further processing in R (R Core Team 2019). First, alignments were subjected to more stringent identity thresholds: 97% for trnL and 95% for ITS1 and ITS2. Then, subject sequence accession numbers were joined with taxonomic lineages using the R package taxonomizr (Sherrill-Mix 2019). Using the grouping and summarizing utilities of the R package dplyr (Wickham et al. 2019), we tallied, for each marker, the number of reads by genus and sample (i.e., site-date) and expressed each genus tally as a proportion by dividing its read count by the total number of reads for the sample to which it belonged. Genera comprising less than 0.05% of the total read count for a given sample were discarded to avoid low-abundance false positives. We then used the table joining utilities of dplyr (Wickham et al. 2019) to merge the genus tallies across our three markers, and genera detected by only one of the three markers were discarded as potential false positives. Finally, we calculated the median proportional abundance for each genus across all markers by which it was detected and rescaled the median proportional composition of each sample so that it totaled to one. In comparisons with pollen microscopy, this approach has been shown to yield quantitatively informative results (Richardson et al. 2019a). R script for this workflow is available from the authors upon request.
Taxonomic analysis
Prior to taxonomic analysis, we excluded any samples in which one of the three libraries was represented by fewer than 1000 reads to avoid making inferences based on insufficient sequencing depth. In 2017, all 44 libraries were adequately sequenced. In 2018, we dropped 6 of 71 libraries (8%) due to under-sequencing, including June samples for sites CH, EG, and NE, September samples for FF and NE and the October sample for WP.
Using the proportional abundance data described above, we characterized the taxonomic composition of pollen samples across sites, sampling periods, and years using principal coordinates analysis (PCoA) based on the Jaccard dissimilarity metric and implemented with the R package vegan (Oksanen 2019). Proportional abundances were square-root-transformed prior to ordination, achieving the equivalent of a Hellinger transformation (Legendre and Gallagher 2001) on raw community data. Differences across years, sampling periods, and sites were evaluated by permutation tests using vegan’s adonis function (Oksanen 2019). We also analyzed sample genus richness as a function of sampling period. Differences in richness across periods were tested by ANOVA and pairwise differences were analyzed with Tukey’s post hoc test. R script for this workflow is available in Data S1.
Trait-based analysis
To explore the relationship between sampled relative abundance and genus traits, we classified each genus detected in our samples on the basis of growth habit (tree/shrub, vine, herb), life cycle (perennial, biennial, annual), and native status (native or exotic to contiguous United States). Trait data were obtained by downloading from the USDA PLANTS Database (USDA, NRCS 2020) all species records for Philadelphia, PA, along with annotations for “Growth Habit,” “Duration,” and “Native Status.” We opted for a narrower geographical scope for trait analysis than we did for reference sequence curation because the addition of congeners not likely to be present in our local study system would increase intrageneric trait heterogeneity, resulting in more ambiguous trait assignments. Genera found in our pollen data but absent from the USDA PLANTS Database Philadelphia records were manually added to the processed dataset by searching the genera in the USDA PLANTS Database without geographical constraints and selecting species with distributions including Pennsylvania, Delaware, New Jersey, Maryland, or New York. Using tidyverse (Wickham 2017) functions in R, we filtered the resulting species list by the genera detected in our pollen samples and summarized each trait for each genus by concatenating unique trait values. Where trait values were not uniform across species within a given genus, the trait was considered “undetermined”. For growth habit, we combined “Tree” and “Shrub” classes into “tree/shrub” and the “Forb/Herb” and “Graminoid” classes in to “herb”. Raw data, derived trait tables, and R script are available in Data S1 and S2.
Results
Weight dynamics
Weight dynamics were characterized by distinct gain and loss motifs that were largely consistent across years, with periods of strong weight gain in May and June, weak gain or weak loss in July, strong loss in August, weak to strong gain in early September, and strong loss from the second week of September to the end of data collection in October (Fig. 3). For both years of our study, HGAM analysis with AIC model selection supported the use of both global and site-specific smoothers (Model 2), indicating both a site effect and an archetypal pattern of weight dynamics for our study system (Table 2). Site-specific weight curves including integrated data (i.e., not subject to first-order differencing) can be generated using scripts and data in Data S1 and S2.

Model by year | df | AIC | ΔAIC |
---|---|---|---|
2017 | |||
Model 2 (global + site) | 89 | 4552 | – |
Model 3 (site) | 104 | 4569 | 17 |
Model 1 (global) | 14 | 5665 | 1113 |
2018 | |||
Model 2 (global + site) | 190 | 2045 | – |
Model 3 (site) | 245 | 2098 | 53 |
Model 1 (global) | 24 | 5992 | 3948 |
Pollen sample composition
We detected a total of 119 and 139 genera in 2017 and 2018, respectively (tabulated data available in Data S2). Pollen diversity (genus richness) differed significantly across sampling periods (2017: F = 6.5, P < 0.001; 2018: F = 16.3, P = 0.001), with June having significantly higher diversity (Tukey test, P < 0.05) than August and September in 2017 and May and June having significantly higher diversity (Tukey test, P < 0.05) than all other months in 2018 (Fig. 4). A summary of sequencing output by sample is available in Data S2.

PCoA revealed strong differentiation across sampling periods in overall pollen composition (Fig. 5). The greatest discontinuities were seen across the May–June and August–September intervals. Permutation testing indicated that site (F = 1.5, R2 = 0.09, P = 0.004), year (F = 5.3, R2 = 0.03, P = 0.001), and sampling period (F = 16.9, R2 = 0.43, P = 0.001) were all significant predictors of pollen sample composition, but sampling period explained by far the most variance.

May samples (collected only in 2018) (Fig. 6) were dominated by wind-pollinated tree genera, including willow (Salix), maple (Acer), oak (Quercus), ash (Fraxinus), and plane tree (Platanus). Apple (Malus) was also abundant. The ornamental shrub Christmas berry (Photinia villosa is the only member of its genus present in our study system) dominated the composition of a single site (CC).

Between May and June, a strong phenological shift is evident in the floral community, marked by a transient lull in weight gain (Fig. 3) and a near-total turnover of pollen composition apparent both in the raw data and in the wide discontinuity between May and June samples in PCoA ordination (Fig. 5). Following this shift, June pollen samples (Figs. 6, 7) were characterized by smaller trees and herbaceous flora associated with forest edge, mid-successional, or open landscapes, including hawthorn (Crataegus), honey locust (Gleditsia), sumac (Rhus), clover (Trifolium), and sweet clover (Melilotus), along with magnolia trees (Magnolia) that are common in our study system as ornamental plantings. False indigo (Amorpha), a shrub that grows abundantly along the banks of the Schuylkill River, was also abundant in some samples in 2017.

In July (Figs. 6, 7), both weight gain and genus richness were attenuated (Figs. 3, 4), and pollen composition became dominated by clover (Trifolium) and Virginia creeper (Parthenocissus), accompanied by a strong presence of sweetclover (Melilotus) in 2017 and ornamental crepe myrtle (Lagerstroemia) in 2018. Mullein (Verbascum) and magnolia (Magnolia) were also important pollen sources, particularly in 2017.
During the apparently sparse month of August (Figs. 3, 6, 7), the abundance of clover (Trifolium) pollen declined but sweetclover (Melilotus) remained abundant. Spikenard (most likely the invasive understory tree, Aralia elata) emerged as a major pollen source. Crepe myrtle (Lagerstroemia) continued to be prominent alongside another ornamental species, Japanese pagoda tree (Styphnolobium japonicum is the only member of its genus present in our study area), which is commonly planted as a street tree in our study system and has become naturalized in some forested areas. The August–September interval saw another strong discontinuity in pollen composition (Fig. 5).
In September (Figs. 6, 7), clematis (most likely sweet autumn clematis, Clematis ternifolia) and ivy (Hedera helix and/or H. hibernica) dominated pollen samples, accompanied in 2017 by a strong presence of elm (the Chinese elm, Ulmus parviflora, is the only member of its genus that flowers in September in our study system). In 2017, crepe myrtle (Lagerstroemia) remained a major resource. Boneset (Eupatorium) was found across most sites in both years but was especially abundant in 2017.
At the close of the foraging season in October (sampled only in 2018) (Fig. 7), pollen samples from all sites consisted mainly of ivy (H. helix and/or H. hibernica). Late-season asters were also common, though, including American asters (Symphyotrichum), snakeroot (Ageratina), Artemisia (likely A. vulgaris, common mugwort), and goldenrod (Solidago), and clematis remained important at some sites.
Trait analysis
Trait-based patterns were generally consistent across the two years of our study (Fig. 8). With respect to growth habit, we saw a predominance of trees and shrubs early in the season, giving way to herbs in mid-summer, then returning to trees and shrubs in August, and finally shifting to vines in September and October. In terms of life cycle, the contribution of perennials appears to have dwarfed that of biennials and annuals throughout the whole foraging season, with the caveat that some sampling periods had a large proportion of genera with undetermined life cycle due to trait non-uniformity among congeners. Native genera ostensibly comprised the majority of pollen in May and June while exotic genera predominated in July, August, September, and October. These patterns should be interpreted cautiously, though, since the proportion of undetermined genera was high for most sampling periods, and in some cases, abundant genera that could represent both native and exotic species (e.g., Aralia, Clematis, Phragmites) likely consisted mainly of their exotic constituents (e.g., A. elata, C. ternifolia, P. australis).

Discussion
Continuous colony weight monitoring revealed strong gain and loss motifs. We interpret these dynamics as a measure of the overall abundance of floral resources in our study area, with the caveat that colony weight is affected by overall health and honey bee life history in addition to resource availability. We mitigated these potentially confounding effects by omitting colonies with health anomalies, standardizing colony starting conditions, and analyzing detrended weight dynamics (i.e., differenced time series) rather than absolute weight. Under this interpretation, spring in our study system is a time of plenty, characterized by two pulses of resource availability, the first beginning in mid-May and the second in mid-June. Resource availability tapers off through July and reaches a minimum in August, a period during which all colonies in our study lost weight. After a brief pulse of resource availability in early September, the growing season is effectively over, with minimal resource availability from mid-September to the end of our measuring period in mid-October. These results—especially the occurrence of a late-summer/early-fall dearth of floral resources—are broadly consistent with temporal patterns of floral resource availability documented in England (Couvillon et al. 2014), France (Requier et al. 2015), Denmark (Lecocq et al. 2015), Germany (Danner et al. 2017), and the Midwestern United States (Wood et al. 2018). The notion of a “summer dearth” is also a ubiquitous anecdote among beekeepers in temperate ecoregions [e.g., Japan: (Hiratsuka 1920), Ireland (Morony 1906), the United States (Demuth 1918)]. That such similar results should be found in such geographically disparate study systems might be explained partly by the fact that some of the most important honey bee foraging resources are cosmopolitan synanthropes (e.g., Trifolium, Melilotus, Hedera), but the resource dynamics we observed seem to be characteristic of temperate floral communities even across systems with little overlap in major flora. That our Philadelphia study system comports with this pattern suggests that urban landscapes, despite the extreme modification of their floral communities, conform to archetypal patterns of floral phenology that are shaped more by climate than by the particulars of individual landscapes.
One of the most salient patterns in our data is the importance of woody perennials, including both trees/shrubs and woody vines like Hedera, Parthenocissus, and Clematis. These findings contribute to a growing consensus about the importance of woody plants as pollinator resources (Mach and Potter 2018, Donkersley 2019), perhaps especially in urban environments (Burgett 1978, Wattles 2017, Lau et al. 2019). In May, the most important resources included wind-pollinated trees like Salix, Acer, Quercus, Fraxinus, and Platanus, reinforcing Saunders’ (2018) call for increased attention to be given to wind-pollinated plants as resources for pollinators. Later in the season, ornamental trees and shrubs like Lagerstroemia and Styphnolobium, along with adventitious shrubs and woody vines like Aralia, Parthenocissus, Clematis, and Hedera, account for the majority of pollen foraging. Only in July did herbaceous plants consistently account for the majority of pollen foraging, and this was attributable mostly to a single dominant genus (Trifolium). These findings highlight the importance of accounting for trees and other woody plants in studies of plant–pollinator ecology, particularly in urban areas where these resources seem to be of prime importance, despite the technical challenges of canopy sampling that have prompted some authors to focus exclusively on low vegetation (Matteson et al. 2013, Lowenstein et al. 2019).
The strongest discontinuities in the composition of our pollen samples, particularly evident in clustering patterns seen in PCoA ordination (Fig. 5), occurred between the May and June sampling periods and between the August and September sampling periods. As observed by Robertson (Robertson 1895), the late-spring discontinuity reflects the closure of the tree canopy, which marks the end of flowering for most tree and understory species, with a corresponding shift of pollinator visitation to edge and open habitats. The late-summer discontinuity in our data probably reflects the effective (though not absolute) end of Trifolium and Melilotus bloom (which may also account for the observed weight loss of colonies in August) and the shift to the fall flora dominated by Clematis and Hedera.
An unexpected—and perhaps characteristically urban—pattern in our data is the overwhelming abundance of woody vines in summer and fall pollen samples, beginning in July with Parthenocissus and peaking in September and October with Clematis and Hedera. In our study region, Parthenocissus is represented by both native P. quinquefolia and exotic P. tricuspidata (USDA, NRCS 2020). The only species of Clematis abundant in our study system that flowers in the fall is exotic C. ternifolia (D. Sponsler, personal observation). Hedera is represented by H. helix and H. hibernica, both exotic (USDA, NRCS 2020). With the exception of the native P. quinquefolia, all these vines were introduced to North America as ornamentals and have since become widely naturalized in urban and suburban areas, where they thrive ubiquitously in both seminatural and developed areas. It is puzzling, in light of our findings, that neither Clematis nor Hedera is mentioned in classic surveys of North American pollinator-attractive flora, and Parthenocissus is deemed at best a secondary nectar source of regional importance (Lovell 1918, Pellett 1920, Lieux 1972, Ayers and Harman 1992). This raises the question of whether the dominance of these vines as summer and fall floral resources was simply overlooked in the past—reflecting the general neglect of urban landscapes in 20th-century literature (Pickett et al. 2016)—or whether it is the result of more recent invasive success. In either case, if these taxa dominate flower–insect interactions in other cities to the extent observed here, it would be a striking case of urban biotic homogenization (McKinney 2006) propagated from the level of community composition to that of interaction network structure. Indeed, at least in the case of Hedera, its comparable importance as a bee plant in our study system and in Brighton, UK (Garbuzov and Ratnieks 2014), together with its widespread distribution in North American cities and suburbs (Jones and Reichard 2009), suggests that its role as a homogenizer of urban ecosystems may already be well established. It should also be noted that Clematis and Hedera dominate the composition of our pollen samples at a time of year traditionally associated with the flowering of native asters (e.g., Ageratina, Cichorium, Eupatorium, Eutrochium, Solidago, Symphyotrichum) (Lovell 1918, Pellett 1920, Ayers and Harman 1992) and still dominated by native asters in rural areas (Sponsler et al. 2017, Wood et al. 2018). While detected in our samples, these native flora were dwarfed in abundance by Clematis and Hedera.
The special importance of the exotic vines in our data reflects a larger pattern in which exotic plants play a prominent role in the floral resource community of our study system. With the caveat that our genus trait analysis was hampered by trait non-uniformity in many genera, our data suggest a seasonal structure to the relative importance of native and exotic flora (Fig. 8), closely mirroring the findings of Wood et al. (2018) in Michigan, USA. Native flora predominate early in the season, corresponding to the flowering of native trees and shrubs. In July and August, a season of overall dearth in floral resources (Fig. 3), exotic plants predominate. This pattern could be interpreted not as a displacement of native floral resources but a compensation for a period of natural scarcity. The ornamental trees Lagerstroemia and Styphnolobium, absent from most previous surveys of North American floral resources (Lovell 1918, Pellett 1920, Ayers and Harman 1992; though see Lau et al. 2019), deserve special mention since they are not generally regarded as invasive. Moreover, the evident function of these ornamental trees as resources for flower-visiting insects suggests a caveat to Quigley’s (2011) dismissal of designed greenspaces as “Potemkin gardens” devoid of ecological function. While the annual ornamentals Quigley probably had in mind appear, indeed, to be of negligible importance as pollen resources (Fig. 8), their woody counterparts (e.g., Lagerstoemia, Styphnolobium, Ligustrum, Hydrangea, Photinia, Magnolia, Phellodendron) can be major pollen resources. Notably, while annuals were virtually absent from our pollen samples, there is some evidence that annual ornamentals can be significant late-season nectar sources (Erickson et al. 2019). As discussed above, however, the predominance of exotic flora—specifically Clematis and Hedera—continues through September and October, a period traditionally associated with a pulse of floral resources driven by native asters (e.g., Solidago, Symphyotrichum) (Lovell 1918, Pellett 1920, Ayers and Harman 1992). The eclipse of these flora by Clematis and Hedera can be interpreted as a consequence of urban environmental filtering (Aronson 2016), with asters being suppressed by urban disturbance regimes (e.g., mowing) and exotic woody vines being favored both by the ample substrate of the built environment and by the high propagule pressure stemming from human cultivation.
The relationship between colony weight dynamics and pollen sample composition requires careful interpretation. First, any relationship between our weight data and pollen data is partly obscured by the fact that our weight data are continuous but our pollen data represent only a several-day interval per sampling period. This is a limitation that could theoretically be overcome by higher sampling frequency. A more fundamental constraint on the joint interpretation of weight and pollen data is that honey bee colony weight is driven mainly by changes in stored honey (i.e., the product of nectar foraging), but pollen trap samples represent only pollen foraging. For taxa thought to be major sources of both pollen and nectar (e.g., Trifolium, Melilotus) (Ayers and Harman 1992), relative abundance in pollen samples may approximate simultaneous contribution to nectar foraging. Other taxa, such as the wind-pollinated trees that dominate our May samples, produce copious pollen but little or no nectar (Ayers and Harman 1992). Conversely, taxa such as knotweed (Reynoutria) are prolific nectar sources but not thought to contribute significantly to pollen foraging (Ayers and Harman 1992). Thus, we recommend the conservative approach of interpreting pollen samples as qualitative indicators of flora in bloom during contemporaneous patterns of weight dynamics, with further interpretation based on ancillary data or expert knowledge of the study system.
While the focus of our study is on the use of honey bees as a model generalist forager, our data have practical relevance for beekeepers in study region. Colony weight dynamics can inform hive inspection regimes and allow beekeepers to plan management interventions such as “supering” (the addition of boxes to accommodate colony growth), honey harvest, supplemental feeding, or colony splitting (Gary 1992). This of particular importance to urban beekeepers who often contend with difficult access to their apiaries (Alton and Ratnieks 2016) and must schedule their management judiciously. It is also important to note times of dearth so that inter-colony robbing can be mitigated, for example, through the use of entrance reducers and the minimizing of open-hive manipulations (Gary 1992). The clear demarcation of spring, summer, and fall floral communities in our data is also of apicultural significance. Traditionally, apiarists in our study region recognize spring and fall nectar flows, and honey is often marketed in terms of these seasonal varieties. Our data show a more nuanced, three-season pattern of nectar flows. Knowing this pattern would potentially allow beekeepers to harvest a third seasonal honey variety, a significant asset to urban beekeepers who lack access to single-species cropping needed to market floral varietal honeys.
Conclusions and future research
Frankl et al. (2005) pose the question of whether the landscape-scale quantification of floral resources is possible. While there may be multiple ways to answer this question in the affirmative, including the approach presented by its original authors (Frankl et al. 2005), we submit that our “honey bee foraging assay”—the combination of continuous weight monitoring and pollen metabarcoding—constitutes a landscape-scale quantification of floral resources. We acknowledge that honey bee foraging is not an unbiased sampling of floral resources, and we do not interpret our results as an objective census of the floral community. Nevertheless, an objective census of floral resources would arguably be less relevant to questions of ecological function than the “biased” sampling of a generalist florivore.
The most serious limitation of our approach is the imperfect representation of the total insect florivore community by honey bee proxy. Honey bees have the broadest described diet breadth of any single flower-visiting insect species, often visiting a hundred or more floral species in any given region, but their foraging activity in any particular time and place tends to be focused on relatively few species (Butz Huryn 1997), with a preference for those occurring at high abundance (Percival 1947, Leonhardt and Blüthgen 2012). Studies that focus on the dietary overlap between honey bees and other bee species vary widely in their findings, with percent overlap ranging from 19% (Leonhardt and Blüthgen 2012) to 97% (Paini and Roberts 2005) when comparing honey bees to individual non-Apis bee species and from 33% (de Menezes Pedro and De Camargo 1991) to 45% (Steffan-Dewenter and Tscharntke 2000) when comparing honey bees to an aggregated non-Apis bee community. We are unaware of any studies focused on comparing honey bee foraging to that of non-bee florivores, though such data could be extracted from comprehensive plant–pollinator network studies (e.g., Chacoff et al. 2012). These uncertainties notwithstanding, we maintain that the honey bee’s dietary generalism, long foraging range, and amenability to standardized sampling techniques warrant its preliminary use as a sampler of landscape-scale floral resources, and the results of the methods we employ can provide an informative baseline against which to compare the floral resource use by other florivores. In principle, though, the foraging assay approach to landscape-scale floral resource surveying could be implemented with model organisms other than the honey bee, and we urge the development of such methods. Bumble bees (Bombus spp.), in particular, are amenable to both pollen barcoding (Pornon 2016) and colony weight monitoring (Goulson et al. 2002), and their choice of floral resources would be expected to differ from that of honey bees, though with substantial overlap (Leonhardt and Blüthgen 2012).
The use of continuous colony weight monitoring to infer resource availability is a field in its infancy. While it is self-evident that changes in a colony’s weight reflect the net flux of material between colony and environment, there is no established proportionality between “true” resource availability (however one chooses to define this) and a given change of weight in a sentinel colony. With this limitation in mind, we are conservative in our interpretation of colony weight dynamics, focusing on the direction of weight change (i.e., positive or negative) rather than its absolute magnitude. We encourage future work aimed at clarifying the relationship between the linked dynamics of resource availability and colony weight. Furthermore, the biological interpretation of the resource fluctuations inferred from colony weight patterns must vary with the life history of the focal taxa in question. A perennial organism, like the honey bee colonies in our study, cannot “escape in time” from periods of scarcity. The same would be true of an annual organism with a long active season, such as a bumble bee colony. In contrast, organisms with relatively short active seasons—a category that includes the vast majority of flower-visiting insect species—depend only on the availability of resources during select periods. An analogous point can be made with respect to the spatial scale of an organism’s activity. A landscape that is, on the whole, resource poor from the perspective of a foraging honey bee colony, could contain a small patch of rich resources that could suffice for the provisioning of insects with small foraging ranges able to subsist within the resource patch.
Finally, it is not incidental that our methodology was implemented in a complex urban landscape. With respect to the ecology and conservation of plant–pollinator communities, the question is no longer whether cities can support such communities (Baldock 2015) but rather how they are structured by the heterogeneous habitats of which cities are composed (Baldock 2019). Our work begins to shed light on this question, revealing the emergence of weedy vines as dominant floral resources, the seasonal structuring of the relative importance of trait-based floral guilds (e.g., native vs. exotic, woody vs. herbaceous), and the dramatic temporal dynamics of overall floral resource availability. While we urge caution in extrapolating our findings beyond our Philadelphia study system, the general consistency of our findings with comparable datasets (Garbuzov and Ratnieks 2014, Couvillon et al. 2014, Requier et al. 2015, Danner et al. 2017, Wood et al. 2018, Lau et al. 2019) suggests that the patterns seen in our data are shaped by ecological and social drivers that are not unique to our locality. Future work should focus on elucidating these ecological drivers and formulating testable hypotheses regarding the relationships between landscape, floral resources, and florivores.
Acknowledgments
T. Jones, D. Brough, K. Oxman assisted and field and laboratory work. K. Frank, K. Wattles and members of the Philadelphia Beekeepers Guild and the Philadelphia Botanical Club provided informative conversation regarding local floral resources. M. McIntyre, Weavers Way Coop, Congregation Rodeph Shalom, Hotel Sofitel, Shane Confectionery, Greensgrow Farm, The Philadelphia Insectarium, the Philadelphia Business and Technology Center, Mt. Moriah Cemetery, Share Food Program, W. B. Saul high school, Paradiso Restaurant, and A. Pfeffer provided apiary locations for this study. This work was funded by a USDA-NIFA postdoctoral fellowship for D. Sponsler (grant 2017-07141), a Penn State Apes Valentes research award to D. Sponsler, and USDA–NIFA– SCRI grant 2016–51181–235399 to C. Grozinger.