Have a personal or library account? Click to login
iNaturalist Users Exhibit Distinct Spatiotemporal Sampling Preferences, with Implications for Biodiversity Science and Project Planning Cover

iNaturalist Users Exhibit Distinct Spatiotemporal Sampling Preferences, with Implications for Biodiversity Science and Project Planning

Open Access
|Jan 2026

Full Article

Introduction

Digital biodiversity-focused contributory science platforms—projects within the broader field of citizen science in which participants primarily contribute to data collection—contribute vast quantities of data, enabling novel research at broad spatial, temporal, and taxonomic scales. Such platforms encourage volunteer participants to share observations and engage with a larger community, incentivizing data collection by providing information about the natural world (Campbell et al. 2023). Initiatives vary in structure, ranging from projects with predefined protocols, target taxa, and specific site locations, to more-flexible, opportunistic models that allow contribution with fewer constraints. While flexible approaches broaden participation, they also shape the distribution of observations, which may consequently reflect both social as well as ecological processes (Carlen et al. 2024). Observers upload records in accessible areas, at convenient times, and of taxa of personal interest. Thus, observations often exhibit biases such as a disproportionate focus on developed areas (Di Cecco et al. 2021), protected land (Botts et al. 2011), higher income areas (Estien et al. 2024), and weekends (Courter et al. 2013). These biases are important for both researchers—who must define and account for them to ensure sound ecological inferences—and project planners, who can strategically work to increase data coverage and close gaps in underrepresented areas.

iNaturalist (www.inaturalist.org), one of the most widely used and flexible platforms, relies on opportunistic sampling, allowing participants to make observations anytime, anywhere, and of any taxon. Identification support, through computer vision and volunteer curation, enables participation independent of previous biological training or background. By allowing flexibility in both observation and identification, iNaturalist supports large-scale data collection and broad participation, amassing more than 150 million research-grade observations to date, along with a rapidly growing observer base of more than 3.5 million (iNaturalist 2025). Scientists leverage these records for a wide range of applications, unlocking insights into species interactions (Pernat et al. 2024), the timing of key biological events (Barve et al. 2020), range shifts (Fourcade 2016), phenotypic variation (Davis et al. 2022), and response to environmental change (Callaghan et al. 2020). Simultaneously, community-led data collection fosters increased participation in policy decisions (Callaghan et al. 2025; Overdevest et al. 2004), connection with the natural world (Pocock et al. 2023a), and trust between community members and scientists (Bedessem et al. 2021).

Where, when, and how often a participant makes observations are likely shaped by their motivations (e.g., conservation, personal species lists, learning, coursework), abilities (e.g., financial resources, travel, time), and background (e.g., where they live, education). Likely due to these diverse motivations and factors (Rotman et al. 2014; West et al. 2021), individual observers may exhibit vastly different sampling patterns, differentially contribute to biases, and fill distinct data gaps. While general biases in the structure of observations have been described (Courter et al. 2013; Estien et al. 2024; Di Cecco et al. 2021), much less is known about how these biases are produced by the underlying activity of different users. Understanding the processes that govern which data are generated where and by whom is imperative for encouraging increased engagement, identifying and compensating for underlying biases, and filling gaps in under-sampled areas.

A core aspect of user activity is participation level, which ranges from users who submit a handful of observations to those who contribute hundreds of thousands. As in many contributory science initiatives, a small group of highly-dedicated users produce the majority of observations and generate a large amount of data disproportionate to their number (Di Cecco et al. 2021; Rowley et al. 2019; Wood et al. 2011). While these users are credited with generating the majority of contributory science data, users who make lower-volume contributions represent a far larger number of individuals and collectively contribute considerable data (Boakes et al. 2016). Previous work has found that increased participation levels are associated with greater motivation to contribute to conservation and research (Larson et al. 2020; Lowe et al. 2025). The distinction between highly active and casual users is an important behavioral axis that differs based on motivations, abilities, and backgrounds. These differences provide a useful contrast to test how intensity of engagement shapes where and when biodiversity is recorded, with important implications for downstream data use.

Users also vary in where they make observations. While some primarily observe close to home, others contribute while traveling or even travel with the specific goal of observing. In Hawaii, a location with unique tourism dynamics, residents have very different observation patterns than do visitors: Residents tend to be short-term observers and make more observations in developed areas, non-protected areas, and near roads (Dimson and Gillespie 2023). While we expect these sampling dynamics likely differ across regions with different social and land contexts, these questions remain untested. This distinction between locals and travelers represents a second key behavioral axis, reflecting differences in familiarity with the environment (Moon et al. 2024) and the purposes of observation. These differences provide a useful contrast to test how familiarity with the local environment versus tourism-oriented sampling shapes where and when biodiversity is recorded, with important implications for downstream data use.

Here, we examine how spatial and temporal sampling patterns vary across iNaturalist users in the southeastern United States. As iNaturalist does not collect social information from participants, we infer insights into behavior and motivation from their contribution patterns. We categorized users along two dimensions: participation level (highly active versus casual) and traveler status (local versus traveler). We examined how these groups differentially sample spatially, including across land cover categories, urban and rural areas, protected land, urban parks, and low-income neighborhoods; and temporally, focusing on weekends versus weekdays (Supplemental file 1: Table 1). Directly linking user dimensions to sampling patterns and biases will help inform efforts to improve data collection coverage and better quantify the processes that underlie sampling.

Methods

Downloading and filtering observation data

Using custom code derived from the R package rinat (Barve and Hart 2022), we downloaded all verifiable iNaturalist observations (observed by December 31st, 2023 and uploaded by October 12th, 2024) in the focal region and removed those with obscured coordinates (Figure 1a). We focused on the southeastern USA, a biodiversity hotspot (Noss et al. 2015) that draws over 140 million tourists annually (Visit Florida Research 2025), contains areas with some of the nation’s highest poverty rates (Baker 2020), and has extensive private land ownership (Butler and Wear 2013). Core analyses were conducted in R version 4.2.3 (R Core Team 2023), and we used the following R packages: tidyverse (Wickham et al. 2019), sf (Pebesma and Bivand 2023), terra (Hijmans 2024), nnet (Venables and Ripley 2002), and data.table (Barrett et al. 2025).

cstp-11-1-868-g1.png
Figure 1

a)Map of southeastern USA focal region. b) The cumulative percent of all focal observations by percent of users contributing these observations. c) Proportion of users in each of the defined user groups, and proportion of observations made by each group.

Associating observations with land context

We associated each observation from the focal region with land context categories: land cover, protected areas, urban areas, urban parks, and low-income urban neighborhoods. We established land cover categories following methods described in Callaghan et al. (2019), aggregating land cover classes from the National Land Cover Database (USGS 2023) into urban green area, natural green area, agriculture, open-urban, low-intensity developed, and medium/high-intensity developed. Urban green area and natural green area were differentiated using the 2020 TIGER/Line Urban Areas boundaries (USCB 2020).

We designated land protection status using the “Fee” feature class of the Protected Areas Database (USGS 2022). Urban areas were designated using 2020 TIGER/Line Urban Areas boundaries (USCB 2020). We used the ParkServe dataset to designate which urban observations were made in parks versus neighborhoods (TPL 2024). Finally, we used low-income census tract polygons from the American Community Survey (USCB 2022) to specify urban observations made in low-income areas.

General sampling biases in the focal region

To quantify overall spatial sampling biases, we first defined the proportion of observations expected in each land class based on area. We calculated the proportion of land within the focal region (for land cover, protected areas, and urban areas) and within focal urban areas (for urban parks and low-income neighborhoods) that met each land context categorization. We then calculated the actual proportion of observations made within each land class and compared these values with the expected proportions.

To quantify weekend versus weekday bias, we calculated the expected proportion of observations on weekends versus weekdays and determined whether each observation in the focal region was made on a weekend or weekday using the wday() function in the R package lubridate (Grolemund and Wickham 2011). We calculated the proportion of observations made on weekends versus weekdays and compared these values with the expected proportions. Weekend versus weekday bias was included here because it can strongly impact phenological analyses, which are often conducted using contributory science data (Courter et al. 2013).

Assigning user categories

We compiled a list of users whose observations were included in the primary dataset. To assemble a complete dataset of global observations made by these users, we combined previously downloaded observations from Campbell et al. (2023) with additional data directly downloaded from iNaturalist, again using custom code based on the R package rinat (Barve and Hart 2022). We then counted each user’s total number of verifiable global observations as of October 12, 2024 and assigned each user a percentile relative to others. The top 5% of observers by number of observations were classified as highly active and all others were classified as casual. We determined traveler status by calculating the proportion of each user’s observations made in the southeastern USA relative to their global observation count. We classified users who had made half or more observations in the focal region as locals and those who made less than half as travelers.

This method relies on the assumption that observers make the majority of their observations in the immediate area in which they reside. Although there may be edge-cases in which observers live near the study region boundary or moved their primary residence during the study period, the large number of active observers included (283,392) ensures that such cases are unlikely to influence overall patterns. Misclassifications would be most likely to reduce the observed differences between groups, making our results somewhat conservative.

Assessing how participation level and traveler status shape sampling

Quality grade

We calculated the proportion of observations in each quality grade (casual, needs ID, research grade) for casual locals, casual travelers, highly active locals, and highly active travelers and compared these values with the expected proportions. We then conducted a multinomial logistic regression to estimate the likelihood of an observation belonging to one of three quality grades based on user participation group, traveler status, and their interaction. Our aim was to generate exploratory models that examined potentially interactive relationships between user activity patterns and quality grade, without explicit a priori hypotheses (Tredennick et al. 2021). We fit four models, the first including participation group, traveler status, and their interaction; the second including participation group and traveler status; the third including just participation group; and the fourth including just traveler status. We then compared models and selected the top model using the Akaike Information Criterion (AIC).

Land context

We calculated the proportion of observations made within each land context class for casual locals, casual travelers, highly active locals, and highly active travelers and compared these values with the expected proportions based on land area. We then fit a set of logistic regression models predicting the likelihood that an observation was made in a given land category with user participation group, traveler status, and their interactions as predictors. We fit four competing models for each spatial variable, and selected the top model using the exploratory modeling process above. We used a multinomial logistic regression model to predict land cover class, and binomial logistic regression models to predict protected land status, urban versus rural areas, urban parks versus urban neighborhoods, and low- or not low-income urban areas. Odds ratios reported in Results reflect estimates derived from models, holding one variable constant to isolate the effect of the other.

Weekends versus weekdays

We calculated the proportion of observations made on weekends versus weekdays for casual locals, casual travelers, highly active locals, and highly active travelers and compared these values with the expected proportions. We then fit a binomial logistic regression model, using the same exploratory model selection process described above, to test whether the likelihood of an observation occurring on a weekend or weekday is influenced by user participation group, traveler status, and/or their interaction.

Results

iNaturalist records

We downloaded a total of 8,897,124 observations. After removing records with obscured coordinates, 7,533,946 remained in the focal region, contributed by 283,392 unique users. We downloaded and compiled 58,435,335 observations made globally by these users prior to October 12, 2024.

General sampling biases in the focal region

We found that observations in five land cover classes were overrepresented in the focal region: low-intensity developed areas (20.7% observed, 4.6% expected), medium/high-intensity developed areas (16.4% observed, 2.3% expected), open urban areas (16.3% observed, 7.1% expected), and urban green spaces (9.2% observed, 2.2% expected) (Supplemental file 2: Table 2). Agricultural areas (5.5% observed, 20.2% expected) and natural green areas (31.9% observed, 63.6% expected) were underrepresented.

Protected land (39.4% observed, 14.4% expected) and urban areas (46.8% observed, 8.9% expected) were overrepresented, while unprotected land (60.6% observed, 85.6% expected) and rural areas (53.2% observed, 91.1% expected) were underrepresented. Within urban areas, urban parks (19.3% observed, 2.5% expected) were overrepresented, while urban neighborhood areas (80.7% observed, 97.5% expected) were underrepresented. Higher-income census tracts (68.3% observed, 67.3% expected) within urban areas were slightly overrepresented, and low-income census tracts (31.7% observed, 32.7% expected) slightly underrepresented. Weekends (35% observed, 28.6% expected) were overrepresented and weekdays (65% observed, 71.4% expected) underrepresented.

Assigning user categories

Of the 283,392 users, we defined the most active 5% as highly active and the remaining 95% as casual based on their number of global observations (Figure 1c). Highly active users made an average of 3,419 total observations globally (median: 1,223, min: 494, max: 276,674, sd: 7,855.2), while casual users made an average of 37 total observations globally (median: 9, min: 1, max: 493, sd: 73.5). Highly active users were responsible for 59.7% of observations in the focal region and casual users 40.3% (Figure 1b). 61.6% of users were categorized as locals and 38.4% as travelers. Locals made 62.9% of all observations in the focal region and travelers made 37.1%. At the intersection of participation level and traveler status, 61.1% users were classified as casual locals, 33.9% as casual travelers, 0.5% as highly active locals, and 4.5% as highly active travelers. Highly active locals made the most observations (31.9%), followed by casual locals (31%), highly active travelers (27.8%) and casual travelers (9.3%) (Figure 1c).

Assessing how participation level and traveler status shape sampling

Quality grade

The best-supported model of observation quality grade included participation group, traveler status, and their interaction as predictors (Supplemental file 3: Table 4). We found that highly active users were more likely to make observations that reach research grade, and casual users were more likely to make observations that remain as needs ID or are designated as casual grade (Figure 2; Supplemental file 3: Table 5). Travelers were more likely to make research-grade observations than residents, and less likely to make casual or needs ID observations.

cstp-11-1-868-g2.png
Figure 2

iNaturalist observation quality grade proportions made by each of the four defined user groups: casual locals, casual travelers, highly active locals, and highly active travelers.

Land cover

The best-supported land cover model contained participation group, traveler status, and their interaction as predictors (Supplemental file 3: Table 4, 7). Highly active users, regardless of traveler status, were less likely to sample in developed landscapes and more likely to sample in natural green areas compared with casual users (Figure 3a). Travelers, regardless of participation level, were more likely to sample in natural green areas and less likely to sample in areas with low development compared with locals. In developed areas, highly active travelers observed more than expected based on their participation and traveler status group effects alone. The category with the most variation in observation behavior was natural green areas, where highly active travelers observed the most and casual locals the least. With increasing human development, the difference between casual and highly active users generally widened, with these groups making similar proportions of observations in urban green and open urban areas, but casual users making higher proportions in developed areas (Figure 3a).

cstp-11-1-868-g3.png
Figure 3

Proportion of observations made by each of the four defined user groups across spatial and temporal variables. Grey bars represent expected proportion based on land area for spatial variables and day ratios for the temporal variable.

Protected land

The best-supported protected land model included participation group, traveler status, and their interaction as predictors (Supplemental file 3: Table 4). Highly active users had 2.24 times higher odds of observing on protected land than casual users (p < 0.0001), while travelers had 1.29 times higher odds compared with locals (p < 0.0001; Figure 3b; Supplemental file 3: Table 7). Highly active travelers were slightly less likely to observe in protected areas than expected from the combined effects of participation level and traveler status (p < 0.0001; Supplemental file 3: Table 7). All four user groups observed more than expected on protected land. Highly active travelers were the most biased toward protected land, observing 36.2% more than expected, followed by highly active locals (+30.3%), casual travelers (+17.4%), and casual locals (+11.7%) (Figure 4).

cstp-11-1-868-g4.png
Figure 4

Percent difference between expected and realized sampling across land cover, protected land, urban areas, urban parks, low-income urban areas, and weekends for each of the four defined user groups. Blue lines right of zero indicate overrepresentation and red lines left of zero indicate underrepresentation.

Urban areas

The best-supported urban areas model included participation group, traveler status, and their interaction as predictors (Supplemental file 3: Table 4). Casual users had 2.08 times higher odds of observing in urban areas than highly active users (p < 0.0001), and locals had 1.33 times higher odds than travelers (p < 0.0001; Figure 3c; Supplemental file 3: Table 7). Highly active travelers were slightly more likely to observe in urban areas than expected from the combined effects of being highly active and a traveler (p < 0.0001; Supplemental file 3: Table 7). All four user groups observed more than expected in urban areas. Casual locals were the most biased toward urban areas, observing 52.3% more than expected, followed by casual travelers (+42.6%), highly active locals (+31.6%), and highly active travelers (+27.6%) (Figure 4).

Urban parks and neighborhoods

The best-supported urban parks model included participation group and traveler status without their interaction as predictors (Supplemental file 3: Table 4). However, there was equivocal support for the second-ranked model, which included the interaction term and is presented in the supplemental materials (Supplemental file 3: Table 7). Highly active users had 2.02 times higher odds of observing in parks compared with casual users (p < 0.0001), and travelers had 1.1 times higher odds compared with locals (p < 0.0001; Figure 3d; Supplemental file 3: Table 7). All four user groups observed more than expected in urban parks relative to neighborhoods. Highly active travelers were the most biased toward parks, observing 20.3% more in these areas than expected, followed by highly active locals (+18.8%), casual travelers (+12.8%), and casual locals (+12.6%) (Figure 4).

Low-income areas

The best-supported income model included participation group, traveler status, and their interaction as predictors (Supplemental file 3: Table 4). Locals had 1.33 times higher odds of observing in low-income areas than travelers (p < 0.0001), and casual users had 1.16 times higher odds than highly active (p < 0.0001; Figure 3e; Supplemental file 3: Table 7). Highly active travelers were slightly more likely to observe in low-income areas than explained by the combined effects of being highly active and a traveler (p < 0.0001; Supplemental file 3: Table 7). One user group, casual locals, observed more than expected in low-income areas (+1%) (Figure 4). All other user groups—casual travelers, highly active locals, and highly active travelers—observed slightly less than expected in these areas. Highly active travelers under-sampled most in non-low-income areas, observing 3.5% less than expected, followed closely by casual travelers (–3%), and then highly active locals (–0.3%).

Weekends versus weekdays

The best-supported weekend versus weekday model included participation group, traveler status, and their interaction as predictors (Supplemental file 3: Table 4). Highly active users had 1.19 times higher odds of observing on weekends than casual users (p < 0.0001), and travelers had 1.05 times higher odds than locals (p < 0.0001; Figure 3f; Supplemental file 3: Table 7). There was a significant interaction, indicating the tendency to observe on weekends is weaker for traveling highly active users compared with traveling casual users (p < 0.0001; Supplemental file 3: Table 7). Of casual users, travelers were more likely to observe on weekends than locals, but in highly active users this trend flipped and locals were more likely to observe on weekends than travelers. All four user groups disproportionately observed more on weekend days than weekdays. Highly active locals were the most biased toward weekend days, observing 9.3% more than expected, followed by highly active travelers (+7.5%), casual travelers (+6.3%), and casual locals (+2.6%) (Figure 4).

Discussion

Spatial and temporal biases in data contributed by volunteer participants have been well-documented (Di Cecco et al. 2021; Ellis-Soto et al. 2023), but without clear connection to how different users may perpetuate or reduce those biases. We established two key ways to distinguish users, those who participate more (highly active) or less (casual), and those local to the region (local) versus those visiting (traveler). We then asked how different user groups observe biodiversity across land contexts and days of week. We explicitly chose these metrics because we know, based on previous work (Di Cecco et al. 2021; Dimson and Gillespie 2023), observation effort is biased across these variables, and we expect that user groups have different behaviors and preferences impacting how observations accrue. A key result we explore herein is that highly active users tend to seek out biodiversity-rich areas, while casual users primarily incorporate iNaturalist into their day-to-day. We also found that locals likely fill gaps in low-income urban areas that travelers often miss. We present evidence supporting these insights, with implications for contributory program planning and downstream data use. We note that without participant surveys, it is possible to only indirectly infer user motivations, preferences, and abilities from their observation patterns alone.

A main finding of this work is that observer participation level more strongly shaped how observations were structured across space and time than traveler status. Within participation groups, whether a user was a local or traveler was secondary in structuring observations, with travelers often showing similar directional tendencies to highly active users, and locals similar to casual users. However, observation patterns in urban neighborhood income categories were shaped more by traveler status than participation group.

Highly active users seek out biodiversity-rich areas, while casual users primarily incorporate sampling into daily life

We found that casual and highly active users have different observational habits and likelihoods of making observations that reach research grade. In particular, highly active observers generally go out of their way to sample in natural and protected areas, both within and outside cities, and are more likely to add observations on weekends than casual users. Such areas are likely to harbor higher, unique diversity given that developed areas tend to act as strong filters for such diversity (Aronson et al. 2014). To be clear, highly active users also contribute numerous records in developed areas and during the week, but what sets them apart from casual users is their high proportion of observations in biodiverse, natural areas.

The preference of highly active observers toward these natural locations, presumably often during leisure time, aligns with their increased motivation to contribute to conservation research (Larson et al. 2020; Lowe et al. 2025), and with previous work focused on other contributory science initiatives. For example, Rosenblatt et al. (2022) found that specialist birders, who have stronger skills and devote more time to birding, travel farther from home to observe. Similarly, Bowler et al. (2022) observed that participants with more experience were more likely to conduct planned searches, and that these searches were more likely to occur in natural and protected areas than urban areas. Our findings, in combination with these works, suggest highly active users make intentional and focused efforts to collect biodiversity data.

Highly active users fill critical gaps in areas broadly under-sampled on iNaturalist, such as rural areas and natural green spaces, but also perpetuate biases toward protected land and urban parks. Still, observations from these biodiverse areas are particularly valuable for research on rare or threatened species, range shifts, ecosystem response to environmental change, and urban-natural comparisons. Consistent with previous work (Dimson and Gillespie 2023), we found observations made by highly active users were more likely to reach research grade than those by casual users. This may be because highly active users include more detailed images or metadata with their observations, or might initially provide more specific identifications (e.g., genus rather than family or order). As the primary source of research-grade records, highly active users play a key role in generating high-quality biodiversity data for research. However, these users also tend to neglect non-park urban areas and unprotected rural lands, which are most likely privately owned. These areas may host lower overall biodiversity, reducing their sampling potential and attractiveness to observers, but they also might simply be places highly active users can’t access or don’t frequent.

Our results suggest that observers who use iNaturalist more casually tend to primarily incorporate sampling into their daily lives. These casual users sample predominantly in urban areas, strongly perpetuating biases toward developed land that are well-documented in community-collected biodiversity data (Di Cecco et al. 2021). These findings align with those in Dimson and Gillespie (2023), who found that short-term observers who were residents of Hawaii were the least likely to sample outside developed areas. This is likely because casual users sample primarily close to home or during their routines and are less likely to go on nature-specific excursions, a conclusion further strengthened with the result that these users, especially locals, are less biased toward weekends than highly active users, and are more likely to make casual observations that reflect cultivated organisms. We note, however, the lack of direct information about where observers reside, and that these conclusions are inferences based on patterns in their sampling behavior.

Although casual users perpetuate data biases toward developed areas, they critically fill gaps in areas less served by highly active users. Within cities, casual users sample proportionally more than highly active users in non-park neighborhood areas. As observations in urban parks are generally overrepresented on iNaturalist, these non-park observations are particularly valuable for sampling broadly across cities and can help researchers more effectively understand how biodiversity responds across the broadest urban gradients (Callaghan et al. 2020). Some of these non-park observations are cultivated plants on private property based on a spot-check of records to determine cultivation status of casual-grade records. While these may be considered less valuable, records of cultivated species can offer valuable insight into the interaction between cultivated and natural biodiversity, the resiliency of urban green areas to environmental change, and environmental equity across cities (Leong et al. 2018). Still, the primary goal of iNaturalist is to collect biodiversity information on wild organisms, and cultivated observations can sometimes pose challenges for ecological analyses, particularly when the “cultivated” tag is not properly selected (López-Guillén et al. 2024).

In addition to filling data gaps within urban environments, casual users fill gaps in non-protected rural areas. In the southeastern USA, the majority of forested land is privately owned (Butler and Wear 2013) and heavily underrepresented on iNaturalist. Contributory biodiversity monitoring is especially critical in these areas, as they are generally not covered by traditional ecological monitoring. Casual users, perhaps because they represent a broader pool of participants who live in, know about, or have access to these areas, are helping fill these data gaps.

Locals fill gaps in low-income urban areas regardless of participation group

Regardless of participation group, local iNaturalist users were more likely to observe in low-income urban areas than travelers. Casual locals observed in these areas slightly more than expected by land area, representing the only case in our analyses in which a user group actively sampled against the overall data bias (Figure 4).

It is well-documented that contributory science platforms are impacted by socioeconomics, with historically redlined and lower-income areas underrepresented and yielding a less complete picture of biodiversity (Ellis-Soto et al. 2023; Estien et al. 2024). We did not find large differences in representation across income classes, likely in part due to the broad classifications of low-income versus higher-income areas in our analyses, but more importantly because iNaturalist users local to the southeastern USA helped mitigate biases toward higher income-areas. Filling gaps in underrepresented areas not only provides valuable data for research but also strengthens the capacity of communities in these areas to advocate for better environmental protections (Ellis-Soto et al. 2023). Here, we focused broadly on locals across the southeastern USA, but our work provides preliminary evidence that better engaging residents may be key to improving data availability and knowledge. Additionally, while this was not our focus here, facilitating participation from locals might reduce not only spatial but also taxonomic biases because locals, particularly those who are highly active, may be more attuned to regional biodiversity (Dimson and Gillespie 2023).

Implications for contributory project planning

Our results emphasize that a broader pool of contributors, varying in location, preferences, motivations, and abilities, enhances the scope of data collected and supports iNaturalist’s mission to “connect people worldwide with nature while advancing biodiversity science” (iNaturalist 2025). While highly active users are often prioritized in contributory science projects, our findings highlight the importance of recruiting new participants and residents of under-sampled areas, regardless of whether they will eventually become highly active. Strategies focused solely on cultivating highly active users may risk overlooking other contributors who fill critical data gaps.

Our work offers insights into which user groups are likely, willing, or able to observe in certain environments, and can be used to target specific audiences depending on project goals. Previous research has demonstrated that contributory project participants can be influenced to observe in under-sampled areas or areas of conservation concern thorough behavioral nudging, an intervention with the goal of behavioral changes (Callaghan et al. 2023; Thompson et al. 2023). Our results can help augment these efforts by implementing targeted nudges based on behavioral attributes of participants (Hart et al. 2022). For example, casual users may be more likely to respond to nudges asking them to sample near their neighborhood, while highly active users may be more willing or able to travel to rural areas of conservation concern. It is unclear whether the effectiveness of behavioral nudging varies by user group, and this presents a valuable question to explore in future work.

Implications for the field of citizen science

Our results illustrate how considering multiple behavioral axes and their relation to each other can disentangle participation patterns when assessing how volunteers contribute to citizen science. We found differences in sampling patterns across both the participation level and traveler status behavioral axes, generally supporting the notion that motivations of participants are complex and multifaceted (Dowthwaite et al. 2025; Robinson et al. 2021; West et al. 2021). As mentioned above, because different groups have different motivations and contextual conditions, project design needs to consider how and when to appeal to the different groups (Bowser et al. 2014; Lee et al. 2018). As an example, different maps could be provided to the different user types, highlighting specific areas to sample (Skarlatidou et al. 2024). This user-focused design concept is relevant not only to biodiversity-focused contributory science programs, but also citizen science programs more broadly (Fogg-Rogers et al. 2024). Understanding how different user groups contribute, and what motivates or enables their participation, provides a pathway to refine both analyses and project design, forming an important component of future research. When conducting these multi-axes analyses, axes may vary based on the size, complexity, goals, and subject matter of the citizen science project, participant information available to the researcher, or specific hypotheses or questions held by the researcher or project organizers.

Implications for biodiversity research

Understanding that different kinds of observers are more or less likely to observe in various environments and at different times is key to understanding the processes which underlie opportunistic spatial and temporal data biases. Our work suggests that spatial and temporal biases(e.g., weekends versus weekdays) may be intimately linked, which is valuable to consider when accounting for oversampling in phenology analyses, an area of research rapidly expanding using opportunistic data (Li et al. 2021). Previous work has shown that different biodiversity trend analyses vary in robustness to variation in user sampling behavior (Pocock et al. 2023b).

These same user-structured spatial biases in sampling also likely impact region-specific taxonomic coverage. Previous literature suggests highly active and casual users also differ in their taxonomic preferences. Deitsch et al. (2024), for example, found less-experienced iNaturalist users were more likely to observe large, brightly colored, invasive spiders compared with more-active users. In sum, our work implies that taxonomic biases, in addition to temporal biases, may not be consistent across space, which is relevant for researchers aiming to draw comparisons across land contexts or gradients.

Generalizability and future directions

While our work encompasses more than seven million observations and a broad geographic area, our findings describe patterns and processes within the context of the southeastern USA. We expect many of our findings (i.e., highly active users preferentially targeting biodiverse and protected areas and casual users incorporating observations into daily routines) to be broadly transferable to other regions. This is because behaviors stem from general motivational and logistical constraints (e.g., time availability, travel capacity, proximity to home) that shape opportunistic biodiversity recording regardless of geography (Bowler et al. 2022). While the particular land-use configurations of the southeastern USA are unique, the underlying behavioral drivers are likely common across contexts. However, future studies, building upon the work of Dimson and Gillespie (2023) and our analyses, are still needed to explore whether user groups maintain distinct sampling patterns across different land, cultural, and socioeconomic contexts.

Although we think it likely that many of these conclusions are relevant to other contributory science platforms, previous work suggests differences in participant behavior, motivation, and preferences across platforms (Boakes et al. 2016; Crimmins et al. 2021). Further work to capture dynamics on other platforms would help extend these conclusions to the global citizen science field more broadly and clarify which are unique to iNaturalist.

Here, we focused on two dimensions of user behavior and broadly categorized users. While this was appropriate to answer our questions, future analyses could be extended to include additional and continuous behavioral axes (August et al. 2020). An additional next step to more clearly link motivation to sampling behavior is to survey participants and analyze how their motivations, abilities, and background predict sampling patterns. Finally, it is unknown whether users maintain their sampling preferences over time or shift their observation patterns as they transition to being more highly active. Our data summarizes over those differences, but individual user traces could be used to examine this question more directly.

Conclusions

Sampling patterns and biases in opportunistic biodiversity data reflect the varied preferences of iNaturalist users. We show that by classifying users based on simple behavioral characteristics, we can predict where and when they are likely to sample and better understand how biases accrue and critical data gaps are filled. These results highlight the importance of retaining highly dedicated users and recruiting new participants, especially residents of underrepresented areas. When researchers use these data to answer ecological questions, especially when interested in certain environments or comparisons across environments, it is useful to consider who might be making observations in different areas, their motivations or expertise, and how these might vary.

Data Accessibility Statement

iNaturalist records are archived on Zenodo (DOI:10.5281/zenodo.15278505) and associated code is available on GitHub (https://github.com/eringrady/iNaturalist-users-exhibit-distinct-spatiotemporal-sampling-preferences.git).

Supplementary Files

The Supplementary files for this article can be found as follows:

Supplemental File 1: Table 1

Key variables considered in this work, their source, and related references. DOI: https://doi.org/10.5334/cstp.868.s1

Supplemental File 2: Tables 2–3

Expected versus observed observation proportions. DOI: https://doi.org/10.5334/cstp.868.s2

Supplemental File 3: Tables 4–7

Model selection and results. DOI: https://doi.org/10.5334/cstp.868.s3

Acknowledgements

We extend our gratitude to iNaturalist users who contributed records in the southeastern USA, making this work relevant and possible. We thank Dr. David Blackburn and Dr. Vaughn Shirey for providing valuable feedback. We acknowledge UFIT Research Computing for providing resources and support.

Competing Interests

The authors have no competing interests to declare.

Author Contributions

ELG, RPG, CJC, and CTC developed the ideas; CJC downloaded and compiled the data; ELG analyzed the data; ELG led the writing of the manuscript. All authors contributed substantially to drafts and gave final approval.

DOI: https://doi.org/10.5334/cstp.868 | Journal eISSN: 2057-4991
Language: English
Submitted on: May 1, 2025
|
Accepted on: Jan 1, 2026
|
Published on: Jan 29, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Erin L. Grady, Caitlin J. Campbell, Corey T. Callaghan, Robert P. Guralnick, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.