Science Stories: Adventures in Bay-Delta Data

rss
  • July 19, 2024

By Pete Nelson

What’s this post about?

A Juvenile Production Estimate (JPE), as we discussed in Part 1 of this essay, is an estimate of the number and timing of outmigrating juvenile spring-run Chinook Salmon (“spring run”) as they enter the Sacramento-San Joaquin Delta. It is an important tool for protecting these fish because it helps water managers anticipate when these salmon may be at risk of becoming entrained in water diversions as well as serving as an important check on the status of this population. In this part, I’ll describe the cutting-edge genetic and modeling tools we’re using to distinguish spring run from the other Central Valley Chinook. This series will finish with a final installment full of the quantitative modeling we’re developing to pull in all the salmon and environmental data and actually produce a forecast of juvenile spring-run production.

Distinguishing spring-run Chinook from other salmon

Ronald Reagan once said, “A tree’s a tree: How many more do you need to look at?” I don’t know about Mr. Reagan, but most of us could probably tell that there’s a difference between a valley oak from a ponderosa pine (hint: one has pinecones and the other has acorns); however, even the best fish biologists can’t tell a spring-run from a fall-run Chinook based on looks alone.

Table 1. Central Valley Chinook salmon life cycle timing.

Table indicated what time of year you can generally find salmon of various life stages in the central valley of California.

Until recently, the most practical way to separate the four different runs (remember? there are fall run, late fall run, winter run, and, of course, spring run) was to consider the size of the fish and the day of the year it was observed. Each run spawns during a different time of the year, and if those developing eggs and fry grow at approximately the same rate, you might expect that spring run, with peak spawning in September and October, are going to be a little larger than fall run, which spawn about two months later (Table 1). Similarly, late fall- and winter-run fish, which begin their spawning even later are likely to be smaller still. As you peruse Table 1, you’ll see that the four runs spawn during different times of the year (light blue bars), but there’s a lot of overlap, particularly between spring run and fall run. When you factor in variations in water temperature and food availability across the watershed, which directly affect the duration of egg incubation and larval and juvenile growth, there’s even more potential for overlap in size between the runs. To make run identification more challenging still, the juveniles from different spawning locations are typically moving downstream during roughly the same timeframe (November through July), and juveniles from some runs (especially late fall run and spring run) over-summer where they can find cold enough water and don’t migrate down until the following migration season, but typically earlier in the season and at a much larger size. Nonetheless, observing outmigrating juveniles with the first rains in November, you’d probably expect the largest of these fish to be late fall-run and spring-run fish that delayed migrating to the ocean and spent their first summer in their natal streams, with successively smaller fish being young-of-the-year winter run, spring run, and fall run.

Four graphs showing the lengths and dates traditionally used to tell juvenile salmon runs apart, along with the genetic run assignments. Length at date is often wrong.

Figure 1. River length-at-date (LAD) run identification curves (colored panels) and the genetic identity of fish (panels A-D) for juvenile Chinook collected at Chipps Island ( from Johnson et al. 2017). Word is, this figure is based on older genetic results and a lot of the bigger fall-run here are probably spring-run or late fall-run (thanks Brett Harvey!).

Counting cards, counting salmon

Now take a look at Figure 1. The background of each panel, made up of curved shapes, is colored to indicate the predicted size ranges on any given day of the year for the four run-types: Fall run as orange, late fall run as green, winter run as blue, and spring run as purple. These colors are generated using a mathematical formula that relates a fish’s length-at-date or LAD to a particular run-type. For example, looking at the blue winter-run curves, you’ll see that they (the fish and the shapes) start small, early in the time period—not even 50 mm long in September—but growing to 200 mm as early as about April. Therefore, if the models are correct, we would expect a small, 50 mm fish in October to be the progeny of parents who returned to fresh water in the winter (winter run). Similarly, a 75 mm fish in October is likely to be a late fall-run fish, and a juvenile 80 mm or larger in October is expected to be a fall-run fish. That’s if the fish play by our rules! Now look at the black dots on these graphs: Each dot represents the size of an actual individual, outmigrating juvenile salmon that was genetically identified as belonging to one of the four run-types, and the dots don’t all stay in the colored curve where we’d expect based on length. Still need convincing that these models aren’t perfect? Figure 2 compares the percentage of outmigrants belonging to each run as determined by LAD versus genetic test, and the success of the LAD approach at predicting run type could only be described as abysmal. Bottom line: LAD models are a less than dependable option for identifying the run of a juvenile salmon. Still, LAD does provide the foundation for another important element of our JPE program: probabilistic length-at-date or PLAD models.

Bar graph showing the percentage of fish in different rivers that are assigned each run type based on genetics and length-at-date. The bars are similar for fall run and winter run, but not very similar for spring-run.

Figure 2. Percent of total juvenile Chinook salmon by field year assigned to each run (rows), based on length-at-date (LAD) criteria versus genetic analysis (from Brandes et al. 2021).

The PLAD approach also uses the size of a juvenile salmon and the date captured to assign run type, but, employing a similar approach to counting cards when trying to beat your nephew during a competitive game of “Hi-Lo”[1],; PLAD also uses probabilistic modeling to estimate the uncertainty associated with that run assignment. Figure 3 shows how two run types might be distinguished with varying degrees of confidence based on day of capture, size, and our history of genetically determined run types of that date and size combination. Here, the larger the fish, the more likely it is to be from that “blue” run type. Each contour line is defined by a probability measure. That measure of probability relies on our database of juvenile salmon captures where we’ve used genetic techniques to determine the run-type for juveniles sampled across a range of sizes, dates and locations of capture. We’re particularly careful to retain a tissue sample from fish with a low probability PLAD run assignment; later genetic analysis is expected to improve our model’s accuracy. As this database is expanded, our ability to include such factors as the tributary where the salmon was captured and environmental factors (e.g., flow and temperature) likely to affect growth rates will be improved. In other words, the PLAD model does not provide a run-type determination with complete certainty, but it should dramatically improve our ability to distinguish between these runs and it’s cheaper, quicker, and easier than doing the genetics on every single fish we sample.

Graph showing two overlapping sets of length-at-date lines with probabilities for each set of lines.

Figure 3. Conceptual depiction of probabilistic length-at-date (PLAD) juvenile salmon size ranges for two runs (from Nelson et al. 2023).

Genetics

Thus far, we’ve been blithely referring to the use of “genetics” for distinguishing amongst Chinook run types without a care in the world for the hard-working folks who extract the DNA from these fish and examine the sequence of molecules contained therein. So how do they do it? DNA sequences can be used for differentiating amongst groups at varying levels—blood found at a crime scene might be examined first at the species level: Is this blood from a human victim or is this merely evidence of poor kitchen practices where someone butchered that darned rooster who persisted in crowing at 4 AM every morning? DNA can be used to determine if this is poultry or human blood, and it can also be used to determine which human left that blood. As you might imagine, distinguishing between individual humans requires more detail (more genetic markers, more time and expense) than determining foul versus fowl. The laboratory techniques for making these determinations are continually being improved, and DWR is now using a method that allows field crews to take a tissue sample from a fish and right there, in as little as 30 minutes, learn whether that fish is a spring-run salmon or not. The fish is unharmed, the sampling is quick, and the process is relatively inexpensive.

Our team has several genetic tests that we employ, each giving results with varying levels of precision and each with different demands in terms of time, effort and cost. Our initial test uses a CRISPR-based technique known as SHERLOCK (Baerwald et al. in press). It’s fast and cheap—less than $2.26 per sample—and gives one of three results: Early/Early, Late/Late, and Early/Late. These three terms relate to Central Valley Chinook run timing and the knowledge that these behaviors have largely been attributed to a single genetic region. Like humans, each salmon has two copies of each gene; each juvenile salmon inherits either an Early or a Late gene from dad, and an Early or Late gene from mom. Thus, each juvenile salmon ends up with either two Early genes (Early/Early or “homozygous Early”) or two Late genes (Late/Late or “homozygous Late”), or the juvenile may end up with one of each gene (Early/Late—“heterozygous” as the geneticists would say). It turns out that both winter- and spring-run salmon are Early/Early, and Late/Late fish are either fall-run or late fall-run salmon. Additional testing can distinguish between winter and spring run. What about the Early/Late heterozygotes? Yet another genetic test may allow us to assign those fish to a specific run, but this test costs more and must be processed in the laboratory, which takes more time.

Not only are these genetic tests critical to the ostensibly simple counts (“three more spring run and one more winter run”) that go into calculating a JPE, but they also serve to improve our PLAD model, testing and refining our ability to predict how to make run assignments using only size and date. After all, the PLAD models are still the quickest and cheapest way to determine how many juvenile salmon are produced by each of the runs.

What next?

Maryam Mirzakhani, the amazing Iranian-born mathematician, said, “The beauty of mathematics only shows itself to more patient followers.” Similarly, our efforts to forecast the number of juvenile spring run each year requires a little patience...and requires what ecologists refer to as quantitative modeling. This is where we take the monitoring data, PLAD results and the output of genetic analyses, and join it with environmental information—principally water temperature and flow rates—to generate a final JPE number. But for this, you’ll have to wait for Part 3.

Further Reading

  1. Hi-Lo is a simple card game in which the dealer turns over the top card in a full deck and the player then guesses whether the next card in the deck will be higher or lower than the first card. Guess right and the player wins; guess wrong and the dealer wins (if the cards are equal, then neither player wins). The process continues as each card is revealed in succession. To give a very simple example, if you count the face cards (Jacks, Queens, Kings—12 in the deck, total), and the dealer has revealed 10, chances are very good that a Queen is going to be followed by a lower card.

Categories: General
  • April 26, 2024

Untangling the estuary’s food web with data analysis and synthesis

Blog by Rosemary Hartman. Paper and analysis by Tanya L Rogers, Samuel M Bashevkin , Christina E Burdi, Denise D Colombano, Peter N Dudley, Brian Mahardja, Lara Mitchell, Sarah Perry, and Parsa Saffarinia.

Did you know that one female threadfin shad can lay 5,000 to 20,000 eggs at once?! The California Department of Fish and Wildlife’s Fall Midwater Trawl, which surveys the Sacramento-San Joaquin Delta and upper San Francisco Estuary, caught 1,551 threadfin shad in 2022. If half of those were female, they could have produced three to fifteen MILLION baby fish. But in 2023 the Fall Midwater Trawl only caught 1,922 threadfin shad. What happened to all the baby fish?

Apart from the number of parents, fish populations (and other populations) are usually controlled by a combination of three ecological processes: 1) Individuals cannot find enough food or nutrients to survive and reproduce (what’s known as a ‘bottom-up’ process), 2) they are eaten by predators (what’s known as a ‘top-down’ process), or 3) they encounter unfavorable environmental conditions such as high water temperatures that lower survival. It is frequently very hard to tell which process is dominating, but understanding whether lack of food or too many predators is controlling a population can be very helpful in figuring out how to recover populations (Figure 1).

Diagram of a food web pyramid showing that predators exert top-down pressure on their prey, whereas food exerts bottom-up effects on their predators.

Figure 1. Fish populations depend on the number of fish you start with (parents), amount of food available, number of predators, and suitable environmental conditions such as temperature and salinity. Diagram by Rosemary Hartman, DWR.

To try and figure out which processes might be controlling threadfin shad (and other estuarine fishes), a team of scientists recently applied structural equation models to a long-term dataset of physical variables, phytoplankton, zooplankton, clams, and fishes. The resulting paper “Evaluating top-down, bottom-up, and environmental drivers of pelagic food web dynamics along an estuarine gradient” was recently published in the journal Ecology. It is a great example of how long-term monitoring data, data integration, cutting-edge statistics, and diverse teams can work together to provide new insights about our environment.

The team started by drawing out their hypothesized relationships between ecosystem components in a food-web diagram (Figure 2). They wanted to create a mathematical model that described the relationships between each component in their diagram, but could only do so if there were adequate data available. They compiled data from six different long-term monitoring programs and one model dataset for a combined package of forty years of data across the estuary. Unfortunately, some of the key variables in their conceptual model, such as specific types of phytoplankton, aquatic vegetation, and large, predatory fishes, haven’t been monitored as well as others, so couldn’t be used in the model, but what was available was impressive.

Diagram showing that predatory fish eat smaller fish, smaller fish eat zooplankton, zooplankton eat phytoplankton, and clams eat phytoplankton and zooplankton. Nutrients control amount of phytoplankton.

Figure 2. Conceptual model of the estuarine food web used as a basis for the NCEAS synthesis team's food web model. Diagram adapted by Rosemary Hartman, DWR, from Rogers et al, 2024. 

They applied a modeling technique that allowed them to quantify each of the connections in their food web diagram to determine whether changes in one population were due to top-down effects of their predators, bottom-up effects of their food supply, or environmental variables like temperature and flow, and this was not an easy task! As Tanya Rogers, one of the team members and co-lead author on the final paper said, “It was sometimes hard to find the right balance between detail and interpretability”. Really complicated models have lots of detail, but can be hard to understand, but if the model is too simple it doesn’t describe reality. They eventually settled on a model that was somewhere in between. They found that fish and zooplankton trends were more driven by food supply in freshwater areas of the estuary, but the same populations were more driven by predation in the brackish water areas of the estuary. Abiotic drivers (temperature, turbidity, and flow) were frequently important in all regions and at all levels of the food web and had similar or greater effects than food supply or predation.

This was a great example of using an integrated dataset to address big questions about the ecology of the estuary, but the coolest part of it may have been the team that put it together. This project was part of a synthesis program sponsored by the Delta Science Program and led by the National Center for Ecological Analysis and Synthesis (NCEAS). The Delta Science Program recruited a diverse team of researchers from State and federal resource agencies and local universities with complementary areas of expertise. The team then participated in training workshops run by NCEAS on open science practices, data synthesis, reproducible workflows, data publications, and statistical techniques. Once the team had these new skills, the NCEAS trainers helped them assemble the integrated datasets and analyses used for the project. This workshop resulted in a really cool paper, but more importantly the team gained the skills to do more research like this in the future. Plus, they had fun getting to work with other early career researchers from different organizations who bring different perspectives.

Most exciting of all, the team made several recommendations for future research, such as exploring nutrient dynamics in more detail, testing changes to the dynamics in different salinity zones, exploring different time scales, and filling monitoring gaps such as large predatory fishes. With the team’s new skills and the available data, there are a lot more possibilities to explore.

Further reading

Categories: General
  • November 22, 2023
Adult Chinook Salmon with a fin tag being released by a scientist into a river. Photo from CDFW.

By Peter Nelson

The Challenge

Spring-run Chinook salmon (“spring-run”) are listed as threatened under both the California Endangered Species Act and the Federal Endangered Species Act. Like most salmon, these fish are anadromous: The adults, having grown and matured in the ocean, return to their natal stream to spawn, and the juveniles, after rearing in freshwater, eventually migrate downstream to the ocean (see Figure 1). During that downstream migration, juvenile salmon are exposed to a gauntlet of threats, including warm water temperatures, predators of all sorts, and “taking the wrong turn” through water diversions and getting lost on their way to the ocean. Managing or reducing the risk posed by water diversions is a responsibility of the Department of Water Resources, and to do that water managers need to know the number and timing of those outmigrating juvenile spring-run as they enter the Delta. Coming up with an accurate prediction of this—what’s termed a Juvenile Production Estimate (PDF) or JPE—is not simple. This is the first of a two part piece about our efforts to develop a JPE, both what’s been accomplished and what’s planned, as well as a timeline.

Diagram of spring-run salmon life cycle showing adults migrating upstream into the mountains where they hang out in cold water pools below dams all summer before spawning. Juveniles then travel through the delta to the ocean to mature. - link opens in new window

Figure 1. Spring-run chinook have a complex life cycle. The adults migrate upstream in January through March, but instead of spawning right away like most salmon they hold in coldwater pools all summer and spawn in the fall. Diagram by Rosemary Hartman, Department of Water Resources. Click to enlarge.

The Approach

We know when to expect adult spring-run to return to their natal streams to spawn based on past experience: Humans, beginning with the indigenous peoples of the West Coast, have been observing these runs for generations, and we might reasonably expect that the numbers of returning adult salmon are a decent predictor of the juvenile fish those returning salmon will eventually produce. Observations by multiple teams of biologists of adult salmon throughout the Central Valley allow us to predict the likely numbers of juvenile spring-run expected to migrate downstream and enter the Delta on the way to the Pacific Ocean each year. “Hold on a minute,” you might say, “What about the water in those streams? If the creeks are low and the water is warm, surely those baby salmon won’t do as well as they might when conditions are good.” You’d be right! The number of reproducing salmon—the parents—isn’t a perfect predictor of the number of offspring: There are many environmental factors that affect juvenile production, but, based on past studies of salmon ecology, we can include factors like flow in our analysis of the likely number of juveniles that will be produced by the annual return of adult salmon (for example, see Michel 2019; Singer et al. 2020).

These estimates, however, are just that—we can’t know exactly how the varying amount of water will affect the survival of juvenile salmon as they grow and migrate, but we should get reasonably close, and we have another source of information to improve our estimates, the number of outmigrating juveniles that we observe directly as they swim towards the Delta: The streams where spring-run spawn regularly have rotary screw traps (Video) (RSTs, Figure 2) on them. These devices divert migrating juveniles into a holding pen where biologists count and measure them each day before releasing them back into the stream to continue their journey to the sea. Data from these RSTs give us another check on our estimates based on spawner production, and are themselves an alternative means for estimating spring -run juvenile production.

A rotary screw trap with a conical trap and surrounding deck deployed in a narrow channel with trees growing on the bank.

Figure 2. A rotary screw trap floating in the Yolo Bypass Toe Drain with its cone out of the water (not sampling). Photo courtesy of the Department of Water resources.

One last point: In order for water managers to use these predictions for how many (and when) spring-run are expected to reach the Delta, these estimates need to happen each year before spring-run are expected to enter the Delta when water managers need to make decisions about their operations. This is especially tricky for estimates that rely on that RST data because it only takes a few weeks for juvenile salmon to travel from the RSTs to the Delta. This means that the process of counting adult salmon and (especially) juvenile salmon in the RSTs, entering those data into a shared database, and crunching the numbers to produce a JPE must be fast, efficient and accurate.

Gathering Information

This is a collaborative, interagency effort, which we began by holding a broad-based, public workshop in September 2020 with the Department of Fish and Wildlife (see Nelson et al. 2022 for details) and writing a science plan (PDF) with our agency partners to determine what monitoring data were needed to develop a spring-run JPE. Estimating an annual spring-run JPE is complicated by (1) the broad geographic and geologic range of Central Valley streams that support spring-run, (2) the challenge of developing a holistic, coordinated multi-agency monitoring framework for generating quantitative estimates of juvenile spring-run across their range, (3) the variable life history displayed across the spring-run streams, and (4) the difficulty of distinguishing juvenile spring-run from other run types (fall run, late-fall run, and winter run) found in the same streams (we will talk more about distinguishing salmon run type in our next blog post).

Monitoring

Most of the monitoring in spring-run streams is conducted by the staff of several governmental agencies (e.g., Deer Creek), gathering data on the numbers and timing of returning adults and of migrating juveniles, and tracking the changes in these metrics from year-to-year, but monitoring historically was designed to focus on local management needs, employed multiple methods and focused on different life stages across the watershed. Some work has been done to integrate data on number of returning adults (CDFW's GrandTab dataset, which produced the graph of returning adults, Figure 3 below). However, a spring-run JPE will require more a coordinated approach with the means of combining data from more than 40 monitoring programs from eight regions, several governmental agencies, and nearly two dozen data stewards and managers, using diverse methods and having large discrepancies in monitoring histories. These are significant challenges, but they can be met as long as we’re aware of the limitations (see below).

Bar graph of returning adult spring-run chinook salmon in the JPE tributaries. Total escapement varies from over 20,000 to less than 1,000, with Butte Creek having the highest returns. - link opens in new window

Figure 3. Total escapement (number of returning adults) by tributary for 2000-2022. Click for an enlarged version broken out by tributary.

In addition to gathering data on the number and timing of returning adults and departing juveniles, we’ll also need data on year-to-year salmon spawning success and on the survival of those outmigrating juveniles as they move from higher elevation habitats through lower, slower and warmer tributaries, and as they migrate down the mainstem of the Sacramento River to finally reach the Delta (streams with major spring-run spawning are shown in Figure 4).

Environmental conditions too are crucial: Preeminent are the quantity of water in the system and water temperature; we know that these have strong effects on salmon survivorship and behavior. The number and location of predators also vary from year to year and can affect the number of juvenile spring-run reaching the Delta.

Map of the Sacramento Valley watershed highlighting Clear Creek, Butte Creek, Battle Creek, Deer Creek, and the Feather River, where spring-run spawn. - link opens in new window

Figure 4. Map of the Sacramento River watershed highlighting the rivers and streams where data is being collected for the spring-run JPE. Some spring-run also spawn in the San Joaquin watershed, but they have not been added to the spring-run JPE dataset yet. Click to enlarge.

Data Management

You may have heard the expression, “garbage in, garbage out”? Wherever the phrase originated, it certainly applies to ecology! Quality data and metadata (how, when, where, and by whom the data are collected) are critical to an accurate spring-run JPE and its application to salmon conservation and water management. DWR led the formation of a team to design a data management system. This team conducted extensive outreach to the various monitoring programs for the seven spring-run spawning streams identified as most important to the JPE.

This data management system is now a reality, and is designed to provide timely access to machine-readable monitoring data and metadata. To meet the annual deadlines for calculating a spring-run JPE, new RST data must be compatible across programs and reported rapidly. Building the initial dataset took over a year because of historical inconsistencies in data reporting across monitoring programs, but state and federal agencies are collaborating to make newly collected data compatible from the moment of data entry. Data from some monitoring programs are now acquired automatically from digital entry and uploads are occurring directly from the field daily; the rest of the monitoring programs will move to this “field-to-cloud” data entry system over the next several years, improving data quality and the greatly facilitating the ease of access. All historical RST data are now publicly available from the Environmental Data Initiative (use search term “JPE”), and new RST data will be added to this repository on a weekly basis. Indeed, one of the most exciting and novel aspects of the spring-run JPE effort is that it has unified much of the existing data reporting from multiple agencies monitoring along with new monitoring under a common goal and purpose.

The spring-run JPE data management program

  • has now standardized data collection methodologies, schemas, encodings, and processing protocols;
  • produces machine-readable data for all RST monitoring programs (adult data will follow soon);
  • uploads data in near real-time to a shared data management system; and
  • makes data publicly accessible in a simple format.

This system allows us to look at all the different data sources at once to learn new things! For example, if we plot the catch of salmon from the rotary screw traps at Mill Creek, the Feather River, Knights Landing, and Delta Entry from upstream to downstream (Figure 5) we see that the most upstream site (Mill Creek) catches salmon earlier than the downstream sites and catches a lot more of them. Moving downstream the catch gets smaller and smaller as juvenile salmon get lost, eaten, or die along the way. Mill Creek also has juvenile salmon leaving the stream as late as May or June, but very few of these fish make it all the way down to the Delta, indicating that later migrants might have a harder time surviving.

Ridgeline plot showing timing and number of juvenile outmigrants at Mill Creek, Feather River, Knights Landing, and Delta Entry. - link opens in new window

Figure 5. Plot of rotary screw trap catch over time for the spring of 2023 at several locations in the Central Valley. Click to enlarge.

In our next post on the spring-run JPE, we’ll describe the cutting-edge genetic tools we’re using to distinguish spring-run from the other Central Valley Chinook, the quantitative modeling we’re developing that pulls in all of the salmon and environmental data and actually produced a juvenile production estimate along with an indication of our confidence in that estimate, the peer-review process that will critique our program and recommend improvements, and where we expect to take this spring -run JPE program next.

Further Reading

Categories: General
  • June 28, 2023

Blog by Rosemary Hartman, Data collated by Nick Rasmussen

Our last post gave you an introduction to the water weeds of the Delta. Invasive submerged aquatic vegetation is taking over the waterways, making it difficult for boaters, fish, water project operations, and scientific researchers (Khanna et al. 2019). As we described in the blog “Getting into the weeds”, they are hard to control too. But how do we collect data on aquatic weeds and what do those data look like?

How do we collect data?

There are two main types of data that we can work with (see IEP Technical Report 92 (PDF) for details). The first, is collected with areal photographs, and is known as ‘remote sensing’ (Figure 1). A specialized camera (sensor) is mounted on a platform (a drone, airplane, or satellite), and it can collect a regular-old photograph, or a hyperspectral photograph that records a lot of wavelengths of light that our eyes can’t see. Because different types of plants reflect different spectra of light (they are different colors) these photographs can be used to map location and extent of vegetation (Figure 2).

Diagram of remote sensing process showing an airplane flying over the water. Lines representing light go from the sun to the ground and are reflected to the sensor on the plane.
Figure 1. Diagram showing how remote sensing works. Light from the sun is reflected by objects on the ground. Different wavelengths of light are reflected differently. The sensor (camera) on the platform (airplane, satellite, drone, etc) registers the different wavelengths and stores them as an image file. Later, experts can classify the images based on which wavelengths of light were reflected.

The UC Davis Center for Spatial Technologies and Remote Sensing (CSTARS) has been mapping vegetation in the Delta using hyperspectral imagery collected with airplanes for most of the last 20 years (like the map in Figure 2). They create maps every year and share their data online via the Knowledge Network for Biocomplexity.

False-color map of Suisun Marsh showing different colors for different types of vegetation.
Figure 2. Hyperspectral image map of Suisun Marsh collected by CSTARS. Colors are used to represent different vegetation types (they aren't really that color).

Hyperspectral imagery is very good for identifying floating aquatic vegetation and terrestrial vegetation, but the water makes it hard to identify submersed vegetation. We can map where submersed vegetation is, but not what kind of vegetation is there. To look at community composition, we need to actually get out in the field and check on the weeds directly. To survey submersed weeds, researchers use a thatch rake (Figure 3) – an evil looking tool with sharp tines on one end and a long handle.

Image of a rake with a long pole and very jagged blades on the end.
Figure 3. A thatch rake being used to sample aquatic vegetation. Photo from the Department of Water Resources.

To measure weeds, researchers either lower the rake into the water and twist it around to pick up the weeds, or they drag the rake behind the boat. When they pull the rake in, they rank the coverage of weeds on the rake head and identify the weeds to species. Several different groups have been collecting these data over the past twenty years– the State Parks Division of Boating and Waterways (DBW), UC Davis (including CSTARS), SePRO Corporation, and the Department of Water Resources. Dr. Nick Rasmussen (DWR) recently integrated all these datasets into one data publication available on the Environmental Data Initiative. He developed the data set because he was helping to write a series of reports about the environmental impact of drought and drought-related management actions. Those reports required rounding up aquatic vegetation data quickly, but at the time, virtually none of it was readily available. He wanted to fix that.

It ended up being a fun challenge for Nick because he got to learn how to create a fully reproducible process for integrating the dataset – including some hard decisions about how best to combine data that was collected in very different ways. He did all the cleaning and formatting in R scripts that are made available in a public GitHub repository.

What do the data look like?

Well, because these data were collected by different programs for different purposes, there are pretty big differences in number of samples and distribution of samples over the years (Figure 4). So it’s a little difficult to detect trends.

Bar graph showing lots of samples taken from 2007-2010, no samples taken 2011-2013, and a lower number of samples taken 2014-2021. The Franks Tract Survey only occurred from 2014-2021.
Figure 4. Graph of number of submerged vegetation rake samples per year in the integrated vegetation dataset.

However, when we look at the relative abundance of different species that have shown up in the rake sample data (Figure 5), we can really see an expansion in Potamogeton richardsonii (Richardson’s pondweed, black bars in figure 5) and Najas guadalupensis (southern naiad, brown bars in figure 5) after 2014. It’s not clear whether these species, both native to California, were recent invaders in the Delta or whether early surveys didn’t know how to tell the difference between them and similar looking species such as Potamogeton crispus (curlyleaf pondweed, red bars in Figure 5).

Bar plot showing relatively consistent communities of vegetation over time, except for increases in Potamogeton richardsonii and Najas guadalupensis after 2014. The enlarged image shows the stacked bar plots for each species.
Figure5. Graph of relative abundance of each species across all rake sample surveys by year. No sampling occurred 2011-2013. Click for a version that shows separate plots for each species.

To make the story a little more complicated, these species aren’t found evenly throughout the Delta. Both Potamogeton richardsonii and Najas guadalupensis are found chiefly in Franks Tract (Figure 6) and not in any of the other regions of the Delta. SePRO and DBW conduct extensive sampling every year within Franks Tract (see Caudill et al. 2019 (PDF)), but not as high an intensity in other areas, so high abundance there throws off the Delta-wide data if it is not weighted by location.

Map with pie charts in each region showing relative abundance of vegetation across all years. Most pie charts show mostly Egeria and Ceratophyllum, but Franks Tract also has Potamogeton richardsonii and Najas guadalupensis. The enlarged image shows the stacked plots for each species.
Figure 6. Average relative abundance of different types of submerged vegetation by region of the Delta for the entire dataset. Click for a version that shows separate plots for each species.

We can also look at the hyperspectral maps to give us a record of total coverage of submerged weeds by year (Figure 7). We can see that total coverage of weeds really increased between 2008 and 2017, then remained about the same from 2017 to 2022.

Bar graph showing coverage of vegetation across all Delta waterways. Coverage is about 15% from 2007-2020, then jumps to 18-25% between 2017 and 2022.
Figure 7. Graph of percent of waterways in the Delta covered with submerged vegetation by year. Triangles indicate missing years.

However, it is often more interesting to look at a smaller area of the Delta and see how vegetation shifts from year to year (Figure 8, 9). For example, distribution of weeds in Franks Tract – a large, open-water area of the Delta – changed dramatically when a barrier was installed in West False River in 2015 and 2021-2022. The open-water area in the middle of the tract filled in during 2015, but the area on the eastern side of the tract started to clear out during 2021-2022 (see Hartman et al, 2022 (PDF) for more information).

Map showing the location of the Delta in the central valley of California and Franks Tract in the center of the Delta.
Figure 8. Map showing the location of Franks Tract - a large, open-water area with extensive vegetation and the site of many vegetation surveys by DBW and SePRO. Maps of vegetation in Franks Tract through the years are in Figure 9, below.

Maps of Franks Tract for 2004-2008 and 2014-2022 showing shifts in vegetation over time.
Figure 9. Hyperspectral image of Franks Tract showing installations of a barrier in West False River during 2015, 2021, and 2022, and resulting changes in distribution of submerged aquatic vegetation. Click to enlarge.

Between all these surveys, we’ve collected a lot of data on weeds, but we haven’t done an extensive analysis. There are lots more questions waiting to be asked! How does the distribution of weeds change with floods and droughts? Which species of weeds grow in shallow versus deep water? Are any species expanding in range? Download the dataset yourself and take a look!

Further reading

Categories: General, Underappreciated data
  • June 15, 2023

By Shruti Khanna and Rosemary Hartman

CSTARS field crew member sampling submerged aquatic vegetation using a thatching rake in the southern edge of Little Frank's Tract. Azolla and water primrose in the background, Egeria densa on the rake.
Figure 1. The hard-working staff of the UC Davis Center for Spatial Technologies and Remote Sensing survey water weeds using large thatch rakes. Image credit: UC Davis.

If you’ve ever tried to take a boat across one of the large, flooded islands in the Delta, there is a good chance you’ve gotten your propeller snagged on water weeds. Submersed aquatic vegetation (which is the fancy, scientific term for ‘water weeds’) have been getting worse and worse in recent years, with invasive species taking over areas that were previously open water. Most of these weeds are introduced species from South America, but even some of our native species have been expanding rapidly. Weeds were the subject of several papers in the most recent State of Bay Delta Science, they were the subject of a recent paper synthesizing many years of herbicide data to look at weed control effectiveness at a landscape scale, and data on aquatic weeds in the Delta has been published in several different datasets. Classification maps are now available for all aquatic plants in the Delta for most of 2004-2022 (with 2023 in the works) and there is a new, integrated dataset made up of vegetation samples from four different programs. Together, these data and publications are increasing our understanding of where weeds are a problem and what to do about it.

The recent paper “Multi-year landscape-scale efficacy of fluridone treatment of invasive submerged aquatic vegetation in the Sacramento San Joaquin Delta”, published earlier this year in Biological Invasions, was an excellent example of how many datasets could be integrated and used to answer a major management question. Water weeds have been managed with herbicides by California State Parks Division of Boating and Waterways for years, but its always seemed like a Sisyphean affair – constant use of herbicides and mechanical removal while weeds just keep growing back (for more info, see recent special issue of the Journal of Aquatic Plant Management). This gave lead author Dr. Shruti Khanna the idea that someone should look at whether the treatments were working. So she enlisted the help of Dr. Jereme Gaeta – statistician extraordinaire, Dr. Edward Gross – an expert in hydrodynamic modeling, and Dr. Louise Conrad – who could provide both scientific and policy advice.

For Shruti, the best part was working with the other members of the team: “Honestly, what I enjoyed most was working with Jereme on this project. He brought methods of analysis to the table that I couldn't have dreamed of when I was doing my PhD. I just didn't have the know-how. So, it was a very complementary effort - my remote sensing and geospatial skills to extract spatial information and integrate it with diverse other datasets, his statistical skills to create a nuanced GAM model to explore the data, and Louise's expert knowledge on the treatment program, its history, and management perspective made the publication a lot stronger. It was Jereme's idea to add Ed's hydrodynamic modeling of water speed to our analysis. Incorporating speed gave a whole new perspective to our study and turned out to be critical to understanding why treatment is sometimes not effective."

They gathered data gathered in the field on when and where weeds were treated, annual maps of weed distribution collected with hyperspectral imagery, and predicted current speed based on hydrodynamic models. The final integrated dataset was made of five diverse datasets with field data tables, rasters, vectors, and model output. They then created a statistical model to see whether multiple years of herbicide treatment resulted in lower probability of weeds. They found that when high levels of herbicide were applied there was a lower probability of weeds being there, but only at low current speeds (See figure 2). If the water was moving quickly, all the herbicide would be washed away and there was no effect of treatment (See figure 3).

Graph showing how the probability of vegetation presence decreases with increasing current speed. However, effectiveness of fluridone treatment also decreases with increasing current speed. Figure 2. Graph of model results from Khanna et al. 2023 showing probability of vegetation presence versus current speed. At low current speeds, fluridone decreased the probability of vegetation being present. But at high current speeds, there was no effect of fluridone. Graph reproduced from Khanna et al. 2023, with permission of the authors.

In a complicated, tidal system like the Delta, it is extremely difficult to control weeds when the herbicides get washed away by the tides more often than not. New tools are currently under investigation by members of the Delta Region Areawide Aquatic Weed Project, including the Division of Boating and Waterways, US Department of Agriculture, UC Davis, and others to get a better handle on this sticky problem.

Conceptual diagram showing how fluridone pellets sit on the bottom of the water. At low current speeds, water sloshes back and forth by the fluridone stays in place. At high current speeds, the fluridone washes away. Figure 3. Diagram of what is going on at herbicide treatment spots in the Delta. Fluridone comes in pellets which slowly release into the water column. When current speeds are low, water ‘sloshes’ back and forth a bit, but fluridone concentration remains high and weeds are killed. When current speeds are high, all the fluridone washes away before it can be effective. Image credit: Shruti Khanna, CDFW.

References

Categories: General