|
|
||||||||
a Center for Nonlinear Studies, Los Alamos National Lab., Los Alamos, NM 87545
b Earth and Environmental Sciences Division, Los Alamos National Lab., Los Alamos, NM 87545
c Lincoln Environmental Research, Ruakura Research Center, Hamilton, New Zealand
d Civilian Nuclear Program Office, Los Alamos National Lab., Los Alamos, NM 87545
* Corresponding author (vrugt{at}lanl.gov).
All rights reserved. No part of this periodical may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher.
Received 23 April 2007.
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: CPU, central processing unit EnKF, ensemble Kalman filter GDPM, generalized dual-porosity model KF, Kalman filter LANL, Los Alamos National Laboratory MDA, Material Disposal Area MPI, message passing interface MVG, Mualem–van Genuchten PFBA, pentafluorobenzoate RTD, residence time distribution SLS, simple least squares SVE, soil vapor extraction VOC, volatile organic compound
| INTRODUCTION |
|---|
|
|
|---|
Among the first to suggest the application of computer models to estimate soil hydraulic parameters were Whisler and Whatson (1968), who reported on estimating the saturated hydraulic conductivity of a draining soil by matching observed and simulated drainage. This process of iteratively adjusting the model parameters so that the model approximates, as closely and consistently as possible, the observed response of the system under study during some historical period of time is called inverse modeling. Parameter estimation using inverse modeling accommodates more flexible experimental conditions than typically utilized in laboratory experiments and facilitates estimating values of the hydraulic properties that pertain to the scale of interest, and thus is useful for upscaling. When adopting an inverse modeling approach, however, the soil hydraulic properties can no longer be estimated by direct inversion using closed form analytical equations but are determined using an iterative solution, thereby placing a heavy demand on computational resources.
In the field of vadose zone hydrology, inverse modeling usually involves the estimation of the soil water retention and unsaturated soil hydraulic conductivity characteristics using repeated numerical solutions of the governing Richards equation:
![]() | [1] |
We will briefly review the current state of the art of inverse modeling for estimating unsaturated flow and transport processes. We will explain how the inverse method works, discuss the historical background that led to the current perspectives on inverse modeling, and review the solution algorithms used to solve the parameter estimation problem. We will then highlight and illustrate some of our recent work at Los Alamos on self-adaptive multimethod global optimization, parallel computing, and sequential data assimilation to improve efficiency of parameter estimation in complex flow and transport applications and help assess parameter and model output prediction uncertainty.
Next, we present and discuss three different case studies that illustrate these various summarized developments. The first case study considers the calibration of a fully integrated three-dimensional vapor extraction model using observed data from a volatile organic compound in the subsurface at the Material Disposal Area near the Los Alamos National Laboratory (LANL). The second case study considers a multicriteria calibration of the Mualem–van Genuchten (MVG) parameters in the HYDRUS-1D model using observed tensiometric data from the Spydia field site in New Zealand. The last case study describes the application of a combined parameter and state estimation method to the interpretation of a multitracer experiment conducted at the Yucca Mountain field site in Nevada. This method, entitled Simultaneous Optimization and Data Assimilation (SODA) improves the treatment of input, model structural, and output error by combining the strengths of stochastic global optimization and recursive state updating using an ensemble Kalman filter.
| Inverse Modeling: A Review |
|---|
|
|
|---|
im
nek (1999) and Hopmans et al. (2002). We suggest reading this material to get a more complete overview and background.
Mathematical Development
A schematic overview of the inverse modeling procedure appears in Fig. 1
. Consider a model
in which the discrete time evolution of the state vector
is described by
![]() | [2] |
represents the observed forcing (e.g., boundary conditions), β is the vector of parameter values, and t denotes time. In the current context,
represents Richards' equation augmented with parametric forms of the soil water retention and unsaturated hydraulic conductivity functions, whose exact shapes are defined by the values in β. Assume that realistic upper and lower bounds on each of the p model parameters, β = {β1, ..., βp} can be specified a priori, thereby defining the feasible space of solutions:
![]() | [3] |
p, the problem is said to be constrained.
|
={
1,...,
n}, where n denotes the total number of observations. After this, a simulation is performed with F given some initial guess for b (usually based on some prior information), and the vector of model predictions Y(β)={y1(β),...,yn(β)} is computed. These output predictions are directly related to the model state according to
![]() | [4] |
![]() | [5] |
The aim of parameter estimation or model calibration now becomes finding those values of β such that the measure E is in some sense forced to be as close to zero as possible. The formulation of a criterion that mathematically measures the size of E(β) is typically based on assumptions regarding the distribution of the measurement errors present in the observational data. The classical approach to estimating the parameters in Eq. [4] is to ignore input data uncertainty and to assume that the predictive model
is a correct representation of the underlying physical data-generating system. In line with classical statistical estimation theory, the residuals in Eq. [5] are then assumed to be mutually independent (uncorrelated) and normally distributed with a constant variance. Under these circumstances, the "best" parameter combination can be found by minimizing the following simple least square (SLS) objective function with respect to β:
![]() | [6] |
![]() | [7] |
T denotes the error deviation of the measurements, and Q( ) is the
2 cumulative distribution with (n – p) degrees of freedom. This adequacy test gives us a measure of how well the optimized model fits the observations relative to their measurement precision. Using the definition in Eq. [7], models are adequate if padeq is >0.5.
Historical Background
During the last few decades, a great deal of research has been devoted to exploring the applicability and suitability of the inverse approach for the identification of soil hydraulic parameters. Most of this work has focused on laboratory experiments using small soil cores with well-defined boundary conditions to fully understand the prospects and limitations of the method and facilitate easy benchmarking against hydraulic properties derived from static or steady-state methods on similar soil samples. For our discussion, we group the early work on the use of inverse methods in vadose zone hydrology into five different categories, which we will discuss here in sequential order:
im
nek and van Genuchten (1996, 1997), Inoue et al. (1998),
im
nek et al. (1998b), Romano and Santini (1999), Durner et al. (1999), Wildenschild et al. (2001), Si and Bodhinayake (2005), and Zeleke and Si (2005). Various investigations have demonstrated the usefulness of the evaporation method, multistep outflow (MSO), ponded infiltration, tension infiltrometer, multistep soil water extraction, cone penetrometer, and falling head infiltration method for inverse estimation of the hydraulic parameters. These methods typically enable collection of a sufficient experimental range of data to at least be able to estimate some of the hydraulic parameters with good accuracy.
(h) data (Eching and Hopmans, 1993). Additionally, in the case of outflow experiments, van Dam et al. (1994) has shown that more reliable parameter estimates can be obtained by incrementally increasing the pneumatic pressure in several steps instead of using a single pressure increment throughout the entire experiment. In Vrugt et al. (2001c, 2002), a global sensitivity analysis (PIMLI) was presented to show that the information content for the various soil hydraulic parameters was separated sequentially with time during a MSO experiment, and that this information could be used to help improve the identification of the global minimum in the search space.
im
nek et al., 2003, and the many references therein). While initial applications of inverse modeling have focused primarily on the estimation of the unsaturated hydraulic properties from small soil cores, recent progress is also reported in the description of water uptake by plant roots (Vrugt et al., 2001a,b; Hupet et al., 2002), estimation of soil thermal properties (Hopmans et al., 2001), measurement of water content using time domain reflectometry (Huisman et al., 2004; Heimovaara et al., 2004), prediction of pore geometry based on air permeability measurements (Unsal et al., 2005), simultaneous estimation of soil hydraulic and solute transport parameters (Inoue et al., 2000), optimal allocation of electrical resistivity tomography electrodes to maximize measurement quality (Furman et al., 2004), and automated water content reconstruction of zero-offset borehole ground penetrating radar data (Rucker and Ferre, 2005). Moreover, with the ever-increasing pace of computational power and the inability of small-core measurements to adequately characterize larger scale flow and transport properties, application scales of inverse modeling have significantly increased in recent years, from the soil core to the field plot and regional scale, to provide solutions to emerging environmental problems such as salinization and groundwater pollution (Neupaier et al., 2000; Vrugt et al., 2004; Schoups et al., 2005a,b).
Among the first to apply the inverse modeling approach to a field situation were Dane and Hruska (1983), who estimated soil physical parameters using transient drainage data. The application of inverse modeling to estimate soil hydraulic properties across spatial scales is very promising, yielding parameters that represent effective conceptual representations of spatially and temporally heterogeneous watershed properties at the scale of interest. This approach is particularly powerful because it overcomes many of the difficulties associated with conventional upscaling approaches. With few exceptions, however, larger scale models are typically based on complex multidimensional governing equations, requiring significant computational resources for simulation and efficient optimization algorithms for model calibration. Moreover, unlike in small-scale experiments, boundary and initial conditions at the larger spatial scale are not as well defined because direct measurement techniques are mostly not available. Furthermore, the observational data available to characterize large-scale vadose zone processes are sparse, both in space and time. This requires the use of inverse algorithms that are efficient and can derive meaningful uncertainty estimates on the model parameters and associated model predictions. Note that inverse modeling is a powerful method to derive effective values of the soil hydraulic parameters at various spatial scales and circumvents many of the problems associated with conventional upscaling methods. For an excellent review of upscaling approaches for hydraulic properties and soil water flow, see Vereecken et al. (2007).
Parameter Estimation
For linear models, simple analytical solutions exist that conveniently minimize Eq. [6] and derive the optimal value for β at relatively low computational cost. Unfortunately, for most applications in subsurface flow and transport modeling (and many other inverse problems in hydrology), the optimal value of β can no longer be estimated by direct inversion and needs to be estimated by an iterative process. During this process, the parameters are iteratively adjusted so that the FSLS objective function is minimized and the model approximates, as closely and consistently as possible, the observed response of the system under study. Hence, this procedure is a critical component of the inverse modeling process, as the final optimized hydraulic properties might be susceptible to significant error if a wrong search procedure is used.
Because of the subjectivity and time-consuming nature of manual trial-and-error parameter estimation, there has been a great deal of research into the development of automatic methods for model calibration. Automatic methods seek to take advantage of the speed and power of computers, while being objective and easier to implement than manual methods. These algorithms may be classified as local search methodologies, when seeking for systematic improvement of the objective function using an iterative search starting from a single arbitrary initial point in the parameter space, or as global search methods in which multiple concurrent searches from different starting points are conducted within the parameter space.
One of the simplest local-search optimization methods, which is commonly used in the field of soil hydrology, is a Gauss–Newton type of derivative-based search (Marquardt, 1963; Zijlstra and Dane, 1996):
![]() | [8] |
f(βk) and H(βk) denote the gradient and Hessian matrix, respectively, evaluated at β = βk. From an initial guess of the parameters β0, a sequence of parameter sets, {β1, β2, βk+1}, is generated that is intended to converge to the global minimum of E(β) in the parameter space. Doherty (2004) and Clausnitzer and Hopmans (1995) presented general-purpose optimization software that implement this Gauss–Newton or Levenberg–Marquardt (Marquardt, 1963) type of search strategy. The derivative-based search method defined in Eq. [8] will evolve toward the true optimum in the search space in situations where the objective function exhibits a convex response surface in the entire parameter domain. Unfortunately, numerous contributions to the hydrologic literature have demonstrated that the response surface seldom satisfies these conditions, but instead exhibits multiple optima in the parameter space with both small and large domains of attraction, discontinuous first derivatives, and curved multidimensional ridges. Local gradient-based search algorithms are not designed to handle these peculiarities, and therefore they often prematurely terminate their search with their final solution essentially being dependent on the starting point in the parameter space. Another emerging problem is that many of the hydraulic parameters typically demonstrate significant interaction because of an inability of the observed experimental data to properly constrain all of the calibration parameters. This further lowers the chance of finding a single unique solution with local search methodologies.
These considerations inspired researchers in many fields of study to develop robust global optimization methods that use multiple concurrent searches from different starting points to reduce the chance of getting stuck in a single basin of attraction. Global optimization methods that have been used for the estimation of the unsaturated soil hydraulic properties include the annealing–simplex method (Pan and Wu, 1998), genetic algorithms (Takeshita, 1999; Vrugt et al., 2001c), multilevel grid sampling strategies (Abbaspour et al., 2001; Lambot et al., 2002), ant-colony optimization (Abbaspour et al., 2001), and shuffled complex methods (Vrugt and Bouten, 2002; Vrugt et al., 2003c; Mertens et al., 2005, 2006).
| Recent Advances in Inverse Modeling |
|---|
|
|
|---|
Multimethod Global Optimization
The current generation of optimization algorithms typically implements a single operator for population evolution. Reliance on a single model of natural selection and adaptation presumes that a single method exists that can efficiently evolve a population of potential solutions through the parameter space and work well for a large range of problems. Existing theory and numerical benchmark experiments have demonstrated, however, that it is impossible to develop a single universal algorithm for population evolution that is always efficient for a diverse set of optimization problems (Wolpert and Macready, 1999). This is because the nature of the fitness landscape (objective function mapped out as function of β) often varies considerably between different optimization problems and often dynamically changes en route to the global optimal solution. It therefore seems productive to develop a search strategy that adaptively updates the way it generates offspring based on the local peculiarities of the response surface.
In light of these considerations, Vrugt and Robinson (2007a) and Vrugt et al. (2008) recently introduced a new concept of self-adaptive multimethod evolutionary search. This approach, entitled A MultiAlgorithm Genetically Adaptive Method (AMALGAM), runs a diverse set of optimization algorithms simultaneously for population evolution and adaptively favors individual algorithms that exhibit the highest reproductive success during the search. By adaptively changing preference to individual algorithms during the course of the optimization, AMALGAM has the ability to quickly adapt to the specific peculiarities and difficulties of the optimization problem at hand. Synthetic single- and multiobjective benchmark studies covering a diverse set of problem features, including multimodality, ruggedness, ill conditioning, nonseparability, interdependence (rotation), and high dimensionality, have demonstrated that AMALGAM significantly improves the efficiency of evolutionary search (Vrugt and Robinson, 2007a; Vrugt et al., 2008). An additional advantage of self-adaptive search is that the need for algorithmic parameter tuning is reduced, increasing the applicability to solving search and optimization problems in many different fields of study. An extensive algorithmic description and outline of AMALGAM, including comparison against other state-of-the-art single- and multiobjective optimization methods can be found in Vrugt and Robinson (2007a) and Vrugt et al. (2008). In our case studies, we illustrate the application of AMALGAM to inverse modeling of subsurface flow and transport properties. Before doing so, we first highlight recent advances in parallel computing to facilitate an efficient solution to the inverse problem for complex subsurface flow and transport models requiring significant computational time.
Parallel Computing Using Distributed Networks
The traditional implementation and application of many local and global optimization methods involves sequential execution of the algorithm using the computational power of a single central processing unit (CPU). Such an implementation works acceptably well for relatively simple optimization problems and those optimization problems with models that do not require much computational time to execute. For high-dimensional optimization problems involving complex spatially distributed models, such as are frequently used in the field of earth science, however, this sequential implementation needs to be revisited. Most computational time required for calibrating parameters in subsurface flow and transport models is spent running the model code and generating the desired output. Thus, there should be large computational efficiency gains from parallelizing the algorithm so that independent model simulations are run on different nodes in a distributed computer system.
Distributed computers have the potential to provide an enormous computational resource for solving complex environmental problems, and there is active research in this area to take better advantage of parallel computing resources. For example, in hydrology applications, parallel computing is being exploited to improve computational efficiency of individual, large-scale groundwater flow (Wu et al., 2002) and reactive transport (Hammond et al., 2005) models. Efforts to couple hydrologic models consisting of a network of individual submodels (groundwater, surface water, and atmospheric models) also are being designed in a way that submodels can be partitioned to different processors (Winter et al., 2004). Finally, parallel versions of model inversion and sensitivity analysis software such as PEST (Doherty, 2004) or the Shuffled Complex Evolution Metropolis (SCEM-UA) optimization algorithm (Vrugt et al., 2006b) have been developed.
The parallel implementation of AMALGAM is presented in Fig. 2 . In short, the master computer runs the algorithmic part of AMALGAM and generates an offspring population from the parent population using various genetic operators. This new population is distributed across a predefined number of computational nodes. These nodes (also referred to as slaves) execute the simulation model and compute the objective function of the points received. After this, the master computer collects the results and follows the various algorithmic steps to generate the next generation of points. This iterative process continues until convergence has been achieved. Various case studies presented in Vrugt et al. (2006b) have demonstrated that this setup results in an almost linear increase in speed for more complex simulation models, suggesting that the communication time between master and nodes is typically small compared with the time needed to run the model.
|
Bayesian and Multiobjective Inverse Modeling
Classical parameter optimization algorithms focus on the identification of a single best parameter combination, without recourse to uncertainty estimation. This seems unjustified given the presence of input (boundary conditions), output (calibration data), and model structural error (our model is only an approximation of reality) in most of our modeling efforts. Hence the assumptions underlying the classical approach to parameter estimation need to be revisited.
One response to directly confront the problem of overconditioning is to abandon the search for a single "best" parameter combination and adopt a Bayesian viewpoint, which allows the identification of a distribution of model parameters. The Bayesian approach treats the model parameters in Eq. [6] as probabilistic variables having a joint posterior probability density function, which summarizes our belief about the parameters β in light of the observed data
. An example of this approach is the SCEM-UA optimization algorithm for simultaneously estimating the traditional "best" parameter set and its underlying probability distribution within a single optimization run. Detailed investigations of the SCEM-UA-derived posterior mean, standard deviation, and Pearson correlation coefficients between the samples in the high probability density region of the parameter space facilitates the selection of an adequate model structure (Vrugt et al., 2003a), helps to assess how much complexity is warranted by the available calibration data (Vrugt et al., 2003b), and guides the development of optimal experimental design strategies (Vrugt et al., 2002). Applications of the SCEM-UA algorithms to inverse modeling of subsurface flow and transport properties are presented in Vrugt et al. (2003a, 2004), Vrugt and Dane (2005), and Schoups et al. (2005a,b).
Another response, highlighted here and illustrated in the second case study, is to pose the optimization problem in a multiobjective context (Gupta et al., 1998; Neuman, 1973). By simultaneously using a number of complementary criteria in the optimization procedure and analyzing the tradeoffs in the fitting of these criteria, the modeler is able to better understand the limitations of current model structures and gain insights into possible model improvements. As a commonplace illustration, consider the development of a personal investment strategy that simultaneously considers the objectives of high rate of return and low volatility. For this situation, there is no single optimal solution. Rather, there is a family of tradeoff solutions along a curve called the "Pareto-optimal front" in which improvement in one objective (say, high rate of return) comes only at the expense of a degradation of another objective (volatility). Development of robust algorithms to solve such optimization problems is a research direction that is currently attracting great interest.
The multiobjective AMALGAM method discussed above uses the Nondominated Sorted Genetic Algorithm (NSGA-II, Deb et al., 2002), Particle Swarm Optimizer (Kennedy et al., 2001), Adaptive Metropolis Search (Haario et al., 2001), and Different Evolution (Storn and Price, 1997) for population evolution and is significantly more robust and efficient than current Pareto optimization methods, with improvement approaching a factor of 10 for more complex, higher dimensional problems. We will illustrate the multiobjective version of AMALGAM in Case Study 2 below.
Improved Treatment of Uncertainty: Sequential Data Assimilation
Considerable progress has been made in the development and application of automated calibration methods for estimating the unknown parameters during inverse modeling. Nevertheless, major weaknesses of these calibration methods include their underlying treatment of the uncertainty as being primarily attributable to the model parameters, without explicit treatment of input, output, and model structural uncertainties. In subsurface flow and transport modeling, this assumption can be easily contested. Hence, uncertainties in the modeling procedure stem not only from uncertainties in the parameter estimates, but also from measurement errors associated with the system input and outputs, and from model structural errors arising from the aggregation of spatially distributed real-world processes into a relatively simple mathematical model. Not properly accounting for these errors results in error residuals that exhibit considerable variation in bias (nonstationarity), variance (heteroscedasticity), and correlation structures under different hydrologic conditions. If our goal is to derive meaningful parameter estimates that mimic the intrinsic properties of our underlying transport system, a more robust approach to the optimization problem is required. A few studies have discussed the treatment of input, state, and model structural uncertainties for subsurface flow and transport modeling (Valstar et al., 2004; McLaughlin and Townley, 1996; Katul et al., 1993; Parlange et al., 1993), but such approaches have not become common practice in current inverse modeling practices.
In the past few years, ensemble-forecasting techniques based on sequential data assimilation methods have become increasingly popular due to their potential ability to explicitly handle the various sources of uncertainty in geophysical modeling. Techniques based on the ensemble Kalman filter (EnKF, Evensen, 1994) have been suggested as having the power and flexibility required for data assimilation using nonlinear models. In particular, Vrugt et al. (2005a) recently presented the Simultaneous Optimization and Data Assimilation (SODA) method, which uses the EnKF to recursively update model states while estimating time-invariant values for the model parameters using the SCEM-UA (Vrugt et al., 2003d) optimization algorithm. A novel feature of SODA is its explicit treatment of errors due to parameter uncertainty, uncertainty in the initialization and propagation of state variables, model structural error, and output measurement errors. The development below closely follows that of Vrugt et al. (2005a), after which the application of this method to multitracer experiments is described.
To help facilitate the description of the classical Kalman filter (KF), we start by writing the model dynamics in Eq. [2] as a stochastic equation:
![]() | [9] |
t, called the measurement error:
![]() | [10] |
0 denotes the error deviation of the observations, and
t* denotes the true model states at time t. At each measurement time, when an output observation becomes available, the output forecast error zt is computed:
![]() | [11] |
tf, are updated using the standard KF analysis equation:
![]() | [12] |
tu is the updated or analyzed state and Kt denotes the Kalman gain. The size of the gain depends directly on the size of the measurement and model error.
The analyzed state then recursively feeds the next state propagation step in the model:
![]() | [13] |
The virtue of the KF method is that it offers a very general framework for segregating and quantifying the effects of input, output, and model structural error in flow and transport modeling. Specifically, uncertainty in the model formulation and observational data are specified through the stochastic forcing terms q and
, whereas errors in the input data are quantified by stochastically perturbing the elements of X.
The SODA method is an extension of traditional techniques in that it uses the EnKF to solve Eq. [9–13]![]()
![]()
![]()
. The EnKF uses a Monte Carlo method to generate an ensemble of model trajectories from which the time evolution of the probability density of the model states and related error covariances are estimated (Evensen, 1994). The EnKF avoids many of the problems associated with the traditional extended Kalman Filter (EKF) method. For example, there is no closure problem, as is introduced in the EKF by neglecting contributions from higher order statistical moments in the error covariance evolution. Moreover, the conceptual simplicity, relative ease of implementation, and computational efficiency of the EnKF make the method an attractive option for data assimilation in the meteorologic, oceanographic, and hydrologic sciences.
In summary, the EnKF propagates an ensemble of state vector trajectories in parallel, such that each trajectory represents one realization of generated model replicates. When an output measurement is available, each forecasted ensemble state vector
tf is updated by means of a linear updating rule in a manner analogous to the Kalman filter. A detailed description of the EnKF method, including the algorithmic details, is found in Evensen (1994) and so will not be repeated here.
| Case Studies |
|---|
|
|
|---|
Case Study 1: Global Optimization of a Three-Dimensional Soil Vapor Extraction Model
Thousands of sites across the United States are contaminated with volatile organic compounds (VOCs) including trichloroethane, trichloroethene, and tetrachloroethene. These industrial solvents have high vapor pressure and low solubility and generally migrate faster in the vapor phase than in the liquid phase (Jury et al., 1990). Volatile organic compound plumes in the vadose zone can grow rapidly and are more likely to spread laterally because vapor diffusion is typically four orders of magnitude greater than liquid diffusion (Fetter, 1999). Remediation of vadose zone VOC plumes is required to protect human health, the environment, and deeper groundwater resources. The VOC plumes in deep vadose zones rely on remediation techniques that differ substantially from the techniques used to remediate plumes in the saturated zone.
One of the primary remediation techniques currently used on VOCs in the vadose zone is soil vapor extraction (SVE). Soil vapor extraction is appealing because of the relatively low costs associated with installation and operation, the effectiveness of remediation, and its widespread use at contaminated sites (Lehr, 2004). This technique uses an applied vacuum to draw pore gas toward an extraction hole. In a contaminated area, the extracted soil gas will contain some fraction of the VOC in addition to air, water vapor, and CO2. As the VOC is removed from the subsurface pore gas, any dissolved, adsorbed, or liquid-phase VOC will tend to move into the vapor phase, thus reducing the total VOC plume (Hoeg et al., 2004). Depending on the off-gas concentrations and local regulations, the gas stream may need to be treated by methods such as C adsorption or burning with a catalytic agent.
The focus of the first case study is a VOC plume located in the vadose zone surrounding the primary liquid material disposal area (MDA L) at LANL. This area is located on Mesita del Buey, a narrow finger mesa situated near the eastern edge of the Pajarito Plateau (Broxton and Vaniman, 2005; McLin et al., 2005). During operations, liquid chemical waste was emplaced in 20-m-deep shafts on the mesa top. The VOCs from the shafts have subsequently leaked into the subsurface and created a vadose zone plume that extends laterally beyond the boundaries of the site and vertically to depths of more than 70 m below the ground surface. The vadose zone at this site is quite deep and SVE is being investigated as a possible corrective measure to remediate the plume. The initial investigation consisted of short-duration (
22 d) SVE tests on two extraction holes (indicated as SVE West and East) that were designed to collect a range of data including pressure responses and concentration measurements from both the extraction holes and surrounding monitoring holes. A detailed description of the VOC, pressure response data, and SVE numerical model appears in Stauffer et al. (2005, 2007a,b). Here, we focus on the calibration of a fully integrated three-dimensional SVE model using observed VOC extraction concentrations during the initial SVE test. Note that this case study of gas-phase transport is different than the typical applications of inverse modeling of soil hydraulic properties discussed above. Moreover, our numerical grid is made up of soil and rock mass, representing the geologic setting at LANL. Below, we shortly summarize the details relevant to our calibration.
Geologic Setting
The Pajarito plateau is located on the eastern flank of the Jemez volcanic center, and the rocks that form the plateau were created in two main ignimbrite eruptions that occurred at approximately 1.61 and 1.22 million yr (Izett and Obradovich, 1994). The plateau has been incised by canyons that drain into the Rio Grande. The regional aquifer is located approximately 300 m below the surface of MDA L, and no perched water was encountered during drilling at the site.
Figure 3 gives the general geography for MDA L and the surrounding area, and Fig. 4 shows the approximate site stratigraphy on a north–south cross-section of the mesa with several of the boreholes from Fig. 3 projected onto the plane. The most important rocks with respect to the current study are the ash-flow Units 1 and 2 of the Tshirege member of the Bandelier Tuff, Qbt1 and Qbt2, respectively. The uppermost unit at the site, Qbt2, is relative welded with ubiquitous vertical cooling joints. Many of the joints in this unit are filled with clay in the upper few meters, but are either open at depth or filled with powdered tuff (Neeper and Gilkeson, 1996). Unit Qbt1 is subdivided into upper and lower units, Qbt1-v and Qbt1-g, respectively; Qbt1-v is further subdivided into a nonwelded upper unit and a welded lower unit, Qbt-1vu and Qbt-1vc, respectively. Subunit Qbt-1vu is less welded than Qbt2 and has fewer cooling joints. Subunit Qbt-1vc is a welded tuff with many open vertical joints that provide rapid equilibration of pressure changes during pump tests (Neeper, 2002). Unit Qbt1-g is a nonwelded, glass-bearing ash flow that contains a few joints that appear to be continuations of joints formed in Qbt1-v that die out at depth. For a more complete description of the geologic framework of the Pajarito plateau, see Broxton and Vaniman (2005).
|
|
The primary assumptions governing vapor-phase flow and transport are as follows. First, we assume that the vapor phase is composed solely of air that obeys the ideal gas law and calculate vapor-phase density (kg m–3) as a function of vapor pressure and temperature. Density differences due to spatial variations in VOC concentrations hardly affected gaseous movement. For example, at the maximum plume concentration of 3000 µm3 m–3, the pore gas changes in density from about 1 to 1.01 kg m–3. We use Darcy's law to calculate the advective volume flux. Vapor-phase contaminant conservation is governed by the advection–dispersion equation (Fetter, 1999) where the contaminant flux (mol m–2 s–1) is given by
![]() | [14] |
denotes the porosity, Sv is vapor saturation defined as air-filled porosity divided by total porosity, Cv is the molar concentration (mol m–3) and the dispersion coefficient, Dcv, includes contributions from both dispersivity and molecular diffusion as
![]() | [15] |
) as
![]() | [16] |
i) is directional; however, in FEHM we keep only the diagonal terms of this tensor. The superscript i implies that the equation is solved for the principle directions. For example, in three dimensions, the volume flux at any point can be decomposed into three principle components, vx, vy, and vz.. An additional constraint is imposed by Henry's Law equilibrium partitioning, which requires a constant ratio between concentrations in the liquid and vapor phase as
![]() | [17] |
|
|
Calibration Details: Parameters and Initial and Boundary Conditions
In situ measured VOC concentrations at the extraction and surrounding bore holes were used to initialize the respective concentrations in the grid domain. A snapshot of the initial VOC concentration at the onset of the SVE pilot tests is presented in Fig. 5. The extraction flow rate during the test was assumed to be constant and calculated from an equation provided by the manufacturer of the orifice plate used to measure the pressure drop across a slight decrease in the diameter of the extraction pipe. We further simplified the modeling analysis by assuming no movement of the liquid phase. Furthermore, the atmospheric boundary pressure was held constant at 80 kPa, and the temperature was fixed to the yearly average of about 10°C (derived from local stations). The vertical side boundaries of the domain were no flow with respect to both heat and mass, and no flow of water or vapor was permitted across the bottom boundary, with its temperature being held constant at 25°C (Griggs, 1964). Finally, the porosity and saturation fraction of the different geologic units was fixed to measured values presented in Birdsell et al. (2002) and Springer (2005). Justification for all these assumptions in the context of the Pajarito Plateau was given in Stauffer et al. (2005).
Observed VOC concentrations in the vapor phase at the wellhead of the west and east extraction holes during the initial 22-d pilot experiment were used for SVE model calibration. Because of the significant spatial variability in permeability observed from straddle packer tests, the west and east pilot tests were calibrated separately. As calibration parameters, we selected the permeability of the rocks of the upper four geologic units depicted in Fig. 4. Initial sensitivity analysis demonstrated that deeper geologic units had marginal impact on the simulation results. To simulate the effects of vertical cracks, separate values for the horizontal and vertical permeability were optimized for each rock type. Moreover, we also optimized the permeability of asphalt because this material acts as an important diffusive barrier over almost the entire grid surface. Table 1 provides an overview of the calibration parameters and their initial uncertainty ranges. These ranges were based on straddle packer data and core permeability measurements presented in Neeper (2002) and Stauffer et al. (2007a,b).
|
|
The optimized permeabilities for the west and east pilot tests are in good agreement, generally within an order of magnitude difference. The optimized permeabilities appear reasonable and demonstrate the appropriate variability as observed in the field (Neeper, 2002; Stauffer et al., 2007a,b). Moreover, their optimized values are typically within the middle of the prior uncertainty ranges, only approaching the outer bounds for a few parameters. We hypothesize that more realistic values for these particular parameters will be obtained when fracture flow is explicitly included in the SVE model. It seems unjustified, however, to overcondition the calibration process to a single "best" parameter combination as done here, given the presence of systematic errors in our model predictions so apparent during the first 2 d for both pilot tests. In the next two case studies, therefore, we will provide a better treatment of parameter, model, and output uncertainty.
Case Study 2: Multiobjective Calibration of Soil Hydraulic Properties
We conducted a multiobjective calibration of the unsaturated soil hydraulic properties using observed tensiometric pressure data from three different depth intervals at the Spydia field site in the Lake Taupo catchment in New Zealand. The vadose zone materials at this site consist of a loamy sand to sand material with gravel fractions up to 34%. The soil encompasses a young volcanic soil (0–1.6 m depth) on top, followed by the unwelded Taupo Ignimbrite (1.6–4.4 m) and two older Paleosols (4.5–5.8 m) having a late Pleistocene to Holocene age. Their parent material was inferred to be weathered tephra or tephric loess. A detailed physical and textural analysis of the vadose zone at the experimental site is given in Wöhling et al. (2008) and so will not be repeated here.
Altogether, 15 tensiometer probes (Type UMS T4e, UMS Umweltanalytische Mess-Systeme GmbH, Munich) were installed at five depths (0.4, 1.0, 2.6, 4.2, and 5.1 m below the soil surface) using three different replicates at each depth. The pressure head was recorded at 15-min intervals using a compact field point controller (cFP2010, National Instruments, Austin, TX). In addition, precipitation was recorded by event using a 0.2-mm bucket gauge and upscaled to hourly values for use in our calculations. Daily values of potential evapotranspiration were calculated with the Penman–Monteith equation (Allen et al., 1998) using observed meteorological data from the nearby Waihora weather station. For our inverse modeling exercise, we used the HYDRUS-1D model (
im
nek et al., 1998a). This model simulates water flow in variably saturated porous media and uses the Galerkin finite element method based on the mass conservative iterative scheme proposed by Celia et al. (1990). An extensive description of this model appears in
im
nek et al. (1998a). In this study, the unsaturated soil hydraulic properties are described with the MVG model (van Genuchten, 1980; Mualem, 1976):
![]() | [18] |
![]() | [19] |
s is the saturated water content [L3 L–3],
r is the residual water content [L3 L–3],
VG [L–1] and nVG (dimensionless) are curve shape parameters, l is the pore-connectivity parameter of Mualem (1976), and Ks is the saturated hydraulic conductivity [L T–1]. In the preprocessing phase, the soil domain was discretized into a rectangular grid of finite elements using a uniform nodal distance of 0.02 m and maximum depth of 4.2 m. The numerical solution with this spacing was generally found to be accurate to within 0.5% of a much smaller nodal spacing but required far less computational time for a single forward run of the model. The initial pressure head throughout the soil profile was derived from observed tensiometric data at the onset of the simulation. Moreover, all our simulations were run with an atmospheric upper boundary condition (switching between a prescribed flux and prescribed head condition depending on the pressure head at the soil–air interface) and prescribed lower boundary pressure head derived from observed tensiometric data at the bottom of the profile (4.2-m depth).
Simulations were conducted for a 282-d period between 11 Apr. 2006 and 18 Jan. 2007. In this calibration study, we focus on the tensiometer data from the unsaturated zone only, encompassing the 0.4-, 1.0-, and 2.6-m depth intervals. Deeper intervals were influenced by lateral groundwater flow. Consistent with our knowledge of the soil profile, we used three different layers to properly characterize the unsaturated soil hydraulic properties. For each layer, we optimized the MVG parameters
s,
VG, nVG, Ks, and l. The upper and lower bounds of these parameters that define the prior uncertainty ranges are listed in Table 2
. Because of poor sensitivity, the residual water content was set to zero. This is a common assumption, reducing the number of calibration parameters and increasing the computational efficiency of the calibration.
To measure the ability of the HYDRUS-1D model to simulate the tensiometric data at the 0.4-, 1.0-, and 2.6-m depths, we defined separate RMSE values for each interval. The Pareto optimal solution space for the three criteria {F1, F2, F3} and 15 MVG parameters was estimated with AMALGAM using a population size of 100 points in combination with 20,000 function evaluations. The results of this three-criterion calibration are summarized in Fig. 7 , 8 , and 9 , and discussed below. An extensive treatment of this data set and multicriteria optimization using various different optimization algorithms is presented in Wöhling et al. (2008). Here we only show and discuss some initial results.
|
|
|
To illustrate the relative efficiency of AMALGAM, Fig. 8 presents a comparative efficiency analysis of different multiobjective optimization algorithms. This graph depicts the evolution of the sum of the best individual found objective function values for the three criteria as a function of the number of HYDRUS-1D model evaluations using the NSGA-II (Deb et al., 2002), MOSCEM-UA (Vrugt et al., 2003c), and AMALGAM (Vrugt and Robinson, 2007a; Wöhling et al., 2008) optimization algorithms. This graph is reproduced from results presented in Wöhling et al. (2008) and more details regarding this comparison can be found there. In general, results demonstrate that the AMALGAM method finds significantly better solutions than the other two nonlinear global optimization algorithms. Specifically, AMALGAM finds an overall objective function value after about 3000 runs that is better than the other two algorithms after 20,000 HYDRUS-1D model evaluations. It is probable that the performance of the other optimization algorithms would have been more similar in terms of parameter estimates and final objective function values if they would have been used in conjunction with a larger population size; however, running them with a larger population size would have significantly deteriorated their efficiency. Moreover, in this study we used commonly used values of the population. Despite these findings, the strength of AMALGAM is its good and reliable performance with a relatively small population, inspiring confidence in the efficiency of this multimethod search algorithm.
The hydraulic head predictions at the 0.4-m depth associated with the Pareto solution set is illustrated in Fig. 9 for a portion of the 228-d calibration period. Model predictions are indicated with solid lines, while the solid circles denote observations. The overall "best" prediction, having an average RMSE of about 0.14 m for the three different depths, is indicated separately with a dashed line. The Pareto prediction uncertainty ranges generally capture the observations, but can be considered quite large, especially during prolonged dry conditions. Perhaps smaller uncertainty bounds would be achievable if we used prior information in our model calibration for at least some of the parameters, such as the saturated water content (see, for instance, Mertens et al., 2004). Note that the fit of the best parameter combination can be considered quite good, with pressure head predictions that nicely fluctuate around the measured tensiometric values. A systematic delay in modeled response is found, however, after most of the rain events. This suggests preferential flow, a mechanism that was not explicitly incorporated in our model formulation; however, HYDRUS-1D hardly improved our results by mimicking this process through the use of a dual-porosity model.
Case Study 3: Parameter and State Estimation in Subsurface Media
Interwell tracer experiments provide the best possible method at hand to study the complex flow and transport properties of a subsurface system and to measure field-scale flow and transport parameters of aquifers and reservoirs. Tracer tests bridge theory and practice by providing access to the transport characteristics of a heterogeneous porous medium. Interpretation of field tracer experiments is notoriously difficult, however, due to a lack of detailed information on subsurface flow paths and mixing between pathways, incomplete knowledge about subsurface heterogeneity, and problems with parameter estimation.
Multitracer experiments are an extension of the tracer technique that in principle dramatically increases the information content relative to that of a single tracer breakthrough curve. For example, multiple tracers with different diffusion coefficients are useful to distinguish between the effects of diffusion and hydrodynamic dispersion for transport in fractured rock. Similarly, a comparison of the breakthrough curves of sorbing and nonsorbing tracers provides a direct estimate of the sorption properties along the flow path between the injection and production well. Thus, multitracer experiments help to reduce conceptual model uncertainty, providing far less ambiguous information on transport processes.
Here, we apply the SODA method to the interpretation of a multitracer experiment conducted in fractured volcanic tuffs at the C wells, near Yucca Mountain, Nevada. Cross-hole tracer tests were conducted between injection well C3 and production well C2. The purpose of this and other similar tests at this site was to establish the validity of transport models and laboratory-determined sorption parameters when applied to field-scale transport in the saturated zone beneath the proposed Yucca Mountain nuclear waste repository. A detailed description of the tracer test methods and initial interpretation was given by Reimus et al. (2003); only the aspects relevant to our modeling are repeated below. Our application is an extension of the recently published work by Vrugt et al. (2005b), which discussed the parameter estimation of a single tracer within a recursive data assimilation framework. Our current application extends this previous work by considering the measured breakthrough curves of different tracers simultaneously to estimate model parameters and states.
MultiTracer Breakthrough Curve Measurements
Breakthrough curves under forced-gradient conditions were measured for two nonsorbing solute tracers with different diffusion coefficients (Br– and pentafluorobenzoate [PFBA]) and one weakly sorbing tracer (Li+). Tracers were mixed in 12 m3 of water and injected simultaneously. Figure 10
shows the measured normalized concentrations of the three solute tracers at the production well as a function of time. By scaling the breakthrough curves to the relative mass of each tracer injected, differences between the curves can be attributed to differences in transport properties of the tracers. The attenuation of Br– relative to PFBA is evidence of diffusion from fractures into the rock matrix, and the more attenuated Li+ response is attributable to sorption (Reimus et al., 2003). Additionally, the test featured three flow interruptions (one unintentional and two intentional) during the tailing portion of the experiment. The increase in tracer concentration on resumption of flow is another signature of dual-porosity behavior, with fracture flow and diffusion into the rock matrix (Reimus et al., 2003).
|
Many modeling approaches have been developed to deal with these complexities (e.g., Cvetkovic and Shapiro, 1990; Destouni and Cvetkovic, 1991; Simmons et al., 1995; Ginn, 2001). In our case, the goal of multitracer experimentation was to identify transport processes such as diffusion and sorption, and to estimate field-scale parameters, rather than to describe the detailed pathways in between a given well pair. Such details are not really important to the goal but can stand in the way of obtaining useful results by introducing complexity in the flow model that is not critical to the task of evaluating diffusive and sorptive processes. The philosophy of our approach is to capture the site-specific details that will inevitably result in a diverse set of advective velocities with a simple yet flexible model so that focus can be on the underlying transport processes other than advection. While other approaches are possible, we have found this simplification to lead to fruitful results.
Similar to our previous work (Vrugt et al., 2005b), we apply the theory of micromixing to the interpretation of interwell tracer tests. The model has its origins in the field of chemical reaction engineering (Zwietering, 1959), and similar approaches have been adopted in the field of groundwater (e.g., Rainwater et al., 1987; Dagan and Cvetkovic, 1996). Instead of requiring the collection of complex, in situ information on the three-dimensional subsurface flow pathways, the method enforces a particular residence time distribution (RTD). Robinson and Viswanathan (2003) showed how to build this advection flow path with side exits and assign the locations and flow rates of the side exits to reproduce an arbitrary RTD. A schematic overview of the model is given in Fig. 11 .
|
In this study, we chose the probability density function of stable distributions to represent the RTD. Stable distributions are a rich class of probability laws that can be used to represent the skewness and tailing behavior in the probability density function. The characteristic function of a stably-distributed random variable P is defined as
![]() | [20] |
ST
1 and where
ST is an index of stability (
ST
(0,2]), βST is a skewness parameter (βST
[–1,1]),
is a scale parameter (
> 0), and
is a location parameter (
R). Initial sensitivity analysis and optimization runs demonstrated that the implementation of Eq. [20] would be most productive if scaling parameters in the x and y dimensions (sx and sy) are introduced, while fixing the values of βST,
, and
to 1, 60, and 60, respectively. To simplify our problem, we assumed βST = 1, forcing the stable density function to be skewed to the left, which is consistent with the data.
After definition of the RTD, the sorbing and nonsorbing tracers are modeled in the mixing model by including parameters for matrix diffusion for all tracers and sorption for Li+. It was assumed that all the tracers were transported along the same set of three-dimensional subsurface flow pathways. Consistent with laboratory sorption studies (Anghel et al., 2002), Li+ sorption was modeled using a Freundlich sorption isotherm:
![]() | [21] |
Table 3
summarizes the calibration parameters and their prior uncertainty ranges. Note that the parameters
ST, sx, and sy will affect the fitting of each of the three different tracer curves as it defines the RTD, whereas the additional diffusion and sorption parameters are tracer specific. This approach is consistent with the fact that all tracers undergo the same advective and mixing processes, while each tracer has unique sorption and diffusion behavior.
|
Parameter Estimation
When interpreting a multitracer experiment, the set of measurements available for parameter estimation is no longer a single outlet concentration, but consists of different transient mass outputs, each representing the measured breakthrough curve of one of the injected tracers. Considered in total, these individual tracer data sets probably contain conflicting information about the underlying transport system, so that a particular set of parameters might result in an excellent fit to one of the tracers while exhibiting poor predictive capabilities for another tracer. In principle, these individual breakthrough curves could be weighted in a single aggregate scalar, and a stochastic global optimization algorithm could be used to find the underlying posterior distribution of the parameters. In the absence of a compelling framework for the assignment of the weights, however, the parameter estimation problem is inherently multiobjective, and any attempt to convert it into a single-objective problem is associated with considerable subjectivity.
Here we implement the multiobjective model calibration approach considered in the previous case study. We use three RMSE objective functions, FBr, FPFBA, and FLi, to trade off the fitting of the various tracer curves. Each of these objective functions is computed using the EnKF-derived time series of forecast errors, zt in Eq. [11], for each of the respective breakthrough curves:
![]() | [22] |
The Pareto optimal solution space for the three criteria specified in Eq. [22] was estimated using a population size of 100 points, in combination with 10,000 trials with a distributed computing implementation of the AMALGAM algorithm. The calculations reported here were implemented using 50 Pentium IV 3.40 GHz processors of the LISA cluster at the SARA parallel computing center (University of Amsterdam, the Netherlands). The CPU time required for joint Pareto optimization and ensemble state estimation of the conceptual mixing model was approximately 22 h. The results of the three-criteria {FBr FPFBA, FLi} optimization are summarized in Table 3 and Fig. 12 , 13 , and 14 and discussed below.
|
|
|
A related observation is that the output prediction uncertainty ranges are much larger for the sorbing tracer Li+ than for the nonsorbing tracers Br– and PFBA. This finding demonstrates that the conceptual mixing model as currently formulated is appropriate for predicting the subsurface transport processes of nonsorbing tracers, and that most of the error in the model arises from incorrect specification of the sorption processes along the flow paths. Note that the characteristic jumping behavior of the uncertainty bounds in Fig. 12C is caused by an insufficient ensemble size.
It is instructive to contrast the results obtained using the multiobjective SODA approach with a classical multiobjective AMALGAM optimization without state-variable updating. Figure 13 presents the autocorrelation functions of the computed residuals for the SODA method (Fig. 13A, 13C, and 13E) and the classical approach (Fig. 13B, 13D, and 13F). Autocorrelation is a measure of the serial dependency of the error residuals between model predictions and corresponding data. Ideally, this measure should be centered around zero at all lags, implying a lack of bias (systematic errors). Notice that there is significant autocorrelation between the error residuals for the classic multiobjective optimization with AMALGAM without state updating. Although the classical method captures the general trends in the measured tracer data (not shown), the residuals exhibit considerable variation in bias, variance, and correlation structure. Without state updating, the single-pathway conceptual mixing model is unable to track precisely the time series of measured breakthrough data for each of the individual tracers. This results in significant serial correlation between the error residuals, the effects of which clouds the interpretation of the tests. By comparison, there is considerably less autocorrelation between the residuals when performing a multiobjective SODA calibration, suggesting that recursive state adjustments remove a large part of the bias in the model predictions. When properly acknowledging model conceptual uncertainty by state variable updating, the error residuals become serially independent and exhibit normality. This approach should yield more reliable parameter estimates and uncertainty bounds.
Finally, Fig. 14 presents normalized parameter plots of the results for the three-criteria {FBr, FPFBA, FLi} SODA calibration with the AMALGAM algorithm. Each line going from left to right across the normalized parameter plots correspond to a different parameter combination. The gray lines represent members of the Pareto set, whereas the red, blue, and black symbols refer to the optimal solutions for FBr, FPFBA, and FLi, respectively. The eight model parameters are listed along the x axis, and the y axis corresponds to the parameter values, normalized by their initial ranges, as defined in Table 3. Additionally, Table 3 lists the Pareto uncertainty intervals of the parameters estimated with the MOSCEM-UA algorithm. The plot at the right-hand side in Fig. 14 depicts a three-dimensional projection of the tradeoff surface of the three objective functions. The Pareto Rank 1 solutions are indicated with gray points, whereas the optimal solutions corresponding to the three objectives are denoted with the prior defined symbols.
The results presented in Fig. 14 demonstrate that most of the mixing-model parameters are well constrained by optimization against multiple tracer data, in that most of the parameters the Pareto uncertainty intervals occupy a small region interior to the prior-defined, physically plausible uncertainty ranges. Most significantly, the diffusion coefficients are in the same general range as laboratory-determined and field-estimated values (Reimus et al., 2003), despite the fact that a very large prior range was chosen. This suggests that there is sufficient information in the time series of measured breakthrough data to recursively estimate the model states and simultaneously estimate the RTD and tracer-specific diffusion parameters. This result lends strong support to the matrix diffusion model. Also note that the stable parameter
ST is estimated to be in vicinity of 0.5. For the special case when
ST = 0.5 and βST = 1 (fixed a priori), the stable distribution represents the Lévy distribution.
It is interesting to note that for some of the model parameters, the Pareto set is discontinuous, with different well-separated clusters of solutions in the parameter space. This is most obvious for the scaling parameters sx and sy, and highlights that different combinations of these parameters result in a similar RTD curve. This conclusion was expected; no particular physical significance can be attached to the parameters of the stable distribution. Rather, the RTD-based approach is designed to allow for flexibility in establishing a nondiffusive RTD to represent all of the site-specific, unknown details associated with the advective flow paths between the wells. The approach provides a sound foundation for identifying the diffusion and sorption processes and parameters in the absence of such detailed flow path information.
The three-dimensional projection of the objective function space of the Pareto set of solutions on the right-hand side of Fig. 14 illustrates that there is significant tradeoff in the fitting of the various breakthrough curves, suggesting that for this model structure, it is not possible to find a single set of parameter values that simultaneously provides an excellent fit to the measured breakthrough curves of each of the three tracers. The most significant tradeoffs occur between the fitting of the sorbing and nonsorbing tracers. This conjecture is consistent with our previous conclusion that most of the error in the conceptual mixing model arises from an insufficiently complex specification of the sorption processes along the flow path. Note that the optimization seeks sorption isotherm parameters that are significantly more nonlinear (lower values of nFR) than would be expected from batch sorption experiments (Anghel et al., 2002). Heterogeneity in sorption parameters within the fractured tuff formation is a possible reason for this result. Concentrations are highest within the short-residence-time paths, and the inverse model is curtailing the degree of sorption in this portion of the mixing model with the only parameter at its disposal, the exponent nFR. A better simultaneous fit to the early and late portions of the Li+ breakthrough curve would probably be obtained with a conceptual model of sorption heterogeneity, whereby short-residence-time paths have lower sorption coefficients than the longer paths. This concept will be explored in future work.
| Summary and Conclusions |
|---|
|
|
|---|
We have reviewed the historical development of the current perspectives on inverse modeling in unsaturated flow and transport modeling, and have highlighted some of our recent work on this topic at LANL. In particular, we have discussed a general-purpose multialgorithm evolutionary search approach, called AMALGAM, that can efficiently handle a high number of parameters in complex, distributed flow and transport models and find preferred parameter solutions quickly in multimodal and noisy response surfaces. In addition, we have discussed the implementation of AMALGAM on a distributed computer network using a standard MPI toolbox in Octave (open-source software). Finally, we have discussed a combined optimization and Monte Carlo based sequential nonlinear filtering method to improve the treatment of uncertainty and provide meaningful prediction uncertainty bounds on our model simulations. All these approaches have been illustrated with three case studies addressing the transport of water, vapor, and contaminants through the vadose zone.
Finally, we would like to emphasize that considerable progress in the application of inverse modeling to complex simulation models has also been reported in adjacent fields such as groundwater and surface hydrology. Methods and approaches developed there might be of great use and inspiration to solve inverse modeling problems in vadose zone hydrology. Particularly appealing are approaches that attempt to reduce the dimensionality of the optimization problem using concepts of sensitivity and principle component analysis (Tonkin and Doherty, 2005) and representer-based calibration methods (Valstar et al., 2004). Among emerging methods for uncertainty estimation not included here are Bayesian model averaging (Hoeting et al., 1999; Raftery et al., 2005; Vrugt et al., 2006a; Vrugt and Robinson, 2007b) and its maximum likelihood variant (Neuman, 2003; Ye et al., 2004, 2005). These methods seem particularly appealing because they are relatively easy to implement, accounting jointly for conceptual model and parameter uncertainty by considering a set of alternative models that may be very different from each other while being based on a common set of data.
| ACKNOWLEDGMENTS |
|---|
im
nek greatly enhanced the presentation of this manuscript. | REFERENCES |
|---|
|
|
|---|
javier/investigacion/papers/VecPar04.pdf (verified 10 Apr. 2008) Dep. de Arquitectura y Tecnología de Computadores, Univ. de Granada, Granada, Spain.
im
nek. 1999. Review of inverse estimation of soil hydraulic properties. p. 643–659. In M.Th. van Genuchten et al. (ed.) Characterization and measurement of the hydraulic properties of unsaturated porous media. Proc. Int. Worksh., Riverside, CA. 22–24 Oct. 1997. U.S. Salinity Lab., Riverside, CA.
im
nek, N. Romano, and W. Durner. 2002. Simultaneous determination of water transmission and retention properties: Inverse methods. p. 963–1008. In J.H. Dane and G.C. Topp (ed.) Methods of soil analysis. Part 4. Physical methods. SSSA Book Ser. 5. SSSA, Madison, WI.
im
nek. 2001. Indirect estimation of soil thermal properties and water flux from heat pulse measurements: Geometry and dispersion effects. Water Resour. Res. 38:1–14.
im
nek, J.W. Hopmans, and V. Clausnitzer. 1998. In situ estimation of soil hydraulic functions using a multistep soil-water extraction technique. Water Resour. Res. 34:1035–1050.[CrossRef]
im
nek, S. Shiozawa, and J.W. Hopmans. 2000. Simultaneous estimation of soil hydraulic and solute transport parameters from transient infiltration experiments. Adv. Water Resour. 23:677–688.[CrossRef]
im
nek, J., and M.Th. van Genuchten. 1996. Estimating unsaturated soil hydraulic properties from tension disc infiltrometer data by numerical inversion. Water Resour. Res. 32:2683–2696.[CrossRef]
im
nek, J., and M.Th. van Genuchten. 1997. Estimating unsaturated soil hydraulic properties from multiple tension disc infiltrometer data. Soil Sci. 162:383–398.[CrossRef]
im
nek, J., N.J. Jarvis, M.Th. van Genuchten, and A. Gärdenäs. 2003. Review and comparison of models for describing non-equilibrium and preferential flow and transport in the vadose zone. J. Hydrol. 272:14–35.[CrossRef]
im
nek, J., M.
ejna, and M.Th. van Genuchten. 1998a. The HYDRUS-1D software package for simulating the one-dimensional movement of water, heat, and multiple solutes in variably-saturated media. Version 2.0. IGWMC-TPS-70. Int. Ground Water Modeling Ctr., Colorado School of Mines, Golden.
im
nek, J., O. Wendroth, and M.Th. van Genuchten. 1998b. A parameter estimation analysis of the evaporation method for determining soil hydraulic properties. Soil Sci. Soc. Am. J. 62:894–905.
im
nek. 2001a. Calibration of a two-dimensional root water uptake model. Soil Sci. Soc. Am. J. 65:1027–1037.
im
nek. 2001b. One-, two-, and three-dimensional root water uptake functions for transient modeling. Water Resour. Res. 37:2457–2470.[CrossRef]
im
nek. 2001. Flow rate dependence of soil hydraulic characteristics. Soil Sci. Soc. Am. J. 65:35–48.This article has been cited by other articles:
![]() |
B. B. Mirus, K. S. Perkins, J. R. Nimmo, and K. Singha Hydrologic Characterization of Desert Soils with Varying Degrees of Pedogenesis: 2. Inverse Modeling for Effective Properties Vadose Zone J., May 21, 2009; 8(2): 496 - 509. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. H. Stauffer, J. A. Vrugt, H. J. Turin, C. W. Gable, and W. E. Soll Untangling Diffusion from Advection in Unsaturated Porous Media: Experimental Data, Modeling, and Parameter Uncertainty Vadose Zone J., May 21, 2009; 8(2): 510 - 522. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Simunek and S. A. Bradford Vadose Zone Modeling: Introduction and Importance Vadose Zone J., May 27, 2008; 7(2): 581 - 586. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Crop Science | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||