VZJ
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online 27 May 2008
Published in Vadose Zone J 7:843-864 (2008)
DOI: 10.2136/vzj2007.0078
© 2008 Soil Science Society of America
677 S. Segoe Rd., Madison, WI 53711 USA
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Vrugt, J. A.
Right arrow Articles by Vesselinov, V. V.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Vrugt, J. A.
Right arrow Articles by Vesselinov, V. V.
GeoRef
Right arrow GeoRef Citation
Agricola
Right arrow Articles by Vrugt, J. A.
Right arrow Articles by Vesselinov, V. V.
Related Collections
Right arrow Inverse Procedures/Parameter Estimation
Right arrow Soil Physics
Right arrow Vadose Zone Processes and Chemical Transport

SPECIAL SECTION: VADOSE ZONE MODELING

Inverse Modeling of Subsurface Flow and Transport Properties: A Review with New Developments

Jasper A. Vrugta,*, Philip H. Staufferb, Th. Wöhlingc, Bruce A. Robinsond and Velimir V. Vesselinovb

a Center for Nonlinear Studies, Los Alamos National Lab., Los Alamos, NM 87545
b Earth and Environmental Sciences Division, Los Alamos National Lab., Los Alamos, NM 87545
c Lincoln Environmental Research, Ruakura Research Center, Hamilton, New Zealand
d Civilian Nuclear Program Office, Los Alamos National Lab., Los Alamos, NM 87545

* Corresponding author (vrugt{at}lanl.gov).

All rights reserved. No part of this periodical may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher.


Received 23 April 2007.



    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 Inverse Modeling: A Review
 Recent Advances in Inverse...
 Case Studies
 Summary and Conclusions
 REFERENCES
 
Many of the parameters in subsurface flow and transport models cannot be estimated directly at the scale of interest, but can only be derived through inverse modeling. During this process, the parameters are adjusted in such a way that the behavior of the model approximates, as closely and consistently as possible, the observed response of the system under study for some historical period of time. We briefly review the current state of the art of inverse modeling for estimating unsaturated flow and transport processes. We summariz how the inverse method works, discuss the historical background that led to the current perspectives on inverse modeling, and review the solution algorithms used to solve the parameter estimation problem. We then highlight our recent work at Los Alamos related to the development and implementation of improved optimization and data assimilation methods for computationally efficient calibration and uncertainty estimation in complex, distributed flow and transport models using parallel computing capabilities. Finally, we illustrate these developments with three different case studies, including (i) the calibration of a fully coupled three-dimensional vapor extraction model using measured concentrations of volatile organic compounds in the subsurface near the Los Alamos National Laboratory, (ii) the multiobjective inverse estimation of soil hydraulic properties in the HYDRUS-1D model using observed tensiometric data from an experimental field plot in New Zealand, and (iii) the simultaneous estimation of parameter and states in a groundwater solute mixture model using data from a multitracer experiment at Yucca Mountain, Nevada.

Abbreviations: CPU, central processing unit • EnKF, ensemble Kalman filter • GDPM, generalized dual-porosity model • KF, Kalman filter • LANL, Los Alamos National Laboratory • MDA, Material Disposal Area • MPI, message passing interface • MVG, Mualem–van Genuchten • PFBA, pentafluorobenzoate • RTD, residence time distribution • SLS, simple least squares • SVE, soil vapor extraction • VOC, volatile organic compound


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 Inverse Modeling: A Review
 Recent Advances in Inverse...
 Case Studies
 Summary and Conclusions
 REFERENCES
 
MODELS SIMULATING flow and transport through the vadose zone require accurate estimates of the soil water retention and hydraulic conductivity function (hereafter referred to as soil hydraulic properties) at the application scale of interest. In the past few decades, various laboratory experiments have been developed to facilitate a rapid, reliable, and cost-effective estimation of the soil hydraulic properties. For practical considerations, most of these experiments have focused on relatively small soil cores. Unfortunately, many contributions to the hydrologic literature have demonstrated an inability of these laboratory-scale measurements on small soil cores to accurately characterize flow and transport processes at larger spatial scales. This necessitates the development of alternative methods to derive the soil hydraulic properties at the application scale of the model.

Among the first to suggest the application of computer models to estimate soil hydraulic parameters were Whisler and Whatson (1968), who reported on estimating the saturated hydraulic conductivity of a draining soil by matching observed and simulated drainage. This process of iteratively adjusting the model parameters so that the model approximates, as closely and consistently as possible, the observed response of the system under study during some historical period of time is called inverse modeling. Parameter estimation using inverse modeling accommodates more flexible experimental conditions than typically utilized in laboratory experiments and facilitates estimating values of the hydraulic properties that pertain to the scale of interest, and thus is useful for upscaling. When adopting an inverse modeling approach, however, the soil hydraulic properties can no longer be estimated by direct inversion using closed form analytical equations but are determined using an iterative solution, thereby placing a heavy demand on computational resources.

In the field of vadose zone hydrology, inverse modeling usually involves the estimation of the soil water retention and unsaturated soil hydraulic conductivity characteristics using repeated numerical solutions of the governing Richards equation:

Formula 1[1]
thereby minimizing the difference between observed and model-predicted flow variables such as water content and fluxes. In Eq. [1], C(h) is the so-called soil water capacity, representing the slope of the soil water retention curve [L–1], h denotes the soil water matric head [L], t represents time [T], K is the unsaturated hydraulic conductivity tensor [L T–1], z denotes the gravitational head to be included for the vertical flow component only [L], and S is the volumetric sink term representing sources and or sinks of water [L3 L–3 T–1]. To solve Eq. [1] for the considered soil domain, appropriate initial and boundary conditions need to be specified, and time-varying sinks and sources need to be included.

We will briefly review the current state of the art of inverse modeling for estimating unsaturated flow and transport processes. We will explain how the inverse method works, discuss the historical background that led to the current perspectives on inverse modeling, and review the solution algorithms used to solve the parameter estimation problem. We will then highlight and illustrate some of our recent work at Los Alamos on self-adaptive multimethod global optimization, parallel computing, and sequential data assimilation to improve efficiency of parameter estimation in complex flow and transport applications and help assess parameter and model output prediction uncertainty.

Next, we present and discuss three different case studies that illustrate these various summarized developments. The first case study considers the calibration of a fully integrated three-dimensional vapor extraction model using observed data from a volatile organic compound in the subsurface at the Material Disposal Area near the Los Alamos National Laboratory (LANL). The second case study considers a multicriteria calibration of the Mualem–van Genuchten (MVG) parameters in the HYDRUS-1D model using observed tensiometric data from the Spydia field site in New Zealand. The last case study describes the application of a combined parameter and state estimation method to the interpretation of a multitracer experiment conducted at the Yucca Mountain field site in Nevada. This method, entitled Simultaneous Optimization and Data Assimilation (SODA) improves the treatment of input, model structural, and output error by combining the strengths of stochastic global optimization and recursive state updating using an ensemble Kalman filter.


    Inverse Modeling: A Review
 TOP
 ABSTRACT
 INTRODUCTION
 Inverse Modeling: A Review
 Recent Advances in Inverse...
 Case Studies
 Summary and Conclusions
 REFERENCES
 
We provide a short description of the underlying concept of inverse modeling and discuss the progress that has been made with particular emphasis on laboratory experiments. Excellent reviews on inverse modeling of soil hydraulic properties have been presented in Hopmans and Simunek (1999) and Hopmans et al. (2002). We suggest reading this material to get a more complete overview and background.

Mathematical Development
A schematic overview of the inverse modeling procedure appears in Fig. 1 . Consider a model {Phi} in which the discrete time evolution of the state vector {psi} is described by

Formula 2[2]
where Formula 2 represents the observed forcing (e.g., boundary conditions), β is the vector of parameter values, and t denotes time. In the current context, {Phi} represents Richards' equation augmented with parametric forms of the soil water retention and unsaturated hydraulic conductivity functions, whose exact shapes are defined by the values in β. Assume that realistic upper and lower bounds on each of the p model parameters, β = {β1, ..., βp} can be specified a priori, thereby defining the feasible space of solutions:

Formula 3[3]
If B is not the entire domain space, Formula 3p, the problem is said to be constrained.


Figure 1
View larger version (10K):
[in this window]
[in a new window]

 
FIG. 1. Schematic overview of inverse modeling: The model parameters are iteratively adjusted so that the predictions of the model, f, (represented with the solid line) approximate as closely and consistently as possible the observed response (represented with the dotted line). The symbol {oplus} represents observations of the forcing terms and responses that are subject to measurement erroros and uncertainty, and therefore may be different than the true values. Similarly, f represents the model with functional response of a rectangular box to indicate that the model is, at best, only an approximation of the underlying system. The label "output" on the y axis of the plot on the right-hand side can represent any time series of data.

 
A common approach to estimating the values of b for a particular site is to take a small soil sample from the field and conduct a transient experiment under controlled conditions with prescribed initial and boundary conditions. During this experiment, one or more flow-controlled variables are measured and collected in the vector Formula 3={Formula 31,...,Formula 3n}, where n denotes the total number of observations. After this, a simulation is performed with F given some initial guess for b (usually based on some prior information), and the vector of model predictions Y(β)={y1(β),...,yn(β)} is computed. These output predictions are directly related to the model state according to

Formula 4[4]
where the measurement operator W( ) maps the state space into the measurement or model output space. Finally, the difference between the model-simulated output and measured data is subsequently computed and stored in the residual vector, E:

Formula 5[5]
where the function G( ) allows for various user-selected linear or nonlinear transformations of the simulated and observed data.

The aim of parameter estimation or model calibration now becomes finding those values of β such that the measure E is in some sense forced to be as close to zero as possible. The formulation of a criterion that mathematically measures the size of E(β) is typically based on assumptions regarding the distribution of the measurement errors present in the observational data. The classical approach to estimating the parameters in Eq. [4] is to ignore input data uncertainty and to assume that the predictive model {Phi} is a correct representation of the underlying physical data-generating system. In line with classical statistical estimation theory, the residuals in Eq. [5] are then assumed to be mutually independent (uncorrelated) and normally distributed with a constant variance. Under these circumstances, the "best" parameter combination can be found by minimizing the following simple least square (SLS) objective function with respect to β:

Formula 6[6]
In this case, Hollenbeck and Jensen (1998) stretched the importance of model adequacy before sound statements can be made about the final parameter estimates and their uncertainty. Model adequacy is determined from

Formula 7[7]
where {sigma}T denotes the error deviation of the measurements, and Q( ) is the {chi}2 cumulative distribution with (np) degrees of freedom. This adequacy test gives us a measure of how well the optimized model fits the observations relative to their measurement precision. Using the definition in Eq. [7], models are adequate if padeq is >0.5.

Historical Background
During the last few decades, a great deal of research has been devoted to exploring the applicability and suitability of the inverse approach for the identification of soil hydraulic parameters. Most of this work has focused on laboratory experiments using small soil cores with well-defined boundary conditions to fully understand the prospects and limitations of the method and facilitate easy benchmarking against hydraulic properties derived from static or steady-state methods on similar soil samples. For our discussion, we group the early work on the use of inverse methods in vadose zone hydrology into five different categories, which we will discuss here in sequential order:

  1. The type of transient experiment and kind of initial and boundary conditions suited to yield a reliable characterization of the soil hydraulic properties were investigated by van Dam et al. (1992, 1994), Ciollaro and Romano (1995), Santini et al. (1995), Simunek and van Genuchten (1996, 1997), Inoue et al. (1998), Simunek et al. (1998b), Romano and Santini (1999), Durner et al. (1999), Wildenschild et al. (2001), Si and Bodhinayake (2005), and Zeleke and Si (2005). Various investigations have demonstrated the usefulness of the evaporation method, multistep outflow (MSO), ponded infiltration, tension infiltrometer, multistep soil water extraction, cone penetrometer, and falling head infiltration method for inverse estimation of the hydraulic parameters. These methods typically enable collection of a sufficient experimental range of data to at least be able to estimate some of the hydraulic parameters with good accuracy.
  2. The determination of the appropriate boundary conditions and most informative kind of observational data were investigated by Zachmann et al. (1981), Kool et al. (1985), Parker et al. (1985), Kool and Parker (1988), Valiantzas and Kerkides (1990), Toorman et al. (1992), Eching and Hopmans (1993), Eching et al. (1994), Durner et al. (1999), and Vrugt et al. (2002), among others. Early work reported by Kool et al. (1985) and Kool and Parker (1988) suggested that the transient experiments should cover a wide range in water contents and preferably include tensiometer measurements within the soil sample to match the observed {theta}(h) data (Eching and Hopmans, 1993). Additionally, in the case of outflow experiments, van Dam et al. (1994) has shown that more reliable parameter estimates can be obtained by incrementally increasing the pneumatic pressure in several steps instead of using a single pressure increment throughout the entire experiment. In Vrugt et al. (2001c, 2002), a global sensitivity analysis (PIMLI) was presented to show that the information content for the various soil hydraulic parameters was separated sequentially with time during a MSO experiment, and that this information could be used to help improve the identification of the global minimum in the search space.
  3. The selection of an appropriate model of the soil hydraulic properties was investigated by Zachmann et al. (1982), Russo (1988), and Zurmühl and Durner (1998). Numerous investigations have demonstrated that the MVG (Mualem, 1976; van Genuchten, 1980), Brooks and Corey (Brooks and Corey, 1964), Rossi and Nimmo (Rossi and Nimmo, 1994), and Kosugi (Kosugi, 1996, 1999) models appropriately describe the hydraulic properties of most soils. To further increase flexibility to fit experimental data, various researchers have proposed extensions of these models to describe multimodal pore size distributions, including multilevel spline approximations of the hydraulic properties (Bitterlich et al., 2004; Iden and Durner, 2007) and extensions to dual-porosity and dual-permeability models to simulate preferential flow (Simunek et al., 2003, and the many references therein).
  4. The implementation of methods to quantify the uncertainty associated with the inversely estimated parameters was investigated by Kool and Parker (1988), Hollenbeck and Jensen (1998), Vrugt and Bouten (2002), and Vrugt et al. (2003a, 2004). This work has focused on the implementation and use of traditional first-order approximation methods to estimate linear parameter uncertainty intervals (Carrera and Neuman, 1986; Kool and Parker, 1988), development of computationally more expensive response surface and Markov chain Monte Carlo sampling approaches to derive exact nonlinear confidence intervals (Toorman et al., 1992; Hollenbeck and Jensen, 1998; Romano and Santini, 1999; Vrugt et al., 2001c, 2003a), pseudo-Bayesian methods (Beven et al., 2006; Zhang et al., 2006), and multicriteria approaches to interpret the Pareto solution set of two or more conflicting objectives (Vrugt and Dane, 2005; Schoups et al., 2005a,b; Mertens et al., 2006; Wöhling et al., 2008).
  5. The construction and weighting of multiple sources of information in an objective function was investigated by van Dam et al. (1994), Clausnitzer and Hopmans (1995), Hollenbeck and Jensen (1998), and Vrugt and Bouten (2002) and search methods to efficiently locate the optimal parameters in rough, multimodal response surfaces (objective function mapped out in the parameter space) were developed and applied by Abbaspour et al. (1997, 2001), Pan and Wu (1998), Takeshita (1999), Vrugt et al. (2001c), Lambot et al. (2002), Vrugt and Bouten (2002), and Mertens et al. (2005, 2006). In recent years, Bayesian and pseudo-Bayesian approaches have been developed to weight multiple types of data in one objective function, and local and global optimization methods such as the sequential uncertainty fitting (SUFI), annealing-simplex, shuffled complex, and ant-colony methods have been proposed to find the optimal values of the soil hydraulic properties.

While initial applications of inverse modeling have focused primarily on the estimation of the unsaturated hydraulic properties from small soil cores, recent progress is also reported in the description of water uptake by plant roots (Vrugt et al., 2001a,b; Hupet et al., 2002), estimation of soil thermal properties (Hopmans et al., 2001), measurement of water content using time domain reflectometry (Huisman et al., 2004; Heimovaara et al., 2004), prediction of pore geometry based on air permeability measurements (Unsal et al., 2005), simultaneous estimation of soil hydraulic and solute transport parameters (Inoue et al., 2000), optimal allocation of electrical resistivity tomography electrodes to maximize measurement quality (Furman et al., 2004), and automated water content reconstruction of zero-offset borehole ground penetrating radar data (Rucker and Ferre, 2005). Moreover, with the ever-increasing pace of computational power and the inability of small-core measurements to adequately characterize larger scale flow and transport properties, application scales of inverse modeling have significantly increased in recent years, from the soil core to the field plot and regional scale, to provide solutions to emerging environmental problems such as salinization and groundwater pollution (Neupaier et al., 2000; Vrugt et al., 2004; Schoups et al., 2005a,b).

Among the first to apply the inverse modeling approach to a field situation were Dane and Hruska (1983), who estimated soil physical parameters using transient drainage data. The application of inverse modeling to estimate soil hydraulic properties across spatial scales is very promising, yielding parameters that represent effective conceptual representations of spatially and temporally heterogeneous watershed properties at the scale of interest. This approach is particularly powerful because it overcomes many of the difficulties associated with conventional upscaling approaches. With few exceptions, however, larger scale models are typically based on complex multidimensional governing equations, requiring significant computational resources for simulation and efficient optimization algorithms for model calibration. Moreover, unlike in small-scale experiments, boundary and initial conditions at the larger spatial scale are not as well defined because direct measurement techniques are mostly not available. Furthermore, the observational data available to characterize large-scale vadose zone processes are sparse, both in space and time. This requires the use of inverse algorithms that are efficient and can derive meaningful uncertainty estimates on the model parameters and associated model predictions. Note that inverse modeling is a powerful method to derive effective values of the soil hydraulic parameters at various spatial scales and circumvents many of the problems associated with conventional upscaling methods. For an excellent review of upscaling approaches for hydraulic properties and soil water flow, see Vereecken et al. (2007).

Parameter Estimation
For linear models, simple analytical solutions exist that conveniently minimize Eq. [6] and derive the optimal value for β at relatively low computational cost. Unfortunately, for most applications in subsurface flow and transport modeling (and many other inverse problems in hydrology), the optimal value of β can no longer be estimated by direct inversion and needs to be estimated by an iterative process. During this process, the parameters are iteratively adjusted so that the FSLS objective function is minimized and the model approximates, as closely and consistently as possible, the observed response of the system under study. Hence, this procedure is a critical component of the inverse modeling process, as the final optimized hydraulic properties might be susceptible to significant error if a wrong search procedure is used.

Because of the subjectivity and time-consuming nature of manual trial-and-error parameter estimation, there has been a great deal of research into the development of automatic methods for model calibration. Automatic methods seek to take advantage of the speed and power of computers, while being objective and easier to implement than manual methods. These algorithms may be classified as local search methodologies, when seeking for systematic improvement of the objective function using an iterative search starting from a single arbitrary initial point in the parameter space, or as global search methods in which multiple concurrent searches from different starting points are conducted within the parameter space.

One of the simplest local-search optimization methods, which is commonly used in the field of soil hydrology, is a Gauss–Newton type of derivative-based search (Marquardt, 1963; Zijlstra and Dane, 1996):

Formula 8[8]
where βk+1 is the updated parameter set and {nabla}f(βk) and H(βk) denote the gradient and Hessian matrix, respectively, evaluated at β = βk. From an initial guess of the parameters β0, a sequence of parameter sets, {β1, β2, βk+1}, is generated that is intended to converge to the global minimum of E(β) in the parameter space. Doherty (2004) and Clausnitzer and Hopmans (1995) presented general-purpose optimization software that implement this Gauss–Newton or Levenberg–Marquardt (Marquardt, 1963) type of search strategy.

The derivative-based search method defined in Eq. [8] will evolve toward the true optimum in the search space in situations where the objective function exhibits a convex response surface in the entire parameter domain. Unfortunately, numerous contributions to the hydrologic literature have demonstrated that the response surface seldom satisfies these conditions, but instead exhibits multiple optima in the parameter space with both small and large domains of attraction, discontinuous first derivatives, and curved multidimensional ridges. Local gradient-based search algorithms are not designed to handle these peculiarities, and therefore they often prematurely terminate their search with their final solution essentially being dependent on the starting point in the parameter space. Another emerging problem is that many of the hydraulic parameters typically demonstrate significant interaction because of an inability of the observed experimental data to properly constrain all of the calibration parameters. This further lowers the chance of finding a single unique solution with local search methodologies.

These considerations inspired researchers in many fields of study to develop robust global optimization methods that use multiple concurrent searches from different starting points to reduce the chance of getting stuck in a single basin of attraction. Global optimization methods that have been used for the estimation of the unsaturated soil hydraulic properties include the annealing–simplex method (Pan and Wu, 1998), genetic algorithms (Takeshita, 1999; Vrugt et al., 2001c), multilevel grid sampling strategies (Abbaspour et al., 2001; Lambot et al., 2002), ant-colony optimization (Abbaspour et al., 2001), and shuffled complex methods (Vrugt and Bouten, 2002; Vrugt et al., 2003c; Mertens et al., 2005, 2006).


    Recent Advances in Inverse Modeling
 TOP
 ABSTRACT
 INTRODUCTION
 Inverse Modeling: A Review
 Recent Advances in Inverse...
 Case Studies
 Summary and Conclusions
 REFERENCES
 
We highlight and illustrate here some of our recent work at Los Alamos on self-adaptive multimethod global optimization, parallel computing, and sequential data assimilation to improve efficiency of parameter estimation in complex flow and transport applications, and help assess parameter and model output prediction uncertainty. These methods are illustrated using three different case studies whose details and results are presented below.

Multimethod Global Optimization
The current generation of optimization algorithms typically implements a single operator for population evolution. Reliance on a single model of natural selection and adaptation presumes that a single method exists that can efficiently evolve a population of potential solutions through the parameter space and work well for a large range of problems. Existing theory and numerical benchmark experiments have demonstrated, however, that it is impossible to develop a single universal algorithm for population evolution that is always efficient for a diverse set of optimization problems (Wolpert and Macready, 1999). This is because the nature of the fitness landscape (objective function mapped out as function of β) often varies considerably between different optimization problems and often dynamically changes en route to the global optimal solution. It therefore seems productive to develop a search strategy that adaptively updates the way it generates offspring based on the local peculiarities of the response surface.

In light of these considerations, Vrugt and Robinson (2007a) and Vrugt et al. (2008) recently introduced a new concept of self-adaptive multimethod evolutionary search. This approach, entitled A MultiAlgorithm Genetically Adaptive Method (AMALGAM), runs a diverse set of optimization algorithms simultaneously for population evolution and adaptively favors individual algorithms that exhibit the highest reproductive success during the search. By adaptively changing preference to individual algorithms during the course of the optimization, AMALGAM has the ability to quickly adapt to the specific peculiarities and difficulties of the optimization problem at hand. Synthetic single- and multiobjective benchmark studies covering a diverse set of problem features, including multimodality, ruggedness, ill conditioning, nonseparability, interdependence (rotation), and high dimensionality, have demonstrated that AMALGAM significantly improves the efficiency of evolutionary search (Vrugt and Robinson, 2007a; Vrugt et al., 2008). An additional advantage of self-adaptive search is that the need for algorithmic parameter tuning is reduced, increasing the applicability to solving search and optimization problems in many different fields of study. An extensive algorithmic description and outline of AMALGAM, including comparison against other state-of-the-art single- and multiobjective optimization methods can be found in Vrugt and Robinson (2007a) and Vrugt et al. (2008). In our case studies, we illustrate the application of AMALGAM to inverse modeling of subsurface flow and transport properties. Before doing so, we first highlight recent advances in parallel computing to facilitate an efficient solution to the inverse problem for complex subsurface flow and transport models requiring significant computational time.

Parallel Computing Using Distributed Networks
The traditional implementation and application of many local and global optimization methods involves sequential execution of the algorithm using the computational power of a single central processing unit (CPU). Such an implementation works acceptably well for relatively simple optimization problems and those optimization problems with models that do not require much computational time to execute. For high-dimensional optimization problems involving complex spatially distributed models, such as are frequently used in the field of earth science, however, this sequential implementation needs to be revisited. Most computational time required for calibrating parameters in subsurface flow and transport models is spent running the model code and generating the desired output. Thus, there should be large computational efficiency gains from parallelizing the algorithm so that independent model simulations are run on different nodes in a distributed computer system.

Distributed computers have the potential to provide an enormous computational resource for solving complex environmental problems, and there is active research in this area to take better advantage of parallel computing resources. For example, in hydrology applications, parallel computing is being exploited to improve computational efficiency of individual, large-scale groundwater flow (Wu et al., 2002) and reactive transport (Hammond et al., 2005) models. Efforts to couple hydrologic models consisting of a network of individual submodels (groundwater, surface water, and atmospheric models) also are being designed in a way that submodels can be partitioned to different processors (Winter et al., 2004). Finally, parallel versions of model inversion and sensitivity analysis software such as PEST (Doherty, 2004) or the Shuffled Complex Evolution Metropolis (SCEM-UA) optimization algorithm (Vrugt et al., 2006b) have been developed.

The parallel implementation of AMALGAM is presented in Fig. 2 . In short, the master computer runs the algorithmic part of AMALGAM and generates an offspring population from the parent population using various genetic operators. This new population is distributed across a predefined number of computational nodes. These nodes (also referred to as slaves) execute the simulation model and compute the objective function of the points received. After this, the master computer collects the results and follows the various algorithmic steps to generate the next generation of points. This iterative process continues until convergence has been achieved. Various case studies presented in Vrugt et al. (2006b) have demonstrated that this setup results in an almost linear increase in speed for more complex simulation models, suggesting that the communication time between master and nodes is typically small compared with the time needed to run the model.


Figure 2
View larger version (20K):
[in this window]
[in a new window]

 
FIG. 2. Flowchart of a parallel implementation of AMALGAM. The master computer performs the various algorithmic steps in AMALGAM (on the left-hand side of the flowchart), while slave computers run the simulation model (on right-hand side of the flowchart).

 
The most important message passing interface (MPI) calls that are used to facilitate communication between the master and slave computers are: (i) MPI_Send—to send a package with parameter combinations (master) or model simulation outputs (slave); (ii) MPI_Prob—to check whether there are any incoming messages; (iii) MPI_Get_elements—to track the number of basic elements in the package; and (iv) MPI_Recv—to retrieve the information in the package. A detailed description of each of these functions appears in tutorials from the LAM Team (2006) and so will not be repeated here. An example implementation of these functions for Markov chain Monte Carlo simulation using the MPITB toolbox developed by Fernández et al. (2004) is presented in Vrugt et al. (2006b).

Bayesian and Multiobjective Inverse Modeling
Classical parameter optimization algorithms focus on the identification of a single best parameter combination, without recourse to uncertainty estimation. This seems unjustified given the presence of input (boundary conditions), output (calibration data), and model structural error (our model is only an approximation of reality) in most of our modeling efforts. Hence the assumptions underlying the classical approach to parameter estimation need to be revisited.

One response to directly confront the problem of overconditioning is to abandon the search for a single "best" parameter combination and adopt a Bayesian viewpoint, which allows the identification of a distribution of model parameters. The Bayesian approach treats the model parameters in Eq. [6] as probabilistic variables having a joint posterior probability density function, which summarizes our belief about the parameters β in light of the observed data Formula 8. An example of this approach is the SCEM-UA optimization algorithm for simultaneously estimating the traditional "best" parameter set and its underlying probability distribution within a single optimization run. Detailed investigations of the SCEM-UA-derived posterior mean, standard deviation, and Pearson correlation coefficients between the samples in the high probability density region of the parameter space facilitates the selection of an adequate model structure (Vrugt et al., 2003a), helps to assess how much complexity is warranted by the available calibration data (Vrugt et al., 2003b), and guides the development of optimal experimental design strategies (Vrugt et al., 2002). Applications of the SCEM-UA algorithms to inverse modeling of subsurface flow and transport properties are presented in Vrugt et al. (2003a, 2004), Vrugt and Dane (2005), and Schoups et al. (2005a,b).

Another response, highlighted here and illustrated in the second case study, is to pose the optimization problem in a multiobjective context (Gupta et al., 1998; Neuman, 1973). By simultaneously using a number of complementary criteria in the optimization procedure and analyzing the tradeoffs in the fitting of these criteria, the modeler is able to better understand the limitations of current model structures and gain insights into possible model improvements. As a commonplace illustration, consider the development of a personal investment strategy that simultaneously considers the objectives of high rate of return and low volatility. For this situation, there is no single optimal solution. Rather, there is a family of tradeoff solutions along a curve called the "Pareto-optimal front" in which improvement in one objective (say, high rate of return) comes only at the expense of a degradation of another objective (volatility). Development of robust algorithms to solve such optimization problems is a research direction that is currently attracting great interest.

The multiobjective AMALGAM method discussed above uses the Nondominated Sorted Genetic Algorithm (NSGA-II, Deb et al., 2002), Particle Swarm Optimizer (Kennedy et al., 2001), Adaptive Metropolis Search (Haario et al., 2001), and Different Evolution (Storn and Price, 1997) for population evolution and is significantly more robust and efficient than current Pareto optimization methods, with improvement approaching a factor of 10 for more complex, higher dimensional problems. We will illustrate the multiobjective version of AMALGAM in Case Study 2 below.

Improved Treatment of Uncertainty: Sequential Data Assimilation
Considerable progress has been made in the development and application of automated calibration methods for estimating the unknown parameters during inverse modeling. Nevertheless, major weaknesses of these calibration methods include their underlying treatment of the uncertainty as being primarily attributable to the model parameters, without explicit treatment of input, output, and model structural uncertainties. In subsurface flow and transport modeling, this assumption can be easily contested. Hence, uncertainties in the modeling procedure stem not only from uncertainties in the parameter estimates, but also from measurement errors associated with the system input and outputs, and from model structural errors arising from the aggregation of spatially distributed real-world processes into a relatively simple mathematical model. Not properly accounting for these errors results in error residuals that exhibit considerable variation in bias (nonstationarity), variance (heteroscedasticity), and correlation structures under different hydrologic conditions. If our goal is to derive meaningful parameter estimates that mimic the intrinsic properties of our underlying transport system, a more robust approach to the optimization problem is required. A few studies have discussed the treatment of input, state, and model structural uncertainties for subsurface flow and transport modeling (Valstar et al., 2004; McLaughlin and Townley, 1996; Katul et al., 1993; Parlange et al., 1993), but such approaches have not become common practice in current inverse modeling practices.

In the past few years, ensemble-forecasting techniques based on sequential data assimilation methods have become increasingly popular due to their potential ability to explicitly handle the various sources of uncertainty in geophysical modeling. Techniques based on the ensemble Kalman filter (EnKF, Evensen, 1994) have been suggested as having the power and flexibility required for data assimilation using nonlinear models. In particular, Vrugt et al. (2005a) recently presented the Simultaneous Optimization and Data Assimilation (SODA) method, which uses the EnKF to recursively update model states while estimating time-invariant values for the model parameters using the SCEM-UA (Vrugt et al., 2003d) optimization algorithm. A novel feature of SODA is its explicit treatment of errors due to parameter uncertainty, uncertainty in the initialization and propagation of state variables, model structural error, and output measurement errors. The development below closely follows that of Vrugt et al. (2005a), after which the application of this method to multitracer experiments is described.

To help facilitate the description of the classical Kalman filter (KF), we start by writing the model dynamics in Eq. [2] as a stochastic equation:

Formula 9[9]
where qt is a dynamical noise term representing errors in the conceptual model formulation. This stochastic forcing term flattens the probability density function of the states during the integration. We assume that the observation Eq. [4] also has a random additive error {varepsilon}t, called the measurement error:

Formula 10[10]
where {sigma}0 denotes the error deviation of the observations, and {psi}t* denotes the true model states at time t. At each measurement time, when an output observation becomes available, the output forecast error zt is computed:

Formula 11[11]
and the forecast states, {psi}tf, are updated using the standard KF analysis equation:

Formula 12[12]
where {psi}tu is the updated or analyzed state and Kt denotes the Kalman gain. The size of the gain depends directly on the size of the measurement and model error.

The analyzed state then recursively feeds the next state propagation step in the model:

Formula 13[13]

The virtue of the KF method is that it offers a very general framework for segregating and quantifying the effects of input, output, and model structural error in flow and transport modeling. Specifically, uncertainty in the model formulation and observational data are specified through the stochastic forcing terms q and {varepsilon}, whereas errors in the input data are quantified by stochastically perturbing the elements of X.

The SODA method is an extension of traditional techniques in that it uses the EnKF to solve Eq. [9–13]GoGoGoGo. The EnKF uses a Monte Carlo method to generate an ensemble of model trajectories from which the time evolution of the probability density of the model states and related error covariances are estimated (Evensen, 1994). The EnKF avoids many of the problems associated with the traditional extended Kalman Filter (EKF) method. For example, there is no closure problem, as is introduced in the EKF by neglecting contributions from higher order statistical moments in the error covariance evolution. Moreover, the conceptual simplicity, relative ease of implementation, and computational efficiency of the EnKF make the method an attractive option for data assimilation in the meteorologic, oceanographic, and hydrologic sciences.

In summary, the EnKF propagates an ensemble of state vector trajectories in parallel, such that each trajectory represents one realization of generated model replicates. When an output measurement is available, each forecasted ensemble state vector {psi}tf is updated by means of a linear updating rule in a manner analogous to the Kalman filter. A detailed description of the EnKF method, including the algorithmic details, is found in Evensen (1994) and so will not be repeated here.


    Case Studies
 TOP
 ABSTRACT
 INTRODUCTION
 Inverse Modeling: A Review
 Recent Advances in Inverse...
 Case Studies
 Summary and Conclusions
 REFERENCES
 
To illustrate the various methods described above, we present three different case studies. The first case study considers the calibration of a fully coupled three-dimensional vapor extraction model using measured concentrations of volatile organic compounds in the subsurface near the LANL. The second study involves the multiobjective inverse estimation of soil hydraulic properties in the HYDRUS-1D model using observed tensiometric data from an experimental field plot in New Zealand. The last case study presents a simultaneous estimation of parameter and states in a groundwater solute mixture model using data from a multitracer experiment at Yucca Mountain, Nevada.

Case Study 1: Global Optimization of a Three-Dimensional Soil Vapor Extraction Model
Thousands of sites across the United States are contaminated with volatile organic compounds (VOCs) including trichloroethane, trichloroethene, and tetrachloroethene. These industrial solvents have high vapor pressure and low solubility and generally migrate faster in the vapor phase than in the liquid phase (Jury et al., 1990). Volatile organic compound plumes in the vadose zone can grow rapidly and are more likely to spread laterally because vapor diffusion is typically four orders of magnitude greater than liquid diffusion (Fetter, 1999). Remediation of vadose zone VOC plumes is required to protect human health, the environment, and deeper groundwater resources. The VOC plumes in deep vadose zones rely on remediation techniques that differ substantially from the techniques used to remediate plumes in the saturated zone.

One of the primary remediation techniques currently used on VOCs in the vadose zone is soil vapor extraction (SVE). Soil vapor extraction is appealing because of the relatively low costs associated with installation and operation, the effectiveness of remediation, and its widespread use at contaminated sites (Lehr, 2004). This technique uses an applied vacuum to draw pore gas toward an extraction hole. In a contaminated area, the extracted soil gas will contain some fraction of the VOC in addition to air, water vapor, and CO2. As the VOC is removed from the subsurface pore gas, any dissolved, adsorbed, or liquid-phase VOC will tend to move into the vapor phase, thus reducing the total VOC plume (Hoeg et al., 2004). Depending on the off-gas concentrations and local regulations, the gas stream may need to be treated by methods such as C adsorption or burning with a catalytic agent.

The focus of the first case study is a VOC plume located in the vadose zone surrounding the primary liquid material disposal area (MDA L) at LANL. This area is located on Mesita del Buey, a narrow finger mesa situated near the eastern edge of the Pajarito Plateau (Broxton and Vaniman, 2005; McLin et al., 2005). During operations, liquid chemical waste was emplaced in 20-m-deep shafts on the mesa top. The VOCs from the shafts have subsequently leaked into the subsurface and created a vadose zone plume that extends laterally beyond the boundaries of the site and vertically to depths of more than 70 m below the ground surface. The vadose zone at this site is quite deep and SVE is being investigated as a possible corrective measure to remediate the plume. The initial investigation consisted of short-duration (~22 d) SVE tests on two extraction holes (indicated as SVE West and East) that were designed to collect a range of data including pressure responses and concentration measurements from both the extraction holes and surrounding monitoring holes. A detailed description of the VOC, pressure response data, and SVE numerical model appears in Stauffer et al. (2005, 2007a,b). Here, we focus on the calibration of a fully integrated three-dimensional SVE model using observed VOC extraction concentrations during the initial SVE test. Note that this case study of gas-phase transport is different than the typical applications of inverse modeling of soil hydraulic properties discussed above. Moreover, our numerical grid is made up of soil and rock mass, representing the geologic setting at LANL. Below, we shortly summarize the details relevant to our calibration.

Geologic Setting
The Pajarito plateau is located on the eastern flank of the Jemez volcanic center, and the rocks that form the plateau were created in two main ignimbrite eruptions that occurred at approximately 1.61 and 1.22 million yr (Izett and Obradovich, 1994). The plateau has been incised by canyons that drain into the Rio Grande. The regional aquifer is located approximately 300 m below the surface of MDA L, and no perched water was encountered during drilling at the site.

Figure 3 gives the general geography for MDA L and the surrounding area, and Fig. 4 shows the approximate site stratigraphy on a north–south cross-section of the mesa with several of the boreholes from Fig. 3 projected onto the plane. The most important rocks with respect to the current study are the ash-flow Units 1 and 2 of the Tshirege member of the Bandelier Tuff, Qbt1 and Qbt2, respectively. The uppermost unit at the site, Qbt2, is relative welded with ubiquitous vertical cooling joints. Many of the joints in this unit are filled with clay in the upper few meters, but are either open at depth or filled with powdered tuff (Neeper and Gilkeson, 1996). Unit Qbt1 is subdivided into upper and lower units, Qbt1-v and Qbt1-g, respectively; Qbt1-v is further subdivided into a nonwelded upper unit and a welded lower unit, Qbt-1vu and Qbt-1vc, respectively. Subunit Qbt-1vu is less welded than Qbt2 and has fewer cooling joints. Subunit Qbt-1vc is a welded tuff with many open vertical joints that provide rapid equilibration of pressure changes during pump tests (Neeper, 2002). Unit Qbt1-g is a nonwelded, glass-bearing ash flow that contains a few joints that appear to be continuations of joints formed in Qbt1-v that die out at depth. For a more complete description of the geologic framework of the Pajarito plateau, see Broxton and Vaniman (2005).


Figure 3
View larger version (116K):
[in this window]
[in a new window]

 
FIG. 3. Areal overview of Material Disposal Area L at the Los Alamos National Laboratory.

 

Figure 4
View larger version (19K):
[in this window]
[in a new window]

 
FIG. 4. Site stratigraphy with wells lying immediately to the east of Material Disposal Area L (after Neeper, 2002).

 
Model Formulation
Our SVE model builds on the Los Alamos porous flow simulator, FEHM, a one-, two-, and three-dimensional finite-volume heat and mass transfer code (Zyvoloski et al., 1997). This model has been used extensively for simulation of multiphase transport and has extensive capabilities to simulate subsurface flow and transport systems with complicated geometries in multiple dimensions (Stauffer et al., 1997, 2005; Stauffer and Rosenberg, 2000; Wolfsberg and Stauffer, 2004; Neeper and Stauffer, 2005; Kwicklis et al., 2006; and many others). Equations governing the conservation of phase mass, contaminant moles, and energy are solved numerically using a fully implicit Newton–Raphson scheme. A detailed description of FEHM appears in Zyvoloski et al. (1997).

The primary assumptions governing vapor-phase flow and transport are as follows. First, we assume that the vapor phase is composed solely of air that obeys the ideal gas law and calculate vapor-phase density (kg m–3) as a function of vapor pressure and temperature. Density differences due to spatial variations in VOC concentrations hardly affected gaseous movement. For example, at the maximum plume concentration of 3000 µm3 m–3, the pore gas changes in density from about 1 to 1.01 kg m–3. We use Darcy's law to calculate the advective volume flux. Vapor-phase contaminant conservation is governed by the advection–dispersion equation (Fetter, 1999) where the contaminant flux (mol m–2 s–1) is given by

Formula 14[14]
where vv represents the advective volume flux, {phi} denotes the porosity, Sv is vapor saturation defined as air-filled porosity divided by total porosity, Cv is the molar concentration (mol m–3) and the dispersion coefficient, Dcv, includes contributions from both dispersivity and molecular diffusion as

Formula 15[15]
where the molecular diffusion coefficient in FEHM is a function of the free-air diffusion coefficient (Dfree) and the tortuosity ({tau}) as

Formula 16[16]
The dispersivity tensor ({alpha}i) is directional; however, in FEHM we keep only the diagonal terms of this tensor. The superscript i implies that the equation is solved for the principle directions. For example, in three dimensions, the volume flux at any point can be decomposed into three principle components, vx, vy, and vz.. An additional constraint is imposed by Henry's Law equilibrium partitioning, which requires a constant ratio between concentrations in the liquid and vapor phase as

Formula 17[17]
where Henry's Law value and other properties relevant for trichloroethane (TCA) transport can be found in Stauffer et al. (2005, Table 2). In summary, the model is a molar-based solution to the advection–dispersion equation using Fickian transport theory. We do not account for the effects of non-Fickian diffusion; however, corrections for non-equimolar behavior are relatively small (<3%) (Fen and Abriola, 2004).


View this table:
[in this window]
[in a new window]

 
TABLE 2. Case Study 2: Lower and upper bounds for each of the HYDRUS-1D parameters used in the multicriteria optimization.

 
Three-Dimensional Model Domain and Computational Grid
The three-dimensional simulation domain is approximately centered on MDA L and includes the surrounding mesa and canyon environment from the land surface to the water table. Figure 5 shows a portion of the computational domain, the site boundary as a heavy black polygon, and an orthophoto showing roads and buildings. Material Disposal Area L is approximately 180 m east–west by 120 m north–south and the simulated domain extends beyond the site on all sides by a minimum of 100 m to minimize boundary effects. The computational grid is made up of >140,000 nodes and nearly 800,000 volume elements. The lateral extent is 410 m east–west by 370 m north–south. The grid extends vertically from an elevation of 1737 m above sea level (ASL) at the water table to 2074 m ASL on the northwestern corner of Mesita del Buey. The grid has a vertical resolution of 1 m in the top 90 m and stretches to a resolution of 25 m at the water table. The horizontal resolution is everywhere 10 m in both x and y directions. The grid captures the topography of the site and extends to the water table, >300 m below the surface of the mesa on which MDA L is situated. The deeper part of the grid, 90 m below the mesa top, has little impact on the simulations and is included with a vertical spacing of 10 m to address questions concerning plume impacts on the regional water table. The three-dimensional grid used here is an extension and refinement of the grid used in Stauffer et al. (2005) and images from that study will be helpful for visualizing the current domain and grid.


Figure 5
View larger version (35K):
[in this window]
[in a new window]

 
FIG. 5. Slice plane of the three-dimensional numerical grid. Colors depict the initial volatile organic compound (VOC) concentration before the soil vapor extraction pilot tests. The areal photograph is draped onto the digital elevation model of the site and shows the canyons on either side of the mesa. The Material Disposal Area L site boundary is indicated with the black polygon.

 
The computer code FEHM has a new capability that allows us to embed radial boreholes within an existing three-dimensional site-scale mesh (Pawar and Zyvoloski, 2006). This capability is used to reduce the total number of nodes required to capture the radial flow near the simulated SVE holes while also capturing the topography and stratigraphy at the site scale. Without this capability, we would have had to embed two three-dimensional extraction borehole meshes and all the necessary extra nodes to allow the borehole meshes to correctly connect to the existing three-dimensional grid while maintaining the Voronoi volume constraints that are required for computational accuracy. The wellbores used in the simulations each have an inner radius of 0.08 m and an outer shell radius of 2 m, with four nodes spanning this distance. Therefore, each well has one vertical line of nodes representing the open hole and four onion skins surrounding this. Both SVE holes have a total depth of 66 m with 67 nodes along the vertical. The nodes representing the open hole are assigned a permeability of 5 x 10–7 m2, providing little resistance to flow in the open hole. The first onion skins in the upper 20 m of each hole are assigned a permeability of 5 x 10–19 m2 and a diffusion coefficient (D*) of 5 x 10–19 m2 s–1 to simulate the effects of the steel casing. Nodes in the second and third onion skins are assigned the rock and tracer properties specified in a given simulation for the geologic unit in which they reside.

Calibration Details: Parameters and Initial and Boundary Conditions
In situ measured VOC concentrations at the extraction and surrounding bore holes were used to initialize the respective concentrations in the grid domain. A snapshot of the initial VOC concentration at the onset of the SVE pilot tests is presented in Fig. 5. The extraction flow rate during the test was assumed to be constant and calculated from an equation provided by the manufacturer of the orifice plate used to measure the pressure drop across a slight decrease in the diameter of the extraction pipe. We further simplified the modeling analysis by assuming no movement of the liquid phase. Furthermore, the atmospheric boundary pressure was held constant at 80 kPa, and the temperature was fixed to the yearly average of about 10°C (derived from local stations). The vertical side boundaries of the domain were no flow with respect to both heat and mass, and no flow of water or vapor was permitted across the bottom boundary, with its temperature being held constant at 25°C (Griggs, 1964). Finally, the porosity and saturation fraction of the different geologic units was fixed to measured values presented in Birdsell et al. (2002) and Springer (2005). Justification for all these assumptions in the context of the Pajarito Plateau was given in Stauffer et al. (2005).

Observed VOC concentrations in the vapor phase at the wellhead of the west and east extraction holes during the initial 22-d pilot experiment were used for SVE model calibration. Because of the significant spatial variability in permeability observed from straddle packer tests, the west and east pilot tests were calibrated separately. As calibration parameters, we selected the permeability of the rocks of the upper four geologic units depicted in Fig. 4. Initial sensitivity analysis demonstrated that deeper geologic units had marginal impact on the simulation results. To simulate the effects of vertical cracks, separate values for the horizontal and vertical permeability were optimized for each rock type. Moreover, we also optimized the permeability of asphalt because this material acts as an important diffusive barrier over almost the entire grid surface. Table 1 provides an overview of the calibration parameters and their initial uncertainty ranges. These ranges were based on straddle packer data and core permeability measurements presented in Neeper (2002) and Stauffer et al. (2007a,b).


View this table:
[in this window]
[in a new window]

 
TABLE 1. Case Study 1: Calibration parameters, their initial uncertainty ranges, and their final optimized values using AMALGAM for the western and eastern soil vapor extraction pilot tests.

 
A distributed computing implementation of AMALGAM was used to optimize the SVE model parameters for the west and east pilot tests using a simple FSLS objective function. We used a population size of 10 points, and hence 10 different slave computers, in combination with 120 computing hours on the LISA cluster at the SARA parallel computing center (University of Amsterdam, the Netherlands). Each of these nodes is equipped with a dual-core Intel Xeon 3.4-GHz processor with 4 GB of memory. The results of this single objective optimization are summarized in Table 1 and Fig. 6 and discussed below.


Figure 6
View larger version (16K):
[in this window]
[in a new window]

 
FIG. 6. Comparison of simulated (line) and observed (solid circles) volatile organic compound (VOC) outlet concentrations (ppmv is parts per million volume or µm3 m–3) as a function of time at the (A) western and (B) eastern extraction wells.

 
Figure 6 presents a time series plot of observed (solid circles) and simulated (solid line) VOC vapor-phase outlet concentrations at the western (Fig. 6A) and eastern (Fig. 6B) SVE wells. The corresponding optimized permeability values for the individual rock types are listed in Table 1. In general, the fit to the observed data can be considered quite good for both pilot tests after about 2 d, successfully capturing and simulating the process of matrix flow. During the first 2 d, however, the SVE model significantly underestimated observed VOC concentrations. This initial misfit was caused by flow through joints and fractures, a process widely observed throughout the Pajarito Plateau and the experimental site, but not explicitly included in our model. We represent fracture flow by allowing AMALGAM to optimize the horizontal and vertical permeability in ranges above measured matrix values. This implementation combines the effect of matrix and fracture flow, and does not allow us to explicitly simulate flow through fractures, which would be required to match the early-time data. Hence, much better predictions at the initial time steps are possible if we explicitly incorporate fracture flow in the model. One approach to doing this would be to augment the current SVE model with the generalized dual porosity model (GDPM) presented in Zyvoloski et al. (2008). This method assumes one-dimensional transport into and out of the matrix using multiple closely spaced nodes connected to the primary fracture nodes. This setup can capture preferential flow and transport processes and therefore will probably simulate the high initial extraction of the VOC observed in the experimental data. An example of the implementation of GDPM is discussed below in Case Study 3. We are currently in the process of including this in our SVE model as well. Note that the diurnal variations in the VOC concentrations are due to the measurements being calibrated at a single temperature. As temperature varies during a 24-h cycle, a constant concentration in the outflowing gas would thus lead to a sinusoidal measurement time series. Nevertheless, these changes appear small, and therefore we have attempted to calibrate the SVE model to the mean signal.

The optimized permeabilities for the west and east pilot tests are in good agreement, generally within an order of magnitude difference. The optimized permeabilities appear reasonable and demonstrate the appropriate variability as observed in the field (Neeper, 2002; Stauffer et al., 2007a,b). Moreover, their optimized values are typically within the middle of the prior uncertainty ranges, only approaching the outer bounds for a few parameters. We hypothesize that more realistic values for these particular parameters will be obtained when fracture flow is explicitly included in the SVE model. It seems unjustified, however, to overcondition the calibration process to a single "best" parameter combination as done here, given the presence of systematic errors in our model predictions so apparent during the first 2 d for both pilot tests. In the next two case studies, therefore, we will provide a better treatment of parameter, model, and output uncertainty.

Case Study 2: Multiobjective Calibration of Soil Hydraulic Properties
We conducted a multiobjective calibration of the unsaturated soil hydraulic properties using observed tensiometric pressure data from three different depth intervals at the Spydia field site in the Lake Taupo catchment in New Zealand. The vadose zone materials at this site consist of a loamy sand to sand material with gravel fractions up to 34%. The soil encompasses a young volcanic soil (0–1.6 m depth) on top, followed by the unwelded Taupo Ignimbrite (1.6–4.4 m) and two older Paleosols (4.5–5.8 m) having a late Pleistocene to Holocene age. Their parent material was inferred to be weathered tephra or tephric loess. A detailed physical and textural analysis of the vadose zone at the experimental site is given in Wöhling et al. (2008) and so will not be repeated here.

Altogether, 15 tensiometer probes (Type UMS T4e, UMS Umweltanalytische Mess-Systeme GmbH, Munich) were installed at five depths (0.4, 1.0, 2.6, 4.2, and 5.1 m below the soil surface) using three different replicates at each depth. The pressure head was recorded at 15-min intervals using a compact field point controller (cFP2010, National Instruments, Austin, TX). In addition, precipitation was recorded by event using a 0.2-mm bucket gauge and upscaled to hourly values for use in our calculations. Daily values of potential evapotranspiration were calculated with the Penman–Monteith equation (Allen et al., 1998) using observed meteorological data from the nearby Waihora weather station. For our inverse modeling exercise, we used the HYDRUS-1D model (Simunek et al., 1998a). This model simulates water flow in variably saturated porous media and uses the Galerkin finite element method based on the mass conservative iterative scheme proposed by Celia et al. (1990). An extensive description of this model appears in Simunek et al. (1998a). In this study, the unsaturated soil hydraulic properties are described with the MVG model (van Genuchten, 1980; Mualem, 1976):

Formula 18[18]
and

Formula 19[19]
where {theta}s is the saturated water content [L3 L–3], {theta}r is the residual water content [L3 L–3], {alpha}VG [L–1] and nVG (dimensionless) are curve shape parameters, l is the pore-connectivity parameter of Mualem (1976), and Ks is the saturated hydraulic conductivity [L T–1].

In the preprocessing phase, the soil domain was discretized into a rectangular grid of finite elements using a uniform nodal distance of 0.02 m and maximum depth of 4.2 m. The numerical solution with this spacing was generally found to be accurate to within 0.5% of a much smaller nodal spacing but required far less computational time for a single forward run of the model. The initial pressure head throughout the soil profile was derived from observed tensiometric data at the onset of the simulation. Moreover, all our simulations were run with an atmospheric upper boundary condition (switching between a prescribed flux and prescribed head condition depending on the pressure head at the soil–air interface) and prescribed lower boundary pressure head derived from observed tensiometric data at the bottom of the profile (4.2-m depth).

Simulations were conducted for a 282-d period between 11 Apr. 2006 and 18 Jan. 2007. In this calibration study, we focus on the tensiometer data from the unsaturated zone only, encompassing the 0.4-, 1.0-, and 2.6-m depth intervals. Deeper intervals were influenced by lateral groundwater flow. Consistent with our knowledge of the soil profile, we used three different layers to properly characterize the unsaturated soil hydraulic properties. For each layer, we optimized the MVG parameters {theta}s, {alpha}VG, nVG, Ks, and l. The upper and lower bounds of these parameters that define the prior uncertainty ranges are listed in Table 2 . Because of poor sensitivity, the residual water content was set to zero. This is a common assumption, reducing the number of calibration parameters and increasing the computational efficiency of the calibration.

To measure the ability of the HYDRUS-1D model to simulate the tensiometric data at the 0.4-, 1.0-, and 2.6-m depths, we defined separate RMSE values for each interval. The Pareto optimal solution space for the three criteria {F1, F2, F3} and 15 MVG parameters was estimated with AMALGAM using a population size of 100 points in combination with 20,000 function evaluations. The results of this three-criterion calibration are summarized in Fig. 7 , 8 , and 9 , and discussed below. An extensive treatment of this data set and multicriteria optimization using various different optimization algorithms is presented in Wöhling et al. (2008). Here we only show and discuss some initial results.


Figure 7
View larger version (25K):
[in this window]
[in a new window]

 
FIG. 7. Normalized parameter ranges for each of the Mualem–van Genuchten parameters in the HYDRUS-1D model using the three-criteria {F0.4, F1.0, F2.6} calibration with AMALGAM. Each line going from left to right across the plot denotes a single Pareto solution. The squared plot at the right-hand side represents a three-dimensional projection of the objective space of the Pareto solution set. The single-objective solutions for F0.4, F1.0, and F2.6 are indicated in red, blue, and black, respectively.

 

Figure 8
View larger version (12K):
[in this window]
[in a new window]

 
FIG. 8. Evolution of the best overall RMSE value for the three different depths as a function of the number of HYDRUS-1D model evaluations with the NSGA-II (Deb et al. 2002), MOSCEM-UA (Vrugt et al., 2003c), and AMALGAM multiobjective optimization algorithms. An extensive comparison of these algorithms appears in Wöhling et al. (2008).

 

Figure 9
View larger version (22K):
[in this window]
[in a new window]

 
FIG. 9. Time series plots of observed and simulated pressure heads with the HYDRUS-1D model at three different depths using a representative 125-d portion of the calibration period. The solid circles denote observed data and the gray lines represent the Pareto solutions.

 
Figure 7 presents normalized parameter plots for each of the 15 MVG parameters in the HYDRUS-1D model. The various parameters are listed along the x axis, while the y axis corresponds to the parameter values scaled according to their prior ranges defined in Table 2 to yield normalized ranges. Each line across the graph represents one Pareto solution. The lines separately indicated with symbols represent the optimal solutions for the three different criteria. The three-dimensional plot at the right-hand side depicts the Pareto solution surface in objective function space. Notice that there is significant variation in identifiability of the MVG parameters. For instance, there is considerable uncertainty associated with the saturated hydraulic conductivity for each of the three layers with Pareto solutions that span the entire prior range. On the contrary, the shape parameter nVG tends to more closely cluster in the Pareto space, with values that are consistent with the soil type found at the field site. Significant tradeoff in the fitting of the various criteria is also observed in the Pareto surface of solutions on the right-hand side. If the model would be a perfect description of reality and the input and output data were observed without error, than a single combination of the 15 MVG parameters would exist that perfectly fits the observations at the various depths. In that case, the Pareto surface would consist of a single point, with values of zero for the individual objectives. The presence of various sources of error results in an inability of the HYDRUS-1D model to perfectly describe the data with a single parameter combination and therefore results in a tradeoff in the fitting of different objectives. It seems logical that there is a strong connection between Pareto uncertainty and the various error sources, although it is not particularly clear what this relationship is.

To illustrate the relative efficiency of AMALGAM, Fig. 8 presents a comparative efficiency analysis of different multiobjective optimization algorithms. This graph depicts the evolution of the sum of the best individual found objective function values for the three criteria as a function of the number of HYDRUS-1D model evaluations using the NSGA-II (Deb et al., 2002), MOSCEM-UA (Vrugt et al., 2003c), and AMALGAM (Vrugt and Robinson, 2007a; Wöhling et al., 2008) optimization algorithms. This graph is reproduced from results presented in Wöhling et al. (2008) and more details regarding this comparison can be found there. In general, results demonstrate that the AMALGAM method finds significantly better solutions than the other two nonlinear global optimization algorithms. Specifically, AMALGAM finds an overall objective function value after about 3000 runs that is better than the other two algorithms after 20,000 HYDRUS-1D model evaluations. It is probable that the performance of the other optimization algorithms would have been more similar in terms of parameter estimates and final objective function values if they would have been used in conjunction with a larger population size; however, running them with a larger population size would have significantly deteriorated their efficiency. Moreover, in this study we used commonly used values of the population. Despite these findings, the strength of AMALGAM is its good and reliable performance with a relatively small population, inspiring confidence in the efficiency of this multimethod search algorithm.

The hydraulic head predictions at the 0.4-m depth associated with the Pareto solution set is illustrated in Fig. 9 for a portion of the 228-d calibration period. Model predictions are indicated with solid lines, while the solid circles denote observations. The overall "best" prediction, having an average RMSE of about 0.14 m for the three different depths, is indicated separately with a dashed line. The Pareto prediction uncertainty ranges generally capture the observations, but can be considered quite large, especially during prolonged dry conditions. Perhaps smaller uncertainty bounds would be achievable if we used prior information in our model calibration for at least some of the parameters, such as the saturated water content (see, for instance, Mertens et al., 2004). Note that the fit of the best parameter combination can be considered quite good, with pressure head predictions that nicely fluctuate around the measured tensiometric values. A systematic delay in modeled response is found, however, after most of the rain events. This suggests preferential flow, a mechanism that was not explicitly incorporated in our model formulation; however, HYDRUS-1D hardly improved our results by mimicking this process through the use of a dual-porosity model.

Case Study 3: Parameter and State Estimation in Subsurface Media
Interwell tracer experiments provide the best possible method at hand to study the complex flow and transport properties of a subsurface system and to measure field-scale flow and transport parameters of aquifers and reservoirs. Tracer tests bridge theory and practice by providing access to the transport characteristics of a heterogeneous porous medium. Interpretation of field tracer experiments is notoriously difficult, however, due to a lack of detailed information on subsurface flow paths and mixing between pathways, incomplete knowledge about subsurface heterogeneity, and problems with parameter estimation.

Multitracer experiments are an extension of the tracer technique that in principle dramatically increases the information content relative to that of a single tracer breakthrough curve. For example, multiple tracers with different diffusion coefficients are useful to distinguish between the effects of diffusion and hydrodynamic dispersion for transport in fractured rock. Similarly, a comparison of the breakthrough curves of sorbing and nonsorbing tracers provides a direct estimate of the sorption properties along the flow path between the injection and production well. Thus, multitracer experiments help to reduce conceptual model uncertainty, providing far less ambiguous information on transport processes.

Here, we apply the SODA method to the interpretation of a multitracer experiment conducted in fractured volcanic tuffs at the C wells, near Yucca Mountain, Nevada. Cross-hole tracer tests were conducted between injection well C3 and production well C2. The purpose of this and other similar tests at this site was to establish the validity of transport models and laboratory-determined sorption parameters when applied to field-scale transport in the saturated zone beneath the proposed Yucca Mountain nuclear waste repository. A detailed description of the tracer test methods and initial interpretation was given by Reimus et al. (2003); only the aspects relevant to our modeling are repeated below. Our application is an extension of the recently published work by Vrugt et al. (2005b), which discussed the parameter estimation of a single tracer within a recursive data assimilation framework. Our current application extends this previous work by considering the measured breakthrough curves of different tracers simultaneously to estimate model parameters and states.

MultiTracer Breakthrough Curve Measurements
Breakthrough curves under forced-gradient conditions were measured for two nonsorbing solute tracers with different diffusion coefficients (Br and pentafluorobenzoate [PFBA]) and one weakly sorbing tracer (Li+). Tracers were mixed in 12 m3 of water and injected simultaneously. Figure 10 shows the measured normalized concentrations of the three solute tracers at the production well as a function of time. By scaling the breakthrough curves to the relative mass of each tracer injected, differences between the curves can be attributed to differences in transport properties of the tracers. The attenuation of Br relative to PFBA is evidence of diffusion from fractures into the rock matrix, and the more attenuated Li+ response is attributable to sorption (Reimus et al., 2003). Additionally, the test featured three flow interruptions (one unintentional and two intentional) during the tailing portion of the experiment. The increase in tracer concentration on resumption of flow is another signature of dual-porosity behavior, with fracture flow and diffusion into the rock matrix (Reimus et al., 2003).


Figure 10
View larger version (16K):
[in this window]
[in a new window]

 
FIG. 10. Measured normalized concentrations of Br, pentafluorobenzoate, and Li+ at the production well. The interwell tracer experiment was conducted in the period between September 1998 and January 1999 at the C-wells near Yucca Mountain, Nevada.

 
Conceptual Transport Model
Incomplete knowledge of subsurface heterogeneity and lack of detailed information of subsurface flow paths are complicating uncertainties that place practical limits on the complexities of flow and transport models. Thus, tracer tests are often modeled with idealized one-dimensional conceptual representations based on single-pathway models rather than full three-dimensional numerical models of subsurface flow and transport. Unfortunately, field tracer tests are often affected by complex flow through multiple paths and can exhibit long tails, skewness, and multiple peaks. If the underlying conceptual model for flow, mixing, and dispersion is too simple, transport parameter estimates will be dominated by structural deficiencies in the mixing part of the transport model and the advantage of using multiple tracers will be negated.

Many modeling approaches have been developed to deal with these complexities (e.g., Cvetkovic and Shapiro, 1990; Destouni and Cvetkovic, 1991; Simmons et al., 1995; Ginn, 2001). In our case, the goal of multitracer experimentation was to identify transport processes such as diffusion and sorption, and to estimate field-scale parameters, rather than to describe the detailed pathways in between a given well pair. Such details are not really important to the goal but can stand in the way of obtaining useful results by introducing complexity in the flow model that is not critical to the task of evaluating diffusive and sorptive processes. The philosophy of our approach is to capture the site-specific details that will inevitably result in a diverse set of advective velocities with a simple yet flexible model so that focus can be on the underlying transport processes other than advection. While other approaches are possible, we have found this simplification to lead to fruitful results.

Similar to our previous work (Vrugt et al., 2005b), we apply the theory of micromixing to the interpretation of interwell tracer tests. The model has its origins in the field of chemical reaction engineering (Zwietering, 1959), and similar approaches have been adopted in the field of groundwater (e.g., Rainwater et al., 1987; Dagan and Cvetkovic, 1996). Instead of requiring the collection of complex, in situ information on the three-dimensional subsurface flow pathways, the method enforces a particular residence time distribution (RTD). Robinson and Viswanathan (2003) showed how to build this advection flow path with side exits and assign the locations and flow rates of the side exits to reproduce an arbitrary RTD. A schematic overview of the model is given in Fig. 11 .


Figure 11
View larger version (11K):
[in this window]
[in a new window]

 
FIG. 11. Illustration of the conceptual transport model used for characterization of the subsurface. The arrows denote the side exits to reproduce a given residence time distribution; Ct,prod denotes the simulated concentration at the production well, and Nnodes is the total number of nodes in the model.

 
In an extension of our previous work, the GDPM briefly discussed in Case Study 1 was applied in the present study to represent diffusion and sorption from the fractured network into the rock matrix. This method assumes one-dimensional transport into and out of a matrix connected to each primary porosity node (representing the fracture domain) in the mixing model. In contrast to other dual-porosity models, the GDPM method allows multiple, closely spaced matrix nodes to be used to capture sharp concentration gradients immediately adjacent to the primary (fracture) flow medium. The model domain of primary porosity and one-dimensional matrix nodes is represented in integrated finite difference form, and the computer code FEHM (Zyvoloski et al., 1997, 2008) is used to perform the transport calculations. The present study represents, to our knowledge, the first attempt to couple a residence-time-based conceptual model for advection to a matrix diffusion mass exchange model.

In this study, we chose the probability density function of stable distributions to represent the RTD. Stable distributions are a rich class of probability laws that can be used to represent the skewness and tailing behavior in the probability density function. The characteristic function of a stably-distributed random variable P is defined as

Formula 20[20]
when {alpha}ST != 1 and where {alpha}ST is an index of stability ({alpha}ST isin (0,2]), βST is a skewness parameter (βST isin [–1,1]), {gamma} is a scale parameter ({gamma} > 0), and {delta} is a location parameter ({delta} isin R). Initial sensitivity analysis and optimization runs demonstrated that the implementation of Eq. [20] would be most productive if scaling parameters in the x and y dimensions (sx and sy) are introduced, while fixing the values of βST, {gamma}, and {delta} to 1, 60, and 60, respectively. To simplify our problem, we assumed βST = 1, forcing the stable density function to be skewed to the left, which is consistent with the data.

After definition of the RTD, the sorbing and nonsorbing tracers are modeled in the mixing model by including parameters for matrix diffusion for all tracers and sorption for Li+. It was assumed that all the tracers were transported along the same set of three-dimensional subsurface flow pathways. Consistent with laboratory sorption studies (Anghel et al., 2002), Li+ sorption was modeled using a Freundlich sorption isotherm:

Formula 21[21]
where S and C are the sorbed and aqueous concentrations at node i, respectively, and Kf and nFR are sorption parameters that need to be estimated by modeling the Li+ breakthrough curve. The diffusion and sorption parameters were assumed to be constant along the flow paths, an assumption that is tantamount to assuming uniform transport properties in the formation.

Table 3 summarizes the calibration parameters and their prior uncertainty ranges. Note that the parameters {alpha}ST, sx, and sy will affect the fitting of each of the three different tracer curves as it defines the RTD, whereas the additional diffusion and sorption parameters are tracer specific. This approach is consistent with the fact that all tracers undergo the same advective and mixing processes, while each tracer has unique sorption and diffusion behavior.


View this table:
[in this window]
[in a new window]

 
TABLE 3. Case Study 3: Los Alamos porous flow simulator FEHM parameters subject to calibration, with their initial uncertainty and final optimized multicriteria ranges using the AMALGAM algorithm.

 
State Estimation
Similarly to our previous work (Vrugt et al., 2005b), the mismatch between the measured and simulated tracer concentrations at the production well (left-hand side in Fig. 11) were used to recursively update the FEHM simulated concentrations of the various tracers. Computational details on how to do this are found in Vrugt et al. (2005b). Note that in this study, we did not impose any mass constraints on the proposed sequential state updates, possibly resulting in systematic tracer removal or addition. In principle, we could add this constraint to the KF, but a posteriori analysis has demonstrated that this was not urgent, as the total mass balance was within 5%. Based on recommendations in the classic treatise of the EnKF in Evensen (1994), the time evolution of the model error was modeled as a first-order autoregressive process.

Parameter Estimation
When interpreting a multitracer experiment, the set of measurements available for parameter estimation is no longer a single outlet concentration, but consists of different transient mass outputs, each representing the measured breakthrough curve of one of the injected tracers. Considered in total, these individual tracer data sets probably contain conflicting information about the underlying transport system, so that a particular set of parameters might result in an excellent fit to one of the tracers while exhibiting poor predictive capabilities for another tracer. In principle, these individual breakthrough curves could be weighted in a single aggregate scalar, and a stochastic global optimization algorithm could be used to find the underlying posterior distribution of the parameters. In the absence of a compelling framework for the assignment of the weights, however, the parameter estimation problem is inherently multiobjective, and any attempt to convert it into a single-objective problem is associated with considerable subjectivity.

Here we implement the multiobjective model calibration approach considered in the previous case study. We use three RMSE objective functions, FBr, FPFBA, and FLi, to trade off the fitting of the various tracer curves. Each of these objective functions is computed using the EnKF-derived time series of forecast errors, zt in Eq. [11], for each of the respective breakthrough curves:

Formula 22[22]

The Pareto optimal solution space for the three criteria specified in Eq. [22] was estimated using a population size of 100 points, in combination with 10,000 trials with a distributed computing implementation of the AMALGAM algorithm. The calculations reported here were implemented using 50 Pentium IV 3.40 GHz processors of the LISA cluster at the SARA parallel computing center (University of Amsterdam, the Netherlands). The CPU time required for joint Pareto optimization and ensemble state estimation of the conceptual mixing model was approximately 22 h. The results of the three-criteria {FBr FPFBA, FLi} optimization are summarized in Table 3 and Fig. 12 , 13 , and 14 and discussed below.


Figure 12
View larger version (24K):
[in this window]
[in a new window]

 
FIG. 12. Normalized breakthrough curve model predictions for (A) Br, (B) pentafluorobenzoate, and (C) Li+ using the multiobjective SODA method. In each case, the symbols denote the measured data, the dark gray region indicates the prediction uncertainty bounds associated with the Pareto uncertainty in the parameters, and the light gray region represents conceptual model uncertainty.

 

Figure 13
View larger version (31K):
[in this window]
[in a new window]

 
FIG. 13. Autocorrelation functions of the time series of error residuals for the Br, pentafluorobenzoate, and Li breakthrough curves using (A, C, and E) a classical parameter estimation algorithm without state variable updating, and (B, D, and F) the multiobjective SODA framework with state updating. The solid lines denote the theoretical upper and lower 99% significance intervals of a time series of Gaussian error residuals.

 

Figure 14
View larger version (17K):
[in this window]
[in a new window]

 
FIG. 14. Normalized parameter plots for each of the model parameters using the three-criteria {FBr, FPFBA, FLi} SODA optimization. Each line across the graph denotes a single parameter set: shaded is the Pareto solution set; the dashed lines in red, blue, and black are the optimal solutions for FBr, FPFBA, and FLi, respectively. The plot at the right-hand side represents a three-dimensional projection of the objective space of the Pareto set of solutions. The optimal solutions of the three objectives are indicated with the same colors.

 
Figure 12 shows a comparison of measured and predicted breakthrough curves of Br (Fig. 12A), PFBA (Fig. 12B), and Li+ (Fig. 12C) using the multiobjective SODA method. The dark gray region represents the uncertainty in the model predictions associated with the Pareto uncertainty of the parameters. Additional uncertainty due to conceptual model errors is indicated by the light gray region. Note that the simulated breakthrough curves representing both parameter and conceptual model uncertainty are reasonable and bracket the measured concentrations at the production well for each of the three tracers. Adopting a strategy of recursive state estimation using the measured breakthrough curves of the various tracers facilitates the matching of the observations, and reduces the bias in the model predictions. Perhaps more importantly, the EnKF-computed time series of temporal and spatial variations in state updates along the flow path contains useful information to improve the mixing model. In addition, the output prediction uncertainty ranges associated with the Pareto uncertainty of the parameters is generally small compared with the model conceptual error. This indicates that most of the parameters in the conceptual mixing model are reasonably well determined by optimization against multiple tracer data and that further improvement in the model would require refinement of the conceptual model. We will further elaborate on this below.

A related observation is that the output prediction uncertainty ranges are much larger for the sorbing tracer Li+ than for the nonsorbing tracers Br and PFBA. This finding demonstrates that the conceptual mixing model as currently formulated is appropriate for predicting the subsurface transport processes of nonsorbing tracers, and that most of the error in the model arises from incorrect specification of the sorption processes along the flow paths. Note that the characteristic jumping behavior of the uncertainty bounds in Fig. 12C is caused by an insufficient ensemble size.

It is instructive to contrast the results obtained using the multiobjective SODA approach with a classical multiobjective AMALGAM optimization without state-variable updating. Figure 13 presents the autocorrelation functions of the computed residuals for the SODA method (Fig. 13A, 13C, and 13E) and the classical approach (Fig. 13B, 13D, and 13F). Autocorrelation is a measure of the serial dependency of the error residuals between model predictions and corresponding data. Ideally, this measure should be centered around zero at all lags, implying a lack of bias (systematic errors). Notice that there is significant autocorrelation between the error residuals for the classic multiobjective optimization with AMALGAM without state updating. Although the classical method captures the general trends in the measured tracer data (not shown), the residuals exhibit considerable variation in bias, variance, and correlation structure. Without state updating, the single-pathway conceptual mixing model is unable to track precisely the time series of measured breakthrough data for each of the individual tracers. This results in significant serial correlation between the error residuals, the effects of which clouds the interpretation of the tests. By comparison, there is considerably less autocorrelation between the residuals when performing a multiobjective SODA calibration, suggesting that recursive state adjustments remove a large part of the bias in the model predictions. When properly acknowledging model conceptual uncertainty by state variable updating, the error residuals become serially independent and exhibit normality. This approach should yield more reliable parameter estimates and uncertainty bounds.

Finally, Fig. 14 presents normalized parameter plots of the results for the three-criteria {FBr, FPFBA, FLi} SODA calibration with the AMALGAM algorithm. Each line going from left to right across the normalized parameter plots correspond to a different parameter combination. The gray lines represent members of the Pareto set, whereas the red, blue, and black symbols refer to the optimal solutions for FBr, FPFBA, and FLi, respectively. The eight model parameters are listed along the x axis, and the y axis corresponds to the parameter values, normalized by their initial ranges, as defined in Table 3. Additionally, Table 3 lists the Pareto uncertainty intervals of the parameters estimated with the MOSCEM-UA algorithm. The plot at the right-hand side in Fig. 14 depicts a three-dimensional projection of the tradeoff surface of the three objective functions. The Pareto Rank 1 solutions are indicated with gray points, whereas the optimal solutions corresponding to the three objectives are denoted with the prior defined symbols.

The results presented in Fig. 14 demonstrate that most of the mixing-model parameters are well constrained by optimization against multiple tracer data, in that most of the parameters the Pareto uncertainty intervals occupy a small region interior to the prior-defined, physically plausible uncertainty ranges. Most significantly, the diffusion coefficients are in the same general range as laboratory-determined and field-estimated values (Reimus et al., 2003), despite the fact that a very large prior range was chosen. This suggests that there is sufficient information in the time series of measured breakthrough data to recursively estimate the model states and simultaneously estimate the RTD and tracer-specific diffusion parameters. This result lends strong support to the matrix diffusion model. Also note that the stable parameter {alpha}ST is estimated to be in vicinity of 0.5. For the special case when {alpha}ST = 0.5 and βST = 1 (fixed a priori), the stable distribution represents the Lévy distribution.

It is interesting to note that for some of the model parameters, the Pareto set is discontinuous, with different well-separated clusters of solutions in the parameter space. This is most obvious for the scaling parameters sx and sy, and highlights that different combinations of these parameters result in a similar RTD curve. This conclusion was expected; no particular physical significance can be attached to the parameters of the stable distribution. Rather, the RTD-based approach is designed to allow for flexibility in establishing a nondiffusive RTD to represent all of the site-specific, unknown details associated with the advective flow paths between the wells. The approach provides a sound foundation for identifying the diffusion and sorption processes and parameters in the absence of such detailed flow path information.

The three-dimensional projection of the objective function space of the Pareto set of solutions on the right-hand side of Fig. 14 illustrates that there is significant tradeoff in the fitting of the various breakthrough curves, suggesting that for this model structure, it is not possible to find a single set of parameter values that simultaneously provides an excellent fit to the measured breakthrough curves of each of the three tracers. The most significant tradeoffs occur between the fitting of the sorbing and nonsorbing tracers. This conjecture is consistent with our previous conclusion that most of the error in the conceptual mixing model arises from an insufficiently complex specification of the sorption processes along the flow path. Note that the optimization seeks sorption isotherm parameters that are significantly more nonlinear (lower values of nFR) than would be expected from batch sorption experiments (Anghel et al., 2002). Heterogeneity in sorption parameters within the fractured tuff formation is a possible reason for this result. Concentrations are highest within the short-residence-time paths, and the inverse model is curtailing the degree of sorption in this portion of the mixing model with the only parameter at its disposal, the exponent nFR. A better simultaneous fit to the early and late portions of the Li+ breakthrough curve would probably be obtained with a conceptual model of sorption heterogeneity, whereby short-residence-time paths have lower sorption coefficients than the longer paths. This concept will be explored in future work.


    Summary and Conclusions
 TOP
 ABSTRACT
 INTRODUCTION
 Inverse Modeling: A Review
 Recent Advances in Inverse...
 Case Studies
 Summary and Conclusions
 REFERENCES
 
The application and use of inverse modeling for estimating subsurface flow and transport parameters has found widespread implementation and use in the last few decades. This has resulted in much experience with the method on small soil cores under controlled boundary conditions in the laboratory. With the ever-increasing pace of computational power and the inability of small-core measurements to adequately characterize larger scale flow and transport properties, however, typical application scales of inverse modeling have significantly increased in recent years from the soil core to the field plot and regional scale. Inverse modeling is particularly appealing to derive effective flow and transport parameters at larger spatial scales, because it overcomes many of the difficulties associated with conventional upscaling methods. Unfortunately, larger spatial scales typically involve complex simulation models, which are based on multidimensional governing equations, requiring significant computational resources for simulation and efficient optimization algorithms for model calibration. Moreover, boundary and initial conditions at larger spatial scales are not as well defined because direct measurement techniques are mostly not available. Furthermore, the observational data available to characterize large-scale vadose zone processes are sparse, both in space and time. These measurement limitations require significant additional methodological developments to appropriately address and communicate uncertainty.

We have reviewed the historical development of the current perspectives on inverse modeling in unsaturated flow and transport modeling, and have highlighted some of our recent work on this topic at LANL. In particular, we have discussed a general-purpose multialgorithm evolutionary search approach, called AMALGAM, that can efficiently handle a high number of parameters in complex, distributed flow and transport models and find preferred parameter solutions quickly in multimodal and noisy response surfaces. In addition, we have discussed the implementation of AMALGAM on a distributed computer network using a standard MPI toolbox in Octave (open-source software). Finally, we have discussed a combined optimization and Monte Carlo based sequential nonlinear filtering method to improve the treatment of uncertainty and provide meaningful prediction uncertainty bounds on our model simulations. All these approaches have been illustrated with three case studies addressing the transport of water, vapor, and contaminants through the vadose zone.

Finally, we would like to emphasize that considerable progress in the application of inverse modeling to complex simulation models has also been reported in adjacent fields such as groundwater and surface hydrology. Methods and approaches developed there might be of great use and inspiration to solve inverse modeling problems in vadose zone hydrology. Particularly appealing are approaches that attempt to reduce the dimensionality of the optimization problem using concepts of sensitivity and principle component analysis (Tonkin and Doherty, 2005) and representer-based calibration methods (Valstar et al., 2004). Among emerging methods for uncertainty estimation not included here are Bayesian model averaging (Hoeting et al., 1999; Raftery et al., 2005; Vrugt et al., 2006a; Vrugt and Robinson, 2007b) and its maximum likelihood variant (Neuman, 2003; Ye et al., 2004, 2005). These methods seem particularly appealing because they are relatively easy to implement, accounting jointly for conceptual model and parameter uncertainty by considering a set of alternative models that may be very different from each other while being based on a common set of data.


    ACKNOWLEDGMENTS
 
J.A. Vrugt is supported by a J. Robert Oppenheimer Fellowship from the LANL postdoctoral program. Computer support, provided by the SARA center for parallel computing at the University of Amsterdam, the Netherlands, is highly appreciated. Critical reviews by Jacob Dane, Jan Mertens, and Jirka Simunek greatly enhanced the presentation of this manuscript.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 Inverse Modeling: A Review
 Recent Advances in Inverse...
 Case Studies
 Summary and Conclusions
 REFERENCES
 




This article has been cited by other articles:


Home page
Vadose Zone JHome page
B. B. Mirus, K. S. Perkins, J. R. Nimmo, and K. Singha
Hydrologic Characterization of Desert Soils with Varying Degrees of Pedogenesis: 2. Inverse Modeling for Effective Properties
Vadose Zone J., May 21, 2009; 8(2): 496 - 509.
[Abstract] [Full Text] [PDF]


Home page
Vadose Zone JHome page
P. H. Stauffer, J. A. Vrugt, H. J. Turin, C. W. Gable, and W. E. Soll
Untangling Diffusion from Advection in Unsaturated Porous Media: Experimental Data, Modeling, and Parameter Uncertainty
Vadose Zone J., May 21, 2009; 8(2): 510 - 522.
[Abstract] [Full Text] [PDF]


Home page
Vadose Zone JHome page
J. Simunek and S. A. Bradford
Vadose Zone Modeling: Introduction and Importance
Vadose Zone J., May 27, 2008; 7(2): 581 - 586.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Vrugt, J. A.
Right arrow Articles by Vesselinov, V. V.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Vrugt, J. A.
Right arrow Articles by Vesselinov, V. V.
GeoRef
Right arrow GeoRef Citation
Agricola
Right arrow Articles by Vrugt, J. A.
Right arrow Articles by Vesselinov, V. V.
Related Collections
Right arrow Inverse Procedures/Parameter Estimation
Right arrow Soil Physics
Right arrow Vadose Zone Processes and Chemical Transport


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
The SCI Journals Agronomy Journal Crop Science
Journal of Natural Resources
and Life Sciences Education
Soil Science Society of America Journal
Journal of Plant Registrations Journal of
Environmental Quality
The Plant Genome