VZJ
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online 27 May 2008
Published in Vadose Zone J 7:533-546 (2008)
DOI: 10.2136/vzj2007.0173
© 2008 Soil Science Society of America
677 S. Segoe Rd., Madison, WI 53711 USA
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lovejoy, S.
Right arrow Articles by Schertzer, D.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Lovejoy, S.
Right arrow Articles by Schertzer, D.
GeoRef
Right arrow GeoRef Citation
Agricola
Right arrow Articles by Lovejoy, S.
Right arrow Articles by Schertzer, D.
Related Collections
Right arrow Fractal Approaches
Right arrow Scaling
Right arrow Remote Sensing

SPECIAL SECTION: MULTISCALE MAPPING

Single- and Multiscale Remote Sensing Techniques, Multifractals, and MODIS-Derived Vegetation and Soil Moisture

S. Lovejoya, A. M. Tarquisb,*, H. Gaonac'hc and D. Schertzerd

a Dep. of Physics, McGill Univ., 3600 University St., Montreal, QC H3A 2T8 Canada
b Dep. of Applied Mathematics, E.T.S.I.A. Agricultural Engineering, Univ. Politécnica de Madrid, Ciudad Universitaria s.n. 28040, Madrid, Spain, and CEIGRAM–U.P.M., Madrid, Spain
c GEOTOP, L'Université du Québec à Montréal, Montreal, Canada
d Université de Paris Est, ENPC, CEREVE, 6-8, Avenue Blaise Pascal, Cité Descartes, 77455 Marne-la-Vallee Cedex, France

* Corresponding author (anamaria.tarquis{at}upm.es).

All rights reserved. No part of this periodical may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher.


Received 12 November 2007.



    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 Spectral Analysis
 Multifractal Analysis
 Conclusions
 Appendix: Symbol Definitions
 REFERENCES
 
Scaling processes are increasingly understood to be the result of nonlinear dynamic mechanisms repeating scale after scale from large to small scales leading to nonclassical resolution dependencies. This means that the statistical properties systematically vary in strong, power-law ways with the resolution. When present in geophysical and remotely sensed fields, it implies that when classical (single-scale) remote sensing algorithms are used to determine surrogates of various geophysical fields, they can at most be correct at the unique (and subjective) calibration resolution. Scaling analysis and modeling techniques were applied to MODIS TERRA Bands 1 through 7 and to the standard derived vegetation and soil moisture indices in order to quantitatively characterize the wide range of scaling of these fields. The scaling exponents we found are not so large; however, they act across wide scale ranges and imply large effects. For example, for the statistics near the mean, the MODIS (500-m) resolution would be biased by a factor of ~1.52 when compared with similar results from an "ideal" sensor at 1-mm resolution. Applying the standard index algorithms on lower and lower resolution satellite data, we obtained indices with significantly different statistical properties than if the same algorithm was used at the finest resolution and then degraded to an intermediate value (a difference of a factor ~1.54). This shows that the algorithms can, at best, be accurate at the unique calibration scale and this points to the need to develop resolution-independent algorithms based on the scaling exponents.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 Spectral Analysis
 Multifractal Analysis
 Conclusions
 Appendix: Symbol Definitions
 REFERENCES
 
SOIL MOISTURE is an important hydrologic state variable critical to successful hydroclimatic and environmental predictions. Soil moisture varies in both space and time because of spatiotemporal variations in precipitation, soil properties, topographic features, and vegetation characteristics (Corwin et al., 2006). Monitoring all these processes across multiple scales is a fundamental component of many environmental and natural resource issues and remote sensing has long been identified as a technology capable of supporting such development (Viña et al., 2004; Vereecken et al., 2007).

The geophysical and biological processes that determine vegetation, soil moisture, and other surface characteristics are highly nonlinear; they involve interacting structures from planetary down to millimeter scales. A consequence is that the surface radiance fields are statistically interrelated across comparable ranges. Such statistical interrelations (correlations) are the basis of most remote sensing algorithms. A typical approach identifies wavebands that are particularly sensitive to the surface features of interest—such as the vegetation or soil moisture; then semiempirical algorithms are developed to relate them to the radiances. Typically, the algorithms assume that satellite data are homogenous or roughly homogenous at scales below the resolution.

The problem with this approach is that it ignores systematic scale or resolution dependencies due to strong subpixel variability and singles out a single scale for the calibration. This scale is essentially subjective; it is primarily determined by the available technology. A symptom of this scale problem is that when new, higher resolution satellite data are used in algorithms calibrated at earlier, lower resolutions, the algorithm typically must be recalibrated.

In the last 25 yr, an understanding of these strong resolution dependencies has started to emerge, and systematic techniques are now available for analyzing and modeling such behavior. Motivated by the fractal geometry of sets (Mandelbrot, 1983) and the development of (multifractal) cascade models in turbulence, the origin of this behavior has been traced to nonlinear dynamic mechanisms that repeat scale after scale from large to small scales. Paralleling this revolution in our understanding of the consequences of wide-range scaling is the generalization of the notion of scale itself to encompass systems that are scaling but highly anisotropic (generalized scale invariance, Schertzer and Lovejoy, 1985b; Pecknold et al., 1997; Lilley et al., 2004; Lovejoy and Schertzer, 2007). We illustrate some of these ideas using MODIS data and vegetation and soil moisture surrogates derived from them. Although we briefly review the basic multifractal notions, our aim is to perform systematic scale-by-scale analyses on the MODIS data and surrogates.


    Spectral Analysis
 TOP
 ABSTRACT
 INTRODUCTION
 Spectral Analysis
 Multifractal Analysis
 Conclusions
 Appendix: Symbol Definitions
 REFERENCES
 
The data we used are calibrated MODIS TERRA radiances (product level B1), with wavelengths indicated in Table 1 ; some of the images are shown in Fig. 1 . The vegetation and soil surface moisture indices are standard products described in Rouse et al. (1973) and Lampkin and Yool (2004):

Formula 1A[1a]

Formula 1B[1b]
where {sigma}VI and {sigma}SM are the vegetation and soil surface moisture indices (for a complete list of symbols and definitions, see the appendix) and the Is are the radiances (the band number is indicated in the subscript); Eq. [1] is usually stated without reference to any particular scale. We may immediately note that, although the MODIS TERRA data have a resolution of 500 m (Bands 1 and 2 were degraded from 250 m) and the variability of the radiances and surface features continues to much smaller scales, the surrogates are defined at a single (subjective) resolution equal to that of the sensor. One of the applications of our analyses is to investigate how the relations between the surrogates and the bands used to define them change with scale: we anticipated that since the scaling properties are different, making the surrogates with data at different resolutions would produce fields with different properties.


View this table:
[in this window]
[in a new window]

 
TABLE 1. Characteristics of the MODIS data, taken over Guadalajara (central Spain) over 512 by 512 pixels, each 500 by 500 m (~250 by 250 km2), on 29 July 2006.

 

Figure 1
View larger version (119K):
[in this window]
[in a new window]

 
FIG. 1. Grey-shade images of Guadalajara (central Spain) corresponding to an area of 250 by 250 km. Left column: (top) vegetation index, (middle) Band 1, (bottom) Band 2; right column: (top) soil moisture index, (middle) Band 6, (bottom) Band 7.

 
Before using more sophisticated (multifractal) scale-by-scale analysis techniques, we first used one that is fairly familiar to geoscientists: spectral analysis. In addition to its familiarity, spectral analysis has the advantage of being very sensitive to scale breaks; it is also useful for studying anisotropy. For an intensity field I, we may define the spectral densities P(k):

Formula 2A[2a]

Formula 2B[2b]
where r is a position vector and k is a wave vector. Since P is quadratic in I, it is a second-order statistic. In the definition, we have included the theoretically motivated ensemble average (denoted <>) although, in fact, below we estimate P from a single realization using a fast Fourier algorithm and a standard Hanning window.

In Fig. 2 , we show P estimated for the vegetation and soil moisture indices. The indices were estimated from Eq. [1] at the highest resolution, i.e., they are single-scale surrogates (we discuss multiple-scale surrogates below). To make the contours clearer, we smoothed log10P with a four-point Gaussian smoother. In the figure, we see that the contours are fairly roundish, indicating that, to some approximation, the second-order statistics are isotropic. Careful scale-by-scale analyses of the anisotropy would be rewarding (see, e.g., Pflug et al., 1993; Lewis et al., 1999; Beaulieu et al., 2007), but is outside our present scope. Analysis of the individual radiance bands revealed a degree of isotropy similar to the indices shown in Fig. 2.


Figure 2
View larger version (80K):
[in this window]
[in a new window]

 
FIG. 2. A gray shade rendition of the (log) spectral densities (P) of (a) the vegetation index, and (b) the soil moisture index.

 
The roundness of the P contours justifies the use of the "isotropic" spectrum E(k) obtained by angle integrating P:

Formula 3[3]
where k is the modulus of the wave vector (the notation indicates angle integration in Fourier space). In Fig. 3a , we see E on a log–log plot showing that, with the exception of the single lowest wave number k = 1, E is quite accurately of the power-law form:

Formula 4[4]
where β is the spectral exponent. Note that sometimes angle averaging (rather than integration) is performed so that in two dimensions, the corresponding exponent is β – 1. The advantage of using the present (turbulence-based) definition is that if the process is isotropic, then β is independent of the dimension of space (e.g., one-dimensional sections of two-dimensional data will have the same exponent). We can also see from the figure that Band 5 (with strong artifacts, "banding"), indicated by blue in Fig. 3a nevertheless has good scaling except at harmonics of the basic banding wave number. In Table 2 , we indicate the β values (estimated for k ≥ 2), finding for all the bands β ~ 1.17 ± 0.08, where the uncertainty indicates the band-to-band spread in exponents.


Figure 3
View larger version (20K):
[in this window]
[in a new window]

 
FIG. 3. (a) Spectra of all five bands [E(k)] as a function of the modulus of the wave vector k (and using a Hanning window). In order from top to bottom at the point log10k = 0.7, the curves are: purple = Band 6, black = Band 1, magenta = Band 7, blue (with spikes) = Band 5, light green = Band 2, cyan = Band 4, dark green = Band 3; (b) log spectra compensated by k1.17 [i.e., log10 k1.17 E(k)] vegetation index (red, middle left) with Band 1 (black, top left) and Band 2 (green, bottom left) so that flat curves indicate k–1.17 spectra (note the greatly expanded scale, the fluctuations are small); and (c) spectrum compensated by k1.17 soil moisture (blue, bottom left) with Band 6 (purple, top left) and Band 7 (magenta, middle left).

 

View this table:
[in this window]
[in a new window]

 
TABLE 2. A comparison of the different exponents estimated in this study: β is the spectral exponent, H the basic fluctuation ("conservation") exponent, C1 the codimension characterizing the pure resolution dependence near the mean, {alpha} is the Levy index, which determines how quickly the resolution exponents vary with statistical moment (intensity) of the fields. TM indicates that estimates are based on the trace moments, Struc indicates that they are based on the fluctuations via the structure functions, qD is the exponent of the probability distribution characterizing the extremes, VI is vegetation index, SM is soil moisture index, mss indicates statistics for the multiple-scale surrogate definition of index (all the other statistics are for single-scale surrogates [sss]). For all except {alpha} (which is fit in q space), fits are across the smallest spatial scale to a factor of 4 from the lowest (except TM, which also excludes a small scale factor of 2).

 
The power-law form Eq. [4] is called scaling because it is invariant under isotropic "zooms" (E -> {lambda}–1E, k -> {lambda}k), and the exponent β is "scale invariant." To better judge the quality of the scaling, we can "compensate" the spectrum by dividing by the mean behavior; this is shown in Fig. 3b and 3c (where we have removed the single k = 1 values to magnify the ordinate). Deviations from flatness indicate deviations from k = 1.17 scaling. In this way, we can easily see that the extreme high wave-number factor of roughly 2 in scale has some extra power, presumably because of noise. Also, the single-scale surrogates (vegetation index, in red in the middle of Fig. 3b, and the soil moisture index, in blue on the bottom of Fig. 3c) are both more strongly affected at the highest wave numbers, and their β values are also lower, indicating that they also have greater variability at the lower wave numbers. Presumably, the extra high wave-number variability is because they are the results of a nonlinear operation ([Eq. 1]) at the pixel scale, which breaks the scaling. Note that variability is still present, although it is reduced by Fourier angle integration; the resulting spectrum is typical of multifractal processes. In any case, the largest (log10) deviations from power law (log–log linearity) are <0.1 across the entire range k = 2 to 128 cycles per image; since the corresponding variation of log10E is ~2.5, the spectral scaling holds to better than 4% of this range. Also note that the angle integration smoothes more effectively at the higher wave numbers since there are many more small-scale structures than large ones. Finally, note that the low wave-number break in the scaling (for k = 1) is probably caused by post-processing, which attempts to correct for atmospheric attenuation. In effect, the algorithm works as a low-pass filter, ensuring that the overall intensity across the scene is roughly constant.


    Multifractal Analysis
 TOP
 ABSTRACT
 INTRODUCTION
 Spectral Analysis
 Multifractal Analysis
 Conclusions
 Appendix: Symbol Definitions
 REFERENCES
 
Generalized Structure Functions
As indicated in Eq. [2], the spectrum is a second-order statistic; if we assume statistical horizontal translational invariance (statistical homogeneity; this is the spatial counterpart of statistical stationarity, which refers to statistical translational invariance in time), then the Wiener–Khintchin theorem shows that the spectrum is the Fourier transform of the autocorrelation function. Before the advent of multifractals, it was believed that many turbulent and turbulent-like processes could be well modeled by Gaussian processes in which the statistics are completely determined by P(k) [indeed, isotropic Gaussian processes are completely determined by E(k)]. However, to explain the extreme variability associated with the phenomenon of intermittency, cascade models were developed in which a simple mechanism repeats scale after scale in a multiplicative manner while conserving the mean of the process. The resulting statistical behavior is given by

Formula 5[5]
where {varepsilon}{lambda} is the conserved flux associated with an observable, such as an intensity, at resolution {lambda}, which is the scale ratio across which the process has been developed, L is the outer scale, l is the inner scale, and K{varepsilon}(q) is the (convex) moment scaling exponent (Kolmogorov, 1962; Mandelbrot, 1974). In remote sensing terms, Eq. [5] quantifies the rate at which spatial averaging (i.e., lower and lower resolution pixels, smaller {lambda}) smoothes the various powers of the field {varepsilon}.

The symbol {varepsilon} is used for the cascade process since the prototypical cascade quantity is the turbulent energy flux, denoted {varepsilon}, which is conserved from one scale to another in hydrodynamic turbulence. Note that we are discussing systems far from equilibrium and it is not the energy that is conserved but rather the ensemble mean flux of energy from large scales to smaller scales (in fact, {varepsilon} is only a true flux in Fourier space, i.e., a flux from low to high wave numbers). In terms of the moments in Eq. [5], this conservation implies that the (appropriately normalized) process respects <{varepsilon}{lambda}> = 1 so that K{varepsilon}(1) = 0. Actually, although this ensemble "canonical" conservation is the most general one, it is not the only one possible. Indeed, more restrictive "microcanonical" conservation principles are quite popular—in spite of the fact that the variability of the corresponding multifractal processes is much lower and that such processes do not have simplifying universal behaviors (see Schertzer and Lovejoy, 1992, and below).

The typical inner scales of turbulent processes in the atmosphere are the viscous dissipation scales, which are typically on the order of millimeters or less. In remote sensing applications, while the actual inner scale of the radiances may be of the same order, the satellite averages them to the larger scale l (here 500 m) so that the relevant resolution scale in Eq. [5] is the resolution of the images. In this case, the small subsensor scales average out most of the small-scale variability. In general, however, they will not completely smooth it out; this is the origin of the interesting strong variability of the extremes, which goes variously under the names multifractal butterfly effect, nonclassical self-organized criticality, first-order multifractal phase transitions, and divergence of statistical moments, which are discussed below (for a review of this and other multifractal properties, see Schertzer et al., 1997).

The behavior described by Eq. [5] is called multiscaling because each statistical moment is scaling but with a different exponent. Continuing with the turbulence example, the famous Kolmogorov law for isotropic turbulence relates the energy flux to velocity fluctuations ({Delta}v) across a distance {Delta}x as follows:

Formula 6[6]
where H is the scaling exponent of the fluctuations with respect to the flux and a is the exponent of the flux. The usual interpretation of this equation is that the equality is in the sense of scaling laws so that, taking the qth powers of both sides and ensemble averaging, we obtain:

obtain:

Formula 7[7]
We can see that the fluctuations of typical observables have an extra linear scaling term Hq and we have introduced the (generalized) structure function exponent {xi}(q) (the usual structure function is for q = 2, for a single realization—no ensemble averaging—the latter is called a variogram). From Eq. [6], we can see that the statistics of the fluctuations across distances {Delta}x has two components, the first because it depends of the flux {varepsilon} at low resolution {lambda} = L/{Delta}x, the second because of the scaling relation between the fluctuations and the flux ({Delta}xH); H thus characterizes the distance from the (conserved) pure multiplicative process {varepsilon}; it is the degree of non- (scale-by-scale) conservation of the process.

To check this behavior on the MODIS data and surrogates, we need only estimate the gradients using:

Formula 8[8]
where we have assumed statistical translational invariance and isotropy. It is worth noting that this definition of the fluctuation {Delta}I{lambda} is sometimes called the "poor man's wavelet"; other choices of definition are possible. Wavelets provide a systematic framework for this (see, e.g., Holschneider, 1995); in practice, however, the definition of Eq. [8] is usually adequate, the main restriction being that it is only appropriate when 0 ≤ H ≤ 1, a condition that is usually (although not always) satisfied in geophysical applications (here we will see that H ~ 0.18). For example, when H > 1, one must measure fluctuations with respect to a local linear trend; this can be done either by fractionally differentiating the process (power-law filtering, Schertzer and Lovejoy, 1987), using appropriate wavelets (Bacry et al., 1989), or using the multifractal detrended fluctuation analysis technique (Kantelhardt et al., 2002). Note that by using the Wiener–Khintchin theorem, we obtain a simple relation between the second-order structure function exponent and the spectral exponent: β = 1 + {xi}(2); this is indeed approximately verified (see Table 2).

Using Eq. [8] to estimate the fluctuation {Delta}I{lambda}, we obtain the generalized structure functions shown in Fig. 4a for Bands 1, 2, 6, and 7 and in Fig. 4b for the vegetation and soil moisture indices. We can see that, except for the smallest factor of 4 to 8, the (multiple) scaling is excellent. To quantify the differences in the scaling, in Fig. 5a and 5b we show the slopes that are our estimates of {xi}(q). We can see that (as expected) {xi}(q) is concave downward. It is interesting to note that while the vegetation index has a {xi}(q) value intermediate between its defining band {xi}(q) values, the soil moisture {xi}(q) is somewhat lower, outside the range. It is clear from the figures that a direct quantitative intercomparison of these continuous concave curves {xi}(q) is not possible; we first need to reduce the problem to a finite number of manageable parameters. Although we discuss this in much more detail below, we may already make a first quantification using the exponent H. If we assume that, due to the scale-by-scale conservation of the underlying nonlinear cascade, K(1) = 0, we see from Eq. [6] that H = {xi}(1) [actually, according to Eq. [7], H = {xi}(1) + K{varepsilon}(a), and, in general, we don't know a; however, the correction K{varepsilon}(a) is generally small and will be ignored below]. In Table 2, we compare these estimates, finding that across all seven bands, H = 0.189 ± 0.034 while HVI = 0.162 and HSM = 0.144, so that the surrogates have H values that are a little lower than those of the individual bands. To gauge the significance of this effect across the range of scales in the MODIS images (a factor of 512), we see that in the mean [{xi}(q = 1) = H], the largest and smallest fluctuations differ by a factor 5120.162 = 2.74.


Figure 4
View larger version (32K):
[in this window]
[in a new window]

 
FIG. 4. (a) Structure functions for Bands 1, 2, 6, and 7 (M = <{Delta}I{lambda}q> = <|{Delta}I({Delta}x)|q>, where {Delta}x is the number of pixels of the "lag"; see Eq. [8]) with moments of order q = 0.1, 0.3, ..., 1.7, and 1.9 shown bottom to top, along with fits across the scaling range (eight pixels up to 256); (b) left, vegetation index and right, soil moisture index single-scale surrogate (sss) with moments of order q = 0.1, 0.3, ..., 1.7, and 1.9 shown bottom to top; and (c) left, vegetation index and right, soil moisture index scale by scale surrogate structure functions (mss) with moments of order q = 0.1, 0.3, ..., 1.7, and 1.9 shown bottom to top.

 

Figure 5
View larger version (9K):
[in this window]
[in a new window]

 
FIG. 5. Single-scale structure function exponents [{xi}(q)] for (a) the vegetation index (blue, middle) with Bands 1 (black, top) and 2 (green, bottom), and (b) the soil moisture index (purple, top) with Bands 6 and 7 (pink, bottom and middle, respectively); and (c) multiscale {xi}(q) for the vegetation index (blue, top) and the soil moisture index (red, bottom).

 
It is helpful to compare these H values with those of other geophysical fields. We have already mentioned that the classical Kolmogorov turbulent result is H = 1/3. Another classical theoretical H is for passive scalars in turbulence; the Corrsin–Obhukhov law also has H = 1/3. Empirically, the topography has H = 0.4 to 0.7 (oceans and continents, respectively; Gagnon et al., 2006), whereas many infrared and visible signals from volcanic and other surfaces give H values quite close to those found here (e.g., Laferrière and Gaonac'h, 1999; Harvey et al., 2002; Gaonac'h et al., 2003; see also Lovejoy and Schertzer, 2006, for cloud radiances). In addition, Lovejoy et al (2008 and unpublished data) analyzed 10 radiance bands spanning the visible, near infrared, thermal infrared, and passive microwave wavelengths across the range ~8 to 20,000 km and showed that, to within ~1%, the latter respects the predictions of cascade models (Eq. [5]) with outer scales in the range 5000 to 18,000 km (depending on the wavelength). Also depending on the wavelength, the corresponding H values were found to be in the range 0.2 to 0.5.

It remains to characterize the nonlinear K(q). Although we do this more fully below, we can nevertheless go a step further by characterizing the slope of the {xi}(q) curve near the mean, i.e., {xi}'(1) = HK'(1). Defining C1 = K'(1) as the codimension of the mean (see below for the justification of this terminology), we obtained the mean for all seven bands: C1 = 0.032 ± 0.010 while C1VI = 0.033 and C1SM = 0.010. Although these values appear small, typical values in turbulence are only a bit larger, e.g., C1 ~ 0.07 for horizontal wind (Schmitt et al., 1992, 1994) and C1 ~ 0.04 for passive scalars in the horizontal (Lilley et al., 2004). The corresponding values in the vertical are about 1.8 times larger; this is as predicted by the 23/9D model of scaling stratification (Schertzer and Lovejoy, 1985a). Finally, the topography has C1 ~ 0.12 (pretty much the same for both continents and oceans). For an early review of these and other results, see Lovejoy and Schertzer (1995).

Since C1 characterizes the statistics of {varepsilon} near its mean, it (partially) quantifies the effect of pure changes in satellite (pixel) resolution. For example, using the mean band value C1 = 0.032, we see that for MODIS we may expect that the bias due to a 500-m resolution with respect to an "ideal" 1-mm-resolution satellite (i.e., assuming that the smallest scale of variability in the radiance field is 1 mm) is (500/0.001)0.032 = 1.52. This bias will be important when trying to calibrate satellite-derived quantities from in situ measurements.

Multiscale vs. Single-Scale Surrogates
We have already mentioned above that the vegetation and soil moisture surrogates are defined in an ad hoc way using the (subjective, finest available) resolution. The fact that the surrogates have different scaling [{xi}(q)s] implies that if the surrogates are defined at different resolutions, their statistical properties will be different; in other words, the single-scale surrogate can be (at most) correct at a single resolution. To see this more clearly, these single-scale surrogates [sss, at scale {lambda}, {sigma}{lambda}(s)] can be contrasted with the corresponding multiscale surrogates [mss, at scale {lambda}, {sigma}{lambda}(m)]. Mathematically, the difference can be expressed as

Formula 9A[9a]

Formula 9B[9b]
where the maximum scale ratio (satellite image scale/single pixel scale) L/l = {Lambda} and the notation (Ii,{Lambda}){lambda} and [{sigma}{Lambda}(s)]{lambda} denotes averaging from this finest resolution {Lambda} up to the intermediate resolution {lambda} < {Lambda}. The mss is the surrogate that would be obtained by applying an identical algorithm (Eq. [1]) to satellite data at the lower resolution {lambda}, whereas the sss is the surrogate at the same resolution {lambda} but based on the {Lambda} scale calibration. In Fig. 4c, we show that the resulting {xi}mss(q) obtained across the same range of scales as the single-scale surrogate {xi}sss (Fig. 4b) is quite different (it is statistically significantly larger; see below). To partially quantify this difference, we see from Table 2 that HVImss = 0.231 and HSMmss = 0.213, which are larger than the sss estimates above: HVIsss = 0.162 and HSMsss = 0.144. Similarly, determining the corresponding C1 values we find the mss values C1VImss = 0.049 and C1SMmss = 0.059 (c.f., the sss values C1VIsss = 0.033 and C1SMsss = 0.010, which are substantially smaller, especially the soil moisture values). Note that whenever an exponent is quoted without the indication of sss or mss, the sss value is understood; we add it explicitly here only to contrast it with the mss estimates. These statistical differences imply that if the indices are calculated at the MODIS resolution {Lambda} and then degraded to an intermediate resolution {lambda}, that the results would be quite different than if the same algorithms were used but directly on data at a lower resolution. The problem gets worse as the range of scales increases because the scaling exponents are different. For example, the exponent difference HSMmssHSMsss = 0.213 – 0.144 = 0.069 so that the difference in the mean index fluctuations across the range of scales of the MODIS image is about a factor 5000.069 ~ 1.54. Alternatively, considering the effect of the inadequate MODIS resolution [500 m rather than, say, a scale of ~1 mm, where "ideal" indices might be defined, i.e., {lambda} = (500/0.001)], then the ratio between the bias due to pixel size in the soil moisture statistics near the mean is {lambda}C1SMmss–C1SMsss=1.90 (with C1SMmssC1SMsss = 0.059 – 0.01 = 0.049). We therefore conclude that due to the scale dependence of the data, the surrogate indices cannot be valid across a very significant range of scales. This scale dependence must at the very least be taken into account during calibration with in situ data, which are typically at much smaller resolutions. This underscores the need to develop scale-invariant algorithms based on the scale-invariant exponents; see Lovejoy et al. (2001) for a discussion.

Trace Moment Analysis
We have seen that the generic statistical properties of nonlinear processes that are repeated scale after scale are characterized by a nonlinear exponent K(q), and that the observables will generally have an extra linear scaling term Hq. Since at least for low q the linear term Hq is often larger than the nonlinear K(q), in analyses it can mask the nonlinear term. It is therefore advantageous to first estimate the conserved flux {varepsilon} from the observed v and then estimate K(q) directly. From Eq. [6], we see that, in principle, this can be done by removing the {lambda}H scaling. To do this, note that if we start with a field {varepsilon}a and fractionally integrate it by H (a power-law filter kH), the resulting field will have the fluctuation statistics indicated by Eq. [6] (see Marsan et al., 1996). This suggests that, to obtain a flux from v (i.e., a conserved field with H = 0), it suffices to invert the power-law filter, i.e., to fractionally differentiate by an order H. It turns out that a finite difference approximation to this is obtained by an integer order differentiation H' > H followed by taking the absolute value of the result (the absolute value is necessary since the multiplicative cascade {varepsilon} > 0; see, however, complex and vector cascades [Schertzer and Lovejoy, 1995]). Since, usually, 0 ≤ H ≤ 1, a first-order finite difference is sufficient. One simply takes the absolute differences at the finest available resolution {varepsilon}{Lambda} and degrades them:

Formula 10[10]
where again the notation ({varepsilon}{Lambda}){lambda} indicates the average of the finest resolution data {varepsilon}{Lambda} across the intermediate resolution {lambda}. The proportionality constant (lH) is unimportant (see [Eq. 6] with v replaced by I and with a = 1, {Delta}x = l ). In one dimension (e.g., series), one can simply use {Delta}I{Lambda}(x) = I(x + l) – I(x) where the inner scale is l = L/{Lambda}, where L is again the outer scale. In higher dimensions on data on regular grids (as here), if the data are fairly isotropic, then we can simply treat each line separately and then use the one-dimensional method; however, there are more possibilities. For example, we may use finite difference moduli of gradient vectors, or finite difference Laplacians (the latter being an isotropic order 2 derivative). In two dimensions, this yields, respectively,

Formula 11A[11a]

Formula 11B[11b]
(l = L/{Lambda} is the interpixel distance). These and other variants have been extensively tested on numerical simulations and on data (see, e.g., Tessier et al., 1993; Lavallée et al., 1993); there is generally not much difference between the various choices (or between direct fractional differentiation using Fourier techniques). Due to analogies with quantum statistical mechanics, the methods based on degrading the resolution of the fluxes and averaging across the available samples is called the trace moment technique (Schertzer and Lovejoy, 1987). It is distinguished from similar partition function methods (Halsey et al., 1986) in that, in addition to spatial averaging, it also involves ensemble averaging. Since here we apply the method to individual realizations (images), the two are equivalent.

In Fig. 6a and 6b , we show the results on the sss indices. The plot shows various statistical moments of {varepsilon}{lambda} at various intermediate resolutions obtained by spatially averaging {varepsilon}{Lambda} obtained from the modulus of the gradient vector across larger scales (smaller scale ratios) {lambda}. Since the intermediate scales {Delta}x are inversely proportional to the scale ratio ({lambda} = L/{Delta}x), the smallest scales are on the right, the largest scales are on the left. As a first remark, we see that the overall range of variability is quite a bit smaller than for the corresponding structure functions; this is because for the moments shown (q < 2), the Hq term is larger than the nonlinear K(q) term. Notice that for q > 1, the effect of spatial averaging is to decrease the values (K > 0), whereas for q < 1, it increases them (K < 0). Next, we can see that, due to the nonlinear operations at the smallest scales (the definition in terms of the various bands, the modulus of the gradient operator), the scaling is not so good at the finest resolutions; the exponents K(q) (the slopes on the log–log figure) were estimated across the intermediate range shown. At the very largest scales, the scaling is also poor partly because of poor statistics (there aren't very many large-scale structures), but also because of the low wave-number atmospheric corrections mentioned above.


Figure 6
View larger version (19K):
[in this window]
[in a new window]

 
FIG. 6. Log–log plot of trace moments (<{varepsilon}{lambda}q>) using 20 values of scale ratio {lambda} per order of magnitude in scale: (a) vegetation index; (b) soil moisture index. From bottom to top, q = 0.2,0.4, ..., 1.6, and 1.8.

 
To quantify the scale-by-scale statistical differences between the bands and the indices, we can use the scaling of the moments (the slopes in Fig. 6) to estimate K(q) and compare the various moment scaling exponents; this is done in Fig. 7a , 7b, and 7c. Again, we can quantify the differences by the behavior near q = 1; numerically evaluating the slope C1 = K'(1), we obtain a second series of estimates shown in Table 2. For the bands, these are very close to the previous estimates (for the seven bands: C1 = 0.0367 ± 0.001, c.f. the previous value 0.032 ± 0.010), but for the surrogates, the values are somewhat different: C1VI = 0.064 and C1SM = 0.053, compared with the structure function estimates of 0.033 and 0.010, respectively. The fact that the two methods agree so well for the bands but not so well for the surrogates is due to the poorer scaling of the latter, itself due to their scale-dependent definitions.


Figure 7
View larger version (11K):
[in this window]
[in a new window]

 
FIG. 7. Trace moment determination of the scaling exponent of the qth order moment of the field [K(q)] for: (a) vegetation index (purple, top right) and soil moisture index (green, bottom right); (b) vegetation index (purple, top right) and Band 1 (orange, middle right) and Band 2 (green, bottom right); and (c) soil moisture index (green) and Band 6 (blue, bottom right) and Band 7 (purple, middle right).

 
Multiscaling the Probabilities and the Probability Distribution Multiple Scaling Technique
We have concentrated on the statistical moments as a simple method of characterizing scale-invariant fields; however, the moments are integrals of the probability densities, and the multiscaling of the former implies the multiscaling of the latter. In addition, under fairly general conditions, the relationship can be inverted so that knowledge of the scaling of the moments determines the scaling of the probabilities. In particular, we obtain

Formula 12[12]
which states that the probability (Pr) that a (resolution {lambda}) value {varepsilon}{lambda} exceeds the threshold {lambda}{gamma} is a power of the resolution with exponent c({gamma}) (Schertzer and Lovejoy, 1987). The approximation sign is to within constant or slowly varying factors (e.g., including subexponential factors such as log{lambda}). Note that although, in the above, we used the expression probability distribution, it is not strictly appropriate since it represents the integral of the probability density from {lambda}{gamma} to {infty} rather than the usual –{infty} to {lambda}{gamma} (the latter would correspond to a true cumulative distribution function). The exponent {gamma} is called a singularity since it quantifies the rate at which the field values {varepsilon}{lambda} diverge in the small-scale limit {lambda}->{infty} (if {gamma} < 0, it is in fact a regularity). The function c({gamma}) defined via Eq. [12] is called the (statistical) codimension function (Schertzer and Lovejoy, 1987) or sometimes the Cramer function (Mandelbrot, 1989). If it is low enough [c({gamma}) < d, where d is the dimension of the embedding space; d = 2 for single images], then d({gamma}) = dc({gamma}) is the (geometrical) fractal dimension of the set of points (at resolution {lambda}) with {varepsilon}{lambda} > {lambda}{gamma}. While the geometrical d({gamma}) must satisfy 0 ≤ d({gamma}) ≤ d, the only restriction on c({gamma}) is that it is nonnegative.

The use of c in place of d is necessary in stochastic processes since events with c > d [i.e., those that would have an impossible negative geometrical dimension d({gamma}) < 0] correspond to events that are almost surely absent in a d-dimensional space but on the contrary are almost surely present in a large enough sample. Indeed, if we have Ns independent realizations of the process, each across a range of scales {lambda}, then the effective or sampling dimension of the sample is Ds = logNs/log{lambda}, and one would expect to find all levels of activity up to extreme events such that c({gamma}s) = d + Ds, where {gamma}s is the sampling singularity (Schertzer and Lovejoy, 1989). In particular, there will be a largest moment qs = c'({gamma}s) that can be estimated from a finite sample of Ns realizations; for q > qs, the K(q) becomes (spuriously) linear following the tangent K'(qs).

As mentioned above, knowledge of K(q) is equivalent to knowledge of c({gamma}) and visa versa; indeed, the two are conveniently related via Legendre transformations (Parisi and Frisch, 1985):

Formula 13[13]
This establishes a one-to-one correspondence between orders of singularity {gamma} and statistical moments q: {gamma} = K'(q) and q = c'({gamma}). We can now see the justification for the term C1 = K'(1) introduced above; in a precise sense, it corresponds to the singularity that gives the dominant contribution to the mean; in addition, since K(1) = 0, we find C1 = c(C1) is also the codimension of the {gamma} = C1 singularities.

Before using Eq. [12] to directly estimate c({gamma}), we recall that the approximation sign in Eq. [12] means that using the approximation c({gamma}) ~ –logPr/log{lambda} is not so accurate. It is therefore better to calculate the histograms at a series of resolutions {lambda} and then, for a fixed {gamma} = log{varepsilon}{lambda}/log{lambda}, regress logPr against log{lambda}; the slope is –c({gamma}). This is the probability distribution multiple scaling technique (Lavallée et al., 1991). We show the result in Fig. 8a, 8b, and 8c . As can be seen at the larger {gamma} (the extremes), the statistics become poor so that even though, in principle, there is information up until c = d = 2, it is very noisy. The extremes can be examined by determining the histograms directly at the highest resolution. This is especially important since, in general, as mentioned above, in general "canonical" cascade processes there is a fundamental distinction that must be made between the "bare" cascade, quantities {varepsilon}{lambda},b, i.e., that developed from the outer scale L down to a smaller (but finite) scale l, and the "dressed" process, {varepsilon}{lambda},d, obtained by averaging ("dressing") a bare process down to the small-scale limit (this is a good approximation to measured fields, which typically have variability much below that of their averaging resolution). While the former has all its positive moments converge [the corresponding bare Kb(q) is always finite], the dressed process is more variable; it generally has diverging moments for all moments q > qD where qD is a critical exponent dependent on the dimension D across which the process is averaged (Mandelbrot, 1974; Schertzer and Lovejoy, 1987) (Kd(q) = Kb(q) for q < qD). In general,

Formula 14[14]
In practice, the moments of a finite data set are always finite; nevertheless, the divergence of moments implies that the K(q) estimated for q > qD will be dominated by a single (largest) singularity, the corresponding scaling will be spurious, and there will be discontinuity in the slope of K(q) at q = qD. Since there is a mathematical analogy between classical thermodynamics and multifractals, the discontinuity K'(q) at q = qD is called a first-order multifractal phase transition. Another term for the phenomenon is the multifractal butterfly effect (Lovejoy and Schertzer, 1998), a term that is justified because—just as for the classical deterministic chaos "butterfly effect"—the large-scale moments for q > qD are in fact determined by the small-scale details. Finally, the divergence of moments occurs in combination with scaling fractal structures (indeed, it is the because of the buildup of such strong variability from large to small scales that the spatial averaging across finite sets cannot sufficiently "calm" the higher order statistical moments). Elsewhere in statistical physics, the combination of fractal structures with algebraic extreme probabilities has been called self-organized criticality (SOC; Bak et al., 1987); hence, we see that multifractals provide a "nonclassical" route to SOC (Schertzer and Lovejoy, 1994). Actually, this nonclassical SOC may be more generally physically relevant since it based on a (quasi-) constant flux, whereas the classical SOC is in fact only strictly valid in the (often unrealistic) zero-flux limit.


Figure 8
View larger version (13K):
[in this window]
[in a new window]

 
FIG. 8. The codimension function [c({gamma})] of the set of points with singularities {gamma} for: (a) the seven bands, in order from top to bottom at the point {gamma} = 0.23 (below the large spike in the blue curve): blue (with spikes) = Band 5, purple = Band 6, dark green = Band 3, cyan = Band 4, orange = Band 1, magenta = Band 7, and light green = Band 2; (b) the vegetation index (red) and Bands 1 and 3 (yellow and green, respectively); and (c) the soil moisture index (orange, bottom line) and Band 6 (purple, top) and Band 7 (magenta, middle). The straight reference lines (black) have slopes of 6.6.

 
To check for the divergence of moments, we therefore calculated the probability distributions of {varepsilon}{lambda} (see Fig. 9a and 9b ). Due to the slight excess smoothing at the highest factor of 2 in scale (see the spectra, above), we averaged the (gradient-estimated) {varepsilon}{Lambda} across two by two pixels. We can see that the forms of the extreme probability tails are roughly the same for the indices and the bands from which they were derived, but it is not obvious that any asymptotic linear behavior is followed. The problem is that it is notoriously difficult to estimate the exponent of a probability tail, especially when theoretically we know neither the value of the exponent nor where the asymptotic regime begins. To quantify the tail behavior, we estimated the tail slope for the greatest factor of 2 in scale (see Table 2); for the Bands 1, 2, 6, and 7, we obtained a mean qD ~ 6.64 (the mean of all the bands is 6.4 ± 1.4) and this reference slope is placed on the distributions in Fig. 9a and 9b. We see that the agreement is quite suggestive (especially when we consider that various instrumental [smoothing] effects could be responsible for artificially reducing the values of the extreme gradients used to estimate {varepsilon}). The evidence for algebraic tails is apparently strongest for the soil moisture surrogate and the corresponding Bands 6 and 7. Alternatively, we note that the absolute logarithmic slope in the tails in Fig. 9 yields the maximum moments that can be accurately measured with the available sample, qs. It may be that qs < qD so that the data set is simply too small to observe qD (recall that as Ns increases, so does qs).


Figure 9
View larger version (19K):
[in this window]
[in a new window]

 
FIG. 9. Log–log plot of the probability (Pr) that a value {varepsilon}{lambda} (indicated by the threshold s), with resolution {lambda}, exceeds the threshold {lambda}{gamma} vs. the order of singularities ({gamma}) (see Eq. [12]): (a) vegetation index (blue, top right), Band 1 (orange, middle right) and Band 2 (green, bottom right); and (b) soil moisture index (bottom right), Bands 6 and 7 (middle and top right, respectively).

 
A Complete, Manageable, and Physically Motivated Characterization of the Processes: Universal Multifractals
We have argued that a general feature of processes whose mechanism repeats across a wide range of scales is that they are scaling, characterized by convex moment functions [K(q), c({gamma})]. If this was all that we could deduce, then multifractals would be quite unmanageable: theoretically, it would mean that an infinite number of parameters were important [e.g., each value of K(q)] or empirically it would require the determination of an infinite number of coefficients (e.g., regression slopes). In physics, however, it has generally been found that when processes are iterated sufficiently or if there is a large enough number of interactions, then only a few of their characteristics are important in the limit of many interactions or iterations. This is the idea of "universality." Perhaps the oldest and most familiar example of universality is the central limit theorem in probability theory, which is routinely invoked to justify assumptions about Gaussianity of measurement noises. Although the Gaussian example is well known, the full result—the generalized central limit theorem (Levy, 1925) for sums of independent identically distributed random variables—is less known. It states that if one appropriately centers and normalizes sums of a sufficiently large number of independent identically distributed random variables with finite variance, the result is indeed a Gaussian; however, if the finite variance requirement is dropped (so that one allows for algebraic distributions with exponents {alpha} < 2), then one obtains Levy distributions. This result is relevant to cascade processes because if one considers the "generator" of the process, log{varepsilon}, then this is the sum of the logarithms of the random cascade factors, hence one expects the generalized central limit theorem to apply to the logarithms. This would lead to logarithmic Gaussian and logarithmic Levy multifractals. Although this basic idea turns out to be correct, the nontrivial small-scale limit of the cascade process has obscured the issue, even leading to strong statements that there are no universal multifractal properties (Mandelbrot, 1989). Indeed, to obtain universal properties, one must consider the universality issue on cascades developed only across a finite range of scales, and only then—after central limit convergence has been achieved—consider the small-scale limit (see Schertzer and Lovejoy, 1997). The resulting bare universal multifractals have the following exponent functions:

Formula 15[15]
The above Legendre transformation pairs (Schertzer and Lovejoy, 1987) are valid for 0 ≤ {alpha} < 1 and 1 < {alpha} ≤ 2 [when {alpha} = 1, K(q) = C1qlogq]. Also, 0 ≤ C1 ≤ d, since if C1 > d, the process cannot be normalized on the subspace. The restriction to q ≥ 0 is necessary; since generally for {alpha} < 2, the q < 0 moments diverge. This is why we did not plot them in Fig. 5 and 7 (the exception is the {alpha} = 2 lognormal multifractal). Once again, the results for the dressed cascades apply, so that these formulae are only valid for q < qD and {gamma} < {gamma}D = K'(qD). Because of this divergence, the terms log-Levy and log-Gaussian are misnomers; the probability tails are actually somewhat stronger than those terms would imply. It could be mentioned that in the literature, a weaker form of universality has been proposed leading to log-Poisson multifractals (She and Levesque, 1994). While log-Poisson multifractals share the infinite divisibility properties of the universal (Levy-generator) multifractals (necessary for cascades continuous in scale), the corresponding processes are neither stable nor attractive so that it would be surprising for them to emerge from complex real-world processes (see Schertzer et al., 1995).

The universal form of Eq. [15] shows that only two parameters are needed to specify the conserved flux of universal multifractals ({varepsilon}); for the "observables" (v), we need the third parameter (H). We have seen above that two of these parameters (H and C1) can be estimated from {xi}(1) and {xi}'(1); the above shows that only a single additional parameter ({alpha}) is needed to characterize the entire process. This "Levy index" {alpha} can be estimated either from the radius of curvature of {xi}(q) [or K(q)] at q = 1 (equivalently, determined from the second derivative), or—in practice more accurately—by considering K(q) or {xi}(q) across a wider range of q values and exploiting the dependence on q{alpha}. We tested one of these methods (Schmitt et al., 1995) on the MODIS data (for another popular method, double trace moments, see Lavallée et al., 1993). As long as {xi}'(0) is finite, we may remove the linear part of {xi}(q) by considering the power-law "residue" r(q):

Formula 16[16]
Applied to universal multifractals (as long as {alpha} > 1), we have

Formula 17[17]
Hence we can estimate {alpha} by a linear regression of log10r(q) vs. log10q. We show this in Fig. 10a and 10b using q values from 0.1 to 3 at intervals of 0.1 [{xi}'(0) is estimated from a local quadratic fit to {xi}(q) with q = 0, 0.1, and 0.2]. As can be seen, the residue is accurately a power law; for the bands, we find (Table 2) {alpha} = 1.91 ± 0.03, while for the indices, we find {alpha}VI = 2.02 and {alpha}SM = 2.23, both of which are slightly outside the theoretically allowed range for universal processes (it can thus only be an approximate empirical characterization). This is presumably a further consequence of the poor scaling of the indices (but not the radiances). Finally, we compare these results to those of a nonlinear regression on the trace moment estimate K(q) curve. Due to the presence of the linear term in the universal K(q) (Eq. [15]), the nonlinear regression is not as accurate. We might also note that a potential problem with it is that one must only make a fit for q < min(qs, qD) [recall that for q > qs or q > qD, Kd(q) becomes linear and is spurious], and a priori, the appropriate values of qs and qD are unknown. Here we have seen that the most extreme q that can be used is on the order q ~ 6 or greater and we made our nonlinear regressions using the relatively well estimated moments q ≤ 2.


Figure 10
View larger version (13K):
[in this window]
[in a new window]

 
FIG. 10. The nonlinear part of the scaling exponent of the qth order moment of the field [r(q) = q{xi}'(0) – {xi}(q)] for: (a) the vegetation index (blue, bottom left) and Band 1 (orange, top left) and Band 2 (green, middle left); and (b) soil moisture (red, bottom) with Bands 6 and 7 (pink and blue, respectively, top, nearly superposed).

 
Using these universal multifractal parameters, we can make multifractal simulations with the same statistics or resolution dependencies (Schertzer and Lovejoy, 1987; Wilson et al., 1991; Pecknold et al., 1993); see Fig. 11a and 11b for examples with roughly the same universal multifractal parameters as the radiances. For more simulations, showing the effect not only of varying the H, C1, and {alpha} parameters but also of various scaling anisotropies, see Lovejoy and Schertzer (2007) and also the multifractal explorer site: www.physics.mcgill.ca/~gang/multifrac/index.htm (verified 14 Feb. 2008).


Figure 11
View larger version (74K):
[in this window]
[in a new window]

 
FIG. 11. Two simulations to illustrate the effect of a small amount of anisotropy with universal multifractal parameters of basic fluctuation exponent (H) = 0.18, codimension characterizing pure resolution dependence near the mean (C1) = 0.05, and Levy index ({alpha}) = 1.9: (a) the same anisotropy at all scales, so that the scale-changing operator is a standard "zoom" but on an anisotropic initial structure, the degree and type of anisotropy is independent of scale; and (b) differential anisotropy with the direction and elongation of structures slowly changing with scale and the scale-changing operator itself anisotropic so that the degree of anisotropy changes with scale.

 

    Conclusions
 TOP
 ABSTRACT
 INTRODUCTION
 Spectral Analysis
 Multifractal Analysis
 Conclusions
 Appendix: Symbol Definitions
 REFERENCES
 
Due to extreme variability across huge ranges of scale, it is often difficult or impossible to obtain adequate quantities of in situ data; remotely sensed surrogates are attractive alternatives. The classical approach combines remote data from judiciously chosen radiance bands using semiempirical algorithms that are calibrated at the best available resolutions. The problem is that such algorithms are predicated on the classical geostatistical assumption that the fields of interest are sufficiently regular and smooth so that they have no significant resolution dependencies. In effect, they postulate a priori that the subpixel variability is not serious. In this study, we have argued that on the contrary, the fields are multifractal, having strong resolution dependencies (they are singular with respect to Lebesgue measures). This implies that, at best, the surrogate fields derived in this manner can be correctly calibrated at only a single resolution; they will be incorrect at other resolutions. This shows that new, resolution-independent algorithms must be developed.

In this study, we illustrated these ideas using soil moisture and vegetation indices and surrogates obtained from red and far red MODIS satellite imagery. We systematically reviewed the key notions and analysis methods that have emerged after more than 20 yr of study of cascades and other multifractal processes. First, using the classical technique of Fourier analysis, we showed that the basic isotropic scaling was reasonably well obeyed across most of the available range of scales (here, a factor of 512). We then considered the statistics of fluctuations, which can be systematically analyzed as functions of scale and intensity (by varying the order, q) using generalized structure functions {xi}(q). This showed that the basic radiance bands had good multiscaling [i.e., scaling with concave {xi}(q)], motivating the introduction of two different surrogates: the first, the classical sss, is defined by a nonlinear combination of radiances at the finest resolution; the second, the mss, is based on the same algorithm except that it is defined on successively lower resolution satellite radiances. Although the basic algorithm is identical (the surrogates are equal to the difference in radiances at two bands divided by their sums), their resolution dependencies were found to be different. This is a consequence of the scale dependence of the radiances and the nonlinear operation defining the surrogates. The fact that the scale-by-scale properties of the sss and mss surrogates are different (a fact quantified further with the parameters H and C1) shows that unless the finest resolution of the satellite data just happens to coincide with the inner scale of the soil moisture and vegetation indices, the surrogates cannot be more than rough approximations to the indices, valid only near the resolutions at which they were calibrated. In particular, across the MODIS range of scales (a factor 512), we showed that the amplitude of the mean statistics fluctuation in mss and sss was ~1.54.

We then went on to make a more complete scale-by-scale statistical characterization of the data and surrogates, showing how to obtain and analyze the underlying conserved fluxes using a technique called trace moments, as well as a scale-invariant characterization of the probability distributions, called the probability distribution multiple scaling technique. These analysis techniques were originally developed to quantify multifractal turbulent processes and we reviewed the relevant theory while explaining the various techniques. At this level, the intercomparison of the scaling involves two exponent functions, one for the moments [K(q)], one for the probabilities [c({gamma})]. Although these are related by a Legendre transform—so that they contain essentially the same information—they still effectively involve an infinite number of parameters (e.g., a different exponent for each K or c value). To make the problem manageable, we argued that in the real world, multifractal processes are likely to be in the basin of attraction of the three parameter "universal" multifractal processes so that analysis requires the estimation of just three parameters: in addition to H and C1 mentioned above, the Levy index {alpha}, which characterizes the degree of multifractality. We then estimated the remaining {alpha} index. Overall (Table 2), we found the scaling parameters (H, C1, and {alpha}) were not too different for the different bands but were significantly different for the surrogates (which also had scaling that was less accurately followed, a consequence of the nonlinear transformation at the smallest scale). Even though the exponents H and C1 seem small, because they act across wide ranges of scale they lead to large effects. For example, we pointed out that the difference in the mean intensity fluctuation across the MODIS image scale range (factor of 512) was about 1.47 depending on whether the indices are defined scale by scale or at a fixed scale. In contrast, there is another multifractal approach to remote sensing (Lévy-Vehel and Mignot, 1994; Lévy-Vehel, 1995) that is agnostic about the existence of wide-range scaling; in practice it only considers very small ranges of scale (e.g., factors of 2–4) so that it leads to applications that are not too different from classical remote sensing techniques.

The similitude of the scaling properties of the different radiance bands does not mean that overall they have the same statistics; the analysis techniques applied in this study are essentially isotropic. Even though by spectral analysis we showed that the anisotropy is not too large, the (unanalyzed) deviations from isotropic scaling can nevertheless lead to significant variations in the morphologies of structures. This is perhaps most graphically shown in the simulations (Fig. 11). Systematic analysis of the scaling anisotropy is not easy, however, and is outside our present scope.

Since our conclusions are based on an analysis of a single scene (albeit at seven wavelengths with nearly 2 x 106 pixels in all), we expect to find differences when data from other sites are analyzed. Since the qualitative (multiscaling) behavior that we found is theoretically expected, and the corresponding exponents are very fundamental, we anticipate that differences will be primarily in the (possibly large, but "normal") sample-to-sample variability of multifractal fields coupled with possible differences in the scale-by-scale anisotropy mentioned above.

Although we criticized classical scale-bound remote sensing algorithms, we did not propose a specific alternative. The general problem is to obtain a scale-invariant characterization of the statistical interrelation between the various radiances or fields. The basic approach is to consider a "state vector," with each relevant field represented by a different component. We are thus led to scale-invariant vectors (Lie cascades, Schertzer and Lovejoy, 1995), an approach that has been discussed to some extent in Lovejoy et al. (2001); however, very little is known about vector multifractals or about the appropriate analysis techniques.


    Appendix: Symbol Definitions
 TOP
 ABSTRACT
 INTRODUCTION
 Spectral Analysis
 Multifractal Analysis
 Conclusions
 Appendix: Symbol Definitions
 REFERENCES
 

Ii Intensity in band i (Eq. [1])
{sigma}VI, {sigma}SM Vegetation index (VI), soil moisture index (SM) (Eq. [2])
P Spectral density (Eq. [2])
E Spectrum (angle integral of P; Eq. [3])
k Wave vector (Eq. [2])
r Position vector (Eq. [2])
β Spectral exponent (Eq. [4])
L Largest scale in the system, here typically the image scale
l Smallest scale in the system, here typically the pixel scale
{lambda} Scale ratio: largest/resolution scale
{Lambda} Maximum scale ratio: L/l, {Lambda} ≥ {lambda}
{varepsilon}{lambda} Conserved flux associated with an observable, such as an intensity, at resolution {lambda} (Eq. [5])
{varepsilon}{lambda}, b The "bare" {varepsilon}{lambda}, i.e., the result of a cascade over a ratio {lambda} that stops at scale {lambda}
{varepsilon}{lambda}, d The "dressed" {varepsilon}{lambda}, i.e., the result of a cascade down to infinitely small scales that is then averaged to resolution {lambda}
K(q) Scaling exponent of the qth order moment of field quantifying how the qth order statistics change with resolution {lambda} (Eq. [5])
q Order of statistical moments (e.g., Eq. [5])
qs Highest order statistic that can be reliably estimated with Ns samples at dimension d.
qD Critical dimension for the divergence of statistical moments, equal to the asymptotic (absolute) probability distribution exponent
{Delta}v{lambda} The velocity radiance across a distance {Delta}x =L/{lambda} (Eq. [6])
{Delta}I{lambda} The velocity radiance across a distance {Delta}x =L/{lambda}
{Delta}x Horizontal distance
a Exponent of the flux (Eq. [6])
H Scaling exponent of the fluctuations with respect to the flux (Eq. [6]), scale-by-scale conservation exponent
{xi}(q) Exponent of the qth order structure function (Eq. [7])
c({gamma}) (Statistical) codimension of the set of points with singularities {gamma} (Eq. [12])
{gamma} Order of singularity corresponding to a field level {varepsilon}{lambda} ({gamma} = log{varepsilon}{lambda}/log{lambda}; Eq. [12])
d Dimension of space, here (for images) d = 2
d({gamma}) Dimension of the set with singularity {gamma} [ = d – c({gamma})]
Ds Sampling dimension = logNs/log{lambda}
Ns Number of independent fields of dimension d, resolution {lambda}, here it = 1
Pr Probability
s Dummy (threshold) variable (Eq. [14])
C1 Codimension corresponding to the mean (i.e., q = 1)
{alpha} Levy index quantifying how quickly the higher and lower order statistics depart from the mean (q = 1) behavior
{alpha}' Auxiliary variable (Eq. [15])
{gamma}D Critical singularity corresponding to qD
{gamma}s Critical singularity corresponding to qs
r(q) Pure nonlinear part of K(q) (Eq. [17])


    ACKNOWLEDGMENTS
 
We are grateful to Carmelo Alonso for providing the images. This work has been partially supported by CICYT under Project MTM2006-15533-C02-02.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 Spectral Analysis
 Multifractal Analysis
 Conclusions
 Appendix: Symbol Definitions
 REFERENCES
 




This article has been cited by other articles:


Home page
Vadose Zone JHome page
S. D. Logsdon, E. Perfect, and A. M. Tarquis
Multiscale Soil Investigations: Physical Concepts and Mathematical Techniques
Vadose Zone J., May 27, 2008; 7(2): 453 - 455.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lovejoy, S.
Right arrow Articles by Schertzer, D.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Lovejoy, S.
Right arrow Articles by Schertzer, D.
GeoRef
Right arrow GeoRef Citation
Agricola
Right arrow Articles by Lovejoy, S.
Right arrow Articles by Schertzer, D.
Related Collections
Right arrow Fractal Approaches
Right arrow Scaling
Right arrow Remote Sensing


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
The SCI Journals Agronomy Journal Crop Science
Journal of Natural Resources
and Life Sciences Education
Soil Science Society of America Journal
Journal of Plant Registrations Journal of
Environmental Quality
The Plant Genome