User:DBoyd13/Linear Seismic Inversion

Introduction to Seismic Inversion
Inverse modeling is a mathematical technique where the objective is to determine the phycical properties of the subsurface of an earth region that has produced a given seismogram. Cooke and Schneider (1983) defined it as calculation of the earth’s structure and physical parameters from some set of observed seismic data. The underlying assumption in this method is that the collected seismic data are from an earth structure that matches the cross-section computed from the inversion algorithm. Some common earth properties that are inverted for include acoustic velocity, formation and fluid densities, impedance, poisson's ratio, formation compressibility, shear rigidity, porosity, saturation etc. The method has been a useful tool for geophysicists for over 20 years and can be categorized into two broad types  : Deterministic and Stochastic inversion. Deterministic inversion methods are based on comparison of the output from an earth model with the observed field data and continuously updating the earth model parameters to minimize a function, which is usually some form of difference between model output and field observation. As such, this method of inversion to which linear inversion falls under is posed as an minimization problem and the accepted earth model is the set of model parameters that minimizes the objective function in producing a numerical seismogram which best compares with collected field seismic data. On the other hand, Stochastic inversion methods are used to generate constrained models as used in reservoir flow simulation, using geostatistical tools like kriging. As opposed to deterministic inversion methods which produce a single set of model parameters, stochastic methods generate a suite of alternate earth model parameters which all obey the model constraint. However, the two methods are related as the results of deterministic models is the average of all the possible non-unique solutions of stochastic methods. Since seismic linear inversion is a deterministic inversion method, the stochastic method will not be discussed beyond this point.





Linear inversion
The deterministic nature of linear inversion requires a functional relationship which models, in terms of the earth model parameters, the seismic variable to be inverted. This functional relationship is some mathematical model derived from the fundamental laws of physics and is more often called a forward model. The aim of the technique is to minimize a function which is dependent on the difference between the convolution of the forward model with a source wavelet and the field collected seismic trace. As in the field of optimization, this function to be minimized is called the objective function and in convectional inverse modeling, is simply the difference between the convolved forward model and the seismic trace. As earlier mentioned, different types of variables can be inverted for but for clarity, these variables will be referred to as the impedance series of the earth model. In the following subsections we will describe in more detail, in the context of linear inversion as a minimization problem, the different components that are necessary to invert seismic data.

Forward model
The centerpiece of seismiclinear inversion is the forward model which models the generation of the experimental data collected. According to Wiggins (1972), it provides a functional (computational) relationship between the model parameters and calculated values for the observed traces. Depending on the seismic data collected, this model may vary from the classical wave equations for predicting particle displacement or fluid pressure for sound wave propagation through rock or fluids, to some variants of these classical equations. For example the forward model in Tarantola (1984) is the wave equation for pressure variation in a liquid media during seismic wave propagation while by assuming constant velocity layers with plane interfaces, Kanasewich and Chiu (1985) used the brachistotrone model of John Bernoulli for travel time of a ray along a path. In Cooke and Schneider (1983), the model is a synthetic trace generation algorithm expressed as in Eqn. 3, where R(t) is generated in the Z-domain by recursive formula. In whatever form the forward model appears, it is important that it not only predicts the collected field data, but also models how the data is generated. Thus, the forward model by Cooke and Schneider (1983) can only be used to invert CMP data since the model invariably assumes no spreading loss by mimicking the response of a laterally homogeneous earth to a plane-wave source

$$t=\sum_{i=1}^n\frac{\big[(x_i-x_{i-1})^2+(y_i-y_{i-1})^2+(z_i-z_{i-1})^2\big]^{\frac{1}{2}}}{v_i}\!$$ 

where t is ray travel time, x, y, z are depth coordinates and vi is the constant velocity between interfaces i − 1 and i.

$$\Big[\frac{1}{K(\vec{r})}\frac{\partial^2}{\partial t^2}-\nabla\cdot\big(\frac{1}{\rho(\vec{r}\big)}\nabla)\Big] U(\vec{r},t)= s(\vec{r},t) \!$$

where $$K(\vec{r})\!$$ represent bulk modulus, $$\rho(\vec{r})\!$$ density, $$s(\vec{r},t)\!$$ the source of acoustic waves, and $$U(\vec{r},t)\!$$ the pressure variation.

$$s(t)=w(t)*R(t)\!$$ 

where s(t) = synthetic trace, w(t) = source wavelet, and R(t) = reflectivity function.

Objective function
An important numerical process in inverse modeling is to minimize the objective function, which is a function defined in terms of the difference between the collected field seismic data and the numerically computed seismic data. Classical objective functions include sum of squares of difference between experimental and numerical data, as in the least squares methods, the sum of the magnitude of the difference between field and numerical data or some variant of these definitions. Irrespective of the definition used, numerical solution of the inverse problem is obtained as earth model that minimize the objective function.

In addition to the objective function, other constraints like known model parameters and known layer interfaces in some regions of the earth are also incorporated in the inverse modeling procedure. These constraints, according to Francis 2006, help to reduce non-uniqueness of the inversion solution by providing a priori information that is not contained in the inverted data while Cooke and Schneider (1983) reports their useful in controlling noise and when working in a geophysically well-known area.







Mathematical analysis of generalized linear inversion procedure
The objective of mathematical analysis of inverse modeling is to cast the generalized linear inverse problem into a simple matrixalgebra by considering all the components described in previous sections. viz; forward model, objective function etc. Generally, the numerically generated seismic data are non-linear functions of the earth model parameters. To remove the non-linearity and create a platform for application of linear algebra concepts, the forward model is linearized by expansion using a Taylor series as carried out below. For more details see Wiggins (1972), Cooke and Schneider (1983).

Consider a set of $$m\!$$ seismic field observations $$F_j\!$$, where $$j = 1 \cdots m\!$$ and a set of $$n\!$$ earth model parameters $$p_i\!$$ to be inverted for, where $$i=1\cdots n\!$$. The field observations can be represented in either $$\vec{F}\,(\vec{p})\!$$ or $$F_j\,(p_i)\!$$ where $$\vec{p}\!$$ and $$\vec{F}\,(\vec{p})\!$$ are vectorial representations of model parameters and the field observations as a function of earth parameters. Similarly, for $$q_i\!$$ representing guesses of model parameters, $$\vec{F}\,(\vec{q})\!$$ is the vector of numerical computed seismic data using the forward model of Sec. 1.3 Taylor's series expansion of $$\vec{F}\,(\vec{p})\!$$ about $$\vec{q}\!$$ is given below.

$$\vec{F}\,(\vec{p}) = \vec{F}\,(\vec{q})+(\vec{p}-\vec{q})\frac{\partial \vec{F}\,(\vec{q})}{\partial \vec{p}}+(\vec{p}-\vec{q})^2\frac{\partial^2 \vec{F}\,(\vec{q})}{\partial \vec{p}^2} +O(\vec{p}-\vec{q})^3\!$$

On linearization by dropping the non-linear terms (terms with (p⃗ − ⃗q) of order 2 and above), the equation becomes

$$\vec{F}\,(\vec{p}) - \vec{F}\,(\vec{q})=(\vec{p}-\vec{q})\frac{\partial \vec{F}\,(\vec{q})}{\partial \vec{p}}\!$$

Considering that $$\vec{F}\!$$ has $$m\!$$ components and $$\vec{p}\!$$ and $$\vec{q}\!$$ have $$n\!$$ components, the discrete form of Eqn. 5 results in a system of $$m\!$$ linear equations in $$n\!$$ variables whose matrix form is shown below.

$$\Delta \vec{F} = \mathbf{A}\,\Delta\vec{p}\!$$

$$\Delta\vec{F} = \begin{bmatrix}F_1(\vec{p})-F_1(\vec{q})\\\vdots\\F_m(\vec{p})-F_m(\vec{q})\end{bmatrix}\!$$

$$\Delta\vec{p} = \vec{p}-\vec{q} = \begin{bmatrix} p_1-q_1\\ \vdots	\\ p_n-q_n \end{bmatrix}\!$$</li>

<li>$$\mathbf{A} = \begin{bmatrix} \frac{\partial F_1(\vec{q})}{\partial p_1} & \frac{\partial F_1(\vec{q})}{\partial p_2} & \cdots & \frac{\partial F_1(\vec{q})}{\partial p_n} \\ \frac{\partial F_2(\vec{q})}{\partial p_1} & \cdots & \frac{\partial F_2(\vec{q})}{\partial p_{n-1}} & \frac{\partial F_2(\vec{q})}{\partial p_n} \\ \vdots	& \frac{\partial F_j(\vec{q})}{\partial p_i} & \vdots & \vdots \\ \frac{\partial F_m(\vec{q})}{\partial p_1} & \frac{\partial F_m(\vec{q})}{\partial p_2} & \cdots & \frac{\partial F_m(\vec{q})}{\partial p_n} \\ \end{bmatrix}\!$$</li> </ol>

$$\Delta \vec{F}\!$$ is called the difference vector in(Cooke and Schneider 1983). It has a size of $$n\times 1\!$$ and its components are the difference between the observed trace and the numerically computed seismic data. $$\Delta\vec{p}\!$$ is the corrector vector of size $$n\times 1\!$$ while $$\mathbf{A}\!$$ is called the sensitivity matrix. It has a size of $$m\times n\!$$ and its comments are such that each column is the partial derivative of a component of the forward function with respect to one of the unknown earth model parameters. Similarly, each row is the partial derivative of a component of the numerically computed seismic trace with respect to all unknown model parameters.

Solution algorithm
$$\vec{F}\,(\vec{q})\!$$ is computed from the forward model while $$\vec{F}\,(\vec{p})\!$$ is the experimental data. Thus, $$\Delta \vec{F}\!$$ is a known quality. On the other hand, $$\Delta\vec{p}\!$$ is unknown and is obtained by solution of Eqn. \ref{eq_inv}. This equation is theoretically solvable only when $$\mathbf{A}\!$$ is invertible, that is, if it is a square matrix so that the number of observations $$m\!$$ is equal to the number $$n\!$$ of unknown earth parameters. If this is the case, the unknown corrector vector $$\Delta\vec{p}\!$$, is solved for as shown below, using any of the classical direct or iterative solvers for solution of a set of linear equations.

<li>$$\Delta \vec{p} = \mathbf{A}^{-1}\,\Delta\vec{F}\!$$</li> </ol>

In most seismic inversion applications, there are more observations than the number of earth parameters to be inverted for i.e. $$m>n\!$$, leading to a system of equations that is mathematically over-determined. As a result, Eqn. \ref{eq_inv} is not theoretically solvable and an exact solution is not obtainable. An estimate of the corrector vector is obtained using the least squares procedure to find the corrector vector $$\Delta \vec{p}\!$$ that minimizes $$\vec{e}\,^T \vec{e}\!$$, which is the sum of the squares of the error, $$\vec{e}\!$$.

The error$$\vec{e}\!$$ is given by

<li>$$\vec{e} = \Delta\vec{F}-\mathbf{A}\,\Delta \vec{p}\!$$</li> </ol>

In the least squares procedure, the corrector vector that minimizes $$\vec{e}\,^T\vec{e}\!$$ is obtained as below.

<li>$$\begin{align} \mathbf{A}\,\Delta \vec{p} &=\Delta\vec{F}\\ \mathbf{A}^T\mathbf{A}\,\Delta \vec{p} &= \mathbf{A}^T\Delta\vec{F} \end{align}\!$$</li> </ol>

Thus,

<li>$$ \Delta \vec{p} = (\mathbf{A}^T\mathbf{A})^{-1}\,\mathbf{A}^T\Delta\vec{F}\!$$</li> </ol>

From the above discussions, the objective function is defined as either the $$L_1\!$$ or $$L_2\!$$ norm of $$\Delta \vec{p}\!$$ given by $$\sum_{j=0}^n|\Delta p_j|\!$$ or $$\sum_{j=0}^n|\Delta p_j|^2\!$$ or of $$\Delta \vec{F}\!$$ given by $$\sum_{i=0}^m|\Delta F_i|\!$$ or $$\sum_{i=0}^m|\Delta F_i|^2\!$$.

The generalized procedure for inverting any experimental seismic data for $$m = n\!$$ or $$m >n\!$$, using the mathematical theory for inverse modeling as described above is shown in Fig. 1 and described as follows.

An initial guess of the model impedance is provided to initiate the inversion process. The forward model uses this initial guess to compute a synthetic seismic data which is subtracted from the observed seismic data to calculate the difference vector.


 * 1) An initial guess of the model impedance $$\vec{q}\!$$ is provided to initiate the inversion process.
 * 2) A synthetic seismic data $$\vec{F}(\vec{q})\!$$ is computed by the forward model, using the model impedance above.
 * 3) The difference vector $$\vec{F}(\vec{p})-\vec{F}(\vec{q})\!$$ is computed as the difference between experimental and synthetic seismic data. \item
 * 4) The sensitivity matrix $$\mathbf{A}\!$$ is computed at this value of the impedance profile.
 * 5) Using $$\mathbf{A}\!$$ and the difference vector from 3 above, the corrector vector $$\Delta \vec{p}\,\!$$ is calculated. A new impedance profile is obtained as <li>$$\vec{p}=\vec{q}+\Delta \vec{p}\!$$</li></ol>
 * 6) The $$L_1\!$$ or $$L_2\!$$ norm of the computed corrector vector is compared with a provided tolerance value. If the computed norm is less than the tolerance, the numerical procedure is concluded and the inverted impedance profile for the earth region is given by $\vec{p}\! from Eqn. \ref{corr_imp}. On the other hand, if the norm is greater than the tolerance, iterations through steps 2-6 are repeated but with an updated impedance profile as computed from Eqn. 14. Fig. 2 shows a typical example of impedance profile updating during successive iteration process. According to (Cooke and Schneider 1983), use of the corrected guess from Eqn. \ref{corr_imp} as the new initial guess during iteration reduces the error.

Parameterization of the earth model space
Irrespective of the variable to be inverted for, the earth’s impedance is a continuous function of depth (or time in seismic data) and for numerical linear inversion technique to be applicable for this continuous physical model, the continuous properties have to be discretized and /or sampled at discrete intervals along the depth of the earth model. Thus, the total depth over which model properties are to be determined is a necessary starting point for the discretization. Commonly, as shown in Fig. 3, this properties are sampled at close discrete intervals over this depth to ensure high resolution of impedance variation along the earth’s depth. The impedance values inverted from the algorithm represents the average value in the discrete interval.

Considering that inverse modeling problem is only theoretically solvable when the number of discrete intervals for sampling the properties is equal to the number of observation in the trace to be inverted, a high resolution sampling will lead to a large matrix which will be very expensive to invert. Furthermore, the matrix may be singular for dependent equations, the inversion can be unstable in the presence of noise and the system may be under-constrained if parameters other than the primary variables inverted for, are desired. In relation to parameters desired, other than impedance, Cooke and Schneider (1983) gives them to include source wavelet and scale factor.

Finally by treating constraints as known impedance values in some layers or discrete intervals, the number of unknown impedance values to be solved for reduces, leading to greater accuracy in the results of the inversion algorithm.







Temperature inversion from Marescot (2010)
We start with an example to invert for earth parameter values from temperature depth distribution in a given earth region. Although this example does not directly relate to seismic inversion since no traveling acoustic waves are involved, it nonetheless introduces practical application of the inversion technique in a manner easy to comprehend, before moving on to seismic applications. In this example, the temperature of the earth is measured at discrete locations in a well bore by placing temperature sensors in the target depths. By assuming a forward model of linear distribution of temperature with depth, two parameters are inverted for from the temperature depth measurements.

The forward model is given by <li> $$\vec{F}(\vec{q}) = \vec{T} = a+bz\!$$</li></ol>

where $$\vec{q} = [a,b]\!$$. Thus, the dimension of $$\vec{q}\!$$ is 2 i.e the number of parameters inverted for is 2.

The objective of this inversion algorithm is to find $$\vec{p}\!$$, which is the value of $$[a,b]\!$$ that minimizes the difference between the observed temperature distribution and those obtained using the forward model of Eqn. 15. Considering the dimension of the forward model or the number of temperature observations to be $$n\!$$, the components of the forward model is written as

<li>$$\begin{align} T_1&=a+bz_1		\\ T_2&=a+bz_2	\\ \vdots	\\ T_{n-1}&=a+bz_{n-1}		\\ T_n&=a+bz_n	\\ \end{align}\!$$</li>

so that $$\vec{F}(\vec{q}) = T\!$$

<li>$$\mathbf{A} = \begin{bmatrix} 1 & z_1 \\ 1 & z_2 \\ \vdots & \vdots	\\ 1 & z_{n-1} \\ 1 & z_n \\ \end{bmatrix}\!$$</li></ol>

We present results from Marescot (2010) for the case of $$n = 2\!$$ for which the observed temperature values at depths were $$T_1 = 19 ^{\circ}C\!$$ at $$z=2m\!$$ and $$T_2=22^{\circ}C\!$$ at $$z=8m\!$$. These experimental data were inverted to obtain earth parameter values of $$a = 0.5\!$$ and $$b=18^{\circ}C\!$$. For a more general case with large number of temperature observations, Fig. 4 shows the final linear forward model obtained from using the inverted values of $$a\!$$ and $$b\!$$. The figure shows a good match between experimental and numerical data.

Wave travel time inversion from Marescot (2010)
This examples inverts for earth layer velocity from recorded seismic wave travel times. Fig. 5 shows the initial velocity guesses and the travel times recorded from the field, while Fig. 6a shows the inverted heterogeneous velocity model, which is the solution of the inversion algorithm obtained after 30 iterations. As seen in Fig. 6b, there is good comparison between the final travel times obtained from the forward model using the inverted velocity and the field recored travel times. Using these solutions, the ray path was reconstructed and is shown to be highly tortuous through the earth model as shown in Fig. 7.

Seismic trace inversion from Cooke and Schneider (1983)
This example, taken from Cooke and Schneider (1983), shows inversion of a CMP seismic trace for earth model impedance (product of density and velocity) profile. The seismic trace inverted is shown in Fig. 8 while Fig. 9a shows the inverted impedance profile with the input initial impedance used for the inversion algorithm. Also recorded alongside the seismic trace is an impedance log of the earth region as shown in Fig. 9b. The figures show good comparison between the recorded impedance log and the numerical inverted impedancefrom the seismic trace.