next up previous [pdf]

Next: Numerical Implementation Up: Theory Previous: Connection between time- and

The optimization formulation

Given boundary conditions 8, equation 6 describes the traveltime $t_0$ of a velocity model with a plane-wave source at the surface. For a given $t_0$, equation 7 is a first-order linear PDE on $x_0$ and thus computation of $x_0$ is straightforward. Our idea is inspired by a natural logic: if the resulting $x_0$ does not satisfy equation 5, we need to modify $v$ in a way such that the misfit decreases, and repeat the process until convergence.

Mathematically, we define a cost function $f (z,x)$ based on equation 5:

\begin{displaymath}
f (z,x) = \nabla x_0 \cdot \nabla x_0 - v_d^2 w\;,
\end{displaymath} (9)

where for convenience we use slowness-squared $w (z,x) = v^{-2} (z,x)$ instead of $v$. Note that $f$ is dimensionless. The discretized form of equation 9 reads
\begin{displaymath}
\mathbf{f} = (\nabla \mathbf{x_0} \cdot \nabla) \mathbf{x_0...
...
- \mbox{diag}(\mathbf{v_d} \star \mathbf{v_d}) \mathbf{w}\;.
\end{displaymath} (10)

In equation 10, $\mathbf{f}, \mathbf{x_0}, \mathbf{v_d}$ and $\mathbf{w}$ are all column vectors after discretizing the computational domain $(z,x)$. For example, $\mathbf{x_0}$ is the discretized column vector of $x_0 (z,x)$. The vector $\mathbf{v_d}$ may require interpolation because it is in $(t_0,x_0)$ while the discretization is in $(z,x)$. The interpolation can be done after forward mapping from $(z,x)$ to $(t_0,x_0)$ at current velocity model. We denote an operator which is a matrix $\mathbf{L}_{x_0} = \nabla \mathbf{x_0} \cdot \nabla$. The other operator $\mbox{diag}()$ expands a vector into a diagonal matrix. Finally, the symbol $\star$ stands for an element-wise vector-vector multiplication.

As is common in many optimization problems, we seek to minimize the least-squares norm of $\mathbf{f}$:

\begin{displaymath}
E [\mathbf{w}] = \frac{1}{2} \Vert\mathbf{f}\Vert^2 = \frac{1}{2} \mathbf{f}^T \mathbf{f}\;,
\end{displaymath} (11)

where the superscript $T$ stands for transpose. The Gauss-Newton method in optimization requires linearizing the cost function in equation 10:
\begin{displaymath}
\frac{\partial f}{\partial w} =
2 (\nabla x_0 \cdot \nabla...
...ial w} -
2 v_d w \frac{\partial v_d}{\partial w} - v_d^2\;.
\end{displaymath} (12)

The Fréchet derivative matrix $\mathbf{J}$ required by inversion is the discretized form of equation 12, i.e., $\mathbf{J} = \partial \mathbf{f} / \partial \mathbf{w}$. In Appendix B we find that $\mathbf{J}$ is a cascade and summation of several parts. An update $\delta \mathbf{w}$ at current $\mathbf{w}$ is found by solving the following normal equation arising from the Gauss-Newton approach (Björck, 1996):
\begin{displaymath}
\delta \mathbf{w} = \left[ \mathbf{J}^T \mathbf{J} \right]^{-1} \mathbf{J}^T (- \mathbf{f})\;.
\end{displaymath} (13)

Equations 11 and 13 together suggest a nonlinear inversion procedure for solving the original system of PDEs 5, 6 and 7. The inversion is analogous to traveltime tomography but with more complexity. The cost 9 can be interpreted as difference between modeled and observed geometrical spreadings. However, both of them depend on the model $v$, while in traveltime tomography the observed arrival times are independent of $v$. The forward modeling in our case involves two steps, which construct a curvilinear coordinate system that is sensitive to lateral velocity variations. On the other hand, the forward modeling in traveltime tomography consists of only one step. Last but not least, unlike traveltime tomography, we have observations everywhere in the computational domain, except for areas excluded due to instabilities of the numerical implementation, as we will discuss later.

Before introducing a numerical implementation, we would like to point out several important facts and assumptions that make a successful time-to-depth conversion possible by the proposed method:

For the prior model, we adopt the Dix-inverted model. In other words, we seek to refine the interval velocity given by equation 2 by taking the geometrical spreading of image rays into consideration according to equation 3.


next up previous [pdf]

Next: Numerical Implementation Up: Theory Previous: Connection between time- and

2015-03-25