next up previous [pdf]

Next: Operator approach to missing Up: Missing-data program Previous: Missing-data program

Matrix approach to missing data

Customarily, we have referred to data by the symbol $\bold d$. Now that we are dividing the data space into two parts, known and unknown (or missing), we refer to this complete space as the model (or map) space $\bold m$.

There are 15 data points in Figures 1-5. Of the 15, 4 are known and 11 are missing. Denote the known by $k$ and the missing by $u$. Then, the sequence of missing and known is $(u,u,u,u,k,u,k,k,k,u,u,u,u,u,u)$. Because I cannot print $15\times 15$ matrices, please allow me to describe instead a data space of 6 values $(m_1, m_2, m_3, m_4, m_5, m_6)$ with known values only $m_2$ and $m_3$, $\bold m$ arranged as $(u,k,k,u,u,u)$.

Our approach is to minimize the energy in the residual, which is the filtered map (model) space. We state the fitting goals $\bold 0\approx \bold F\bold m$ as:

\begin{displaymath}
\left[
\begin{array}{c}
0 \\
0 \\
0 \\
0 \\
0 \\
...
...
m_2 \\
m_3 \\
m_4 \\
m_5 \\
m_6
\end{array} \right]
\end{displaymath} (1)

Rearranging these fitting goals, and bringing the columns multiplying known data values ($m_2$ and $m_3$) to the left, we get $\bold y =-\bold F_k \bold m_k \approx \bold F_u \bold m_u$.

\begin{displaymath}
\left[
\begin{array}{c}
y_1 \\
y_2 \\
y_3 \\
y_4  ...
...array}{c}
m_1 \\
m_4 \\
m_5 \\
m_6
\end{array} \right]
\end{displaymath} (2)

Equation (2) is the familiar form of an overdetermined system of equations $\bold y \approx \bold F_u \bold m_u$ that we could solve for $\bold m_u$ as illustrated earlier by conjugate directions or by a wide variety of well-known methods.

The trouble with this matrix approach is that it is awkward to program the partitioning of the operator into the known and missing parts, particularly if the application of the operator uses arcane techniques, such as those used by the fast-Fourier-transform operator or various numerical approximations to differential, or partial differential, operators that depend on regular data sampling. Even for the modest convolution operator, we already have a library of convolution programs that handle a variety of end effects, and it would be much nicer to use the library as it is rather than recode it for all possible geometrical arrangements of missing data values.


next up previous [pdf]

Next: Operator approach to missing Up: Missing-data program Previous: Missing-data program

2014-12-03