Introduction

Registration is a technique aimed to align two objects using a particular transformation.

A typical example of registration is to have two medical images from the same patient taken at different dates. It is very likely that the patient assume a different position during each acquisition. A registration procedure would allow to take both images and find a spatial transformation to find the corresponding pixel from one image into the other.

Another typical example of registration is to have a geometrical model of an organ, let's say a bone. This model can be used to find the corresponding structure in a medical image. In this case, a spatial transformation is needed to find the correct location of the structure in the image.

ITK Registration Framework

The Insight Toolkit takes full advantage of the power provided by generic programming. Thanks to that, it have been possible to create an abstraction of the particular problems that the toolkit is intended to solve.

The registration problem have been decomposed in a set of basic elements. They are:

Target: the object that is assumed to be static.
Reference: the object that will be transformed in order to be superimpossed to the Target
Transform: the mapping that will convert one point from the Reference object space to the Target object space.
Metric: a measure that indicates how well the Target object matches the Reference object after transformation.
Mapper: the particular technique used for interpolating values when objects are resampled through the Transform.
Optimizer: the method used to find the Transform parameters that optimize the Metric.

A particular registration method is defined by selecting specific implementations of each one of these basic elements. In order to determine the registration method appropriated for a particular problem, is will be useful to answer the following questions:

Target and Reference Objects

Currently the Target and Reference objects can be of type itk::Image and itk::PointSet. Methods have been instantiated for a variety of Image to Image and PointSet to Image registration cases.

Transforms

This is a rapid description of the transforms implemented in the toolkit

Affine: The affine transform is N-Dimensional. It is composed of a NxN matrix and a translation vector. The affine transform is a linear transformation that can manage rotations, translations, shearing and scaling.

Rigid3D: This transform is specific for 3D, it supports only rotations and translations. Rotations are represented using Quaternions.

Rigid3DPerspective: A composition of a Rigid3D transform followed by a perspective projection. This transformation is intended to be used in applications like X-Rays projections.

Translation: A N-Dimensional translation internally represented as a vector.

Spline: A kernel based spline is used to interpolate a mapping from a pair of point sets.

Similarity Metrics

Metrics are probably the most critical element of a registration problem. The metric defines what the goal of the process is, they measure how well the Target object is matched by the Reference object after the transform has been applied to it. The Metric should be selected in function of the types of objects to be registered and the expected kind of missalignment. Some metrics has a rather large capture region, which means that the optimizer will be able to find his way to a maximum even if the missalignment is high. Typically large capture regions are associated with low precision for the maximum. Other metrics can provide high precision for the final registration, but usually require to be initialized quite close to the optimal value.

Unfortunately there are no clear rules about how to select a metric, other that trying some of them in different conditions. In some cases could be and advantage to use a particular metric to get an initial approximation of the transformation, and then switch to another more sensitive metric to achieve better precision in the final result.

Metrics are depend on the objects they compare. The toolkit currently offers Image To Image and PointSet to Image metrics as follows:

Mean Squares Sum of squared differences between intensity values. It requires the two objects to have intensity values in the same range.
Normalized Correlation Correlation between intensity values divided by the square rooted autocorrelation of both target and reference objects: \( \frac{\sum_i^n{a_i * b_i }}{\sum_i^n{a_i^2}\sum_i^n{b_i^2}} \). This metric allows to register objects whose intensity values are related by a linear transformation.
Pattern Intensity Squared differences between intensity values transformed by a function of type \( \frac{1}{1+x} \) and summed them up. This metric has the advantage of increase simultaneously when more samples are available and when intensity values are close.
Mutual Information Mutual information is based in an information theory concept. Mutual information between two sets measures how much can be known from one set if only the other set is known. Given a set of values \( A=\{a_i\} \). Its entropy \( H(A) \) is defined by \( H(A) = \sum_i^n{- p(a_i) \log({p(a_i)})} \) where \( p(a_i) \) are the probabilities of the values in the set. Entropy can be interpreted as a measure of the mean uncertainty reduction that is obtained when one of the particular values is found during sampling. Given two sets \( A=\{a_i\} \) and \( B=\{b_i\} \) its joint entropy is given by the joint probabilities \( p_(a_i,b_i) \) as \( H(A,B) = \sum_i^n{-p(a_i,b_i) * log( p(a_i, b_i) )} \). Mutual information is obtained by subtracting the entropy of both sets from the joint entropy, as : \( H(A,B)-H(A)-H(B) \), and indicates how much uncertainty about one set is reduced by the knowledge of the second set. Mutual information is the metric of choice when image from different modalities need to be registered.

Optimizers

The following optimization methods are available:

Gradient Descent : Advance following the direction and magnitud of the gradient scaled by a learning rate.

Regular Step Gradient Descent : Advances following the direction of the gradient and use a bipartition scheme to compute the step length.

Conjugate Gradient : Nonlinear Minimization that optimize search directions using a second order approximation of the cost function.

Levenberg Marquardt : Nonlinear Least Squares Minimization

LBFGS : Limited Memory Broyden, Fletcher, Goldfarb and Shannon minimization.

Amoeba : Nelder Meade Downhill Simplex.

One Plus One Evolutionary : Strategy that simulates the biological evolution of a set of samples in the search space.

Multiresolution

The evaluation of a metric can be very expensive in computing time. An approach often used to improve performance is to register first reduced resolution versions of the target and reference objects. The resulting transform is used as the starting point for a second registration process performed in progressively higher resolution objects.

It is usual to create first a sequence of reduced resolution version of the objects, this set of objects is called a pryramid representation . A Multiresolution method is basically a set of consecutive registration process, each one performed at a particular level of the pyramid, and using as initial transform the resulting transform of the previous process.

Multiresolution offers the double advantage of increasing performance and at the same time improving the stability of the optimization by smoothing out local minima and increasing the capture region of the process.