Project 04

Image Warping and Mosaicing

Introduction

Images can be viewed as rays of light that pass through the same center projecting onto a plane. The "center" mentioned above is called the center of projection. Whether it's projecting through a pinhole camera or taking pictures with digital single lens reflex camera, the images/ photots we see can always be interpreted as projection of rays onto the screen.

$\begin{bmatrix}a & b & c \\ d & e & f \\ g & h & i \end{bmatrix}$ $\begin{bmatrix}x \\ y\end{bmatrix}$ , we need an additional slack variable 1 to take translation into account.

$i = 1$ $\begin{bmatrix}a & b & c \\ d & e & f \\ g & h & 1 \end{bmatrix}$ therefore has 8 degrees of freedom, and can be solved with 4 pairs of corresponding points. In reality, since we are applying this transformation onto the whole image, we would want to find more than 4 point correspondences to obtain a more universal transformation. When we increase the number of points to be more than 4, the parameters in the transformation matrix should be solved by least-squares approximation.

Projective transformation can be applied to multiple scenarios. In this project we will explore its application in image rectification and mosaicing. Image mosaicing is achieved by first warping two images onto the same projection plane, and blend them together. We will discuss concrete implementations in later sections.

Shooting Pictures

The first step of warping images is to collect two images lying on different projection planes. One may think that different projection planes can be easily realized by rotating the camera. However, we must also make sure that the center of projection stays the same, so that we are measuring the same pencils of rays. In practice, this can be achieved by rotating the camera/ smartphone around the focus of the lens.

Below are three different views of the University Library reading room. They would also be our primary ingredients for image rectification and mosaicing.

Left view:

left

Middle view:

right

Right view:

mosaic2_right

Computing Transformation Matrix

$a, b, c, d, e, f, g, h$ $(x, y)$ $(x', y')$ as an example. Applying the transformation matrix we can get:

[\begin{matrix} a & b & c \\ d & e & f \\ g & h & 1 \end{matrix}] [\begin{matrix} x \\ y \\ 1 \end{matrix}] = [\begin{matrix} a x + b y + c \\ d x + e y + 1 \\ g x + h y + 1 \end{matrix}] = [\begin{matrix} w x^{'} \\ w y^{'} \\ w \end{matrix}], w \in R

$w$ $w$ will only change the length of projected vector, and will not affect the direction of transformation.

$n$ $\{(x_1, y_1), (x_1', y_1')\}, \dots \{(x_n, y_n), (x_n', y_n')\}$ , we can set up a least squares problem:

\begin{matrix} A = [\begin{matrix} x_{1} & y_{1} & 1 & 0 & 0 & 0 & - x_{1} x_{1}^{'} & - y_{1} x_{1}^{'} \\ 0 & 0 & 0 & x_{1} & y_{1} & 1 & - x_{1} y_{1}^{'} & - y_{1} y_{1}^{'} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ x_{n} & y_{n} & 1 & 0 & 0 & 0 & - x_{n} x_{n}^{'} & - y_{n} x_{n}^{'} \\ 0 & 0 & 0 & x_{n} & y_{n} & 1 & - x_{n} y_{n}^{'} & - y_{n} y_{n}^{'} \end{matrix}] \vec{b} = [\begin{matrix} x_{1}^{'} \\ y_{1}^{'} \\ ⋮ \\ x_{n}^{'} \\ y_{n}^{'} \end{matrix}] \vec{x} = [\begin{matrix} a \\ b \\ c \\ d \\ e \\ f \\ g \\ h \end{matrix}] \\ A \vec{x} = \vec{b} ⟹ \vec{x} = (A^{T} A)^{- 1} A^{T} \vec{b} \end{matrix}

$\begin{bmatrix} x \\ y \\ 1\end{bmatrix}$ $\begin{bmatrix} x' \\ y'\\ 1\end{bmatrix}$ $\begin{bmatrix} wx' \\ wy' \\ w\end{bmatrix}$ $w$ Instead. In order to recover the real projective point, we can divide the first two entries by the last entry to get a normalized result.

Image Rectification

With means of computing the projective transformation matrix, we can do amazing image warping effects like image rectification. Since rectification only involves one image, we will use that original image to choose point correspondences. What I did in this part was to choose "standard objects", and pick 8 point correspondences with respect to those objects. Once we rectify the object, we can then rectify the whole image.

I tried rectifying the "left view" image and "right view" image presented above. They are also shown as below:

Left view:

left

Right view:

mosaic2_right

As we may notice, the images above are both twisted up-and-down and side-ways. This means that in order to rectify them, we need to first transform them from upward view to eyelevel, then adjust them to a strict frontal view. The upward-to-eyelevel image is generated from the original images, while the strict frontal view is generated from the adjusted eyelevel image.

The first image below is the upward-to-eyelevel transformation, while the second is the eventual frontal view.

Upward to eyelevel:

rectified_left

Final frontal view:

rectified_left_further

Here are the upward-to-eyelevel transformation and final transformation for "right view" image.

Upward to eyelevel:

rectified_right

Final frontal view:

rectified_right_further

Mosaicing

Image rectification has verified that the algorithm can help mimic the change of projection plane. We can then apply projective transformation to more complicated tasks: create a wide-angle photo from multiple pictures in different projection planes.

The first step to complete the task is to warp each image into the same projection plane. I set one image as standard, and in each mosaic, I morph every other images to this standard projection plane. The images are then stacked together. Finally, image feathering is applied to its boundary to soften the concatenation effect.

Here's the first group of mosaic. I concatenated the middle image of University Library with its left view.

Left view:

left

Right view:

right

Mosaic without feathering:

mosaic1_unmasked