CS194-26 Project 3 Face Morphing

Weichao Chen

PART I - Manual Image Stitching

Homography Matrix Calculation

$H$ $\ge8$ equations)

\begin{bmatrix} a & b & c \\ d & e & f \\ g & h & 1 \end{bmatrix} \begin{bmatrix} x \\ y \\ 1 \end{bmatrix} = \begin{bmatrix} wx' \\ wy' \\ w \end{bmatrix}

$H$ $H^{-1}p$ . More on this on StackExchange

Image Rectifying

Below are examples that shows projective transformation in use.

Flyer

flyer_rectified

Screen

screen_rectified

Image Stiching

By taking two pictures with different point of views but overlapped field of view, say pic1 and pic2. Then warp pic1 using the homography matrix defined between the two pictures.

Flyer

flyer

Shelf

shelf

Kitchen

kitchen

PART II - Automatic Image Stitching

In this part we attempt to automate the image stiching process by using

HarrisCorner algorithm to compute the "cornerness" of each pixel

Adaptive Non-Maximum Suppression to select a handful of pixels with the highest level of "corerness"
Patch Texture Feature Extraction for each selected candidate corner coordinate
SSD as metrics to compare and define correspondence relationship between the corner coordinates for two images
$H$ , the homography matrix

Harris Corner and ANMS

k $H$ the homography matrix less robust. ANMS on the other hand, by computing the suppression radius for each point i.e. the distance until the closest point has a higher "cornerness" score, ensures that the selected pixels are more evenly distributed across the image.

Photo Credit: Anisha Gartia, Georgia Tech

$k = 100, C_{robust} = 0.9$

shelf_anms

Patch Texture Feature Extraction

Next up we'd calculate a high-dimensional feature descriptor at each point so we can defind correspondence between points across different images. Ideally, these descriptor should be invariant to rotation and scaling. For the sake of simplicity though, I use a 40 * 40 window to extract the gradient sub-region containing each point, down-sample such window to a 8 * 8 window (patch), convolve it against a Gaussian filter so that points close-up gets a higher weight, and then normalize to have a standard deviation of 1 and mean of 0 before I flatten the matrix into a 1 * 64 vector.

Defining Point Correspondences

Now that we have k points from two different images and the descriptor vector v associated with them, we simply calculate the pair-wise Euclidean distance using v and match the two points with the shortest distance. After that, we select the pairs where the distance is below a threshold.

shelf_matched

RANSAC

While we root for the matching algorithm to find correct correspondence, incorrect matches still exist. Yet, it majority of the matches are correct, we can use the RANSACH $(x_i, y_i), (x_j, y_j)$ $(x\prime_i, y\prime_i, w)^T = H(x_i, y_i)^T$ $(x_j, y_j)$ . If the distance is below a predefined threshold, we could the point as an inlier.

Photo Credit: Mehrdad Heydarzadeh

Mosaic Examples

Finally, here're some example using the stitching procedure

What I Learn

The intuition behind RANSAC is pretty simple, although the name is pretty intimidating. Moving forward, I'll be interested in understanding finding scale, rotation invariance feature descriptor